Hado van Hasselt
Hado van Hasselt
Research Scientist, Google DeepMind
Verified email at google.com - Homepage
TitleCited byYear
Deep reinforcement learning with double Q-learning
H van Hasselt, A Guez, D Silver
AAAI Conference on Artificial Intelligence, 2094-2100, 2016
15532016
Dueling Network Architectures for Deep Reinforcement Learning
Z Wang, T Schaul, M Hessel, H van Hasselt, M Lanctot, N de Freitas
The 33rd International Conference on Machine Learning, 1995–2003, 2016
8832016
Double Q-learning
H van Hasselt
Advances in Neural Information Processing Systems, 2613-2621, 2010
402*2010
Rainbow: Combining improvements in deep reinforcement learning
M Hessel, J Modayil, H van Hasselt, T Schaul, G Ostrovski, W Dabney, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
3812018
Starcraft ii: A new challenge for reinforcement learning
O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ...
arXiv preprint arXiv:1708.04782, 2017
2582017
Reinforcement learning in continuous action spaces
H van Hasselt, MA Wiering
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007 …, 2007
2052007
Reinforcement Learning in Continuous State and Action Spaces
H van Hasselt
Reinforcement Learning: State of the Art, 207-251, 2012
1552012
Successor features for transfer in reinforcement learning
A Barreto, W Dabney, R Munos, JJ Hunt, T Schaul, HP van Hasselt, ...
Advances in neural information processing systems, 4055-4065, 2017
1382017
Distributed prioritized experience replay
D Horgan, J Quan, D Budden, G Barth-Maron, M Hessel, H van Hasselt, ...
arXiv preprint arXiv:1803.00933, 2018
1322018
The predictron: End-to-end learning and planning
D Silver, H van Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1232017
Ensemble algorithms in reinforcement learning
MA Wiering, H van Hasselt
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 38 …, 2008
1182008
Deep Reinforcement Learning in Large Discrete Action Spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt
1122015
A theoretical and empirical analysis of Expected Sarsa
H van Seijen, H van Hasselt, S Whiteson, M Wiering
Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL'09 …, 2009
962009
Learning values across many orders of magnitude
H van Hasselt, A Guez, M Hessel, V Mnih, D Silver
Advances in Neural Information Processing Systems 29 (NIPS 2016), 2016
652016
Insights in reinforcement learning
HP van Hasselt
Hado van Hasselt, 2011
65*2011
Meta-gradient reinforcement learning
Z Xu, HP van Hasselt, D Silver
Advances in neural information processing systems, 2396-2407, 2018
60*2018
Weighted importance sampling for off-policy learning with linear function approximation
AR Mahmood, H van Hasselt, RS Sutton
Advances in Neural Information Processing Systems 27, 2014
562014
Using continuous action spaces to solve discrete problems
H van Hasselt, MA Wiering
Neural Networks, 2009. IJCNN 2009. International Joint Conference on, 1149-1156, 2009
412009
Observe and look further: Achieving consistent performance on atari
T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ...
arXiv preprint arXiv:1805.11593, 2018
382018
Adaptive serious games using agent organizations
J Westra, H van Hasselt, F Dignum, V Dignum
International Workshop on Agents for Games and Simulations, 206-220, 2009
322009
The system can't perform the operation now. Try again later.
Articles 1–20