Mayank Daswani
TitleCited byYear
Self-modification of policy and utility function in rational agents
T Everitt, D Filan, M Daswani, M Hutter
International Conference on Artificial General Intelligence, 1-11, 2016
182016
Feature reinforcement learning: state of the art
M Daswani, P Sunehag, M Hutter
Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
112014
A definition of happiness for reinforcement learning agents
M Daswani, J Leike
International Conference on Artificial General Intelligence, 231-240, 2015
72015
Q-learning for history-based reinforcement learning
M Daswani, P Sunehag, M Hutter
MIT Press, 2013
72013
Feature Reinforcement Learning using Looping Suffix Trees
M Daswani, P Sunehag, M Hutter
JMLR Workshop and Conference Proceedings : EWRL 2012 24, 11-24, 2012
62012
Reinforcement learning with value advice
M Daswani, P Sunehag, M Hutter
Proceedings of the 6th Asian Conference on Machine Learning, 2014
12014
Generic Reinforcement Learning Beyond Small MDPs
M Daswani
The Australian National University, 2015
2015
The system can't perform the operation now. Try again later.
Articles 1–7