Mayank Daswani
Title
Cited by
Cited by
Year
Self-modification of policy and utility function in rational agents
T Everitt, D Filan, M Daswani, M Hutter
International Conference on Artificial General Intelligence, 1-11, 2016
212016
Feature reinforcement learning: State of the art
M Daswani, P Sunehag, M Hutter
Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
112014
A definition of happiness for reinforcement learning agents
M Daswani, J Leike
International Conference on Artificial General Intelligence, 231-240, 2015
72015
Q-learning for history-based reinforcement learning
M Daswani, P Sunehag, M Hutter
MIT Press, 2013
72013
Feature Reinforcement Learning using Looping Suffix Trees
M Daswani, P Sunehag, M Hutter
JMLR Workshop and Conference Proceedings : EWRL 2012 24, 11-24, 2012
62012
Reinforcement learning with value advice
M Daswani, P Sunehag, M Hutter
Asian Conference on Machine Learning, 299-314, 2015
22015
Generic Reinforcement Learning Beyond Small MDPs
M Daswani
The Australian National University, 2015
2015
The system can't perform the operation now. Try again later.
Articles 1–7