Daniel J. Mankowitz
Daniel J. Mankowitz
Google Deepmind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, DJ Mankowitz, S Mannor
In Proc. Association for the Advancement of Artificial Intelligence (AAAI), 2017
1752017
Learn what not to learn: Action elimination with deep reinforcement learning
T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor
Advances in Neural Information Processing Systems, 3562-3573, 2018
40*2018
Challenges of real-world reinforcement learning
G Dulac-Arnold, D Mankowitz, T Hester
arXiv preprint arXiv:1904.12901, 2019
362019
Transfer in deep reinforcement learning using successor features and generalised policy improvement
A Barreto, D Borsa, J Quan, T Schaul, D Silver, M Hessel, D Mankowitz, ...
arXiv preprint arXiv:1901.10964, 2019
362019
Adaptive Skills Adaptive Partitions (ASAP)
DJ Mankowitz, TA Mann, S Mannor
Neural Information Processing Systems (NIPS), Barcelona, Spain, 2016
362016
Reward constrained policy optimization
C Tessler, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1805.11074, 2018
342018
Time-Regularized Interrupting Options (TRIO)
D Mankowitz, T Mann, S Mannor
Proceedings of the 31st International Conference on Machine Learning (ICML …, 2014
33*2014
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems, 3135-3145, 2017
232017
Unicorn: Continual learning with a universal, off-policy agent
DJ Mankowitz, A Žídek, A Barreto, D Horgan, M Hessel, J Quan, J Oh, ...
arXiv preprint arXiv:1802.08294, 2018
212018
Universal successor features approximators
D Borsa, A Barreto, J Quan, D Mankowitz, R Munos, H van Hasselt, ...
arXiv preprint arXiv:1812.07626, 2018
102018
Learning robust options
DJ Mankowitz, TA Mann, PL Bacon, D Precup, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
102018
Mobile device-based cellular network coverage analysis using crowd sourcing
JD Mankowitz, AJ Paverd
2011 IEEE EUROCON-International Conference on Computer as a Tool, 1-6, 2011
102011
BRISK-based visual feature extraction for resource constrained robots
DJ Mankowitz, S Ramamoorthy
Robot Soccer World Cup, 195-206, 2013
72013
Iterative hierarchical optimization for misspecified problems (IHOMP)
DJ Mankowitz, TA Mann, S Mannor
arXiv preprint arXiv:1602.03348, 2016
62016
Soft-robust actor-critic policy-gradient
E Derman, DJ Mankowitz, TA Mann, S Mannor
arXiv preprint arXiv:1803.04848, 2018
42018
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, R Jeong, A Abdolmaleki, JT Springenberg, ...
arXiv preprint arXiv:1906.07516, 2019
32019
CFORB: Circular FREAK-ORB Visual Odometry
DJ Mankowitz, E Rivlin
arXiv preprint arXiv:1506.05257, 2015
32015
Learning when to Switch Between Skills in High Dimensional Domains
T Mann, D Mankowitz, S Mannor
Workshop on Learning for General Competency in Video Games - AAAI 2015, 2015
32015
BRISK-based Visual Landmark Localisation using Nao Humanoid Robots
DJ Mankowitz
University of Edinburgh, 2012
32012
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1905.09700, 2019
22019
The system can't perform the operation now. Try again later.
Articles 1–20