Tom Zahavy

Cited by

	All	Since 2019
Citations	1998	1753
h-index	20	19
i10-index	31	31

460

230

115

345

20162017201820192020202120222023202434 58 143 171 257 308 401 454 160

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Daniel J. MankowitzGoogle DeepmindVerified email at google.com
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Sebastian FlennerhagResearch Scientist at DeepMindVerified email at google.com
Chen TesslerResearch Scientist, NVIDIA ResearchVerified email at nvidia.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Mordechai SegevSolid State Institute, Physics Department and Electrical Engineering Department Technion - IsraelVerified email at technion.ac.il
Alex DikopoltsevQuantum Optoelectronics Group, Department of Physics, ETHVerified email at phys.ethz.ch
Brendan O'DonoghueStanford University, Google DeepMindVerified email at alumni.stanford.edu
Zhongwen XuTencentVerified email at tencent.com
Oren CohenProfessor of Physics, Technion, IsraelVerified email at technion.ac.il
Vivek VeeriahGoogle DeepMindVerified email at google.com
David SilverDeepMind, UCLVerified email at google.com
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Junhyuk OhResearch Scientist, DeepMindVerified email at google.com
Nadav MerlisPostdoctoral Fellow @ CREST, ENSAE ParisVerified email at ensae.fr
Alessandro MagnaniWalmartlabsVerified email at walmartlabs.com
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
Valentin DalibardUniversity of CambridgeVerified email at cl.cam.ac.uk
Yannick SchroeckerDeepMindVerified email at google.com

Tom Zahavy

Other namesTom Ben Zion Zahavy

Staff Research Scientist, Google DeepMind

Verified email at deepmind.com - Homepage

Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A deep hierarchical approach to lifelong learning in minecraft C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	432	2017
Graying the black box: Understanding dqns T Zahavy, N Ben-Zrihem, S Mannor International conference on machine learning (ICML), 1899-1908, 2016	319	2016
Learn what not to learn: Action elimination with deep reinforcement learning T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor Advances in neural information processing systems 31, 2018	232	2018
Deep learning reconstruction of ultrashort pulses T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ... Optica 5 (5), 666-673, 2018	163	2018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce T Zahavy, A Krishnan, A Magnani, S Mannor Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	105*	2018
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	78	2020
Bootstrapped meta-learning S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh International Conference on Learning Representations (ICLR) 2022, 2021	66	2021
Shallow updates for deep reinforcement learning N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor Advances in Neural Information Processing Systems 30, 2017	52	2017
Reward is enough for convex mdps T Zahavy, B O'Donoghue, G Desjardins, S Singh Advances in Neural Information Processing Systems 34, 25746-25759, 2021	50	2021
Online limited memory neural-linear bandits with likelihood matching O Nabati, T Zahavy, S Mannor International Conference on Machine Learning, 7905-7915, 2021	37*	2021
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	35	2021
Ensemble robustness and generalization of stochastic deep learning algorithms T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor arXiv preprint arXiv:1602.02389, 2016	34*	2016
Discovering Evolution Strategies via Meta-Black-Box Optimization R Tjarko Lange, T Schaul, Y Chen, T Zahavy, V Dallibard, C Lu, S Singh, ... International Conference on Learning Representations (ICLR) 2023, 2022	30*	2022
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ... International Conference on Learning Representations (ICLR) 2023, 2022	27	2022
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ... Optics express 28 (5), 7528-7538, 2020	25	2020
Emphatic algorithms for deep reinforcement learning R Jiang, T Zahavy, Z Xu, A White, M Hessel, C Blundell, H Van Hasselt International Conference on Machine Learning (ICML), 5023-5033, 2021	22	2021
Online Apprenticeship Learning L Shani, T Zahavy, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence, 2021	22	2021
Discovering a set of policies for the worst case reward T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ... International Conference on Learning Representations (ICLR) 2021, 2021	22	2021
Visualizing dynamics: from t-sne to semi-mdps NB Zrihem, T Zahavy, S Mannor Workshop on Human Interpretability in Machine Learning, ICML (WHI 2016), 2016	21*	2016
Balancing constraints and rewards with meta-gradient d4pg DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann International Conference on Learning Representations (ICLR) 2021, 2020	20	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors