Aviral Kumar

Cited by

	All	Since 2019
Citations	8591	8583
h-index	30	30
i10-index	41	41

3300

1650

825

2475

20192020202120222023202454 328 1137 2134 3223 1695

Public access

View all

20 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sergey LevineUC Berkeley, Physical IntelligenceVerified email at eecs.berkeley.edu
George TuckerGoogle BrainVerified email at google.com
Chelsea FinnStanford University, GoogleVerified email at cs.stanford.edu
Anikait SinghStanford UniversityVerified email at stanford.edu
Tianhe YuGoogle DeepMindVerified email at google.com
Yevgen ChebotarFigure AIVerified email at figure.ai
Aurick ZhouWaymoVerified email at berkeley.edu
Rishabh AgarwalSenior Research Scientist, Google DeepMindVerified email at google.com
Xue Bin PengAssistant Professor, Simon Fraser University, NVIDIAVerified email at sfu.ca
Kevin SwerskyGoogle BrainVerified email at cs.toronto.edu

Aviral Kumar

Google DeepMind

Verified email at berkeley.edu - Homepage

Machine Learning Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Offline reinforcement learning: Tutorial, review, and perspectives on open problems S Levine, A Kumar, G Tucker, J Fu arXiv preprint arXiv:2005.01643, 2020	1620	2020
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine Advances in Neural Information Processing Systems 33, 1179-1191, 2020	1456	2020
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction A Kumar, J Fu, G Tucker, S Levine NeuRIPS 2019, arXiv:1906.00949, 2019	916	2019
D4rl: Datasets for deep data-driven reinforcement learning J Fu, A Kumar, O Nachum, G Tucker, S Levine arXiv preprint arXiv:2004.07219, 2020	898	2020
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	463	2023
Advantage-weighted regression: Simple and scalable off-policy reinforcement learning XB Peng, A Kumar, G Zhang, S Levine arXiv preprint arXiv:1910.00177, 2019	409	2019
Combo: Conservative offline model-based policy optimization T Yu, A Kumar, R Rafailov, A Rajeswaran, S Levine, C Finn Advances in neural information processing systems 34, 28954-28967, 2021	319	2021
Trainable calibration measures for neural networks from kernel mean embeddings A Kumar, S Sarawagi, U Jain International Conference on Machine Learning, 2805-2814, 2018	258	2018
Graph Normalizing Flows J Liu, A Kumar, J Ba, J Kiros, K Swersky NeurIPS 2019, arxiv:1905.13177, 2019	256*	2019
Opal: Offline primitive discovery for accelerating offline reinforcement learning A Ajay, A Kumar, P Agrawal, S Levine, O Nachum arXiv preprint arXiv:2010.13611, 2020	152	2020
Diagnosing Bottlenecks in Deep Q-learning Algorithms J Fu, A Kumar, M Soh, S Levine International Conference on Machine Learning (ICML) 2019, https://arxiv.org …, 2019	146	2019
Conservative safety critics for exploration H Bharadhwaj, A Kumar, N Rhinehart, S Levine, F Shkurti, A Garg arXiv preprint arXiv:2010.14497, 2020	119	2020
When should we prefer offline reinforcement learning over behavioral cloning? A Kumar, J Hong, A Singh, S Levine arXiv preprint arXiv:2204.05618, 2022	113*	2022
Discor: Corrective feedback in reinforcement learning via distribution correction A Kumar, A Gupta, S Levine Advances in Neural Information Processing Systems 33, 18560-18572, 2020	104	2020
Cog: Connecting new skills to past experience with offline reinforcement learning A Singh, A Yu, J Yang, J Zhang, A Kumar, S Levine arXiv preprint arXiv:2010.14500, 2020	95	2020
Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability D Ghosh, J Rahme, A Kumar, A Zhang, RP Adams, S Levine Advances in neural information processing systems 34, 25502-25515, 2021	91	2021
Calibration of Encoder Decoder Models for Neural Machine Translation A Kumar, S Sarawagi https://arxiv.org/abs/1903.00802, 2019	84	2019
Reward-conditioned policies A Kumar, XB Peng, S Levine arXiv preprint arXiv:1912.13465, 2019	81	2019
A workflow for offline model-free robotic reinforcement learning A Kumar, A Singh, S Tian, C Finn, S Levine arXiv preprint arXiv:2109.10813, 2021	80	2021
One solution is not all you need: Few-shot extrapolation via structured maxent rl S Kumar, A Kumar, S Levine, C Finn Advances in Neural Information Processing Systems 33, 8198-8210, 2020	79	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors