Tor Lattimore

Cited by

	All	Since 2019
Citations	6983	6505
h-index	38	35
i10-index	67	62

1600

800

400

1200

20132014201520162017201820192020202120222023202424 28 53 57 95 182 349 763 1219 1438 1576 1154

Public access

View all

23 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Botao HaoOpenAIVerified email at openai.com
Andras GyorgyDeepMindVerified email at google.com
Laurent OrseauResearch Scientist at Google DeepMindVerified email at google.com
Branislav KvetonAmazonVerified email at amazon.com
Eren SezenerDeepMindVerified email at google.com
Ian OsbandOpenAIVerified email at openai.com
Christoph DannResearch Scientist, GoogleVerified email at google.com
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Joel VenessGoogle DeepMindVerified email at google.com
Julian ZimmertGoogle ResearchVerified email at google.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityVerified email at princeton.edu
Avishkar BhoopchandResearch Engineer, DeepMindVerified email at google.com
Agnieszka Grabska BarwińskaDeepMindVerified email at google.com
Peter TothAI ResearchVerified email at techcombank.com.vn
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Johannes KirschnerSwiss Data Science Center, ETH ZurichVerified email at sdsc.ethz.ch
Dale SchuurmansUniversity of Alberta, Google DeepMindVerified email at cs.ualberta.ca

Tor Lattimore

DeepMind

Verified email at google.com - Homepage

machine learning learning theory reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bandit algorithms T Lattimore, C Szepesvári Cambridge University Press, 2020	2867	2020
Unifying PAC and regret: Uniform PAC bounds for episodic reinforcement learning C Dann, T Lattimore, E Brunskill Advances in Neural Information Processing Systems 30, 2017	315	2017
Causal bandits: Learning good interventions via causal inference F Lattimore, T Lattimore, MD Reid Advances in neural information processing systems 29, 2016	272*	2016
Degenerate feedback loops in recommender systems R Jiang, S Chiappa, T Lattimore, A György, P Kohli Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 383-390, 2019	228	2019
Learning with good feature representations in bandits and in rl with a generative model T Lattimore, C Szepesvari, G Weisz International conference on machine learning, 5662-5670, 2020	190	2020
Behaviour suite for reinforcement learning I Osband, Y Doron, M Hessel, J Aslanides, E Sezener, A Saraiva, ... arXiv preprint arXiv:1908.03568, 2019	180	2019
PAC bounds for discounted MDPs T Lattimore, M Hutter Algorithmic Learning Theory: 23rd International Conference, ALT 2012, Lyon …, 2012	144	2012
The end of optimism? an asymptotic analysis of finite-armed linear bandits T Lattimore, C Szepesvari Artificial Intelligence and Statistics, 728-737, 2017	137	2017
Conservative bandits Y Wu, R Shariff, T Lattimore, C Szepesvári International Conference on Machine Learning, 1254-1262, 2016	126	2016
On explore-then-commit strategies A Garivier, T Lattimore, E Kaufmann Advances in Neural Information Processing Systems 29, 2016	121	2016
A geometric perspective on optimal representations for reinforcement learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Advances in neural information processing systems 32, 2019	104	2019
Model selection in contextual stochastic bandit problems A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ... Advances in Neural Information Processing Systems 33, 10328-10337, 2020	98	2020
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	78	2019
Toprank: A practical algorithm for online stochastic ranking T Lattimore, B Kveton, S Li, C Szepesvari Advances in Neural Information Processing Systems 31, 2018	73	2018
Linear bandits with stochastic delayed feedback C Vernade, A Carpentier, T Lattimore, G Zappella, B Ermis, M Brueckner International Conference on Machine Learning, 9712-9721, 2020	71	2020
Near-optimal PAC bounds for discounted MDPs T Lattimore, M Hutter Theoretical Computer Science 558, 125-143, 2014	70	2014
The sample-complexity of general reinforcement learning T Lattimore, M Hutter, P Sunehag International Conference on Machine Learning, 28-36, 2013	70	2013
Bounded Regret for Finite-Armed Structured Bandits T Lattimore, R Munos	69	2014
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	66	2020
An information-theoretic approach to minimax regret in partial monitoring T Lattimore, C Szepesvári Conference on Learning Theory, 2111-2139, 2019	66	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors