Brian Tanner

Cited by

	All	Since 2019
Citations	788	235
h-index	11	7
i10-index	11	7

2004200520062007200820092010201120122013201420152016201720182019202020212022202320247 31 16 34 29 27 38 56 57 51 45 37 36 27 52 37 41 52 56 38 11

Brian Tanner

Research Engineer, DeepMind

Verified email at google.com

Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
RL-Glue: Language-independent software for reinforcement-learning experiments B Tanner, A White The Journal of Machine Learning Research 10, 2133-2136, 2009	169	2009
Temporal-difference networks RS Sutton, B Tanner Advances in neural information processing systems 17, 2004	153	2004
Protecting against evaluation overfitting in empirical reinforcement learning S Whiteson, B Tanner, ME Taylor, P Stone 2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011	141	2011
Hierarchical heuristic search revisited RC Holte, J Grajkowski, B Tanner International Symposium on Abstraction, Reformulation, and Approximation …, 2005	66	2005
Using Predictive Representations to Improve Generalization in Reinforcement Learning. EJ Rafols, MB Ring, RS Sutton, B Tanner IJCAI, 835-840, 2005	65	2005
Report on the 2008 reinforcement learning competition S Whiteson, B Tanner, A White AI Magazine 31 (2), 81-81, 2010	57	2010
Td (λ) networks: temporal-difference networks with eligibility traces B Tanner, RS Sutton Proceedings of the 22nd international conference on Machine learning, 888-895, 2005	35	2005
Temporal-Difference Networks with History. B Tanner, RS Sutton IJCAI, 865-870, 2005	31	2005
Dynamic coalition formation in robotic soccer J Anderson, B Tanner, J Baltes Proceedings of the AAAI-04 Workshop on Forming and Maintaining Coalitions …, 2004	20	2004
Reward-respecting subtasks for model-based reinforcement learning RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Artificial Intelligence 324, 104001, 2023	19	2023
Grounding Abstractions in Predictive State Representations. B Tanner, V Bulitko, A Koop, C Paduraru IJCAI, 1077-1082, 2007	14	2007
Reinforcement learning from teammates of varying skill in robotic soccer J Anderson, B Tanner, J Baltes FIRA Robot World Congress, 2004	5	2004
Forming and Maintaining Coalitions & Teams in Adaptive Multiagent Systems LK Soh, JE Anderson AAAI Workshop, San Jose CA, 2004	4	2004
Peer reinforcement in homogeneous and heterogeneous multi-agent learning J Anderson, B Tanner, R Wegner Proceedings of the IASTED International Conference on Artificial …, 2002	4	2002
Temporal-difference networks RS Sutton, B Tanner arXiv preprint arXiv:1504.05539, 2015	3	2015
Exploiting opportunities through dynamic coalitions in robotic soccer J Anderson, R Wegner, B Tanner Proceedings of the AAAI International Workshop on Coalition Formation in …, 2002	2	2002
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint) RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22713 …, 2024		2024
Evaluating Agents using Social Choice Theory M Lanctot, K Larson, Y Bachrach, L Marris, Z Li, A Bhoopchand, ... arXiv preprint arXiv:2312.03121, 2023		2023
Numerical Optimization: Project Report New Objectives for Predictive Representations B Tanner		2005
Name of Author: Brian Timothy Tanner Title of Thesis: Temporal-Difference Networks Degree: Master of Science Year this Degree Granted: 2005 BT Tanner		2005

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by