Follow
Brian Tanner
Brian Tanner
Research Engineer, DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
RL-Glue: Language-independent software for reinforcement-learning experiments
B Tanner, A White
The Journal of Machine Learning Research 10, 2133-2136, 2009
1692009
Temporal-difference networks
RS Sutton, B Tanner
Advances in neural information processing systems 17, 2004
1532004
Protecting against evaluation overfitting in empirical reinforcement learning
S Whiteson, B Tanner, ME Taylor, P Stone
2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011
1412011
Hierarchical heuristic search revisited
RC Holte, J Grajkowski, B Tanner
International Symposium on Abstraction, Reformulation, and Approximation …, 2005
662005
Using Predictive Representations to Improve Generalization in Reinforcement Learning.
EJ Rafols, MB Ring, RS Sutton, B Tanner
IJCAI, 835-840, 2005
652005
Report on the 2008 reinforcement learning competition
S Whiteson, B Tanner, A White
AI Magazine 31 (2), 81-81, 2010
572010
Td (λ) networks: temporal-difference networks with eligibility traces
B Tanner, RS Sutton
Proceedings of the 22nd international conference on Machine learning, 888-895, 2005
352005
Temporal-Difference Networks with History.
B Tanner, RS Sutton
IJCAI, 865-870, 2005
312005
Dynamic coalition formation in robotic soccer
J Anderson, B Tanner, J Baltes
Proceedings of the AAAI-04 Workshop on Forming and Maintaining Coalitions …, 2004
202004
Reward-respecting subtasks for model-based reinforcement learning
RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ...
Artificial Intelligence 324, 104001, 2023
192023
Grounding Abstractions in Predictive State Representations.
B Tanner, V Bulitko, A Koop, C Paduraru
IJCAI, 1077-1082, 2007
142007
Reinforcement learning from teammates of varying skill in robotic soccer
J Anderson, B Tanner, J Baltes
FIRA Robot World Congress, 2004
52004
Forming and Maintaining Coalitions & Teams in Adaptive Multiagent Systems
LK Soh, JE Anderson
AAAI Workshop, San Jose CA, 2004
42004
Peer reinforcement in homogeneous and heterogeneous multi-agent learning
J Anderson, B Tanner, R Wegner
Proceedings of the IASTED International Conference on Artificial …, 2002
42002
Temporal-difference networks
RS Sutton, B Tanner
arXiv preprint arXiv:1504.05539, 2015
32015
Exploiting opportunities through dynamic coalitions in robotic soccer
J Anderson, R Wegner, B Tanner
Proceedings of the AAAI International Workshop on Coalition Formation in …, 2002
22002
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint)
RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22713 …, 2024
2024
Evaluating Agents using Social Choice Theory
M Lanctot, K Larson, Y Bachrach, L Marris, Z Li, A Bhoopchand, ...
arXiv preprint arXiv:2312.03121, 2023
2023
Numerical Optimization: Project Report New Objectives for Predictive Representations
B Tanner
2005
Name of Author: Brian Timothy Tanner Title of Thesis: Temporal-Difference Networks Degree: Master of Science Year this Degree Granted: 2005
BT Tanner
2005
The system can't perform the operation now. Try again later.
Articles 1–20