Follow
Sam Toyer
Sam Toyer
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Variational discriminator bottleneck: Improving imitation learning, inverse RL, and GANs by constraining information flow
XB Peng, A Kanazawa, S Toyer, P Abbeel, S Levine
ICLR 2019, 2018
2622018
Action Schema Networks: Generalised Policies with Deep Learning
S Toyer, F Trevizan, S Thiebaux, L Xie
AAAI Conference on Artificial Intelligence (AAAI), 2018
1242018
Asnets: Deep learning for generalised planning
S Toyer, S Thiébaux, F Trevizan, L Xie
Journal of Artificial Intelligence Research 68, 1-68, 2020
912020
Human pose forecasting via deep Markov models
S Toyer, A Cherian, T Han, S Gould
International Conference on Digital Image Computing: Techniques and …, 2017
582017
imitation: Clean imitation learning implementations
A Gleave, M Taufeeque, J Rocamonde, E Jenner, SH Wang, S Toyer, ...
arXiv preprint arXiv:2211.11972, 2022
552022
The MAGICAL Benchmark for Robust Imitation
S Toyer, R Shah, A Critch, S Russell
NeurIPS 2020, 2020
532020
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
S Toyer, O Watkins, EA Mendes, J Svegliato, L Bailey, T Wang, I Ong, ...
ICLR 2024, 2023
522023
A strongreject for empty jailbreaks
A Souly, Q Lu, D Bowen, T Trinh, E Hsieh, S Pandey, P Abbeel, ...
arXiv preprint arXiv:2402.10260, 2024
342024
An Empirical Investigation of Representation Learning for Imitation
X Chen, S Toyer, C Wild, S Emmons, I Fischer, KH Lee, N Alex, SH Wang, ...
NeurIPS 2021, Datasets and Benchmarks Track, 2021
302021
The imitation library for imitation learning and inverse reinforcement learning
S Wang, S Toyer, A Gleave, S Emmons
252020
A primer on maximum causal entropy inverse reinforcement learning
A Gleave, S Toyer
arXiv preprint arXiv:2203.11409, 2022
222022
Publishing and Using Earth Observation Data with the RDF Data Cube and the Discrete Global Grid System
D Brizhinev, S Toyer, K Taylor
https://www.w3.org/TR/eo-qb/, 2017
202017
Guiding search with generalized policies for probabilistic planning
W Shen, F Trevizan, S Toyer, S Thiébaux, L Xie
Proceedings of the International Symposium on Combinatorial Search 10 (1 …, 2019
172019
Derail: Diagnostic environments for reward and imitation learning
P Freire, A Gleave, S Toyer, S Russell
arXiv preprint arXiv:2012.01365, 2020
102020
Computer vision training using paired image data
S Gould, S Toyer, D Reiner
US Patent App. 16/360,954, 2019
72019
seals: Suite of environments for algorithms that learn specifications
A Gleave, P Freire, S Wang, S Toyer
62020
Variational discriminator bottleneck: improving imitation learning
XB Peng, A Kanazawa, S Toyer, P Abbeel, S Levine
Inverse RL, and GANs by Constraining Information Flow.[(accessed on 29 …, 2018
52018
Generalised policies for probabilistic planning with deep learning
S Toyer
Research and development, honours thesis, Research School of Computer …, 2017
42017
Action Schema Networks–IPC Version
M Hao, S Toyer, R Wang, S Thiébaux, F Trevizan
Tenth International Planning Competition (IPC-10) Learning Track: Planner …, 2023
32023
Single image completion from retrieved image collections
S Gould, S Toyer, D Reiner
US Patent 10,885,628, 2021
22021
The system can't perform the operation now. Try again later.
Articles 1–20