Follow
Zuyue Fu
Title
Cited by
Cited by
Year
Actor-critic provably finds Nash equilibria of linear-quadratic mean-field games
Z Fu, Z Yang, Y Chen, Z Wang
International Conference on Learning Representations, 2019
592019
Instrumental variable value iteration for causal offline reinforcement learning
L Liao, Z Fu, Z Yang, Y Wang, M Kolar, Z Wang
arXiv preprint arXiv:2102.09907, 2021
412021
Single-timescale actor-critic provably finds globally optimal policy
Z Fu, Z Yang, Z Wang
International Conference on Learning Representations, 2020
412020
Offline reinforcement learning with instrumental variables in confounded markov decision processes
Z Fu, Z Qi, Z Wang, Z Yang, Y Xu, MR Kosorok
arXiv preprint arXiv:2209.08666, 2022
162022
Learning from demonstration: Provably efficient adversarial policy imitation with linear function approximation
Z Liu, Y Zhang, Z Fu, Z Yang, Z Wang
International conference on machine learning, 14094-14138, 2022
14*2022
False Correlation Reduction for Offline Reinforcement Learning
Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
9*2023
Decentralized single-timescale actor-critic on zero-sum two-player stochastic games
H Guo, Z Fu, Z Yang, Z Wang
International Conference on Machine Learning, 3899-3909, 2021
82021
Convergent reinforcement learning with function approximation: A bilevel optimization perspective
Z Yang, Z Fu, K Zhang, Z Wang
62018
Sample elicitation
J Wei, Z Fu, Y Liu, X Li, Z Yang, Z Wang
International Conference on Artificial Intelligence and Statistics, 2692-2700, 2021
22021
A two-fold structural classification method for determining the accurate ensemble of protein structures
P Tan, Z Fu, L Petridis, S Qian, D You, D Wei, J Li, L Hong
Communications in Computational Physics 25 (4), 2018
12018
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Z Fu, Z Qi, Z Yang, Z Wang, L Wang
arXiv preprint arXiv:2212.12167, 2022
2022
Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics
S Zheng, L Wang, S Qiu, Z Fu, Z Yang, C Szepesvari, Z Wang
The Eleventh International Conference on Learning Representations, 2022
2022
On the Optimality and Complexity of Reinforcement Learning
Z Fu
Northwestern University, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–13