Online Robust Reinforcement Learning with Model Uncertainty Y Wang, S Zou Advances in Neural Information Processing Systems 34, 2021 | 72 | 2021 |
Policy gradient method for robust reinforcement learning Y Wang, S Zou International Conference on Machine Learning, 23484-23526, 2022 | 43 | 2022 |
A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems S He, Y Wang, S Han, S Zou, F Miao 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023 | 27* | 2023 |
Finite-sample analysis of Greedy-GQ with linear function approximation under Markovian noise Y Wang, S Zou Conference on Uncertainty in Artificial Intelligence, 11-20, 2020 | 25 | 2020 |
Non-asymptotic analysis for two time-scale TDC with general smooth function approximation Y Wang, S Zou, Y Zhou Advances in Neural Information Processing Systems 34, 9747-9758, 2021 | 15* | 2021 |
Robust average-reward Markov decision processes Y Wang, A Velasquez, G Atia, A Prater-Bennette, S Zou AAAI 2023, 2023 | 6 | 2023 |
Robust constrained reinforcement learning Y Wang, F Miao, S Zou arXiv preprint arXiv:2209.06866, 2022 | 5 | 2022 |
Model-free robust average-reward reinforcement learning Y Wang, A Velasquez, GK Atia, A Prater-Bennette, S Zou International Conference on Machine Learning, 36431-36469, 2023 | 3 | 2023 |
Finite-time error bounds for Greedy-GQ Y Wang, Y Zhou, S Zou Machine Learning, 1-38, 2024 | 1 | 2024 |
Data-driven robust multi-agent reinforcement learning Y Wang, Y Wang, Y Zhou, A Velasquez, S Zou 2022 IEEE 32nd International Workshop on Machine Learning for Signal …, 2022 | 1 | 2022 |
Achieving the Minimax Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach Y Wang, J Xiong, S Zou arXiv preprint arXiv:2305.13289, 2023 | | 2023 |