Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks
S Arora, SS Du, W Hu, Z Li, R Wang
arXiv preprint arXiv:1901.08584, 2019
Towards understanding the role of over-parametrization in generalization of neural networks
B Neyshabur, Z Li, S Bhojanapalli, Y LeCun, N Srebro
arXiv preprint arXiv:1805.12076, 2018
On exact computation with an infinitely wide neural net
S Arora, SS Du, W Hu, Z Li, RR Salakhutdinov, R Wang
Advances in Neural Information Processing Systems, 8139-8148, 2019
Learning in games: Robustness of fast convergence
DJ Foster, Z Li, T Lykouris, K Sridharan, E Tardos
Advances in Neural Information Processing Systems, 4734-4742, 2016
Theoretical analysis of auto rate-tuning by batch normalization
S Arora, Z Li, K Lyu
arXiv preprint arXiv:1812.03981, 2018
Solving marginal map problems with np oracles and parity constraints
Y Xue, Z Li, S Ermon, CP Gomes, B Selman
Advances in Neural Information Processing Systems, 1127-1135, 2016
Stability of generalized two-sided markets with transaction thresholds
Z Li, Y Liu, P Tang, T Xu, W Zhan
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent …, 2017
An exponential learning rate schedule for deep learning
Z Li, S Arora
arXiv preprint arXiv:1910.07454, 2019
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets
R Kuditipudi, X Wang, H Lee, Y Zhang, Z Li, W Hu, R Ge, S Arora
Advances in Neural Information Processing Systems, 14574-14583, 2019
Enhanced Convolutional Neural Tangent Kernels
Z Li, R Wang, D Yu, SS Du, W Hu, R Salakhutdinov, S Arora
arXiv preprint arXiv:1911.00809, 2019
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
S Arora, SS Du, Z Li, R Salakhutdinov, R Wang, D Yu
arXiv preprint arXiv:1910.01663, 2019
Understanding Generalization of Deep Neural Networks Trained with Noisy Labels
W Hu, Z Li, D Yu
arXiv preprint arXiv:1905.11368, 2019
Online Improper Learning with an Approximation Oracle
E Hazan, W Hu, Y Li, Z Li
Advances in Neural Information Processing Systems, 5657-5665, 2018
Implicit Regularization of Normalization Methods
X Wu, E Dobriban, T Ren, S Wu, Z Li, S Gunasekar, R Ward, Q Liu
arXiv preprint arXiv:1911.07956, 2019
