Di He
Di He
Microsoft Research
Verified email at microsoft.com
Title
Cited by
Cited by
Year
Dual learning for machine translation
D He, Y Xia, T Qin, L Wang, N Yu, TY Liu, WY Ma
Advances in neural information processing systems, 820-828, 2016
458*2016
A theoretical analysis of NDCG ranking measures
Y Wang, L Wang, Y Li, D He, W Chen, TY Liu
Proceedings of the 26th Annual Conference on Learning Theory (COLT 2013) 8, 6, 2013
286*2013
Frage: Frequency-agnostic word representation
C Gong, D He, X Tan, T Qin, L Wang, TY Liu
Advances in Neural Information Processing Systems, 1334-1345, 2018
752018
Multilingual neural machine translation with knowledge distillation
X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu
arXiv preprint arXiv:1902.10461, 2019
642019
Layer-wise coordination between encoder and decoder for neural machine translation
T He, X Tan, Y Xia, D He, T Qin, Z Chen, TY Liu
Advances in Neural Information Processing Systems, 7944-7954, 2018
522018
Non-autoregressive machine translation with auxiliary regularization
Y Wang, F Tian, D He, T Qin, CX Zhai, TY Liu
AAAI 2019, 2019
442019
Non-autoregressive neural machine translation with enhanced decoder input
J Guo, X Tan, D He, T Qin, L Xu, TY Liu
Proceedings of the AAAI Conference on Artificial Intelligence 33, 3723-3730, 2019
422019
A game-theoretic machine learning approach for revenue maximization in sponsored search
D He, W Chen, L Wang, TY Liu
arXiv preprint arXiv:1406.0728, 2014
412014
Towards binary-valued gates for robust lstm training
Z Li, D He, F Tian, W Chen, T Qin, L Wang, TY Liu
ICML 2018, 2018
312018
Decoding with value networks for neural machine translation
D He, H Lu, Y Xia, T Qin, L Wang, TY Liu
Advances in Neural Information Processing Systems, 178-187, 2017
312017
Adversarially robust generalization just requires more unlabeled data
R Zhai, T Cai, D He, C Dan, K He, J Hopcroft, L Wang
arXiv preprint arXiv:1906.00555, 2019
302019
Incorporating bert into neural machine translation
J Zhu, Y Xia, L Wu, D He, T Qin, W Zhou, H Li, TY Liu
arXiv preprint arXiv:2002.06823, 2020
242020
Hint-based training for non-autoregressive translation
Z Li, D He, F Tian, T Qin, L Wang, TY Liu
22*2018
Towards a deep and unified understanding of deep neural models in nlp
C Guan, X Wang, Q Zhang, R Chen, D He, X Xie
International Conference on Machine Learning, 2454-2463, 2019
202019
Fast structured decoding for sequence models
Z Sun, Z Li, H Wang, D He, Z Lin, Z Deng
Advances in Neural Information Processing Systems, 3016-3026, 2019
202019
Dense information flow for neural machine translation
Y Shen, X Tan, D He, T Qin, TY Liu
arXiv preprint arXiv:1806.00722, 2018
202018
On layer normalization in the transformer architecture
R Xiong, Y Yang, D He, K Zheng, S Zheng, C Xing, H Zhang, Y Lan, ...
arXiv preprint arXiv:2002.04745, 2020
172020
Beyond error propagation in neural machine translation: Characteristics of language also matter
L Wu, X Tan, D He, F Tian, T Qin, J Lai, TY Liu
arXiv preprint arXiv:1809.00120, 2018
152018
Understanding and improving transformer from a multi-particle dynamic system point of view
Y Lu, Z Li, D He, Z Sun, B Dong, T Qin, L Wang, TY Liu
arXiv preprint arXiv:1906.02762, 2019
142019
Sentence level recurrent topic model: letting topics speak for themselves
F Tian, B Gao, D He, TY Liu
arXiv preprint arXiv:1604.02038, 2016
132016
The system can't perform the operation now. Try again later.
Articles 1–20