Qinghao Ye

Cited by

	All	Since 2019
Citations	1518	1516
h-index	14	14
i10-index	17	17

720

360

180

540

202120222023202446 195 711 560

Public access

View all

7 articles

1 article

available

not available

Based on funding mandates

Co-authors

Guang Yang, IEEE Senior MemberAssociate Professor & UKRI Future Leaders Fellow, Bioengineering/Imperial-X, Imperial College LondonVerified email at imperial.ac.uk
Jun XiaDepartment of Radiology, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Center.Verified email at email.szu.edu.cn
Zhangming NiuMindRank, Imperial College LondonVerified email at mindrank.ai
Weiping Ding (Associate Editor of TNN...Nantong University(Full Professor, Ph.D, IEEE Senior Member)Verified email at ntu.edu.cn
Chengjia WangUniversity of EdinburghVerified email at ed.ac.uk
Li Yuan, 袁粒Peking University, School of ECE, Shenzhen Graduate SchoolVerified email at pku.edu.cn
Ling Shao, Fellow of IEEE/IAPRGeneral Terminus Technologies; Former CEO of IIAI, Initiator/Provost & EVP of MBZUAIVerified email at inceptioniai.org
Yuan GaoStaff Engineer, Alibaba Group, Damo AcademyVerified email at alibaba-inc.com

Qinghao Ye

DAMO Academy, Alibaba Group; University of California, San Diego

Verified email at alibaba-inc.com

Computer Vision Multimodal Learning Video Understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Unbox the Black-box for the Medical Explainable AI via Multi-modal and Multi-centre Data Fusion: A Mini-Review, Two Showcases and Beyond G Yang, Q Ye, J Xia Information Fusion, 2021	416	2021
mPLUG-Owl: Modularization empowers large language models with multimodality Q Ye, H Xu, G Xu, J Ye, M Yan, Y Zhou, J Wang, A Hu, P Shi, Y Shi, C Li, ... arXiv preprint arXiv:2304.14178, 2023	413	2023
Exploring global diverse attention via pairwise temporal relation for video summarization P Li, Q Ye, L Zhang, L Yuan, X Xu, L Shao Pattern Recognition 111, 107677, 2021	89	2021
Explainable AI For COVID-19 CT Classifiers: An Initial Comparison Study Q Ye, J Xia, G Yang IEEE International Symposium on Computer-Based Medical Systems (CBMS 2021), 2021	82	2021
mplug-owl2: Revolutionizing multi-modal large language model with modality collaboration Q Ye, H Xu, J Ye, M Yan, H Liu, Q Qian, J Zhang, F Huang, J Zhou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	73	2024
mPLUG-2: A modularized multi-modal foundation model across text, image and video H Xu, Q Ye, M Yan, Y Shi, J Ye, Y Xu, C Li, B Bi, Q Qian, W Wang, G Xu, ... Proceedings of International Conference on Machine Learning, 2023	73	2023
Hitea: Hierarchical temporal-aware video-language pre-training Q Ye, G Xu, M Yan, H Xu, Q Qian, J Zhang, F Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	47	2023
mplug-docowl: Modularized multimodal large language model for document understanding J Ye, A Hu, H Xu, Q Ye, M Yan, Y Dan, C Zhao, G Xu, C Li, J Tian, Q Qi, ... arXiv preprint arXiv:2307.02499, 2023	45	2023
Evaluation and analysis of hallucination in large vision-language models J Wang, Y Zhou, G Xu, P Shi, C Zhao, H Xu, Q Ye, M Yan, J Zhang, J Zhu, ... arXiv preprint arXiv:2308.15126, 2023	43	2023
Robust Weakly Supervised Learning for COVID-19 Recognition Using Multi-Center CT Images Q Ye, Y Gao, W Ding, Z Niu, C Wang, Y Jiang, M Wang, EF Fang, ... Applied Soft Computing, 2021	34	2021
Temporal Cue Guided Video Highlight Detection with Low-Rank Audio-Visual Fusion Q Ye, X Shen, Y Gao, Z Wang, Q Bi, P Li, G Yang International Conference on Computer Vision (ICCV 2021), 2021	33	2021
All grains, one scheme (AGOS): Learning multigrain instance representation for aerial scene classification Q Bi, B Zhou, K Qin, Q Ye, GS Xia IEEE Transactions on Geoscience and Remote Sensing 60, 1-17, 2022	30	2022
Ureader: Universal ocr-free visually-situated language understanding with multimodal large language model J Ye, A Hu, H Xu, Q Ye, M Yan, G Xu, C Li, J Tian, Q Qian, J Zhang, Q Jin, ... Association for Computational Linguistics: EMNLP 2023, 2841–2858, 2023	28	2023
Systematic and comprehensive automated ventricle segmentation on ventricle images of the elderly patients: a retrospective study X Zhou* (Co-First), Q Ye* (Co-First), Y Jiang, M Wang, Z Niu, ... Frontiers in Aging Neuroscience 12, 461, 2020	21*	2020
Can Clinical Symptoms and Laboratory Results Predict CT Abnormality? Initial Findings Using Novel Machine Learning Techniques in Children With COVID-19 Infections H Ma* (Co-First), Q Ye* (Co-First), W Ding, Y Jiang, M Wang, Z Niu, ... Frontiers in Medicine 8, 855, 2021	14*	2021
Hallucination augmented contrastive learning for multimodal large language model C Jiang, H Xu, M Dong, J Chen, W Ye, M Yan, Q Ye, J Zhang, F Huang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	13	2024
Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, and Jingren Zhou. mplug-2: A modularized multi-modal foundation model across text, image and video H Xu, Q Ye, M Yan, Y Shi, J Ye, Y Xu, C Li Proceedings of International Conference of Machine Learning (ICML) 2, 2023	12	2023
mplug-paperowl: Scientific diagram analysis with the multimodal large language model A Hu, Y Shi, H Xu, J Ye, Q Ye, M Yan, C Li, Q Qian, J Zhang, F Huang arXiv preprint arXiv:2311.18248, 2023	9	2023
AI-based Medical e-Diagnosis for Fast and Automatic Ventricular Volume Measurement in the Patients with Normal Pressure Hydrocephalus X Zhou* (Co-First), Q Ye* (Co-First), X Yang, J Chen, H Ma, X Jun, JD Ser, ... Neural Computing and Applications, 2022	8*	2022
Youku-mplug: A 10 million large-scale chinese video-language dataset for pre-training and benchmarks H Xu, Q Ye, X Wu, M Yan, Y Miao, J Ye, G Xu, A Hu, Y Shi, G Xu, C Li, ... arXiv preprint arXiv:2306.04362, 2023	6	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors