Unbox the Black-box for the Medical Explainable AI via Multi-modal and Multi-centre Data Fusion: A Mini-Review, Two Showcases and Beyond G Yang, Q Ye, J Xia Information Fusion, 2021 | 416 | 2021 |
mPLUG-Owl: Modularization empowers large language models with multimodality Q Ye, H Xu, G Xu, J Ye, M Yan, Y Zhou, J Wang, A Hu, P Shi, Y Shi, C Li, ... arXiv preprint arXiv:2304.14178, 2023 | 413 | 2023 |
Exploring global diverse attention via pairwise temporal relation for video summarization P Li, Q Ye, L Zhang, L Yuan, X Xu, L Shao Pattern Recognition 111, 107677, 2021 | 89 | 2021 |
Explainable AI For COVID-19 CT Classifiers: An Initial Comparison Study Q Ye, J Xia, G Yang IEEE International Symposium on Computer-Based Medical Systems (CBMS 2021), 2021 | 82 | 2021 |
mplug-owl2: Revolutionizing multi-modal large language model with modality collaboration Q Ye, H Xu, J Ye, M Yan, H Liu, Q Qian, J Zhang, F Huang, J Zhou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 73 | 2024 |
mPLUG-2: A modularized multi-modal foundation model across text, image and video H Xu, Q Ye, M Yan, Y Shi, J Ye, Y Xu, C Li, B Bi, Q Qian, W Wang, G Xu, ... Proceedings of International Conference on Machine Learning, 2023 | 73 | 2023 |
Hitea: Hierarchical temporal-aware video-language pre-training Q Ye, G Xu, M Yan, H Xu, Q Qian, J Zhang, F Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 47 | 2023 |
mplug-docowl: Modularized multimodal large language model for document understanding J Ye, A Hu, H Xu, Q Ye, M Yan, Y Dan, C Zhao, G Xu, C Li, J Tian, Q Qi, ... arXiv preprint arXiv:2307.02499, 2023 | 45 | 2023 |
Evaluation and analysis of hallucination in large vision-language models J Wang, Y Zhou, G Xu, P Shi, C Zhao, H Xu, Q Ye, M Yan, J Zhang, J Zhu, ... arXiv preprint arXiv:2308.15126, 2023 | 43 | 2023 |
Robust Weakly Supervised Learning for COVID-19 Recognition Using Multi-Center CT Images Q Ye, Y Gao, W Ding, Z Niu, C Wang, Y Jiang, M Wang, EF Fang, ... Applied Soft Computing, 2021 | 34 | 2021 |
Temporal Cue Guided Video Highlight Detection with Low-Rank Audio-Visual Fusion Q Ye, X Shen, Y Gao, Z Wang, Q Bi, P Li, G Yang International Conference on Computer Vision (ICCV 2021), 2021 | 33 | 2021 |
All grains, one scheme (AGOS): Learning multigrain instance representation for aerial scene classification Q Bi, B Zhou, K Qin, Q Ye, GS Xia IEEE Transactions on Geoscience and Remote Sensing 60, 1-17, 2022 | 30 | 2022 |
Ureader: Universal ocr-free visually-situated language understanding with multimodal large language model J Ye, A Hu, H Xu, Q Ye, M Yan, G Xu, C Li, J Tian, Q Qian, J Zhang, Q Jin, ... Association for Computational Linguistics: EMNLP 2023, 2841–2858, 2023 | 28 | 2023 |
Systematic and comprehensive automated ventricle segmentation on ventricle images of the elderly patients: a retrospective study X Zhou* (Co-First), Q Ye* (Co-First), Y Jiang, M Wang, Z Niu, ... Frontiers in Aging Neuroscience 12, 461, 2020 | 21* | 2020 |
Can Clinical Symptoms and Laboratory Results Predict CT Abnormality? Initial Findings Using Novel Machine Learning Techniques in Children With COVID-19 Infections H Ma* (Co-First), Q Ye* (Co-First), W Ding, Y Jiang, M Wang, Z Niu, ... Frontiers in Medicine 8, 855, 2021 | 14* | 2021 |
Hallucination augmented contrastive learning for multimodal large language model C Jiang, H Xu, M Dong, J Chen, W Ye, M Yan, Q Ye, J Zhang, F Huang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 13 | 2024 |
Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, and Jingren Zhou. mplug-2: A modularized multi-modal foundation model across text, image and video H Xu, Q Ye, M Yan, Y Shi, J Ye, Y Xu, C Li Proceedings of International Conference of Machine Learning (ICML) 2, 2023 | 12 | 2023 |
mplug-paperowl: Scientific diagram analysis with the multimodal large language model A Hu, Y Shi, H Xu, J Ye, Q Ye, M Yan, C Li, Q Qian, J Zhang, F Huang arXiv preprint arXiv:2311.18248, 2023 | 9 | 2023 |
AI-based Medical e-Diagnosis for Fast and Automatic Ventricular Volume Measurement in the Patients with Normal Pressure Hydrocephalus X Zhou* (Co-First), Q Ye* (Co-First), X Yang, J Chen, H Ma, X Jun, JD Ser, ... Neural Computing and Applications, 2022 | 8* | 2022 |
Youku-mplug: A 10 million large-scale chinese video-language dataset for pre-training and benchmarks H Xu, Q Ye, X Wu, M Yan, Y Miao, J Ye, G Xu, A Hu, Y Shi, G Xu, C Li, ... arXiv preprint arXiv:2306.04362, 2023 | 6 | 2023 |