Hao Zhang

Cited by

	All	Since 2019
Citations	3012	3011
h-index	14	14
i10-index	17	17

1700

850

425

1275

202220232024185 1693 1127

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Feng LiPhD student, Hong Kong University of Science and TechnologyVerified email at connect.ust.hk
Shilong LiuPhD student, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Lei ZhangInternational Digital Economy Academy (IDEA)Verified email at idea.edu.cn
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Lionel NiChair Professor of Data Science and Analytics, HKUST(Guangzhou)Verified email at ust.hk
Tianhe RenInternational Digital Economy Academy (IDEA)Verified email at idea.edu.cn
Chunyuan LiMicrosoft Research, RedmondVerified email at microsoft.com
Xueyan ZouPhD Student at UW-MadisonVerified email at wisc.edu
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Hongyang LiSouth China University of TechnologyVerified email at mail.scut.edu.cn
Heung-Yeung ShumMicrosoftVerified email at microsoft.com
Huaizhe XuHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Ailing ZengVerified email at idea.edu.cn
Peize SunThe University of Hong KongVerified email at connect.hku.hk
Junchi YanFIET & Prof., Shanghai Jiao Tong University (2018-), RSM of IBM Research (2011-2018)Verified email at cs.sjtu.edu.cn

Hao Zhang

Other names张浩

The Hong Kong University of Science and Technology

Verified email at connect.ust.hk - Homepage

AI Computer Vision Multi-modality Object detection


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
DINO: Detr with improved denoising anchor boxes for end-to-end object detection H Zhang, F Li, S Liu*, L Zhang, H Su, J Zhu, LM Ni, HY Shum International Conference on Learning Representations (ICLR), 2023, 2022	740	2022
Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, C Li, J Yang, H Su, J Zhu, ... arXiv preprint arXiv:2303.05499, 2023	542	2023
DAB-DETR: Dynamic anchor boxes are better queries for DETR S Liu, F Li, H Zhang, X Yang, X Qi, H Su, J Zhu, L Zhang International Conference on Learning Representations (ICLR), 2022, 2022	482	2022
Dn-detr: Accelerate detr training by introducing query denoising F Li, H Zhang, S Liu, J Guo, LM Ni, L Zhang The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR …, 2022	425	2022
Segment everything everywhere all at once X Zou, J Yang, H Zhang, F Li, L Li, J Gao, YJ Lee NeurIPS 2023, 2023	229	2023
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation F Li, H Zhang, S Liu, L Zhang, LM Ni, HY Shum The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023, 2022	216	2022
A simple framework for open-vocabulary segmentation and detection H Zhang, F Li, X Zou, S Liu, C Li, J Gao, J Yang, L Zhang ICCV 2023, 2023	67	2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity F Li, H Zhang, P Sun, X Zou, S Liu, J Yang, C Li, L Zhang, J Gao arXiv preprint arXiv:2307.04767, 2023	65	2023
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V J Yang, H Zhang, F Li, X Zou, C Li, J Gao arXiv preprint arXiv:2310.11441, 2023	49	2023
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models F Li, H Zhang, YF Zhang, S Liu, J Guo, LM Ni, PC Zhang, L Zhang arXiv preprint arXiv:2203.01922, 2022	30	2022
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR F Li, A Zeng, S Liu, H Zhang, H Li, L Zhang, LM Ni CVPR 2023, 2023	29	2023
Llava-plus: Learning to use tools for creating multimodal agents S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ... arXiv preprint arXiv:2311.05437, 2023	26	2023
MP-Former: Mask-Piloted Transformer for Image Segmentation H Zhang, F Li, H Xu, S Huang, S Liu, LM Ni, L Zhang The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023, 2023	25	2023
Grounded sam: Assembling open-world models for diverse visual tasks T Ren, S Liu, A Zeng, J Lin, K Li, H Cao, J Chen, X Huang, Y Chen, F Yan, ... arXiv preprint arXiv:2401.14159, 2024	18	2024
Detection Transformer with Stable Matching S Liu, T Ren, J Chen, Z Zeng, H Zhang, F Li, H Li, J Huang, H Su, J Zhu, ... ICCV 2023, 2023	14	2023
Multi-relation message passing for multi-label text classification M Ozmen, H Zhang, P Wang, M Coates ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	11	2022
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding S Liu, Y Liang, F Li, S Huang, H Zhang, H Su, J Zhu, L Zhang AAAI 2023, 2022	10	2022
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models H Zhang, H Li, F Li, T Ren, X Zou, S Liu, S Huang, J Gao, L Zhang, C Li, ... arXiv preprint arXiv:2312.02949, 2023	7	2023
A Strong and Reproducible Object Detector with Only Public Datasets T Ren, J Yang, S Liu, A Zeng, F Li, H Zhang, H Li, Z Zeng, L Zhang arxiv, 2023	6	2023
Introducing Depth into Transformer-based 3D Object Detection H Zhang, H Li, A Zeng, F Li, S Liu, X Liao, L Zhang arXiv preprint arXiv:2302.13002, 2023	6*	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors