Clip2video: Mastering video-text retrieval via image clip H Fang, P Xiong, L Xu, Y Chen arXiv preprint arXiv:2106.11097, 2021 | 296 | 2021 |
Triple-GAN: Progressive face aging with triple translation loss H Fang, W Deng, Y Zhong, J Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 49 | 2020 |
Mlfw: A database for face recognition on masked faces C Wang, H Fang, Y Zhong, W Deng Chinese Conference on Biometric Recognition, 180-188, 2022 | 31 | 2022 |
Generate to adapt: Resolution adaption network for surveillance face recognition H Fang, W Deng, Y Zhong, J Hu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 25 | 2020 |
Transferring image-clip to video-text retrieval via temporal relations H Fang, P Xiong, L Xu, W Luo IEEE Transactions on Multimedia 25, 7772-7785, 2022 | 18 | 2022 |
Dynamic training data dropout for robust deep face recognition Y Zhong, W Deng, H Fang, J Hu, D Zhao, X Li, D Wen IEEE Transactions on Multimedia 24, 1186-1197, 2021 | 17 | 2021 |
LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling K Ma, X Zang, Z Feng, H Fang, C Ban, Y Wei, Z He, Y Li, H Sun Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
CLIP2Video: Mastering video-text retrieval via image CLIP. CoRR abs/2106.11097 (2021) H Fang, P Xiong, L Xu, Y Chen arXiv preprint arXiv:2106.11097, 2021 | 6 | 2021 |
Alignment and generation adapter for efficient video-text understanding H Fang, Z Yang, Y Wei, X Zang, C Ban, Z Feng, Z He, Y Li, H Sun Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 4 | 2023 |
Trusted unified feature-neighborhood dynamics for multi-view classification H Huang, C Qin, Z Liu, K Ma, J Chen, H Fang, C Ban, H Sun, Z He arXiv preprint arXiv:2409.00755, 2024 | 3 | 2024 |
Beyond uncertainty: Evidential deep learning for robust video temporal grounding K Ma, H Huang, J Chen, H Chen, P Ji, X Zang, H Fang, C Ban, H Sun, ... arXiv preprint arXiv:2408.16272, 2024 | 3 | 2024 |
Adaptive re-balancing network with gate mechanism for long-tailed visual question answering H Chen, R Liu, H Fang, X Zhang ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 3 | 2021 |
Semantic Segmentation of Aerial Image Using Fully Convolutional Network J Yang, Y Jiang, H Fang, Z Jiang, H Zhang, S Hao Image and Graphics Technologies and Applications: 13th Conference on Image …, 2018 | 3 | 2018 |
Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval H Fang, Z Yang, X Zang, C Ban, Z He, H Sun, L Zhou Proceedings of the 31st ACM International Conference on Multimedia, 3847-3856, 2023 | 2 | 2023 |
A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search X Zang, W Gao, G Li, H Fang, C Ban, Z He, H Sun Proceedings of the 31st ACM International Conference on Multimedia, 7737-7746, 2023 | 1 | 2023 |
GOAL: Grounded text-to-image Synthesis with Joint Layout Alignment Tuning Y Li, H Fang, Z Feng, K Ma, C Ban, X Zang, LX Zhou, Z He, J Chen, J Hu, ... Proceedings of the 32nd ACM International Conference on Multimedia, 7055-7064, 2024 | | 2024 |
BoViLA: Bootstrapping Video-Language Alignment via LLM-Based Self-Questioning and Answering J Chen, K Ma, H Huang, J Shen, H Fang, X Zang, C Ban, Z He, H Sun, ... arXiv preprint arXiv:2410.02768, 2024 | | 2024 |
Disentangle and denoise: Tackling context misalignment for video moment retrieval K Ma, H Fang, X Zang, C Ban, L Zhou, Z He, Y Li, H Sun, Z Feng, X Hou arXiv preprint arXiv:2408.07600, 2024 | | 2024 |
ProTA: Probabilistic Token Aggregation for Text-Video Retrieval H Fang, X Zang, C Ban, Z Feng, L Zhou, Z He, Y Li, H Sun ICME 2024, 2024 | | 2024 |
Augmented Face Representation Learning via Transitive Distillation H Fang, W Deng, Y Zhong, J Hu, D Zhao, X Li, D Wen 2021 16th IEEE International Conference on Automatic Face and Gesture …, 2021 | | 2021 |