Luowei Zhou
Title
Cited by
Cited by
Year
Towards automatic learning of procedures from web instructional videos
L Zhou, C Xu, JJ Corso
AAAI Conference on Artificial Intelligence, 2017
2402017
End-to-end dense video captioning with masked transformer
L Zhou, Y Zhou, JJ Corso, R Socher, C Xiong
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
2332018
Unified vision-language pre-training for image captioning and vqa
L Zhou, H Palangi, L Zhang, H Hu, J Corso, J Gao
Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 13041 …, 2020
2072020
Watch what you just said: Image captioning with text-conditional attention
L Zhou, C Xu, P Koch, JJ Corso
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 305-313, 2017
92*2017
Grounded video description
L Zhou, Y Kalantidis, X Chen, JJ Corso, M Rohrbach
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
892019
Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction
L Zhou, N Louis, JJ Corso
British Machine Vision Conference, 2018
452018
Multiagent reinforcement learning with sparse interactions by negotiation and knowledge transfer
L Zhou, P Yang, C Chen, Y Gao
IEEE transactions on cybernetics 47 (5), 1238-1250, 2016
382016
Less is more: Clipbert for video-and-language learning via sparse sampling
J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
312021
Dense video captioning
Y Zhou, L Zhou, C Xiong, R Socher
US Patent 10,542,270, 2020
212020
A balanced heuristic mechanism for multirobot task allocation of intelligent warehouses
L Zhou, Y Shi, J Wang, P Yang
Mathematical Problems in Engineering 2014, 2014
152014
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
H Huang, L Zhou, W Zhang, JJ Corso, C Xu
arXiv preprint arXiv:1812.05637, 2018
7*2018
Cluster-former: Clustering-based sparse transformer for long-range dependency encoding
S Wang, L Zhou, Z Gan, YC Chen, Y Fang, S Sun, Y Cheng, J Liu
arXiv preprint arXiv:2009.06097, 2020
6*2020
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ...
arXiv preprint arXiv:2106.04632, 2021
52021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
52021
CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning
L Zhou, J Liu, Y Cheng, Z Gan, L Zhang
arXiv preprint arXiv:2104.00285, 2021
42021
Language-Driven Video Understanding
L Zhou
12020
A Balanced Heristic Auction Method for Multi-Robot Task Allocation of Intelligent Warehouse [J]
Y Shi, L Zhou, J Wang, P YANG, C CHEN
Control and Decision 10 (15), 280-085, 2014
12014
Dense video captioning
Y Zhou, L Zhou, C Xiong, R Socher
US Patent 10,958,925, 2021
2021
Temporally Guided Articulated Hand Pose Tracking in Surgical Videos
N Louis, L Zhou, SJ Yule, RD Dias, M Manojlovich, FD Pagani, ...
arXiv preprint arXiv:2101.04281, 2021
2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training—-Supplement Material
M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu
The system can't perform the operation now. Try again later.
Articles 1–20