Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-world Multi-task Agents Z Wang, S Cai, G Chen, A Liu, X Ma, Y Liang NeurIPS 2023, 2023 | 299* | 2023 |
Rethinking Graph Neural Architecture Search from Message-passing S Cai, L Li, J Deng, B Zhang, ZJ Zha, L Su, Q Huang CVPR 2021, 2021 | 66* | 2021 |
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models Z Wang, S Cai, A Liu, Y Jin, J Hou, B Zhang, H Lin, Z He, Z Zheng, Y Yang, ... Workshop on Agent Learning in Open-Endedness (ALOE) at NeurIPS 2023, 2023 | 51 | 2023 |
Open-World Multi-task Control Through Goal-aware Representation Learning and Adaptive Horizon Prediction S Cai, Z Wang, X Ma, A Liu, Y Liang CVPR 2023, 2023 | 28 | 2023 |
GROOT: Learning to Follow Instructions by Watching Gameplay Videos S Cai, B Zhang, Z Wang, X Ma, A Liu, Y Liang ICLR 2024, Spotlight Presentation, 2023 | 19 | 2023 |
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning Z Liu, J Deng, L Li, S Cai, Q Xu, S Wang, Q Huang ACM MM 2020, Oral Presentation, 2020 | 19 | 2020 |
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings B Li, S Cai, W Liu, P Zhang, Q He, M Hua, Z Yi WACV 2023, 2023 | 12 | 2023 |
Automatic Relation-aware Graph Network Proliferation S Cai, L Li, X Han, J Luo, ZJ Zha, Q Huang CVPR 2022, Oral Presentation, 2022 | 11 | 2022 |
Edge-featured Graph Neural Architecture Search S Cai, L Li, X Han, Z Zha, Q Huang arXiv preprint arXiv:2109.01356, 2021 | 7 | 2021 |
Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition S Cai, L Li, X Han, S Huang, Q Tian, Q Huang TNNLS 2023, 2023 | 5 | 2023 |
Groot-1.5: Learning to follow multi-modal instructions from weak supervision S Cai, B Zhang, Z Wang, X Ma, A Liu, Y Liang Multi-modal Foundation Model meets Embodied AI Workshop@ ICML2024, 2024 | 1 | 2024 |
Inductive State-Relabeling Adversarial Active Learning with Heuristic Clique Rescaling B Zhang, L Li, S Wang, S Cai, ZJ Zha, Q Tian, Q Huang TPAMI 2024, 2024 | | 2024 |
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Z Wang, S Cai, Z Mu, H Lin, C Zhang, X Liu, Q Li, A Liu, X Ma, Y Liang NeurIPS 2024, 2024 | | 2024 |