CompeteSMoE--Effective Training of Sparse Mixture of Experts via Competition Q Pham, G Do, H Nguyen, TT Nguyen, C Liu, M Sartipi, BT Nguyen, ... arXiv preprint arXiv:2402.02526, 2024 | 11 | 2024 |
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts G Do, K Le, Q Pham, T Nguyen, TN Doan, BT Nguyen, C Liu, ... The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 | 11 | 2023 |
Competesmoe–effective training of sparse mixture of experts via competition, 2024 Q Pham, G Do, H Nguyen, T Nguyen, C Liu, M Sartipi, BT Nguyen, ... Cited on, 1, 0 | 2 | |
SimSMoE: Solving Representational Collapse via Similarity Measure G Do, H Le, T Tran 2025 Annual Conference of the Nations of the Americas Chapter of the …, 2025 | | 2025 |
On the effectiveness of discrete representations in sparse mixture of experts G Do, K Pham, H Le, T Tran arXiv preprint arXiv:2411.19402, 2024 | | 2024 |