Against The Achilles' Heel: A Survey on Red Teaming for Generative Models L Lin, H Mu, Z Zhai, M Wang, Y Wang, R Wang, J Gao, Y Zhang, W Che, ... arXiv preprint arXiv:2404.00629, 2024 | | 2024 |
Demystifying Instruction Mixing for Fine-tuning Large Language Models R Wang, M Wu, Y Wang, X Han, C Zhang, H Li, T Baldwin arXiv e-prints, arXiv: 2312.10793, 2023 | | 2023 |