The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016 KA Lee, V Hautamäki, T Kinnunen, A Larcher, C Zhang, A Nautsch, ... Interspeech, 1328-1332, 2017 | 27 | 2017 |
Twin Model G-PLDA for Duration Mismatch Compensation in Text-Independent Speaker Verification J Ma, V Sethu, E Ambikairajah, KA Lee Interspeech, 2016 | 11 | 2016 |
Generalized variability model for speaker verification J Ma, V Sethu, E Ambikairajah, KA Lee IEEE Signal Processing Letters, 2018 | 9 | 2018 |
Duration compensation of i‐vectors for short duration speaker verification J Ma, V Sethu, E Ambikairajah, KA Lee Electronics Letters 53 (6), 405-407, 2017 | 9 | 2017 |
Speaker-phonetic vector estimation for short duration speaker verification J Ma, V Sethu, E Ambikairajah, KA Lee 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 8 | 2018 |
The I4U submission to the 2016 NIST speaker recognition evaluation KA Lee, H Sun, S Aleksandr, W Guangsen Proceedings of the NIST SRE 2016 Workshop, San Diego, CA, 2016 | 5 | 2016 |
Parallel Speaker and Content Modelling for Text-dependent Speaker Verification J Ma, S Irtza, K Sriskandaraja, V Sethu, E Ambikairajah | 5 | 2016 |
Incorporating Local Acoustic Variability Information into Short Duration Speaker Verification. J Ma, V Sethu, E Ambikairajah, KA Lee INTERSPEECH, 1502-1506, 2017 | 3 | 2017 |
An end-to-end far-field keyword spotting system with neural beamforming X Ji, L Lu, F Fang, J Ma, L Zhu, J Li, D Zhao, M Liu, F Jiang 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 2 | 2021 |
V2a-mapper: A lightweight solution for vision-to-audio generation by connecting foundation models H Wang, J Ma, S Pascual, R Cartwright, W Cai Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15492 …, 2024 | 1 | 2024 |
A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model D Zhao, J Ma, L Lu, J Li, X Ji, L Zhu, F Fang, M Liu, F Jiang arXiv preprint arXiv:2401.02673, 2024 | | 2024 |
Low latency transformers for speech processing J Ma, S Pan, D Chandran, A Fanelli, R Cartwright arXiv preprint arXiv:2302.13451, 2023 | | 2023 |
Vairable Scale LoCaPE: A Practical High Resolution Spectral Zooming Tool for Mnova J Ma, E Aboutanios, C Cobas ANZMAG 2019, 2019 | | 2019 |
Modelling and compensation techniques for short duration speaker verification J Ma UNSW Sydney, 2019 | | 2019 |
Use of Uncertainty Propagation in Twin Model GPLDA for Short Duration Speaker Verification J Ma, V Sethu, E Ambikairajah, KA Lee Australasian International Conference on Speech Science and Technology (17th …, 2018 | | 2018 |
System Description for MCE 2018 J Ma, V Sethu, E Ambikairajah system 1, 10.46, 0 | | |