End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published 26 days ago • 5
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published 26 days ago • 5
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published 26 days ago • 5
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published 30 days ago • 5
VoxCog: Towards End-to-End Multilingual Cognitive Impairment Classification through Dialectal Knowledge Paper • 2601.07999 • Published Jan 12 • 1
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published 30 days ago • 5
VoxCog: Towards End-to-End Multilingual Cognitive Impairment Classification through Dialectal Knowledge Paper • 2601.07999 • Published Jan 12 • 1
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published 30 days ago • 5
tiantiaf/whisper-large-v3-msp-podcast-emotion-dim Audio Classification • 2B • Updated Aug 10, 2025 • 658 • 2
tiantiaf/whisper-large-v3-msp-podcast-emotion Audio Classification • 2B • Updated Aug 10, 2025 • 948 • 5
Vox-Profile Collection This collection includes the implementation of models described in the Vox-Profile benchmark. (https://arxiv.org/pdf/2505.14648). • 14 items • Updated Dec 2, 2025 • 2