Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs Paper • 2603.21573 • Published 8 days ago • 1
CPRT Collection Compositional Privacy Risk Taxonomy: Benchmark and Models • 3 items • Updated about 11 hours ago • 1
Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs Paper • 2603.21573 • Published 8 days ago • 1
FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System Paper • 2603.10420 • Published 20 days ago • 6
Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data Paper • 2603.07534 • Published 22 days ago • 5
AlexXu811/child-adult-joint-asr-diarization Automatic Speech Recognition • 0.2B • Updated Jan 31 • 49 • 2
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published Jan 25 • 5
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published Jan 25 • 5
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published Jan 25 • 5
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published Jan 20 • 5
VoxCog: Towards End-to-End Multilingual Cognitive Impairment Classification through Dialectal Knowledge Paper • 2601.07999 • Published Jan 12 • 1