TTS & Speech to Text - a samsam55 Collection

samsam55 's Collections

Reinforcement Learning Etc..

Run on CPU Optimizations

World View Creation (out painting 3D)

Visual Multi Modal LLM

TTS & Speech to Text

Misc

Agents

3D Models & Modeling

TTS & Speech to Text

updated Oct 16, 2025

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

Paper • 2510.03117 • Published Oct 3, 2025 • 11
ResembleAI/chatterbox

Text-to-Speech • Updated Sep 23, 2025 • 506k • • 1.4k
thewh1teagle/phonikud

0.3B • Updated Aug 24, 2025 • 318 • 1
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15, 2025 • 62