Rising1 sources· last seen 7h ago· first seen 7h ago

Fish Audio Releases S2: open-source, controllable and expressive TTS model

Fish Audio is open-sourcing S2, where you can direct voices for maximum expressivity with precision using natural language emotion tags like \[whispers sweetly\] or \[laughing nervously\]. You can generate multi-speaker dialogue in one pass, time-to-first-audio is 100ms, and 80+ languages are suppor

Lead: r/LocalLLaMABigness: 29fishaudioopen-sourcecontrollableexpressive

Open primary source

📡 Coverage

1 news source

🟠 Hacker News

🔴 Reddit

197 upvotes across 1 sub

📈 Google Trends

Full methodology: How scoring works

Receipts (all sources)

Fish Audio Releases S2: open-source, controllable and expressive TTS model

REDDIT · r/LocalLLaMA · 7h ago · ⬆ 197 · 💬 32

score 130