Rising1 sources· last seen 7h ago· first seen 7h ago
Fish Audio Releases S2: open-source, controllable and expressive TTS model
Fish Audio is open-sourcing S2, where you can direct voices for maximum expressivity with precision using natural language emotion tags like \[whispers sweetly\] or \[laughing nervously\]. You can generate multi-speaker dialogue in one pass, time-to-first-audio is 100ms, and 80+ languages are suppor
Lead: r/LocalLLaMABigness: 29fishaudioopen-sourcecontrollableexpressive
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
70
197 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works
Receipts (all sources)
Fish Audio Releases S2: open-source, controllable and expressive TTS model
REDDIT · r/LocalLLaMA · 7h ago · ⬆ 197 · 💬 32
score 130
Fish Audio is open-sourcing S2, where you can direct voices for maximum expressivity with precision using natural language emotion tags like \[whispers sweetly\] or \[laughing nervously\]. You can generate multi-speaker dialogue in one pass, time-to-first-audio is 100ms, and 80+ languages are suppor