Draft:Fish Audio


Fish Audio
Developer(s)Hanabi AI
Initial release2024
Stable release
S1 / S1-mini
TypeText-to-speech
LicenseProprietary (platform), Open source (selected models)
Websitefish.audio

Fish Audio is a text-to-speech (TTS) platform developed by the American artificial intelligence research company Hanabi AI. The platform provides speech synthesis and speech recognition using machine learning. Several of its TTS models have been released as open source on GitHub and Hugging Face.[1][2]

History

[edit]

Fish Audio was launched on April 29, 2024 with the release of Fish Speech v1.0.0, an open-source text-to-speech model.[3][4]

On September 12, Fish Audio released Fish Speech v1.4, trained on approximately 700,000 hours of multilingual audio data.[5] Versions v1.5 and v1.6 followed later in December 2024 and March 2025.[6]

In March 2025, Hanabi AI, the developer of Fish Audio, was accepted into the HF0 Residency startup accelerator as part of its W25 cohort.[7]

On 2 June 2025, the platform introduced Fish Audio S1 (also known as OpenAudio S1), a 4 billion parameter model available on its web service. A distilled 0.5 billion parameter version, S1-mini, was released as open source on Hugging Face.[8][9]

Products

[edit]
  • Fish Speech v1.0–v1.6 – successive text-to-speech models released between 2024 and 2025, with multilingual support.[10]
  • Fish Audio S1 – large-scale text-to-speech model (4B parameters) released in June 2025.[11]
  • Fish Audio S1-mini – distilled version of S1 (0.5B parameters), released in June 2025 as open source on Hugging Face.[12]

Reception

[edit]

36Kr reported in 2025 that Fish Audio had achieved around US$5 million in annual recurring revenue, citing Hanabi AI as an example of a lean AI company reaching notable scale.[13] MarkTechPost described Fish Speech v1.4 as a multilingual open-source TTS model with instant voice cloning and low-latency output.[14] Fish Audio has also been included in the Text-to-Speech Arena leaderboard maintained by ArtificialAnalysis.ai, which compares different speech synthesis systems.[15]

References

[edit]
  1. ^ "Releases · fishaudio/Fish-speech". GitHub.
  2. ^ "Fishaudio (Fish Audio)". 25 March 2025.
  3. ^ "Release v1.0.0 · fishaudio/Fish-speech". GitHub.
  4. ^ https://fish.audio/
  5. ^ "Fish Audio Introduces Fish Speech 1.4: A Powerful, Open-Source Text-to-Speech Model with Multilingual Support, Instant Voice Cloning, and Lightning-Fast Performance". 13 September 2024.
  6. ^ "Releases · fishaudio/Fish-speech". GitHub.
  7. ^ https://www.bloomberg.com/news/features/2025-04-15/hf0-startup-accelerator-uses-meditation-to-push-founders?embedded-checkout=true
  8. ^ "Fishaudio/Openaudio-s1-mini · Hugging Face".
  9. ^ "Fish Audio Releases OpenAudio S1: A New Benchmark for AI Voice with Professional Dubbing Actor Quality".
  10. ^ "Releases · fishaudio/Fish-speech". GitHub.
  11. ^ "Fishaudio/Fish-speech". GitHub.
  12. ^ "Fishaudio/Openaudio-s1-mini · Hugging Face".
  13. ^ "新增10家上榜Ai应用小团队,他们靠"交付结果"年入千万美元-36氪".
  14. ^ "Fish Audio Introduces Fish Speech 1.4: A Powerful, Open-Source Text-to-Speech Model with Multilingual Support, Instant Voice Cloning, and Lightning-Fast Performance". 13 September 2024.
  15. ^ "Text to Speech Model Arena | Artificial Analysis".