Speech to Text Open Source

资讯

ChatTTS a new open source AI voice text-to-speech AI model

ChatTTS is an open-source AI voice text-to-speech (TTS) model that has gained significant popularity on GitHub due to its impressive features and user-friendly design. This model is specifically ...

Nature7月

Striving for open-source and equitable speech-to-speech translation

Striving for open-source and equitable speech-to-speech translation US technology company Meta has produced an AI model that can directly translate speech in one language to speech in another.

Slator6 天

Meet Whispering, an Open‑Source, Local‑First Transcription App

In the rapidly evolving world of speech-to-text technology, Whispering is a new entrant with a clear commitment to openness ...

Slator4 天

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...

7 天on MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good ...

Microsoft's road to total AI domination continues with an interesting looking open-source project called VibeVoice. This text ...

Geeky Gadgets5月

OpenAI AI Audio : TTS Speech-to-Text Audio Integrated Agents

TL;DR Key Takeaways : OpenAI has introduced advanced speech-to-text and text-to-speech models, improving transcription accuracy, speed, and customization for dynamic voice interactions.

Analytics India Magazine13 天

Microsoft Unveils VibeVoice, an Open-Source Text-to-Speech AI Model

VibeVoice can produce up to 90 minutes of synthetic dialogue with as many as four distinct speakers. GraphRAG is emerging as ...

CNET2 年

Meta’s New AI Can Translate Speech and Text for Nearly 100 Languages

Meta says that it's the biggest open-source multimodal dataset, containing 270,000 hours' worth of mined speech and text alignment on which its AI was trained.

Hackaday7 年

DIY Text-to-Speech With Raspberry Pi - Hackaday

In the meantime, we have small, cheap computers and plenty of open source software to turn them into document readers. [rgrokett] built a RaspPi text reader to help an aging parent maintain their ...

VentureBeat5月

OpenAI's new voice AI model gpt-4o-transcribe lets you add speech to ...

EliseAI, a company focused on property management automation, found that OpenAI’s text-to-speech model enabled more natural and emotionally rich interactions with tenants.

当前正在显示可能无法访问的结果。

隐藏无法访问的结果