Speech to Text Open Source

资讯

ChatTTS a new open source AI voice text-to-speech AI model

ChatTTS is an open-source AI voice text-to-speech (TTS) model that has gained significant popularity on GitHub due to its impressive features and user-friendly design. This model is specifically ...

Slator4 天

Meet Whispering, an Open‑Source, Local‑First Transcription App

In the rapidly evolving world of speech-to-text technology, Whispering is a new entrant with a clear commitment to openness ...

TechCrunch9 年

Crowdsourced project aims to add text-to-speech to Wikipedia

An open source project hopes to draw on crowdsourced contributions to make Wikipedia more accessible by adding text to speech synthesis that will enable users of the online encyclopedia to have ...

6 天on MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good ...

"VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as ...

Slator2 天

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...

Nature7月

Striving for open-source and equitable speech-to-speech translation

Striving for open-source and equitable speech-to-speech translation US technology company Meta has produced an AI model that can directly translate speech in one language to speech in another.

Analytics India Magazine11 天

Microsoft Unveils VibeVoice, an Open-Source Text-to-Speech AI Model

VibeVoice can produce up to 90 minutes of synthetic dialogue with as many as four distinct speakers.

Geeky Gadgets5月

OpenAI AI Audio : TTS Speech-to-Text Audio Integrated Agents

TL;DR Key Takeaways : OpenAI has introduced advanced speech-to-text and text-to-speech models, improving transcription accuracy, speed, and customization for dynamic voice interactions.

Hackaday7 年

DIY Text-to-Speech With Raspberry Pi - Hackaday

In the meantime, we have small, cheap computers and plenty of open source software to turn them into document readers. [rgrokett] built a RaspPi text reader to help an aging parent maintain their ...

VentureBeat5月

OpenAI's new voice AI model gpt-4o-transcribe lets you add speech to ...

EliseAI, a company focused on property management automation, found that OpenAI’s text-to-speech model enabled more natural and emotionally rich interactions with tenants.

一些您可能无法访问的结果已被隐去。

显示无法访问的结果