Speech to Text Open Source

资讯

Microsoft Unveils VibeVoice, an Open-Source Text-to-Speech AI Model

VibeVoice can produce up to 90 minutes of synthetic dialogue with as many as four distinct speakers. TI’s latest UCC25661 LLC ...

Slator7 天

Meet Whispering, an Open‑Source, Local‑First Transcription App

In the rapidly evolving world of speech-to-text technology, Whispering is a new entrant with a clear commitment to openness ...

Slator5 天

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...

TweakTown6 天

Microsoft's VibeVoice uses AI to create 90-minute podcasts with multiple speakers

VibeVoice is a new open-source AI tool that can generate a full 90 minute audio podcast recording with multiple speakers from ...

TechRadar19 天

Best speech-to-text app of 2025 - TechRadar

We list the best speech-to-text apps, to make it simple and easy to dictate straight to your documents. Speech-to-text used to be regarded as very niche, specifically just used for busy people who ...

13 天

Alibaba Introduces Open-Source Model For Digital Human Video Generation

S2V, brings portraits to life Alibaba has unveiled Wan2.2-S2V Speech-to-Video, its latest open-source model designed for ...

WinBuzzer6 天

Microsoft Releases VibeVoice Open-Source AI Model for Generating Multi-Speaker Podcasts

Microsoft has launched VibeVoice, a new open-source AI model capable of generating up to 90 minutes of multi-speaker audio ...

9 天on MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good ...

Microsoft's road to total AI domination continues with an interesting looking open-source project called VibeVoice. This text ...

Reuters14 天

How to Choose a Speech-to-Text Converter | Reuters

Key features, accuracy, and usability factors to consider when selecting the right speech-to-text converter for your needs ...

搜狐8 天

The Shock Release of the Open Source Speech Large Model Step-Audio 2 mini: A New Milestone ...

Recently, Juyue Xingchen officially launched its latest open-source end-to-end speech large model—Step-Audio 2 mini. The release of this model not only marks a significant breakthrough in speech ...

9 小时

Bilibili's Self-Developed Voice Generation Model IndexTTS-2.0 Officially Open-Sourced ...

In contrast, IndexTTS-2.0 introduces a mechanism for precise duration control, achieving efficient duration management for the first time within an autoregressive framework. This innovation makes the ...

InfoWorld11 天

OpenAI adds MCP and SIP support to gpt-realtime for smarter voice-based agents

The new API features will help enterprises build autonomous, multimodal voice agents with remote tool access, PBX integration ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果