Speech to Text Project

Regtechtimes on MSN

Understanding How Audio and Video Transcription Converts Speech into Clear Text

In today’s digital world, audio and video content is everywhere. From lectures and podcasts to webinars and meetings, spoken ...

Yahoo

Project Gutenberg puts 5,000 audiobooks online for free using synthetic speech

Open book repository Project Gutenberg has turned thousands of its titles into audiobooks practically overnight using synthetic speech, available now for download or streaming on multiple services.

Geeky Gadgets

OpenAI AI Audio : TTS Speech-to-Text Audio Integrated Agents

OpenAI has introduced a series of AI audio models, fundamentally redefining how voice-based AI can be integrated into modern applications wit&h ChatGPT. These advancements include state-of-the-art ...

Engadget

Meta’s open-source speech AI recognizes over 4,000 spoken languages

Meta has created an AI language model that (in a refreshing change of pace) isn’t a ChatGPT clone. The company’s Massively Multilingual Speech (MMS) project can recognize over 4,000 spoken languages ...

SiliconANGLE

Amazon researchers develop cutting-edge Base TTS text-to-speech model

Amazon.com Inc. researchers have developed a new text-to-speech model, Base TTS, that can pronounce words more naturally than earlier neural networks. TechCrunch reported the project late Wednesday.

TechSpot

Project Gutenberg releases 5,000 free audiobooks using neural text-to-speech technology

Forward-looking: Audiobooks have gained popularity in recent years due to their accessibility, but recording them can be difficult and expensive. Researchers recently demonstrated an automated method ...

Hackaday

Robust Speech-to-Text, Running Locally On Quest VR Headset

[saurabhchalke] recently released whisper.unity, a Unity package that implements whisper locally on the Meta Quest 3 VR headset, bringing nearly real-time transcription of natural speech to the device ...

CNET

Speech Accessibility Project Aims to Make Voice Recognition More Inclusive

Tech giants are teaming up with researchers at the University of Illinois to improve speech recognition for people with disabilities. Abrar's interests include phones, streaming, autonomous vehicles, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results