Speech Recognition in Python

Top 10 Open Source Python Libraries for Voice Agents in 2025

Overview Open source Python libraries empower developers to build advanced, customizable voice agents with full ...

Progress and Prospects for Speech Recognition in Broadcasting

Speaker-independent large-vocabulary continuous speech recognition is now in use in a closed-captioned broadcasting service for certain NHK news, music, sports, and lifestyle programming. The system ...

Geeky Gadgets

NVIDIA Parakeet 2 vs OpenAI Whisper: Which AI Speech Recognition Model Wins?

What if the race to perfect AI speech recognition wasn’t just about accuracy but also speed and usability? In a world where audio-to-text transcription powers everything from virtual meetings to ...

Science Daily

Automatic speech recognition on par with humans in noisy conditions

Are humans or machines better at recognizing speech? A new study shows that in noisy conditions, current automatic speech recognition (ASR) systems achieve remarkable accuracy and sometimes even ...

Science Daily

Machine listening: Making speech recognition systems more inclusive

One group commonly misunderstood by voice technology are individuals who speak African American English, or AAE. Researchers designed an experiment to test how AAE speakers adapt their speech when ...

VentureBeat

Mistral’s Voxtral goes beyond transcription with summarization, speech-triggered functions

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Mistral released an open-sourced voice ...

CU Boulder News & Events

Fine-tuning a Strong Language model to Enable Classroom Speech Recognition

Postdoctorate Viet Anh Trinh led a project within Strand 1 to develop a novel neural network architecture that can both recognize and generate speech. He has since moved on from iSAT to a role at ...

EDN

IoT: GenAI voice helps generate speech recognition models

A new generative AI feature brings voice recognition to tiny devices with a text-to-speech (TTS) synthetic dataset generation capability. It enables developers to generate synthetic speech data with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results