OpenAI Launches Whisper API for Text and Translation

OpenAI has launched the Whisper API, a hosted version of the open-source Whisper speech-to-text model it launched in September. Whisper is priced at US$0.006 per minute. It is an automated speech recognition system that OpenAI claims enables "robust" transcription in multiple languages and translation from those languages into English. Receives files in a variety of formats including M4A, MP3, MP4, MPEG, MPGA, WAV and WEBM

Also Read: Hexa Receives $20.5 Million Investment

The Whisper API is the same great model you can get as open source

Many organizations have developed highly capable speech recognition systems that are at the heart of the software and services offered by tech giants like Google, Amazon and Meta. It has reportedly been trained on 680,000 hours of multilingual and "multitasking" data collected from the Web, leading to better recognition of unique accents, background noise and technical jargon.

However, Whisper has its limitations. The system is trained on large amounts of noisy data. So OpenAI warns that Whisper can add words to its transcriptions that are not actually spoken. Whisper may also not perform equally well across languages, as it suffers from a higher error rate when it comes to speakers of languages that are not well represented in the training data.

This last part is nothing new in the world of speech recognition. Biases have long plagued even the best systems, according to a 2020 Stanford study that found systems from Amazon, Apple, Google, IBM and Microsoft. Despite this, OpenAI sees Whisper's transcription capabilities being used to improve existing applications, services, products and tools. The AI-powered language learning app Speak is already using the Whisper API to power a new in-app virtual speech assistant. If OpenAI can tap into the huge speech-to-text market, it could be highly profitable for the Microsoft-backed company.

OpenAI Launches Whisper API for Text and Translation

The Whisper API is the same great model you can get as open source

Mizanplus Kitchens Receives $1 Million Investment

AI Startup Sahara AI Receives $43 Million Investment

Kiteworks Receives $456 Million Investment for Sensitive Data Security

EliseAI Receives $75 Million Investment

No comments yet for this news, be the first one!...

Leave a Comment:

Mizanplus Kitchens Receives $1 Million Investment

AI Startup Sahara AI Receives $43 Million Investment

Kiteworks Receives $456 Million Investment for Sensitive Data Security

EliseAI Receives $75 Million Investment

Syfe Receives $27 Million New Investment

Gowit Enhances Advertising Platform with $1.3 Million Investment

Gaussion Receives €10.9 Million Investment

OW Smell Made Digital Receives 22 Million Pounds of Investment!

AI-Powered Marketing Platform Userled Receives 4 Million Pound Pre-Seed Investment

Clare&me Receives €3.7 Million Investment

Anduril Industries Receives $1.5 Billion Investment and Reaches $14 Billion Valuation

LightSolver Receives €12.5 Million from the European Innovation Council

Stori Receives $212 Million Investment and Unicorn Status

OpenAI Launches Whisper API for Text and Translation

The Whisper API is the same great model you can get as open source

Related Posts

No comments yet for this news, be the first one!...

Leave a Comment: