Whisper (OpenAI)

Whisper (OpenAI)

Translate audio or video to text with language translation

Description

Whisper is an open-source automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. It is designed to be robust to accents, background noise and technical language, and can transcribe and translate speech in multiple languages into English. It is a simple end-to-end approach, implemented as an encoder-decoder Transformer. It is also capable of performing language identification and phrase-level timestamps. It is designed to be easy to use and have high accuracy, allowing developers to add voice interfaces to more applications.

GitHub Repository

Note: This is a GitHub repository, meaning that it is code that someone created and made available for others to use. It typically requires some technical knowledge to set up and run.

Explore Similar AI Tools

View VideoDubber
VideoDubber

VideoDubber

A tool to translate and clone voices in videos.

Paid
Translation
View Translate.Video
Translate.Video

Translate.Video

Translate videos with just 1-Click

Freemium
Speech-To-Text
View OpenL
OpenL

OpenL

A tool for translation.

Freemium
Translation
View BlipCut AI Video Translator
BlipCut AI Video Translator

BlipCut AI Video Translator

A platform to translate, dub, and generate subtitles for multilingual videos.

Paid
Translation

AI news twice a week

Join 230,000+ readers getting the most important AI news and coolest tools every Wednesday and Friday.