Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
-
Updated
May 19, 2024 - Python
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
a self-hosted webui for 30+ generative ai
Speech To Text AI using Whisper with Wake word detection
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
OpenAI .NET sdk - Azure OpenAI, ChatGPT, Whisper, and DALL-E
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
End-to-end platform for building voice first multimodal agents
Bash function to ease the transcription of audio files with OpenAI's whisper.
This project implement end to end realtime vietnamese speech recognition with PhoWhisper in Backend and frontend in React Native
Port of OpenAI's Whisper model in C/C++
(project) Implementing a GPT model with dimensional speech recognition
Uchinoko Studio is a web application designed to facilitate real-time voice conversations with AI.
The simplest way to serve AI/ML models in production
Transcribe is OpenAI's chatGPT based real time transcription, conversation, Language learning platform. It provides live transcripts from microphone and speaker. It generates a suggested conversation response using OpenAI's GPT API. It will read out the responses, simulating a real live conversation in English or another language.
Faster Whisper transcription with CTranslate2
A Telegram bot that turns voice memos into progress reports
Created by OpenAI
Released August 2021
Latest release 6 months ago