Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
-
Updated
May 17, 2024 - Python
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.
Anaouder mouezh e Brezhoneg gant Vosk
AI voice assistant made with python and powered by Gemini, Mistral and PHI-3
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
Faster Whisper transcription with CTranslate2
video captioning software
Python-based application designed to convert speech to text in real-time.
簡單的yt影片或本機語音文件轉文字程式-獲取高品質的數據集
Open Voice OS Status Page
TikTok video scraping and multimodal content analysis tool.
OBS plugin for local speech recognition and captioning using AI
🧠 Leon is your open-source personal assistant.
A Python library for solving reCAPTCHA v2 and v3 with Playwright
A simple speech to text script using openai and python
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!
Service app for transcribing WhatsApp voice messages. Powered by whatsmeow and openai/whisper.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."