speech-to-text

Star

Here are 2,852 public repositories matching this topic...

alibaba-damo-academy / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated May 17, 2024
Python

jackwuwei / gptspeaker

Star

The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.

raspberry-pi ai smarthome chatbot tts speech-recognition speech-to-text voice-assistant chatgpt

Updated May 17, 2024
Python

gweltou / vosk-br

Star

Anaouder mouezh e Brezhoneg gant Vosk

speech-to-text stt breton-language breton vosk vosk-models

Updated May 17, 2024
Python

Robitx / gp.nvim

Sponsor

Star

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]

plugin vim ai lua neovim voice nvim openai cursor speech-to-text gpt copilot whisper tabnine gpt-4 gpt4 llm chatgpt codeium

Updated May 17, 2024
Lua

NafisRayan / AI-Voice-Assistant

Star

AI voice assistant made with python and powered by Gemini, Mistral and PHI-3

python nlp text-to-speech ai voice voice-recognition gemini speech-recognition speech-to-text voice-control mistral voice-assistant phi huggingface llm

Updated May 17, 2024
Python

zh-plus / openlrc

Star

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。

python lyrics speech-to-text whisper transcribe voice-to-text lyrics-generator subtitle-translation openai-api auto-subtitle faster-whisper openlrc

Updated May 17, 2024
Python

SYSTRAN / faster-whisper

Star

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated May 17, 2024
Python

sri0606 / VidCaptio

Star

video captioning software

gui subtitles speech-to-text spee videocaptioning

Updated May 17, 2024
Python

a3ro-dev / voiceTypingApp

Star

Python-based application designed to convert speech to text in real-time.

python script side-project project python3 speech-synthesis speech-recognition speech-to-text speech-processing dad pyttsx3 googlespeech googlespeechapi

Updated May 17, 2024
Python

win10ogod / whisper-yt.transcribe

Star

簡單的yt影片或本機語音文件轉文字程式-獲取高品質的數據集

nlp youtube dataset speech-to-text whisper

Updated May 17, 2024
Python

OpenVoiceOS / status

Star

Open Voice OS Status Page

status text-to-speech translator monitoring alerting cuda sam nvidia tts uptime stats speech-to-text stt piper ovos upptime openvoiceos fasterwhisper mimic3

Updated May 17, 2024
Markdown

kariemoorman / tiktok-analyzer

Star

TikTok video scraping and multimodal content analysis tool.

nlp docker sentiment-analysis video-processing face-detection object-detection speech-to-text snyk tiktok

Updated May 17, 2024
Python

occ-ai / obs-localvocal

Star

OBS plugin for local speech recognition and captioning using AI

plugin ai speech-to-text obs whisper

Updated May 17, 2024
C++

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated May 17, 2024
Python

Xewdy444 / Playwright-reCAPTCHA

Star

A Python library for solving reCAPTCHA v2 and v3 with Playwright

library recaptcha solver asyncio speech-to-text playwright

Updated May 17, 2024
Python

WeNeedCoffee / Speech-To-Text

Star

A simple speech to text script using openai and python

speech-recognition openai speech-to-text whisper auto-typing

Updated May 17, 2024
Python

k2-fsa / sherpa-onnx

Star

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

android windows macos linux raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v asr arm32 onnx vits openkylin

Updated May 17, 2024
C++

botbahlul / crx-live-translate

Star

Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!

javascript chrome edge voice-recognition speech-recognition browser-extension speech-to-text google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated May 17, 2024
JavaScript

hoehermann / whatsmeow-transcribe

Star

Service app for transcribing WhatsApp voice messages. Powered by whatsmeow and openai/whisper.

speech-to-text whatsapp-bot

Updated May 17, 2024
Go

Uberi / speech_recognition

Star

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated May 16, 2024
Python

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-text

Here are 2,852 public repositories matching this topic...

alibaba-damo-academy / FunClip

jackwuwei / gptspeaker

gweltou / vosk-br

Robitx / gp.nvim

NafisRayan / AI-Voice-Assistant

zh-plus / openlrc

SYSTRAN / faster-whisper

sri0606 / VidCaptio

a3ro-dev / voiceTypingApp

win10ogod / whisper-yt.transcribe

OpenVoiceOS / status

kariemoorman / tiktok-analyzer

occ-ai / obs-localvocal

leon-ai / leon

Xewdy444 / Playwright-reCAPTCHA

WeNeedCoffee / Speech-To-Text

k2-fsa / sherpa-onnx

botbahlul / crx-live-translate

hoehermann / whatsmeow-transcribe

Uberi / speech_recognition

Improve this page

Add this topic to your repo