Champion at Brainhack TIL 2023: Team 10000SGDMRT
-
Updated
May 29, 2024 - Jupyter Notebook
Champion at Brainhack TIL 2023: Team 10000SGDMRT
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Production First and Production Ready End-to-End Speech Recognition Toolkit
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
تفريغ المواد المرئية أو المسموعة إلى نصوص
Bash function to ease the transcription of audio files with OpenAI's whisper.
Multilingual Multitask Multipurpose Medical Speech Recognition
A proactive version of STT engine for Commbase
A reactive and remote-ready version of STT engine for Commbase
Live speech to text transcription.
Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.
[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.
OpenAI Whisper ASR Webservice API
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
CMUSphinx Website
Interactive web tool for automatically ⚙️ transcribing and subtitling videos from URL or file uploads in your chosen language. The transcript appears alongside the video player, complete with embedded subtitles.
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."