automatic-speech-recognition

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

ai automatic-speech-recognition transcription speaker-recognition speaker-verification speaker-diarization whisper-ai faster-whisper

Updated May 27, 2024
Python

ieasybooks / tafrigh

Star

تفريغ المواد المرئية أو المسموعة إلى نصوص

python youtube subtitles srt vtt automatic-speech-recognition whisper audio-processing asr stable-whisper faster-whisper ctranslate2 whisper-jax

Updated May 26, 2024
Python

MooersLab / bash-whisper-transcription

Star

Bash function to ease the transcription of audio files with OpenAI's whisper.

audio bash automation automatic-speech-recognition speech-to-text beginner-friendly stt whisper automate-the-boring-stuff asr bash-function audio-messages audio-file-trancription

Updated May 25, 2024
Python

leduckhai / MultiMed

Star

Multilingual Multitask Multipurpose Medical Speech Recognition

machine-learning natural-language-processing deep-learning artificial-intelligence automatic-speech-recognition

Updated May 25, 2024
Python

mydroidandi / commbase-stt-whisper-proactive-p

Star

A proactive version of STT engine for Commbase

python engine speech-recognition automatic-speech-recognition speech-to-text stt asr commbase libcommbase commbase-stt-whisper-p commbase-stt-vosk-p

Updated May 25, 2024
Python

mydroidandi / commbase-stt-whisper-reactive-p

Star

A reactive and remote-ready version of STT engine for Commbase

android python ssh raspberry-pi remote-control engine assistant speech-recognition recorder automatic-speech-recognition stt assistive-technology remote-access-tool secure-shell openai-whisper commbase

Updated May 25, 2024
Python

EricApgar / live-speech-to-text

Star

Live speech to text transcription.

raspberry-pi offline automatic-speech-recognition asr hugging-face

Updated May 24, 2024
Python

th-schmidt / whisply

Star

Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!

subtitles speech-recognition automatic-speech-recognition speech-to-text whisper-ai

Updated May 24, 2024
Python

This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.

automatic-speech-recognition low-resource-languages bemba

Updated May 23, 2024

matiuste / DistriBlock

Star

[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.

machine-learning automatic-speech-recognition uncertainty-quantification adversarial-examples

Updated May 23, 2024
Python

ahmetoner / whisper-asr-webservice

Sponsor

Star

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

Updated May 20, 2024
Python

chimechallenge / chime-utils

Star

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

speech-recognition automatic-speech-recognition speech-processing speech-separation speech-enhancement far-field-speech-recognition diarization multi-speaker-asr meeting-transcription

Updated May 16, 2024
Python

EmulationAI / awesome-large-audio-models

Star

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

music-information-retrieval automatic-speech-recognition speech-to-text audio-processing music-ai music-processing large-language-models foundational-models speech-ai audio-ai large-audio-models speech-llms large-language-model-speech

Updated May 14, 2024

bricewalker / Hey-Jetson

Star

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Updated May 20, 2024
Jupyter Notebook

QubitPi / cmusphinx.github.io

Star

CMUSphinx Website

jekyll documentation automatic-speech-recognition cmusphinx

Updated May 9, 2024
HTML

LD239 / WebTranscript

Star

Interactive web tool for automatically ⚙️ transcribing and subtitling videos from URL or file uploads in your chosen language. The transcript appears alongside the video player, complete with embedded subtitles.

open-source web translation video-player video-annotation automatic-translation webvtt web-tool automatic-speech-recognition transcripts whisper web-tools transcript-editor automatic-transcription subtitles-generator webvtt-subtitles whisper-ai

Updated May 7, 2024
JavaScript

Improve this page

Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic-speech-recognition

Here are 287 public repositories matching this topic...

aliencaocao / TIL-2023

winstxnhdw / CapGen

TensorSpeech / TensorFlowASR

wenet-e2e / wenet

NavodPeiris / speechlib

ieasybooks / tafrigh

MooersLab / bash-whisper-transcription

leduckhai / MultiMed

mydroidandi / commbase-stt-whisper-proactive-p

mydroidandi / commbase-stt-whisper-reactive-p

EricApgar / live-speech-to-text

th-schmidt / whisply

csikasote / BembaSpeech

matiuste / DistriBlock

ahmetoner / whisper-asr-webservice

chimechallenge / chime-utils

EmulationAI / awesome-large-audio-models

bricewalker / Hey-Jetson

QubitPi / cmusphinx.github.io

LD239 / WebTranscript

Improve this page

Add this topic to your repo