#

vad

Here are 87 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 9, 2024
Python

RuntimeAudioImporter

gtreshchev / RuntimeAudioImporter

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

audio plugin mp3 audio-files audio-player mp3-player vad audio-formats unreal-engine ue4 blueprints audio-converter unreal-engine-4 voice-activity-detection ue4-plugin bink ue5 unreal-engine-5 ue5-plugin

Updated Jun 9, 2024
C++

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 9, 2024
Python

mgonzs13 / whisper_ros

silero-vad + whisper.cpp (speech-to-text) for ROS 2

speech-recognition vad speech-to-text ros2 voice-activity-detection whisper-cpp ggml

Updated Jun 5, 2024
C++

JarbasHiveMind / HiveMind-voice-sat

OpenVoiceOS Voice Satellite

voice-commands voice-recognition voice-chat vad mycroft voice-control stt voice-assistant hivemind wake-word-detection ovos wake-word openvoiceos

Updated Jun 4, 2024
Python

Yifei-ZHAO96 / Tr-VAD

Tr-VAD: An Efficient Transformer based Voice Activity Detection Model

vad voice-activity-detection

Updated Jun 4, 2024
Python

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

Updated Jun 3, 2024
Python

Acervans / lastfm_RS

LastFM recommendation with sentiment analysis (Bachelor Thesis Project)

nlp sentiment-analysis vad recommender-system lastfm-api

Updated Jun 3, 2024
Jupyter Notebook

LZ9 / Minerva

Minerva是一个便捷的音频工具，支持快速进行录音（PCM/MP3/WAV）和VAD端点检测识别，并保存活动语音。

mp3 wav recording vad pcm audioplayer

Updated May 23, 2024
C

emmanuelinfante / SubtitlesEveryone

Transcribe Like a Pro, Without Paying a Penny!

Updated May 9, 2024
Jupyter Notebook

IntendedConsequence / vadc

Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech

pytorch vad voice-activity-detection onnxruntime tinygrad silero-vad

Updated May 23, 2024
C++

thurti / vad-audio-worklet

Voice Activity Detection (VAD) AudioWorklet

speech vad voice-activity-detection audioworklet audioworkletprocessor

Updated Apr 18, 2024
JavaScript

Picovoice / cobra

On-device voice activity detection (VAD) powered by deep learning

speech-recognition vad voice-activity-detection on-device voice-activity voice-activity-detector

Updated Apr 8, 2024
Python

shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

deep-learning speech-recognition vad speech-to-text whisper asr tensorrt voice-activity-detection tensorrt-llm

Updated Apr 5, 2024
Jupyter Notebook

aria

lef-fan / aria

A local and uncensored AI entity.

python bot text-to-speech ai deep-learning speech pytorch assistant vad speech-to-text voice-assistant large-language-models llm

Updated Mar 28, 2024
Python

eja / wav2vad

A command line tool for voice activity detection.

Updated Mar 23, 2024
C++

smacke / ffsubsync

Automagically synchronize subtitles with video.

Updated Mar 18, 2024
Python

baabaaox / go-webrtcvad

WebRTC Voice Activity Detection for Golang

go golang webrtc cgo vad webrtcvad

Updated Feb 17, 2024
C++

gkonovalov / android-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Feb 12, 2024
C

EtienneAb3d / WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

text-to-speech sound-processing vad whisper audio-processing asr noise-removal vocals

Updated Feb 6, 2024
Python

Improve this page

Add a description, image, and links to the vad topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."