#

speaker-diarization

Here are 102 public repositories matching this topic...

espnet / espnet

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated May 22, 2024
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated May 22, 2024
Python

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated May 23, 2024
Jupyter Notebook

alibaba-damo-academy / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated May 23, 2024
Python

uis-rnn

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Aug 28, 2023
Python

awesome-diarization

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

machine-learning awesome deep-learning speech-recognition awesome-list speech-processing speaker-diarization

Updated Mar 22, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated May 21, 2024
Jupyter Notebook

IBM-Cloud / chatbot-watson-android

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

android java ibm-watson-services conversation-service watson chatbot dialog speech intent workspace entity conversation android-studio speaker-recognition watson-services ibm-watson speaker-diarization ibm-cloud ibm-cloud-solutions

Updated Nov 17, 2021
Java

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Apr 22, 2024
Python

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

speaker-recognition speaker-diarization uis-rnn ghostvlad vgg-speaker-recognition

Updated Jul 1, 2021
Python

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated May 23, 2024
Python

SpectralCluster

wq2012 / SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

python machine-learning clustering unsupervised-learning constrained-clustering speaker-diarization spectral-clustering unsupervised-clustering auto-tune

Updated Jan 9, 2024
Python

diart

juanmc2005 / diart

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Jan 4, 2024
Python

yinruiqing / pyannote-whisper

whisper asr speaker-diarization meeting-summarization pyannote chatgpt

Updated May 11, 2024
Python

manojpamk / pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speaker-recognition speaker-verification speaker-diarization speaker-embeddings

Updated Nov 11, 2020
Python

alibaba-damo-academy / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker rdino cnceleb

Updated May 23, 2024
Python

hitachi-speech / EEND

End-to-End Neural Diarization

machine-learning deep-learning chainer end-to-end kaldi speaker-diarization eend

Updated Aug 30, 2021
Python

VidyasagarMSC / WatBot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Updated Dec 13, 2018
Java

cvqluu / TDNN

Time delay neural network (TDNN) implementation in Pytorch using unfold method

pytorch speech-recognition speaker-recognition speaker-verification speech-processing asr speaker-diarization tdnn x-vector

Updated Nov 21, 2019
Python

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition speaker-verification source-separation speaker-diarization speaker-identification

Updated Mar 20, 2024
Python

Improve this page

Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."