Collection of broadcast news video clips
-
Updated
Mar 20, 2017
Collection of broadcast news video clips
Semi Supervised Speaker Diarization with Gaussian Mixture Models
Scripts for LIUM SpkDiarization tools
Repository holding various implementation of specific NMF methods for speaker diarization
Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies on efficient use of temporal information from extracted audio features.
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.
🔭 Speaker diarization via transfer learning
Pytorch implementation of Generalized End-to-End Loss for speaker verification
Sample codes of Google Cloud Speech API's speaker diarization feature
Time delay neural network (TDNN) implementation in Pytorch using unfold method
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Our group's submission to the first DIHARD speaker diarization challenge held as a special session in INTERSPEECH '18.
speaker diarization in phone recording/电话录音中的说话人分离
Multimodal speaker diarization using pre-trained audio-visual synchronization model
Speaker diarization simulation built with python
speaker diarization using spectralcluster and Deeplearning
Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.
To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."