AnKaS: Development and Analysis of the Database of Livvi-Karelian Speech Annotations [INTERSPEECH 2024]
-
Updated
Mar 13, 2024 - JavaScript
AnKaS: Development and Analysis of the Database of Livvi-Karelian Speech Annotations [INTERSPEECH 2024]
[Interspeech 2022] Tutorial - Learning from Weak Labels
Interspeech2020 Accented English Speech Recognition Competition Data
Official repo for "Multi-Corpus Emotion Recognition Method based on Cross-Modal Gated Attention Fusion" in INTERSPEECH 2024
Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of Speech Disfluencies.
The implementation code for the paper "Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries"
Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. (Interspeech 2019)
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
Baseline Experiments for ReMASC dataset.
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Interspeech 2019 experiments
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Reading list for research topics in Sound AI
a deep accent recognition network
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
Add a description, image, and links to the interspeech topic page so that developers can more easily learn about it.
To associate your repository with the interspeech topic, visit your repo's landing page and select "manage topics."