End-to-End Speech Processing Toolkit
-
Updated
May 24, 2024 - Python
End-to-End Speech Processing Toolkit
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
A PyTorch-based Speech Toolkit
Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning
On-device Speech-to-Intent engine powered by deep learning
This repository is a comprehensive project that leverages the XLM-Roberta model for intent detection. This repository is a valuable resource for developers looking to build and fine-tune intent detection models based on state-of-the-art techniques.
The open source codes for ICONIP 2023 Paper “A deep joint model of Multi-Scale intent-slots Interaction with Second-Order Gate for SLU”.
Real-time Spoken Language Understanding for Orthopedic Training in Virtual Reality
"An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding", accepted at INTERSPEECH 2023.
Learning a common representation space from speech and text for cross-modal retrieval given textual queries and speech files.
Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"
Dataset Release for Intent Classification from Speech
Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"
A spoken question answering dataset on SQUAD
Dataset Release for Phone Number Entity capture task
real time japanese speech recognition translator using wav2vec2
Library for training visually-grounded models of spoken language understanding.
cross-domain slot filling task with BERT
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
Add a description, image, and links to the spoken-language-understanding topic page so that developers can more easily learn about it.
To associate your repository with the spoken-language-understanding topic, visit your repo's landing page and select "manage topics."