Alibaba Damo Academy

FunASR Public

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 3.8k 436

FunClip Public

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 1.8k 174

3D-Speaker Public

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 740 56

KAN-TTS Public

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 442 71

FunCodec Public

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 291 23

former3d Public

Python 95 9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alibaba Damo Academy

Popular repositories

Repositories

People

Top languages

Most used topics