Skip to content

Popular repositories

  1. FunASR FunASR Public

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Python 3.8k 436

  2. FunClip FunClip Public

    Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

    Python 1.8k 174

  3. 3D-Speaker 3D-Speaker Public

    A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

    Python 740 56

  4. KAN-TTS KAN-TTS Public

    KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

    Python 442 71

  5. FunCodec FunCodec Public

    FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

    Python 291 23

  6. former3d former3d Public

    Python 95 9

Repositories

Showing 10 of 22 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…