Skip to content
@IS2AI

ISSAI

Institute of Smart Systems and Artificial Intelligence

Popular repositories

  1. Kazakh_TTS Kazakh_TTS Public

    An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers h…

    Shell 109 20

  2. SpeakingFaces SpeakingFaces Public

    A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer …

    Python 75 8

  3. TurkicASR TurkicASR Public

    A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

    Python 52 6

  4. ISSAI_SAIDA_Kazakh_ASR ISSAI_SAIDA_Kazakh_ASR Public

    the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 …

    Shell 43 6

  5. TurkicTTS TurkicTTS Public

    A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.

    Python 39 3

  6. thermal-facial-landmarks-detection thermal-facial-landmarks-detection Public

    SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.

    Jupyter Notebook 34 7

Repositories

Showing 10 of 53 repositories
  • Enhancing-Ambient-Assisted-Living-with-Multi-Modal-Vision-and-Language-Models Public

    This project is aimed at detecting the abnormal behaviour or emergency cases using vision-language model (VLM), large language model (LLM), human detection model, text-to-speech (TTS) and speech-to-text models (STT). The framework can detect the subtle sings of emergency and actively interact with the user to make an accurate decision.

    0 0 0 0 Updated May 22, 2024
  • TatarSCR Public

    An Open-Source Speech Commands Dataset for the Tatar Language

    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated May 16, 2024
  • COHI-O365 Public

    The most diverse in number of images/labels/classes fisheye synthetic dataset with source codes and models. As well as a benchmarking testing real dataset.

    0 MIT 0 0 0 Updated May 12, 2024
  • KazQAD Public

    An open-source Kazakh Question Answering Dataset

    1 CC-BY-SA-4.0 0 0 0 Updated Apr 23, 2024
  • TatarTTS Public

    TatarTTS: An Open-Source Text-to-Speech Synthesis Dataset for the Tatar Language

    2 CC-BY-4.0 1 1 0 Updated Apr 20, 2024
  • KazSAnDRA Public

    An open-source Kazakh Sentiment Analysis Dataset of Reviews and Attitudes (KazSAnDRA) and baseline sentiment classification models

    Python 2 0 0 0 Updated Apr 19, 2024
  • OpenThermalPose Public

    An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines

    0 MIT 0 0 0 Updated Apr 15, 2024
  • KazEmoTTS Public

    An open-source Kazakh Emotional Text-to-Speech Dataset

    Python 19 3 0 0 Updated Apr 2, 2024
  • KazParC Public

    An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish

    Jupyter Notebook 3 0 0 0 Updated Mar 29, 2024
  • thermal-facial-landmarks-detection Public

    SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.

    Jupyter Notebook 34 MIT 7 0 0 Updated Mar 11, 2024