ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
-
Updated
May 19, 2024 - Python
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
A reactive and remote-ready version of STT engine for Commbase
A proactive version of STT engine for Commbase
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Faster Whisper transcription with CTranslate2
RyMind is a system integrating NLP and (computer vision) face recognition.
Speech To Text AI using Whisper with Wake word detection
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
Gita Summarizer extracts key insights from the Bhagavad Gita, aiding comprehension. Built with Streamlit, Azure Speech-to-Text, and LLMWare for summarization.
🧠 Leon is your open-source personal assistant.
MuskanAi is a personal Digital Assistant which is capable of performing all Automation task whether it is Controlling your Devices, Browsing the Internet and Emotional Understanding..
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
Learn languages by subtitles. Upload your favorite subtitle. Voice speech.
This project implement end to end realtime vietnamese speech recognition with PhoWhisper in Backend and frontend in React Native
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."