kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Apr 30, 2024 - Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Port of OpenAI's Whisper model in C/C++
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A PyTorch-based Speech Toolkit
🧠 Leon is your open-source personal assistant.
💬 Speech recognition for your site
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Lingvo
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."