#

speech-recognition

Here are 4,614 public repositories matching this topic...

alibaba-damo-academy / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated May 22, 2024
Python

alibaba-damo-academy / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated May 22, 2024
Python

transformers

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated May 22, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 22, 2024
Python

openvinotoolkit / openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated May 22, 2024
C++

espnet / espnet

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated May 21, 2024
Python

botbahlul / crx-live-translate

Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!

javascript chrome edge voice-recognition speech-recognition browser-extension speech-to-text google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated May 21, 2024
JavaScript

botbahlul / js-live-audio-video-translate

HTML Web template that can RECOGNIZE any live audio/video streaming (using Chrome webkitSpeechRecognition API) then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE

javascript html web voice-recognition speech-recognition google-translate web-template google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated May 21, 2024
JavaScript

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

golang ui web ai subtitles webapp speech-recognition speech-to-text transcription stt whisper audio-to-text sveltekit web-whisper

Updated May 21, 2024
Svelte

alex-vt / WhisperInput

Offline voice input panel & keyboard with punctuation for Android.

offline speech-recognition speech-to-text android-app whisper-ai

Updated May 21, 2024
Java

huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

audio speech-recognition whisper

Updated May 21, 2024
Python

code2k13 / pipico_speech_recognition

This repository contains code and instructions to implement single word speech recognition on any board running CircuitPython

python machine-learning tensorflow speech-recognition neural-networks digital-signal-processing circuitpython tinyml rp2040

Updated May 21, 2024
Jupyter Notebook

vinzh05 / VietGPT-VoiceBot-A-Vietnamese-Speech-Recognition-Chatbot

VietGPT VoiceBot: Chatbot automatically recognizes Vietnamese voice and uses the ChatGPT API for natural language interaction.

python speech-recognition speech-to-text chatgpt

Updated May 21, 2024
HTML

amica

semperai / amica

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

ai computer-vision tts speech-recognition assistant-chat-bots llm

Updated May 21, 2024
TypeScript

leon

leon-ai / leon

🧠 Leon is your open-source personal assistant.

Updated May 21, 2024
Python

souradipp76 / Speaker-recognition

Segment speech sequences based on speaker transitions, using ML and DSP.

machine-learning speech-recognition digital-signal-processing speaker-recognition

Updated May 21, 2024
JavaScript

whisper.cpp

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated May 21, 2024
C

anujlunawat / Personal-Desktop-Assistant-Jarvis

Personal Desktop Assistant, Jarvis, built using python.

Updated May 21, 2024
Python

andybi7676 / reborn-uasr

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

reinforcement-learning speech-recognition unsupervised-learning

Updated May 21, 2024
Python

piaseckijulian / Sentinel

🚀AI Voice Chatbot

ai sentinel speech-recognition

Updated May 22, 2024
Python

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."