Skip to content

Experience the wonder of conversation without language barriers. This nimble audio translator lends you the power to seamlessly converse in a multitude of languages. It listens, interprets, and echoes back in the language of your choice—effortlessly, accurately, instantly.

License

Notifications You must be signed in to change notification settings

aastroza/whisper-echoes

Repository files navigation

WhisperEchoes [Proof of Concept]

Experience the wonder of conversation without language barriers. This nimble audio translator lends you the power to seamlessly converse in a multitude of languages. It listens, interprets, and echoes back in the language of your choice—effortlessly, accurately, instantly.

WhisperEchoes is a real-time translation application that harnesses the power of OpenAI's SDK and Instructor for on-the-fly speech-to-text and text-to-speech conversions. Designed to facilitate direct communication between two individuals who do not share a common language, WhisperEchoes provides an immediate spoken language translation, enabling each party to speak and listen in their native tongue.

Use Case In this example a spanish speaker said: "Había una vez un perro volador".

Features

  • Fast Translation: Engage in conversations with speakers of any language using immediate audio translation.
  • OpenAI API Integration: Implements the latest OpenAI API for reliable speech recognition and synthesis.
  • User-Friendly GUI: Simple terminal interface powered by Textual.
  • Continuous Interaction: Designed for ongoing dialogue, allowing for multiple exchanges in a single session.

Installation

conda create --name whisper-echoes -c conda-forge python=3.11
conda activate whisper-echoes
pip install -r requirements.txt

Usage

To start a real-time translation session for a conversation, run the following command adding two ISO 639-1 language codes:

python app.py <language_code_1> <language_code_2>

Example for translating a conversation between an english and a spanish speaker:

python whisper_echoes.py en es

Upon initiation, the application will prompt each user to speak after pressing 'Enter'. It will then translate and read out loud the translated speech in the other user's language.

Supported Languages

Afrikaans (af), Arabic (ar), Armenian (hy), Azerbaijani (az), Belarusian (be), Bosnian (bs), Bulgarian (bg), Catalan (ca), Chinese (zh), Croatian (hr), Czech (cs), Danish (da), Dutch (nl), English (en), Estonian (et), Finnish (fi), French (fr), Galician (gl), German (de), Greek (el), Hebrew (he), Hindi (hi), Hungarian (hu), Icelandic (is), Indonesian (id), Italian (it), Japanese (ja), Kannada (kn), Kazakh (kk), Korean (ko), Latvian (lv), Lithuanian (lt), Macedonian (mk), Malay (ms), Marathi (mr), Maori (mi), Nepali (ne), Norwegian (no), Persian (fa), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Serbian (sr), Slovak (sk), Slovenian (sl), Spanish (es), Swahili (sw), Swedish (sv), Tagalog (tl), Tamil (ta), Thai (th), Turkish (tr), Ukrainian (uk), Urdu (ur), Vietnamese (vi), and Welsh (cy).

License

This project is distributed under the MIT License. For more details, see the LICENSE.md file.

About

Experience the wonder of conversation without language barriers. This nimble audio translator lends you the power to seamlessly converse in a multitude of languages. It listens, interprets, and echoes back in the language of your choice—effortlessly, accurately, instantly.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages