Text to Speech MLX model. #767

javileyes · 2024-05-09T09:35:35Z

There are MLX models for text generation (llama 3) and for text recognition (whisper) but I think that to have a complete NLP environment it would be necessary to create a text to scpeech MLX. How would it be possible to create, for example, an MLX model of facebook/fastspeech2-en-ljspeech?

awni · 2024-05-09T13:10:46Z

It should be possible. There is a port of Suno's Bark model already: https://github.com/j-csc/mlx_bark

I think it still depends on PyTorch for the encodec model though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text to Speech MLX model. #767

Text to Speech MLX model. #767

javileyes commented May 9, 2024

awni commented May 9, 2024

Text to Speech MLX model. #767

Text to Speech MLX model. #767

Comments

javileyes commented May 9, 2024

awni commented May 9, 2024