Projects

Some Speech and Audio AI projects.


[Jul 23, 2024]

For finetuning pre-trained vocos model with mels generated from any tts decoder use.

Vocos with finetuning


[Jun 8, 2023]

Speech to text, text to intent. REST API application that can be deployed in cloud.

Speech to intent classification


[May 28, 2023]

This is an experiment to check if we can clone a voice for the VITS tts. Here we will use tts models from MMS.

VITS with MMS for TTS


[Dec 16, 2020]

A speech synthesis system with prosody embeddings.

Universal TTS