Tacotron 2 + Weights & Biases

PyTorch implementation of Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions. This fork has been instrumented with Weights & Biases to enable experiment tracking, prediction logging, dataset and model versioning, and hyperparameter optimziation.

This implementation includes uses the LJSpeech dataset.

Pre-requisites

NVIDIA GPU + CUDA cuDNN

Running

Run pip install -r requirements.txt
Run wandb init to configure your working directory to log to Weights & Biases.
Run python register-data.py to create a reference Artifact pointing to the LJSpeech dataset.
Run python split-data.py to create a versioned train/validation split of the data.
Run python register-model ... to log pre-trained tacotron and waveglow models as Artifacts to Weights & Biases.
Run python train.py <dataset-artifact> to warm-start train tacotron2 on the dataset you created.
Run python inference.py <tacotron-artifact> <waveglow-artifact> <text> to run inference on a text file containing newline delimited sentences. The inference results will be logged to Weights & Biases as a wandb.Table

Related repos

WaveGlow Faster than real time Flow-based Generative Network for Speech Synthesis

nv-wavenet Faster than real time WaveNet.

Acknowledgements (Copied)

This implementation uses code from the following repos: Keith Ito, Prem Seetharaman as described in our code.

We are inspired by Ryuchi Yamamoto's Tacotron PyTorch implementation.

We are thankful to the Tacotron 2 paper authors, specially Jonathan Shen, Yuxuan Wang and Zongheng Yang.

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
text		text
waveglow @ 5bc2a53		waveglow @ 5bc2a53
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
audio_processing.py		audio_processing.py
data_utils.py		data_utils.py
demo.wav		demo.wav
distributed.py		distributed.py
hparams.yaml		hparams.yaml
inference.ipynb		inference.ipynb
inference.py		inference.py
layers.py		layers.py
loss_function.py		loss_function.py
loss_scaler.py		loss_scaler.py
model.py		model.py
multiproc.py		multiproc.py
plotting_utils.py		plotting_utils.py
register-data.py		register-data.py
register-model.py		register-model.py
requirements.txt		requirements.txt
split-data.py		split-data.py
stft.py		stft.py
tensorboard.png		tensorboard.png
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tacotron 2 + Weights & Biases

Pre-requisites

Running

Related repos

Acknowledgements (Copied)

About

Uh oh!

Releases

Packages

Languages

License

bcsherma/tacotron2

Folders and files

Latest commit

History

Repository files navigation

Tacotron 2 + Weights & Biases

Pre-requisites

Running

Related repos

Acknowledgements (Copied)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages