Pytorch Conformer

Pytorch implementation of conformer model with training script for end-to-end speech recognition on the LibriSpeech dataset.

Usage

Train model from scratch:

python train.py --data_dir=./data --train_set=train-clean-100 --test_set=test_clean --checkpoint_path=model_best.pt

Resume training from checkpoint

python train.py --load_checkpoint --checkpoint_path=model_best.pt

Train with mixed precision:

python train.py --use_amp

For a full list of command line arguments, run python train.py --help. Smart batching is used by default but may need to be disabled for larger datasets. For valid train_set and test_set values, see torchaudio's LibriSpeech dataset. The model parameters default to the Conformer (S) configuration. For the Conformer (M) and Conformer (L) models, refer to the table below:

Other Implementations

TODO:

Language Model (LM) implementation
Multi-GPU support
Support for full LibriSpeech960h train set
Support for other decoders (ie: transformer decoder, etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

model.py

model.py

train.py

train.py

utils.py

utils.py

Repository files navigation

Pytorch Conformer

Usage

Train model from scratch:

Resume training from checkpoint

Train with mixed precision:

Other Implementations

TODO:

About

Releases

Packages

Languages

License

jreremy/conformer

Folders and files

Latest commit

History

Repository files navigation

Pytorch Conformer

Usage

Train model from scratch:

Resume training from checkpoint

Train with mixed precision:

Other Implementations

TODO:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages