nnsvs-english-support

The original NNSVS English support.

GUIDE NOW AVAILABLE

Support for English NNSVS Dataset Creation
Now maintained by Intunist.

Support for UK, US, and Australian English has been fully confirmed. You are not necessarily expected to use all of the phonemes if your dialect lacks them. You may either drop those phonemes or label them for compatibility with existing scores.

This repo contains the files required to create an English dataset for use in NNSVS. Additionally, instructions and examples are provided (or will be provided) for the labeling process.

The phoneme set is based on arpabet has been carefully selected to be compatible with as many English-speaking dialects as possible.

NOTE: A pretrained model for English is not available at this time. Expect 8+ hours of audio to be required for a very high quality result. The estimated amount of audio required for a somewhat decent model is about 4 hours (without silence). A prototype can be made with less (~1 hour).

An example of a model trained with a 40 minute dataset, provided by HydroChromatic.

Severse.English.40m.mp4

And an 8.5m example from an earlier release.

Hydro_English_8.5m_Example_reverb.mp4

Additional Info and Directions

The HED file was written by hand for NNSVS, it may not work in other tools as-is. Two hed files are included. A minimal and full hed file. Both should work for NNSVS but using the full hed is suggested.

"DynamiVox English NN Phoneme Set v[x.x.x].txt" is a list of DynamiVox phonemes along with explanations of these phonemes. Example words are provided for both US and UK English. If no UK example is provided then the example is the same for both. This phoneme set is NOT COMPLETELY COMPATIBLE with Arpabet. There are several additions/changes that cause the DynamiVox phoneme set to not be 1:1. However, the basic phoneme section should be about the same.

"DynamiVox Phone Ref.txt" is a cleaned up phoneme document for reference while labeling.

the /dic folder contains several files:
/english.conf - phoneme information
/amepd_dvx.table - DYVAUX (heavily) modified AMEPD dictionary
/blank.table - basic dictionary for phonetic training
/table credit.txt - credits for dictionaries

the /LAB_EXAMPLES folder contains EXAMPLES ON HOW TO LABEL singing. These micro-datasets cannot be trained and are for reference only. Usage of these datasets beyond this is strictly prohibited. Multiple dialects are included to assist in properly labeling speech. Please refer to "LABEL_HOWTO.txt" in the /LAB folder for more information.

You can change the hed file in config.yaml, this is found at /train/config.yaml.

NOTE: please change the value of "in_dim" in /train/conf/train/*/model/*.yaml The values should be set based on the values provided in the HED files. Training will not work if these values are incorrect.

A the time of writing, NNSVS doesn't appear to support multi-syllable words in the table. The UST/score will need to be written phonetically. For phonetic usts, don't use english.table, use blank.table. Otherwise it may try to match phonemes to pronuncations and fail with ValueError: could not broadcast input array from shape (###,476) into shape (###,476).

Version 0.6.0 adds support for ENUNU's flag system for swapping voice timbre.

Here is a list of the currently supported flags:

Flag	Purpose
F	falsetto
H	head voice
SF	soft
ST	strong
OPN	open_wide_vowel
CLS	closed_narrow_vowel
W	whisper_devoiced
BR	bright_resonance
DR	dark_resonance
N	nasal
1	additional_1
2	additional_2
3	additional_3

Japanese support

v0.9.0 adds supplimental japanese support. This can be used with a flag (ex: 1) to made a multi-lang model.

JP Phone	EN Phone
a	aa
e	eh
i	iy
o	ow
u	uw
A	ah
E	ey
I	ih
O	ao
U	uh
N	en
r	dx
ts	th
[C]y	[C] y

Name		Name	Last commit message	Last commit date
Latest commit History 325 Commits
LAB_EXAMPLES		LAB_EXAMPLES
dic		dic
hed		hed
shiro_files		shiro_files
.gitattributes		.gitattributes
LICENSE.md		LICENSE.md
PHONEMES.md		PHONEMES.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LAB_EXAMPLES

LAB_EXAMPLES

dic

dic

hed

hed

shiro_files

shiro_files

.gitattributes

.gitattributes

LICENSE.md

LICENSE.md

PHONEMES.md

PHONEMES.md

README.md

README.md

Repository files navigation

nnsvs-english-support

GUIDE NOW AVAILABLE

Additional Info and Directions

Japanese support

About

Releases

License

intunist/nnsvs-english-support

Folders and files

Latest commit

History

Repository files navigation

nnsvs-english-support

Additional Info and Directions

Japanese support

About

Topics

Resources

License

Stars

Watchers

Forks