Module for preparing text data for TTS data collections, specifically for the Icelandic language.

Installation

The code should be mostly supported by any Python 3.* but Python 3.6 or higher is recommended. Install dependencies by simply running pip install -r requirements.txt

Introduction

The code includes methods helpful for all major steps in the text collection side of TTS data collecting:

Gathering data: utt_tools.py contains methods for parsing utterances from some of the open Icelandic text datasets.
Preprocessing: A preprprocessing routine is also found in utt_tools.py, designed specifically for these datasets.
Grapheme-to-Phoneme prediction: A Sequitur wrapper is found in sequitur_tools.py. A pretrained model file is also included in this repository under pron_data. The model path is configured in conf.py.
Scoring: prondata.py contains a large class, PronData.py. It has multiple methods that can prove convenient to others but most important is Prondata.score() which scores the utterances based on length and diphone density.

The PyTorch G2P model

Make sure dependencies are installed
Create a directory ./data and place the pronounciation dictionary there, e.g. ./data/prondict_ice.txt. An Icelandic pronounciation dictionary is available (as of writing this) at terra:/data/prondict_sr/v1/frambordabok_asr_v1.txt
Run python3 main.py which will use default arguments. Look at the documented code to make changes to parameters.
Results will be default be placed under ./results/<experiment_name> in the form of a torch state dictionary, mdl.ckpt. Currently logging is only in the form of printing using e.g. print(...). To store logs run python3 main.py > log.txt to save the logs.

Testing

Simple tests are found in tests.py which additionally demonstrate some of the main operations in this module.

Reading lists

Under reading_lists are 4 reading lists, varying in length. rl_full.txt contains a very high diphone coverage, containing almost 20 instances of every valid Icelandic diphone.

Licence

Licensed under the Apache 2.0 license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Module for preparing text data for TTS data collections, specifically for the Icelandic language.

Installation

Introduction

The PyTorch G2P model

Testing

Reading lists

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
other		other
pron_data		pron_data
reading_lists		reading_lists
tests		tests
torchG2P		torchG2P
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bin_tools.py		bin_tools.py
conf.py		conf.py
prondata.py		prondata.py
requirements.txt		requirements.txt
sequitur_tools.py		sequitur_tools.py
utt_tools.py		utt_tools.py

License

cadia-lvl/tts_data

Folders and files

Latest commit

History

Repository files navigation

Module for preparing text data for TTS data collections, specifically for the Icelandic language.

Installation

Introduction

The PyTorch G2P model

Testing

Reading lists

Licence

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages