Santa Barbara

From MLML
Jump to: navigation, search

Santa Barbara

Location

The dataset was already on the MLML server

Audio

The audio was already in .wav format, and on the server

Alignment

Santa Barbara transcription files (.trn) are aligned only at the utterance level. A script was written to parse this into .textgrid format. Utterances that had less than a .15 second pause between each other were collapsed into a single utterance. The Librispeech English dictionary was used with MFA to align these utterances. A new acoustic model was trained.

Importing

Since the data was converted to textgrids, an importer already exists for PolyglotDB.