Corpus de Investigación en Español de México del Posgrado de Ingeniería Eléctrica y Servicio Social (CIEMPIESS)

From MLML
Jump to: navigation, search

To see all corpus holdings, click here.

The Corpus de Investigación en Español de México del Posgrado de Ingeniería Eléctrica y Servicio Social (CIEMPIESS) is a corpus of Mexico City Spanish developed as part of the CIEMPIESS-UNAM Project, created by the Speech Processing Laboratory of the Faculty of Engineering of the National Autonomous University of Mexico. It offers 17.5 hours of recorded speech obtained from FM radio broadcasts, totalling 16 717 utterances. As part of the CIEMPIESS-UNAM Project, a lexicon was created using grapheme-to-phoneme correspondences and the text has been word-aligned in Praat TextGrids.

External Link

Reference

Hernández-Mena, Carlos D. and José A. Herrera-Camacho. 2014. "CIEMPIESS: A new open-sourced mexican spanish radio corpus". Proceedings of the European Language Resources Association: 371-375.