DIMEx100 (Diálogos Inteligentes Multimodales en Español)
To see all corpus holdings, click here.
The DIMEx100 Corpus offers read speech for Mexican Spanish. In total, 100 speakers read 50 sentences each in addition to a sample of 10 sentences read by all 100 speakers, with the sentences being selected based on phoneme representation. The speakers range from 18 to 36 years old, with 87% of the speakers being undergraduate students and the rest being graduate students. Out of the hundred speakers, 82 were born in Mexico City as well as living there, while the other 12 lived elsewhere in the country. The speaker sample is well balanced for gender (49 men and 51 women). The data are accompanied by time-aligned transcriptions, including phonemic ones using the MexBet system.
- Pineda, Luis A., Hayde Castellanos, Javier Cuétara, Lucian Galescu, Janet Juárez, Joaquim Llisterri, Patricia Pérez, and Luis Villaseñor. 2010. The Corpus DIMEx100: transcription and evaluation. Lang Resources & Evaluation 44: 347–370. DOI 10.1007/s10579-009-9109-9.
- Pineda, Luis A., Luis Villaseñor Pineda, Javier Cuétara, hayde Castellanos, and Ivonne López. 2004. DIMEx100: A New Phonetic and Speech Corpus for Mexican Spanish. IBERAMIA 2004, LNAI 2215, pp. 974-983.