DIMEx100 (Diálogos Inteligentes Multimodales en Español)

Jump to: navigation, search

To see all corpus holdings, click here.

The DIMEx100 Corpus offers read speech for Mexican Spanish. In total, 100 speakers read 50 sentences each in addition to a sample of 10 sentences read by all 100 speakers, with the sentences being selected based on phoneme representation. The speakers range from 18 to 36 years old, with 87% of the speakers being undergraduate students and the rest being graduate students. Out of the hundred speakers, 82 were born in Mexico City as well as living there, while the other 12 lived elsewhere in the country. The speaker sample is well balanced for gender (49 men and 51 women). The data are accompanied by time-aligned transcriptions, including phonemic ones using the MexBet system.

External Links