C-Oral Rom (Integrated Reference Corpora for Spoken Romance Languages)

From MLML
Jump to: navigation, search

To see all corpus holdings, click here.

The C-Oral Rom Corpus is a corpus of spontaneous speech from four Romance languages (French, Italian, Portuguese and Spanish) totaling around 1.2 million words. Both a text transcription and prosodic alignment are available alongside the audio files, allowing for cross-linguistic comparisons. Social information (gender, age, geographical region, education and occupation) for each speaker is additionally available for analyses.

External Links