Nordisk språkteknologi Corpora (NST)

From MLML
Jump to: navigation, search

To see all corpus holdings, click here.

The Nordisk Språkteknologi Corpora (NST) are a collection of 38 corpora (lexicons, speech data, ngram models and written data) from Danish, Norwegian and Swedish. Where multiple orthographies are used in a language, there are separate corpora based on those orthographies.

External Links