> Phonological Corpus of Czech

Fonologický korpus češtiny (Phonological Corpus of Czech)

The phonological corpus of Czech consists of two parts: a lexical subcorpus and a textual subcorpus. The lexical subcorpus is
a phonologically and phonetically transcribed and annotated database of contemporary Czech lexis. It contains more than 275,000 lexical items which are recorded in the major Czech dictionaries and is supplemented with several smaller databases that map mainly proper nouns. The textual subcorpus is a phonological and phonetical transcription of 67 texts containing more than 3.2 million words.

The website includes a complete quantitative phonological analysis of both subcorpora (phoneme frequency, phoneme combinations, syllable types, etc.).