Fonologický korpus češtiny (Phonological Corpus of Czech)

The phonological corpus of Czech consists of two parts: a lexical subcorpus and a textual subcorpus. The lexical subcorpus is
a phonologically and phonetically transcribed and annotated database of contemporary Czech lexis. It contains more than 275,000 lexical items which are recorded in the major Czech dictionaries and is supplemented with several smaller databases that map mainly proper nouns. The textual subcorpus is a phonological and phonetical transcription of 67 texts containing more than 3.2 million words.

The website includes a complete quantitative phonological analysis of both subcorpora (phoneme frequency, phoneme combinations, syllable types, etc.).

previous category

Electronic Dictionary of Old Czech. Inventory of Sources and Abbreviations.

next category

The Card Catalogue of the Lexical Archive 1911–1991