ANALIZA FLEKSYJNA TEKSTÓW HISTORYCZNYCH
I ZMIENNOŚĆ FLEKSJI POLSKIEJ
Z PERSPEKTYWY DANYCH KORPUSOWYCH
The subject matter of this paper is Chronofl eks, a computer system (http:// chronofl eks.nlp.ipipan.waw.pl/) modelling Polish infl ection based on a corpus material. The system visualises changes of infl ectional paradigms of individual lexemes over time and enables examination of the variability of the frequency of infl ected form groups distinguished based on various criteria. Feeding Chronofl eks with corpus data required development of IT tools to ensure an infl ectional processing sequence of texts analogous to the ones used for modern language; they comprise a transcriber, a morphological analyser, and a tagger. The work was performed on data from three historical periods (1601–1772, 1830–1918, and modern ones) elaborated in independent projects. Therefore, fi nding a common manner of describing data from the individual periods was a signifi cant element of the work. Keywords: electronic text corpus – natural language processing – infl ection of Polish – history of language