Stylometrická analýza církevněslovanských textů české provenience
Stylometric Analysis of the Church Slavonic Texts of Czech Origin
Author(s): Radek Čech, Miroslav VeprekSubject(s): Language and Literature Studies, Theoretical Linguistics, Studies of Literature, Lexis, Czech Literature, Western Slavic Languages
Published by: AV ČR - Akademie věd České republiky - Slovanský ústav and Euroslavica
Keywords: stylometric analysis; Czech Church Slavonic; token length; lexical diversity; cluster analysis;
Summary/Abstract: The paper presents a pilot study of stylometric analysis of Czech Church Slavonic texts. The aim of the study is to measure similarities / differences among texts based on selected quantitative characteristics. Specifically, the average token length (ATL), moving average type-token ratio (MATTR), and text distances determined by normalized frequencies of the most frequent words (MFW) are applied. For the analysis, we used a corpus of twelve Church Slavonic literary writings attributed (with various probability) to Czech authors in the 10th and 11th centuries. In addition, two more textual sources were added (Codex Suprasliensis and the Life of St. Methodius) to compare the results and get a more complex view of relationships among texts. The results show the plausibility of the application of methods on this specific sample of texts.
Journal: Slavia - časopis pro slovanskou filologii
- Issue Year: XCII/2023
- Issue No: 5 (Suppl.)
- Page Range: 625-640
- Page Count: 16
- Language: Czech
- Content File-PDF