METHODOLOGICAL DIFFERENCES IN MODELING TEXTS’ STATISTICAL STRUCTUR (on the example of “The Tale of The Rout of Mamai”) Cover Image

Концептуальные различия подходов к описанию статистической структуры текстов (на примере «Сказания о Мамаевом побоище»)
METHODOLOGICAL DIFFERENCES IN MODELING TEXTS’ STATISTICAL STRUCTUR (on the example of “The Tale of The Rout of Mamai”)

Author(s): Lyubov Kovrigina
Subject(s): Language and Literature Studies
Published by: Петрозаводский государственный университет
Keywords: text variants; text component structure; non-gaussian distributions; H-distribution; concentration and dispersion of elements in linguistic distributions; population heterogeneity; Kudrin’s R-point; Hirsch – Popescu’s h-point (h-index)

Summary/Abstract: Three methods of modeling statistical structure of the text are analyzed. The obtained comparative results were derived by the employment of different statistical models to the same material (“The Tale of The Rout of Mamai”). All compared models are designed to separate autosemantic words from synsemantic words of the plot. The results received during models’ testing are provided . The h-point introduced by Hirsch – Popescu is shown to be the most suitable parameter helping to separate content words from structure-class words. The h-point marks the biggest part of non-thematic words for a certain text

Toggle Accessibility Mode