Słowa znaczące, słowa kluczowe, słowozbiory – o statystycznych metodach wyszukiwania wyrazów istotnych
Significant Words, Keywords, Wordlists – on Statistical Methods of Searching for Relevant Terms
Author(s): Maciej EderSubject(s): History, Cultural history
Published by: Wydawnictwa Uniwersytetu Warszawskiego
Keywords: quantitative linguistics; stylometry; keywords; Zeta method; topic modelling,;wordlist
Summary/Abstract: This article discusses automatic extraction of relevant words from sets of texts. Theauthor briefly presents three methods aimed to extract the words from the corpus of wordswith regard to their frequency, or words whose occurrence next to each other is not random.First, he focuses on the keyword analysis method, then he discusses the Zeta method developed by John Burrows and Hugh Craig, and the third method covered in the article isthe topic modelling method, which is becoming very popular recently, and consists in findingclusters of words co-occurring in similar contexts. Topic modelling was intended for a quickcontent search in large collections of documents. On the basis of 100 Polish novels, the articlepresents how this method can be used for linguistic studies.
Journal: Przegląd Humanistyczny
- Issue Year: 545/2016
- Issue No: 03
- Page Range: 31-44
- Page Count: 14
- Language: Polish
- Content File-PDF