Software Library for Authorship Identification
Software Library for Authorship Identification
Author(s): Cvetina Hantova, Maria Nisheva-Pavlova, Phillip Ein-Dor, Ivan Ivanov, Peter L. StanchevSubject(s): Essay|Book Review |Scientific Life
Published by: Институт по математика и информатика - Българска академия на науките
Keywords: text authorship identification; compression algorithms; normalized compression distance; n-grams; natural frequency zoned word distribution.
Summary/Abstract: The aim of this paper is to review some methods for text authorship attribution and to discuss the development of a software library with tools for automatic authorship attribution. The presentation is focused on an analysis of two groups of tools oriented to: (1) methods for extraction of features and (2) methods for computing the distance between character strings based on data compression algorithms.
Journal: Digital Presentation and Preservation of Cultural and Scientific Heritage
- Issue Year: 2015
- Issue No: V
- Page Range: 91-97
- Page Count: 7