Automatic Identification of Domain Terms: An Approach for Italian
Automatic Identification of Domain Terms: An Approach for Italian
Author(s): Maria Teresa Artese, Isabella GagliardiSubject(s): Language and Literature Studies, Library and Information Science, Information Architecture, Applied Linguistics, Computational linguistics
Published by: Институт по математика и информатика - Българска академия на науките
Keywords: Classification Methods; Word Embedding Models; Probability; Food; Italian Language
Summary/Abstract: The problem of creating a fully automated specific-domain thesaurus is very topical. The paper presents a novel method to address this problem in the Italian language. The main feature of this approach is the integration of different methods: machine learning classification methods working on the semantic representation of candidate terms, word embed-dings models, able to capture the semantics of words, and a computation of the degree of specialization of a term. The work is in progress and results obtained so far are promising.
Journal: Digital Presentation and Preservation of Cultural and Scientific Heritage
- Issue Year: 2020
- Issue No: X
- Page Range: 251-258
- Page Count: 8
- Language: English