PTTB and DiTMAO: The Modularity of Some Tools for Digital Humanities Cover Image

PTTB e DiTMAO: modularità di alcune applicazioni per le Digital Humanities
PTTB and DiTMAO: The Modularity of Some Tools for Digital Humanities

Author(s): Andrea Bozzi
Subject(s): Language and Literature Studies, Theoretical Linguistics
Published by: Wydawnictwo Adam Marszałek
Keywords: Talmud; computational linguistic; digital philology; TM (translation memory); Old Occitan; ontologies

Summary/Abstract: For several years now, computational linguistics has been addressing the problems of and developing technological tools for automatic translation, with its important economic implications. At the same time, projects dedicated to facilitating translations of ancient works, which are often fraught with considerable hermeneutical difficulties, are far rarer. The PTTB system, which was designed and constructed at the Institute for Computational Linguistics (National Research Council) in Pisa, enables a group of about fifty scholars to translate the entire Babylonian Talmud, written in Aramaic and Biblical Hebrew, more quickly and uniformly. While the language and structure of the textual corpus made the development of machine translation algorithms impossible, translation memory and edit distance techniques have produced excellent results. Based on them, the system offers scholars a high percentage of correct translations, accessible through a very intuitive graphic user interface. The results are easily exportable to xml files suitable for the final editing and printing operations. So far, these innovations have made it possible to publish four treatises in six printed volumes with translations, annotations and thematic indexes within a relatively short time. Several other volumes have already been processed and are currently being edited. Various perspectives open up for the use of the digital Talmud in Italian. One of the most interesting options involves using machine learning and named entity recognition techniques to associate semantic or conceptual values (Talmud Ontological Framework) with and make cross-references among portions of the text that report or discuss similar themes. This will help various groups of (general and specialised) users to browse this vast and heterogeneous textual archive on the semantic basis. The strategy adopted here is also aligned with the Dictionnaire des Termes Médico-botaniques de l’Ancien Occitan (DiTMAO), another ongoing lexicographical project. It will enable users to semantically navigate within an extensive medical-pharmaceutical and botanical textual corpus in medieval Occitan. For these reasons, PTTB and DiTMAO can be regarded as two instances of one innovative technological infrastructure for linguistic and philological research in the field of digital humanities.

  • Issue Year: 11/2020
  • Issue No: 1
  • Page Range: 101-121
  • Page Count: 21
  • Language: Italian
Toggle Accessibility Mode