This article presents the digital edition of Robert Musil’s work (Klagenfurter Ausgabe) and its role in a digital humanities project aimed at reconstructing Musil’s activity in the WWI journal Tiroler Soldaten-Zeitung. First, the article reviews the ways in which the compu-tational methods of stylometry are applied to attribute the anonymous texts published in the Klagenfurter Ausgabe. Second, it explores how optical character recognition (OCR) soft-ware is employed to expand the corpus. At the core of this methodology two machine learn-ing algorithms are trained and revised using the transcriptions of the Klagenfurter Ausgabe, to reach an accuracy of about 99.9% in the digitization of the Tiroler Soldaten-Zeitungtexts. The work of this project offers not only the possibility of expanding stylometric analysis to the whole journal, but also of improving the transcriptions of the Klagenfurter Ausgabe.

A Digital Edition between Stylometry and OCR: The Klagenfurter Ausgabe of Robert Musil

Rebora, Simone
2019-01-01

Abstract

This article presents the digital edition of Robert Musil’s work (Klagenfurter Ausgabe) and its role in a digital humanities project aimed at reconstructing Musil’s activity in the WWI journal Tiroler Soldaten-Zeitung. First, the article reviews the ways in which the compu-tational methods of stylometry are applied to attribute the anonymous texts published in the Klagenfurter Ausgabe. Second, it explores how optical character recognition (OCR) soft-ware is employed to expand the corpus. At the core of this methodology two machine learn-ing algorithms are trained and revised using the transcriptions of the Klagenfurter Ausgabe, to reach an accuracy of about 99.9% in the digitization of the Tiroler Soldaten-Zeitungtexts. The work of this project offers not only the possibility of expanding stylometric analysis to the whole journal, but also of improving the transcriptions of the Klagenfurter Ausgabe.
2019
Robert Musil, Digital Edition, Stylometry, Optical Character Recognition
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1009327
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact