Deep learning of cuneiform sign detection with weak supervision using transliteration alignment

The cuneiform script provides a glimpse into our ancient history. However, reading age-old clay tablets is time-consuming and requires years of training. To simplify this process, we propose a deep-learning based sign detector that locates and classifies cuneiform signs in images of clay tablets. De...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Dencker, Tobias (VerfasserIn) , Klinkisch, Pablo (VerfasserIn) , Maul, Stefan M. (VerfasserIn) , Ommer, Björn (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: December 16, 2020
In: PLOS ONE
Year: 2020, Jahrgang: 15, Heft: 12, Pages: 1-21
ISSN:1932-6203
DOI:10.1371/journal.pone.0243039
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1371/journal.pone.0243039
Verlag, lizenzpflichtig, Volltext: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0243039
Volltext
Verfasserangaben:Tobias Dencker, Pablo Klinkisch, Stefan M. Maul, Björn Ommer
Beschreibung
Zusammenfassung:The cuneiform script provides a glimpse into our ancient history. However, reading age-old clay tablets is time-consuming and requires years of training. To simplify this process, we propose a deep-learning based sign detector that locates and classifies cuneiform signs in images of clay tablets. Deep learning requires large amounts of training data in the form of bounding boxes around cuneiform signs, which are not readily available and costly to obtain in the case of cuneiform script. To tackle this problem, we make use of existing transliterations, a sign-by-sign representation of the tablet content in Latin script. Since these do not provide sign localization, we propose a weakly supervised approach: We align tablet images with their corresponding transliterations to localize the transliterated signs in the tablet image, before using these localized signs in place of annotations to re-train the sign detector. A better sign detector in turn boosts the quality of the alignments. We combine these steps in an iterative process that enables training a cuneiform sign detector from transliterations only. While our method works weakly supervised, a small number of annotations further boost the performance of the cuneiform sign detector which we evaluate on a large collection of clay tablets from the Neo-Assyrian period. To enable experts to directly apply the sign detector in their study of cuneiform texts, we additionally provide a web application for the analysis of clay tablets with a trained cuneiform sign detector.
Beschreibung:Gesehen am 26.01.2021
Beschreibung:Online Resource
ISSN:1932-6203
DOI:10.1371/journal.pone.0243039