This study presents an interdisciplinary methodology for detecting intertextual references in Latin patristic literature through a novel combination of philological rigor and Natural Language Processing (NLP) techniques. Focusing on Augustine of Hippo’s 'De Genesi ad litteram' and its relationship to Latin biblical texts (specifically Jerome’s Vulgate and pre-Vulgate versions), this research introduces a token-based classification system enriched with semantic annotations, supported by the INCEpTION platform. The classification system accounts for exact matches, lemmatized forms, synonyms, and structural parallels, capturing a wide spectrum of textual similarity. To enhance automatic retrieval of these intertextual links, we fine-tune BERT-based language models for Latin, incorporating contrastive learning and hard negative mining. Experimental results demonstrate that fine-tuned models significantly outperform baselines across varying levels of textual similarity. This work highlights the utility of computational models in bridging explicit citations and implicit allusions, offering a scalable approach for the study of biblical intertextuality in ancient texts.
Mambelli, A., Bigoni, L., Dainese, D., Tutrone, F., Caffagni, D., Cocchi, F., et al. (2026). The Biblical Heritage in Ancient Latin Christian Literature: Advancing Intertextual Mapping Through Sentence Embeddings. UMANISTICA DIGITALE, 22, 157-186 [10.60923/issn.2532-8816/22160].
The Biblical Heritage in Ancient Latin Christian Literature: Advancing Intertextual Mapping Through Sentence Embeddings
Mambelli A
;Tutrone F
;
2026-02-02
Abstract
This study presents an interdisciplinary methodology for detecting intertextual references in Latin patristic literature through a novel combination of philological rigor and Natural Language Processing (NLP) techniques. Focusing on Augustine of Hippo’s 'De Genesi ad litteram' and its relationship to Latin biblical texts (specifically Jerome’s Vulgate and pre-Vulgate versions), this research introduces a token-based classification system enriched with semantic annotations, supported by the INCEpTION platform. The classification system accounts for exact matches, lemmatized forms, synonyms, and structural parallels, capturing a wide spectrum of textual similarity. To enhance automatic retrieval of these intertextual links, we fine-tune BERT-based language models for Latin, incorporating contrastive learning and hard negative mining. Experimental results demonstrate that fine-tuned models significantly outperform baselines across varying levels of textual similarity. This work highlights the utility of computational models in bridging explicit citations and implicit allusions, offering a scalable approach for the study of biblical intertextuality in ancient texts.| File | Dimensione | Formato | |
|---|---|---|---|
|
UD 2025_The Biblical Heritage in Ancient Latin Christian Literature.pdf
accesso aperto
Descrizione: testo completo dell'articolo
Tipologia:
Versione Editoriale
Dimensione
7.15 MB
Formato
Adobe PDF
|
7.15 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


