This research investigates the disclosure of banking institutions by analyzing their annual reports to identify the determinants capable of signaling possible corruption scandals. A textual analysis was conducted on the financial reports of 42 Eurozone banks from the period 2013 to 2022. Drawing on impression management theory, we combine an advanced large language model (LLM) and dictionary approach to extract and analyze the governance-related textual content of the banks in the sample. Machine learning algorithms—including random forests, support vector machines, gradient boosting, and naive Bayes classifiers—and logistic regression have been employed to verify whether disclosure indicators allow for the identification of corruption scandals from a preventive perspective. Our findings show that specific textual measures can be used to analyze the association between disclosure and corruption events and as predictive tools to detect corruption scandals before they become public domain. Our study has several implications, particularly for supervisors and investors who can proactively leverage our findings to identify possible corruption scandals in banks by analyzing their financial disclosures.

Damiano, R., Polizzi, S., Scannella, E., Valenza, G. (2025). Corruption Detection Through Textual Analysis: Evidence From Eurozone Banks. BUSINESS ETHICS, THE ENVIRONMENT & RESPONSIBILITY [10.1111/beer.12824].

Corruption Detection Through Textual Analysis: Evidence From Eurozone Banks

Rodolfo Damiano;Salvatore Polizzi
;
Enzo Scannella;Giuseppe Valenza
2025-01-01

Abstract

This research investigates the disclosure of banking institutions by analyzing their annual reports to identify the determinants capable of signaling possible corruption scandals. A textual analysis was conducted on the financial reports of 42 Eurozone banks from the period 2013 to 2022. Drawing on impression management theory, we combine an advanced large language model (LLM) and dictionary approach to extract and analyze the governance-related textual content of the banks in the sample. Machine learning algorithms—including random forests, support vector machines, gradient boosting, and naive Bayes classifiers—and logistic regression have been employed to verify whether disclosure indicators allow for the identification of corruption scandals from a preventive perspective. Our findings show that specific textual measures can be used to analyze the association between disclosure and corruption events and as predictive tools to detect corruption scandals before they become public domain. Our study has several implications, particularly for supervisors and investors who can proactively leverage our findings to identify possible corruption scandals in banks by analyzing their financial disclosures.
2025
Damiano, R., Polizzi, S., Scannella, E., Valenza, G. (2025). Corruption Detection Through Textual Analysis: Evidence From Eurozone Banks. BUSINESS ETHICS, THE ENVIRONMENT & RESPONSIBILITY [10.1111/beer.12824].
File in questo prodotto:
File Dimensione Formato  
Damiano Polizzi Scannella Valenza 2025.pdf

accesso aperto

Descrizione: This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Tipologia: Versione Editoriale
Dimensione 880.58 kB
Formato Adobe PDF
880.58 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/679203
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact