Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

The widespread diffusion of misinformation through digital platforms has raised significant concerns due to its adverse impacts on society and economy. Nowadays, the adoption of Artificial Intelligence and Machine Learning based mechanisms to automate fact checking processes and distinguish genuine from fake contents is mandatory. However, recent studies reveal vulnerabilities in AI models to adversarial attacks, where slight modifications of the input can deceive the classifiers. Adversarial Machine Learning strategies aim to compromise machine learning algorithms, posing challenges also for fake news detection models. This study focuses on the impact of adversarial attacks on fake news detection systems, utilizing a black-box attack approach against an unknown algorithm used by the online platforms. The research introduces a methodology leveraging a surrogate model to test the validity of malicious samples offline, with the aim of overcoming known limitations such as the high number of queries made to the target model.

Batool, F., Canino, F., Concone, F., Lo Re, G., Morana, M. (2024). A Black-box Adversarial Attack on Fake News Detection Systems. In CEUR Workshop Proceedings. CEUR-WS.

A Black-box Adversarial Attack on Fake News Detection Systems

Batool F.;Canino F.;Concone F.;Lo Re G.;Morana M.

2024-01-01

Abstract

The widespread diffusion of misinformation through digital platforms has raised significant concerns due to its adverse impacts on society and economy. Nowadays, the adoption of Artificial Intelligence and Machine Learning based mechanisms to automate fact checking processes and distinguish genuine from fake contents is mandatory. However, recent studies reveal vulnerabilities in AI models to adversarial attacks, where slight modifications of the input can deceive the classifiers. Adversarial Machine Learning strategies aim to compromise machine learning algorithms, posing challenges also for fake news detection models. This study focuses on the impact of adversarial attacks on fake news detection systems, utilizing a black-box attack approach against an unknown algorithm used by the online platforms. The research introduces a methodology leveraging a surrogate model to test the validity of malicious samples offline, with the aim of overcoming known limitations such as the high number of queries made to the target model.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2024
			
	URL dell'editore (Open access ove possibile)
	
				https://ceur-ws.org/Vol-3731/paper08.pdf
			
	Citazione
	
				Batool, F., Canino, F., Concone, F., Lo Re, G., Morana, M. (2024). A Black-box Adversarial Attack on Fake News Detection Systems. In CEUR Workshop Proceedings. CEUR-WS.
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
_ITASEC_24.pdf accesso aperto Descrizione: This is an open access article under the terms of the Creative Commons Attribution License Tipologia: Versione Editoriale Dimensione 1.37 MB Formato Adobe PDF Visualizza/Apri	1.37 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/707343

Citazioni

ND

2

ND

social impact