Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

The exponential growth of biomedical and biological data, such as histopathological images, genomic sequences, and molecular networks, calls for advanced techniques to represent such data in compact, informative, and interpretable ways. Embedding methods offer an effective solution by transforming high-dimensional and complex data into representations that facilitate classification, pattern discovery, and knowledge extraction tasks. This thesis investigates a broad spectrum of biomedical and biological data embedding approaches, including classical dimensionality reduction techniques, metric learning models, granular-based approaches, and graph-based representations. The central goal is to combine high predictive performance with interpretability, crucial in clinical and scientific domains where understanding the rationale behind automated decisions is essential. A key contribution of this work is the development of GECo Graph Explanation by Communities, a novel algorithm designed to improve the explainability of Graph Neural Networks. GECo leverages community detection to identify meaningful substructures within graphs, providing clear and consistent explanations for Graph Neural Networks predictions, particularly in biomedical and biological settings. Through extensive experimentation on both public and real-world datasets, the thesis demonstrates the effectiveness of embedding techniques in addressing practical biomedical challenges while ensuring transparency, robustness, and interpretability. This work contributes to advancing trustworthy and explainable artificial intelligence systems for biomedical and biological data analysis.

(2025). Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications. (Tesi di dottorato, Università degli Studi di Palermo, 2025).

Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications

CALDERARO, Salvatore

2025-07-01

Abstract

The exponential growth of biomedical and biological data, such as histopathological images, genomic sequences, and molecular networks, calls for advanced techniques to represent such data in compact, informative, and interpretable ways. Embedding methods offer an effective solution by transforming high-dimensional and complex data into representations that facilitate classification, pattern discovery, and knowledge extraction tasks. This thesis investigates a broad spectrum of biomedical and biological data embedding approaches, including classical dimensionality reduction techniques, metric learning models, granular-based approaches, and graph-based representations. The central goal is to combine high predictive performance with interpretability, crucial in clinical and scientific domains where understanding the rationale behind automated decisions is essential. A key contribution of this work is the development of GECo Graph Explanation by Communities, a novel algorithm designed to improve the explainability of Graph Neural Networks. GECo leverages community detection to identify meaningful substructures within graphs, providing clear and consistent explanations for Graph Neural Networks predictions, particularly in biomedical and biological settings. Through extensive experimentation on both public and real-world datasets, the thesis demonstrates the effectiveness of embedding techniques in addressing practical biomedical challenges while ensuring transparency, robustness, and interpretability. This work contributes to advancing trustworthy and explainable artificial intelligence systems for biomedical and biological data analysis.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di discussione
	
				lug-2025
			
	Parole chiave
	
				Embeddings, Metric Learning, XAI, Explainability, Graph Embeddings, Biomedical data
			
	Citazione
	
				(2025). Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications. (Tesi di dottorato, Università degli Studi di Palermo, 2025).
			
	Appare nelle tipologie:
	
				4.2 Tesi di dottorato

File in questo prodotto:

File	Dimensione	Formato
reduced_PhD_Thesis_Salvatore_Calderaro_9_6_2025.pdf embargo fino al 31/12/2025 Tipologia: Tesi di dottorato Dimensione 10 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	10 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/682704

Citazioni

ND

ND

ND

social impact