The exponential growth of biomedical and biological data, such as histopathological images, genomic sequences, and molecular networks, calls for advanced techniques to represent such data in compact, informative, and interpretable ways. Embedding methods offer an effective solution by transforming high-dimensional and complex data into representations that facilitate classification, pattern discovery, and knowledge extraction tasks. This thesis investigates a broad spectrum of biomedical and biological data embedding approaches, including classical dimensionality reduction techniques, metric learning models, granular-based approaches, and graph-based representations. The central goal is to combine high predictive performance with interpretability, crucial in clinical and scientific domains where understanding the rationale behind automated decisions is essential. A key contribution of this work is the development of GECo Graph Explanation by Communities, a novel algorithm designed to improve the explainability of Graph Neural Networks. GECo leverages community detection to identify meaningful substructures within graphs, providing clear and consistent explanations for Graph Neural Networks predictions, particularly in biomedical and biological settings. Through extensive experimentation on both public and real-world datasets, the thesis demonstrates the effectiveness of embedding techniques in addressing practical biomedical challenges while ensuring transparency, robustness, and interpretability. This work contributes to advancing trustworthy and explainable artificial intelligence systems for biomedical and biological data analysis.

(2025). Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications. (Tesi di dottorato, Università degli Studi di Palermo, 2025).

Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications

CALDERARO, Salvatore
2025-07-01

Abstract

The exponential growth of biomedical and biological data, such as histopathological images, genomic sequences, and molecular networks, calls for advanced techniques to represent such data in compact, informative, and interpretable ways. Embedding methods offer an effective solution by transforming high-dimensional and complex data into representations that facilitate classification, pattern discovery, and knowledge extraction tasks. This thesis investigates a broad spectrum of biomedical and biological data embedding approaches, including classical dimensionality reduction techniques, metric learning models, granular-based approaches, and graph-based representations. The central goal is to combine high predictive performance with interpretability, crucial in clinical and scientific domains where understanding the rationale behind automated decisions is essential. A key contribution of this work is the development of GECo Graph Explanation by Communities, a novel algorithm designed to improve the explainability of Graph Neural Networks. GECo leverages community detection to identify meaningful substructures within graphs, providing clear and consistent explanations for Graph Neural Networks predictions, particularly in biomedical and biological settings. Through extensive experimentation on both public and real-world datasets, the thesis demonstrates the effectiveness of embedding techniques in addressing practical biomedical challenges while ensuring transparency, robustness, and interpretability. This work contributes to advancing trustworthy and explainable artificial intelligence systems for biomedical and biological data analysis.
lug-2025
Embeddings, Metric Learning, XAI, Explainability, Graph Embeddings, Biomedical data
(2025). Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications. (Tesi di dottorato, Università degli Studi di Palermo, 2025).
File in questo prodotto:
File Dimensione Formato  
reduced_PhD_Thesis_Salvatore_Calderaro_9_6_2025.pdf

embargo fino al 31/12/2025

Tipologia: Tesi di dottorato
Dimensione 10 MB
Formato Adobe PDF
10 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/682704
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact