The exponential growth of biomedical and biological data, such as histopathological images, genomic sequences, and molecular networks, calls for advanced techniques to represent such data in compact, informative, and interpretable ways. Embedding methods offer an effective solution by transforming high-dimensional and complex data into representations that facilitate classification, pattern discovery, and knowledge extraction tasks. This thesis investigates a broad spectrum of biomedical and biological data embedding approaches, including classical dimensionality reduction techniques, metric learning models, granular-based approaches, and graph-based representations. The central goal is to combine high predictive performance with interpretability, crucial in clinical and scientific domains where understanding the rationale behind automated decisions is essential. A key contribution of this work is the development of GECo Graph Explanation by Communities, a novel algorithm designed to improve the explainability of Graph Neural Networks. GECo leverages community detection to identify meaningful substructures within graphs, providing clear and consistent explanations for Graph Neural Networks predictions, particularly in biomedical and biological settings. Through extensive experimentation on both public and real-world datasets, the thesis demonstrates the effectiveness of embedding techniques in addressing practical biomedical challenges while ensuring transparency, robustness, and interpretability. This work contributes to advancing trustworthy and explainable artificial intelligence systems for biomedical and biological data analysis.
(2025). Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications. (Tesi di dottorato, Università degli Studi di Palermo, 2025).
Embedding Techniques for Biomedical and Biological Data: Interpretable Methods and Applications
CALDERARO, Salvatore
2025-07-01
Abstract
The exponential growth of biomedical and biological data, such as histopathological images, genomic sequences, and molecular networks, calls for advanced techniques to represent such data in compact, informative, and interpretable ways. Embedding methods offer an effective solution by transforming high-dimensional and complex data into representations that facilitate classification, pattern discovery, and knowledge extraction tasks. This thesis investigates a broad spectrum of biomedical and biological data embedding approaches, including classical dimensionality reduction techniques, metric learning models, granular-based approaches, and graph-based representations. The central goal is to combine high predictive performance with interpretability, crucial in clinical and scientific domains where understanding the rationale behind automated decisions is essential. A key contribution of this work is the development of GECo Graph Explanation by Communities, a novel algorithm designed to improve the explainability of Graph Neural Networks. GECo leverages community detection to identify meaningful substructures within graphs, providing clear and consistent explanations for Graph Neural Networks predictions, particularly in biomedical and biological settings. Through extensive experimentation on both public and real-world datasets, the thesis demonstrates the effectiveness of embedding techniques in addressing practical biomedical challenges while ensuring transparency, robustness, and interpretability. This work contributes to advancing trustworthy and explainable artificial intelligence systems for biomedical and biological data analysis.File | Dimensione | Formato | |
---|---|---|---|
reduced_PhD_Thesis_Salvatore_Calderaro_9_6_2025.pdf
embargo fino al 31/12/2025
Tipologia:
Tesi di dottorato
Dimensione
10 MB
Formato
Adobe PDF
|
10 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.