Medical image segmentation is an important task supporting diagnosis and screening systems in several medical areas including oral cancer recognition. This paper explores the effectiveness of different deep learning (DL) architectures, including U-Net, LinkNet, PAN, and FPN for oral cavity lesion segmentation. Furthermore, we propose an ensemble model incorporating several decision fusion strategies to aggregate individual predictions, to improve the individual model performance. Our study employs a dataset acquired and manually labeled by the clinical subgroup of our team. On this dataset, we address two distinct segmentation problems: binary semantic segmentation to differentiate healthy tissue from diseased regions and multiclass semantic segmentation to identify three oral pathologies: aphthous, traumatic, and neoplastic lesions. We study the ensemble model's effectiveness in improving segmentation accuracy by combining different DL architectures' strengths. The results demonstrate that the ensemble strategy is highly effective for binary semantic segmentation, achieving a Dice score of 76.5%; while, for the multi-class problem of differentiating between multiple diseases, improvements are present but less marked.
Parola, M., Cimino, M.G.C.A., Cantini, I., La Mantia, G., Campisi, G., Di Fede, O. (2025). Oral Cancer Recognition on Photographic Images Via Deep Learning Semantic Segmentation. In 2025 IEEE Symposium on Computational Intelligence in Health and Medicine Companion, CIHM Companion 2025 (pp. 1-5). 345 E 47TH ST, NEW YORK, NY 10017 USA : Institute of Electrical and Electronics Engineers Inc. [10.1109/CIHMCompanion65205.2025.11002690].
Oral Cancer Recognition on Photographic Images Via Deep Learning Semantic Segmentation
La Mantia, Gaetano;Campisi, Giuseppina;Di Fede, Olga
2025-01-01
Abstract
Medical image segmentation is an important task supporting diagnosis and screening systems in several medical areas including oral cancer recognition. This paper explores the effectiveness of different deep learning (DL) architectures, including U-Net, LinkNet, PAN, and FPN for oral cavity lesion segmentation. Furthermore, we propose an ensemble model incorporating several decision fusion strategies to aggregate individual predictions, to improve the individual model performance. Our study employs a dataset acquired and manually labeled by the clinical subgroup of our team. On this dataset, we address two distinct segmentation problems: binary semantic segmentation to differentiate healthy tissue from diseased regions and multiclass semantic segmentation to identify three oral pathologies: aphthous, traumatic, and neoplastic lesions. We study the ensemble model's effectiveness in improving segmentation accuracy by combining different DL architectures' strengths. The results demonstrate that the ensemble strategy is highly effective for binary semantic segmentation, achieving a Dice score of 76.5%; while, for the multi-class problem of differentiating between multiple diseases, improvements are present but less marked.| File | Dimensione | Formato | |
|---|---|---|---|
|
Oral_Cancer_Recognition_on_Photographic_Images_Via_Deep_Learning_Semantic_Segmentation.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
909.7 kB
Formato
Adobe PDF
|
909.7 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


