Oral cancer is a major health problem requiring accurate healthcare support systems, and Deep learning (DL) based medical imaging has proven to be an effective solution. This work addresses the oral cancer classification task by employing different convolutional architectures. Our goal is to improve the classification tasks by incorporating segmentation information. We propose two segment-driven strategies to strengthen the traditional classification training. The first one involves training a dedicated neural network (NN) to predict masks, which are then used to classify masked images to hide unuseful information. Specifically, we introduce an approach relying on soft-masks to weigh the contribution of each pixel to the final classification against the already proposed hard-mask strategy. The second proposed approach involves training the NN via CrossEntropy-IoU, a loss function consisting of the CrossEntropy for identifying the correct label, and the Intersection over Union measuring the mismatch between the activation map and the mask. Experiments show that implementing segment-driven strategies enhances accuracy and training speed using both convolutional and transformer architectures.

Parola, M., Malaspina, E., Cimino, M.G.C.A., La Mantia, G., Campisi, G., Di Fede, O. (2025). Improving oral cancer classification via segment-driven photographic deep learning imaging. IEEE SIGNAL PROCESSING MAGAZINE, 1-5 [10.1109/CISMCompanion65074.2025.11032552].

Improving oral cancer classification via segment-driven photographic deep learning imaging

La Mantia G.;Campisi G.;Di Fede O.
2025-01-01

Abstract

Oral cancer is a major health problem requiring accurate healthcare support systems, and Deep learning (DL) based medical imaging has proven to be an effective solution. This work addresses the oral cancer classification task by employing different convolutional architectures. Our goal is to improve the classification tasks by incorporating segmentation information. We propose two segment-driven strategies to strengthen the traditional classification training. The first one involves training a dedicated neural network (NN) to predict masks, which are then used to classify masked images to hide unuseful information. Specifically, we introduce an approach relying on soft-masks to weigh the contribution of each pixel to the final classification against the already proposed hard-mask strategy. The second proposed approach involves training the NN via CrossEntropy-IoU, a loss function consisting of the CrossEntropy for identifying the correct label, and the Intersection over Union measuring the mismatch between the activation map and the mask. Experiments show that implementing segment-driven strategies enhances accuracy and training speed using both convolutional and transformer architectures.
2025
2025 IEEE Symposium on Computational Intelligence in Image, Signal Processing and Synthetic Media Companion (CISM Companion)
17-20 March 2025
Parola, M., Malaspina, E., Cimino, M.G.C.A., La Mantia, G., Campisi, G., Di Fede, O. (2025). Improving oral cancer classification via segment-driven photographic deep learning imaging. IEEE SIGNAL PROCESSING MAGAZINE, 1-5 [10.1109/CISMCompanion65074.2025.11032552].
File in questo prodotto:
File Dimensione Formato  
Improving oral cancer classification via segment-driven photographic deep learning imaging.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 604.89 kB
Formato Adobe PDF
604.89 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/691386
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact