Oral cancer is a major health problem requiring accurate healthcare support systems, and Deep learning (DL) based medical imaging has proven to be an effective solution. This work addresses the oral cancer classification task by employing different convolutional architectures. Our goal is to improve the classification tasks by incorporating segmentation information. We propose two segment-driven strategies to strengthen the traditional classification training. The first one involves training a dedicated neural network (NN) to predict masks, which are then used to classify masked images to hide unuseful information. Specifically, we introduce an approach relying on soft-masks to weigh the contribution of each pixel to the final classification against the already proposed hard-mask strategy. The second proposed approach involves training the NN via CrossEntropy-IoU, a loss function consisting of the CrossEntropy for identifying the correct label, and the Intersection over Union measuring the mismatch between the activation map and the mask. Experiments show that implementing segment-driven strategies enhances accuracy and training speed using both convolutional and transformer architectures.
Parola, M., Malaspina, E., Cimino, M.G.C.A., La Mantia, G., Campisi, G., Di Fede, O. (2025). Improving oral cancer classification via segment-driven photographic deep learning imaging. IEEE SIGNAL PROCESSING MAGAZINE, 1-5 [10.1109/CISMCompanion65074.2025.11032552].
Improving oral cancer classification via segment-driven photographic deep learning imaging
La Mantia G.;Campisi G.;Di Fede O.
2025-01-01
Abstract
Oral cancer is a major health problem requiring accurate healthcare support systems, and Deep learning (DL) based medical imaging has proven to be an effective solution. This work addresses the oral cancer classification task by employing different convolutional architectures. Our goal is to improve the classification tasks by incorporating segmentation information. We propose two segment-driven strategies to strengthen the traditional classification training. The first one involves training a dedicated neural network (NN) to predict masks, which are then used to classify masked images to hide unuseful information. Specifically, we introduce an approach relying on soft-masks to weigh the contribution of each pixel to the final classification against the already proposed hard-mask strategy. The second proposed approach involves training the NN via CrossEntropy-IoU, a loss function consisting of the CrossEntropy for identifying the correct label, and the Intersection over Union measuring the mismatch between the activation map and the mask. Experiments show that implementing segment-driven strategies enhances accuracy and training speed using both convolutional and transformer architectures.| File | Dimensione | Formato | |
|---|---|---|---|
|
Improving oral cancer classification via segment-driven photographic deep learning imaging.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
604.89 kB
Formato
Adobe PDF
|
604.89 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


