Breast cancer is a major disease for women, and mammographic screening proved impressive benefits in reducing mortality risk. However, false positives and false negatives still occur due to human perception, differences in breast density, and the complexity of cancer itself. Convolutional Neural Networks (CNNs) have shown promise in medical imaging issues, but they struggle with understanding long-range spatial interactions in various image patches. Vision Transformers (ViTs) have emerged as a solution to this problem. This work used a subset of the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) to implement a ViT-based framework for breast cancer classification. Geometric and Diffuser-based data augmentation (DA) methods were applied and compared to evaluate the resulting performance improvement. The obtained results show how diffuser-based DA improves the performance of geometric DA. However, their combination allows for higher performance (accuracy = 77.01%, sensitivity = 88.89%, specificity = 68.63%) and demonstrates the feasibility and effectiveness of this approach in enhancing the model’s capabilities for breast cancer classification.

Cannata, S., Cicceri, G., Cirrincione, G., Currieri, T., Lovino, M., Militello, C., et al. (2025). ViT-Based Classification of Mammogram Images: Impact of Data Augmentation Techniques. In Advanced Neural Artificial Intelligence: Theories and Applications (pp. 225-233). Springer Science and Business Media Deutschland GmbH [10.1007/978-981-96-0994-9_21].

ViT-Based Classification of Mammogram Images: Impact of Data Augmentation Techniques

Cicceri, Giovanni
;
Currieri, Tiziana;Prinzi, Francesco;Vitabile, Salvatore
2025-05-24

Abstract

Breast cancer is a major disease for women, and mammographic screening proved impressive benefits in reducing mortality risk. However, false positives and false negatives still occur due to human perception, differences in breast density, and the complexity of cancer itself. Convolutional Neural Networks (CNNs) have shown promise in medical imaging issues, but they struggle with understanding long-range spatial interactions in various image patches. Vision Transformers (ViTs) have emerged as a solution to this problem. This work used a subset of the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) to implement a ViT-based framework for breast cancer classification. Geometric and Diffuser-based data augmentation (DA) methods were applied and compared to evaluate the resulting performance improvement. The obtained results show how diffuser-based DA improves the performance of geometric DA. However, their combination allows for higher performance (accuracy = 77.01%, sensitivity = 88.89%, specificity = 68.63%) and demonstrates the feasibility and effectiveness of this approach in enhancing the model’s capabilities for breast cancer classification.
24-mag-2025
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
9789819609932
9789819609949
Cannata, S., Cicceri, G., Cirrincione, G., Currieri, T., Lovino, M., Militello, C., et al. (2025). ViT-Based Classification of Mammogram Images: Impact of Data Augmentation Techniques. In Advanced Neural Artificial Intelligence: Theories and Applications (pp. 225-233). Springer Science and Business Media Deutschland GmbH [10.1007/978-981-96-0994-9_21].
File in questo prodotto:
File Dimensione Formato  
978-981-96-0994-9_21.pdf

Solo gestori archvio

Tipologia: Versione Editoriale
Dimensione 471.69 kB
Formato Adobe PDF
471.69 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/686853
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact