Breast cancer is a major disease for women, and mammographic screening proved impressive benefits in reducing mortality risk. However, false positives and false negatives still occur due to human perception, differences in breast density, and the complexity of cancer itself. Convolutional Neural Networks (CNNs) have shown promise in medical imaging issues, but they struggle with understanding long-range spatial interactions in various image patches. Vision Transformers (ViTs) have emerged as a solution to this problem. This work used a subset of the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) to implement a ViT-based framework for breast cancer classification. Geometric and Diffuser-based data augmentation (DA) methods were applied and compared to evaluate the resulting performance improvement. The obtained results show how diffuser-based DA improves the performance of geometric DA. However, their combination allows for higher performance (accuracy = 77.01%, sensitivity = 88.89%, specificity = 68.63%) and demonstrates the feasibility and effectiveness of this approach in enhancing the model’s capabilities for breast cancer classification.
Cannata, S., Cicceri, G., Cirrincione, G., Currieri, T., Lovino, M., Militello, C., et al. (2025). ViT-Based Classification of Mammogram Images: Impact of Data Augmentation Techniques. In Advanced Neural Artificial Intelligence: Theories and Applications (pp. 225-233). Springer Science and Business Media Deutschland GmbH [10.1007/978-981-96-0994-9_21].
ViT-Based Classification of Mammogram Images: Impact of Data Augmentation Techniques
Cicceri, Giovanni
;Currieri, Tiziana;Prinzi, Francesco;Vitabile, Salvatore
2025-05-24
Abstract
Breast cancer is a major disease for women, and mammographic screening proved impressive benefits in reducing mortality risk. However, false positives and false negatives still occur due to human perception, differences in breast density, and the complexity of cancer itself. Convolutional Neural Networks (CNNs) have shown promise in medical imaging issues, but they struggle with understanding long-range spatial interactions in various image patches. Vision Transformers (ViTs) have emerged as a solution to this problem. This work used a subset of the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) to implement a ViT-based framework for breast cancer classification. Geometric and Diffuser-based data augmentation (DA) methods were applied and compared to evaluate the resulting performance improvement. The obtained results show how diffuser-based DA improves the performance of geometric DA. However, their combination allows for higher performance (accuracy = 77.01%, sensitivity = 88.89%, specificity = 68.63%) and demonstrates the feasibility and effectiveness of this approach in enhancing the model’s capabilities for breast cancer classification.| File | Dimensione | Formato | |
|---|---|---|---|
|
978-981-96-0994-9_21.pdf
Solo gestori archvio
Tipologia:
Versione Editoriale
Dimensione
471.69 kB
Formato
Adobe PDF
|
471.69 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


