The amount and the variety of available medical data coming from multiple and heterogeneous sources can inhibit analysis, manual interpretation and use of simple data management applications. In the healthcare domain, the development of techniques for enabling health data management, analysis, mining and recognition has become worldwide important. In this paper, the use of the most-known dimensionality reduction techniques on a dataset composed of real mammographic reports is presented. Techniques such as LSI, PCA, and SVD decomposition have been applied to the extracted TF-IDF matrix using less attributes than the original unprocessed matrix and obtaining comparable results. Due to their reliability, LSI and PCA techniques can be efficiently used, increasing any computation feasibility on reduced feature data.

Luca Agnello, Albert Comelli, Salvatore Vitabile (2016). Feature Dimensionality Reduction for Mammographic Report Classification. In J.K.a.B.D.M. F. Pop (a cura di), Resource Management for Big-Data Platforms: Algorithms, Modelling, and High-Performance Computing Techniques (pp. 311-337). Springer [10.1007/978-3-319-44881-7_15].

Feature Dimensionality Reduction for Mammographic Report Classification

COMELLI, Albert;VITABILE, Salvatore
2016-01-01

Abstract

The amount and the variety of available medical data coming from multiple and heterogeneous sources can inhibit analysis, manual interpretation and use of simple data management applications. In the healthcare domain, the development of techniques for enabling health data management, analysis, mining and recognition has become worldwide important. In this paper, the use of the most-known dimensionality reduction techniques on a dataset composed of real mammographic reports is presented. Techniques such as LSI, PCA, and SVD decomposition have been applied to the extracted TF-IDF matrix using less attributes than the original unprocessed matrix and obtaining comparable results. Due to their reliability, LSI and PCA techniques can be efficiently used, increasing any computation feasibility on reduced feature data.
2016
Luca Agnello, Albert Comelli, Salvatore Vitabile (2016). Feature Dimensionality Reduction for Mammographic Report Classification. In J.K.a.B.D.M. F. Pop (a cura di), Resource Management for Big-Data Platforms: Algorithms, Modelling, and High-Performance Computing Techniques (pp. 311-337). Springer [10.1007/978-3-319-44881-7_15].
File in questo prodotto:
File Dimensione Formato  
chp%3A10.1007%2F978-3-319-44881-7_15.pdf

Solo gestori archvio

Descrizione: Chapter
Dimensione 815.04 kB
Formato Adobe PDF
815.04 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
bfm%3A978-3-319-44881-7%2F1.pdf

Solo gestori archvio

Descrizione: Index, Preface, and Table of Contents
Dimensione 111.64 kB
Formato Adobe PDF
111.64 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Front.pdf

Solo gestori archvio

Descrizione: Front Matter
Dimensione 1.14 MB
Formato Adobe PDF
1.14 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/222031
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 4
social impact