The amount and the variety of available medical data coming from multiple and heterogeneous sources can inhibit analysis, manual interpretation and use of simple data management applications. In the healthcare domain, the development of techniques for enabling health data management, analysis, mining and recognition has become worldwide important. In this paper, the use of the most-known dimensionality reduction techniques on a dataset composed of real mammographic reports is presented. Techniques such as LSI, PCA, and SVD decomposition have been applied to the extracted TF-IDF matrix using less attributes than the original unprocessed matrix and obtaining comparable results. Due to their reliability, LSI and PCA techniques can be efficiently used, increasing any computation feasibility on reduced feature data.
Luca Agnello, Albert Comelli, Salvatore Vitabile (2016). Feature Dimensionality Reduction for Mammographic Report Classification. In J.K.a.B.D.M. F. Pop (a cura di), Resource Management for Big-Data Platforms: Algorithms, Modelling, and High-Performance Computing Techniques (pp. 311-337). Springer [10.1007/978-3-319-44881-7_15].
Feature Dimensionality Reduction for Mammographic Report Classification
COMELLI, Albert;VITABILE, Salvatore
2016-01-01
Abstract
The amount and the variety of available medical data coming from multiple and heterogeneous sources can inhibit analysis, manual interpretation and use of simple data management applications. In the healthcare domain, the development of techniques for enabling health data management, analysis, mining and recognition has become worldwide important. In this paper, the use of the most-known dimensionality reduction techniques on a dataset composed of real mammographic reports is presented. Techniques such as LSI, PCA, and SVD decomposition have been applied to the extracted TF-IDF matrix using less attributes than the original unprocessed matrix and obtaining comparable results. Due to their reliability, LSI and PCA techniques can be efficiently used, increasing any computation feasibility on reduced feature data.File | Dimensione | Formato | |
---|---|---|---|
chp%3A10.1007%2F978-3-319-44881-7_15.pdf
Solo gestori archvio
Descrizione: Chapter
Dimensione
815.04 kB
Formato
Adobe PDF
|
815.04 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
bfm%3A978-3-319-44881-7%2F1.pdf
Solo gestori archvio
Descrizione: Index, Preface, and Table of Contents
Dimensione
111.64 kB
Formato
Adobe PDF
|
111.64 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Front.pdf
Solo gestori archvio
Descrizione: Front Matter
Dimensione
1.14 MB
Formato
Adobe PDF
|
1.14 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.