We present two methodologies to deal with high-dimensional data with mixed variables, the strongly decomposable graphical model and the regression-type graphical model. The first model is used to infer conditional independence graphs. The latter model is applied to compute the relative importance or contribution of each predictor to the response variables. Recently, penalized likelihood approaches have also been proposed to estimate graph structures. In a simulation study, we compare the performance of the strongly decomposable graphical model and the graphical lasso in terms of graph recovering. Five different graph structures are used to simulate the data: the banded graph, the cluster graph, the random graph, the hub graph and the scale-free graph. We assume the graphs are sparse. Our finding, in the simulation study, is that the strongly decomposable graphical model shows, generally, comparable or better performance both in low and high-dimensional case. Finally, we show an application on mixed data.

Abbruzzo, A., Mineo, A. (2014). Inferring networks from high-dimensional data with mixed variables. In Springer series contributions to statistics for “Advances in complex data modeling and computational methods in statistics".

Inferring networks from high-dimensional data with mixed variables

ABBRUZZO, Antonino;MINEO, Angelo
2014-01-01

Abstract

We present two methodologies to deal with high-dimensional data with mixed variables, the strongly decomposable graphical model and the regression-type graphical model. The first model is used to infer conditional independence graphs. The latter model is applied to compute the relative importance or contribution of each predictor to the response variables. Recently, penalized likelihood approaches have also been proposed to estimate graph structures. In a simulation study, we compare the performance of the strongly decomposable graphical model and the graphical lasso in terms of graph recovering. Five different graph structures are used to simulate the data: the banded graph, the cluster graph, the random graph, the hub graph and the scale-free graph. We assume the graphs are sparse. Our finding, in the simulation study, is that the strongly decomposable graphical model shows, generally, comparable or better performance both in low and high-dimensional case. Finally, we show an application on mixed data.
2014
Settore SECS-S/01 - Statistica
978-3-319-11148-3
Abbruzzo, A., Mineo, A. (2014). Inferring networks from high-dimensional data with mixed variables. In Springer series contributions to statistics for “Advances in complex data modeling and computational methods in statistics".
File in questo prodotto:
File Dimensione Formato  
AbbruzzoAntonino.pdf

accesso aperto

Descrizione: Main article
Dimensione 218.34 kB
Formato Adobe PDF
218.34 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/101468
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact