Graphical lasso is one of the most used estimators for inferring genetic networks. Despite its diffusion, there are several fields in applied research where the limits of detection of modern measurement technologies make the use of this estimator theoretically unfounded, even when the assumption of a multivariate Gaussian distribution is satisfied. Typical examples are data generated by polymerase chain reactions and flow cytometer. The combination of censoring and high-dimensionality make inference of the underlying genetic networks from these data very challenging. In this article, we propose an $ell_1$-penalized Gaussian graphical model for censored data and derive two EM-like algorithms for inference. We evaluate the computational efficiency of the proposed algorithms by an extensive simulation study and show that, when censored data are available, our proposal is superior to existing competitors both in terms of network recovery and parameter estimation. We apply the proposed method to gene expression data generated by microfluidic Reverse Transcription quantitative Polymerase Chain Reaction technology in order to make inference on the regulatory mechanisms of blood development. A software implementation of our method is available on github (https://github.com/LuigiAugugliaro/cglasso).

Augugliaro, L., Abbruzzo, A., Vinciotti, V. (2020). l1-Penalized censored Gaussian graphical model. BIOSTATISTICS, 21(2), 1-16 [10.1093/biostatistics/kxy043].

l1-Penalized censored Gaussian graphical model

Augugliaro, Luigi
;
Abbruzzo, Antonino;Vinciotti, Veronica
2020-01-01

Abstract

Graphical lasso is one of the most used estimators for inferring genetic networks. Despite its diffusion, there are several fields in applied research where the limits of detection of modern measurement technologies make the use of this estimator theoretically unfounded, even when the assumption of a multivariate Gaussian distribution is satisfied. Typical examples are data generated by polymerase chain reactions and flow cytometer. The combination of censoring and high-dimensionality make inference of the underlying genetic networks from these data very challenging. In this article, we propose an $ell_1$-penalized Gaussian graphical model for censored data and derive two EM-like algorithms for inference. We evaluate the computational efficiency of the proposed algorithms by an extensive simulation study and show that, when censored data are available, our proposal is superior to existing competitors both in terms of network recovery and parameter estimation. We apply the proposed method to gene expression data generated by microfluidic Reverse Transcription quantitative Polymerase Chain Reaction technology in order to make inference on the regulatory mechanisms of blood development. A software implementation of our method is available on github (https://github.com/LuigiAugugliaro/cglasso).
2020
Settore SECS-S/01 - Statistica
Augugliaro, L., Abbruzzo, A., Vinciotti, V. (2020). l1-Penalized censored Gaussian graphical model. BIOSTATISTICS, 21(2), 1-16 [10.1093/biostatistics/kxy043].
File in questo prodotto:
File Dimensione Formato  
3.pdf

Solo gestori archvio

Tipologia: Post-print
Dimensione 29.44 MB
Formato Adobe PDF
29.44 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
3_compressed.pdf

Solo gestori archvio

Tipologia: Post-print
Dimensione 8.1 MB
Formato Adobe PDF
8.1 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
l1-Penalized censored Gaussian graphical model.pdf

Solo gestori archvio

Descrizione: Articolo principale
Tipologia: Versione Editoriale
Dimensione 353.09 kB
Formato Adobe PDF
353.09 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/366807
Citazioni
  • ???jsp.display-item.citation.pmc??? 3
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 4
social impact