Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

We consider estimation of a linear regression model using data where some covariate values are missing but imputations are available to fill in the missing values. This situation generates a tradeoff between bias and precision when estimating the regression parameters of interest. Using only the subsample of complete observations does not cause bias but may imply a substantial loss of precision because the complete cases may be too few. On the other hand, filling in the missing values with imputations may cause bias. We provide the new Stata command gmi, which handles such tradeoff by using either model reduction or Bayesian model averaging techniques in the context of the generalized missing indicator approach recently proposed by Dardanoni, Modica, and Peracchi (2011, Journal of Econometrics 162: 362–368). If multiple imputations are available, gmi can also be combined with the built-in Stata prefix mi estimate to account for extra variability due to imputation. We illustrate the use of gmi with an empirical application in the health domain, where item nonresponse is substantial.

Dardanoni, V., De Luca, G., Modica, S., Peracchi, F. (2012). A Generalized Missing-indicator Approach to Regression with Imputed Covariates. THE STATA JOURNAL, 12(4), 575-604 [10.1177/1536867X1201200402].

A Generalized Missing-indicator Approach to Regression with Imputed Covariates

DARDANONI, Valentino;DE LUCA, Giuseppe;MODICA, Salvatore;Peracchi, F.

2012-01-01

Abstract

We consider estimation of a linear regression model using data where some covariate values are missing but imputations are available to fill in the missing values. This situation generates a tradeoff between bias and precision when estimating the regression parameters of interest. Using only the subsample of complete observations does not cause bias but may imply a substantial loss of precision because the complete cases may be too few. On the other hand, filling in the missing values with imputations may cause bias. We provide the new Stata command gmi, which handles such tradeoff by using either model reduction or Bayesian model averaging techniques in the context of the generalized missing indicator approach recently proposed by Dardanoni, Modica, and Peracchi (2011, Journal of Econometrics 162: 362–368). If multiple imputations are available, gmi can also be combined with the built-in Stata prefix mi estimate to account for extra variability due to imputation. We illustrate the use of gmi with an empirical application in the health domain, where item nonresponse is substantial.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2012
			
	Titolo del periodico 
DATO PREVISTO SU LOGINMIUR
	
				THE STATA JOURNAL
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1177/1536867X1201200402
			
	Citazione
	
				Dardanoni, V., De Luca, G., Modica, S., Peracchi, F. (2012). A Generalized Missing-indicator Approach to Regression with Imputed Covariates. THE STATA JOURNAL, 12(4), 575-604 [10.1177/1536867X1201200402].
			
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Dardanoni, De Luca, Modica, Peracchi (2012).pdf Solo gestori archvio Descrizione: Pubblicazione Tipologia: Versione Editoriale Dimensione 317.95 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	317.95 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/79348

Citazioni

ND

4

4

social impact