Data consisting in repeated observation on a series of fixed units are very common in different context like biological, environmental and social sciences, and different terminology is often used to indicate this kind of data: panel data, longitudinal data, time series-cross section data (TSCS), spatio-temporal data. Missing information are inevitable in longitudinal studies, and can produce biased estimates and loss of powers. The aim of this paper is to propose a new regression (single) imputation method that, considering the particular structure and characteristics of the data set, creates a “complete” data set that can be analyzed by any researcher on different occasions and using different techniques. Simulated incomplete data from a PM10 dataset recorded in Palermo in 2003 have been generated, in order to evaluate the performance of the imputation method by using suitable performance indicators.

Plaia, A., Bondì, A.L. (2010). Regression imputation for space-time datasets with missing values. In F. Palumbo, C.N. Lauro, M.J. Greenacre (a cura di), Data analysis and classification: proceedings of the 6th Conference of the Classification and Data Analysis Group of the Società Italiana di Statistica (pp. 465-472). Springer.

Regression imputation for space-time datasets with missing values

PLAIA, Antonella;BONDI', Anna Lisa
2010-01-01

Abstract

Data consisting in repeated observation on a series of fixed units are very common in different context like biological, environmental and social sciences, and different terminology is often used to indicate this kind of data: panel data, longitudinal data, time series-cross section data (TSCS), spatio-temporal data. Missing information are inevitable in longitudinal studies, and can produce biased estimates and loss of powers. The aim of this paper is to propose a new regression (single) imputation method that, considering the particular structure and characteristics of the data set, creates a “complete” data set that can be analyzed by any researcher on different occasions and using different techniques. Simulated incomplete data from a PM10 dataset recorded in Palermo in 2003 have been generated, in order to evaluate the performance of the imputation method by using suitable performance indicators.
2010
Settore SECS-S/01 - Statistica
Plaia, A., Bondì, A.L. (2010). Regression imputation for space-time datasets with missing values. In F. Palumbo, C.N. Lauro, M.J. Greenacre (a cura di), Data analysis and classification: proceedings of the 6th Conference of the Classification and Data Analysis Group of the Società Italiana di Statistica (pp. 465-472). Springer.
File in questo prodotto:
File Dimensione Formato  
191015_1_En_52.pdf

Solo gestori archvio

Dimensione 318.53 kB
Formato Adobe PDF
318.53 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/48055
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 2
social impact