In the Big Data era, sampling remains a central theme. This paper investigates the characteristics of inverse sampling on two different datasets (real and simulated) to determine when big data become too small for inverse sampling to be used and to examine the impact of the sampling rate of the subsamples. We find that the method, using the appropriate subsample size for both the mean and proportion parameters, performs well with a smaller dataset than big data through the simulation study and real-data application. Different settings related to the selection bias severity are considered during the simulation study and real application.
Cuntrera, D., Falco, V., Giambalvo, O. (2022). On the Sampling Size for Inverse Sampling. STATS, 5(4), 1130-1144 [10.3390/stats5040067].
On the Sampling Size for Inverse Sampling
Cuntrera, Daniele;Falco, Vincenzo
;Giambalvo, Ornella
2022-11-01
Abstract
In the Big Data era, sampling remains a central theme. This paper investigates the characteristics of inverse sampling on two different datasets (real and simulated) to determine when big data become too small for inverse sampling to be used and to examine the impact of the sampling rate of the subsamples. We find that the method, using the appropriate subsample size for both the mean and proportion parameters, performs well with a smaller dataset than big data through the simulation study and real-data application. Different settings related to the selection bias severity are considered during the simulation study and real application.File | Dimensione | Formato | |
---|---|---|---|
stats-05-00067.pdf
accesso aperto
Descrizione: Articolo completo
Tipologia:
Versione Editoriale
Dimensione
1.09 MB
Formato
Adobe PDF
|
1.09 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.