In presence of completely or quasi-completely separated data, the maximum likelihood estimates for the logistic regression parameters do not exist. In medical research the question is of great importance because of the need to obtain finite odds ratios. Statistical packages do not solve the estimation problem with non-overlapped dataset. We suggest to apply the hidden logistic regression model and the MEL estimator of Rousseeuw and Christmas (2003) where a unique solution is graphically obtained by the inspection of the ridge trace of regression parameters (IRT). Alternatively, we inroduce a Cross Validation (CV) based method to choose the regularization parameter. A real data-set on oral candidosis affection in considered. Our analysis points out that CV rather that IRT leads to ML estimates with minimum misclassification error rate.

GIAIMO R, MATRANGA D, CAMPISI G (2006). Odds ratio estimation in the presence of complete OR quasi-complete separation in data. STATISTICA APPLICATA, 18(3), 429-444.

Odds ratio estimation in the presence of complete OR quasi-complete separation in data

GIAIMO, Rosa;MATRANGA, Domenica;CAMPISI, Giuseppina
2006-01-01

Abstract

In presence of completely or quasi-completely separated data, the maximum likelihood estimates for the logistic regression parameters do not exist. In medical research the question is of great importance because of the need to obtain finite odds ratios. Statistical packages do not solve the estimation problem with non-overlapped dataset. We suggest to apply the hidden logistic regression model and the MEL estimator of Rousseeuw and Christmas (2003) where a unique solution is graphically obtained by the inspection of the ridge trace of regression parameters (IRT). Alternatively, we inroduce a Cross Validation (CV) based method to choose the regularization parameter. A real data-set on oral candidosis affection in considered. Our analysis points out that CV rather that IRT leads to ML estimates with minimum misclassification error rate.
2006
GIAIMO R, MATRANGA D, CAMPISI G (2006). Odds ratio estimation in the presence of complete OR quasi-complete separation in data. STATISTICA APPLICATA, 18(3), 429-444.
File in questo prodotto:
File Dimensione Formato  
Statistica Applicata.pdf

Solo gestori archvio

Dimensione 3.04 MB
Formato Adobe PDF
3.04 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/24457
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact