Modelling Count Data with Excess Zeros: An Application to Health Care Utilisation Data


  • Shamzaeffa Samsudin Universiti Utara Malaysia
  • Peter G Moffatt University of East Anglia


Count data, excess zeros, health care utilisation, Poisson model, zero-inflated negative binomial


This study is concerned with the estimation of microeconometric models of health care utilisation. The data set consists of 14,706 individuals from the General Household Survey for Great Britain, and the dependent variable is the number of General Practitioner (GP) consultations over a 2-week period. A clear feature of this count variable is excess zeros, and it is essential to incorporate this feature in the modelling strategy. Accordingly, in addition to standard Poisson and negative binomial models, zero-inflated, two-part, and latent class models were estimated. The zero-inflated negative binomial model (ZINB) proved, on the basis of several selection criteria, to be superior to other models for this dataset. As anticipated, health related variables had significant effects in determining health care utilisation while socioeconomic variables appeared to be less important, according to the results from the preferred model. Some effects differ quite markedly between the different models, underlining the importance of the type of process used to identify the best-fitting one.


Download data is not yet available.

Author Biographies

Shamzaeffa Samsudin, Universiti Utara Malaysia

School of Economics, Finance and Banking, Universiti Utara Malaysia

Peter G Moffatt, University of East Anglia

School of Economics, University of East Anglia