TY - JOUR
T1 - Statistical modelling for falls count data
AU - Ullah, Shahid
AU - Finch, Caroline F.
AU - Day, Lesley
N1 - Funding Information:
Project work supported by a grant from the Australian Government Department of Health and Ageing to undertake falls modelling research provided the impetus for this paper. John Campbell and Clare Robertson, Department of Medical and Surgical Sciences, Dunedin School of Medicine, University of Otago, New Zealand provided the falls data from the New Zealand trial used in this study. Dominique Lord, Zachry Department of Civil Engineering, Texas A&M University and Byung-Jung Park, Texas Transportation Institute Provided R code for finite mixture models.
Funding Information:
Shahid Ullah was supported by an Injury Trauma and Rehabilitation (ITR) Research Fellowship funded through a National Health and Medical Research Council (NHMRC) Capacity Building Grant in Population Health. Caroline Finch was supported by an NHMRC Principal Research Fellowship. Lesley Day was supported by an NHMRC Senior Research Fellowship.
PY - 2010/3
Y1 - 2010/3
N2 - Falls and their injury outcomes have count distributions that are highly skewed toward the right with clumping at zero, posing analytical challenges. Different modelling approaches have been used in the published literature to describe falls count distributions, often without consideration of the underlying statistical and modelling assumptions. This paper compares the use of modified Poisson and negative binomial (NB) models as alternatives to Poisson (P) regression, for the analysis of fall outcome counts. Four different count-based regression models (P, NB, zero-inflated Poisson (ZIP), zero-inflated negative binomial (ZINB)) were each individually fitted to four separate fall count datasets from Australia, New Zealand and United States. The finite mixtures of P and NB regression models were also compared to the standard NB model. Both analytical (F, Vuong and bootstrap tests) and graphical approaches were used to select and compare models. Simulation studies assessed the size and power of each model fit. This study confirms that falls count distributions are over-dispersed, but not dispersed due to excess zero counts or heterogeneous population. Accordingly, the P model generally provided the poorest fit to all datasets. The fit improved significantly with NB and both zero-inflated models. The fit was also improved with the NB model, compared to finite mixtures of both P and NB regression models. Although there was little difference in fit between NB and ZINB models, in the interests of parsimony it is recommended that future studies involving modelling of falls count data routinely use the NB models in preference to the P or ZINB or finite mixture distribution. The fact that these conclusions apply across four separate datasets from four different samples of older people participating in studies of different methodology, adds strength to this general guiding principle.
AB - Falls and their injury outcomes have count distributions that are highly skewed toward the right with clumping at zero, posing analytical challenges. Different modelling approaches have been used in the published literature to describe falls count distributions, often without consideration of the underlying statistical and modelling assumptions. This paper compares the use of modified Poisson and negative binomial (NB) models as alternatives to Poisson (P) regression, for the analysis of fall outcome counts. Four different count-based regression models (P, NB, zero-inflated Poisson (ZIP), zero-inflated negative binomial (ZINB)) were each individually fitted to four separate fall count datasets from Australia, New Zealand and United States. The finite mixtures of P and NB regression models were also compared to the standard NB model. Both analytical (F, Vuong and bootstrap tests) and graphical approaches were used to select and compare models. Simulation studies assessed the size and power of each model fit. This study confirms that falls count distributions are over-dispersed, but not dispersed due to excess zero counts or heterogeneous population. Accordingly, the P model generally provided the poorest fit to all datasets. The fit improved significantly with NB and both zero-inflated models. The fit was also improved with the NB model, compared to finite mixtures of both P and NB regression models. Although there was little difference in fit between NB and ZINB models, in the interests of parsimony it is recommended that future studies involving modelling of falls count data routinely use the NB models in preference to the P or ZINB or finite mixture distribution. The fact that these conclusions apply across four separate datasets from four different samples of older people participating in studies of different methodology, adds strength to this general guiding principle.
KW - Fall count data
KW - Finite mixture models
KW - Model fit
KW - Negative binomial
KW - Regression modelling
KW - Simulation study
KW - Zero-inflated models
UR - https://www.scopus.com/pages/publications/76049129219
U2 - 10.1016/j.aap.2009.08.018
DO - 10.1016/j.aap.2009.08.018
M3 - Article
C2 - 20159058
AN - SCOPUS:76049129219
SN - 0001-4575
VL - 42
SP - 384
EP - 392
JO - Accident Analysis and Prevention
JF - Accident Analysis and Prevention
IS - 2
ER -