Original Article

Beta-Geometric Regression for Modeling Count Data on First Antenatal Care Visit (ANC) with Application

Abstract

Introduction: Although geometric distribution, which is a special case of Negative Binomial (NB) distribution, also belongs to the discrete family of distributions, little attention has been paid to modeling count data with the geometric distribution. There are many real-life phenomena that follow the geometric distribution with a constant probability of first success. However, in practice, the probability of the first success may vary from trial to trial, making simple geometric models unsuitable for modeling such data. In this paper, assuming that the probability of the first success follows a Beta distribution, we developed a Beta-geometric distribution and Beta-geometric regression for modeling the count data that follow the geometric distribution and illustrated the suitability of the model through application to the count data on time to first antenatal care (ANC) visit.   

Methods: The statistical properties of the Beta-geometric distribution are discussed. The estimation of the parameters of the distribution using the method of moments, maximum likelihood estimation (MLE) method, and Bayesian estimation approach are provided. Based on the Beta-geometric distribution, we developed a new Beta-geometric regression model for analyzing count data that follow the geometric distribution. The goodness of fit of the derived model has been tested using real data on time to the first ANC visit.

Results: Beta-geometric distribution has a simple form for its probability mass function (pmf), and is flexible in capturing both underdispersion and overdispersion that may present in count data.  It was found that the proposed Beta-geometric regression model fit the count data on the first ANC visit better than simple geometric distribution or Negative Binomial distribution.

Conclusion: Unlike the Poisson or Negative Binomial distribution, Beta-geometric distribution does not need an additional parameter to accommodate underdispersion or overdispersion and thus could be a flexible choice for analyzing any count data.

1. Lewis-Beck, M. Applied regression: An introduction. Newbury Park, CA: Sage, 1989.
2. Hilbe JM. Negative binomial overdispersion. In Negative Binomial Regression (pp. 180-185). Cambridge University Press New York, NY, 2012.
3. Coxe, S, West SG & Aiken LS. (2009) The analysis of count data: A gentle introduction to Poisson regression and its aternatives. Journal of personality assessment.2009;91(2):121-136.
4. Sellers KF & Shmueli G. A flexible regression model for count data. The Annals of Applied Statistics.2010; 4(2): 943-961.
5. Cameron A & Trivedi PK. Econometric models based on count data: Comparisons and applications of some estimators and tests. Journal of Applied Econometrics.1986;1: 29-53.
6. Winkelmann R & Zimmermann KF. Recent developments in count data modelling: theory and application. Journal of economic surveys.1995; 9(1): 1-24.
7. Wang W & Famoye F. Modeling household fertility decisions with generalized Poisson regression. Journal of Population Economics.1997;10(3): 273-283.
8. Mullahy J. Specification and testing in some modified count data models. Journal of Econometrics. 1986; 33: 341-365.
9. Cameron AC & Trivedi PK. Regression Analysis of Count Data. Cambridge UniversityPress, Cambridge, 1998.
10. World Health Organization(WHO). WHO recommendations on antenatal care for a positive
pregnancy experience. Geneva, 2016.
http://apps.who.int/iris/bitstream/10665/250796/1/9789241549912-eng.pdf?ua=1.
11. Lawn JE, Blencowe H, Waiswa P, Amouzou A, Mathers C, Hogan D & Shiekh S. Stillbirths: rates, risk factors, and acceleration towards 2030. The Lancet. 2016; 387(10018): 587-603.
12. Pell C, Meñaca A, Were F, Afrah NA., Chatio S, Manda-Taylor, L, et al. Factors Affecting Antenatal Care Attendance: Results from Qualitative Studies in Ghana, Kenya and Malawi. PLoS ONE.2013; 8(1): e53747.
13. Feleke1 SA, Mulatu MA, & Yesmaw YS. Medication administration error: magnitude and associated factors among nurses in Ethiopia. BMC Nursing.2015; 14: 53. s12912-015-0099-1.pdf
14. Guleman H & Berhane Y. Timing of First Antenatal Care Visit and its Associated Factors among Pregnant Women Attending Public Health Facilities in Addis Ababa, Ethiopia. Ethiop J Health Sci. 2017;27(2), 139–146.
15. Gebresilassie B, Belete T, Tilahun W, Berhane B & Gebresilassie S. Timing of first antenatal care attendance and associated factors among pregnant women in public health institutions of Axum town, Tigray, Ethiopia, 2017: a mixed design study. BMC Pregnancy and Childbirth. 2019; 19: 340.
16. Suissa S & Blais L. Binary regression with continuous outcomes. Stat Med. 1995;14:247–55.
17. Moser BK & Coombs LP. Odds ratios for a continuous outcome variable without dichotomizing. Stat Med.2004; 23:1843–60.
18. Suissa S. Binary methods for continuous outcomes: a parametric alternative. J Clin Epidemiol. 1991;44: 241–8.
19. Peacock JL, Sauzet O, Ewings SM & Kerry SM. Dichotomising continuous data while retaining statistical power using a distributional approach. Stat Med. 2012; 21:3089–103.
20. Preisser JS, Das K, Benecha H & Stamm JW. 2016. Logistic Regression for Dichotomized Counts. Stat Methods Med Res.2016; 25:3038–56.
21. Sroka CJ & Nagaraja HN. Odds ratios from logistic, geometric, Poisson, and negative binomial regression models. BMC Medical Research Methodology.2018; 18: 112
22. Pradhan B & Kundu D. Bayes estimation and prediction of the two-parameter gamma distribution. Journal of Statistical Computation and Simulation, 2011; 81(9): 1187-1198.
23. Al-Balushi ZMD & Islam MM. Geometric regression for modeling count data on the time-to-first antenatal care visit. Journal of Statistics: Advance in Theory and Applications. 2020; 23(1): 35-57.
24. Dobson AJ & Barnett AG. An introduction to generalized linear models. CRC press, 2018.
25. Weinberg CR & Gladen BC. The beta-geometric distribution applied to comparative fecundability studies. Biometrics. 1986; 42(3):547-560.
26. Kemp AW. The q-beta-geometric distribution as a model for fecundability. Communications in Statistics-Theory and Methods.2001; 30(11), 2373-2384.
27. Paul SR. Testing goodness of fit of the geometric distribution: an application to human fecundability data. Journal of Modern Applied Statistical Methods. 2005; 4(2): 8.

28. Miller AJ. A queueing model for road traffic flow, Journal of the Royal Statistical Society, Series B. 1961; 23: 64–90.

29. Pielou EC. Runs of one species with respect to another in transects through plant populations. Biometrics. 1962; 18(4):579-593.

30. Griffiths DA. Maximum likelihood estimation for the beta-binomial distribution and an application to the household distribution of the total number of cases of a disease. Biometrics. 1973; 29:637 – 648.

31. Hughes G & Madden LV. Using the beta-binomial distribution to describe aggregated patterns of disease incidence. Phytopathology. 1993 83:759-763

32. Okorie IE, Akpanta AC, Ohakwe J, Chikezie DC & Onyemach CU. On the Rayleigh-geometric distribution with applications. Heliyon,. 2019; 5: e02200.

33. King M. Data Reconciliation. Statistics for Process Control Engineers: A Practical Approach, John Wiley & Sons, Ltd, Chichester, UK, 2017.
34. Consul PC & Jain GC. A generalization of the Poisson distribution. Technometrics. 1973;15(4): 791-799.
35. Nelder JA & Wedderburn, RW. Generalized linear models. Journal of the Royal Statistical Society: Series A (General). 1972; 135(3): 370-384.
36. McCullagh P & Nelder JA. Generalized Linear Models 2nd Edition Chapman and Hall. London, UK, 1989.
37. Montgomery DC, Peck EA & Vining GG. Introduction to linear regression analysis (Vol. 821). John Wiley & Sons, Inc., New Jersey, USA., 2012.
38. Riyami A, Afifi M, Al-Kharusi H & Morsi M, National Health Survey, Volume 2, Reproductive Health Survey, Ministry of Health, Muscat, Oman, 2000.
Files
IssueVol 9 No 1 (2023) QRcode
SectionOriginal Article(s)
DOI https://doi.org/10.18502/jbe.v9i1.13977
Keywords
Beta-Geometric distribution Beta-geometric regression Count data Modeling Antenatal care visits

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
1.
Al-Balushi Z, Sarr A, Islam MM. Beta-Geometric Regression for Modeling Count Data on First Antenatal Care Visit (ANC) with Application. JBE. 2023;9(1):68-85.