Original Article

Prediction of the breast cancer mortality rate and its effective factors using genetic algorithm and logistic regression

Abstract

Background: Logistic regression is one of the most common models used to predict and classify binary and multiple state responses in medicine. Genetic algorithms search techniques inspired by biology have recently been used successfully as a predictive model.

Objective: The aim of this study was to use the genetic algorithm and logistic regression models in diagnosing and predicting factors affecting breast cancer mortality.

Method: In this study, data of 2836 people with breast cancer during the years 2014-2018 was examined; their information was recorded in the cancer registration system of Kerman University of Medical Sciences. Death status was considered a dependent variable, while age, morphology, tumor differentiation (grad), residence status, and place of residence were considered independent variables. Sensitivity, specificity, accuracy, and area under the ROC curve were used to compare the models.

Results: the logistic regression model determined factors affecting the breast cancer mortality rate, (with sensitivity (0.62), specificity (0.81), area under the ROC curve (0.74), and accuracy (0.84)), and genetic algorithm model (, with sensitivity (0.19), specificity (0.97), area under the ROC curve (0.58) and accuracy (0.87)).

Conclusions: The sensitivity and area under the ROC curve of the logistic regression model were higher than those of the genetic algorithm, but the specificity and accuracy of the genetic algorithm were higher than those of the logistic regression. According to the purpose of the study, two models can be used simultaneously.

1. Chen T-C, Hsu T-C. A GAs based approach for mining breast cancer pattern. Expert Systems with Applications.
2006;30(4):674-81.
2. Al-Maqaleh B, Shahbazkia H. A Genetic Algorithm for Discovering Classification Rules in Data Mining. International Journal of Computer Applications. 2012;41:40-4.
3. Eken C, Bilge U, Kartal M, Eray O. Artificial neural network, genetic algorithm, and logistic regression applications for
predicting renal colic in emergency settings. International journal of emergency medicine.2009;2(2):99-105.
4. Holland JH. Adaptation in natural and artificial systems: MIT Press; 1992.
5. Johnson P, Vandewater L, Wilson W, Maruff P, Savage G, Graham P, et al. Genetic algorithm with logistic regression
for prediction of progression to Alzheimer's disease. BMC Bioinformatics. 2014;15 Suppl 16:S11.
6. Sivanandam SN, Deepa SN. Introduction to Genetic Algorithms: Springer Publishing Company, Incorporated; 2007.
7. Christophides D, Appelt AL, Gusnanto A, Lilley J, Sebag-Montefiore D. Method for Automatic Selection of Parameters in NormalTissue Complication Probability Modeling. International journal of radiation oncology, biology, physics. 2018;101(3):704-12.
8. Celentano DD, Szklo M, Gordis L. Gordis epidemiology2019.
9. Sastry K, Goldberg D, Kendall G. Genetic Algorithms. In: Burke EK, Kendall G, editors. Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques. Boston, MA: Springer US; 2005.p. 97-125.
10. Liou DM, Chang WP. Applying data mining for the analysis of breast cancer data. Methods in molecular biology (Clifton, NJ). 2015;1246:175-89.
11. Kleinbaum DG, Klein M. Introduction to Logistic Regression. In: Kleinbaum DG, Klein M, editors. Logistic Regression: A Self- Learning Text. New York, NY: Springer NewYork; 2010. p. 1-39.
12. Jahani M, Mahdavi M. Comparison of Predictive Models for the Early Diagnosis of Diabetes. Healthcare informatics research. 2016;22(2):95-100.
13. Song HJ, Yang ES, Kim JD, Park CY,Kyung MS, Kim YS. Best serum biomarker combination for ovarian cancer classification. Biomedical engineering online. 2018;17(Suppl 2):152.
14. Sun LM, Chiu HW, Chuang CY, Liu L. A prediction model based on an artificial intelligence system for moderate to severe obstructive sleep apnea. Sleep & breathing = Schlaf & Atmung. 2011;15(3):317-23.
15. Chang C-L, Hsu M-Y. The study that applies artificial intelligence and logistic regression for assistance in differential
diagnostic of pancreatic cancer. Expert Systems with Applications. 2009;36(7):10663-72.
16. Arslanian-Engoren C, Engoren M. Using a Genetic Algorithm to Predict Evaluation of Acute Coronary Syndromes.
Nursing research. 2007;56:82-8.
Files
IssueVol 8 No 1 (2022) QRcode
SectionOriginal Article(s)
DOI https://doi.org/10.18502/jbe.v8i1.10403
Keywords
: Breast cancer cross over mutation Genetic algorithm Logistic regression

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
1.
mirzaie mahdieh, jahani yunes, bahrampour abbas. Prediction of the breast cancer mortality rate and its effective factors using genetic algorithm and logistic regression. JBE. 2022;8(1):37-44.