Predicting Characteristics Associated with Breast Cancer Survival using Multiple Machine Learning Approaches

creativework.keywordsMachine learning, breast cancer,
dc.contributor.advisorMohammad Monirujjaman Khan
dc.contributor.authorMohammad Nazmul Haque
dc.contributor.id1712859042
dc.coverage.departmentElectrical and Computer Engineering
dc.date.accessioned2025-12-04
dc.date.accessioned2025-12-04T09:16:14Z
dc.date.available2025-12-04T09:16:14Z
dc.date.issued2021-12-30
dc.description.abstractBreast cancer is one of the most commonly diagnosed female disorders globally. Numerous studies have been conducted to predict survival markers, although the majority of these analyses were conducted using simple statistical techniques. In lieu of that, this research employed machine learning approaches to develop models for identifying and visualizing relevant prognostic indications of breast cancer survival rates. A comprehensive hospital-based breast cancer dataset was collected from the National Cancer Institute's SEER Program's November 2017 update, which offers population-based cancer statistics. The dataset included female patients diagnosed between 2006 and 2010 with infiltrating duct and lobular carcinoma breast cancer (SEER primary cites recode NOS histology codes 8522/3). The dataset included nine predictor factors and one predictor variable that were linked to the patients' survival status (alive or dead). To identify important prognostic markers associated with breast cancer survival rates, prediction models were constructed using K-nearest neighbor, decision tree, gradient boosting, random forest, adaboost, logistic regression, voting classifier, and support vector machine. All methods yielded close results in terms of model accuracy and calibration measures, with the lowest achieved from Logistic Regression (accuracy = 80.57 percent) and the greatest acquired from random forest (accuracy = 94.64 percent). Notably, the multiple machine learning algorithms utilized in this research achieved high accuracy, suggesting that these approaches might be used as alternative prognostic tools in breast cancer survival studies, especially in the Asian area.
dc.description.degreeUndergraduate
dc.identifier.cd600000274
dc.identifier.urihttps://repository.northsouth.edu/handle/123456789/1522
dc.language.isoen_US
dc.publisherNorth South University
dc.rights@ NSU Library
dc.titlePredicting Characteristics Associated with Breast Cancer Survival using Multiple Machine Learning Approaches
dc.typeProject
oaire.citation.endPage82
oaire.citation.startPage1
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
600000274-abstract.pdf
Size:
366.35 KB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
600000274.pdf
Size:
1.59 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.93 KB
Format:
Item-specific license agreed to upon submission
Description: