Machine learning for predicting breast cancer risk and outcomes: a scoping review
DOI:
https://doi.org/10.13181/mji.rev.268261Keywords:
breast neoplasm, machine learning, predictive value of testsAbstract
Breast cancer (BC) is the most frequently diagnosed malignancy worldwide and a leading cause of death in women. Machine learning (ML) enables data-driven risk stratification and outcome prediction, potentially improving screening, treatment choices, and overall care. This scoping review cataloged ML models used to predict BC risk and post-treatment outcomes. A PubMed search using MeSH terms (“breast neoplasms,” “breast cancer,” “machine learning,” and “predictive value of tests”) retrieved 44 records, with 9 added by manual searching; 24 studies were included. We excluded 29 reports limited to imaging/radiomics or biomolecular/omics, plus other ineligible items. Model usage among included studies: 13 random forest (RF); 13 support vector machine; 7 decision tree; 5 k-nearest neighbors; 4 deep learning; 3 naïve Bayes; 2 artificial neural networks. Average RF accuracy surpassed 90% in most reports, indicating RF is both common and consistently strong for predictive modeling, supporting targeted clinical translation.
Downloads
References
Santoro M, Strolin S, Paolani G, Della Gala G, Bartoloni A, Giacometti C, et al. Recent applications of artificial intelligence in radiotherapy: where we are and beyond. Appl Sci. 2022;12(7):3223. https://doi.org/10.3390/app12073223
Phung MT, Tin Tin S, Elwood JM. Prognostic models for breast cancer: a systematic review. BMC Cancer. 2019;19(1):230. https://doi.org/10.1186/s12885-019-5442-6
Boeri C, Chiappa C, Galli F, De Berardinis V, Bardelli L, Carcano G, et al. Machine learning techniques in breast cancer prognosis prediction: a primary evaluation. Cancer Med. 2020;9(9):3234-43. https://doi.org/10.1002/cam4.2811
Montazeri M, Montazeri M, Montazeri M, Beigzadeh A. Machine learning models in breast cancer survival prediction. Technol Health Care. 2016;24(1):31-42. https://doi.org/10.3233/THC-151071
Moons KG, de Groot JA, Bouwmeester W, Vergouwe Y, Mallett S, Altman DG, et al. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PLoS Med. 2014;11(10):e1001744. https://doi.org/10.1371/journal.pmed.1001744
Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. https://doi.org/10.1136/bmj.g7594
Ding X, Xie S, Chen J, Mo W, Yang H. A support vector machine model for predicting non-sentinel lymph node status in patients with sentinel lymph node positive breast cancer. Tumour Biol. 2013;34(3):1547-52. https://doi.org/10.1007/s13277-013-0683-5
Ganggayah MD, Taib NA, Har YC, Lio P, Dhillon SK. Predicting factors for survival of breast cancer patients using machine learning techniques. BMC Med Inform Decis Mak. 2019;19(1):48. https://doi.org/10.1186/s12911-019-0801-4
Xu Y, Ju L, Tong J, Zhou C, Yang J. Supervised machine learning predictive analytics for triple-negative breast cancer death outcomes. Onco Targets Ther. 2019;12:9059-67. https://doi.org/10.2147/OTT.S223603
Whitney J, Corredor G, Janowczyk A, Ganesan S, Doyle S, Tomaszewski J, et al. Quantitative nuclear histomorphometry predicts oncotype DX risk categories for early stage ER+ breast cancer. BMC Cancer. 2018;18(1):610. https://doi.org/10.1186/s12885-018-4448-9
Kim I, Choi HJ, Ryu JM, Lee SK, Yu JH, Kim SW, et al. A predictive model for high/low risk group according to oncotype DX recurrence score using machine learning. Eur J Surg Oncol. 2019;45(2):134-40. https://doi.org/10.1016/j.ejso.2018.09.011
Zhu X, Wolfgruber TK, Leong L, Jensen M, Scott C, Winham S, et al. Deep learning predicts interval and screening-detected cancer from screening mammograms: a case-case-control study in 6369 women. Radiology. 2021;301(3):550-8. https://doi.org/10.1148/radiol.2021203758
Pawloski KR, Gonen M, Wen HY, Tadros AB, Thompson D, Abbate K, et al. Supervised machine learning model to predict oncotype DX risk category in patients over age 50. Breast Cancer Res Treat. 2022;191(2):423-30. https://doi.org/10.1007/s10549-021-06443-w
Lindsay WD, Ahern CA, Tobias JS, Berlind CG, Chinniah C, Gabriel PE, et al. Automated data extraction and ensemble methods for predictive modeling of breast cancer outcomes after radiation therapy. Med Phys. 2019;46(2):1054-63. https://doi.org/10.1002/mp.13314
Wu M, Zhong X, Peng Q, Xu M, Huang S, Yuan J, et al. Prediction of molecular subtypes of breast cancer using BI-RADS features based on a "white box" machine learning approach in a multi-modal imaging setting. Eur J Radiol. 2019;114:175-84. https://doi.org/10.1016/j.ejrad.2019.03.015
Akselrod-Ballin A, Chorev M, Shoshan Y, Spiro A, Hazan A, Melamed R, et al. Predicting breast cancer by applying deep learning to linked health records and mammograms. Radiology. 2019;292(2):331-42. https://doi.org/10.1148/radiol.2019182622
Kim J, Yu D, Kwon Y, Lee KS, Sim SH, Kong SY, et al. Genomic characteristics of triple-negative breast cancer nominate molecular subtypes that predict chemotherapy response. Mol Cancer Res. 2020;18(2):253-63. https://doi.org/10.1158/1541-7786.MCR-19-0453
Nicolò C, Périer C, Prague M, Bellera C, MacGrogan G, Saut O, et al. Machine learning and mechanistic modeling for prediction of metastatic relapse in early-stage breast cancer. JCO Clin Cancer Inform. 2020;4:259-74. https://doi.org/10.1200/CCI.19.00133
Mosayebi A, Mojaradi B, Bonyadi Naeini A, Khodadad Hosseini SH. Modeling and comparing data mining algorithms for prediction of recurrence of breast cancer. PLoS One. 2020;15(10):e0237658. https://doi.org/10.1371/journal.pone.0237658
Lou SJ, Hou MF, Chang HT, Chiu CC, Lee HH, Yeh SJ, et al. Machine learning algorithms to predict recurrence within 10 years after breast cancer surgery: a prospective cohort study. Cancers. 2020;12(12):3817. https://doi.org/10.3390/cancers12123817
Liu Z, Mi M, Li X, Zheng X, Wu G, Zhang L. A lncRNA prognostic signature associated with immune infiltration and tumour mutation burden in breast cancer. J Cell Mol Med. 2020;24(21):12444-56. https://doi.org/10.1111/jcmm.15762
Chen X, Zheng J, Zhuo ML, Zhang A, You Z. A six-gene-based signature for breast cancer radiotherapy sensitivity estimation. Biosci Rep. 2020;40(12):BSR20202376. https://doi.org/10.1042/BSR20202376
Pellegrini M. Accurate prediction of breast cancer survival through coherent voting networks with gene expression profiling. Sci Rep. 2021;11(1):14645. https://doi.org/10.1038/s41598-021-94243-z
Wu R, Luo J, Wan H, Zhang H, Yuan Y, Hu H, et al. Evaluation of machine learning algorithms for the prognosis of breast cancer from the surveillance, epidemiology, and end results database. PLoS One. 2023;18(1):e0280340. https://doi.org/10.1371/journal.pone.0280340
Hou C, Zhong X, He P, Xu B, Diao S, Yi F, et al. Predicting breast cancer in Chinese women using machine learning techniques: algorithm development. JMIR Med Inform. 2020;8(6):e17364. https://doi.org/10.2196/17364
Naji MA, Filali SE, Aarika K, Benlahmar EL, Abdelouhahid RA, Debauche O. Machine learning algorithms for breast cancer prediction and diagnosis. Procedia Computer Science. 2021;191:487-92. https://doi.org/10.1016/j.procs.2021.07.062
Rabiei R, Ayyoubzadeh SM, Sohrabei S, Esmaeili M, Atashi A. Prediction of breast cancer using machine learning approaches. J Biomed Phys Eng. 2022;12(3):297-308. https://doi.org/10.31661/jbpe.v0i0.2109-1403
Saleh H, Abd-El Ghany SF, Alyami H, Alosaimi W. Predicting breast cancer based on optimized deep learning approach. Comput Intell Neurosci. 2022;2022:1820777. https://doi.org/10.1155/2022/1820777
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Dini Alyani, Soehartati Gondhowiardjo

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Authors who publish with Medical Journal of Indonesia agree to the following terms:
- Authors retain copyright and grant Medical Journal of Indonesia right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial License that allows others to remix, adapt, build upon the work non-commercially with an acknowledgment of the work’s authorship and initial publication in Medical Journal of Indonesia.
- Authors are permitted to copy and redistribute the journal's published version of the work non-commercially (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in Medical Journal of Indonesia.



