Comparison of the Performance of XGBoost, CatBoost, and GBM Algorithms in Cardiovascular Disease Prediction

Authors

  • Panwasto Samosir P Universitas Mercu Buana
  • Umniy Salamah

DOI:

https://doi.org/10.36085/jsai.v8i1.7552

Abstract

Cardiovascular disease remains the primary cause of mortality globally, encompassing conditions affecting the heart and blood vessels, such as hypertension and coronary artery disease. Risk factors include unhealthy lifestyle habits and immutable factors like age and family history. To tackle the challenges in early detection and prediction of cardiovascular disease, machine learning techniques, especially boosting algorithms, have emerged as promising tools. This study evaluates the performance of three prominent boosting algorithms: XGBoost, CatBoost, and Gradient Boosting—using publicly available datasets to predict cardiovascular disease risk. The findings reveal that CatBoost surpasses the other models with an accuracy of 75%, a Precision of 0.83, and a ROC AUC of 0.81, highlighting its exceptional predictive capabilities. Gradient Boosting achieves 70% accuracy with a well-balanced Recall and Precision, whereas XGBoost records the lowest performance with 63.3% accuracy across all metrics. These results position CatBoost as the most effective model for cardiovascular disease risk prediction.

Downloads

Published

2025-01-31

Issue

Section

Articles
Abstract viewed = 0 times