<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root>
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" article-type="research-article" dtd-version="1.2" xml:lang="en"><front><journal-meta><journal-id journal-id-type="publisher-id">RUDN Journal of Engineering Research</journal-id><journal-title-group><journal-title xml:lang="en">RUDN Journal of Engineering Research</journal-title><trans-title-group xml:lang="ru"><trans-title>Вестник Российского университета дружбы народов. Серия: Инженерные исследования</trans-title></trans-title-group></journal-title-group><issn publication-format="print">2312-8143</issn><issn publication-format="electronic">2312-8151</issn><publisher><publisher-name xml:lang="en">Peoples’ Friendship University of Russia named after Patrice Lumumba (RUDN University)</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">45012</article-id><article-id pub-id-type="doi">10.22363/2312-8143-2025-26-2-168-180</article-id><article-id pub-id-type="edn">MBIJVQ</article-id><article-categories><subj-group subj-group-type="toc-heading" xml:lang="en"><subject>Articles</subject></subj-group><subj-group subj-group-type="toc-heading" xml:lang="ru"><subject>Статьи</subject></subj-group><subj-group subj-group-type="article-type"><subject>Research Article</subject></subj-group></article-categories><title-group><article-title xml:lang="en">Machine Learning Methods for Predicting Cardiovascular Diseases: A Comparative Analysis</article-title><trans-title-group xml:lang="ru"><trans-title>Методы машинного обучения для прогнозирования сердечно-сосудистых заболеваний: сравнительный анализ</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0009-0003-6131-2884</contrib-id><name-alternatives><name xml:lang="en"><surname>Temirbayeva</surname><given-names>Aiym B.</given-names></name><name xml:lang="ru"><surname>Темирбаева</surname><given-names>Айым Болатовна</given-names></name></name-alternatives><bio xml:lang="en"><p>MS student in Applied Data Analytics</p></bio><bio xml:lang="ru"><p>магистрант департамента по прикладной аналитике данных</p></bio><email>aiymtemirbaeva@gmail.com</email><xref ref-type="aff" rid="aff1"/></contrib><contrib contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-4939-8876</contrib-id><name-alternatives><name xml:lang="en"><surname>Altybay</surname><given-names>Arshyn</given-names></name><name xml:lang="ru"><surname>Алтыбай</surname><given-names>Аршын</given-names></name></name-alternatives><bio xml:lang="en"><p>PhD of Philosophy, Senior Researcher of the Department of Differential Equations</p></bio><bio xml:lang="ru"><p>доктор философии, старший научный сотрудник департамента дифференциальных уравнений, Институт математики и математического моделирования</p></bio><email>arshyn.altybay@gmail.com</email><xref ref-type="aff" rid="aff1"/></contrib></contrib-group><aff-alternatives id="aff1"><aff><institution xml:lang="en">Astana IT University</institution></aff><aff><institution xml:lang="ru">Астанинский IT-университет</institution></aff></aff-alternatives><pub-date date-type="pub" iso-8601-date="2025-07-03" publication-format="electronic"><day>03</day><month>07</month><year>2025</year></pub-date><volume>26</volume><issue>2</issue><issue-title xml:lang="en"/><issue-title xml:lang="ru"/><fpage>168</fpage><lpage>180</lpage><history><date date-type="received" iso-8601-date="2025-07-14"><day>14</day><month>07</month><year>2025</year></date></history><permissions><copyright-statement xml:lang="en">Copyright ©; 2025, Temirbayeva A.B., Altybay A.</copyright-statement><copyright-statement xml:lang="ru">Copyright ©; 2025, Темирбаева А.Б., Алтыбай А.</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="en">Temirbayeva A.B., Altybay A.</copyright-holder><copyright-holder xml:lang="ru">Темирбаева А.Б., Алтыбай А.</copyright-holder><ali:free_to_read xmlns:ali="http://www.niso.org/schemas/ali/1.0/"/><license><ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/">https://creativecommons.org/licenses/by-nc/4.0</ali:license_ref></license></permissions><self-uri xlink:href="https://journals.rudn.ru/engineering-researches/article/view/45012">https://journals.rudn.ru/engineering-researches/article/view/45012</self-uri><abstract xml:lang="en"><p>The study aims to accurately predict the presence of heart disease using machine learning models. The research evaluates and compares the performance of five algorithms - Logistic Regression, Support Vector Machine (SVM), Decision Tree, Random Forest, and Gradient Boosting - on a dataset containing clinical features of patients. The primary research question is to identify which algorithm demonstrates the best predictive performance for heart disease diagnosis. The study used a dataset of 270 patients with 13 clinical features. The data was preprocessed, and target variables were converted into binary values for classification. The dataset was split into training and test sets in a 70-30 ratio. Five machine learning models were trained and evaluated using metrics such as accuracy, precision, recall, F1-score, and ROC-AUC. Confusion matrices were analyzed to gain additional insights into model performance. Logistic Regression and Random Forest showed the best results among all models, with an accuracy of 86.4 and 80.2%, respectively. The Logistic Regression showed a ROC-AUC score of 0.844, while the Random Forest showed a score of 0.88. The confusion matrices revealed the strengths and weaknesses of each model in terms of forecasting. Logistic Regression and Random Forest were identified as the most reliable models for predicting heart disease in this dataset. Future work will explore hyperparameter tuning and ensemble methods to further enhance model performance, providing valuable insights for early diagnosis and treatment of cardiovascular diseases.</p></abstract><trans-abstract xml:lang="ru"><p>Цель исследования - точное предсказание наличия сердечно-сосудистых заболеваний с помощью моделей машинного обучения. Оценивалась и сравнивалась эффективность пяти алгоритмов: логистической регрессии, машины опорных векторов, дерева решений, случайного леса и градиентного бустинга на наборе данных, содержащем клинические характеристики пациентов. Определялось, какой из алгоритмов демонстрирует наилучшие прогностические характеристики для диагностики заболеваний сердца. Использован набор данных 270 пациентов с 13 клиническими признаками. Данные были предварительно обработаны, а целевые переменные преобразованы в бинарные значения для классификации. Набор данных был разделен на обучающий и тестовый в соотношении 70-30. Пять моделей машинного обучения были обучены и оценены с помощью таких метрик, как точность, recall, precision, F1-score и ROC-AUC. Для получения дополнительной информации о производительности моделей были проанализированы матрицы ошибок. В результате логистическая регрессия и случайный лес показали наилучшие результаты среди всех моделей с точностью 86,4 и 80,2 %, соответственно. Логистическая регрессия продемонстрировала ROC-AUC на уровне 0,844, а случайный лес - 0,88. С помощью матриц путаницы выявлены прогностические достоинства и недостатки каждой модели. Авторами сделаны следующие выводы: логистическая регрессия и случайный лес были определены как наиболее надежные модели для прогнозирования сердечно-сосудистых заболеваний в этом наборе данных. В дальнейшем планируется изучение методов настройки гиперпараметров и ансамбля для повышения эффективности моделей, что позволит получать ценные сведения для ранней диагностики и лечения сердечно-сосудистых заболеваний.</p></trans-abstract><kwd-group xml:lang="en"><kwd>Random Forest</kwd><kwd>support vector machine</kwd><kwd>gradient boosting</kwd><kwd>decision tree</kwd><kwd>Logistic Regression</kwd><kwd>accuracy</kwd></kwd-group><kwd-group xml:lang="ru"><kwd>случайный лес</kwd><kwd>машина опорных векторов</kwd><kwd>градиентное усиление</kwd><kwd>дерево решений</kwd><kwd>логистическая регрессия</kwd><kwd>точность</kwd></kwd-group><funding-group/></article-meta></front><body></body><back><ref-list><ref id="B1"><label>1.</label><mixed-citation>Mendis S, Graham I, Narula J. Addressing the global burden of cardiovascular diseases; need for scalable and sustainable frameworks. Global Heart. 2022;17(1):46. https://doi.org/10.5334/gh.1139 EDN: ALVXJY</mixed-citation></ref><ref id="B2"><label>2.</label><mixed-citation>Mukasheva G, Abenova M, Shaltynov A, Tsigen-gage O, Mussabekova Z, Bulegenov T, Shalgumbaeva G, Semenova Yu. Incidence and mortality of cardiovascular disease in the Republic of Kazakhstan: 2004-2017. Iranian Journal of Public Health. 2022;51(4):821-830. https://doi.org/10.18502/ijph.v51i4.9243 EDN: DHJPUR</mixed-citation></ref><ref id="B3"><label>3.</label><mixed-citation>Abbas S, Ojo S, Hejaili AA, Sampedro GA, Almadhor A, Zaidi M, Kryvinska N. Artificial intelli-gence framework for heart disease classification from audio signals. Scientific Reports. 2024;14(1)3123. https://doi.org/10.1038/s41598-024-53778-7 EDN: UPLLIK</mixed-citation></ref><ref id="B4"><label>4.</label><mixed-citation>Hossain MI, Maruf MH, Khan MAR, Prity FS, Fatema S, Ejaz MS, Khan M. Heart disease prediction using distinct artificial intelligence techniques: performance analysis and comparison. Iran Journal of Computer Science. 2023;6(4):397-417. https://doi.org/10.1007/s42044-023-00148-7 EDN: IKJGNI</mixed-citation></ref><ref id="B5"><label>5.</label><mixed-citation>Zhang H, Zhang P, Wang Z, Chao L, Chen Y, Li Q. Multi-Feature decision fusion network for heart sound abnormality detection and classification. IEEE Journal of Biomedical and Health Informatics. 2024;28(3):1386-1397. https://doi.org/10.1109/jbhi.2023.3307870 EDN: SSTBYM</mixed-citation></ref><ref id="B6"><label>6.</label><mixed-citation>Liu Z, Jiang H, Zhang F, Ouyang W, Li X, Pan X. Heart sound classification based on bispectrum features and Vision Transformer mode. Alexandria Engineering Journal. 2023;85:49-59. https://doi.org/10.1016/j.aej.2023.11.035 EDN: EKYJWK</mixed-citation></ref><ref id="B7"><label>7.</label><mixed-citation>Mahajan RA, Balkhande B, Wanjale K, Chitre A, Jadhav TA, Hundekari SN. Enhancing Heart Disease Risk Prediction Accuracy through Ensemble Classification Techniques. International Journal of Intelligent Systems and Applications in Engineering. 2023;11(10s):701-713. Available from: https://ijisae.org/index.php/IJISAE/article/view/3325 (accessed: 12.09.2024).</mixed-citation></ref><ref id="B8"><label>8.</label><mixed-citation>Rakhimov M, Akhmadjonov R, Javliev S. Artificial intelligence in Medicine for Chronic disease classification using Machine learning. 2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT). 2022:1-6 https://doi.org/10.1109/aict55583.2022.10013587</mixed-citation></ref><ref id="B9"><label>9.</label><mixed-citation>Hossain I, Maruf M, Khan MAR, Prity FS, Fatema S, Ejaz MS, Khan M. Heart disease prediction using distinct artificial intelligence techniques: performance analysis and comparison. Iran Journal of Computer Science. 2023;6(4):397-417. https://doi.org/10.1007/s42044-023-00148-7 EDN: IKJGNI</mixed-citation></ref><ref id="B10"><label>10.</label><mixed-citation>Erdem K, Yildiz MB, Yasin ET, Koklu M.A detailed analysis of detecting heart diseases using artificial intelligence methods. Intelligent Methods in Engineering Sciences. 2023;2(4):115-124 https://doi.org/10.58190/imiens.2023.71 EDN: DYZTFY</mixed-citation></ref><ref id="B11"><label>11.</label><mixed-citation>Salman HA, Kalakech A, Steiti A. Random Forest algorithm Overview. Babylonian journal of machine learning. 2024;2024:69-79. https://doi.org/10.58496/bjml/2024/007 EDN: HWNARA</mixed-citation></ref><ref id="B12"><label>12.</label><mixed-citation>Wang Q. Support Vector machine algorithm in machine learning. 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA). 2022:750-756. https://doi.org/10.1109/icaica54878.2022.9844516</mixed-citation></ref><ref id="B13"><label>13.</label><mixed-citation>Berrendero JR, Bueno-Larraz B, Cuevas A. On functional logistic regression: some conceptual issues. Test. 2022;32(1):321-349. https://doi.org/10.1007/s11749-022-00836-9 EDN: XCAHRR</mixed-citation></ref><ref id="B14"><label>14.</label><mixed-citation>Bentéjac C, Csörgő A, Martínez-Muñoz G. A com-parative analysis of gradient boosting algorithms. Artificial Intelligence Review. 2020;54(3):1937-1967. https://doi.org/10.1007/s10462-020-09896-5</mixed-citation></ref><ref id="B15"><label>15.</label><mixed-citation>Levy JJ, O’Malley AJ. Don’t dismiss logistic re-gression: the case for sensible extraction of interactions in the era of machine learning. BMC Medical Research Methodology. 2020;20(1):171. https://doi.org/10.1186/s12874-020-01046-3</mixed-citation></ref><ref id="B16"><label>16.</label><mixed-citation>Liew BXW, Kovacs FM, Rugamer D, Royuela A. Machine learning versus logistic regression for prognostic modelling in individuals with non-specific neck pain. European Spine Journal. 2022;31(8):2082-2091. https://doi.org/10.1007/s00586-022-07188-w EDN: YWKGZQ</mixed-citation></ref><ref id="B17"><label>17.</label><mixed-citation>Becker T, Rousseau A, Geubbelmans M, Burzykowski T, Valkenborg D. Decision trees and random forests. American Journal of Orthodontics and Dentofacial Ortho-pedics. 2023;164(6):894-897. https://doi.org/10.1016/j.ajodo.2023.09.011 EDN: QKTJHR</mixed-citation></ref><ref id="B18"><label>18.</label><mixed-citation>Mahajan RA, Balkhande B, Kirti Wanjale K, Chitre A, Jadhav TA, Hundekar SN. Enhancing Heart Disease Risk Prediction Accuracy through Ensemble Classification Techniques. International Journal of Intelligent Systems and Applications in Engineering. 2023;11(10s):701-713. Available from: https://ijisae.org/index.php/IJISAE/article/view/3325/1911 (accessed: 12.09.2024).</mixed-citation></ref></ref-list></back></article>
