Evaluating Machine Learning and Association Rule Techniques for Health Data Mining: A Comparative Study on KNN, Naïve Bayes, and Apriori Algorithms

Authors

  • Jose C. Agoylo Jr. BSIT Department, Southern Leyte State University – Tomas Oppus Campus, Southern Leyte, Philippines
  • Ejie C. Florida BSIT Department, Southern Leyte State University – Tomas Oppus Campus, Southern Leyte, Philippines
  • Athena Joy B. Campania BSIT Department, Southern Leyte State University – Tomas Oppus Campus, Southern Leyte, Philippines
  • Mary Ann C. Paulin BSIT Department, Southern Leyte State University – Tomas Oppus Campus, Southern Leyte, Philippines

DOI:

https://doi.org/10.54554/jtec.2025.17.04.006

Keywords:

K-Nearest Neighbors, Naïve Bayes, Apriori, Health Data Mining, Medical Diagnosis, Association Rule Mining

Abstract

Accurate disease classification and behavioral pattern mining are crucial for early intervention and preventive healthcare. While machine learning models have been extensively applied in health informatics, research that compares the performance of K-Nearest Neighbors (KNN), Naïve Bayes (NB), and Apriori algorithms across various health datasets remains limited. This study evaluated the performance of KNN and NB classifiers on five Kaggle datasets covering lung cancer, heart disease, depression, diabetes, and cardiac risk. The Apriori algorithm was used for mining association rules in the Depression dataset. Preprocessing included data, scaling, and dimensionality reduction using Principal Component Analysis (PCA) to improve efficiency. KNN demonstrated more reliable performance than NB across datasets, achieving an average accuracy of more than 0.92, especially in numeric-heavy environments. The study also identified lifestyle factors significantly associated with depression risk through Apriori mining. The findings highlight the strong potential of lightweight machine learning models for real-time health monitoring, early diagnosis, and behavioral intervention applications.

Downloads

Download data is not yet available.

Downloads

Published

2025-12-24

How to Cite

Jose C. Agoylo Jr., Ejie C. Florida, Athena Joy B. Campania, & Mary Ann C. Paulin. (2025). Evaluating Machine Learning and Association Rule Techniques for Health Data Mining: A Comparative Study on KNN, Naïve Bayes, and Apriori Algorithms. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 17(4), 43–48. https://doi.org/10.54554/jtec.2025.17.04.006