A SIMPLE AND ACCURATE CLASSIFICATION METHOD BASED ON CLASS ASSOCIATION RULES: ADAPTIVE SUPERVISED DISCRETIZATION WITH RANDOM FOREST ENSEMBLE

Authors

  • Jumaniyazova O.U. Urgench Ranch University, Urgench, Uzbekistan Author

Keywords:

class association rules, associative classification, supervised discretization, random forest, data preprocessing, interpretable machine learning.

Abstract

Class Association Rule (CAR)-based classifiers combine the transparency of rule-based models with the predictive power of supervised learning, yet classical methods such as CBA, CMAR, and CPAR suffer from rule explosion, sensitivity to threshold parameters, and degraded performance on imbalanced datasets. This paper proposes an improved two-phase classification pipeline — Adaptive Supervised Discretization combined with a Random Forest ensemble (ASD-RF) — that addresses these limitations. In the first phase, continuous features are discretized using shallow decision trees trained per feature on the class label, deriving cut points that maximise class homogeneity within each bin. In the second phase, a Random Forest of 100 trees is trained on the resulting one-hot-encoded binary feature matrix. Empirical evaluation on two public benchmark datasets (Pima Indians Diabetes and Banknote Authentication) demonstrates that ASD-RF achieves mean accuracy of 90.14%, outperforming CBA (87.28%), CPAR (88.38%), CMAR (79.03%), and an unprocessed Decision Tree baseline (86.14%). The method is fully reproducible, requires no manual threshold tuning, and produces a compact, interpretable feature representation suitable for high-stakes domains such as healthcare and finance.

References

Liu B., Hsu W., Ma Y. Integrating classification and association rule mining // KDD. — 1998. — P. 80–86.

Li W., Han J., Pei J. CMAR: Accurate and efficient classification based on multiple class-association rules // ICDM. — 2001. — P. 369–376.

Yin X., Han J. CPAR: Classification based on predictive association rules // SDM. — 2003. — P. 331–335.

Agrawal R., Srikant R. Fast algorithms for mining association rules // VLDB. — 1994. — P. 487–499.

Han J., Pei J., Yin Y. Mining frequent patterns without candidate generation // SIGMOD. — 2000. — P. 1–12.

Lohweg V. Banknote authentication [Dataset]. UCI Machine Learning Repository. — 2013. https://doi.org/10.24432/C55P57

Breiman L. Random forests // Machine Learning. — 2001. — Vol. 45, No. 1. — P. 5–32.

Pedregosa F. et al. Scikit-learn: Machine learning in Python // JMLR. — 2011. — Vol. 12. — P. 2825–2830.

Fayyad U., Irani K. Multi-interval discretization of continuous-valued attributes for classification learning // IJCAI. — 1993. — P. 1022–1029.

Downloads

Published

2026-05-05

How to Cite

A SIMPLE AND ACCURATE CLASSIFICATION METHOD BASED ON CLASS ASSOCIATION RULES: ADAPTIVE SUPERVISED DISCRETIZATION WITH RANDOM FOREST ENSEMBLE. (2026). Science and Innovation, 4(33), 7-11. https://www.in-academy.uz/index.php/SI/article/view/39575
Innovative Academy RSC
Article metrics Views and PDF downloads
0 Views
0 Downloads