Skip navigation

SCALA - Scaling algorithm for multi-class imbalanced classification: a novel algorithm specifically designed for multi-class multiple minority imbalanced data problems

SCALA - Scaling algorithm for multi-class imbalanced classification: a novel algorithm specifically designed for multi-class multiple minority imbalanced data problems

Barzinji, Ala Othman, Ma, Jixin and Ma, Chaoying (2023) SCALA - Scaling algorithm for multi-class imbalanced classification: a novel algorithm specifically designed for multi-class multiple minority imbalanced data problems. In: ICMLT '23: Proceedings of the 2023 8th International Conference on Machine Learning Technologies. Association for Computing Machinery (ACM), New York, pp. 68-73. ISBN 978-1450398329 (doi:10.1145/3589883.3589893)

[thumbnail of VoR] PDF (VoR)
51797 BARZINJI_SCALA_Scaling_Algorithm_For_Multi-Class_Imbalanced_Classification_(VoR)_2023.pdf - Published Version
Restricted to Repository staff only

Download (476kB) | Request a copy

Abstract

Most of the existing techniques for solving data imbalance problems are geared towards binary classification problems, hence a novel strategy capable of natively handling multi-class classification problems is required. Existing implementations mainly employ a one-versus-rest approach to support multi-class problems and this generalisation hinders its effectiveness in datasets with multiple minority classes. On the contrary, a one-versus-one approach avoids such generalisation and provides finer control over the balancing strategy. In this paper, we propose a novel SCALA algorithm capable of handling imbalanced data with multiple minority class labels with a multi-class output. We introduce a user-defined set of scaling factors which are then integrated with a one-versus-one balancing strategy. Our results show that SCALA demonstrated a significant improvement compared to ADASYN and SMOTE in model performance metrics used to validate balancing techniques. SCALA can balance these datasets without allowing minority classes to overshadow other minority classes. This preserves the information needed by the training algorithm to distinguish between the classes to a high precision.

Item Type: Conference Proceedings
Title of Proceedings: ICMLT '23: Proceedings of the 2023 8th International Conference on Machine Learning Technologies
Additional Information: Proceedings of the 8th International Conference on Machine Learning Technologies (ICMLT 2023), held in Stockholm, Sweden, March 10–12, 2023. - MP
Uncontrolled Keywords: machine learning, data science, algorithms, imbalanced-data classifications, oversampling, one-versus-one approach
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Faculty / School / Research Centre / Research Group: Faculty of Engineering & Science
Faculty of Engineering & Science > School of Computing & Mathematical Sciences (CMS)
Related URLs:
Last Modified: 25 Nov 2025 12:41
URI: https://gala.gre.ac.uk/id/eprint/51797

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics