| ||||
| ||||
![]() Title:Synthetic Resampling Algorithm for Response Class Imbalance in Supervised Learning: Application to Road Accident Severity Prediction Conference:EWGT2025 Tags:Accident severity, Road accident, Supervised Learning and Unbalanced data Abstract: Road traffic injuries are a leading cause of death worldwide and are projected to become even more critical by 2030. Understanding the factors influencing crash severity is essential for developing effective safety interventions. However, crash data often suffer from severe class imbalance, especially when distinguishing between fatal and non-fatal accidents. Traditional machine learning algorithms tend to perform poorly under these conditions, favoring the majority class and misclassifying critical minority cases. To address this, we propose a novel resampling algorithm—SONCA (Synthetic Over-sampling for Numerical and Categorical variables)—designed to balance datasets containing mixed data types. Unlike existing oversampling methods, SONCA handles numerical, ordinal, nominal, and dichotomous. We evaluated SONCA using both parametric (Logit) and non-parametric (CART) models on imbalanced datasets: PTW-ISTAT. The original models failed to detect the minority class effectively, while models estimated on SONCA-resampled data showed substantial improvements in True Positive Rate, G-mean, and Fmeasure. These results demonstrate SONCA's potential as a flexible, model-agnostic preprocessing tool for addressing class imbalance in diverse real-world scenarios Synthetic Resampling Algorithm for Response Class Imbalance in Supervised Learning: Application to Road Accident Severity Prediction ![]() Synthetic Resampling Algorithm for Response Class Imbalance in Supervised Learning: Application to Road Accident Severity Prediction | ||||
| Copyright © 2002 – 2025 EasyChair |
