Multi-Class Machine Learning Models for Predicting Order Cancellations and Returns in E-Commerce


Sari Z. S., Yurdagul H. H., Taskapi B., Dagdas G., AKAY M. F.

9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025, Gaziantep, Türkiye, 27 - 28 Haziran 2025, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/isas66241.2025.11101817
  • Basıldığı Şehir: Gaziantep
  • Basıldığı Ülke: Türkiye
  • Anahtar Kelimeler: E-commerce, Machine Learning, Order Cancellation and Return, Prediction
  • Çukurova Üniversitesi Adresli: Evet

Özet

E-commerce became a central component of sustainable growth strategies for businesses, driven by evolving consumer behavior and accelerating digitalization.In this study, multi-class machine learning models were developed for predicting order cancellations and returns in the e-commerce sector. A real-world dataset containing 1 0, 0 0 0 order records collected from a client of Inveon, covering the period from 2020 to 2024, has been used.The performance of five classification algorithms, including Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), and Deep Neural Network (DNN), was evaluated and compared. Four feature selection techniques were applied to enhance predictive performance, including minimum Redundancy Maximum Relevance (mRMR), Relief-F, FClassification, and a hybrid approach. Model generation has been carried out using 1 0-fold cross-validation and 3-fold grid search for hyperparameter tuning. RF- and XGBoost-based models achieved the highest performance, with F1-scores reaching up to 0,96. Regarding class-specific prediction capability, RF-, XGBoost-, SVM-, and DNN-based models yielded better results in cancellation prediction, while LR-based models performed more effectively in return prediction. Among the feature selection techniques, Relief-F and the hybrid method made the most notable contributions to model performance, especially in RF- and XGBoost-based models.