Prediction of Depression Severity by Applying Machine Learning to Blood Biomarkers


Kavak R., ÖZEL S. A., Yilmaz O.

9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025, Gaziantep, Türkiye, 27 - 28 Haziran 2025, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/isas66241.2025.11101949
  • Basıldığı Şehir: Gaziantep
  • Basıldığı Ülke: Türkiye
  • Anahtar Kelimeler: blood biomarkers, data imbalance handling, depression severity, machine learning, medical data classification
  • Çukurova Üniversitesi Adresli: Evet

Özet

Depression is a complex mental disorder that negatively impacts an individual's general well-being and daily routine. The diagnosis of depression levels generally relies on the patient's self-reports and clinical assessments. Analyses relying on biological data are more dependable and can provide early diagnosis to protect the patient from severe consequences. This study investigates the potential of using blood biomarkers to predict depression severity (mild-severe) by applying various Machine Learning (ML) techniques. Five different machine learning methods Logistic Regression (LR), k-Nearest Neighbor (KNN), Decision Tree (DT), Random Forest (RF), and Multilayer Perceptron (MLP) were applied to a dataset including 7,326 samples and 107 features provided by Adana Dr. Ekrem Tok Mental Health Hospital. Class imbalance in the training set was eliminated by the Synthetic Minority Oversampling Technique (SMOTE) method, and the performance of the models was measured by the Receiver Operator Characteristic (ROC) curve and Area Under the Curve (AUC). Random Forest achieved the best performance with an AUC of 0.82 after SMOTE dataset. Feature importance analysis before and after SMOTE indicated that TSH, HCV Ab, and Chlorine were among the most important indicators, ranking in the top five in both cases. The results demonstrate that blood biomarkers can be used as reliable predictors for estimating depression severity.