Hyperparameter optimization based machine learning approach for early diagnosis of fetal genetic disorders


YALÇIN E., Koç T. K., Aslan S., DEMİR S. C., Aykut S., SUCU M.

Discover Computing, cilt.28, sa.1, 2025 (SCI-Expanded, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 28 Sayı: 1
  • Basım Tarihi: 2025
  • Doi Numarası: 10.1007/s10791-025-09815-8
  • Dergi Adı: Discover Computing
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Anahtar Kelimeler: Down syndrome, First trimester screening, GPT-4o, hyperparameter tuning, Machine learning
  • Çukurova Üniversitesi Adresli: Evet

Özet

Prenatal screening is the process of analyzing various clinical variables to estimate the risk of genetic disorders like Down Syndrome (DS), which is distinguished by intellectual disability, distinct facial features, and developmental delays. The accuracy of these risk assessments is heavily reliant on the suitability of the risk algorithm for the target population. This study proposes an enhanced machine learning (ML) approach for predicting Down Syndrome (DS) risk using first-trimester screening (FTS) data. The dataset includes clinical information from 959 women with singleton pregnancies at the Çukurova University Gynecology and Obstetrics Unit between 2020 and 2024. To address limitations in existing studies, GPT-4 was utilized to generate synthetic minority-class samples, and advanced feature engineering techniques were incorporated to enhance model robustness and interpretability. A predictive ML model was created, and Hyperparameter Tuning (HT) was applied to optimize it for performance. Eight classifiers were tested, and CatBoost performed the best, achieving 97.39% accuracy and a 2.62% false-positive rate, outperforming the second-best classifier (XGBoost) across all primary evaluation metrics. These improvements highlight the novelty of the framework, particularly its integration of GPT-4–based augmentation and engineered biochemical interaction features. The results demonstrate the model’s potential for reliable DS risk prediction, offering a more efficient and less invasive alternative to traditional diagnostic procedures. By enhancing early risk detection, the method could reduce unnecessary referrals for invasive tests like amniocentesis, thereby minimizing patient anxiety and potential complications. Overall, the study contributes to the development of intelligent, data-driven solutions for prenatal care.