Skip to main content

Machine learning-based approach for predicting low birth weight

Abstract

Background

Low birth weight (LBW) has been linked to infant mortality. Predicting LBW is a valuable preventative tool and predictor of newborn health risks. The current study employed a machine learning model to predict LBW.

Methods

This study implemented predictive LBW models based on the data obtained from the “Iranian Maternal and Neonatal Network (IMaN Net)” from January 2020 to January 2022. Women with singleton pregnancies above the gestational age of 24 weeks were included. Exclusion criteria included multiple pregnancies and fetal anomalies. A predictive model was built using eight statistical learning models (logistic regression, decision tree classification, random forest classification, deep learning feedforward, extreme gradient boost model, light gradient boost model, support vector machine, and permutation feature classification with k-nearest neighbors). Expert opinion and prior observational cohorts were used to select candidate LBW predictors for all models. The area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, and F1 score were measured to evaluate their diagnostic performance.

Results

We found 1280 women with a recorded LBW out of 8853 deliveries, for a frequency of 14.5%. Deep learning (AUROC: 0.86), random forest classification (AUROC: 0.79), and extreme gradient boost classification (AUROC: 0.79) all have higher AUROC and perform better than others. When the other performance parameters of the models mentioned above with higher AUROC were compared, the extreme gradient boost model was the best model to predict LBW with an accuracy of 0.79, precision of 0.87, recall of 0.69, and F1 score of 0.77. According to the feature importance rank, gestational age and prior history of LBW were the top critical predictors.

Conclusions

Although this study found that the extreme gradient boost model performed well in predicting LBW, more research is needed to make a better conclusion on the performance of ML models in predicting LBW.

Peer Review reports

Background

Birth weights less than 2500 g are called low birth weight (LBW). LBW has been linked to infant mortality and its consequences [1]. Predicting LBW is thus a valuable preventative tool and predictor of newborn health risks. Previous research has found that maternal demographics, preexisting health conditions, and prenatal care level are all linked to LBW [2, 3]. Thus, pinpointing which pregnant patients are most likely to have a baby with LBW during the preconception or early pregnancy stages is critical for saving neonatal lives and reducing potentially avoidable medical costs through direct clinical and health policy interventions. There are some documented studies on using ML in perinatal care and maternal health. Previous LBW prediction studies achieved good performance in predicting LBW; however, all previous studies recommended more studies due to study limitations such as small sample size or limited feature selection [4,5,6,7]. In this study, we aimed to evaluate the performance of eight different ML algorithms in predicting LBW.

Methods

The findings of this retrospective cohort study are based on birth records obtained from the “Iranian Maternal and Neonatal Network (IMaN Net),” a legitimate national system, from January 2020 to January 2022. IMaN Net is a comprehensive system for registering maternal and newborn information on the outcomes of each delivery, which is completed daily by midwives in all birth centers and hospitals throughout Iran in an integrated manner. All patients’ personal information was deidentified and not disclosed.

Women with singleton pregnancies above the gestational age of 24 weeks who gave birth during a study period were included. The target population in this study was divided into LBW (≤ 2499 g) and not LBW (≥ 2500 g), which is the national standard definition [8]. Exclusion criteria included multiple pregnancies and fetal anomalies.

A predictive model was built using eight statistical learning models, including logistic regression, decision tree classification, random forest classification, deep learning feedforward, extreme gradient boost classification (XGBoost), light gradient boost (LGB), support vector machine (SVM), and permutation feature classification with k-nearest neighbors (KNN). Expert opinion and prior observational cohorts were used to select candidate LBW predictors for all models [9, 10]. Predictor factors included maternal age, educational level, maternal occupation, place of residence, inadequate prenatal care (less than three prenatal care visits), smoking, drug addiction, maternal anemia, cardiovascular disease, chronic hypertension, hepatitis, COVID-19, overt diabetes, gestational diabetes and thyroid dysfunction, parity, preeclampsia, fetal gender, method of childbirth, previous history of LBW, supplementary and vitamins intake were obtained from patient medical records. We used Chi-square test to evaluate the association between predicting factors mentioned above and LBW. Then we performed ML analysis approach. We followed the Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View to report our findings. The programming language Python was chosen to create the machine learning model. Scikit-learn was used to implement the ML algorithm. Scikit-learn is a machine-learning library written in Python. It includes an extensive collection of cutting-edge machine-learning algorithms for both supervised (including the multi-output classification and regression algorithm) and unsupervised problems [11].

Internal validation was carried out with the help of k-fold cross-validation. The cases were randomly assigned to either the “training set” (70%) or the “test set” (30%) using a random number generator. The original dataset kept the rate of LBW and non-LBW groups in the training and test sets constant. Using the training set, we arranged the parameters of the prediction models and evaluated their performance using the “test set”. The average performance was calculated by repeating these ten times.

Metrics, including area under the receiver operating characteristic curve (AUROC), accuracy, precision, recall, and F1 score, were used to assess the predictive power of the models. The accuracy metric calculates how often a model is correctly predicted across the entire dataset. Precision measures how many of the model’s “positive” predictions were correct. The model’s recall estimates how many positive class samples in the dataset were correctly identified. The F1 score combines precision and recall by using their harmonic mean, and maximizing the F1 score implies maximizing both precision and recall simultaneously. As a result, researchers have chosen the F1 score to evaluate their models in conjunction with accuracy. We used AUROC as the primary performance metric because it is a widely used index to describe the ML model’s ability to predict outcomes. The metrics were scaled from 0 to 1, with higher values indicating a better model [12].

Results

Of 8850 eligible cases, we found 1280 women with a recorded LBW, for a frequency of 14.5%. The demographic and clinical characteristics of study population is given in Table 1. As it shown, maternal age, living residency, gestational age, parity, access to prenatal care, maternal anemia, chronic hypertension, preeclampsia, drug addiction, COVID-19, previous LBW, and newborn gender was linked to LBW.

Table 1 Demographic and clinical factors associated with low birth weight

In this study, we attempt to evaluate parameters and feature selection based on performance parameters using various ML algorithms. A plot ROC chart, as shown in Fig. 1, and calculate AUROC as a plot that allows the user to visualize the tradeoff between the classifier’s sensitivity. Deep learning (AUROC: 0.86), random forest classification (AUROC: 0.79), and XGBoost classification (AUROC: 0.79) all have higher ROC_AUC and perform better than others, as shown in Fig. 1.

Fig. 1
figure 1

AUROC of ML models

Other performance parameters for each algorithm are shown in Table 2. Other performance parameters indicate that the XGBoost classification performs more than all. Random forest classification and deep learning feedforward are also very close. When the accuracy, precision, recall, and F1 score of the models mentioned above with higher AUROC were compared, the XGBoost model was the best model to predict LBW with an accuracy of 0.79, precision of 0.87, recall of 0.69, and F1 score of 0.77.

Table 2 Performance parameters of models with the highest AUROC

The confusion matrix of the XGBoost classification model is shown in Fig. 2.

Fig. 2
figure 2

XGBoost classification confusion matrix

Figure 3 presents an analysis of the importance of variables in the XGBoost algorithm. As the feature importance rank was identified, gestational age and previous history of LBW were the top critical predictors.

Fig. 3
figure 3

Feature importance of the XGBoost classification in the prediction of low birth weight

Discussion

With the exponential growth in the quantity and dimension of healthcare data in recent years, ML approaches for dealing with complex and high-dimensional data have been introduced [13,14,15]. In this study, we aimed to evaluate the performance of eight different ML algorithms in predicting LBW. According to our findings, the XGBoost classification model had a more significant diagnostic performance parameter with an AUROC of 0.79, accuracy of 0.79, precision of 0.87, recall of 0.69, and F1 score of 0.77. XGBoost classification is a supervised machine learning algorithm based on a distributed gradient-boosted decision tree [16]. It can produce consistent results while minimizing overfitting by employing a parallel tree-boosting strategy. Furthermore, XGBoost can use the importance score to determine the importance of each feature. Previous studies evaluating different ML machines for predicting LBW will also have promising results. According to Ahmadi et al., the random forest model performed well in terms of diagnostic performance, with an accuracy of 0.95, recall of 0.72, and AUROC of 0.89 [5]. Another study by Desiani et al. found that naive Bayes had excellent diagnostic performance, with an accuracy of 0.85 and a recall of 0.72 [17]. However, both studies were limited by a small sample size (less than 1000 participants).

Recent studies with larger sample sizes also demonstrated good performance. For example, in a survey by Eliyati et al., with a sample size of 12,500 study participants, SVM showed high diagnostic performance in predicting LBW with an accuracy of 0.93 [18]. Ren et al. used a more extensive study in this field, with a sample size of 266,687 birth records over six years. According to their findings, the XGBoost classification model had the highest recall score of 0.85, but the AUROC score was only 0.61 [19].

Although our study did not have the largest sample size of any study in this field, we believe that using hospital records made our feature selection rich enough to make a reasonable conclusion on identifying LBW risk factors. In our study, we surveyed maternal age, educational level, place of residence, inadequate prenatal care (fewer than three prenatal care visits), drug addiction, maternal anemia, cardiovascular disease, chronic hypertension, pyelonephritis, hepatitis, COVID-19, overt diabetes, gestational diabetes and thyroid dysfunction, parity, preeclampsia, and history of LBW. Among all the potential predisposing factors of LBW, gestational age and previous history of LBW were the top critical predictors. In line with previous findings [20, 21], gestational age is the highest predictor of LBW. Being born too soon (premature birth) is the most common cause of LBW. The prior history of LBW was another weighted factor in predicting LBW. It has been reported that the risk of LBW recurs between pregnancies. Women with a previous LBW baby have been identified as potential carriers of the recurrent risk and have a higher recurrence risk of LBW in their subsequent pregnancy than those with a previous normal birth weight baby [22]. Other factors, such as maternal comorbidities, sociodemographic characteristics, and fetal gender, were not among the weighted factors in predicting LBW, in contrast to previous studies. In one study, Bekele et al. found that fetal gender, marriage to birth interval, mother’s occupation, and mother’s age were all weighted factors in predicting LBW [23]. Another study found that maternal pre-pregnancy weight, mother’s age, number of doctor visits during the first trimester, and previous preterm labor were the most significant risk factors for LBW [4]. The differences in findings could be attributed to study design, the type of ML models used, geographical differences, or imbalances in each study’s dataset. It should be noted, however, that clinicians can use the key findings of each study to identify high-risk pregnant patients early in their pregnancy using early prenatal care screening tools.

Although we used a large dataset with a lot of maternal and neonatal information, a significant variable, like body mass index, was missing in most of the birth records, so we couldn’t use this factor in our selection features, which is a significant limitation of the study.

Conclusion

Using ML approaches to predict LBW yielded promising results. As a result, this study might add to the current perinatal care literature. Although this study found that the XGBoost model performed well in predicting LBW, more research is needed to make a better conclusion on the performance of ML models in predicting LBW.

Availability of data and materials

The datasets generated and analyzed during the current study are available from the corresponding author upon reasonable request.

Abbreviations

LBW:

Low birth weight

ML:

Machine learning

XGBoost:

Extreme gradient boost

LGB:

Light gradient boost

SVM:

Support vector machine

KNN:

K-nearest neighbors

References

  1. Chen Y, Li G, Ruan Y, Zou L, Wang X, Zhang W. An epidemiological survey on low birth weight infants in China and analysis of outcomes of full-term low birth weight infants. BMC Pregnancy Childbirth. 2013;13:242. https://doi.org/10.1186/1471-2393-13-242. PMID:24370213;PMCID:PMC3877972.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Heaman MI, Martens PJ, Brownell MD, Chartier MJ, Derksen SA, Helewa ME. The Association of inadequate and intensive prenatal care with maternal, fetal, and infant outcomes: a population-based study in Manitoba, Canada. J Obstet Gynaecol Can. 2019;41(7):947–59. https://doi.org/10.1016/j.jogc.2018.09.006. Epub 2019 Jan 11 PMID: 30639165.

    Article  PubMed  Google Scholar 

  3. Cunningham FG, Leveno KJ, Bloom SL, Hauth JC, Rouse DJ, Spong CY. Williams obstetrics. New York: McGraw-Hill; 2010. p. 804- 831. 23.

    Google Scholar 

  4. Senthilkumar D, Paulraj S. Prediction of low birth weight infants and its risk factors using data mining techniques. Proceedings of the 2015 International Conference on Industrial Engineering and Operations Management; IMEOM '15. Dubai: IEOM Society; 2015.

  5. Ahmadi P, Alavimajd H, Khodakarim S, Tapak L, Kariman N, Amini P, Pazhuheian F. Prediction of low birth weight using random forest: a comparison with logistic regression. Arch Adv Biosci. 2017;8(3):36–43. https://doi.org/10.22037/jps.v8i3.15412.

    Article  Google Scholar 

  6. Borson N, Kabir M, Zamal Z, Rahman R. Correlation analysis of demographic factors on low birth weight and prediction modeling using machine learning techniques. Proceedings of the 4th World Conference on Smart Trends in Systems, Security and Sustainability; WorldS4 '20. London: Institute of Electrical and Electronics Engineers; 2020. p. 169–73.

  7. Faruk A, Cahyono ES. Prediction and classification of low birth weight data using machine learning techniques. Indones J Sci Technol. 2018;3(1):18–28. https://doi.org/10.17509/ijost.v3i1.10799.

    Article  Google Scholar 

  8. International statistical classification of diseases and related health problems, 10th revision. World Health Organization; 2004. Availabe at: https://apps.who.int/iris/bitstream/handle/10665/42980/9241546530_eng.pdf?sequence=1&isAllowed=y.

  9. Schimmel MS, Bromiker R, Hammerman C, Chertman L, Ioscovich A, Granovsky-Grisaru S, Samueloff A, Elstein D. The effects of maternal age and parity on maternal and neonatal outcome. Arch Gynecol Obstet. 2015;291(4):793–8. https://doi.org/10.1007/s00404-014-3469-0.

    Article  CAS  PubMed  Google Scholar 

  10. Sharifi N, Dolatian M, FathNezhadKazemi A, Pakzad R, Yadegari L. The relationship of the structural and intermediate social determinants of health with low birth weight in Iran: a systematic review and meta-analysis. Sci J Kurdistan Univ Medical Sci. 2018;23(2):21–36. https://doi.org/10.29252/sjku.23.2.21.

    Article  Google Scholar 

  11. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay É. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30. https://doi.org/10.1145/2786984.2786995.

    Article  Google Scholar 

  12. Yen SJ, Lee YS. Under-sampling approaches for improving prediction of the minority class in an imbalanced dataset. Proceedings of the 2016 International Conference on Intelligent Computing; ICIC '06. Kunming: Springer; 2006. p. 731–40.

  13. Boujarzadeh B, Ranjbar A, Banihashemi F, Mehrnoush V, Darsareh F, Saffari M. Machine learning approach to predict postpartum haemorrhage: a systematic review protocol. BMJ Open. 2023;13(1):e067661. https://doi.org/10.1136/bmjopen-2022-067661. PMID:36657750;PMCID:PMC9853215.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Mehrnoush V, Ranjbar A, Farashah MV, Darsareh F, Shekari M, Jahromi MS. Prediction of postpartum hemorrhage using traditional statistical analysis and a machine learning approach. AJOG Glob Rep. 2023;3(2):100185. https://doi.org/10.1016/j.xagr.2023.100185. PMID:36935935;PMCID:PMC10020099.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Darsareh F, Ranjbar A, Farashah MV, Mehrnoush V, Shekari M, Jahromi MS. Application of machine learning to identify risk factors of birth asphyxia. BMC Pregnancy Childbirth. 2023;23(1):156. https://doi.org/10.1186/s12884-023-05486-9. PMID:36890453;PMCID:PMC9993370.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Chen T, He T. xgboost: eXtreme gradient boosting. The Comprehensive R Archive Network; 2017. https://cran.microsoft.com/snapshot/2017-12-11/web/packages/xgboost/vignettes/xgboost.pdf.

  17. Desiani A, Primartha R, Arhami M, Orsalan O. Naive Bayes classifier for infant weight prediction of hypertension mother. J Phys Conf Ser. 2019;1282(1):012005. https://doi.org/10.1088/1742-6596/1282/1/012005.

    Article  Google Scholar 

  18. Eliyati N, Faruk A, Kresnawati ES, Arifieni I. Support vector machines for classification of low birth weight in Indonesia. J Phys Conf Ser. 2019;1282(1):012010. https://doi.org/10.1088/1742-6596/1282/1/012010.

    Article  Google Scholar 

  19. Ren Y, Wu D, Tong Y, López-DeFede A, Gareau S. Issue of data imbalance on low birthweight baby outcomes prediction and associated risk factors identification: establishment of benchmarking key machine learning models with data rebalancing strategies. J Med Internet Res. 2023;25:e44081. https://doi.org/10.2196/44081. PMID:37256674;PMCID:PMC10267797.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Loreto P, Peixoto H, Abelha A, Machado J. Predicting low birth weight babies through data mining. Proceedings of the 2019 World Conference on Information Systems and Technologies; WorldCIST '19; March 27–29, 2018. Naples: Springer; 2019. pp. 568–77.

  21. Khan W, Zaki N, Masud MM, Ahmad A, Ali L, Ali N, Ahmed LA. Infant birth weight estimation and low birth weight classification in United Arab Emirates using machine learning algorithms. Sci Rep. 2022;12(1):12110. https://doi.org/10.1038/s41598-022-14393-6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Mvunta MH, Mboya IB, Msuya SE, John B, Obure J, Mahande MJ. Incidence and recurrence risk of low birth weight in Northern Tanzania: a registry based study. PLoS ONE. 2019;14(4):e0215768. https://doi.org/10.1371/journal.pone.0215768.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Bekele WT. Machine learning algorithms for predicting low birth weight in Ethiopia. BMC Med Inform Decis Mak. 2022;22(1):232. https://doi.org/10.1186/s12911-022-01981-9.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

All of the authors acknowledged Hormozgan University of Medical Sciences.

Funding

None.

Author information

Authors and Affiliations

Authors

Contributions

F.D. and N.R. wrote the proposal. F.M. and V.M. contributed significantly to data collection. The findings were analyzed and interpreted by M.F. F.D., the primary contributor to the manuscript’s commenting and editing. A.R. assessed the manuscript’s scientific content critically. The final manuscript for submission was read and approved by all authors.

Corresponding author

Correspondence to Fatemeh Darsareh.

Ethics declarations

Ethics approval and consent to participate

This study complied with the Declaration of Helsinki and was performed according to ethics committee approval. The Ethics and Research Committee of the Hormozgan University of Medical Sciences approved the study. The records of all patients who provided informed consent for using their data for research purposes were analyzed. For those under the age of 18 the informed consent was taken from their gurdian. Statistical analysis was performed with patient anonymity following ethics committee regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ranjbar, A., Montazeri, F., Farashah, M.V. et al. Machine learning-based approach for predicting low birth weight. BMC Pregnancy Childbirth 23, 803 (2023). https://doi.org/10.1186/s12884-023-06128-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12884-023-06128-w

Keywords