Skip to main content

Predicting adverse outcomes in pregnant patients positive for SARS-CoV-2: a machine learning approach- a retrospective cohort study

A Comment to this article was published on 02 August 2023

Abstract

Background

Pregnant people are particularly vulnerable to SARS-CoV-2 infection and to ensuing severe illness. Predicting adverse maternal and perinatal outcomes could aid clinicians in deciding on hospital admission and early initiation of treatment in affected individuals, streamlining the triaging processes.

Methods

An international repository of 1501 SARS-CoV-2-positive cases in pregnancy was created, consisting of demographic variables, patient comorbidities, laboratory markers, respiratory parameters, and COVID-19-related symptoms. Data were filtered, preprocessed, and feature selection methods were used to obtain the optimal feature subset for training a variety of machine learning models to predict maternal or fetal/neonatal death or critical illness.

Results

The Random Forest model demonstrated the best performance among the trained models, correctly identifying 83.3% of the high-risk patients and 92.5% of the low-risk patients, with an overall accuracy of 89.0%, an AUC of 0.90 (95% Confidence Interval 0.83 to 0.95), and a recall, precision, and F1 score of 0.85, 0.94, and 0.89, respectively. This was achieved using a feature subset of 25 features containing patient characteristics, symptoms, clinical signs, and laboratory markers. These included maternal BMI, gravidity, parity, existence of pre-existing conditions, nicotine exposure, anti-hypertensive medication administration, fetal malformations, antenatal corticosteroid administration, presence of dyspnea, sore throat, fever, fatigue, duration of symptom phase, existence of COVID-19-related pneumonia, need for maternal oxygen administration, disease-related inpatient treatment, and lab markers including sFLT-1/PlGF ratio, platelet count, and LDH.

Conclusions

We present the first COVID-19 prognostication pipeline specifically for pregnant patients while utilizing a large SARS-CoV-2 in pregnancy data repository. Our model accurately identifies those at risk of severe illness or clinical deterioration, presenting a promising tool for advancing personalized medicine in pregnant patients with COVID-19.

Peer Review reports

Introduction

As of March 2023, Coronavirus Disease of 2019 (COVID-19) from Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infection has resultedin over 676,000,000 confirmed cases and 6,800,000 confirmed deaths worldwide [1]. Pregnant individuals are more likely to become severely ill compared to non-pregnant people [2,3,4]. Infection with SARS-CoV-2 in pregnancy can result in acute respiratory distress syndrome (ARDS), multiple organ failure, or maternal or fetal death [5]. Pregnant individuals are particularly vulnerable to respiratory viruses due to the physiologic and immunologic changes associated with pregnancy and higher oxygen demands related to fetal-placental circulation [5]. During the SARS-epidemic in 2003, approximately 50% of pregnant patients with SARS-CoV were admitted to an intensive care unit (ICU), 33% required mechanical ventilation, and 25% experienced fatal outcomes [6]. Comorbidities such as obesity, hypertension, pre-existing respiratory illness, and type 2 diabetes are known major risk factors for severe complications of COVID-19 during pregnancy [7, 8]. Other probable predictors in the general population include abnormal laboratory markers, radiological lung features, and respiratory parameters [7]. Despite these known risk factors, it remains challenging to predict the clinical course of pregnant patients with SARS-CoV-2, which can range from asymptomatic or mild disease to the most severe of outcomes [9, 10]. A reliable method for identifying individuals at-risk for detrimental severe outcomes would greatly benefit this vulnerable population by allowing clinicians to begin medical intervention early on in the disease course. As the number of global cases rises, access to large data repositories of COVID-19 cases in pregnant people has provided new opportunities for the development of Artificial Intelligence (AI) models.

AI algorithms use a variety of approaches to imitate human intelligence for automatic decision-making. Since the discovery of SARS-CoV-2, AI and in particular its subfield of machine learning (ML), has been used successfully to support clinical decision-making in COVID-19 cases [11,12,13]. In particular, deep learning methods, a subtype of ML, have found success in applying neural network architectures which vaguely mimic the brain by propagating information through a series of nodes which manipulate the information into useful forms [14]. The propagation process is systematically adjusted through a “training” stage where parameters of the network are optimized to improve performance of the network [14]. Using AI and a comprehensive SARS-CoV-2 in pregnancy data repository, we have developed a robust COVID-19 prognostication algorithm for pregnant patients with COVID-19. The purpose of this algorithm is to conduct higher-level risk-stratification upon obtaining a positive SARS-CoV-2 test such that individual maternal and fetal factors can be considered in clinical decisions for more efficient patient treatment pipelines. Our prognostication algorithm allows for effective triaging of this vulnerable population in resource-strained healthcare systems by determining whether a pregnant person is at risk of severe outcomes. Identifying low-risk individuals could facilitate outpatient management and virtual antenatal care without the need for in-person assessments and laboratory tests, thus decreasing the time and financial load on healthcare systems.

Methods

Study design

We developed and evaluated a novel prognosis-aid for early risk assessment of pregnant patients with SARS-CoV-2 based on retrospective international data from three countries (Canada, Austria, and Germany). The primary objective of our algorithm was to provide high-accuracy prediction of an adverse outcome for pregnant patients. The model was developed through a series of steps including preprocessing, feature selection, and algorithm creation. Multiple machine learning models were trained and compared to find the optimal model. This study was conducted in accordance with the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) Statement [15].

Study population and dataset

Two independent datasets of patients who tested positive for SARS-CoV-2 during pregnancy were used to create a single international data repository of 1501 pregnant patients: (a) The COVID-19 Related Obstetric and Neonatal Outcome Study (CRONOS) dataset from Germany and Austria established by the German Society of Perinatal Medicine (DGPM) consisting of 1402 patients and (b) The Mount Sinai Hospital (MSH) dataset from Canada consisting of 99 patients. The repository comprised multiple common variables, including demographics, comorbidities, symptoms, respiratory parameters, radiological results, and laboratory markers. These data points were documented either at the time of a positive SARS-CoV-2 test or at a time most recent to the positive test. To create a consistent dataset usable for ML, only features available at the time of COVID-19 testing were used to make predictions, and other values available from later points in the pregnancy were not used. Laboratory markers and symptoms would reflect the patients’ immediate health at the time of measurement.

Figure 1 describes the time at which the predictors (selected features) and outcomes occurred. Some predictors occur at a single time point in pregnancy, while others could change during pregnancy. Many predictors and outcomes could arise throughout the entire course of the pregnancy, making temporality an important concern. We will only use predictors available at the time of a positive test to study outcomes that haven’t occurred yet. For example, features in the category ‘patient characteristics’ are determined early in the pregnancy and are not modified during pregnancy (BMI, gravidity, parity, pre-existing comorbidities). These would be applicable to any stage of the pregnancy. Those included under ‘binary outcomes’ (Y/N), would be considered Yes if they occurred at any time prior to the positive test (smoking exposure, use of antihypertensives, diagnosis of fetal malformations on a mid-trimester ultrasound scan, administration of antenatal corticosteroids). The ‘values’ used for symptoms, signs, laboratory tests, and assessment of fetal wellbeing would be those recorded at the time of the positive test, disregarding any prior record of events. Finally, outcomes that will be predicted will only include events that were relevant to the SARS-CoV-2 infection. A study by Gold et al [16] demonstrated their proposed algorithms both for how patient symptoms collected at the time of COVID-19 diagnosis can be used to determine whether pregnant patients require inpatient management, and how clinical signs, laboratory markers, and radiological findings can help identify at-risk patients. Data collection methods such as these would be well-suited for the application of a pregnant patient prognostication model.

Fig. 1
figure 1

Timeline of predictors (features) and adverse pregnancy outcomes

Ethics

Ethics approval was obtained from Ryerson University (REB 2020–310, Aug 13, 2020) and MSH (20–0160-C, July 16, 2020) Research Ethics Boards. Data-use and collaboration agreements were established with CRONOS and MSH to facilitate the transfer of de-identified patient data to Ryerson University to build the repository. Ethics approval for the CRONOS dataset was obtained by the DGPM from the University Hospital of Schleswig–Holstein Campus Kiel (reference number: D 451/20, amended Feb 15, 2021, for this collaboration). The CRONOS Consortium holds individual ethics approvals from committees in each federal state and/or University Hospital for all collaborators within Germany and Austria. All methods described in this study were carried out in accordance with the.guidelines and regulations of all relevant research ethics boards. Informed consent from patients was notrequired by ethics boards due to data de-identification.

Outcomes and definitions

Our primary outcome was an ‘adverse pregnancy event’, which included: (a) maternal-related outcomes, such as admission to ICU, need for mechanical ventilation, critical illness, or death, (b) delivery-related features, such as pregnancy termination, iatrogenic birth, surgical delivery, and stillbirth (all as a result of SARS-CoV-2 infection), and (c) neonatal-related variables, such as estimated fetal birth weight < 10th centile, APGAR score at 5 min < 7, admission to the neonatal intensive care unit (NICU) due to COVID-19 (admissions to the NICU for shorter amounts of time without respiratory support while awaiting test results or for a brief period of observation were not included), and length of NICU stay greater than 24 h where there was also administration of breathing support for the neonate. Table 1 shows the variables and respective criteria used in defining adverse outcomes.

Table 1 Adverse pregnancy outcome composition

Algorithm creation

We used a binary classification task to predict the risk of adverse outcomes in SARS-CoV-2-positive pregnant patients and their neonates. The algorithm defined the possibility of an adverse outcome of a patient as either low risk or high risk. Figure 2 shows the overall architecture of our framework. Model creation consisted of three main components: preprocessing, feature selection, and model development. Full details of the algorithm creation are presented in Additional file 1: Appendix A. The following outlines the steps employed for the overall creation of the model:

  1. 1.

    Preprocessing

    The facilitation of the train-test process by appropriately cleaning and preparing the entire dataset. This was done by analyzing the data for discrepancies, handling missing values, scaling and balancing, and “cleaning” the data. Data cleaning primarily involved the handling of invalid values, categorizing continuous variables, and creating new features by aggregating multiple features representative of one relevant attribute.

  2. 2.

    Feature engineering

    The creation of new features based on knowledge of the relevant biological markers, in other words, creating composite features based on extra features that are pertinent to a single more general feature. For example, pre-existing conditions were provided as individual, binary features. These conditions were used to engineer a new feature to generate the “existence of pre-existing diseases” feature.

  3. 3.

    Feature selection

    The refinement of the larger feature set to generate a smaller more relevant feature subset. The most relevant features were selected based on their importance to the outcome, and achieved the greatest performance as determined through an iterative elimination process called recursive feature elimination.

  4. 4.

    Creation of a balanced dataset through sampling

    Addressing the imbalanced dataset by looking at class distributions of low-risk and high-risk patients in the dataset and randomly eliminating samples from the larger class. As a result, the training dataset becomes more balanced and thus allows the trained model to make more accurate, unbiased decisions.

  5. 5.

    Machine learning model creation

    A series of machine learning models were trained and tested using the training data from the feature selection step. These models were all evaluated separately in order to identify the best performing algorithm to be implemented in the final system.

Fig. 2
figure 2

PROTECT framework

The more complicated steps including management of missing data, feature selection, and model development are elaborated upon in the following sections. For information regarding definitions and descriptions for machine learning-related terms, refer to Table S1 of Additional file 1: Appendix A.

Missing data

Repository data were inspected for invalid data points using the reference attribute ranges according to metadata and domain knowledge, and any invalid observations were excluded. An observation was considered invalid if all attributes for that observation had invalid values (e.g., null, 0, blank, or negatives). The data were then explored for potential discrepancies, including inconsistencies in data representation and data type, and these were manually corrected. After that exploration, the null condition was handled based on possible reasons for the missing values (e.g., in the case of the fetus, missing values for neonate and delivery attributes were filled with a new category code as “Not Applicable”).

Feature selection

To identify whether an attribute impacted the endpoints considered in the feature selection phase, we fitted a random forest model with all 71 clinical attributes of the training set and calculated the model's performance using the stratified tenfold cross-validation and area under the curve (AUC) scoring. An exhaustive list of these clinical attributes prior to feature selection can be found in Additional file 1: Appendix A. Cross-validation was undergone to reduce model over-fitting. Variables were then sequentially eliminated using the wrapper-based backward elimination feature selection method (outlined in Additional file 1: Appendix A), and the model was trained on the remaining variables followed by performance calculation. The process was repeated until the most important attributes and the optimal number of features were identified based on the model’s performance. As shown in Fig. 3, a set of 25 features were selected as the optimal features to train the models. Table 2 shows the most important features to maximize the separability of classes. These features included maternal body mass index (BMI), smoking exposure, pre-existing maternal illness, duration of maternal COVID-19 symptoms (for example, dyspnea, sore throat, fever, fatigue), COVID-19 associated pneumonia, parity, gravidity, laboratory tests (for example, soluble fms-like tyrosine kinase 1 (sFLT-1) and placental growth factor (PlGF), thrombocytopenia, lactate dehydrogenase (LDH), aspartate transaminase (AST), alanine transaminase (ALT) levels), administration of antihypertensive medication, in-patient treatment due to COVID-19, maternal oxygen administration, abnormal amniotic fluid levels (assessed through fetal imaging), signs of fetal distress, presence of fetal respiratory distress syndrome (RDS) and fetal malformations, as outlined in Table 2. Continuous variables were converted to categorical ones based on low, normal, and high ranges during pregnancy.

Fig. 3
figure 3

Number of features used for training versus AUC-ROC score for determining the optimal number of features using the Backward elimination approach

Table 2 Selected features through the feature-selection phase

Model development

The CRONOS and MSH datasets were used to train, validate, and test the designed framework. The models which were trained and evaluated included Support Vector Machine, k-Nearest Neighbor, Decision Tree, Random Forest, XGBoost, and AdaBoost. Both datasets were used for training to make the model more generalizable to demographics in various geographical regions. Additionally, by training on an aggregated dataset with patients from different distributions, we encourage our model to be generalizable. Furthermore, training on one dataset and validating/testing on another is an experimental setup primarily designed to evaluate AI methods specifically aimed at adapting to distribution shifts between datasets. As that task is not the focus of this paper, this approach was not used. The preprocessing step was applied to the data, after which, 262 features from the CRONOS repository and 214 features from the MSH repository were reduced to 71 features and 1501 cases. No patient records were excluded secondary to missing data. As mentioned in Additional file 1: Appendix 1, features included in the feature selection process only involved those that existed in both CRONOS and MSH repositories. These datasets included features associated with both maternal and neonatal health, thereby distinguishing the model from those created to predict COVID-19-related outcomes in the general population. Next, the Near-Miss under-sampling algorithm was used to class-balance the dataset, which resulted in selecting 562 pregnant patients with even distribution among both high and low-risk classes. Finally, the 25-feature subset obtained from the previously described feature selection phase was used to train and evaluate the six ML models. 80% of this data (562 instances with 25 features) were randomly assigned to a training and validation set and the remaining 20% was split into a test set (containing 60 high-risk patients and 52 low-risk patients).

Results

Study population: training and test cohorts

A total of 1501 patients were included in the aggregate data repository, with a mean (SD) age of 30.8 (5.6) years. Table 3 outlines the characteristics of the study population, separated by the existence of an adverse outcome.

Table 3 Repository data analysis for patients with adverse outcomes vs patients no adverse outcomes

Model performance

As shown in Fig. 4, the ensemble Random Forest model resulted in the best overall accuracy on the test set with a score of 0.90 (95% CI 0.83 to 0.95) in predicting adverse outcomes. The test set contained 60 patients with a high risk of developing adverse outcomes and 53 patients with a low risk of developing adverse outcomes. The Random Forest model identified 51/60 (83.3%) of the high-risk patients and 50/53 (92.5%) of the low-risk patients, with an accuracy of 89.0%, a recall score of 0.85, a precision of 0.94, and an F1 score of 0.89. The performance metrics for each model are summarized in Table 4, which shows that the Random Forest model had the highest performance based on AUC, followed by XGBoost, KNN, Decision Tree, AdaBoost, and SVM. Confusion matrices for each of the developed ML models are shown in Fig. 5. Each confusion matrix in this figure demonstrates the number of correctly and incorrectly predicted adverse outcome predictions (bottom right and bottom left, respectively), and correctly and incorrectly-predicted no-adverse outcome predictions (top left and top right, respectively) that were made by each tested ML model.

Fig. 4
figure 4

Area under ROC curve (AUC-ROC) plot for Support Vector Machine (SVM), k-Nearest Neighbors (KNN), Decision Tree (DT), Random Forest (RF), XGBoost, and AdaBoost models

Table 4 Model performance
Fig. 5
figure 5

Confusion matrices of developed ML models. For each matrix, the top left box represents true negatives, top right false negatives, bottom left false positives, and bottom right true positives

Conclusions

In order to improve prediction accuracy of adverse outcomes related to SARS-CoV-2 infection in pregnancy, we created an international data repository that consisted of 1501 patients from three countries. These data were then used to train and test a ML algorithm that can predict adverse maternal, fetal, and gestational outcomes with an accuracy of 89%.

While several studies have developed ML-based prognostic models for SARS-CoV-2 infection in the general population, our study is the first to present such an algorithm specifically for pregnant patients while utilizing a large SARS-CoV-2 in pregnancy data repository. This high-performance model specific to pregnancy provides a tool for clinicians to better identify at-risk pregnant patients with SARS-CoV-2 infection, allowing early hospital admission, enhanced monitoring, and proactive intervention. This algorithm has high clinical relevance as a SARS-CoV-2 infection poses significant risk to a small proportion of pregnant people, including an increased risk for critical disease requiring ventilation support as well as other maternal and fetal complications [17]. A 2020 meta-analysis including 192 international studies on the clinical manifestations, risk factors, and maternal perinatal outcomes of COVID-19 in pregnancy [2] found that the most common symptoms for pregnant or recently-pregnant women with suspected or confirmed COVID-19 were fever (40%) and cough (41%). Raised white blood cell count (26%), lymphopaenia (33%), and raised C reactive protein levels (49%) were the most common laboratory findings. They also found that pregnant women were more likely to be asymptomatic than non-pregnant women of reproductive age. Furthermore, they found that increased maternal age, high BMI, non-white ethnicity, and preexisting comorbidities were all associated with severe disease. Pregnant women with COVID-19 were found to be at increased risk of death, admission to ICU, delivering preterm, and having their neonates admitted to NICU. The determined factors related to adverse outcomes from this meta-analysis draw parallels with many of the features determined through feature selection and used to train our model. Differences include the inclusion of ethnicity as a factor to determine adverse outcome, and specific laboratory markers, such as C reactive protein and lymphopaenia. More recent studies assessing maternal vulnerability as a result of SARS-CoV-2 infection during pregnancy have consistently determined pregnant individuals to be at higher risk of hospitalization [18]. Additionally, a 2021 multinational INTERCOVID study [19] which assessed the relationship between maternal factors such as gestational diabetes mellitus, diabetes mellitus, and maternal obesity related to COVID-19 diagnosis. Interestingly, it was found that these conditions were associated with higher risk of COVID-19 disease severity, both of which were included within the features selected to train our prognostication model.

Our algorithm significantly improves upon previously published AI techniques for COVID-19 prognosis. For example, Schöning et al [20] created a clinical score and ML model that predicted the likelihood of severe disease courses from SARS-CoV-2, where a severe outcome was defined as admission to ICU and/or death. The measured metrics in the validation cohort included an AUC of 0.85, positive predictive value (PPV) of 0.91, and negative predictive value (NPV) of 0.81. However, this study was limited by poor censoring of outcomes (given there was no explicit follow-up), a small sample size (n = 657) and no laboratory data available for included subjects. Similarly, Rechtman et al [21] developed an XGBoost model for the prediction of COVID-19 mortality using 8770 confirmed SARS-CoV-2 cases, including demographic, clinical, and comorbidity variables taken from all Mount Sinai Health System facilities in New York, USA. Cross-validation results yielded an AUC of 0.86 when combining effective predictors. However, this study was limited by poor generalizability due to the absence of widespread testing at the time of algorithm development, with only severe cases of COVID-19 included in the model. Moreover, there was limited access to ICU admission and oxygen support information, thereby limiting the number of patients with severe outcomes that were included in the model. Our algorithm leverages a large, varied dataset with detailed follow-up results and obtains competitive performance or outperforms the existing systems. By using a Random Forest model, a type of Ensemble Learning approach, we were able to overcome the challenges of high disease course variability to accomplish high COVID-19 prognostication accuracy [22]. This problem of generalizability is common in studies attempting to create mortality prediction algorithms [21, 23, 24] and is often caused by data sparsity, leading to a lack of validation from external healthcare systems [25]. The use of Random Forest for the wrapper-based feature selection technique allows for better model building with a large number of clinical features, however also serves as a limitation due to the fact that using the same algorithm at both feature selection and learning stages can inflate performance results [26]. Although studies may find promising performance results in the training phase using COVID-19 repositories, this does not guarantee the generalizability of the algorithm. As such, including diverse care settings or geographic regions can considerably improve the generalizability of the algorithm [27]. Therefore, our study set out to retrieve data from separate sources of different geographic locations to allow for evaluation on more diverse representative data. Of these sources, the CRONOS repository contained data from centers throughout Germany and Austria, thereby furthering the geographical variability of our repository. The large size and diversity of our international repository allowed for the development of a trained ensemble algorithm capable of providing accurate outcomes using a combination of data input parameters.

Nevertheless, this study has several limitations. Firstly, the repository included patient data only for individuals who either sought medical care because of SARS-CoV-2 infection or were found positive on testing prior to giving birth. Our data may not be a real reflection of the general population of pregnant women affected by COVID-19, since health and lifestyle factors may differ in demographics without access to medical care. Canada, Germany, and Austria are predominantly white countries, which may result in the presence of racial bias in the model [28] Therefore, we do not recommend widespread use of this algorithm before future advancements allow for it to be validated in diverse populations. Future advancements of this model aim to diversify and expand the patient demographics in newly obtained data, including racial diversification. Next, as shown in Table 3, the percentage of participants with adverse outcomes differed between the two data sources. There could be several reasons for differences in outcomes between the two datasets. First, CRONOS is a large registry with a broad range of patients from Germany and Austria. Mount Sinai Hospital is a tertiary level hospital in Toronto, Canada which serves as a referral centre for the Greater Toronto Area. The patient population here is a higher-risk population than in the CRONOS registry. The protocols for each registry differ greatly, as CRONOS is a prospective database from over 100 sites, including those admitted and those not requiring admission. MSH, on the other hand, is a list of very sick people admitted to the hospital, not necessarily including everyone who screened positive. CRONOS was a universal screening, whereas MSH was a symptom-based screening and, therefore, was not a comprehensive list of everyone that may have tested positive. By including these vastly different data sources, we were able to obtain a good balance between “population-based” and “hospital-based” screening, thus providing a rich dataset of cases with varying severity to inform the model. As such, our algorithm is based on data obtained from both a high and a low-risk population. Furthermore, one of the adverse outcome features used was admission of the neonate to the NICU due to maternal SARS-CoV-2 infection. However, the reasoning for this admission may vary across different hospitals and health centers, depending on local regulations. For example, NICU admission may be cautionary only at some centres, while in others it may be due to a medical need. Such variable definition limits the accuracy of the model, which could potentially incorrectly-label patients with adverse outcomes, depending on one’s interpretation. Similarly, another variable defining adverse outcome is described as the patient is “extremely critically ill”. This variable is also subject to inter-subject variability and differing interpretations, as there is no objective or quantifiable threshold to help define it. It should also be noted that, as with most machine learning models, the exclusion of certain feature inputs when calculating a patient’s risk may result in a deterioration of the model’s performance. Therefore, collection of all input features (as outlined in the Study and Population section) should be undertaken to ensure maximal model performance. The features we utilise with this model are routine in obstetrics and, thus, should be readily available in most non-emergency cases, making our algorithm usable in a typical pregnancy. Our algorithm is not intended to be used in emergency cases, where these features are most likely to be missing, as the emergency will merit immediate medical intervention regardless of the predicted future risk.

The absence of vaccination status as an input feature to the model serves as an additional limitation. mRNA vaccination has shown to be effective and safe for reducing the severity of COVID-19 in pregnancy [29]. Recent observational studies have evaluated COVID-19 vaccine effectiveness in pregnant patients during the first 6 months of omicron as the variant of concern [30]. Administration of complete or boosted vaccines were found to reduce the risk for severe symptoms, complications, and death [30]. Another 2023 study determined vaccination to be highly effective against delta variants when infected during gestation, and moderately effective against omicron [31]. This being said, the inclusion of vaccination status as a feature for model training using more recent patient data would likely be indispensable for the model, and serve as an important factor in prognostication. Unfortunately, the data collected for the training of this model was done so prior to the availability of any vaccines. Additional data were provided for the CRONOS repository later in the course of this project, which included 1013 patients with known vaccination status. This vaccine variable contained ambiguity as to whether the patients received one or multiple doses. After this additional data were preprocessed, the vaccination feature was deemed unnecessary by the feature selection algorithm and thus was eliminated. The ambiguity of the feature may attest to its elimination, and provision of a more concise feature (indicating number of doses received) may be more impactful for our future model advancements. Therefore, the inclusion of vaccination status as an input feature for this prognostication model does not necessarily allow for a higher-accuracy model. However, future studies will attempt to advance this model by expanding the data repository to include populations from broader global regions. The expanded data repository will also aim at acquiring vaccination status as a multi-class variable reflecting the number of doses received. With this improved dataset, the vaccination feature may have a greater impact on adverse outcome prediction. While we attempted to increase the heterogeneity of our dataset by including patient data from three different countries, we hope to train our algorithm with data from additional countries to further broaden the generalizability of our algorithm. Ultimately, we plan to develop an accessible user interface platform for healthcare providers and patients to input relevant pregnancy parameters that will provide a meaningful prediction of risk.

In conclusion, we demonstrated that clinical data can effectively predict disease severity in pregnant patients with SARS-CoV-2 infection. The intention is to use the algorithm at the point of a first positive test. However, knowing that risk is dynamic, the algorithm should be used each time the clinical condition changes, for e.g., if there are worsening respiratory symptoms or a new pregnancy-related clinical finding. Implementation of this platform and continued validation through international use could significantly improve the ability of healthcare providers to identify, manage, and treat the disease in pregnant people worldwide.

Availability of data and materials

The data repository generated and used in this study is not publicly available due to the collaborating hospitals’ restrictions on data sharing. However, a summary of the CRONOS data is regularly updated and is available here: https://www.dgpm-online.org/gesellschaft/covid-19/. Additionally, the trained algorithm is available on Github:https://github.com/MFI-Lab/PROTECT. For further queries, please contact the corresponding author.

Abbreviations

COVID-19:

Coronavirus disease of 2019

SARS-CoV-2:

Severe acute respiratory syndrome coronavirus 2

ARDS:

Acute respiratory distress syndrome

ICU:

Intensive care unit

AI:

Artificial intelligence

ML:

Machine learning

TRIPOD:

Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis

CRONOS:

COVID-19 related obstetric and neonatal outcome study

DGPM:

German society of perinatal medicine

MSH:

Mount Sinai hospital

NICU:

Neonatal intensive care unit

AUC:

Area under the curve

BMI:

Body mass index

sFLT-1:

Soluble fms-like tyrosine kinase 1

PlGF:

Placental growth factor

LDH:

Lactate dehydrogenase

AST:

Aspartate transaminase

ALT:

Alanine transaminase

RDS:

Respiratory distress syndrome

SD:

Standard deviation

KNN:

K-Nearest neighbour

RF:

Random forest

DT:

Decision tree

COVID-19:

Support vector machine

PPV:

Positive predictive value

NPV:

Negative predictive value

References

  1. COVID-19 Dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU). Johns Hopkins Coronavirus Resource Center. Cited 2023 Apr7. Available from: https://coronavirus.jhu.edu/map.html

  2. Allotey J, Stallings E, Bonet M, Yap M, Chatterjee S, Kew T, et al. Clinical manifestations, risk factors, and maternal and perinatal outcomes of Coronavirus Disease 2019 in pregnancy: Living Systematic Review and meta-analysis. Obstet Anesth Dig. 2021;41(2):81–2.

    Article  Google Scholar 

  3. Galang RR, Newton SM, Woodworth KR, Griffin I, Oduyebo T, Sancken CL, et al. Risk factors for illness severity among pregnant women with confirmed SARS-COV-2 infection – surveillance for emerging threats to mothers and babies network, 20 state, local, and territorial health departments, March 29, 2020 -January 8, 2021. Clinical Infectious Diseases. 2021.

  4. Ko JY, DeSisto CL, Simeone RM, Ellington S, Galang RR, Oduyebo T, et al. Adverse pregnancy outcomes, maternal complications, and severe illness among us delivery hospitalizations with and without a coronavirus disease 2019 (COVID-19) diagnosis. Clin Infect Dis. 2021;73(Supplement_1):24–31.

  5. Kumar R, Yeni CM, Utami NA, Masand R, Asrani RK, Patel SK, et al. SARS-COV-2 infection during pregnancy and pregnancy-related conditions: concerns, challenges, management and mitigation strategies–a narrative review. J Infect Public Health. 2021;14(7):863–75.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Schwartz DA, Graham AL. Potential maternal and infant outcomes from coronavirus 2019-ncov (SARS-COV-2) infecting pregnant women: Lessons from SARS, Mers, and other human coronavirus infections. Viruses. 2020;12(2):194.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Li J, Gong X, Wang Z, Chen R, Li T, Zeng D, et al. Clinical features of familial clustering in patients infected with 2019 novel coronavirus in Wuhan. China Virus Res. 2020;286: 198043.

    Article  CAS  PubMed  Google Scholar 

  8. Guan W-jie, Liang W-hua, Zhao Y, Liang H-rui, Chen Z-sheng, Li Y-min, et al. Comorbidity and its impact on 1590 patients with covid-19 in China: a nationwide analysis. Eur Respir J. 2020;55(5):2000547.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Etti M, Sekikubo M, Nankabirwa V, Sommerfelt H, Freyne B, Kawaza K, et al. SARS-COV-2 infection in pregnant women and their newborns. Annals of Global Health. 2020;86(1).

  10. Moore KM, Suthar MS. Comprehensive analysis of covid-19 during pregnancy. Biochem Biophys Res Commun. 2021;538:180–6.

    Article  CAS  PubMed  Google Scholar 

  11. Tayarani NM-H. Applications of artificial intelligence in battling against covid-19: a literature review. Chaos Solitons Fractals. 2021;142: 110338.

    Article  Google Scholar 

  12. Alballa N, Al-Turaiki I. Machine learning approaches in covid-19 diagnosis, mortality, and severity risk prediction: A Review. Inform Med Unlocked. 2021;24: 100564.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Verma AA, Murray J, Greiner R, Cohen JP, Shojania KG, Ghassemi M, et al. Implementing machine learning in medicine. Can Med Assoc J. 2021;193(34):E1351–7.

  14. Zhu JS, Ge P, Jiang C, Zhang Y, Li X, Zhao Z, et al. Deep-learning artificial intelligence analysis of clinical variables predicts mortality in COVID-19 patients. J Am Coll Emerg Phys Open. 2020;1(6):1364–73.

    Google Scholar 

  15. Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (tripod): explanation and elaboration. Ann Intern Med. 2015;162(1):W1–73.

  16. Gold S, Clarfield L, Johnstone J, Diambomba Y, Shah PS, Whittle W, et al. Adapting Obstetric and neonatal services during the COVID-19 pandemic: a scoping review. BMC Pregnancy Childbirth. 2022;22(1):119.

  17. Hammad WA, Al Beloushi M, Ahmed B, Konje JC. Severe acute respiratory syndrome (SARS) coronavirus-2 infection (COVID-19) in pregnancy – an overview. Eur J Obstet Gynecol Reprod Biol. 2021;263:106–16.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. McClymont E, Albert AY, Alton GD, Boucoiran I, Castillo E, Fell DB, et al. Association of SARS-COV-2 infection during pregnancy with maternal and perinatal outcomes. JAMA. 2022;327(20):1983.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Eskenazi B, Rauch S, Iurlaro E, Gunier RB, Rego A, Gravett MG, et al. Diabetes mellitus, maternal adiposity, and insulin-dependent gestational diabetes are associated with covid-19 in pregnancy: the intercovid study. Am J Obstet Gynecol. 2022;227(1):e1–74.

  20. Schöning V, Liakoni E, Baumgartner C, Exadaktylos AK, Hautz WE, Atkinson A, et al. Development and validation of a prognostic COVID-19 severity assessment (COSA) score and machine learning models for patient triage at a tertiary hospital. J Transl Med. 2021;19(1):56.

  21. Rechtman E, Curtin P, Navarro E, Nirenberg S, Horton MK. Vital signs assessed in initial clinical encounters predict COVID-19 mortality in an NYC hospital system. Sci Rep. 2020;10(1):21545.

  22. Aljame M, Ahmad I, Imtiaz A, Mohammed A. Ensemble learning model for diagnosing COVID-19 from routine blood tests. Inform Med Unlocked. 2020;21: 100449.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Wang K, Zuo P, Liu Y, Zhang M, Zhao X, Xie S, et al. Clinical and laboratory predictors of in-hospital mortality in patients with coronavirus disease-2019: A cohort study in Wuhan. China Clin Infect Dis. 2020;71(16):2079–88.

    Article  CAS  PubMed  Google Scholar 

  24. Booth AL, Abels E, McCaffrey P. Development of a prognostic model for mortality in COVID-19 infection using machine learning. Mod Pathol. 2020;34(3):522–31.

    Article  PubMed  Google Scholar 

  25. Rose S. Machine learning for prediction in electronic health data. JAMA Network Open. 2018;1(4):e181404.

  26. John GH, Kohavi R, Pfleger K. Irrelevant features and the subset selection problem. Machine LearnProc. 1994;1994:121–9.

    Google Scholar 

  27. Rajkomar A, Dean J, Kohane I. Machine learning in medicine. N Engl J Med. 2019;380(14):1347–58.

    Article  PubMed  Google Scholar 

  28. Richardson A. Biased data lead to biased algorithms. Can Med Assoc J. 2022;194(9):E341.

  29. Leik NK, Ahmedy F, Guad RM, Baharuddin DM. Covid-19 vaccine and its consequences in pregnancy: Brief review. Ann Med Surg. 2021;72: 103103.

    Article  Google Scholar 

  30. Villar J, Soto Conti CP, Gunier RB, Ariff S, Craik R, Cavoretto PI, et al. Pregnancy outcomes and vaccine effectiveness during the period of Omicron as the variant of Concern, Intercovid-2022: a multinational observational study. Lancet. 2023;401(10375):447–57.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Jorgensen SC, Hernandez A, Fell DB, Austin PC, D’Souza R, Guttmann A, et al. Maternal mrna covid-19 vaccination during pregnancy and delta or omicron infection or hospital admission in infants: test negative design study. BMJ. 2023;380(1):e074035.

  32. Lin JH, Haug PJ. Data preparation framework for preprocessing clinical data in data mining. InAMIA annual symposium proceedings 2006 (Vol. 2006, p. 489). American Medical Informatics Association.

  33. Tang S, Davarmanesh P, Song Y, Koutra D, Sjoding MW, Wiens J. Democratizing Ehr Analyses with Fiddle: A flexible data-driven preprocessing pipeline for Structured Clinical Data. J Am Med Inform Assoc. 2020;27(12):1921–34.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Ang Q, Liu Z, Wang W, Li K. Explored research on data preprocessing and mining technology for Clinical Data Applications. 2010 2nd IEEE International Conference on Information Management and Engineering. 2010;

  35. Abd Elrahman SM, Abraham A. A review of class imbalance problem. J Network Innov Comput. 2013;2013(1):332–40.

    Google Scholar 

  36. Xu-Ying Liu, Jianxin Wu, Zhi-Hua Zhou. xploratory undersampling for class-imbalance learning. IEEE Transact Systems Man Cybernetics Part B (Cybernetics). 2009;39(2):539–50.

    Article  Google Scholar 

  37. Mani I, Zhang I. kNN approach to unbalanced data distributions: a case study involving information extraction. InProceedings of workshop on learning from imbalanced datasets 2003 Aug 21 (Vol. 126). United States: ICML.

  38. Guyon I, Weston J, Barnhill S, Vapnik V. Mach Learn. 2002;46(1/3):389–422.

    Article  Google Scholar 

  39. Biau G, Scornet E. A Random Forest Guided Tour. TEST. 2016;25(2):197–227.

    Article  Google Scholar 

  40. Vapnik V, Golowich SE, Smola A. Support vector method for function approximation, regression estimation, and signal processing. Adv Neural Inf Process Syst. 1997;2:281–7.

    Google Scholar 

  41. Zhang Z. Introduction to machine learning: K-Nearest Neighbors. Ann Transl Med. 2016;4(11):218.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Safavian SR, Landgrebe D. A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern. 1991;21(3):660–74.

    Article  Google Scholar 

  43. Breiman L. Mach Learn. 2001;45(1):5–32.

    Article  Google Scholar 

  44. Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H. Xgboost: extreme gradient boosting. R package version 0.4–2. 2015;1(4):1–4.

  45. Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016. p. 785–94.

    Chapter  Google Scholar 

  46. Friedman J, Hastie T, Tibshirani R. Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). Ann Stat. 2000;28(2):337–407.

Download references

Acknowledgements

Computations were performed on the SOSCIP Consortium’s Cloud Analytics and GPU-Accelerated computing platforms. SOSCIP is funded by FedDev Ontario, IBM Canada Ltd. and Ontario academic member institutions.

The authors would like to thank Jillian Cardinell for her assistance with proofreading and editing the manuscript.

Funding

This study received funding from the Natural Sciences and Engineering Research Council (NSERC)—Alliance COVID-19 grant (Sussman).

Author information

Authors and Affiliations

Authors

Contributions

DY contributed to data analysis, data cleaning, model development, and manuscript writing; BH contributed to data cleaning, model development, and manuscript writing; CT contributed to project administration and clinical feature analysis; AK contributed to manuscript revision; AP and JD contributed to data curation; WW, MS, SH, contributed to the conceptualization of this study and data acquisition; LG, MR, and PO contributed to data acquisition; JR, RD, RK, UP, DS, and JS contributed to data acquisition and critical manuscript revision; DS was the corresponding author and contributed to study conceptualization, funding acquisition, project administration, and supervision. All authors agree with the final version and agree to be accountable to the integrity of the data published.

Corresponding author

Correspondence to Dafna Sussman.

Ethics declarations

Ethics approval and consent to participate

Ethics approval was obtained from Toronto Metropolitan University Research Ethics Board, Mount Sinai Hospital Research Ethics Boards, and DGPM Research Ethics Board. All methods were carried out in accordance with relevant guidelines and regulations set by the respective institutions and ethics boards. The need for Informed Consent was waived by the Toronto Metropolitan University Research Ethics Board, Mount Sinai Hospital Research Ethics Boards, and DGPM Research Ethics Board.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Appendix A. (Methods). Table S1. Machine learning term definitions. Table S2. Full feature list prior to selection of feature subset using gridsearch [32,33,34,35,36,37,38,39,40,41,42,43,44,45,46].

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Young, D., Houshmand, B., Tan, C.C. et al. Predicting adverse outcomes in pregnant patients positive for SARS-CoV-2: a machine learning approach- a retrospective cohort study. BMC Pregnancy Childbirth 23, 553 (2023). https://doi.org/10.1186/s12884-023-05679-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12884-023-05679-2

Keywords