Skip to main content

Automated prediction of early spontaneous miscarriage based on the analyzing ultrasonographic gestational sac imaging by the convolutional neural network: a case-control and cohort study



It is challenging to predict the outcome of the pregnancy when fetal heart activity is detected in early pregnancy. However, an accurate prediction is of importance for obstetricians as it helps to provide appropriate consultancy and determine the frequency of ultrasound examinations. The purpose of this study was to investigate the role of the convolutional neural network (CNN) in the prediction of spontaneous miscarriage risk through the analysis of early ultrasound gestational sac images.


A total of 2196 ultrasound images from 1098 women with early singleton pregnancies of gestational age between 6 and 8 weeks were used for training a CNN for the prediction of the miscarriage in the retrospective study. The patients who had positive fetal cardiac activity on their first ultrasound but then experienced a miscarriage were enrolled. The control group was randomly selected in the same database from the fetuses confirmed to be normal during follow-up. Diagnostic performance of the algorithm was validated and tested in two separate test sets of 136 patients with 272 images, respectively. Performance in prediction of the miscarriage was compared between the CNN and the manual measurement of ultrasound characteristics in the prospective study.


The accuracy of the predictive model was 80.32% and 78.1% in the retrospective and prospective study, respectively. The area under the receiver operating characteristic curve (AUC) for classification was 0.857 (95% confidence interval [CI], 0.793–0.922) in the retrospective study and 0.885 (95%CI, 0.846–0.925) in the prospective study, respectively. Correspondingly, the predictive power of the CNN was higher compared with manual ultrasound characteristics, for which the AUCs of the crown-rump length combined with fetal heart rate was 0.687 (95%CI, 0.587–0.775).


The CNN model showed high accuracy for predicting miscarriage through the analysis of early pregnancy ultrasound images and achieved better performance than that of manual measurement.

Peer Review reports


Miscarriage is the most common early pregnancy complication affecting about 30% of pregnancies following assisted reproduction and 10% of spontaneously conceived pregnancies [1,2,3]. Even after the embryonic cardiac activity is detected, the subsequent rate of miscarriage remains 5.2 to 10.4% [4, 5]. Miscarriages very often have a significant social and psychological effect on the mother, especially for pregnant women who have received assisted reproduction or experienced repeated abortion [6, 7]. Therefore, for early pregnancies with fetal heart activity, the development of a prediction model for miscarriage can not only help obstetricians decide the frequency of subsequent ultrasound examinations but also alleviate the great psychological pressure and anxiety of the pregnant women due to fear of miscarriage.

Several studies have demonstrated that sonographic findings in early pregnancy have prognostic value in predicting pregnancy outcomes. Various signs have been described, such as mean gestational sac diameter [8], crown-rump length [9, 10], fetal heart rate [11,12,13], abnormal sonographic appearance of the yolk sac [14], etc. The examination of multiple sonographic features may improve the ability to predict subsequent pregnancy loss, however, their calculations are somewhat complicated, which limited ease of clinical implementation.

Deep learning algorithms have been widely applied in the field of image diagnosis and prediction owing to their advantages of being fast, accurate, and reproducible [15]. One of the most powerful deep learning approaches related to images is convolutional neural networks (CNNs). Recent research on medical data has shown that deep CNNs can be successfully used to extract and analyze information obtained from medical images [16,17,18]. In fetal ultrasonography (US), CNNs have shown potential for standard plane recognition, detection, and localization [19]. Training a CNN algorithm to analyze early pregnancy US images and predict the spontaneous miscarriage risk, would be a more convenient alternative for assessing prognosis. However, to our knowledge, there have been no reports to date on the prediction of miscarriage by deep learning.

In this study, we presented a novel procedure using the deep learning algorithms to predict the spontaneous miscarriage risk after the appearance of embryonic cardiac activity in 6 to 8 weeks through US images of the gestational sac.


Study design and patient population

This study was approved by the ethics committee of the Shengjing Hospital of China Medical University (2016PS243K) and was performed in accordance with the Declaration of Helsinki. The fully anonymized ultrasound images in the database were used in the study, and a CNN was constructed by training, validation, and test data sets (Fig. 1).

Fig. 1
figure 1

Data enrollment in the present study. This figure is a flowchart showing the strategies by which we searched for the database to enroll the study population. GW: gestational week; CNN: convolutional neural networks; FHR: fetal heart rate; CRL: crown-rump length; US: ultrasound

The retrospective review was conducted from September 2013 to June 2019, using information from the database of Shengjing Hospital of China Medical University. A total of 71,372 patients with early singleton pregnancies of gestational age between 6 and 8 weeks showing embryonic cardiac activity were retrospectively analyzed. There were 721 patients identified who had positive fetal cardiac activity on their first ultrasound but then experienced a miscarriage. Besides, 721 confirmed normal fetuses were randomly selected from the same database to serve as normal controls. A total of 72 cases were excluded because of measurement caliper overlays which affected the effectiveness of machine learning (34 cases of miscarriage, 38 normal fetuses). In order to better verify the predictive ability of the model, we also conducted a prospective study from January 2019 to June 2019, of 108 women with singleton pregnancies (6 of whom experienced miscarriages and 102 of whom carried the pregnancy to term). Seven patients were excluded because of missed follow-up. The relevant clinical information of the enrolled patients is summarized in Table 1. The gestational age of the enrolled patients at the time of the ultrasound was calculated based on the difference between the date of the ultrasound and the last menstrual period.

Table 1 Demographic and pregnancy characteristics of women included in the study

Data preparation for automated image analysis

US images in the retrospective study were extracted from the hospital ultrasound workstation. The ultrasound examination was conducted using Voluson 730 Expert (GE Medical Systems, Zipf, Austria), Affiniti 70 ultrasound systems (Philips Medical Systems, Amsterdam, the Netherlands), and Aixplorer® (SuperSonic Imagine, France). Images were taken using a transvaginal transducer with a frequency range from 3 to 9 MHz, and the center frequency of the transvaginal transducer was 6.5 MHz. All scans were performed by accredited sonographers with 3–15 years (mean: 8 years) experience. In the prospective study, ultrasound examination was conducted using Affiniti 70 ultrasound systems (Philips Medical Systems, Amsterdam, the Netherlands) equipped with a 3–9 MHz transvaginal transducer. All scans were performed by accredited sonographers with 5 to 15 years’ experience. Crown-rump length and fetal heart rate were measured.

In all the cases, the images were acquired by an experienced sonographer according to a unified acquisition process in the routine ultrasound examination. The specific process was as follows: on the sagittal section of the uterus, display the largest sagittal section of the gestational sac clearly, and save the image; on the transverse section of the uterus, the largest transverse section of the gestational sac was clearly displayed, and the image was preserved. The images in the median sagittal plane and the perpendicular transverse plane were selected for further analysis. All US images included were in JPEG format and 768 × 1024 pixels.

Gestational sac region segmentation

In order to avoid the interference of the grayscale information of the myometrium around the gestational sac to the CNN learning results, we first segmented the gestational sac region [20]. For all ultrasound images, the region of interest containing the complete gestational sac was selected for preprocessing (Fig. 2A). A level set was used to segment the region of interest, and then the smallest convex polygon with the largest connected region was found and filled (Fig. 2B, C, D). The complete edge of the gestational sac was extracted and marked on the original image (Fig. 2E).

Fig. 2
figure 2

The segmentation process of the gestational sac. A Firstly, the region of interests (ROI) was selected which contained the complete gestational sac. B Secondly, the region of gestational sac was segmented using level set method. C Thirdly, the largest connected area was found. D Fill the largest connected area. E Finally, the complete edge of the gestational sac was extracted and marked on the original image

To quantitatively assess the accuracy of the segmentation, we defined the segmentation area as “S”, and two experienced sonographers were trained and asked to manually trace along the inner edges of the trophoblast together, which we used as the “gold standard” (GT). The intersection area between S and GT served as the true positive predicted area (TP); S minus the TP served as the false positive predicted area (FP), and GT minus TP equaled the false-negative predicted area (FN). We calculated areas by counting the number of pixels, as illustrated in Fig. 3.

Fig. 3
figure 3

Illustration of the quantitative accuracy assessment of gestational sac region segmentation. The yellow ellipse is an area (G) manually labeled by a doctor before the test, while the green ellipse indicates the segmentation area (S) predicted by the algorithm. The intersection area between S and G is the true positive predicted area (TP). FP (false positive predicted area) = S-TP; FN (false negative predicted area) = G-TP

Performance of segmentation was assessed using precision, recall, and the Dice coefficient (DICE), which were calculated to measure the extent of overlap between human-labeled and machine-segmented regions. The values of these metrics all ranged from 0 to 1, with larger values representing better performance [19].

They are defined as follows:

  • Precision = TP / (TP + FP),

  • Recall = TP / (TP + FN),

  • DICE = 2 × TP / (S + GT).

Convolutional neural network

Segmented pregnancy sacs were used for CNN learning. Data augmentation was used to increase training data size and to help the trained model cope with variabilities that were not in the original training set. In this case, data augmentation was performed through random geometric image transformations with random parameters, including rotation (from − 40° to 40°), scaling (±20% of the original image sizes), horizontal and vertical shifting (smaller than 20% of the input image sizes for each direction), projection and mirror flipping (on horizontal and vertical directions) (Fig. 4). When there are blank areas after transformations such as shifting and rotation, we randomly used one type of filling methods (namely ‘constant’, ‘nearest’, ‘reflect’ and ‘wrap’). The data augmentation strategy has proved to help prevent network overfitting and memorization of the exact training image details [21]. The training group images were input into the CNN-based VGG19 model [22]. The programs of VGG19 used in our experiments can be downloaded from a public resource on a much popular program platform named ‘github’. The website is More composition details of the above architecture are shown in And the programs of how to use VGG19 model by Keras API and the environment setting are shown as supplemental materials.

Fig. 4
figure 4

Data augmentation was performed by flipping, rotation, scaling, shifting and projection

VGG19 is one type of VGG models containing the most convolutional layers. ‘19’ means 16 convolutional layers and 3 fully-connected layers (Fig. S1). The convolutional layers are responsible for extracting high-order features. And the fully-connected layers are used for features projection and classification [22]. Workflow of the VGG19 models for automated prediction of miscarriage is shown in Fig. 5. The classifier was used to classify the gestational sac features in the ultrasound images captured by the CNN for the final classification of miscarriage and ongoing pregnancy.

Fig. 5
figure 5

A schematic illustration of the convolutional neural network of VGG19

In the evaluation step, 10-fold cross-validation was used. A total of 1/10 of the positive and negative samples (136 cases) before the expansion was selected as the test set, 1/10 of the samples (136 cases) was chosen as the validation set, and the remaining 4/5 samples (1098 cases) were used as the training set. Ultrasound images from the prospective study were used as test sets to verify the predictive performance of the model. The whole testing procedure and computer programming environment are similar to our prior work on lung cancer prediction shown in [23].

The manual ultrasound characteristics

As prenatal US has become a standard element of routine prenatal care [24], many studies have focused on the ability of early ultrasound characteristics, such as fetal heart rate and crown-rump length to predict miscarriage. In order to compare the predictive power of different methods, we analyzed the fetal heart rate and crown-rump length according to the research of DeVilbiss et al. [5] using our prospective data. Low fetal heart rate (≤122, 123, and 158 bpm) and small crown-rump length (≤6.0, 8.5, and 10.9 mm) for gestational weeks 6, 7, and 8, respectively, were used as independent predictors of clinical pregnancy loss.

Statistical analyses

The Mann-Whitney U test or χ2 test was used for comparing maternal and embryonic characteristics between the group of pregnant women with spontaneous miscarriage and the group of pregnant women who carried their pregnancies to full term. The diagnostic indicators of CNN (sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV)) were calculated. The receiver operating characteristic (ROC) analysis was performed using MedCalc, version 18.11 (MedCalc Software Ltd., Ostend, Belgium), and the diagnostic index was calculated using SPSS, version 20.0 (IBM Corp, Armonk, NY, United States). A P-value < .05 was considered statistically significant.


Gestational sac region segmentation

The total set of 2942 images (the transverse and sagittal images of the 1370 retrospective cases and 101 prospective cases) of normal and abnormal cases were used for training, validation, and testing of the gestational sac region segmentation. The average DICE index was 90.69%, the recall was 94.56%, and the precision was 87.86%, indicating that the gestational sac could be well segmented automatically.


Diagnostic efficiency of the deep learning model

In the retrospective study, the deep learning model achieved good performance in predicting miscarriage with the use of early pregnancy ultrasound images. The overall accuracy was 80.32%, and the overall sensitivity, specificity, PPV, and NPV for prediction of miscarriage were 80.73, 80.91, 81.18, and 80.02%, respectively (Table 2).

Table 2 Classification results of different gestational weeks and different planes

Comparison of different gestational weeks and different planes

To be analyzed by different gestational weeks, the sixth week had a better classification effect, as the AUC for the sixth week was 0.904 (95% confidence interval [CI], 0.781–0.950), which was higher than the other 2 groups (seventh week, 0.832 (95%CI, 0.702–0.865); eighth week, 0.858 (95%CI, 0.729–0.829)). We also analyzed different ultrasound planes; the AUC of the sagittal combined with transverse planes was 0.857 (95%CI, 0.793–0.922), which showed better diagnostic performance compared with the sagittal (AUC, 0.843 (95%CI, 0.801–0.884)) or transverse plane (AUC, 0.834 (95%CI, 0.776–0.893)) alone. The ROC curves were shown in Fig. 6.

Fig. 6
figure 6

Receiver operating characteristic curves of the deep convolutional neural network model for prediction of miscarriage in the retrospective study. A The transverse plane of different gestational weeks; B The sagittal plane of different gestational weeks; C The sagittal combined transverse plane of different gestational weeks. D The overall gestational weeks of different planes. Tran: Transverse plane; Sag: Sagittal plane

Prospective study

In order to verify the actual diagnostic ability of our model, we used prospective data as the test set, based on the model trained by the previous retrospective data. Because the sagittal combined with transverse planes showed better diagnostic performance compared with the sagittal or transverse plane alone, we used the sagittal combined with transverse planes for analysis in the prospective study. It had good predictive ability in predicting early pregnancy miscarriage, with the accuracy, sensitivity, specificity, PPV, and NPV of 78.10, 80.39, 94.52, 94.89, and 77%, respectively, and an AUC of 0.885(95%CI, 0.846–0.925) (Table 3).

Table 3 Classification results of the prospective study

Comparison with the manual ultrasound characteristics

We analyzed the fetal heart rate and crown-rump length according to the research of DeVilbiss et al. [5] using our prospective data, and we obtained various indicators of diagnosis (Table 3). The predictive power of the CNN for reviewing ultrasound characteristics was higher compared with performing manual measurements of ultrasound features (the AUC was 0.885vs 0.687, respectively) (Fig. 7).

Fig. 7
figure 7

Receiver operating characteristic curves of the CNN model and Ultrasound indexes in the prospective study. CRL: crown-rump length; HR: heart rate; CNN: convolutional neural network. VGG19: a model of the convolutional neural network


In the retrospective study, we evaluated the feasibility of CNN-based deep learning algorithms for the prediction analysis of miscarriage in early pregnancy ultrasound images of the gestational sac. The best-performing model yielded satisfactory predictions on the test set, with an AUC of 0.857, a sensitivity of 80.73%, and a specificity of 80.91%. For the prospective study, the sensitivity and specificity of the classification were 80.39 and 94.52%, respectively, and the AUC was 0.885. This work suggests an improved approach to the assessment of the early pregnancy sac. Our research focused on pregnancies with a detectable fetal heart rate, instead of pregnancies without fetal cardiac activity, the clinical prediction of low-risk pregnancies was more practical. According to the prediction model, clinicians can provide better consultation and service for pregnant women, make a more reasonable and humanized follow-up plan, shorten the examination cycle and pay attention to the embryonic development status of the pregnant women at risk of miscarriage.

Most existing studies investigate the individual components of the first-trimester ultrasound for predicting miscarriage [9, 25, 26]. Abnormal yolk sac size and appearance have been reported to be useful markers for miscarriage prediction before the demonstration of fetal viability [27]. However, in presence of an established viable intrauterine pregnancy, its usefulness is limited. Pillai et al reported that the crown-rump length has a lower predictive value than the fetal heart rate [28]. This could be due to the fact that the measurement of the crown-rump length in early pregnancy is affected by inter-observer variation [29]. Some studies used 3D ultrasound to observe the volume of the gestational sac or trophoblast to predict early pregnancy miscarriage [30,31,32]. However, the post-processing of volume data is a lengthy process, which limits its clinical application. Gestational age also needs to be considered when using these sonographic markers, because they change with the gestational age throughout the first trimester. However, for spontaneous conception, the exact gestational age cannot be determined in women whose last menstrual period was uncertain or whose menstrual cycle was irregular [33]. Even in those with known last menstrual period and regular cycles, there is a discrepancy of more than 5 days in gestation calculated from the menstrual history and by ultrasound in about 25% of cases [34]. The miscarriage prediction mode we had made by CNNs focused on the morphologic characteristics of the gestational sac, gestational age was not considered as an analysis parameter. Although the ultrasound images of the sixth week performed better at predicting miscarriage, which may have been related to the morphologic changes of the gestational sac during embryonic development, the seventh and eighth weeks also have good prediction results. Recently, DeVilbiss et al. [5] found that low fetal heart rate and small crown-rump length were independent predictors of clinical pregnancy loss, with the greatest risks observed for pregnancies having both characteristics. In order to better illustrate the potential predictive power of our model, we also compared with the manual ultrasound measurements in a prospective study. The results suggest that our CNN method is fast, convenient, and provides more accurate results.

To our knowledge, this is the first study to analyze ultrasound image features using CNN to predict the subsequent risk of pregnancy loss during early pregnancy. In our research, machine learning makes up for the shortcomings of intra-observer and inter-observer variation of manual ultrasound measurements in the prediction of early pregnancy miscarriage. Because machine learning is automatic from extraction to calculation, it evaluates ultrasound images more quickly and objectively. At the same time, it can quantitatively calculate the characteristics of ultrasound images, which is most beneficial for the classification of data. We have not only established a predictive model through a large number of cases in a retrospective study, but we also conducted prospective experiments to verify the predictive ability of the model, so that it can be better applied to clinical practice in the future.

Pregnant women, particularly those who have recurrent miscarriages, have anxiety and fear regarding the potential for miscarriage during early pregnancy. Antenatal depression and anxiety impact fetal development and seem to have a long-term detrimental effect on children’s mental health [7, 35]. It suggests that in addition to the medical value of the ultrasound, it also has an important psychological value that has to be considered in order to guarantee integral care of the pregnant women, especially in the first trimester [36]. Our study tried to predict the possibility of future miscarriage at the time of the first appearance of the fetal heartbeat, it would be useful to obstetricians for counseling their patients, relieving their tension and anxiety, and guiding the short-term management of these pregnancies. A further clinical application of the predictive model would be to equip ultrasound machines with machine-learning capabilities to provide the probability of miscarriage by one click, assist clinicians in patient consultation.

Our research surpasses the limitations of manual measurement of clinical ultrasound image parameters and calculates the image features quantitatively and automatically. In addition, we found that the prediction ability of the transverse plane combined with the sagittal plane was better than that of a single plane, indicating that the scanning of multiple planes provides more comprehensive information. Our method is simple, fast, and more suitable for clinical application, which can make clinicians more confident in their diagnosis and consultation.

However, there are still many limitations in our study. The performance of CNNs is dependent on the images that are fed to the algorithm. The higher resolution ultrasound images performed by experienced sonographers will aid in the improved predictions. In the retrospective study, most of the images we included had irremovable measurement marks, which may explain why the deep learning algorithm cannot achieve a better learning performance. But on the other hand, it also shows that our method is more robust, has lower requirements on input prediction images, and is more suitable for clinical diagnosis. In addition, the number of prospective cases is small, more cases will be used to train the model, and a multicenter, large-scale prospective study, such as a large birth cohort study, will be conducted in the future. In addition, our model only considers the characteristics of the ultrasound image in two-dimensional. In the future, we would try to analyze the 3D ultrasound volume data of the gestational sac combined with other ultrasound and biochemical markers to optimize a better prediction model.


The CNN is feasible to predict miscarriage by identifying the morphologic characteristics of the gestational sac in early pregnancy, and it can be used clinically in the future to provide assistance during clinician-patient consultations.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.



Area under the receiver operating characteristic curve


Confidence interval


Convolutional neural network


False negative


False positive


Gold standard


Negative predictive value


Positive predictive value


Receiver operating characteristic


True positive




  1. Orvieto R, Ben-Rafael Z, Ashkenazi J, Yoeli R, Messing B, Perri T, et al. Outcome of pregnancies derived from assisted reproductive technologies: IVF versus ICSI. J Assist Reprod Genet. 2000;17:385–7.

    CAS  Article  Google Scholar 

  2. Maconochie N, Doyle P, Prior S, Simmons R. Risk factors for first trimester miscarriage--results from a UK-population-based case-control study. BJOG. 2007;114:170–86.

    CAS  Article  Google Scholar 

  3. Wilcox AJ, Weinberg CR, O'Connor JF, Baird DD, Schlatterer JP, Canfield RE, et al. Incidence of early loss of pregnancy. N Engl J Med. 1988;319:189–94.

    CAS  Article  Google Scholar 

  4. Stamatopoulos N, Lu C, Casikar I, Reid S, Mongelli M, Hardy N, et al. Prediction of subsequent miscarriage risk in women who present with a viable pregnancy at the first early pregnancy scan. Aust N Z J Obstet Gynaecol. 2015;55:464–72.

    Article  Google Scholar 

  5. DeVilbiss EA, Mumford SL, Sjaarda LA, Connell MT, Plowden TC, Andriessen VC, et al. Prediction of pregnancy loss by early first trimester ultrasound characteristics. Am J Obstet Gynecol. 2020;223:242 e241–22.

    Google Scholar 

  6. Farren J, Mitchell-Jones N, Verbakel JY, Timmerman D, Jalmbrant M, Bourne T. The psychological impact of early pregnancy loss. Hum Reprod Update. 2018;24:731–49.

    Article  Google Scholar 

  7. Farren J, Jalmbrant M, Falconieri N, Mitchell-Jones N, Bobdiwala S, Al-Memar M, et al. Posttraumatic stress, anxiety and depression following miscarriage and ectopic pregnancy: a multicenter, prospective, cohort study. Am J Obstet Gynecol. 2020;222:367 e361–22.

    Article  Google Scholar 

  8. Oh JS, Wright G, Coulam CB. Gestational sac diameter in very early pregnancy as a predictor of fetal outcome. Ultrasound Obstet Gynecol. 2002;20:267–9.

    CAS  Article  Google Scholar 

  9. Taylor TJ, Quinton AE, de Vries BS, Hyett JA. First-trimester ultrasound features associated with subsequent miscarriage: a prospective study. Aust N Z J Obstet Gynaecol. 2019;59:641–8.

    Article  Google Scholar 

  10. Tadmor OP, Achiron R, Rabinowiz R, Aboulafia Y, Mashiach S, Diamant YZ. Predicting first-trimester spontaneous abortion. Ratio of mean sac diameter to crown-rump length compared to embryonic heart rate. J Reprod Med. 1994;39:459–62.

    CAS  PubMed  Google Scholar 

  11. Bae S, Karnitis J. Triple ultrasound markers including fetal cardiac activity are related to miscarriage risk. Fertil Steril. 2011;96:1145–8.

    Article  Google Scholar 

  12. Chittacharoen A, Herabutya Y. Slow fetal heart rate may predict pregnancy outcome in first-trimester threatened abortion. Fertil Steril. 2004;82:227–9.

    Article  Google Scholar 

  13. Rauch ER, Schattman GL, Christos PJ, Chicketano T, Rosenwaks Z. Embryonic heart rate as a predictor of first-trimester pregnancy loss in infertility patients after in vitro fertilization. Fertil Steril. 2009;91:2451–4.

    Article  Google Scholar 

  14. Tan S, Gulden Tangal N, Kanat-Pektas M, Sirin Ozcan A, Levent Keskin H, Akgunduz G, et al. Abnormal sonographic appearances of the yolk sac: which can be associated with adverse perinatal outcome? Med Ultrason. 2014;16:15–20.

    Article  Google Scholar 

  15. Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H. Artificial intelligence in radiology. Nat Rev Cancer. 2018;18:500–10.

    CAS  Article  Google Scholar 

  16. Litjens G, Sanchez CI, Timofeeva N, Hermsen M, Nagtegaal I, Kovacs I, et al. Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Sci Rep. 2016;6:26286.

    CAS  Article  Google Scholar 

  17. Lee JH, Joo I, Kang TW, Paik YH, Sinn DH, Ha SY, et al. Deep learning with ultrasonography: automated classification of liver fibrosis using a deep convolutional neural network. Eur Radiol. 2019.

  18. Park SH. Artificial intelligence for ultrasonography: unique opportunities and challenges. Ultrasonography. 2021;40:3–6.

    Article  Google Scholar 

  19. Xie H, Wang N, He M, Zhang L, Cai H, Xian J, et al. Using deep learning algorithms to classify fetal brain ultrasound images as normal or abnormal. Ultrasound Obstet Gynecol. 2020.

  20. Yin C, Wang Y, Zhang Q, Han F, Yuan Z, Yao Y. An Accurate Segmentation Framework for Static Ultrasound Images of the Gestational Sac. J Med Biol Eng. 2022;42(1):49–62.

  21. Zhou LQ, Wu XL, Huang SY, Wu GG, Ye HR, Wei Q, et al. Lymph node metastasis prediction from primary breast Cancer US images using deep learning. Radiology. 2020;294:19–28.

    Article  Google Scholar 

  22. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 [cs.CV]; 2014.

    Book  Google Scholar 

  23. Han FF, Yan LK, Chen JX, Teng YY, Chen S, Qi SL, et al. Predicting unnecessary nodule biopsies from a small, unbalanced, and pathologically proven dataset by transfer learning. J Digit Imaging. 2020;33:685–96.

    Article  Google Scholar 

  24. Rodgers SK, Chang C, DeBardeleben JT, Horrow MM. Normal and abnormal US findings in early first-trimester pregnancy: review of the Society of Radiologists in ultrasound 2012 consensus panel recommendations. Radiographics. 2015;35:2135–48.

    Article  Google Scholar 

  25. Abuelghar WM, Fathi HM, Ellaithy MI, Anwar MA. Can a smaller than expected crown-rump length reliably predict the occurrence of subsequent miscarriage in a viable first trimester pregnancy? J Obstet Gynaecol Res. 2013;39:1449–55.

    Article  Google Scholar 

  26. Reljic M. The significance of crown-rump length measurement for predicting adverse pregnancy outcome of threatened abortion. Ultrasound Obstet Gynecol. 2001;17:510–2.

    CAS  Article  Google Scholar 

  27. Suguna B, Sukanya K. Yolk sac size & shape as predictors of first trimester pregnancy outcome: a prospective observational study. J Gynecol Obstet Hum Reprod. 2019;48:159–64.

    CAS  Article  Google Scholar 

  28. Pillai RN, Konje JC, Richardson M, Tincello DG, Potdar N. Prediction of miscarriage in women with viable intrauterine pregnancy-a systematic review and diagnostic accuracy meta-analysis. Eur J Obstet Gynecol Reprod Biol. 2018;220:122–31.

    Article  Google Scholar 

  29. Pexsters A, Luts J, Van Schoubroeck D, Bottomley C, Van Calster B, Van Huffel S, et al. Clinical implications of intra- and interobserver reproducibility of transvaginal sonographic measurement of gestational sac and crown-rump length at 6-9 weeks' gestation. Ultrasound Obstet Gynecol. 2011;38:510–5.

    CAS  Article  Google Scholar 

  30. Odeh M, Ophir E, Grinin V, Tendler R, Kais M, Bornstein J. Prediction of abortion using three-dimensional ultrasound volumetry of the gestational sac and the amniotic sac in threatened abortion. J Clin Ultrasound. 2012;40:389–93.

    Article  Google Scholar 

  31. Reus AD, El-Harbachi H, Rousian M, Willemsen SP, Steegers-Theunissen RP, Steegers EA, et al. Early first-trimester trophoblast volume in pregnancies that result in live birth or miscarriage. Ultrasound Obstet Gynecol. 2013;42:577–84.

    CAS  Article  Google Scholar 

  32. Wie JH, Choe S, Kim SJ, Shin JC, Kwon JY, Park IY. Sonographic parameters for prediction of miscarriage: role of 3-dimensional volume measurement. J Ultrasound Med. 2015;34:1777–84.

    Article  Google Scholar 

  33. Yi Y, Lu G, Ouyang Y, Lin G, Gong F, Li X. A logistic model to predict early pregnancy loss following in vitro fertilization based on 2601 infertility patients. Reprod Biol Endocrinol. 2016;14:15.

    Article  Google Scholar 

  34. Papaioannou GI, Syngelaki A, Maiz N, Ross JA, Nicolaides KH. Ultrasonographic prediction of early miscarriage. Hum Reprod. 2011;26:1685–92.

    Article  Google Scholar 

  35. Stein A, Pearson RM, Goodman SH, Rapa E, Rahman A, McCallum M, et al. Effects of perinatal mental disorders on the fetus and child. Lancet. 2014;384:1800–19.

    Article  Google Scholar 

  36. Simo S, Zuniga L, Izquierdo MT, Rodrigo MF. Effects of ultrasound on anxiety and psychosocial adaptation to pregnancy. Arch Womens Ment Health. 2019;22:511–8.

    Article  Google Scholar 

Download references




This work was supported by the National Key Research and Development Program (2021YFC2701003, 2021YFC2701104), the National Natural Science Foundation of China (Grant numbers: 82171649, 81871219, 81671469), the LiaoNing Revitalization Talents Program (XLYC1902099) and 345 Talent Project of Shengjing Hospital. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations



YW, QXZ, CHY, FFH, and ZWY devised the study plan; LZC, ZYY, and XS helped with data acquisition; QXZ and CHY built the CNN model and analyzed the data; YZB, FFH and ZWY supervised the research. SSJ processed the data. YW, FFH, and ZWY wrote the draft manuscript; all authors read the draft manuscript and made important intellectual contributions to the final version. The author(s) read and approved the final manuscript.

Corresponding authors

Correspondence to Fangfang Han or Zhengwei Yuan.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the ethics committee of the Shengjing Hospital of China Medical University (2016PS243K) and was performed in accordance with the Declaration of Helsinki. The informed consent was obtained from all subject.

Consent for publication

Not applicable.

Competing interests

The authors declare that there is no competing interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

The architecture diagram of VGG19 model.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Zhang, Q., Yin, C. et al. Automated prediction of early spontaneous miscarriage based on the analyzing ultrasonographic gestational sac imaging by the convolutional neural network: a case-control and cohort study. BMC Pregnancy Childbirth 22, 621 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Early pregnancy
  • Spontaneous abortion
  • Sonographic
  • Deep learning