Skip to main content

Use of metabolomics for the identification and validation of clinical biomarkers for preterm birth: Preterm SAMBA



Spontaneous preterm birth is a complex syndrome with multiple pathways interactions determining its occurrence, including genetic, immunological, physiologic, biochemical and environmental factors. Despite great worldwide efforts in preterm birth prevention, there are no recent effective therapeutic strategies able to decrease spontaneous preterm birth rates or their consequent neonatal morbidity/mortality. The Preterm SAMBA study will associate metabolomics technologies to identify clinical and metabolite predictors for preterm birth. These innovative and unbiased techniques might be a strategic key to advance spontaneous preterm birth prediction.


Preterm SAMBA study consists of a discovery phase to identify biophysical and untargeted metabolomics from blood and hair samples associated with preterm birth, plus a validation phase to evaluate the performance of the predictive modelling. The first phase, a case–control study, will randomly select 100 women who had a spontaneous preterm birth (before 37 weeks) and 100 women who had term birth in the Cork Ireland and Auckland New Zealand cohorts within the SCOPE study, an international consortium aimed to identify potential metabolomic predictors using biophysical data and blood samples collected at 20 weeks of gestation. The validation phase will recruit 1150 Brazilian pregnant women from five participant centres and will collect blood and hair samples at 20 weeks of gestation to evaluate the performance of the algorithm model (sensitivity, specificity, predictive values and likelihood ratios) in predicting spontaneous preterm birth (before 34 weeks, with a secondary analysis of delivery before 37 weeks).


The Preterm SAMBA study intends to step forward on preterm birth prediction using metabolomics techniques, and accurate protocols for sample collection among multi-ethnic populations. The use of metabolomics in medical science research is innovative and promises to provide solutions for disorders with multiple complex underlying determinants such as spontaneous preterm birth.

Peer Review reports


Despite improvements in antenatal and neonatal care, the number of premature newborns each year has not significantly decreased since the 1960s. Preterm birth (PTB) is the leading cause of neonatal morbidity and mortality and a major contributor to loss of life, long-term disability, and health care costs [14]. The associated morbidity, mortality and high health costs have been well documented with premature infants facing life-threatening short and long term complications [58].

Despite the enormity of the health economic burden of preterm birth, and many years of focused research, a common aetiology and/or predictive test have not yet been identified. Spontaneous preterm birth (sPTB) is considered one of the “Great Obstetrical Syndromes”, which are conditions resulting from complex interactions between the maternal and fetal genome and the environment and which have a long preclinical period, fetal involvement and adaptive functioning in nature [9]. This reflects the multifactorial nature of this condition and the need to apply strategies that are capable of identifying multiple markers simultaneously in parallel with the assessment of clinical and biophysical risk factors.

There are many clinical [1013] and biochemical risk factors [1416] associated with sPTB and it is likely that these biochemical markers are present in the maternal blood long before the onset of a preterm labour. However, although certain candidate-driven approaches to studying these changes show promise, this has not resulted in effective predictive biomarkers for the general pregnant population. Due to these complex and dynamic characteristics of sPTB syndrome, it remains a difficult task to identify women and babies at risk.

Currently, the selection of women likely to deliver prematurely from clinical risk factors alone lacks the sensitivity required to effectively identify the majority of patients at risk of idiopathic sPTB [14]. Furthermore, parameters derived from previous obstetric history cannot be applied to nulliparous women. The association of biophysical predictors such as cervical length and/or vaginal biomarkers (fibronectin and phosphorylated insulin-like growth factor binding protein-1) enhances accuracy for prediction and enables more effective interventions for selected women. There are therapeutic interventions available for the prevention of sPTB, such as the use of progesterone [10, 17, 18] and cervical pessary [19]. Despite advances in selection of eligible women for such therapeutic interventions, the efficacy of cervical length or fetal fibronectin levels in asymptomatic women are still limited and seem to be more capable of discriminating women at lower risk than those at higher risk [2022]. Owen et al. showed that almost 50 % of women with cervical length between 15 and 25 mm did not deliver before 35 weeks, as well as approximately 70 % with cervical length between 25–30 mm [23].

The development of a predictive test for spontaneous preterm birth would help to accurately identify a high-risk population. To be effective, therapies need to be commenced at a gestational age in which they are likely to be of benefit. A sensitive early pregnancy-screening test would facilitate the timely administration of prophylactic treatments to those women at highest risk. The development of physics, biology and medicine translational research can provide a comprehensive approach for biological processes with complex pathways and regulations. Metabolomics offers an unbiased hypothesis generating approach to identify and validate potential candidate metabolomic biomarkers [24, 25].

We propose a multi-strategy approach to biomarker discovery and validation through the establishment of a large early pregnancy biobank of appropriate samples, in conjunction with the application of analytical methods capable of quantifying multiple blood-borne species simultaneously, and using some clinical and epidemiological markers to identify women at highest risk of spontaneous preterm birth.

The development of predictive tests that translate into clinical care can be divided into two distinct phases; (i) hypothesis generation after acquisition of data, a non-biased process where no or limited biological knowledge is required and (ii) validation of generated hypotheses [26]. The Preterm SAMBA study goal spans both phases and aims to identify a clinically useful early pregnancy-screening test to ascertain which pregnancies are at risk of developing sPTB. Discovery-based methods will be applied to blood and hair samples taken from carefully matched phenotypes in both cohorts (preterm and term deliveries) to develop a predictive algorithm to identify those women at increased risk of sPTB and test the effectiveness of such an algorithm in a prospective cohort.


Preterm SAMBA, an international collaborative multicentre study for the development of predictive tests that translate into clinical care, can be divided into two distinct phases: The first component (Discovery phase) is a case–control study that aims to identify clinical and metabolomics biomarkers related to spontaneous preterm birth. For this initial phase, untargeted metabolomics techniques will be employed to identify and quantify potential predictor’s metabolites that can be associated to potential clinical predictors. The second component (Validation phase) is a cohort study developed to validate the algorithm of prediction using the clinical and metabolomics biomarkers discovered in the first component of the study. Thus, to evaluate the performance of the prediction model developed at the first phase, targeted metabolomics techniques will be employed to analyse participants’ blood and hair samples to quantify those specific metabolites identified as potential predictors of preterm birth.

Discovery phase

The initial phase of the project consists of a case–control study utilizing data and samples collected for the SCOPE study (Screening for Pregnancy Endpoints study). The SCOPE consortium was an international effort to determine the causes and potential predictors for pregnancy complications and its methodology had already been previously published [2729]. Briefly, the cohort comprised 5690 healthy pregnant women recruited between November 2004 and August 2008 in New Zealand, Australia, Ireland and United Kingdom. Inclusion and exclusion criteria for the SCOPE study are described in Tables 1 and 2, respectively. Exclusion criteria include major fetal anomaly, chronic hypertension, diabetes, renal disease, systemic lupus erythematous, and antiphospholipid syndrome. These will therefore be the same criteria for the current study.

Table 1 Inclusion criteria of Preterm SAMBA validation phase – Brazilian cohort
Table 2 Exclusion criteria of Preterm SAMBA validation phase – Brazilian cohort

Extensive sociodemographic and physical data will be collected including age, ethnicity, socio-economic status, dietary and lifestyle questionnaire, parity, BMI (body mass index) and cigarette smoking.

Plasma and serum samples will be collected at 20 weeks of gestation using stringent standard operating procedures designed for metabolomics studies, barcoded and stored at −80 °C within 2–4 h; the timing between collection and freezing will be known for all specimens.

Several Standard Operating Procedures (SOP) for sample preparation by removal of proteins via ultrafiltration were developed and validated. The analysis of deproteinized plasma samples will be performed employing gas chromatography and liquid chromatography mass spectrometry (GC-MS and LC-MS). GC-MS and LC-MS techniques will be performed as described previously [30, 31]. Quality control samples (acquired by pooling plasma from all subjects) will be interspersed in every 5th run to assess reproducibility and validity. It is envisaged that the socioeconomic/physical/biomarker discovery phase of the Preterm SAMBA study will identify several candidate markers and predictive multivariate models. Discriminatory metabolites will be translated to a targeted triple quad MS (QQQ-MS) platform, to be used in the validation phase.

The proportion of pregnancies complicated by any preterm birth is approximately 10 %. Preterm SAMBA Discovery Phase will randomly select 100 women (n = 100) whose pregnancies reached term as compared to fifty randomly sampled pregnancies (n = 50) complicated by spontaneous preterm birth prior to 37 weeks gestation, in each of the Cork Ireland and Auckland New Zealand SCOPE cohorts (Fig. 1). The 20th week samples and data will be analysed to identify sPTB potential predictors. Considering there were no previous studies on this topic for preterm birth, these numbers were empirically estimated using a similar study performed for preeclampsia [31]. Using a type I error of 0.01, type II error of 0.10, a ratio between controls and cases of 1:1, an AUC of 0.9 and an OR of 10, the estimated sample of preterm birth is 49. We then anticipated around 50 preterm birth for each of the two centers.

Fig. 1
figure 1

Flowchart of Preterm SAMBA Discovery Phase participants (Case-control component)

Validation phase – the Brazilian multicentre cohort study

The Preterm SAMBA validation phase consists of a Brazilian multicentre cohort study with 1150 low-risk pregnant nulliparous women. Five of the 27 members of the Brazilian Network for Studies on Reproductive and Perinatal Health (BNSRPH), were chosen to participate in the Brazilian cohort (Table 3). Previous excellence performance in epidemiological and translational studies and diversity of cultural, ethnical and sociodemographic population characteristics were criteria for centre selection. Therefore, there are participating centres in three of the five regions of Brazil, which are the three most populated regions of the country: Northeast, Southeast and South.

Table 3 Participating centres in the Preterm-SAMBA study validation phase – Brazilian cohort

Assuming a type I error rate, α, of 5 % and an estimated area under ROC curve of at least 0.68, then in order to test hypotheses to a suitable level of power (80 % power, β = 0.2), the sample size sufficient should approximate to 80 cases of spontaneous preterm birth (<34 weeks gestation), calculated using MEDCALC®. Based on a minimum expected preterm birth rate of 7 %, the total cohort size should therefore be of approximately 1150 subjects, around 230 women at each participating centres.

Recruitment and data collection

The recruitment strategies include approaching existing pregnant women in participating facilities during prenatal care visits and with website/internet, flyers and local community advertisings. After the identification of potential participants, the research assistant will invite women and obtain an informed consent form of those who meet the inclusion criteria and agree to participate. Maternal age and ethnicity will be recorded from all approached women to facilitate a comparison of those who are recruited and those who decline.

All collected data will preferably be entered directly into the database, but printed forms will also be available in case of inability to access the internet-based database. In such cases, the data will be then entered later and completed printed forms will be stored, according to the required ethical principles.

Sociodemographic, physical data and pregnancy outcomes

First Visit (19–21 weeks): similarly to the SCOPE study, detailed information of sociodemographic characteristics (age, socioeconomic status, education, ethnicity, occupation and type of maternity care), maternal medical and obstetric history, infertility history, drugs and medications use, family medical and obstetric history and current pregnancy (occurrence and details of infection, vaginal bleeding, dipstick proteinuria, intercourse and hospital admission) will be collected.

Anthropometric measurements of maternal body mass index, height, weight, head circumference, arm circumference and triceps, biceps subscapular and suprailiac skinfolds will be performed according to standardized techniques. Height and weight of lightly clothed women will be measured to the nearest 0.1 mm and 0.1 kg respectively. Head and arm circumferences will be measured with an inelastic tape and skinfold thicknesses will be measured on the same side of the body to the nearest 0.2 mm using Harpenden (and/or Lange) skinfold calliper. The calliper is placed 1 cm distal to the firmly grasped skinfold, using the thumb and the index finger, at 90° to the skin. A single measurement is taken after 2 s.

Dietary intake will be assessed using a 24-h dietary recall administered by a trained professional who will query participants about food and beverage consumption in the previous 24 h. A trained nutritionist will then estimate calories, macro and micronutrient intake using computer-based standard tables allowing for appropriate ethnic, social and regional variations.

Furthermore, three consecutive manual blood pressure measurements will be recorded, using an appropriate cuff size for different arm circumferences and using Korotkoff phase V for diastolic blood pressure.

Second and third Visit (27–29 weeks and 37–39 weeks; both optional): three consecutive manual blood pressure measurements, anthropometric parameters (weight, height, head and arm circumference and triceps, biceps, subscapular and suprailiac skinfolds) and occurrence and characteristics of infection, vaginal bleeding, dipstick proteinuria, intercourse and hospital admissions will be recorded.

Postpartum data: data will be collected from the participant’s medical record, the prenatal chart and/or from a personal interview with the participant during hospital admission to minimize missing information. The main outcome is spontaneous preterm birth, defined as a birth before 34 weeks of gestational age due to preterm labour or premature rupture of membranes. Secondary outcomes will also be evaluated: spontaneous preterm birth alternatively defined as a birth before 37 weeks of gestational age due to preterm labour or premature rupture of membranes, provider-initiated preterm birth, defined as preterm birth due to medical indication on account of maternal or fetal conditions; pre-eclampsia, defined as having systolic blood pressure ≥ 140 or systolic blood pressure ≥ 90 mmHg after 20 weeks gestation on at least two occasions apart of 20 min, and/or proteinuria (24-h urinary protein ≥ 300 mg or urine dipstick ≥ ++) and/or severe maternal complications [32]; gestational diabetes mellitus according to ADA guidelines [33]; fetal growth restriction (FGR) defined as having birthweight below 10th percentile based on GROW customised birthweight centiles [34]. Clinical data will also be collected regarding the occurrence of preterm labour, cervical cerclage, deep vein thrombosis, infection, vaginal bleeding, dipstick proteinuria, intercourse, hospital admission, deep vein thrombosis during pregnancy and puerperium, and maternal mortality and the use of progesterone and/or pessary, tocolytic, antibiotic for preterm labour or pPROM, corticosteroids for fetal maturation, magnesium sulphate for neuroprotection during pregnancy. The occurrence of severe maternal morbidity and near miss will also be reported according to WHO guidelines [35]. Neonatal outcomes related to neonatal morbidity and mortality will be recorded until newborn discharge or death.

Sample collection, processing and storage

Non-fasting blood samples will be collected at 20 (between 19 and 21) weeks of gestational age (Fig. 2). All research assistants will be trained according to specific and detailed Standard Operation Protocols (SOPs) developed for sample collection, processing and storage. One of the study coordinators was trained by the SCOPE team, to guarantee adequate understanding of all necessary procedures. A maximum of 20 mL of blood will be collected to provide serum and plasma specimens. The blood specimens will be stored in 250 μL 2-D barcoded cryovials after one centrifugation cycle for plasma specimens (2000×g for 10 min at 4 °C) and two centrifugation cycles for serum (2000×g for 10 min and 2400×g for 10 min at 4 °C). The blood samples will be processed and frozen at −80 °C within 2–4 h. The time interval between collection and freezing will be recorded for all specimens.

Fig. 2
figure 2

Visits of Preterm-SAMBA Validation Phase

Although the Preterm SAMBA strategy and workflow is focussed around the analysis of blood samples, we recently reported a proof-of-concept study, which highlighted the potential use of the hair metabolome in the prediction of pregnancy complications [36]. Hair samples will also be collected at 20 weeks’ gestation. Samples (20–30 hair strands for each participant) will be collected from the occipital area, 0.5 cm away from the scalp, using blunt scissors. Then, hair will be packed in aluminium foil and stored at room temperature [36]. A unique linear barcode will be pasted on each hair package. All specimens and quality control information will be registered in the database.


A specific database for the Preterm SAMBA Brazilian Cohort was developed together with MedSciNet, a Swedish based company specialized in the design and development of online database systems linked with biobanks management systems, similarly to the database previously used for the SCOPE. The preterm SAMBA database will be centralized, secure, internet-based and FDA (United States Food and Drug Administration) and HIPAA (Health Insurance and Accountability Act of 1996, United States Security and Privacy Rules) compliant, which allows continuous data entry and monitoring of study progress. Completeness of clinical data and specimen collection will be constantly monitored, with incomplete fields ‘flagged’ for attention. The database allows several monitoring procedures with hierarchical access licenses and tracking system for all specimen aliquots stored. To comply with biobank regulatory issues, patients will only be identified by a unique study number. Pseudo-anonymised metadata and interim data will be stored using our laboratory information management system. The identifying information about participants will be kept in a separate and secure local database.

Data and sample quality

Several procedures to enhance and assure data and sample quality will be adopted. All entered data will be prospectively and retrospectively monitored. During data entry, internal consistency of variables is performed and error messages are automatically flagged. After completing the collection of data from a participant, all information needs to be reviewed by a local monitor. Then, the final form has to be signed by the local principal investigator (PI) in order to be incorporated in the final database. The coordinating centre (Campinas, Brazil) will also perform a centralized monitoring of data and samples. An initial meeting with all researchers from Brazilian participating centres has been held to discuss the final protocol, procedures to be implemented, their particular characteristics and necessary approaches to be used to guarantee the implementation of the study. Another general meeting at the end of study is planned in order to discuss results, strategies for manuscripts’ writing and submission and other related topics.

The coordinating centre will randomly select approximately 10 % of printed completed forms to carry out a check and validation of data from the forms and database entry during the first and second half of the study. This double-check procedure enhances data quality and decreases typing errors.

The record of information regarding sample collection, processing (precentrifugation and centrifugation) and storage processes will follow the Standard Preanalytical Coding for biospecimens (SPRECs) protocol, developed and recommended by the International Society for Biological and Environmental Repositories Biospecimen (ISBER) Science Working Group [37]. This protocol enables standardization of preanalytical information, using standard codes to refer to the techniques and conditions to which the samples were submitted.

Metabolomics analysis

The precise methodology to be used in the validation phase will depend on the ongoing discovery studies. As detailed above we anticipate that it will be based on a targeted triple quad MS (QQQ-MS) platform, as previously described [38]. We will subsequently describe details relating to metabolomics analysis techniques and metabolomics statistical analyses. We anticipate that data analysis will be integrated into the relational database such that decision rules may combine both clinical and spectrometric data.

The performance of the final algorithm developed in the discovery phase will be evaluated by its capacity to predict spontaneous preterm birth occurrence in women from the Brazilian cohort. The validation will be performed using the average squared difference between predicted and observed outcome (R2), adjusted R2 (same as R2, but penalizes for the number of predictors), sensitivity, specificity, positive and negative predictive values, likelihood ratio and the area under the ROC curve.

Ancillary studies

The Preterm SAMBA Brazilian cohort study will collect additional data regarding other relevant maternal and fetal obstetric complications. Detailed clinical data related to the occurrence and severity of pre-eclampsia, fetal growth restriction and gestational diabetes mellitus will be recorded. Fetal growth restriction will be diagnosed if birthweight is below 10th customised percentile. The occurrence of severe maternal morbidity, maternal near miss and maternal mortality during pregnancy or up to discharge after delivery will also be recorded, according to WHO definitions [35].

Ethical aspects

The SCOPE study, whose data and samples will be analysed for the Preterm SAMBA discovery phase, was approved by local ethics committees in New Zealand and Ireland and registered in the Australian and New Zealand Clinical Trial Registry (ACTRN12607000551493) [28]. All women who participated in the SCOPE study provided written informed consent and agreed to have their data and samples used in other studies. The Preterm SAMBA study has been reviewed and approved by the National Committee for Ethics in Research of Brazil (CONEP) and by the Institutional Review Board (IRB) of the coordinating centre (Letter of approval 1.048.565 issued on 28th April 2015) and of all other Brazilian participating centres. All women who will be enrolled in the Preterm SAMBA Brazilian cohort (Validation phase) will sign an informed consent form, also allowing for future additional studies with their biological samples without any additional consent.

The ethical principles stated in the Brazilian National Heath Council (Resolution CNS 466/12) will be respected in every stage of this study. The anonymity of the source of information will be guaranteed and the care for the women will be provided independent of her agreement to participate in the study. All ethical principles related to biobank storage and transport will be followed according to national and international rules related to research with human beings. The study also complies with the Declaration of Helsinki amended in Hong Kong in 1989. The methodological and ethical aspects of Preterm SAMBA study protocol were developed following STROBE guidelines [39].


The “Omics” Science comprises genomics, transcriptomics, proteomics and metabolomics technologies, which each provide valuable translational surveys in biological processes. A metabolomics approach enables the evaluation of metabolic pathways and the correlation of biochemical changes related to pathophysiology of disease, providing a downstream result of gene expression and higher sensitivity to phenotype of disease [4045]. Underlying conditions and factors related to the occurrence, severity or prognosis of diseases with complex determinants may be assessed, bringing to light the final product of organism metabolism: the metabolome [42].

The development of a two-phase metabolomics research program that includes two large cohorts of nulliparous women is not an easy task. The network collaboration is essential to develop, implement and analyse such complex data and, more importantly, to achieve reliable results. Precise protocols for sample collection, processing, storage and biobank management will be essential to assure high quality data and results.

Metabolomics profiling requires different techniques to address the detection and quantification of different classes of metabolites once there is no current method capable to identify all of them. Preterm SAMBA study will employ different untargeted techniques that require very carefully and standardized protocols for sample preparation [24]. Studying the metabolome in blood samples requires invasive collection and immediate processing. As an alternative, hair samples are non-invasive, do not need processing methods and can be stored at room temperature. Hair can, theoretically, reflect endogenous compounds and environmental exposures from many days/weeks ago. The determination of the hair metabolome is a possible approach to identify biomarkers for spontaneous preterm birth. It has already been explored in gestational diabetes and fetal growth restriction, revealing potential endogenous mechanisms involved in those pathologic conditions [36, 45].

The identification of spontaneous preterm birth predictors using multi-ethnic data/samples and the evaluation of performance in a culturally and ethnically different population is desirable and meaningful for external validation. The use of quality control records and SPREC protocol is another important recommendation for metabolomics studies due to the necessity to evaluate confounders for analytical measures such as the time between sample collection, storage and processing conditions and the occurrence of haemolysis, lipaemia and metabolic degradation on account of inadequate temperature or solar exposure [37].

In the context of translational research, metabolomics may enhance understanding of the underlying pathways, which lead to obstetric complications. Preterm SAMBA aims to identify and validate a predictive model for spontaneous preterm birth, but will also develop a biobank and database that will enable research on pre-eclampsia (PE), fetal growth restriction (FGR) and gestational diabetes mellitus (GDM). The possibility to combine biochemical, genetics and clinical information that can be large-scale and replicable empowers the development of knowledge for clinical practice in preterm birth prevention. This would be especially worthwhile and helpful for countries with a high proportion and high absolute number of preterm births as is the case of Brazil where around 12 % of all births occur prematurely [46].

A recent clustered designed study showed that 30 % of all spontaneous preterm births do not have any maternal, fetal or placental conditions identified that could be related to its occurrence [47]. The application of metabolomics techniques could be a promising approach for spontaneous preterm birth prediction, all the more in those cases of silent phenotype in which there are no known predictors. Metabolomics have been already described in other obstetric conditions as pre-eclampsia, gestational diabetes mellitus and fetal growth restriction [31, 40, 41, 45, 48, 49], showing excellent performance in terms of a discriminatory algorithm. Therefore, we believe metabolomics is a powerful and strategic key not only for preterm birth prediction, but hopefully also for its prevention. The detection of metabolic pathways related to PTB syndrome may enable the development of more accurate therapies for primary or secondary prevention of pregnant women identified as at high-risk.

At the end of the study, if we are successful in the identification of such an effective algorithm, certainly several other topics should be carefully considered. Can this knowledge be really translated into a commercially available kit for screening purposes? Would the costs derived from this process be acceptable for low and middle-income countries? How will this be made available for populations in public sector? For discussing a future implementation of such a screening strategy, the following necessary points to be covered are to know if a concrete package of interventions to reduce preterm birth among those women identified as high-risk is available, and if it is cost-effective to be supported by the public health system. Finally, in this study we are planning to transfer the technology developed for the algorithm from New Zealand to Brazil, including lab technologies for assessing the biomarkers identified by metabolomics for preterm birth. Hopefully, if this is proved to be feasible, we believe that an important step for reducing the burden of preterm birth will have been achieved.


ADA, American Diabetes Association; BMI, body mass index; CONEP, Brazilian National Committee for Ethics in Research; FAME, fatty acid methyl esterification; FDA, Food and Drug Administration; FGR, fetal growth restriction; GC, gas chromatography; GDM, gestational diabetes mellitus; LC, liquid chromatography; MS, mass spectrometry; PE, pre-eclampsia; PI, principal investigator; pPROM, preterm premature rupture of membranes; PTB, preterm birth; SAMBA, Screening and Metabolomics in Brazil and Auckland; SCOPE, Screening for Pregnancy Endpoints study; SOP, standard operation procedures; sPTB, spontaneous preterm birth; WHO, World Health Organization


  1. Liu L, Johnson HL, Cousens S, Perin J, Scott S, Lawn JE, et al. Global, regional, and national causes of child mortality: an updated systematic analysis for 2010 with time trends since 2000. Lancet. 2012;379:2151–61.

    Article  PubMed  Google Scholar 

  2. Blencowe H, Lee ACC, Cousens S, Bahalim A, Narwal R, Zhong N, et al. Preterm birth-associated neurodevelopmental impairment estimates at regional and global levels for 2010. Pediatr Res. 2013;74 Suppl 1:17–34.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Blencowe H, Vos T, Lee ACC, Philips R, Lozano R, Alvarado MR, et al. Estimates of neonatal morbidities and disabilities at regional and global levels for 2010: introduction, methods overview, and relevant findings from the Global Burden of Disease study. Pediatr Res. 2013;74 Suppl 1:4–16.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Beck S, Wojdyla D, Say L, Betran AP, Merialdi M, Requejo JH, et al. The worldwide incidence of preterm birth: a systematic review of maternal mortality and morbidity. Bull World Health Organ. 2010;88:31–8.

    Article  PubMed  Google Scholar 

  5. Copper RL, Goldenberg RL, Creasy RK, DuBard MB, Davis RO, Entman SS, et al. A multicenter study of preterm birth weight and gestational age-specific neonatal mortality. Am J Obstet Gynecol. 1993;168(1 Pt 1):78–84.

    Article  CAS  PubMed  Google Scholar 

  6. Costeloe K, Hennessy E, Gibson AT, Marlow N, Wilkinson AR. The EPICure study: outcomes to discharge from hospital for infants born at the threshold of viability. Pediatrics. 2000;106:659–71.

    Article  CAS  PubMed  Google Scholar 

  7. Petrou S. The economic consequences of preterm birth during the first 10 years of life. BJOG. 2005;112(Suppl):10–5.

    Article  PubMed  Google Scholar 

  8. Wood NS, Costeloe K, Gibson AT, Hennessy EM, Marlow N, Wilkinson AR. The EPICure study: associations and antecedents of neurological and developmental disability at 30 months of age following extremely preterm birth. Arch Dis Child Fetal Neonatal Ed. 2005;90:F134–40.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Di Renzo GC. The great obstetrical syndromes. J Matern Neonatal Med. 2009;22:633–5.

    Article  Google Scholar 

  10. Da Fonseca EB, Bittar RE, Carvalho MH, Zugaib M. Prophylactic administration of progesterone by vaginal suppository to reduce the incidence of spontaneous preterm birth in women at increased risk: A randomized placebo-controlled double-blind study. Am J Obstet Gynecol. 2003;188:419–24.

    Article  PubMed  Google Scholar 

  11. Newnham JP, Dickinson JE, Hart RJ, Pennell CE, Arrese CA, Keelan JA. Strategies to Prevent Preterm Birth. Front Immunol. 2014;5:584.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Requejo J, Merialdi M, Althabe F, Keller M, Katz J, Menon R. Born Too Soon: Care during pregnancy and childbirth to reduce preterm deliveries and improve health outcomes of the preterm baby. Reprod Health. 2013;10 Suppl 1:S4.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Romero R, Nicolaides K, Conde-Agudelo A, Tabor A, O’Brien JM, Cetingoz E, et al. Vaginal progesterone in women with an asymptomatic sonographic short cervix in the midtrimester decreases preterm delivery and neonatal morbidity: a systematic review and metaanalysis of individual patient data. Am J Obstet Gynecol. 2012;206(124):e1–e19.

    PubMed  Google Scholar 

  14. Goldenberg RL, Iams JD, Mercer BM, Meis PJ, Moawad A, Das A, et al. The Preterm Prediction Study: toward a multiple-marker test for spontaneous preterm birth. Am J Obstet Gynecol. 2001;185:643–51.

    Article  CAS  PubMed  Google Scholar 

  15. Goldenberg RL, Goepfert AR, Ramsey PS. Biochemical markers for the prediction of preterm birth. Am J Obstet Gynecol. 2005;192(5 Suppl):S36–46.

    Article  CAS  PubMed  Google Scholar 

  16. Torbé A, Czajka R. Proinflammatory cytokines and other indications of inflammation in cervico-vaginal secretions and preterm delivery. Int J Gynaecol Obstet. 2004;87:125–30.

    Article  PubMed  Google Scholar 

  17. Meis PJ, Klebanoff M, Thom E, Dombrowski MP, Sibai B, Moawad AH, et al. Prevention of recurrent preterm delivery by 17 alpha-hydroxyprogesterone caproate. N Engl J Med. 2003;348:2379–85.

    Article  CAS  PubMed  Google Scholar 

  18. Romero R. Progesterone to prevent preterm birth in twin gestations: what is the next step forward? BJOG. 2013;120:1–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Goya M, Pratcorona L, Merced C, Rodó C, Valle L, Romero A, et al. Cervical pessary in pregnant women with a short cervix (PECEP): an open-label randomised controlled trial. Lancet. 2012;379:1800–6.

    Article  PubMed  Google Scholar 

  20. Abbott DS, Hezelgrave NL, Seed PT, Norman JE, David AL, Bennett PR, et al. Quantitative fetal fibronectin to predict preterm birth in asymptomatic women at high risk. Obstet Gynecol. 2015;125:1168–76.

    Article  CAS  PubMed  Google Scholar 

  21. Crane JM, Hutchens D. Transvaginal sonographic measurement of cervical length to predict preterm birth in asymptomatic women at increased risk: a systematic review. Ultrasound Obstet Gynecol. 2008;31:579–87.

    Article  CAS  PubMed  Google Scholar 

  22. Honest H, Forbes CA, Durée KH, Norman G, Duffy SB, Tsourapas A, et al. Screening to prevent spontaneous preterm birth: systematic reviews of accuracy and effectiveness literature with economic modelling. Health Technol Assess. 2009;13:1–627.

    Article  CAS  PubMed  Google Scholar 

  23. Owen J, Szychowski JM, Hankins G, Iams JD, Sheffield JS, Perez-Delboy A, et al. Does midtrimester cervical length ≥25 mm predict preterm birth in high-risk women? Am J Obstet Gynecol. 2010;203(393):e1–5.

    Google Scholar 

  24. Horgan RP, Clancy OH, Myers JE, Baker PN. An overview of proteomic and metabolomic technologies and their application to pregnancy research. BJOG. 2009;116:173–81.

    Article  CAS  PubMed  Google Scholar 

  25. Romero R, Espinoza J, Gotsch F, Kusanovic J, Friel L, Erez O, et al. The use of high-dimensional biology (genomics, transcriptomics, proteomics, and metabolomics) to understand the preterm parturition syndrome. BJOG. 2006;113:118–35.

    Article  CAS  PubMed  Google Scholar 

  26. Kell DB, Oliver SG. Here is the evidence, now what is the hypothesis? The complementary roles of inductive and hypothesis-driven science in the post-genomic era. Bioessays. 2004;26:99–105.

    Article  PubMed  Google Scholar 

  27. North RA, McCowan LM, Dekker GA, Poston L, Chan EHY, Stewart AW, et al. Clinical risk prediction for pre-eclampsia in nulliparous women: development of model in international prospective cohort. BMJ. 2011;342:d1875.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Australian New Zealand Clinical Trials Registry: ACTRN12607000551493. Accessed 6 Aug 2016.

  29. McCowan LM, Roberts CT, Dekker GA, Taylor RS, Chan EHY, Kenny LC, et al. Risk factors for small-for-gestational-age infants by customised birthweight centiles: data from an international prospective cohort study. BJOG. 2010;117:1599–607.

    Article  CAS  PubMed  Google Scholar 

  30. Wishart DS, Lewis MJ, Morrissey JA, Flegel MD, Jeroncic K, Xiong Y, et al. The human cerebrospinal fluid metabolome. J Chromatogr B Anal Technol Biomed Life Sci. 2008;871:164–73.

    Article  CAS  Google Scholar 

  31. Kenny LC, Broadhurst DI, Dunn W, Brown M, North RA, McCowan L, et al. Robust early pregnancy prediction of later preeclampsia using metabolomic biomarkers. Hypertension. 2010;56:741–9.

    Article  CAS  PubMed  Google Scholar 

  32. Brown MA, Lindheimer MD, de Swiet M, Van Assche A, Moutquin JM. The classification and diagnosis of the hypertensive disorders of pregnancy: Statement from the International Society for the Study of Hypertension in Pregnancy (ISSHP). Hypertens Pregnancy. 2001;20:ix–xiv.

    Article  CAS  PubMed  Google Scholar 

  33. American Diabetes Association: 2. Classification and Diagnosis of Diabetes. Diabetes Care. 2015;38:S8–S16.

    Article  Google Scholar 

  34. GROW Birthweight Centiles. Bulk Centile Calculator. Accessed 6 Aug 2016.

  35. Say L, Souza JP, Pattinson RC. Maternal near miss—towards a standard tool for monitoring quality of maternal health care. Best Pract Res Clin Obstet Gynaecol. 2009;23:287–96.

    Article  PubMed  Google Scholar 

  36. Sulek K, Han TL, Villas-Boas SG, Wishart DS, Soh SE, Kwek K, et al. Hair metabolomics: identification of fetal compromise provides proof of concept for biomarker discovery. Theranostics. 2014;4(9):953–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Betsou F, Lehmann S, Ashton G, Barnes M, Benson EE, Coppola D, et al. Standard preanalytical coding for biospecimens: defining the sample PREanalytical code. Cancer Epidemiol Biomarkers Prev. 2010;19:1004–11.

    Article  CAS  PubMed  Google Scholar 

  38. Bond L, Thomas G, Nolan C, Christensen P, Baker PN, Kenny LC, et al. Preeclampsia risk stratification early in pregnancy: First results of a new LC-MS based multiplex metabolite assay. 2nd Annual European Congress on Clinical Mass Spectrometry. 2015.

    Google Scholar 

  39. Von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. PLoS Med. 2007;4:e296.

    Article  Google Scholar 

  40. Dessì A, Marincola FC, Fanos V. Metabolomics and the great obstetrical syndromes – GDM, PET, and IUGR. Best Pract Res Clin Obstet Gynaecol. 2014;29:156–64.

    Article  PubMed  Google Scholar 

  41. Fanos V, Atzori L, Makarenko K, Melis GB, Ferrazzi E. Metabolomics application in maternal-fetal medicine. Biomed Res Int. 2013;2013:720514.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Putri SP, Nakayama Y, Matsuda F, Uchikata T, Kobayashi S, Matsubara A, et al. Current metabolomics: practical applications. J Biosci Bioeng. 2013;115:579–89.

    Article  CAS  PubMed  Google Scholar 

  43. Dharuri H, Demirkan A, van Klinken JB, Mook-Kanamori DO, van Duijn CM, ’t Hoen PA, et al. Genetics of the human metabolome, what is next? Biochim Biophys Acta. 2014;1842:1923–31.

    Article  CAS  PubMed  Google Scholar 

  44. Romero R, Mazaki-Tovi S, Vaisbuch E, Kusanovic JP, Chaiworapongsa T, Gomez R, et al. Metabolomics in premature labor: a novel approach to identify patients at risk for preterm delivery. J Matern Fetal Neonatal Med. 2010;23:1344–59.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. He X, de Seymour JV, Sulek K, Qi H, Zhang H, Han T-L, et al. Maternal hair metabolome analysis identifies a potential marker of lipid peroxidation in gestational diabetes mellitus. Acta Diabetol. 2016;53(1):119–22.

    Article  PubMed  Google Scholar 

  46. Passini Jr R, Cecatti JG, Lajos GJ, Tedesco RP, Nomura ML, Dias TZ, for the Brazilian Multicentre Study on Preterm Birth study group, et al. Brazilian Multicentre Study on Preterm Birth (EMIP): prevalence and factors associated with spontaneous preterm birth. PLoS One. 2014;9(10):e109069.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Barros FC, Papageorghiou AT, Victora CG, Noble JA, Pang R, Iams J, et al. The Distribution of Clinical Phenotypes of Preterm Birth Syndrome: Implications for Prevention. JAMA Pediatr. 2015;169:220–9.

    Article  PubMed  Google Scholar 

  48. Kenny LC, Black MA, Poston L, Taylor R, Myers JE, Baker PN, et al. Early pregnancy prediction of preeclampsia in nulliparous women, combining clinical risk and biomarkers: the Screening for Pregnancy Endpoints (SCOPE) international cohort study. Hypertension. 2014;64:644–52.

    Article  CAS  PubMed  Google Scholar 

  49. Huynh J, Xiong G, Bentley-Lewis R. A systematic review of metabolite profiling in gestational diabetes mellitus. Diabetologia. 2014;57:2453–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


The Preterm SAMBA study group also included: Mary A. Parpinelli, Jussara Mayrink, School of Medical Sciences, University of Campinas, Brazil; Kleber G. Franchini, LNBio, Campinas, Brazil; Iracema M. Calderon, Bianca F. Cassettari, School of Medicine of Botucatu, UNESP, Brazil; Janete Vetorazzi, Lucia Pfitscher, School of Medicine, Federal University of Rio Grande do Sul, Porto Alegre, Brazil; Edilberto P. Rocha Filho, Débora F Leite, School of Medicine, Federal University of Pernambuco, Recife, Brazil; Francisco E. Feitosa, Carolina L. Costa e Silva, School of Medicine, Federal University of Ceará, Fortaleza, Brazil.

The SCOPE study group also included: Lucilla Poston, King’s College London and King’s Health Partners, London, UK; Jenny E. Myers, Maternal & Fetal Health Research Centre, University of Manchester, UK; Nigel A.B. Simpson and James J Walker, University of Leeds, UK; Gus A. Dekker and Claire T. Roberts, University of Adelaide, South Australia.


This was one of the two big studies selected for sponsoring from the research call “Grand Challenges Brazil: Reducing the burden of preterm birth” number 05/2013 jointly issued by the Brazilian National Research Council (CNPq) and the Bill and Melinda Gates Foundation (Award 401636/2013-5). The funders played no role at all in the study design, writing the manuscript nor in the decision to submit the manuscript for publication. LC Kenny is supported by a Science Foundation Ireland Program Grant for INFANT (12/RC/2272). The SCOPE database is provided and maintained by MedSciNet AB ( The New Zealand SCOPE study was funded by the New Enterprise Research Fund, Foundation for Research Science and Technology; Health Research Council (04/198); Evelyn Bond Fund, Auckland District Health Board Charitable Trust. The Irish SCOPE study was funded by the Health Research Board of Ireland (CSA/2007/2;

Authors’ contributions

All authors contributed to the overall study design and specific methodologies. JGC, PNB, MLC and RCP conceived the study design. RTS, JGC, RCP and RP planned the implementation of the study. RTS and JGC drafted the manuscript. KS, LCK and SVB participated in the design of the metabolomics methods for essays. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations



Corresponding author

Correspondence to Jose G. Cecatti.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cecatti, J.G., Souza, R.T., Sulek, K. et al. Use of metabolomics for the identification and validation of clinical biomarkers for preterm birth: Preterm SAMBA. BMC Pregnancy Childbirth 16, 212 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: