- Open Access
Comparison of alternative gestational age assessment methods in a low resource setting: a retrospective study
BMC Pregnancy and Childbirth volume 22, Article number: 585 (2022)
Accurate gestational age (GA) determination allows correct management of high-risk, complicated or post-date pregnancies and prevention or anticipation of prematurity related complications. Ultrasound measurement in the first trimester is the gold standard for GA determination. In low- and middle-income countries elevated costs, lack of skills and poor maternal access to health service limit the availability of prenatal ultrasonography, making it necessary to use alternative methods. This study compared three methods of GA determination: Last Normal Menstrual Period recall (LNMP), New Ballard Score (NBS) and New Ballard Score corrected for Birth Weight (NBS + BW) with the locally available standard (Ultrasound measurement in the third trimester) in a low-resource setting (Tosamaganga Council Designated Hospital, Iringa, Tanzania).
All data were retrospectively collected from hospital charts. Comparisons were performed using Bland Altman method.
The analysis included 70 mother-newborn pairs. Median gestational age was 38 weeks (IQR 37–39) according to US. The mean difference between LNMP vs. US was 2.1 weeks (95% agreement limits − 3.5 to 7.7 weeks); NBS vs. US was 0.2 weeks (95% agreement limits − 3.7 to 4.1 weeks); NBS + BW vs. US was 1.2 weeks (95% agreement limits − 1.8 to 4.2 weeks).
In our setting, NBS + BW was the least biased method for GA determination as compared with the locally available standard. However, wide agreement bands suggested low accuracy for all three alternative methods. New evidence in the use of second/third trimester ultrasound suggests concentrating efforts and resources in further validating and implementing the use of late pregnancy biometry for gestational age dating in low and middle-income countries.
Accurate determination of gestational age (GA) is of great importance in clinical practice, allowing correct management of high-risk, complicated or post-date pregnancies and prevention or anticipation of prematurity-related complications . The ultrasound (US) measurement in the first trimester (up to and including 13 6/7 weeks of gestation) is considered the gold standard for GA determination and is followed in accuracy by ultrasound in second and third trimester . In low- and middle-income countries, the availability of prenatal ultrasonography is limited by elevated costs, lack of skills and poor maternal access to health service, making it necessary to use alternative methods . The Last Normal Menstrual Period recall (LNMP) and the neonatal physical and neurological maturity assessments with the New Ballard Score (NBS), constitute reasonable measurements for gestational age when compared to ultrasound, and acceptable methods when assessing gestational age in low-resource settings [3,4,5,6,7]. Compared with first trimester ultrasound, LNMP and NBS have a mean bias of 0.2 and 2.8 days, respectively, with 95% limits of agreement of ± 26 days [2, 8]. LNMP shows high sensitivity (84.7%) and specificity (90.5%) for identifying preterm newborns (< 37 weeks) , while NBS has moderate sensitivity (64%) but high specificity (95%) . A modified birthweight-sensitive Ballard method (NBS + BW) seems to improve, in routine clinical practice, the assessment of gestational age and correct for errors caused by low birthweight .
This study compared three methods of GA determination (LNMP, NBS and NBS + BW) with the locally available standard (US measurement in the third trimester) in a low-resource setting, under real field conditions. The purpose was to identify the best method for GA determination in a low-resource setting.
Materials and methods
This study was carried out at the St. John of the Cross Hospital of Tosamaganga (Iringa, Tanzania), the only Comprehensive Emergency Obstetric and Newborn Care Center in Iringa Rural District. Designated as referral hospital of Iringa Rural District Council, it serves an estimated population of 265 000 inhabitants, handling approximately 2300 deliveries per year. The hospital has a total of 165 beds, 48 of which are in the maternity department, including 12 obstetrics, 18 in vaginal postpartum and 18 in CS postpartum. A labour room, a neonatal resuscitation room and a Neonatal Special Care Unit are also present .
All the mother-newborn pairs with complete data on the three different methods of determining GA were included in the study.
The agreement in GA estimation between different methods.
All data were retrospectively and anonymously collected from hospital charts and did not contain any information that might be used to identify individual patients. Maternal data included: age, weight, BMI, number of pregnancies, mode of delivery, GA by LNMP recall, GA by ultrasound measurement in the third trimester. Neonatal data included: sex, birth weight, APGAR score, GA by NBS and NBS + BW.
The GA refers to the duration of time between conception and delivery. The LNMP recall is the difference between the first day of the last menstrual period and the delivery date. A US is defined as of the third trimester when executed at 28 0/7 weeks of gestation and beyond . Late ultrasound GA determination was performed using the INTERGROWTH-21st project estimation method . The NBS consists in a procedure, performed postnatally up to 96 h after birth, that asses physical and neuromuscular maturity of the neonate to determine its gestational age . NBS + BW refers to the NBS adjusted considering birth weight in the score calculation .
The US measurement in the third trimester was separately compared with LNMP recall, NBS and NBS + BW.
The sample size calculation was based on information from available literature . Assuming a mean difference of 0 weeks with a standard deviation of 3 weeks, a minimum of 64 subjects were required to have an 80% chance of detecting, as significant at the 5% level, an agreement interval of 8 weeks in the Bland-Altman plot. The final sample size was rounded up to 70 subjects (reaching an estimated power of 85%). Sample size calculation was performed using R 4.1 (R Foundation for Statistical Computing, Vienna, Austria) .
Categorical variables were summarized as frequency and percentage. Continuous variables were summarized as mean and standard deviation (SD) or median and interquartile range (IQR). The agreement in GA estimation between different methods was assessed using Bland Altman plot (showing mean difference and 95% agreement limits). The correlation between continuous variables was assessed using Pearson correlation coefficient. Inter-rater reliability between the clinicians was evaluated using intra-class correlation coefficient (ICC) in a subsample of 30 newborns with double assessments. All tests were two-sided and a p-value less than 0.05 was considered statistically significant. Statistical analysis was performed using R 4.1 (R Foundation for Statistical Computing, Vienna, Austria) .
The analysis included 70 mother-newborn pairs. Maternal and neonatal characteristics are reported in Table 1. Data on GA by LNMP recall, GA by US, GA by NBS and GA by NBS + BW were available for all newborns (n = 70).
The agreement in GA estimation between LNMP and US is shown in Fig. 1. Mean difference between LNMP and US was 2.1 weeks (95% agreement limits − 3.5 to 7.7 weeks). There was a mild correlation between difference and average GA value (Pearson correlation coefficient 0.36, p = 0.002), which suggested an increasing overestimation of LNMP over US in late GAs. The difference between LNMP and US was not associated with maternal age (Pearson correlation coefficient 0.01, p = 0.98), maternal BMI (Pearson correlation coefficient − 0.24, p = 0.11) or number of pregnancies (Pearson correlation coefficient 0.01, p = 0.93).
The agreement in GA estimation between NBS and US is shown in Fig. 2. Mean difference between NBS and US was 0.2 weeks (95% agreement limits − 3.7 to 4.1 weeks), without any correlation between difference and average value (Pearson correlation coefficient − 0.11, p = 0.38).
The agreement in GA estimation between NBS + BW and US is shown in Fig. 3. Mean difference between NBS + BW and US was 1.2 weeks (95% agreement limits − 1.8 to 4.2 weeks), without any correlation between difference and average value (Pearson correlation coefficient − 0.15, p = 0.20).
In a subsample of 30 newborns, ICC showed good inter-rater reliability for NBS score (ICC = 0.99), NBS neuromuscular subscore (ICC = 0.96) and NBS physical subscore (ICC = 0.95).
In our low-income setting, the modified NBS (NBS + BW) was the less biased method for GA determination as compared to NBS alone and LNMP. In addition, the good inter-rater reliability of NBS suggested that it could be consistently used by the health care staff thank to the low subjectivity. On the other hand, our data indicated low agreement between the alternative methods (LMNP, NBS, NBS + BW) and the locally available standard (US measurement in the third trimester).
In low-income countries, the lack of accessible or accurate data on GA is a critical barrier to the correct management of high-risk pregnancies and preterm births. The limited availability of prenatal US suggested the opportunity of investigating alternative methods of GA determination in low-resource settings [2,3,4,5,6,7]. Unfortunately, we found low accuracy for some alternative methods (LMNP, NBS, NBS + BW) compared with the locally available standard. Overall, our deviations were in broad agreement with previous data reported in available literature [2, 8], thus supporting the unreliability of such alternative methods for GA determination. The deviations of the alternative methods indicated possible underestimation of GA up to 3.7 weeks and overestimation up to 7.7 weeks. As accurate determination of GA is crucial for prevention and anticipation of prematurity-related complications, such magnitude implies the impossibility of discriminating between term and preterm newborns (and among degrees of prematurity) by the health care provider. Given such results, a Reviewer suggested the intriguing idea of combining these methods to improve accuracy of estimated GA. However, NBS and NBS + BW could not be jointly used due to high multicollinearity (NBS + BW values are based on NBS values), hence leading to two options: (i) combining LMNP and NBS, or (ii) combining LMNP and NBS + BW. In both cases, the contribution of LMNP was negligible (data not shown), thus the estimated values were based on NBS or NBS + BW, respectively. Of note, we acknowledge that US measurement in the third trimester may represent a suboptimal reference standard, as dedicated literature suggests US measurement in the first trimester or accurate LMNP recall as the preferred reference standards for testing the validity of alternative methods of GA determination [2, 8]. However, the unavailability of such preferred reference standards forced the use of US measurement in the third trimester as the only viable option in our setting.
Recent evidence suggested that using US in second/third trimester with a novel parsimonious formula might narrow accuracy to ± 10.5 days (between 24 0/7 weeks and 29 6/7 weeks of gestation) and of ± 15.1 days (between 24 0/7 weeks and 29 6/7 weeks of gestation) . These results suggest that concentrating efforts and resources in further validating and implementing the use of late pregnancy biometry for gestational age assessment may be valuable in settings where the preferred reference standards are unavailable.
To our knowledge, this is the first study evaluating the accuracy of the modified birthweight-sensitive Ballard method (NBS + BW) elaborated by Feresu et al. , under real field conditions.
Our study has also some limitations that should be considered by the reader. First, US measurement in the third trimester may represent a suboptimal reference standard. Second, the retrospective nature of the study limited data availability. Third, the generalizability of the findings should be limited to similar settings. In addition, our study included few preterm newborns hence caution is suggested in the interpretation of our findings in such subpopulation.
In a low-income setting, NBS + BW was the least biased method for GA determination as compared with the locally available standard (US measurement in the third trimester). However, wide agreement bands suggested low accuracy for all three alternative methods (LNMP, NBS, NBS + BW. New evidence in the use of second/third trimester ultrasound suggests concentrating efforts and resources in further validating and implementing the use of late pregnancy biometry for gestational age dating in low and middle-income countries.
Availability of data and materials
The dataset analyzed during the current study is available from the corresponding author on reasonable request.
Last Normal Menstrual Period
New Ballard Score
- NBS + BW:
NBS + Birth Weight
Committee on Obstetric Practice, the American Institute of Ultrasound in Medicine, and the Society for Maternal-Fetal Medicine. Committee Opinion No 700: Methods for Estimating the Due Date. Obstet Gynecol. 2017;129(5):e150–4.
Macaulay S, Buchmann EJ, Dunger DB, Norris SA. Reliability and validity of last menstrual period for gestational age estimation in a low-to-middle-income setting. J Obstet Gynaecol Res. 2019;45(1):217–25.
Rosenberg RE, Ahmed AS, Ahmed S, Saha SK, Chowdhury MA, Black RE, et al. Determining gestational age in a low-resource setting: validity of last menstrual period. J Health Popul Nutr. 2009; 27 (3):332–8.
White LJ, Lee SJ, Stepniewska K, Simpson JA, Dwell SL, Arunjerdja R, et al. Estimation of gestational age from fundal height: a solution for resource-poor settings. J R Soc Interface. 2012; 9(68):503–10.
Ballard JL, Khoury JC, Wedig K, Wang L, Eilers-Walsman BL, Lipp R. New Ballard Score, expanded to include extremely premature infants. J Pediatr. 1991; 119(3):417–23.
Feresu SA. Does the modified Ballard method of assessing gestational age perform well in a Zimba- bwean population? Cent Afr J Med. 2003; 49(9–10):97–103.
Jehan I, Zaidi S, Rizvi S, Mobeen N, McClure EM, Munoz B, et al. Dating gestational age by last men-strual period, symphysis-fundal height, and ultrasound in urban Pakistan. Int J Gynaecol Obstet. 2010; 110(3):231–4.
Lee AC, Panchal P, Folger L, Whelan H, Whelan R, Rosner B, Blencowe H, Lawn JE. Diagnostic Accuracy of Neonatal Assessment for Gestational Age Determination: A Systematic Review. Pediatrics. 2017;140(6):e20171423.
Feresu SA, Gillespie BW, Sowers MF, Johnson TR, Welch K, Harlow SD. Improving the assessment of gestational age in a Zimbabwean population. Int J Gynaecol Obstet. 2002;78(1):7–18.
Tognon F, Borghero A, Putoto G, Maziku D, Torelli GF, Azzimonti G, Betran AP. Analysis of caesarean section and neonatal outcome using the Robson classification in a rural district hospital in Tanzania: an observational retrospective study. BMJ Open. 2019;9(12)
Papageorghiou AT, Kemp B, Stones W, Ohuma EO, Kennedy SH, Purwar M et al. Ultrasound based gestational age estimation in late pregnancy. Ultrasound Obstet Gynecol 2016. 48(6):719–26
Ballard JL, Khoury JC, Wedig K, Wang L, Eilers-Walsman BL, Lipp R (1991) New Ballard Score, expanded to include extremely premature infants. J Pediatr 119: 417–423.
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2021. URL https://www.R-project.org/
WHO Alliance for Maternal and Newborn Health Improvement Late Pregnancy Dating Study Group. Performance of late pregnancy biometry for gestational age dating in low-income and middle-income countries: a prospective, multicountry, population-based cohort study from the WHO Alliance for Maternal and Newborn Health Improvement (AMANHI) Study Group. Lancet Glob Health. 2020;8(4):e545–54.
We are very grateful to the nurses and medical staff of the St. John of the Cross, Tosamaganga Council Designated Hospital for the passion they show in their daily work.
This research received no external funding.
Ethics approval and consent to participate
All methods were performed in accordance with the relevant guidelines and regulations.
The study was approved by the Institutional Review Board of Tosamaganga Hospital (protocol number DOIRA/TCDH/VOL.016/5, date 08/09/2021), which waived the need for written informed consent given the retrospective nature of the study and the use of anonymized data from hospital records.
Consent for publication
NA (Not applicable).
The authors declare no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Pietravalle, A., Spolverato, S., Brasili, L. et al. Comparison of alternative gestational age assessment methods in a low resource setting: a retrospective study. BMC Pregnancy Childbirth 22, 585 (2022). https://doi.org/10.1186/s12884-022-04914-6
- Gestational age
- Last menstrual period
- Neonatal examination