- Research article
- Open Access
Reliability and validity of the Edinburgh Postnatal Depression Scale (EPDS) for detecting perinatal common mental disorders (PCMDs) among women in low-and lower-middle-income countries: a systematic review
BMC Pregnancy and Childbirth volume 16, Article number: 72 (2016)
The Edinburgh Postnatal Depression Scale (EPDS), originally developed in Britain, is one of the most widely used screening instruments for assessing symptoms of the Perinatal Common Mental Disorders (PCMDs) of depression and anxiety. However, its potential to detect PCMDs in culturally diverse low- and lower-middle income countries (LALMICs) is unclear. This systematic review aimed to appraise formally validated local language versions of the EPDS from these resource-constrained settings.
Following the PRISMA protocol, we searched MEDLINE-OVID, CINAHL-Plus and PUBMED to identify studies reporting translation, cultural adaptation and formal validation of the EPDS to detect PCMDs among women in LALMICs. The quality of the studies meeting inclusion criteria was assessed using standard criteria and a new process-based criteria; which was developed specifically for this study.
We identified 1281 records among which 16 met inclusion criteria; three further papers were identified by hand-searching reference lists. The publications reported findings from 12 LALMICs in14 native languages. Most of these local language versions of the EPDS (LLV-EPDS) had lower precision for identifying true cases of PCMDs among women in the general perinatal population compared to the original English version. Only one study met all criteria for culturally sensitive translation, the others had not established the comprehensibility of the local version amongst representative groups of women in pre-testing. Many studies tested the LLV-EPDS only amongst convenience samples recruited at single health facilities. Diagnostic interviews for confirmation of mental disorders could have been influenced by the mental health professionals’ lack of blinding to the initial screening results. Additionally, even when diagnostic-interviews were carried out in the local language, questions might not have been understood as most studies followed standard diagnostic protocol which had not been culturally adapted.
Most of the LLV-EPDS from non-English speaking low- and middle-income-countries did not meet all criteria for formal validation of a screening instrument. Psychometric properties of LLV-EPDS could be enhanced by adopting the new process-based criteria for translation, adaptation and validation.
There is growing recognition of mental health problems among women living in resource-constrained, World Bank defined low- and lower-middle-income countries (LALMICs) who are pregnant or have recently given birth. In order to address persistently high maternal and child morbidity and mortality and promote the survival, health and development of infants in these settings, national governments have expressed increasing interest in improving maternal mental health [1, 2]. To optimize detection of Perinatal Common Mental Disorders (PCMDs) among women in primary healthcare, locally adapted and validated screening instruments are needed.
The Edinburgh Postnatal Depression Scale (EPDS) is one of the most widely used screening instruments for assessing symptoms of perinatal depression and anxiety [3, 4]. It assesses emotional experiences over the past seven days using ten Likert-scale items (See Additional file 1). This self-reporting instrument was originally developed in the United Kingdom (U.K.) by Cox, Holden and Sagovsky in 1987 . Its use has now extended far beyond the U.K. to other high-income English-speaking and non-English speaking countries, and increasingly to non-Anglophone LALMICs. The popularity of this brief instrument reflects the original British validation study , in which nine out of ten women who were diagnosed by a psychiatrist as being depressed after giving birth were correctly identified in a blinded comparison with scores above a cut-off on the EPDS. The psychometric properties of the EPDS in primary health care were: 86 % sensitivity (correctly identifying true cases), 78 % specificity (correctly identifying people without the condition) and 73 % positive predictive value (proportion of respondents scoring positive in the test who had a mental disorder diagnosed by clinical interview) .
To improve early detection and treatment of PCMDs, the local language versions of the EPDS (LLV-EPDS) needs to accurately identify people with a PCMDs. However, it has been found that the LLV-EPDS had relatively lower discriminant validity for correctly identifying cases of PCMDs [3, 4] than the original English version . Many reasons have been proposed for why LLV-EPDS did not perform well. These include lack of local cultural sensitivity [3, 7–9] due to compromises made during translation and adaptation process  and recruitment of participants who did not represent general perinatal populations . Finally, the questions asked during diagnostic interviews (the standard comparator) might not have been meaningful or comprehensible in these local settings. Development of a LLV-EPDS with optimal psychometric properties is fundamental for identification of perinatal mental disorders, for assisting nations to assess the overall burden of PCMDs  and enabling aggregation of global prevalence data . However, there is no internationally approved, standard technique for translation and validation of the English EPDS into a non-English local language version appropriate for use in resource-constrained countries.
There are three systematic reviews on the validity of non-English versions of the EPDS. Two of these studies were focused on publications from high and middle-income countries, and included few studies from low-income countries [3, 4], while the other review included only data from African countries . There is no systematic review specific to low-and lower-middle-income countries (LALMICs) . The objectives of this review were: (1) to appraise systematically the formally validated LLV-EPDS from LALMICs, and (2) to establish potentially modifiable reasons for their lower validity by using new specific process-based criteria.
We used the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) protocol for identifying, screening and eligibility of studies  (Additional file 2). Three indexed electronic international databases (MEDLINE-OVID, CINAHL-Plus and PUBMED) were searched up to 20 April 2015, using the strategy described in Additional file 3.
There were four inclusion criteria: studies on translation and/or cultural adaptation and/or validation of the EPDS; that enrolled women who were pregnant and/or had recently given birth; which were conducted in World Bank defined LALMICs, and with reports published in English language, peer-reviewed journals.
Selection of studies
In addition to implementing the search strategy, the reference lists of articles meeting inclusion criteria were searched to identify any studies that had not been found. In order to obtain copies of studies published in non-indexed or local journals, we corresponded with authors via email, if we did not receive a response we wrote to the editor of the journal. By learning the journal was no more published, we sought a copy of the publications through interlibrary loan.
We used two approaches to assess the methodological quality of the selected publications. First, overall quality was assessed using the criteria recommended by Mirza and Jenkins . As recommended by Fisher et al.  we added a criterion about whether approval from a formally constituted ethics committee had been obtained. Thus, the 9 criteria were: (1) clear study aim; (2) sufficient sample size or justification; (3) representativeness of the sample or justification; (4) explicit inclusion and exclusion criteria; (5) response rate and explanation of losses; (6) clear description of data; (7) appropriate statistical analyses; (8) ethics approval; (9) and obtained informed consent. One point was given for meeting each of these points (1for Yes and 0 for No), to yield a maximum total possible score of 9.
The Mirza & Jenkins  and Fisher et al.  assessment scheme did not include specific criteria for assessing quality of a screening instrument like the EPDS. We assessed quality of the translation, cultural adaptation and local validation of the LLV-EPDS by developing a new set of process-based criteria (shown in Fig. 1, and defined in Additional file 4). We derived 33 criteria from diverse sources: which were recommended points for translation of other psychometric instruments [10, 16] and self-reporting questionnaires (SRQ) , measures used and suggested for translation and validation of the EPDS by earlier studies [3, 7, 11, 18–22]. Additionally, we incorporated some criteria based on our experience in international public health.
Collection of data and analysis
We extracted data from selected studies using data-extraction format (Table 1).
Adherence to the 33 process-based criteria were organised into two sections: Culturally Sensitive Translation and Empirical Validation using three response options: Yes/Not mentioned/Not needed (Tables 3 and 4). In line to aim of this study, data (evidence) about adherence to the process-based criteria summarised as narratives by three key aspects of LLV-EPDS development process: Culturally Sensitive Translation, Empirical Validation and Psychometric Properties (meta-analysis was not done as was beyond our study objectives). The data on process-based criteria and methodological quality were extracted by the first author and then rechecked by other authors; differences were resolved by consensus.
In total 1281 records were identified using the search strategy, after removal of duplicates and studies which did not meet inclusion criteria, including six articles published in languages other than English (French, Lithuanian, Polish, Turkish, Hebrew and German), but not conducted in LALMICs,19 studies, all quantitative, remained (Fig. 2 and Table 1).
Methodological assessment of these 19 selected publications indicated that five studies had enrolled sufficient participants to achieve adequate power. Overall, all had a clearly stated aim, 16 studies used clearly defined criteria for selection of the participants. Nine reported representativeness of samples with justification, 12 provided the recruitment rate, 14 publications included a synopsis of participants’ characteristics, 14 acquired ethical approvals, three studies described how participants’ consent had been acquired, but the remaining studies did not. Almost all studies had used appropriate statistical analyses for deriving psychometric properties (Table 2). All studies were included in this review despite of their modest quality. Since, this study aimed to identify reasons behind formally validated LLV-EPDS having lower validity than the original English version.
Of these 19 publications, two studies described translations of the original English EPDS into local languages, one entirely on the topic and another dealt briefly [19, 23]; 13 studies discussed both translation and psychometric properties of the LLV-EPDS; and four studies [24–27] focused on establishing psychometric properties for LLV-EPDS that were already translated. In total 17 studies described psychometric properties of the LLV-EPDS by recruiting a total of 4029 women (sample sizes ranged from 100 to 601) from 12 LALMICs [24–40].
Culturally Sensitive Translation
We found 15 publications reporting translations and adaptation of 14 LLV-EPDS from 12 countries; there were two studies from each of three countries. India [28, 29] and Nigeria [30, 31] each had two native language versions of the EPDS. Both studies from Ethiopia [33, 35] were related to the development of an Amharic version of the EPDS.
In these 15 studies there was variable application of the six key steps we proposed for the culturally sensitive translation of the EPDS: forward translation, backward translation, resolution of difficulties and differences in translations by committee approach, pretesting, amendments and test of conceptual and operational equivalence (Table 3). Seven studies described the translation process and reported forward and backward translation by native speakers who were fluent in English as well. There was a predominance of health professionals in translation panels, which typically had 1–4 members. Two forward-translation panels [19, 32] had included professional translators. In eight studies there were separate panels [19, 24, 29–34] for forward and backward translations. Only one study reported review of the back-translated version by native English speakers .
Nine studies resolved differences in translation by convening a large group discussion; however in most of these studies (8/9), such discussions were held amongst diverse health professionals and in two studies professional translators were also included [29, 30]. Only one study used a committee approach, which included lay-people in addition to multi-disciplinary health professionals, and there were a number of meetings during the translation process .
Nine studies reported pre-testing the preliminary versions of the LLV-EPDS, however only four of these studies (involving the Amharic version from Ethiopia, along with the Bangla and Vietnamese versions) [19, 23, 33, 35] probed understanding and comprehension among women who had recently given birth. One type of amendment in these three LLV-EPDS translations was in the format of the instrument. For example, in the Bangla version to enable administration by an interviewer for respondents who were illiterate, women were addressed in the second person (‘you’). To reduce repetition, the short reply statements were replaced by numerals . In the Amharic version, all ten items were changed into a question. The mode of response was changed into two stages: first by asking a fixed choice Yes/No question, then in the next step probing the frequency and severity of the reported symptoms. Additionally, the short reply statements were rephrased for clarity. Still more, to remind study participants about the timeframe the phrase “last week” was added at the end of the each item .
The other type of amendment aimed to establish semantic equivalence of the LLV-EPDS in the local context. Altogether four studies [19, 23, 33, 35] altered 8/10 EPDS items in this manner. Among three LLV-EPDS (Amharic, Bangla and Vietnamese) versions, we found an overlap among items modified in the different versions, but there were also items that were modified in one study but not in others. In a study conducted in Ethiopia, Hanlon et al.  modified six items in the Amharic version (1–5 & 9), and to ease understanding by rural women including illustrative examples for items 1–3. Despite these modifications, in a re-validation of this version with urban-dwelling women who had recently given birth, Tesfaye et al.  found that items 1 and 2 were still not understood and that provision of examples did not help respondents to understand these question. Connotative translations using local expression were reported for item 6 for the Vietnamese  and the Bangla  versions. Similarly, item 10 (relating to suicidal ideation) was rephrased in the Vietnamese version as the content was ambiguous in translation . In Ethiopia mixed responses were found; women in rural areas were open to the question about ideas of self harm , while women in urban areas without this symptom were embarrassed to be asked it, but women who reported had suicidal ideas reported relief at the interviewer asking this question .
The tests of conceptual and operational equivalence of these LLV-EPDS with the original English version was performed only for the Bangla version EPDS. The test showed a higher conceptual equivalence, with the original English EPDS (correlation coefficient 0.981; p < 0.01). The operational equivalence test, which is to investigate whether the local version can achieve similar outcomes when administered by self-report as by an interviewer was slightly lower (correlation coefficient 0.752; p = 0.01) .
There were 17 studies on psychometric validations of 14 LLV-EPDS from 12 countries; due to inclusion of two studies from each of five countries: Ethiopia [33, 35], India [28, 29], Nepal [24, 40], Nigeria [30, 31] and Pakistan [26, 39] (Table 4).
Generation of data on performance of these 14 LLV-EPDS was mainly carried out by recruiting study participants from ante- or postnatal or immunisation clinics (13/17 studies) [24, 25, 28–32, 34, 36–38, 40]. More than half of the studies (10/17) recruited women who had recently given birth (0 to 36 months ago) [24–26, 28, 30, 33, 35, 36, 39, 40]; while 4/17 studies included women who were currently pregnant [29, 31, 34, 38]; there were both pregnant women and those who had recently given birth in two studies [27, 37] and in one study all participants were women of reproductive age (18–40 years) . Of the 11 publications [24, 25, 27–29, 31, 32, 34–36, 39] that mentioned participants’ literacy status, almost all had a predominance of literate participants (67 to 89 %), except in a rural community-based investigation from Pakistan, in which only a quarter of women could read . In 10/17 studies, interviewers administered the LLV-EPDS [25, 26, 28, 29, 31, 33, 35, 36, 38, 39].
Diagnostic interviews were carried out to establish clinical cut-off points. Of the 17 studies, in 11studies all participants who were screened using LLV-EPDS were also recruited for diagnostic interviews. In three studies, all those scoring higher than certain scores (≥6, ≥9 and ≥13) and a randomly selected sample of woman who scored below the cut-offs were included. In the remaining two studies, about half the participants were included (Table 4).
Psychiatrists or psychologists carried out the diagnostic interviews in 12/17 studies, but in eight of these publications, it is not clear if they were blinded to the initial screening results [24, 28–30, 32, 33, 37, 40]. In addition, in eight studies [24, 28–31, 33, 37, 40] it is not clear whether the diagnostic interviews were carried out on the same day as the screening. Furthermore, although interviews were conducted in the local language, the standard diagnostic protocol (SDP) were translated into respective local languages in only three studies [29, 32, 39], while only two of the SDP had been culturally adapted [32, 39]. In 6/17 studies, there was more than one diagnostic interviewer; but only one of these investigated inter-rater reliability and reported excellent reliability between psychiatrists (kappa = 0.82) .
The 17 studies of the 14 LLV-EPDS revealed wide variation in their psychometric properties. The range of cut-off scores selected for detecting any common mental disorder was 3/4 to 11/12; and where specific conditions were listed, the range of cut-off scores was 3/4  to 13/14  for depression (mild and moderate) and 4/5 to 12/13 for severe depression . The scores for sensitivity (identifying true cases) ranged from 69.7 %  up to 100 % [29, 40]. The scores for specificity (identifying those without PCMDs) ranged from 36.1 %  up to 97 % . The positive predictive value (PPV, which is the proportion of respondents who scoring positive in the screening, who were confirmed to have a common mental disorder by clinical interview) ranged from 22 %  up to 82 % ; the negative predictive value (NPV, which is the proportion of the respondents scoring negative in the screening who were confirmed as having no mental disorders) ranged from 70 %  to 100 % . The range of cut-off points for detecting depression among pregnant women was slightly lower 4/5  to 12/13  than among women who had recently given birth 5/6  to 13/14 .
To our knowledge, this systematic review is the first assessment of EPDS versions translated and adapted for use in low- and lower-middle-income countries (LALMICs). Additionally, this is the first study to use a new set of process-based criteria (Fig. 1) to assess the reliability and validity of LLV-EPDS comprehensively. We acknowledge the possible limitation of this review, that some studies on LLV-EPDS may have been missed, if they had been published in languages other than English in non-indexed journals. However, we think this is unlikely, because we followed rigorous strategies to identify pertinent studies published in non-indexed journals.
We found that of the 82 countries classified by the World Bank (in 2015) as LALMICs, the Edinburgh Postnatal Depression Scale (EPDS) had been formally validated in 12 (14.6 %) countries in 14 native languages. We found psychometric properties of these 14 LLV-EPDS from low-and middle-income countries were lower than that for the original English EPDS . Our finding is consistent with findings from earlier three systematic reviews that included studies from high-, middle- and low-income countries [3, 4] and African countries .
A central finding of our systematic review is that utility of these 14 local versions EPDS for screening PCMDs is questionable, as none met the recommended validation standard of ≥80 % in the three key parameters: sensitivity, specificity and positive predictive value . The process-based appraisal indicated that lower psychometric property of these LLV-EPDS might be related to compromises made during translation, cultural adaptation and empirical validation (Tables 3 and 4). The psychometric properties of the LLV-EPDS were found to be better when the process-based criteria were followed. In Ethiopia, Tesfaye et al.  achieved an almost two fold improvement in specificity (75.3 % vs. 36.1 %), by inclusion of more suitable local expressions than in the earlier Amharic version EPDS . The Bangla EPDS, was the only one which met all steps and criteria for culturally sensitive translation, cultural adaptation and empirical validation and was the one study that demonstrated high sensitivity (88.9 %) and specificity (86.8 %)  (Table 4).
Sub–optimal sensitivity and specificity in these LLV-EPDS might also have been attributable to the recruitment of women who did not represent the wider perinatal population during empirical validation (Fig. 1, criteria: 15–21). For instance, more than half of the studies (10/17) recruited participants from health facilities, most usually a single, urban health institution, using convenience sampling. This is especially problematic in settings where many women give birth in primary care facilities or at home. Selection bias is apparent as there was a predominance of well educated women in these study samples. While they might understand direct, literally translated LLV-EPDS, it is much less likely that women of low literacy or education will understand them. The link between non-random selection of participants from health institutions and a poorly translated LLV-EPDS is shown by the study from Nigeria. In that country, while about half of the female population (50.6 %) are illiterate , more than two thirds of the study participants who were recruited from antenatal clinics were highly educated (bankers, teachers/lecturers, big business owners and civil servants). In this study, the LLV-EPDS had high sensitivity (86.7 %) and specificity (91.5 %), even though it didn’t meet most of our recommended steps for culturally sensitive translation processes . This suggests that these highly educated participants having greater emotional literacy [3, 23] and familiarity with test-taking , but does not provide evidence that this LLV-EPDS will be useful for the majority who have not had opportunities for education and social participation.
Further areas of inconsistency and suboptimal practice appeared to have occurred during the process of formal validation against a diagnostic interview (Fig. 1, criteria: 22–28). In 11/17 studies the interviews were carried out by psychiatrists or psychologists. However, in 8 of these studies, diagnosis might have been influenced by changes in the interviewees’ psychological state as the screening by LLV-EPDS and diagnostic interviews were not conducted on the same day [24, 28, 30, 31, 33, 34, 37, 40]. Further, the diagnosis might have been biased by the psychiatrists or psychologists being aware of the screening results, as only 8 studies reported that they were blinded to the initial screening results. Moreover, even though interviews were carried out in local languages, the accuracy of both screening and diagnosis may be influenced by participants’ limited understanding of English colloquialisms. Only four studies made amendments to the LLV-EPDS so that it was suitable for administration to participants with low literacy, and tried to attain semantic equivalence to the local context by using local expressions [19, 23, 33, 35]. For instance, the statement in the original EPDS “Life is getting on top of me” is intended to detect an experience in which a woman feels that the demands imposed on her, exceed her capacity to manage them. However, it was interpreted by some women in Vietnam as meaning literally that things were being placed on top of them as might occur during a flood or other natural disaster . Although all diagnostic interviews were presumed to be conducted in local languages, only two reported that standardised diagnostic protocols had been culturally adapted. It is possible that in the other studies participants might have not understood and or mis-understood questions asked by clinicians [32, 39].
Conceptual equivalence was generally not established between the LLV-EPDS and the original English EPDS . In half of these studies, the back-translations to English might have been influenced by knowledge of the original version, as both forward and back-translation were carried out by the same panels. Additionally, the back-translated English version was reviewed by native-English-speakers in only one study . Conceptual and operational disparities between the original English version and the LLV-EPDS were not investigated and tests of equivalence were not performed.
There is considerable variation in cut-off points to detect clinically significant symptoms among the 14 LLV-EPDS from non-English speaking low- and lower-middle-income countries. In general lower cut-off scores compared to the English version were found. This probably reflects differences in cultural norms about emotional expression and emotional literacy [7, 9, 23]. The EPDS, developed in Britain, reflects the psychiatric paradigm that experiences of low mood are episodic and represent change from a usual state. It is inaccurate to presume that this is a universal situation. In resource constrained settings, where many women experience chronic social and economic adversity it is probable that they might not experience change from a usual state, but rather could be chronically distressed, so answers to questions about change would be negative . There is also potential confusion, and perhaps linguistic limitations which mean that subtle emotional distinctions, for example being anxious or being scared might not be available or in widespread use and therefore lead to responses that do not reflect reality.
It would appear overall that formal validation of the EPDS following proposed process-based criteria is more likely to derive precise cut-off points appropriate to the local setting and /or population. Having a LLV-EPDS with an imprecise cut-off point has potentially serious implications. On one hand, an inaccurately high cut-off point imported from a high-income Anglophone setting might lead to under-detection of women with PCMDs. This means women’s needs might go unrecognised and unassisted and lead to under-estimation of PCMDs burden for a particular country or population. On the other hand, if a cut-off point is too low, women might be classified as having a clinically significant condition of mental disorders which may lead to unnecessary treatment and potentially, stigma and discrimination, in particular in societies where experiences of human suffering are poorly understood. Inaccurate classification of ‘cases’ may further strain the already over-burdened health systems of low- and lower-middle-income countries .
It is commendable that researchers and clinicians in several resource-constrained countries have made great efforts to improve early detection and timely management of PCMDs. However, this review indicated that currently available local language versions of the Edinburgh Postnatal Depression scales (LLV-EPDS) from low-and lower- middle income countries are of some value, but most of them had deficiencies in translation, cultural adaptation and validation processes. Screening instruments with poor psychometric properties might have far-reaching implications for clinical practice, public policy and research. We recommend a systematic approach to the translation, cultural adaptation and empirical validation of local language versions of the EPDS that adheres to the steps outlined in Fig. 3. This approach will facilitate the development of more precise and validated screening tools for detection and management of PCMDs among women in resource constrained settings.
Fisher J, Cabral de Mello M, Patel V, Rahman A, Tran T, Holton S, Holmes W. Prevalence and determinants of common perinatal mental disorders in women in low-and lower-middle-income countries: A systematic review. Bull World Health Organ. 2012;90(2):139–49.
Rahman A, Fisher J, Bower P, Luchters S, Tran T, Yasamy MT, Saxena S, Waheed W. Interventions for common perinatal mental disorders in women in low-and middle-income countries: A systematic review and meta-analysis. Bull World Health Organ. 2013;91(8):593–601I.
Gibson J, McKenzie MK, Shakespeare J, Price J, Gray R. A systematic review of studies validating the Edinburgh Postnatal Depression Scale in antepartum and postpartum women. Acta Psychiatr Scand. 2009;119(5):350–64.
Kozinszky Z, Dudas RB. Validation studies of the Edinburgh Postnatal Depression Scale for the antenatal period. J Affect Disord. 2015;176:95–105.
Cox J, Holden J. Perinatal Psychiatry: Use and misuse of the Edinburgh Postnatal Depression Scale. Britain: The Royal College of Psychiatrists; 1994.
Cox JL, Holden JM, Sagovsky R. Detection of postnatal depression. Development of the 10-item Edinburgh Postnatal Depression Scale. Br J Psychiatry. 1987;150(6):782–6.
Laungani P. Postnatal depression across cultures: conceptual and methodological considerations. Int J Health Promot Educ. 2000;38(3):86–94.
Shrestha SD. Quality of life of HIV-positive people undertaking antiretroviral therapy in Nepal: A mixed method study. Melbourne: La Trobe University; 2013.
Patel V, Abas M, Broadhead J, Todd C, Reeler A. Depression in developing countries: lessons from Zimbabwe. Br Med J. 2001;322(7284):482–4.
Wild D, Grove A, Martin M, Eremenco S, McElroy S, Verjee‐Lorenz A, Erikson P. Principles of Good Practice for the Translation and Cultural Adaptation Process for Patient‐Reported Outcomes (PRO) Measures: Report of the ISPOR Task Force for Translation and Cultural Adaptation. Value Health. 2005;8(2):94–104.
Matthey S, Henshaw C, Elliott S, Barnett B. Variability in use of cut-off scores and formats on the Edinburgh Postnatal Depression Scale–implications for clinical and research practice. Arch Womens Ment Health. 2006;9(6):309–15.
Tsai AC, Scott JA, Hung KJ, Zhu JQ, Matthews LT, Psaros C, Tomlinson M. Reliability and validity of instruments for assessing perinatal depression in African settings: Systematic review and meta-analysis. PLoS One. 2013;8(12):e82521.
Country and Lending Groups. [http://data.worldbank.org/about/country-and-lending-groups]. Accessed April 2015.
Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred reporting items for systmatic reviews and meta-analyses: The PRISMA statement. Br Med J. 2009;339:2535.
Mirza I, Jenkins R. Risk factors, prevalence, and treatment of anxiety and depressive disorders in Pakistan: systematic review. Br Med J. 2004;328(7443):794.
Bowden A, Fox-Rushby JA. A systematic and critical review of the process of translation and adaptation of generic health-related quality of life measures in Africa, Asia, Eastern Europe, the Middle East. South Am Soc Sci Med. 2003;57(7):1289–306.
Beusenberg M. A User’s guide to the Self reporting questionnaire (SRQ). Geneva: Division of Mental Health, World Health Organization; 1994.
Beck CT, Bernal H, Froman RD. Methods to document semantic equivalence of a translated scale. Res Nurs Health. 2003;26(1):64–73.
Gausia K, Hamadani JD, Islam MM, Ali M, Algin S, Yunus M, Fisher C, Oosthuizen J. Bangla translation, adaptation and piloting of Edinburgh Postnatal Depression Scale. Bangladesh Med Res Counc Bull. 2007;33(3):81–7.
Department of Health Government of Western Australia. Using the Edinburgh Postnatal epression scale (EPDS): Translated versions - validated. Perth: State Perinatal mental Health Reference Group; 2006.
Brislin RW. Back-translation for cross-cultural research. J Cross-Cultural Psychol. 1970;1(3):185–216.
Hanlon C, Medhin G, Alem A, Araya M, Abdulahi A, Tesfaye M, Wondimagegn D,Taha H, Anbesse B, Baheretibeb Y. Measuring common mental disorders in women in Ethiopia. Soc Psychiatry Psychiatr Epidemiol. 2008;43(8):653–9.
Fisher J, Morrow M, Nhu NN, Hoang AL. Prevalence, nature, severity and correlates of postpartum depressive symptoms in Vietnam. Int J Obstetr Gynaecol. 2004;111(12):1353–60.
Nepal MK, Sharma VD, Koirala NR, Regmi SK, Khalid A, Shresta P. Validation of the Nepalese version of Edinburg Postnatal Depression Scale in the tertiary health care facilities in Nepal. Nepalese J Psychiatry. 1999;1:34–40.
Gausia K, Fisher C, Algin S, Oosthuizen J. Validation of the Bangla version of the Edinburgh Postnatal Depression Scale for a Bangladeshi sample. J Reprod Infant Psychol. 2007;25(4):308–15.
Husain N, Kiran T, Sumra A, Zafar SN, Rahman RU, Jafri F, Ansari S, Husain M, Adelekan ML, Chaudhry IB. Detecting maternal depression in a low-income country: Comparison of the Self-Reporting Questionnaire and the Edinburgh Postnatal Depression Scale. J Trop Pediatr. 2013;60(2):129–33.
Tran TD, Tran T, La B, Lee D, Rosenthal D, Fisher J. Screening for perinatal common mental disorders in women in the north of Vietnam: A comparison of three psychometric instruments. J Affect Disord. 2011;133(1):281–93.
Patel V, Rodrigues M, DeSouza N. Gender, poverty, and postnatal depression: A study of mothers in Goa, India. Am J Psychiatry. 2002;159(1):43–7.
Fernandes M, Srinivasan C, Krishnamachari S, Stein A, Menezes G, Sumithra R, Ramchandani P. Assessing prenatal depression in the rural developing world: A comparison of two screening measures. Arch Womens Ment Health. 2011;14(3):209–16.
Uwakwe R. Affective (depressive) morbidity in puerperal Nigerian women: Validation of the Edinburgh Postnatal Depression Scale. Acta Psychiatr Scand. 2003;107(4):251–9.
Adewuya AO, Ola BA, Dada AO, Fasoto OO. Validation of the Edinburgh Postnatal Depression Scale as a screening tool for depression in late pregnancy among Nigerian women. J Psychosom Obstetr Gynecol. 2006;27(4):267–72.
Pollock JI, Manaseki-Holland S, Patel V. Detection of depression in women of child-bearing age in non-western cultures: A comparison of the Edinburgh Postnatal Depression Scale and the Self-Reporting Questionnaire-20 in Mongolia. J Affect Disord. 2006;92(2–3):267–71.
Hanlon C, Medhin G, Alem A, Araya M, Abdulahi A, Hughes M, Tesfaye M, Wondimagegn D, Patel V, Prince M. Detecting perinatal common mental disorders in Ethiopia: validation of the self-reporting questionnaire and Edinburgh Postnatal Depression Scale. J Affect Disord. 2008;108(3):251–62.
Stewart RC, Umar E, Tomenson B, Creed F. Validation of screening tools for antenatal depression in Malawi—A comparison of the Edinburgh Postnatal Depression Scale and Self Reporting Questionnaire. J Affect Disord. 2013;150(3):1041–7.
Tesfaye M, Hanlon C, Wondimagegn D, Alem A. Detecting postnatal common mental disorders in Addis Ababa, Ethiopia: Validation of the Edinburgh postnatal depression scale and kessler scales. J Affect Disord. 2010;122(1):102–8.
Chibanda D, Mangezi W, Tshimanga M, Woelk G, Rusakaniko P, Stranix-Chibanda L, Midzi S, Maldonado Y, Shetty AK. Validation of the Edinburgh Postnatal Depression Scale among women in a high HIV prevalence area in urban Zimbabwe. Arch Womens Ment Health. 2010;13(3):201–6.
Rowel D, Jayawardena P, Fernando N. Validation of the Sinhala translation of Edinburgh Postnatal Depression Scale. Ceylon Med J. 2008;53(1):10–3.
Weobong B, Akpalu B, Doku V, Owusu-Agyei S, Hurt L, Kirkwood B, Prince M. The comparative validity of screening scales for postnatal common mental disorder in Kintampo, Ghana. J Affect Disord. 2009;113(1):109–17.
Rahman A, Iqbal Z, Lovel H, Shah MA. Screening for Postnatal depression in the Developing world: A comparision of the WHO self reporting questionnaire (SRQ) and the Edinburg postnatal depression screen (EPDS). J Pakistan Psychiatr Soc. 2005;2(2):69.
Regmi S, Sligl W, Carter D, Grut W, Seear M. A controlled study of postpartum depression among Nepalese women: Validation of the Edinburgh Postpartum Depression Scale in Kathmandu. Trop Med Int Health. 2002;7(4):378–82.
Glover TA, Albers CA. Considerations for evaluating universal screening assessments. J Sch Psychol. 2007;45(2):117–35.
High level International round table on literacy “Reaching the 2015 Literacy Target: Delivering on the promise” UNESCO, Paris, 6–7 September 2012: Action Plan
Small R. An Australian Study of Vietnamese, Turkish and Filipino Women’s Experiences of Maternity Care and of Maternal Depression After Childbirth [Doctor of Philosophy]. Melbourne: La Trobe University, 2000.
The first author (SDS) acknowledges extension of her Adjunct Research Fellowship (after her injury in 2013) by Professor JF. She expresses her gratitude for the continuous encouragement and support provided by JF along with the Jean Hailes Research Unit towards regaining her academic ability, which made it possible to complete this manuscript.
SDS is an Adjunct Research Fellow in the Jean Hailes Research Unit, Monash University School of Public Health and Preventive Medicine. JF is supported by a Monash University Professional fellowship and the Jean Hailes Professional Fellowship, which receives funding from the L and H Hecht trust which is administered by Perpetual Trustees Pty Ltd. TDT is supported by a Bridging Postdoctoral Research Fellowship from Monash University and an Early Career Fellowship from the Australian National Health and Medical Research Council.
The authors declare that they have no competing interests.
SDS and JF designed the study. SDS conducted the systematic search, developed the novel process-based criteria and undertook the quality appraisal. JF and RG independently undertook the quality appraisal and analysis. SDS wrote the first draft of the manuscript. All authors contributed to revisions, read and approved the final manuscript.
PRISMA (2009) checklist for studies included for this systematic review. (DOCX 25 kb)
Literature search strategy for systematic review on the reliability and validity of the EPDS in low and lower-middle income countries (LALMICs). (DOCX 14 kb)
Definitions of the Process-based criteria. (DOCX 20 kb)
About this article
Cite this article
Shrestha, S.D., Pradhan, R., Tran, T.D. et al. Reliability and validity of the Edinburgh Postnatal Depression Scale (EPDS) for detecting perinatal common mental disorders (PCMDs) among women in low-and lower-middle-income countries: a systematic review. BMC Pregnancy Childbirth 16, 72 (2016). https://doi.org/10.1186/s12884-016-0859-2
- Edinburgh postnatal depression scale (EPDS)
- Cultural equivalence
- Local language versions of the EPDS (LLV-EPDS)
- Perinatal common mental disorders (PCMDs)
- Low- and-lower-middle-income countries (LALMICs)