Medication use during pregnancy, gestational age and date of delivery: agreement between maternal self-reports and health database information in a cohort

Background Health databases are a promising resource for epidemiological studies on medications safety during pregnancy. The reliability of information on medications exposure and pregnancy timing is a key methodological issue. This study (a) compared maternal self-reports and database information on medication use, gestational age, date of delivery; (b) quantified the degree of agreement between sources; (c) assessed predictors of agreement. Methods Pregnant women recruited in a prenatal clinic in Friuli Venezia Giulia (FVG) region, Italy, from 2007 to 2009, completed a questionnaire inquiring on medication use during pregnancy, gestational age and date of delivery. Redeemed prescriptions and birth certificate records were extracted from regional databases through record linkage. Percent agreement, Kappa coefficient, prevalence and bias-adjusted Kappa (PABAK) were calculated. Odds Ratio (OR), with 95 % confidence interval (95 % CI), of ≥1 agreement was calculated through unconditional logistic regression. Results The cohort included 767 women, 39.8 % reported medication use, and 70.5 % were dispensed at least one medication. Kappa and PABAK indicated almost perfect to substantial agreement for antihypertensive medications (Kappa 0.86, PABAK 0.99), thyroid hormones (0.88, 0.98), antiepileptic medications (1.00, 1.00), antithrombotic agents (0.70, 0.96). PABAK value was greater than Kappa for medications such as insulin (Kappa 0.50, PABAK 0.99), antihistamines for systemic use (0.50, 0.99), progestogens (0.28, 0.79), and antibiotics (0.12, 0.63). Adjusted OR was 0.48 (95 % CI 0.26; 0.90) in ex- vs. never smokers, 0.64 (0.38; 1.08) in < high school vs. university, 1.55 (1.01; 2.37) in women with comorbidities, 2.25 (1.19; 4.26) in those aged 40+ vs. 30–34 years. Gestational age matched exactly in 85.2 % and date of delivery in 99.5 %. Conclusions For selected medications used for chronic conditions, the agreement between self-reports and dispensing data was high. For medications with low to very low prevalence of use, PABAK provides a more reliable measure of agreement. Maternal reports and dispensing data are complementary to each other to increase the reliability of information on the use of medications during pregnancy. Birth certificates provide reliable data on the timing of pregnancy. FVG health databases are a valuable source of data for pregnancy research. Electronic supplementary material The online version of this article (doi:10.1186/s12884-015-0745-3) contains supplementary material, which is available to authorized users.

(Continued on next page) (Continued from previous page) Conclusions: For selected medications used for chronic conditions, the agreement between self-reports and dispensing data was high. For medications with low to very low prevalence of use, PABAK provides a more reliable measure of agreement. Maternal reports and dispensing data are complementary to each other to increase the reliability of information on the use of medications during pregnancy. Birth certificates provide reliable data on the timing of pregnancy. FVG health databases are a valuable source of data for pregnancy research.
Keywords: Pregnancy, Medication use, Health database, Dispensing claims, Birth certificate, Agreement, Kappa, Prevalence and bias-adjusted Kappa, Pharmacoepidemiology, Questionnaires Background Maternal use of prescription medications during pregnancy is common, with prevalence ranging from 27 to 99 % in developed countries [1]. In Italy, a prevalence of about 50 % has been reported [2].
Pregnant women are generally not included in preauthorization studies, thus the risk-benefit profile of medicines used in pregnancy is assessed mostly through post-authorization studies. The assessment of the association between maternal use of medications during pregnancy and pregnancy or infant outcomes often rely on pregnancy medication exposure registries [3,4] and on studies using administrative databases [5][6][7], registering prescriptions at the general physician prescription level or at pharmacy dispensing level.
Pregnancy registries can provide timely ascertainment of exposure and outcomes, and good quality information on their temporal association when data are collected prospectively. Limitations include: (a) potential for selection bias, as registration is spontaneous, (b) insufficient power for some outcomes, (c) problems in identifying an appropriate comparison group, (d) quality and completeness of information depends on healthcare providers and/or maternal reporting [8].
Administrative databases represent an efficient and cost-effective source of data on large populations, and they allow researchers to identify exposure, regardless of information on outcome [9]. However their use in assessing medication exposure has some limitations. In particular, prescription filling or redemption is a proxy for medication consumption. Noncompliance and medication borrowing or sharing [10] may lead to overestimation of use and exposure misclassification. It has been estimated that 6 % of dispensed medications were not used [11]. Moreover, information on the use of nonprescription and over-the-counter (OTC) medications, herbal preparations and medications taken in the hospital, is not captured.
Other approaches include case-control studies/surveillance, and cohort studies. In ad hoc studies, maternal self-reports have often been used to measure medication use in pregnancy. Inaccurate recall, susceptibility to bias and under-reporting are among the limitations of this tool. The accuracy of reporting has been shown to vary by therapeutic class [12], type of use (chronic vs. occasional) [13] and to depend on data collection methods and questionnaire design [14][15][16].
Due to the limitations of maternal self-reports and prescription databases, neither of these sources can be considered the 'gold standard' to assess the use of medications.
Few studies have been conducted in pregnant women, comparing maternal reports of medication use during pregnancy and database information [12,14,[27][28][29][30]. In general, the results showed that medications taken for long courses or chronically, such as antidiabetic agents, medications for thyroid conditions and for asthma, antiepileptics and antihypertensives, had generally higher agreement than medications taken occasionally.
Another key methodological issue is the accuracy of information on the use of medications during pregnancy and on pregnancy timing. The latter is needed to assign etiologically relevant 'time windows' of exposure to medication at the exact gestational age.
In a cohort of 767 women, resident of Friuli Venezia Giulia (FVG) region, Northeast Italy, and recruited from 2007 to 2009 at the first visit in a prenatal clinic, we compared (a) self-reported information on medication use during pregnancy with data from the regional outpatient dispensing database; and (b) self-reported information on gestational age at birth and on date of delivery with data from the birth certificate database. Moreover, we assessed the effect of women characteristics on the likelihood of agreement.

Data sources
The sources of data were selected FVG health databases, recording computerized information on the use of health services for the residents of the region. All residents are registered with the Regional Health System, providing universal access to health care. A unique personal identifier links anonymized individual records. For this study, the outpatient dispensing and birth certificate databases were used.
The database used in this study records prescriptions at pharmacy redemption level. The database captures all redeemed prescriptions for reimbursed medications dispensed to residents of the region. Prescription medications are reimbursed to residents, including pregnant women.
For each redeemed prescription, the following information is recorded: date of redemption, active substance (description and Anatomical Therapeutic and Chemical ATC classification code [31]), brand, quantity, strength, dispensed form, number of units and number of refills. Information on the indication and the prescribed dosage regimen are not recorded.
The birth certificate database records data on all births in FVG since 1989. For each birth, the information recorded includes: gestational age at the first prenatal visit, at the first ultrasound examination and at delivery, date of delivery, number of prenatal visits and ultrasound examinations, gestational hypertension.
The Direzione Centrale Salute, Integrazione Socio Sanitaria e Politiche Sociali, Regione Friuli Venezia Giulia granted permission to access all above mentioned anonymized databases.

Study cohort
Pregnant women attending their first prenatal visit (between 20 and 22 weeks of gestation) at the Institute for Maternal and Child Health -IRCCS "Burlo Garofolo", in Trieste, FVG, from April 3, 2007 to March 3, 2009 were eligible to be included in this prospective cohort. Eligible women had to be resident in FVG for at least 2 years, in order to be covered by the regional health databases for a period of time before pregnancy, as another objective of this study was to assess the effect of maternal medication and behavioral exposures before pregnancy on the health of the mother and child. Moreover, women had to be fluent in Italian and at least 18 years old. Women with complicated or twin pregnancies were excluded.
Complicated pregnancies were defined as those with maternal abnormalities of the reproductive tract, uterine fibroids, pre-existing chronic illness such as cancer, AIDS, severe heart disease, severe kidney disease, severe Crohn's disease or ulcerative colitis, and those with foetal congenital defects. A complicated pregnancy was determined at the time of recruitment. According to protocol, when a complication emerged in a prenatal examination (e.g. a prenatal tests indicated that the foetus had congenital defects), the woman was excluded from the study. However, no women were excluded for a complicated pregnancy, or any other reason, after recruitment. All eligible women recruited in the study were included in statistical analysis.
During the recruitment period, about 1800 live births per year were recorded in Trieste and 9000 in FVG [32].

Data collection
Women who agreed to participate filled in a selfadministered questionnaire between the 28th week of estimated gestational age and 1 month after delivery. The questionnaire inquired on the use of medications during the pregnancy. Women answering 'Yes' to the question 'Have you ever taken medicationson a regular basisduring pregnancy?' were asked to indicate the brand name and/or the name of the active substance and the indication. ('Which medications have you used during pregnancy? Please list the commercial name of each medication, active substance, if known, and its indication'). In the instructions for completing the questions, 'regular basis' was defined as 'the assumption of a medication for 4 or more times per week or for more than two weeks'.
Data on brand name, active substance and indication of up to six medications were collected. The questions were open-ended.
The questionnaire collected also information on women social and demographic characteristics (country of origin, age, level of education, marital status and profession), health behaviours and conditions (smoking, comorbidities before or during pregnancy, such as diabetes, asthma, allergy, epilepsy, hypertension, vomit, hypothyroidism, hyperthyroidism, lupus, rheumatic diseases, urinary infections, infections, fever, seizures, anemia, cardiovascular diseases, neurological diseases), prior pregnancies (gravidity), gestational age at birth and date of delivery. The date of questionnaire completion was also recorded. The questionnaire is provided as an Additional file 1.
For each woman, through record linkage using an individual identifier, we extracted from health databases the records of (a) prescriptions redeemed from 2006 to 2012 and (b) birth certificate. All prescriptions redeemed from the estimated date of conception to the date of delivery were considered during the pregnancy. The estimated date of conception was obtained by subtracting gestational age at birth from the date of delivery.
To help the interpretation of the Kappa values, we also calculated sensitivity, specificity, positive and negative predictive value, with 95 % confidence interval (95 % CI). The prescription database was the reference standard. Confidence intervals were calculated according to the method by Wilson [37] to avoid aberrations.
For women who completed the questionnaire before the delivery, the prescriptions dispensed from the estimated date of conception to the date of questionnaire completion were considered for the assessment of agreement.
The same statistics were calculated to assess the agreement between hypertension during pregnancy reported in questionnaire and recorded in the birth certificate database. Hypertension during pregnancy, both reported in questionnaire and recorded in the birth certificate database, was also compared with the use antihypertensive medications, both reported and recorded in the dispensing database.
The Odds Ratio (OR), with 95 % CI, of having at least one agreement between questionnaire and prescription database was calculated through unconditional logistic regression. The following variables were evaluated through uni-and multi-variable analysis: age at delivery, level of education, prior pregnancies, smoking status, comorbidities during pregnancy, country of origin, time of completion, marital status, number of visits and of ultrasound imaging during pregnancy, number of medications reported in questionnaire. The manual process of multivariate model building included entering individual terms and evaluating the likelihood ratio test for inclusion of each variable in the model. Only variables that explained the variability or modified the regression coefficient estimators were retained. The final model included age at delivery, level of education, prior pregnancies, smoking status, and comorbidities during pregnancy and country of origin. Women who did not report any medication use and without any prescription, were excluded from this analysis.
The percentage of women matching exactly or with ±1 and ±2 days of difference, on the date of delivery and gestational age at birth was calculated.
The statistical analysis was performed with SAS© software, version 9.3 (SAS, Cary, NC, USA).

Ethics committee review
The study protocol was approved by the Ethics Committees at the University Hospital of Udine and at the Institute for Maternal and Child Health of Trieste.
Written informed consent for participation in the study was obtained.
Overall, 70.5 % of women (N = 541) redeemed at least one prescription during the pregnancy. Only 2 women were dispensed more than 6 different medications (one 7 and one 9). The median number of dispensing was 2 (25°; 75°percentile: 1; 2), the mean was 1.8 (standard deviation 1.01). Folic acid (36.0 % of women reported the use and 29.0 % had at least one dispensing) and iron (26.2 % and 28.6 %) were the most frequently used medications (Table 2). A total of 146 women (19.2 %) were dispensed antibiotics and 96 (12.6 %) progestogens, but only 20 (2.6 %) and 19 (2.5 %), respectively, reported their use. Of note, 5 women were dispensed antidepressants and one methadone. The use of these medications was not reported.
Except for folic acid (Kappa 0.11 and PABAK 0.22), PABAK was higher than Kappa when this latter indicated slight agreement, such as for antibiotics (0.12 and 0.63), labour repressants (0.18 and 0.98) and medications for acid related disorders (0.17 and 0.81). When Kappa indicated poor agreement, e.g. for nonsteroidal anti-inflammatory drugs, non-opioid analgesics or selective serotonin agonists, PABAK was >0.80. The results did not vary when Kappa and PABAK were calculated separately according to the time of questionnaire completion (i.e. before or after the delivery) (Additional file 2: Table S1). For all medications, the sensitivity of questionnaire vs. database was lower than specificity, and the negative predictive value was >0.90 with the exceptions of iron (0.84), folic acid (0.75), progestogens (0.89), antibiotics (0.82) ( Table 3).
When simultaneously adjusted for age at delivery, level of education, prior pregnancies, smoking status, comorbidities during pregnancy and country of origin, the OR of ≥1 agreement was 0.88 (95 % CI 0. 46

Discussion
About 40 % of women reported the use of medications and about 70 % redeemed at least one prescription during pregnancy. The agreement between self-reported data and database information varied greatly by therapeutic class. It was almost perfect to substantial for medications taken for chronic conditions, such as antihypertensive medications, thyroid hormones, antiepileptic and antithrombotic medications, while it was moderate to slight for OTC medications, such as iron and folic acid. These results are consistent with prior studies [12,[28][29][30]. Medications such as antibiotics or antivirals, taken occasionally, showed slight to fair Kappa-based agreement but, when prevalence and bias were taken into account, the agreement was higher. A prior study found high agreement for antibiotics [38]. The Kappa coefficient is influenced by the prevalence of the condition and by bias. Its value, therefore, was interpreted in the light of additional indices of agreement, such as PABAK. For several medications showing moderate to poor agreement, such as agents for obstructive airways disease and for acid related disorders, progestogens, labour repressants, non-opioid analgesics and antidepressants, the value of these indices suggested    [22] c Anatomical Therapeutic and Chemical classification code [20]  that the low value of Kappa was influenced by the low to very low prevalence of use in the population. Several reasons may explain the level of agreement between self-reported data and prescription redemption records. The type of use affects the accuracy of recall, thus women may recall more accurately medications taken chronically or over longer periods than those taken occasionally. Questionnaire design and question structure influence recall [15,16,39]. Questions specific for individual medications/therapeutic classes or for indication, increase the percentage of affirmative answers [39]; memory aids increase the accuracy of reporting. In this study, the questionnaire was self-administered, questions were open and no memory aid was used. This limitation may have contributed to decrease the positive agreement, in particular for medications taken occasionally.
Antidepressants and methadone were prescribed but not reported. In a recent study, use of antidepressants was not reported by 22 % of users during the first trimester and by 38 % during the second and third [40].
Noncompliance and prescription medication borrowing or sharing [10] may also partially explain disagreement.
We did not consider the prescriptions redeemed before the conception date. However, women may have redeemed a prescription before and taken the medication also after conception, partly explaining the discrepancies between sources.
The database does not capture information on the use of OTC, non-prescription or non-reimbursed medications and herbal preparations. However, their use may have been reported in questionnaires, thus contributing to discrepancy between sources. The estimated prevalence of OTC use by pregnant women is not negligible. In the Netherlands, 12.5 % of pregnant women used OTC medications [41]. In the USA, OTC acetaminophen, ibuprofen, and pseudoephedrine were used by at least 65 %, 18 %, and 15 % of pregnant women, respectively [42].
Moreover, women may report medications taken in the hospital setting, not captured by prescription databases.  The result for progestogens can be partly explained by inhospital use, e.g. for the risk of abortion.
In prior studies, the recall of medications taken during the pregnancy was lower when assessed post-delivery vs. pre-delivery [43,44]. The recall time span was several months to eight years. In our cohort, the agreement did not vary according to the time of questionnaire completion. The recall time span in our study was much shorter, median 30 days (25°-75°percentile 21-45 days). This result confirms that the recall of medication use during pregnancy is higher when data is collected shortly after the delivery.
Women sociodemographic characteristics, health behaviours and conditions influenced the probability of agreement. The agreement was less likely in immigrant women, those with less than university education and current or ex-smokers. Recall accuracy has been positively associated to maternal education [13,44] and negatively to smoking during pregnancy [13,29]. Smoking during pregnancy has been positively associated with poor attention for health, for instance women smokers more frequently do not take folic supplementation [45] and have low adherence to psychotropic medications [46]. Smokers may therefore have a less accurate recall of medications assumption in pregnancy.
In our study, agreement was more likely in primiparae, women experiencing comorbidities during pregnancy and those in the extreme age classes. Women at their first pregnancy, with poorer health condition or aged 40 years or older, may be more concerned on the pregnancy outcome and have a more accurate recall. Another study found that the recall certainty of dates of analgesic use in pregnancy was positively associated with maternal age [13].
The use of medications outside the coverage of the dispensing database, such as herbal medications or vitamin supplements, may be more frequent in subgroups of immigrant and young adult women. This differential use of medications not covered by the database may partially explain the lower likelihood of agreement in immigrant women and in those aged 25 to 29 years.
The likelihood of agreement increases with increasing number of medications reported. Women who use more medications may be those with health problems in pregnancy; therefore, they may recall better the medications used during it. The total number of medications has previously been positively associated with the recall accuracy of prescription analgesics use [13].
Databases do not always capture information on gestational age and date of delivery. Thus, the timing of exposure relative to pregnancy cannot be evaluated. This limitation hinders the use of databases for pregnancy research. The accurate timing of pregnancy is of great relevance for epidemiologic studies of exposures during pregnancy, including medications, and maternal or foetal and infant outcomes. We found a very high agreement  for gestational age and date of delivery between questionnaire data and birth certificate records. This result is in line with a prior study, reporting a high agreement, with positive predictive value >90 %, between birth certificate and medical record data for gestational age [47].
The additional value of this study to the existing literature on the agreement between self-reports and database information on medication use during pregnancy includes the followings: (a) it was performed in a cohort with information on women demographic and socioeconomic characteristics, and it could, therefore, assess the factors associated with agreement; (b) in measuring agreement, the prevalence of the medication use was taken into account through the PABAK calculation; (c) the study evaluated in the same cohort both the agreement for medication use and for gestational age and date of delivery -the latter being crucial for evaluating the reliability of data on the timing of pregnancy.
A strength of this study is that all the women in the cohort were linked to dispensing and birth certificate records, without omissions of specific population subgroups (e.g. low socioeconomic level or immigrant status), confirming the high quality of FVG databases.

Conclusions
The agreement between self-reports and prescription redemption data was high to very high for medications used for chronic conditions. Our findings confirm that maternal reports and prescription redemption data are complementary to each other to increase the reliability of information on the use of medications during pregnancy. Future studies using large administrative data should be considered to assess exposure also with a selfreported questionnaire in a subsample as an internal validation study. The results of this validation study could be used, e.g. in sensitivity analysis, to take into account the impact of possible exposure misclassification, on the association with the outcome.
To assess the use of medications not captured by database, such as OTC, herbal preparations, medications not reimbursed or used in the hospital setting, other sources should be considered, such as primary care or hospital electronic medical records.
The method choice of interview and questionnaire design should account for maternal factors affecting recall, such as sociodemographic and health behaviours, in the target population.
We found a very high agreement for gestational age and date of delivery between maternal reports and birth certificate database. This result suggests that birth certificates provide reliable data on the timing of pregnancy.
Our results show that FVG health databases are a valuable source of data for pregnancy research and for studies on the safety of medications during pregnancy.

Additional files
Additional file 1: Study questionnaire. (DOCX 24 kb) Additional file 2: Supplemental tables. Table S1. Agreement between questionnaire and prescription redemption database for selected therapeutic classes by time of questionnaire completion. Table S2. Number of women with information on hypertension during pregnancy and agreement between questionnaire and birth certificate database. Table S3. Number of women with information on hypertension during pregnancy in questionnaire and in birth certificate database and use of antihypertensive medications according to questionnaire and prescription database. Positive Predictive Value and Negative Predictive Value of prescriptions for antihypertensive medications recorded in questionnaire and in birth certificate database. (DOCX 23 kb)