Development and validation of the prenatal activity restriction stress questionnaire: a Rasch rating scale analysis

Background Antepartum activity restriction (AAR) is a common recommendation given to women at risk for preterm delivery. However, such treatment has been shown to cause heavy emotional burdens on the women receiving it since it requires them to face many challenges derived from the intervention. Nevertheless, current existing scales lack effective items that can reflect the distress of these women. The aim of this study was to develop a reliable instrument to assess the distress of women experiencing AAR. Method The Prenatal Activity Restriction Stress Questionnaire (PARSQ) was developed according to comprehensive literature review, women’s interviews, and existing pregnancy-special stress scales from August 2016 to July 2017 in southern Taiwan. Six experts evaluated its content validity; the Rasch rating scale model (RSM) was used to examine its item-fit, dimensionality, and reliability with 200 women with AAR experience. Furthermore, the concurrent validity was assessed through computing the correlation of AAR women’s scores on the PARSQ and Perceived Stress Scale (PSS), and discriminant validity of the PARSQ was assessed to compare the scores’ differences between the AAR women and the healthy pregnant women. Results The PARSQ was constructed with 23 items in the 4-dimensional scale: Role function changes (8 items), Fetal safety and health (5 items), Physical and psychological care issues (5 items), and Socioeconomic and medical issues (5 items). It was confirmed to have satisfactory content vitality (CVI = 0.78 to 1.0), reasonable item-fit (0.77 to 1.51), and good reliability in RSM model, as well as adequate concurrent validity (p = 0.005) and discriminant validity (p < 0.001). Conclusions Understanding the distress of women undergoing AAR is necessary for developing appropriate prenatal care to assist women in coping with their situation to alleviate their emotional burdens. The developed PARSQ with satisfied psychometric properties can be an informative instrument for clinicians/researchers to assess the specific stress of pregnant women with AAR. Supplementary Information The online version contains supplementary material available at 10.1186/s12884-020-03347-3.


Background
Preterm birth has been known to seriously threaten the health of newborns. Therefore, finding solutions to strengthen the prenatal care of women to avoid preterm birth is a global endeavor [1].
Antepartum activity restriction (AAR) is a common recommendation prescribed for women with threatened preterm labor (TPTL) [2], which is the main precursor and the most common cause of treatment or hospitalization during pregnancy [3]. Whether they are in the hospital or at home, women are advised to implement AAR to avoid the worsening of their symptoms [4] since AAR is thought to calm uterine activity and increase blood flow of the uterus, which may benefit the health of the mother and the fetus [5].
Although AAR is intended to alleviate women's TPTL symptoms to decrease their risk of preterm birth, previous qualitative studies have shown that women were required to cope with many challenges derived from it. To respond to activity restriction, women must suspend their work, redistribute their roles in the family, learn to monitor the subtle changes in their bodies, and adapt to the side effects of tocolytics. Such living contexts change a woman's self-identity and cause heavy emotional burdens [6,7]. Mounting evidence has shown that woman's negative prenatal emotions can lead to adverse pregnancy outcomes, like preterm birth and low infant birth weight [8,9], and negative consequences of maternal and offspring health [10]. Therefore, the prenatal emotional status of women undergoing AAR requires greater attention.
To date, several types of questionnaires have been applied to measure women's prenatal emotional status. The first type measures stressful life events, such as unemployment or divorce, that occur during pregnancy (e.g., the Life Events Inventory [11]); the second assesses the general emotional status of pregnant women (e.g., the Perceived Stress Scale [12] and Edinburgh Depression Scale [13]); and the third type evaluates the pregnancy-specific stress that arises from the women's concerns about physical condition and changes, baby health, and upcoming labor (e.g., the Prenatal Distress Questionnaire [14] and Pregnancy-Related Anxiety Scale [15]). A recent meta-analysis has shown that pregnancyspecific stress measures had greater predictability for women's adverse pregnancy outcomes than other kinds of psychological stress measures [8].
Although 15 pregnancy-specific stress questionnaires have been developed [16], most were based on the context of healthy pregnant women [17,18]. Accordingly, those scales seem more suitable for healthy pregnant women than high-risk pregnant women. Although one scale has been developed based on the life experiences of medically high-risk pregnant women [19], it lacks certain items reflecting the distress of women undergoing AAR due to their TPTL symptoms, such as issues of drug treatment and finding substitutes for their family and workplace functions [6,20]. Consequently, the stress of the women undergoing AAR cannot be reflected fully, which impedes the development of appropriate care measures for assisting these women in coping with their situation to alleviate their stress. In response, this study aimed to develop an instrument that effectively measures the distress or concerns of women undergoing AAR.

Methods
Following an instrument-development procedure [21], three-stage development of the instrument was conducted in southern Taiwan from August 2016 to July 2017 with approval by the Institutional Review Board (A-ER-105-001). In the first stage, a comprehensive literature review and interviews of Taiwanese women with AAR were conducted to conceptualize "Prenatal Activity Restriction Stress," and then items were generated and modified according to this definition to develop the initial Prenatal Activity Restriction Stress Questionnaire (PARSQ). The second stage involved expert panels examining the content validity of the initial PARSQ. In the third stage, the psychometric properties of the modified PARSQ were tested.

Initial PARSQ development
Firstly, a series of literature findings exploring the experience of women undergoing AAR due to their TPTL were reviewed. These studies revealed that women were burdened by both their TPTL and activity restrictions. At the moment of being diagnosed with TPTL, a woman's joyful and healthy pregnancy may suddenly change into an abnormal pregnancy filled with fears and worries. It also raises the woman's concerns about fetal health and their own physical and psychological discomfort. Activity restrictions prescribed for preventing preterm birth force these women to alter their original lives drastically, like being suspended from work and not being able to complete their original roles in the family. Moreover, they have had to monitor all the subtle changes in their bodies. The huge lifestyle changes can cause heavy emotional burdens for these women [6,7,20,[22][23][24].
The literature review findings were then used as the fundamental knowledge to develop the interview guide to interview individually 10 Taiwanese women with AAR due to their TPTL. Most of them were under age 40 (n = 9), and half had experienced hospitalization for treating their TPTL symptoms (n = 5). These women reported that their AAR at home lasted for at least 14 days, and all had experience of taking tocolytic drugs for controlling their TPTL. The interviewed women reported that they too had faced the challenges and conflicts between their inactivity, family life and career similar to the findings of the literature review.
Findings from the literature review and the Taiwanese women's interviews were then integrated to define the construct of "Prenatal Activity Restriction Stress" as 4 dimensions of pregnant women's distress or concerns due to their activity restrictions and pregnancy complications: (1) Role and function changes; (2) Physical and psychological care issues; (3) Fetal safety and health; and (4) Socioeconomic and medical issues. Afterwards, the items were generated according to this definition and other existing stress scales addressing women with highrisk pregnancies [14,15,19,25]. The initial version of the PARSQ containing 29 items with a 5-point scale ranging from 1 (never) to 5 (always) was developed.

Content validity of the initial PARSQ
The content validity of the initial PARSQ was evaluated by six experts with specialties in maternal health, research, and questionnaire development. An item-review document containing a cover letter with an explanation of the PARSQ's purpose and an item-scoring form with instructions was mailed to the panel of experts who were asked to evaluate the relevance, correctness, and comprehensibility of the PARSQ items with a 4-point scale ranging from 1 (not appropriate) to 4 (very appropriate), and to provide suggestions for revising the items. The relevance of items means the degree to which the items relate to a particular construct; the correctness of items means the degree to which they guide respondents toward providing genuine answers, and the comprehensibility refers to the degree to which the item's wording is easy to understand. Subsequently, the content validity index for each item (I-CVI) and the content validity index for all scales (S-CVI) were computed. The I-CVI is calculated as the percentage of experts giving either 3 or 4 on a 4-point scale, while the S-CVI is the average of the total I-CVI in the scale. I-CVI values equal to or greater than 0.78 and an S-CVI greater than 0.8 are considered appropriate when there are six or more experts rating the items [26]. The ranges of the CVI of relevance, correctness, and comprehensibility for each item on PARSQ were 0.67-1.0, 0.67-1.0, 0.5-1.0, respectively, with four items (items 4, 5, and [11][12][13][14] demonstrating unacceptable CVI (Additional file 1) due to unclear and redundant content commanded by the experts. After the four items were modified, the remaining 25 items were then processed for further psychometric properties testing.

Psychometric properties examination of the modified PARSQ
A total of 200 women with AAR experience and 96 healthy pregnant women participated in examining the psychometric properties of the modified PARSQ.

Participants
The target group of this study was women with AAR experience either in the hospital or at home. The eligibilities of the AAR women were being at least 20 years of age, having AAR prescribed or recommended by their doctors due to their TPTL symptoms, such as preterm uterine contractions or vaginal bleeding, before 37 weeks of pregnancy within 1 year. Their responses on the modified PARSQ were analyzed for item quality, dimensionality, concurrent validity, and reliability of the instrument. The healthy pregnant women were at least 20 years old and with no pregnancy complications and AAR experience. Their responses on the modified PARS Q were used to examine the instrument's discriminate validity. Women in both groups who had ever been diagnosed with depression or other affective disorders were excluded.
Purposive sampling was used to recruit the participants in a medical center in southern Taiwan. The participants were recruited mainly via doctor and case manager referrals when attending their regular antenatal or postpartum check-ups at an outpatient clinic. Since few studies lay down substantive rules for determining the sample size calculations for the Rasch model and how to calculate the power of a test power, a rule-ofthumb guide of at least 10 observations per category of scales for reaching the precision of the estimates was used [27]. The responses in the PARSQ category in this study meet this criterion.

Prenatal activity restriction stress questionnaire (PARSQ)
The PARSQ is a self-reported scale designed to detect four domains of specific distresses of activity restriction, including (1) Role function changes; (2) Fetal safety and health; (3) Physical and psychological care issues; and (4) Socioeconomic and medical issues. The initial PARSQ contained 29 questions, which were reduced to 25 after the expert review. Moreover, to account for variations in the situation of each woman, six filter questions (items 3, 5, 15, 18, 23, 24) that attempted to explore the issues of childcare and employment were included. If a respondent had not experienced what the filter question described, they could skip the question. Items in the PARSQ were rated on a 5-point scale from 1 (never) to 5 (always), with higher scores indicating higher distress.

Perceived stress scale (PSS)
The Chinese version of the 10-item PSS with adequate reliability and validity [28] was adopted in this study. It is a self-reported questionnaire originally developed by Cohen and Williamson (1988) [29] for assessing the general perceived stress status of people in a given situation within the past month. The items on the PSS-10 were rated on a 5-point scale ranging from 0 (never) to 4 (very often), with higher scores indicating higher perceived stress.

Data analyses
Although test models of Classic Test Theory (CTT) have been applied for decades in developing measuring instruments due to its simple assumptions, the main shortcomings of CTT are both a person's true scores and item parameters (e.g., item difficulty and item discrimination) that are test-and sample-dependent [30,31]. To address the disadvantages of CTT, many IRT test models, including the rating scale model (RSM), were developed [32]. The RSM, a Rasch-family model, identifies responses in scales with a fixed set of rating points by using complex mathematical equations to estimate the probability of a person endorsing a specific response category of a given item. This method can not only overcome the deficiencies of CTT but also can determine the problematic items through analyzing people's aberrant responses to them [33,34]. Due to the advantages of the RSM, the RSM combined with CTT was applied in examining the psychometric properties of the PARSQ. Several methods, including item analysis, dimensionality, reliability, interpretability, differential item functioning (DIF), and concurrent and discriminant validity, were adopted.

Item analysis
Descriptive statistical analysis, extreme groups' analysis, and RSM, was performed to evaluate the quality of each item on the modified PARSQ. In descriptive statistical analysis, the mean scores and standard deviation of each item are analyzed. If an item has an obviously high or low mean score or low standard deviations (≤ 1.0), it indicates that modification or deletion of the item should be considered because it may be too hard or too easy for the respondents or has low discrimination [35].
Extreme groups' analysis was used to determine whether an item can differentiate respondents with high scores from those with low scores [36]. During the computation of this analysis, the upper and lower 27% of the scorers of each item were identified and then sorted into high-and low-score groups. Thereafter, independent ttests were used to test for significant differences between these two groups, where significance indicates acceptable discrimination.
RSM with two mean-square fit statistics (MnSq) weighted (Infit) and unweighted (Outfit) were adopted to determine undesired response patterns and problematic items [34]. The Infit is an indicator used to detect aberrant response patterns on items near the ability level of the respondents, while Outfit is sensitive to unexpected response patterns on items far exceeds respondents' ability.
The ideal values of Infit and outfit are 1.0, which indicates little distortion in the instrument. Generally, the acceptable range of an item's Infit and Outfit is 0.5-1.5 [37]. After poor-quality items were deleted based on the itemanalysis results, the PARSQ's validity and reliability were then evaluated.
Dimensionality, reliability, interpretability, and DIF evidence Unidimensional RSM and multidimensional RSM were applied to verify the structure of the PARSQ. The deviance difference between these two models was approximated by a Chi-square distribution with degrees of freedom, which is the difference in the estimated number of the two models, to determine which hypothesized model was the best-fit [38]. After confirming the PARSQ's structure, the item-separation reliability and person reliability (expected a posteriori/plausible value, EAP/PV) were obtained to support the PARSQ's reliability. Item-separation reliability reflects whether the questionnaire items can effectively separate respondents with different levels of latent ability; whereas person reliability is the reproducibility of respondents' abilities when they respond to a parallel test, which is equivalent to the concept of consistent reliability (Cronbach's alpha) in CTT. The acceptable value of item and person reliability is 0.6-0.8 [39]. Finally, a visual graph representing the distribution of item and person locations was illustrated to support the PARSQ's interpretability validity [40]. Moreover, to examine the potential response bias between women with AAR experience during their past pregnancy within 1 year and those with a current pregnancy, DIF is a statistical approach used to identify unfair items for which different groups of respondents perform differently. The Chi-square parameter equality test has been calculated and if the p value is < 0.05, the presence of DIF has been verified [41].

Concurrent and discriminate validity
Pearson correlation coefficients (r) between participant scores on the PARSQ and PSS were computed to provide evidence for the PARSQ's concurrent validity. An r value of 0 indicates no association between two scores, a value greater than 0 implies a positive relation, and a value smaller than 0 indicates a negative association. The value of r between 0.1-0.3 represents a low-level relation between two scores; values between 0.3-0.5 indicate a medium-level relation, and values greater than 0.5 indicate a high relation [42]. Evidence for the PARSQ's discrimination was supported by comparing the significant differences in the scores between the women with AAR and the healthy pregnant women. Independent ttest and Mann-Whitney U-test for small samples of a subgroup were adopted, and when the p-value is < 0.05, a significant difference exists between the two groups.
All statistical analyses were conducted with IBM SPSS 17.0 and ConQuest 3.0.1 [41]. Since the Rasch model does not require complete data to make estimates, and estimates still can be computed from the set of nonmissing responses on the scale without imputing estimates or deleting any portion of the data [43], the responses of the filter items of the PARSQ were treated as missing-response data and left as a period (.) when conducting analysis using ConQuest.

Characteristics of participants
Two hundred women with AAR experience (135 women with AAR experience during their past pregnancy within 1 year and 65 women with AAR experience during their current pregnancy) and 96 healthy women were included in this study. Since no unfair items of the PARS Q were indicated by the result of DIF (p = 0.267), we treated these two kinds of women with AAR experience as a whole group to test the psychometric properties of PARSQ.
As shown in Table 1, the majority of the women with AAR experience were younger than age 35 and had earned a bachelor's degree. Most had full-time jobs, and approximately 80% had a family income of at least 40, 000 Taiwan dollars per month (Table 1). Nearly 50% of the women's AAR experience occurred at their first pregnancy, and 37% had hospitalization experience. Most of these women had the experience of taking tocolytic drugs for alleviating their TPTL symptoms, and Ritodrine (Yutopar) was the drug most commonly reported. Women's AAR experience has been reported to occur at an average of 27.2 weeks of gestation. Aside from lacking activity restriction experience and treatments for pregnancy complications, there was no significant difference in the characteristics of the healthy pregnant women and the AAR women (Table 1).

Item analysis
The mean score range for all items was between 1.64 and 3.93, with the standard deviation (SD) ranging between 0.85 and 1.61. The mean scores of all items were within the mean of the whole scale ±1 SD (mean score of total scale = 3.04, SD = 1.41) ( Table 2). In the extreme groups' comparison, the differences between the top and bottom 27% of each item was between − 15.39 and − 3.91, and all reached significance (p < 0.001), indicating the PARSQ items were with good discrimination. Moreover, the Infit  (Table 2). After items 6 and 23 were deleted due to undesirable fit parameters, the remaining 23 items underwent further analysis.

Dimensionality, reliability evidence of the PARSQ
To confirm the PARSQ's structure, the two hypothesized models, namely the unidimensional and 4-dimensional models, were compared. The results showed that the 4dimensional model was the better model, since its deviance was 113.73 less than that of the unidimensional model, and this value reached statistical significance in the Chi-square distribution with nine degrees of freedom ( Table 3). The correlations among these 4 dimensions ranged from 0.638-0.845; meanwhile, the person-separation reliability of these dimensions ranged from 0.744-0.865, and the item-separation reliability was 0.993. These values indicate that the items in the 4 dimensions have adequate internal consistency and can separate respondents with different levels of stress of activity restriction. The Infit and Outfit MnSq of the remaining 23 items also demonstrated acceptable fit parameters under the 4-dimensional model ( Table 4). The item difficulties are shown in Table 4. In dimension 1, the easiest item was item 1 (Feel troubled by not being able to go out to run errands), and the hardest was item 19 (Worry about relationships with other family members deteriorating). In dimension 2, the easiest item was item 8 (Worry about baby's development and health), and the hardest was item 10 (Worry that the labor process may harm the baby). In dimension 3, the easiest item was item 12 (Feel annoyed by physical discomfort), and the hardest was item 16 (Feel troubled by necessary physical activity). And in dimension 4, the easiest item was item 24 (Feel distressed with having to ask for leave from work for bed rest), and the hardest was item 20 (Feel annoyed with the unclear way of obtaining pertinent information on management of physical symptoms and coping with AAR).  Interpretability evidence of the PARSQ The item-person maps by dimension were illustrated, and the item difficulties and person ability were estimated in the dimensions of PARSQ to assess whether the PARSQ is appropriate for the respondents (Figs. 1  and 2). For the item-person maps, the right side of the map showed the locations of item difficulty, while the left side demonstrated the distribution of respondent ability, which is marked with "X." Items at the top displayed a higher difficulty than those at the bottom of the map, while higher locations for respondents indicated they had higher latent ability. Ideally, the difficulties of the items on the scale were expected to be spread across the ability range of all respondents; thus respondents with different abilities had items with corresponding difficulties that could accurately reflect their abilities. As illustrated in Figs. 1 and 2, item difficulties were generally distributed among ±3 logits and the ability range of the respondents ranged from − 2.26-2.24 logits, 10 which indicated that the item difficulty was very close to the respondent ability and the items of the PARSQ can cover the distribution of most respondents' ability. However, it should be noted that there was a lack of some equivalent-level items in dimension 3 to reflect the status of respondents with higher latent abilities. In addition, some items with similar difficulty also existed, such as the difficulty of item 3 being close to that of item 4 in dimension 1, and item 13 being close to that of item 16 in dimension 3. Moreover, the test information curves (Fig. 3) for the total and subscale of the PARSQ also demonstrated that the PARSQ provided the greatest amount of information for respondents who had abilities between ±3 logits. Above findings indicated the PARSQ had satisfied interpretive validity for women with AAR stress levels of ±3 logits.

Concurrent and discriminant validity of the PARSQ
For verifying the concurrent validity of the PARSQ, the scores for the PARSQ and PSS of 65 pregnant women with activity restriction experience were analyzed. Results showed that women's PARSQ scores positively correlated with their PSS scores (r = 0.348, p = 0.005), especially for dimensions 3 (r = 0.461, p < 0.001) and 4 (r = 0.382, p = 0.002). For examining the PARSQ's discrimination, the total scale and subscale scores of the women with AAR experience and healthy pregnant women were compared. Since the women's total scores on the PARS Q may be different due to their parity and employment status, subgroup analysis was conducted according to parity to confirm whether any difference in the PARSQ scores existed between the AAR women and the healthy pregnant women. The results indicated that the AAR women's scores on the PARSQ total scale and subscales were all significantly higher than those of the healthy women regardless of parity and employment status (Table 5).

Discussion
Pregnancy stress has been proven to affect the health of the offspring [8]; it has therefore garnered much attention in recent years. As such, the stress of women undergoing AAR to prevent preterm birth should not be ignored since they have to face many challenges and must learn to live with the threat of preterm birth. To fill this need, this study developed and evaluated the psychometric properties of the PARSQ to assess the distress of women undergoing activity restriction during pregnancy. Finally, the PARSQ with 23 items measuring the 4 dimensions of distress (Role function changes, Fetal safety and health, Physical and psychological care issues, and Socioeconomic and medical issues) using 5point Likert-type response scale was developed and confirmed to have appropriate validity and reliability.
Unlike previous scales, which were mostly developed based on the life situation of healthy pregnant women [16], the PARSQ was developed based on the life experience of medically high-risk women with AAR due to their TPTL symptoms. Thus, although there are other scales developed based on the context of medically highrisk women, like the High-Risk Pregnancy Stress Scale (HRPSS) [19], PARSQ has the preterm labor contextspecial items, such as issues of tocolytics treatment, relationship with important others, and employment. Furthermore, different from the previous 15 pregnancyspecial stress instruments, all of whose psychometric properties were tested based on classical measurement theory [16], the item quality and item difficulty of the PARSQ were confirmed through the RSM. Therefore, each item of the PARSQ was fashioned with particular function and difficulty to detect the specific distresses of women with AAR, the features of which increase the convenience of its use in a time-limited clinical setting. For example, the simpler questions on each PARSQ dimension (like item 1 on dimension 1 and item 8 on Fig. 3 Test information curve for the full-scale and subscales of the PARSQ dimension 2) can be performed first with the women, and then the more difficult questions can be given as needed. Moreover, since the equal interval measurement properties of the PARSQ were also verified by RSM, the total scores of the PARSQ are reasonable to compare differences in the degree of distress between groups. However, since the parity, treatment and employment status would affect women's responses on the filter items and total scores on dimensions 1, 3, and 4, we suggest that subgroup analysis should be done according to the women's parity, treatment, and employment status when conducting difference comparison.
Although advantages of the PARSQ had been demonstrated, some points and improvements should be of concern. First, although the PARSQ was developed based on comprehensive literature review and interviews with AAR women, apart from 4 items with unclear content that were deleted in content validity analysis, another 2 items (item 6: "losing baby" and item 23: "losing my job"), which unexpectedly showed misfit in the RSM, were also deleted. One possible reason for the misfit of item 6 may be related to the good accessibility of maternal healthcare under National Health Insurance in Taiwan [44], which could reduce a woman's fear of losing her baby. Another reason could be the death taboo in Taiwanese culture, in which people are discouraged from talking about death as it may bring bad luck [45]. The content of item 6 could make the women associate it with stillbirth or fetus death; therefore, they would tend to conceal their true feelings and answer the question positively. The reason for the misfit of item 23 may be related to Taiwan's Act of Gender Equality in Employment and Labor Standards Act, which gives women legal protection on their right to work during pregnancy [46]. Therefore, women who undergo activity restriction might not perceive much distress on item 23: "losing my job." Second, to complete the PARSQ's effectiveness, generating items with higher levels of stress difficulty in dimension 3 and modifying items with similar difficulties should be made in a subsequent study. Third, the PARSQ's validity and reliability were primarily tested on a population of women with activity restriction due to TPTL. To expand the PARSQ's usefulness, its validity and reliability could be further evaluated among women who are advised to restrict their activity to alleviate their pregnancy complications or improve the success of implantation [47]. Fourth, to understand the effect of stress of activity restriction on women's pregnancy outcomes, the PARSQ's predictive validity should be tested in the future. Finally, associations between the PARSQ and other biological measures could also be tested in future work to explore the influence of stress of activity restriction on pregnant women's physiological health. A main limitation of this study should be noted: nonpregnant women who had experienced AAR within the past year were included in this study. Even though no study revealed the different experiences with AAR between women who are pregnant and those who have already given birth, we still cannot exclude the impact of retrospective recall on the psychometric properties of the PARSQ. Retrospective recall can be influenced by a woman's pregnancy outcome and may conflict with accurate recall [48]. Therefore, even though no DIF existed among items of the PARSQ between pregnant and nonpregnant AAR women, showing no difference in responses of these two groups on the PARSQ's items, we cannot completely exclude the impact of retrospective recall on the psychometric properties of the PARSQ.

Conclusion
Understanding the distress of women with should be a key component for developing appropriate prenatal care to assist women in coping with their situation to avoid the adverse consequences of the emotional burden on both maternal and offspring health. The developed PARSQ with 23 items and 4 dimensions was confirmed to have satisfied validity and reliability. Since the PARSQ is a content-based scale, it allows clinicians or researchers not only to characterize the stress levels experienced by women undergoing activity restriction, but it also offers a more tailored understanding of their distress during this specific period. Therefore, the PARSQ is a valuable measure that can be used as a foundation for healthcare providers to develop appropriate care for meeting women's individual needs.