Rationale The response characteristics of the 6 minute walk test (6MWT) in studies of idiopathic pulmonary fibrosis (IPF) are only poorly understood, and the change in walk distance that constitutes the minimum important difference (MID) over time is unknown.
Objectives To examine changes over time in distance walked (ie, 6MWD) during the 6MWT and to estimate the change in distance that constitutes the MID in patients with IPF.
Methods Data from a recently completed trial that included subjects with IPF who completed the 6MWT, Saint George's Respiratory Questionnaire (SGRQ) and forced vital capacity (FVC) at 6 and 12 months were used to examine longitudinal changes in 6MWD. Both anchor- and distribution-based approaches as well as linear regression analyses were used to determine the MID for 6MWD. The SGRQ Total score and FVC were used as clinical anchors.
Main results Among 123 subjects alive and able to complete the 6MWT at both follow-up time points, 6MWD did not change significantly over time (378.1 m at baseline vs 376.8 m at 6 months vs 361.3 m at 12 months, p=0.5). The point estimate for the 6MWD MID was 28 m with a range of 10.8–58.5 m.
Conclusion In a group of patients with IPF with moderate physiological impairment, for those alive and able to complete a 6MWT, 6MWD does not change over 12 months. At the population level, the MID for 6MWD appears to be ∼28 m. Further investigation using other anchors and derivation methods is required to refine estimates of the MID for 6MWD in this patient population.
- Interstitial lung disease
- idiopathic pulmonary fibrosis
- 6-minute walk test
- minimum important difference
- interstitial fibrosis
Statistics from Altmetric.com
- Interstitial lung disease
- idiopathic pulmonary fibrosis
- 6-minute walk test
- minimum important difference
- interstitial fibrosis
Idiopathic pulmonary fibrosis (IPF) is a progressive, fatal fibrosing interstitial lung disease that impairs functional status and quality of life. Although recent advances in our understanding of the underlying pathogenesis of IPF have led to the investigation of numerous profibrotic pathways,1 no medical treatment has been shown to improve respiratory symptoms or functional status, or to prolong survival.
The 6 minute walk test (6MWT) is a simple and easily performed assessment of a subject's submaximal functional capacity. It measures the distance a patient can walk on a flat surface over a period of 6 min and is used to measure the response to medical interventions in patients with moderate to severe cardiac or pulmonary disease.2 Its reproducibility (with a coefficient of variation of ∼8%) appears to be better than the 1 s forced expiratory volume in patients with chronic obstructive pulmonary disease (COPD),3–6 and it has less short-term variability than questionnaire indices of functional status (22–33%).7
For IPF, the 6 minute walk distance (6MWD) appears highly reproducible (test–retest reliability 0.98) over short time intervals (eg, 1–2 weeks), and is highly correlated (r = 0.78) with maximal oxygen uptake derived from a maximal cardiopulmonary exercise test.8 In two relatively small studies, investigators have examined changes in distance walked during the 6MWT over longer time periods in patients with confidently defined IPF. Tomioka and colleagues9 found that 6MWD declined to a non-significant degree (37.3 ± 120.9 m, p = 0.2) from baseline among 32 subjects with IPF who completed a 6MWT after a median 14 months of follow-up. In a recently completed placebo-controlled study investigating the efficacy of etanercept in the treatment of IPF, the 6MWT was used as a secondary endpoint. In terms of 6MWD, there was no statistically significant difference between the two treatment arms, and subjects in the placebo arm (n = 41) experienced a mean decline of 14.7 (±112.5) m (p = 0.5) at week 48, with changes of −2.2 (±65.80), 2.1 (±71.7) and −22.3 (±106.8) m at weeks 12, 24 and 36, respectively.10
What is not known about the 6MWT in patients with IPF is what distance constitutes the minimum important difference (MID)—the smallest change in distance that patients can perceive as different from the previous test and that would mandate, in the absence of troublesome side effects and excessive costs, a change in management.11 Several methods can be used to determine the MID for an outcome measure, but there is much controversy over which ones are best.12 A common approach is to use multiple methods to derive MID estimates. One—the anchor-based method—calls for a different but related clinical variable to be a so-called anchor; the mean change in the outcome (eg, 6MWD) calculated for subjects who change “minimally” according to that anchor is the MID estimate. There are no restrictions on what variables can serve as anchors; however, any anchor must be clinically relevant and cut-off values for defining minimal change should be sensible. Distribution-based methods, including the effect size, standardised response mean and standard error of measurement, use statistical calculations based solely on study sample data for the outcome variable to derive the MID.
The goals of this study were to determine the changes in 6MWD over time in a large, well-defined sample of subjects with IPF with moderate physiological impairment and to use multiple approaches to determine the MID for 6MWD in this population.
Data from the international, prospective, double-blinded, randomised, placebo-controlled, parallel group study investigating the use of bosentan in the treatment of patients with IPF (BUILD-1) were analysed.13 Its study design, inclusion and exclusion criteria, and primary results have been previously published.13 Briefly, patients were enrolled in BUILD-1 if their IPF diagnosis met American Thoracic Society (ATS)/European Respiratory Society (ERS) consensus guidelines,14 if they were diagnosed with IPF between 3 and 36 months prior to enrolment, and if a screening 6MWD was between 150 and 499 m. Patients were excluded if they had a forced vital capacity (FVC) of <50% or >90% predicted,15 a diffusing capacity for carbon monoxide (DLCO) corrected for haemoglobin level <30% predicted,16 a resting arterial oxygen pressure (PaO2) <55 mm Hg (sea level) or 50 mm Hg (>1400 m), echocardiographic evidence of severe pulmonary hypertension (systolic pulmonary pressure >50 mm Hg or tricuspid regurgitation velocity >3.2 m/s) or severe congestive heart failure. The study was approved by the appropriate independent ethics committees or institutional review boards and conducted in accordance with the principles of the Declaration of Helsinki, local laws and guidelines for good clinical practice. All patients provided written, informed consent. Our study sample was comprised of subjects with baseline, 6 and 12 month 6MWD data.
The 6MWD was measured without supplemental oxygen using a modified 6MWT protocol, conducted in accordance with ATS guidelines,2 but terminated prematurely if SpO2 fell below 80% (pulse oximeter; Nellcor N-595; Pleasanton, California, USA). Technicians conducting the 6MWT were blinded to all questionnaire data.
The Saint George's Respiratory Questionnaire (SGRQ) is a self-administered, obstructive lung disease-specific questionnaire with 50 items comprising three domains (Symptoms, Activity and Impact) and a Total score. Domain and Total scores range from 0 to 100, with higher scores corresponding to worse health-related quality of life.
Descriptive statistics were used for baseline, 6 month and 12 month data. Change over time in 6MWD was examined by using longitudinal data analytic methods. We used SAS PROC MIXED (SAS Institute, Cary, North Carolina, USA) to model mean 6MWD at each study time point (baseline, 6 and 12 months) while handling the within-subject correlation of repeated measures of 6MWD. Thus, the model yielded least squared means estimates for 6MWD at each of the three time points, and the null hypothesis of equality among the three mean values for a given variable was tested. Even with mild deviations from normality and particularly for sample sizes >100—the data from this sample of 123 subjects were mildly skewed—PROC MIXED will yield robust estimates.
Next, we stratified subjects according to anchor score changes (SGRQ Total score and FVC) over the two time intervals—from baseline to 6 months as well as 6–12 months. The SGRQ Total was chosen as an anchor because it captures patients' perceptions about their own health status; thus it assesses certain effects of IPF on patient well-being. In a previous study, we determined the MID for the SGRQ Total to be 7 points, with a range of 5–10 points. Thus, for this analysis, we categorised subjects as “unchanged” if the difference in SGRQ Total was within 5 points (exclusive) of the value at the previous time point, as “changed minimally” if the difference in SGRQ Total was 5–10 points (exclusive) different from the previous time point, and as “changed more than minimally” if the difference was ≥10 points. For each time interval, the mean change in 6MWD was calculated for subjects stratified according to categorical changes (ie, unchanged, changed minimally or changed more than minimally) in SGRQ Total score. To show the ability of SGRQ Total to discriminate between subjects whose 6MWD changed over time, we used repeated measures analysis of variance (ANOVA) to compare linear contrasts in mean changes in 6MWD across categories of change in SGRQ Total score. We used this same approach for the FVC anchor: first, stratifying the sample on change in FVC, then calculating mean changes in 6MWD for categories of change in FVC, and finally using repeated measures ANOVA to compare changes in 6MWD across FVC categories. For FVC, we categorised subjects as “unchanged” if the difference in the raw FVC was within 7% (inclusive) of the value at the previous time point, as “changed minimally” if the difference in the raw FVC value was between 7% and 12% (exclusive) of the value at the previous time point, and as “changed more than minimally” if the difference was ≥12%. We considered a 7–12% change in raw FVC as a minimally important change because this range covers 7% (a so-called marginal change that has been shown to carry prognostic significance in IPF17 18) as well as 10% (a commonly used endpoint in trials and in clinical practice14).
To estimate the MID for 6MWD, we calculated weighted averages for 6MWD for subjects with minimal change in the SGRQ Total (5–10 points) or FVC anchor (7–12% from the baseline raw value). We next used the within-patient anchor-based method19 to derive another MID estimate for 6MWD. For this method, linear regression is used to examine the relationship between changes in 6MWD (dependent variable) and changes in the anchor—SGRQ Total score or FVC% (independent variables). Lastly, we used distribution-based methods to generate MID estimates. The effect size (referred to as ES and calculated as the change in 6MWD divided by the SD of the baseline 6MWD) is one such method.20 For ES, values of 0.2 are considered small effects, 0.5 medium effects, and 0.8 large effects. Although there is no consensus about how or even whether12 ES should be used in the estimation of MIDs, some investigators consider 0.5 to correspond to the MID.21 22 Thus, we derived the change in 6MWD that would correspond to an ES of 0.5 and used that as an MID estimate. All analyses were performed with SAS version 9.1.3 (SAS Institute), and p values <0.05 were considered statistically significant.
The mean age of the sample was 65 years, and most subjects were men (table 1). Among all subjects able to complete a 6MWT at baseline and at both 6 and 12 month follow-up (n=123), there was no statistical difference in mean 6MWD between any time point (figure 1). Further, table 2 shows that baseline walk distance appeared not to influence subsequent measures of 6MWD.
Statistically significant relationships were observed between change in 6MWD and change in SGRQ Total score or FVC (table 3). Table 4 gives MID estimates from distribution- and anchor-based approaches. The distribution-based approach (ES=0.5) yielded a higher estimate than the anchor-based approaches. The mean of all MID point estimates yielded a value of 28 m.
For patients with IPF, the 6MWT appears to be a valid reflection of global functional capacity8; it is frequently used clinically to assess changes in IPF disease status over time but with few data to support this practice. More importantly, the 6MWT—either the 6MWT itself, variations thereof or data collected during the 6MWT (eg, measures of oxygenation)—has been used as an outcome measure in trials enrolling subjects with IPF.23–25 However, there are large knowledge gaps regarding certain important aspects of this test in IPF.
In this study, for a select group of well-defined subjects with IPF who were able to walk no less than 150 m and no more than 499 m during a baseline 6MWT, and who performed follow-up testing, we found no significant change from baseline in 6MWD at 6 or 12 months. These data are supported by studies from Tomioka and Raghu who also found no significant change over time frames ranging from about 12 to 14 months.9 10
Our study estimates the MID for 6MWD in patients with IPF, which we found to be in the range of 28 m. To our knowledge, only two other groups of investigators have assessed the MID for the 6MWD in patients with IPF. In a study published only in abstract form, Mathai and colleagues divided into two subgroups 20 patients with IPF who were participating in a 1 day support group. Each patient performed a 6MWT, and then each subgroup of 10 patients spent the day together at a support group meeting (details not provided). At the end of the day, each patient was asked to rate his ability to walk relative to the other members in his subgroup. These investigators found that 6MWD needed to differ by a mean 17.9±103.6 m for patients to stop rating themselves as “the same” as other members in their group. Redelmeier and colleagues developed this method, and used it to show that the MID for 6MWD in patients with COPD is 54 m (95% CI 37 to 71 m); that is, the mean difference in 6MWD between subjects who rated themselves as being able to walk either “a little bit better” or “a little bit worse”, compared with those who rated themselves as being able to walk “about the same” as other patients, was 54 m. In that study, patients who rated themselves as “a little worse” on average walked 80 m less than other patients in their group, whereas subjects who rated themselves as “a little better” walked only 30 m more than other patients in their group—yielding a weighted average of 54 m among subjects who rated themselves as minimally (or “a little…”) different from other patients.
Another method commonly used to determine the MID for an outcome measure involves asking subjects to provide a so-called transition assessment at the time of follow-up testing. For example, at 6 months, subjects (blinded to baseline and follow-up data) might be asked to report whether they perceived their 6MWD to be “the same as”, “a little bit less”, “a lot less”, “a little bit greater” or “a lot greater” than their baseline walk. The mean 6MWD for subjects reporting a minimal change in perceived 6MWD (ie, “a little bit less” or “a little bit greater”) would be the estimate of the MID. Singh and co-investigators used this method to estimate the MID for the incremental shuttle walk test to be 47.5 m in patients with COPD.26 In the only study of the 6MWD MID in IPF published in manuscript format, Holland and her colleagues used global change ratings, receiver operating characteristic (ROC) curves and a distribution-based approach to derive their estimate.27 Among 24 subjects with IPF who completed a 6MWT before and after an 8 week exercise programme, the MID was found to be between 29 and 34 m.
No transition assessment was used in the BUILD-1 trial, but we employed a related (and perhaps the most common) method of determining the MID—an anchor-based method. Although any variable can be an anchor, each should meet three criteria: (1) it should be related to the outcome variable; (2) it must possess face validity; and (3) it must be able to be divided into at least three categories: no change, minimal and other.12 As one anchor, we selected the SGRQ Total for several reasons: (1) it is a patient-oriented or patient-assessed outcome measure and, as such, it asks specifically about patients' perceptions28—a notion viewed by many investigators as paramount to deriving the MID for an outcome variable12; (2) IPF impairs—and patients value—the quality of their lives, so the SGRQ and other health status questionnaires provide meaningful data in this population29 30; and (3) we have shown previously that a 7-point change in SGRQ Total score is its MID among patients with IPF.31 As the second anchor we chose the FVC because it is perhaps the most widely used physiological measurement to assess IPF severity, and cut-off values for its clinical and prognostic meaningfulness have been established in a number of studies.32–36 We elected to use FVC rather than DLCO because of the greater intrinsic variability in DLCO measures as well as our inability to define confidently the clinically significant range for its minimum change.
There are a number of limitations to this study. As is often the case, the distribution-based method yielded higher estimates for the MID than the anchor-based methods.37 The selected population included only patients with IPF who could walk >150 m but <499 m during a 6MWT at baseline—and the overwhelming majority could walk at least 350 m. Furthermore, data from subjects who died, or who were unable to complete the 6MWT for other reasons, were not included in our analyses; this could introduce bias. Thus, inferences drawn here may not be applicable to extremely debilitated patients with IPF in the latter stages of the disease or to more fit patients with IPF in the earlier stages of disease. However, the results of this study may have important implications for both IPF investigators who plan to power future IPF studies for change in 6MWD as well as clinicians who prognosticate future changes in functional status. While the pattern of decline in 6MWD over time is unclear, if a linear decline were assumed, one might expect a 30 m decline in 6MWD to occur, on average, at 31 months of follow-up (data not shown). It has been recommended that multiple anchors (in multiple studies) be used to generate a range for the MID for any outcome variable.12 Although this is but one study, we derived the MID for 6MWD by using two clinically meaningful anchors as well as by employing distribution-based methods. Furthermore, our estimate is nearly identical to the estimate from the only other published study to examine the 6MWD MID in IPF. Finally, it must be recognised that the MID estimate here is to be used at the population level; that is, the mean change in 6MWD that is considered clinically important in a population is “often much less” than the change in 6MWD that would allow a practitioner to be confident that a change within an individual patient is outside of the inter-test variability.37
In conclusion, data from this study provide the first systematic examination of 12 month longitudinal changes in 6MWD as a prospectively acquired, primary outcome variable in a well-defined subset of patients with IPF, as well as an estimate of the MID for 6MWD in patients with this disease. It appears as though the MID for 6MWD is smaller for IPF than for COPD, but future confirmatory studies should be performed to estimate the MID for 6MWD in IPF.
The authors would like to acknowledge all of the investigators involved in the BUILD-1 trial: Ishaar Ben-Dov, Charles Chan, Jean-Francois Cordier, James Dauber, Joao De Andrade, Adaani Frost, Thomas Geiser, Marilyn Glassberg, Jeffrey Golden, Gary Hunninghake, Sanjay Kalra, Lisa Lancaster, Robert Levy, Keith Meyer, Joachim Mueller-Quernheim, Paul Noble, Christophe Pison, Charles Poirier, Milton Rossman, Paola Rottoli, Gerd Staehler, Athol Wells, Gordon Yung and David Zisman. The authors also wish to acknowledge the study coordinators and nurses for their hard work and all of the study subjects for their generous contributions.
Funding Actelion Pharmaceuticals funded the performance of the underlying BUILD-1 trial that investigated the efficacy of bosentan in the treatment of idiopathic pulmonary fibrosis. No additional external funding was obtained. All authors had full access to all the data in this study. The corresponding author had final responsibility for the decision to submit the paper for publication.
Competing interests JB, RMduB, TEK, GR and KKB served on the steering committee for the BUILD-1 trial, sponsored by Actelion Pharmaceuticals. JJS has served as a paid consultant to Actelion Pharmaceuticals. FSW has no competing interests to declare.
Ethics approval This study was approved by the appropriate independent ethics committees or institutional review boards.
Provenance and peer review Not commissioned; externally peer reviewed.