Abstract
PURPOSE The way in which spirometry is interpreted can lead to misdiagnosis of chronic obstructive pulmonary disease (COPD) resulting in inappropriate treatment. We compared the clinical relevance of 2 criteria for defining a low ratio of forced expiratory volume in 1 second to forced vital capacity (FEV1/FVC): the fixed ratio and the lower limit of normal.
METHODS We analyzed data from the cross-sectional phase of the population-based Canadian Cohort of Obstructive Lung Disease (CanCOLD) study. We determined associations of the spirometric criteria for airflow limitation with patient-reported adverse outcomes, including respiratory symptoms, disability, health status, exacerbations, and cardiovascular disease. Sensitivity analyses were used to explore the impact of age and severity of airflow limitation on these associations.
RESULTS We analyzed data from 4,882 patients aged 40 years and older. The prevalence of airflow limitation was 17% by fixed ratio and 11% by lower limit of normal. Patients classified as having airflow limitation by fixed ratio only had generally small, nonsignificant increases in the odds of adverse outcomes. Patients having airflow limitation based on both fixed ratio and lower limit of normal had larger, significant increases in odds. But strongest associations were seen for patients who had airflow limitation by both fixed ratio and lower limit of normal and also had a low FEV1, defined as one less than 80% of the predicted value.
CONCLUSIONS Our results suggest that use of the fixed ratio alone may lead to misdiagnosis of COPD. A diagnosis established by both a low FEV1/FVC (according to fixed ratio and/or lower limit of normal) and a low FEV1 is strongly associated with clinical outcomes. Guidelines should be reconsidered to require both spirometry abnormalities so as to reduce overdiagnosis of COPD.
INTRODUCTION
Chronic obstructive pulmonary disease (COPD) is one of the most prevalent chronic diseases and the cause of much morbidity and mortality worldwide.1 COPD has been estimated to occur in up to 25% of the population aged 40 years and older1–3; however, both overdiagnosis and underdiagnosis pose challenges in daily practice, resulting in inappropriate patient management.2–10 COPD guidelines recommend the use of a low ratio of forced expiratory volume in 1 second to forced vital capacity (FEV1/FVC) to establish the diagnosis in patients with chronic respiratory symptoms or those at risk.1,11,12
There is currently no consensus on the best criteria to be used for the spirometric confirmation of a clinical diagnosis of COPD. A lively debate revolves around 2 main measures. One measure is the FEV1/FVC fixed ratio of less than 0.7, alone or in combination with a low FEV1 set at a value less than 80% of predicted; the combination comprises Global Initiative for Chronic Obstructive Lung Disease (GOLD) stage 2 or worse disease. The other measure is the FEV1/FVC ratio below the 5th percentile, in other words, below the lower limit of normal (LLN).1,11–17 Prevalence studies have shown discordance between the fixed ratio and LLN, suggesting potential overdiagnosis by the former or underdiagnosis by the latter.3,4,8,18,19
This controversy persists because of the absence of a reference standard for the diagnosis of COPD.19 The associations of the 2 criteria with clinical outcomes have been assessed in a few studies, with unclear results on the relative superiority of 1 criterion over the other when it comes to their relationship with different clinical outcomes.20 The cross-sectional phase of the Canadian Cohort of Obstructive Lung Disease (CanCOLD) study provided us with a large population-based database to further explore the clinical relevance of these criteria by examining their associations with multiple patient-reported outcomes such as symptoms, health status, dyspnea-related disability, exacerbations, and cardiovascular comorbidity, with adjustments for age, sex, and smoking exposure.
METHODS
We analyzed data from 5,176 people from the general population aged 40 years and older to assess the clinical relevance of differing diagnostic criteria for COPD. Data were collected between August 2005 and May 2009 in a large, cross-sectional, multisite, population-based study on lung health, which constituted the first phase of the longitudinal CanCOLD study. The sampling strategy and study protocol of the baseline cross-sectional part of the study were the same as those used in the international Burden of Obstructive Lung Disease (BOLD) initiative, the full details of which have been published elsewhere.21 Briefly, random samples of noninstitutionalized adults from 9 Canadian urban sites were drawn from census data of Statistics Canada (Survey and Analysis Section), and recruitment was conducted by the NRG Research group by random telephone digit dialing to identify eligible individuals21 who were invited to attend a clinic visit to complete interviewer-administered respiratory questionnaires and to perform pre- and post-bronchodilator spirometry. The only exclusion criterion was inability or refusal to perform spirometry. The mean clinic visit participation rate was 74% (range, 63% to 87%).21 All participants gave written informed consent, and the study was approved by the respective institutional ethical review boards. CanCOLD is registered at http://www.clinicaltrials.gov (NCT00920348).
Measurements
Spirometry was performed according to the acceptability and reproducibility criteria of the American Thoracic Society and European Respiratory Society guidelines.22 Measurements included FEV1 and FVC. Data in administered questionnaires provided information on age, sex, body mass index, educational level, and race; smoking status, including number of pack-years; and the patient report of physician-diagnosed COPD, emphysema, and chronic bronchitis.
Additional questionnaire data included patient-reported outcomes: (1) respiratory symptoms including chronic cough and chronic phlegm for most days during 3 months in the last year; chronic bronchitis (defined as chronic cough and chronic phlegm for more than 2 years); and wheeze in the last year (not only during colds); (2) disability by the Medical Research Council (MRC) dyspnea scale; (3) general health status as assessed by the Short Form 12-item health survey (SF-12); (4) self-reported exacerbations of COPD; and (5) self-reported physician-diagnosed cardiovascular disease.
We compared 2 spirometric criteria for determining airflow limitation—a postbronchodilator FEV1/FVC ratio of less than 0.70 (fixed ratio) and a postbronchodilator FEV1/FVC ratio of less than the 5th percentile (LLN)—using the reference equations from the Third National Health and Nutrition Examination Survey (NHANES III).23 We assessed diagnosis according to presence of either or both criteria.
Outcomes and Modifying Factors
We defined clinically relevant outcomes as follows: respiratory symptoms including chronic cough, chronic phlegm, chronic bronchitis, and wheeze; health status, as measured by SF-12; disability, as measured by MRC dyspnea scale; exacerbations of COPD as measured by ever having breathing problems that interfered with usual daily activities or caused one to miss work; and cardiovascular disease (heart disease and stroke).
We assessed a variety of factors for their potential modification of the relationship between the 2 diagnostic criteria and clinically relevant outcomes, including sex, age-group (younger or older than 60 years), cardiovascular disease, and smoking status. In particular, we studied the impact of the severity of airflow limitation, assessed by FEV1 as the percentage of the predicted value, on the association of fixed ratio or LLN with the outcomes.
Statistical Analysis
We analyzed the associations of the 2 main diagnostic criteria with clinically relevant outcomes. In addition, we compared 6 subgroups of individuals who satisfied various diagnostic criteria consisting of single spirometric measures or combinations of measures. Univariate analysis of variance (ANOVA) or the Kruskal-Wallis test for continuous variables, and the χ2 square test for dichotomous variables were performed to compare the subgroups on baseline characteristics. We analyzed the clinical relevance of the diagnostic criteria according to the subgroups by a multivariate regression model, controlling for confounders at baseline. Logistic models were used for binary data, and linear models were used for continuous data. In addition, we performed interaction analyses between the spirometric criteria and modifying factors to determine whether any of these factors significantly modified the association with clinical outcomes. Finally, sensitivity analyses were performed by including 2 additional subgroups with a low FEV1, defined as a value less than 80% of predicted: a subgroup meeting the fixed ratio criterion and having low FEV1 (ie, GOLD stage 2 or higher disease) and subgroup meeting the LLN criterion and having low FEV1. A P value of less than .05 was considered to be statistically significant.
RESULTS
Analyses were based on 4,882 CanCOLD study participants who had complete data on all study parameters (Table 1). Their mean age (±SD) was 57 (±11) years, with a range of 40 to 93 years; 146 (3%) were older than 80 years. Forty-three percent were male. Their mean FEV1 (±SD) was 95.0% (±17%) of the normal predicted value.
The prevalence of spirometric airflow limitation was 17% by fixed ratio and 11% by lower limit of normal (Table 1). The majority of patients with airflow limitation satisfied both spirometric criteria as there was considerable overlap (concordance) between individuals meeting the fixed ratio and LLN cutoffs for diagnosis as shown in the Venn diagram depicting the proportions satisfying various criteria (Figure 1). The number of patients with airflow limitation by LLN but not by fixed ratio (discordance) was small, at just 15 (0.3%); these patients were on average younger and almost always female (Table 1). The prevalence of airflow limitation by fixed ratio was greater than that by LLN, except among young women; the greatest differences were seen among older adults (Figures 2 and 3).
Associations With Outcomes
Compared with peers who met neither criterion for airflow limitation, patients having airflow limitation based on fixed ratio only were significantly more likely to have physician-diagnosed wheeze (odds ratio [OR] = 1.54) and cardiovascular disease (OR = 1.52); associations with other outcomes showed only nonsignificant trends (Table 2). In contrast, patients who met both the fixed ratio and LLN criteria for airflow limitation were at significantly increased risk for 7 of the outcomes studied, with particularly strong associations seen for wheeze (OR = 3.25), chronic bronchitis (OR = 3.14), and chronic phlegm (OR = 3.04). Additionally, these patients meeting both criteria also had elevated risk when compared with peers meeting the fixed ratio criterion alone (Table 3).
Modifying Factors and Sensitivity Analysis
Analyses did not reveal any significant interaction between the spirometric criteria for airflow limitation and potential modifying factors such as sex, age, and smoking status. Furthermore, excluding patients with cardiovascular disease did not affect any of the observed associations.
Table 4 shows the impact of the severity of airflow on the associations with patient-reported outcomes in 6 groups of patients stratified by the FEV1/FVC criteria—fixed ratio or LLN—and further by FEV1 status (normal vs low). The results indicated that patients meeting either FEV1/FVC criterion had higher risks of some clinical outcomes, but the addition of a low FEV1 to either or both criteria greatly strengthened the associations; patients having such airflow limitation had 2 to 5 times higher odds of most adverse outcomes relative to counterparts having no airflow limitation. Notably, only individuals satisfying either the fixed ratio or LLN criterion (or both) who also had a low FEV1, indicating moderate to severe airflow limitation, had significantly elevated odds of cardiovascular disease (ORs = 1.51–1.56).
DISCUSSION
Key Findings
In this study, we found that airflow limitation defined solely by either of the basic criteria for FEV1/FVC, namely, the fixed ratio or the LLN, was weakly and variably associated with patient-reported poor outcomes such as symptoms, disability, impaired health status, and exacerbations, with stronger associations seen for those meeting the latter criterion. Patients meeting both of these criteria had more consistent and greater increases in risk. But patients meeting either criterion and in addition having a low FEV1 were the most likely to experience poor outcomes. Cardiovascular disease risk was most elevated for patients who had moderate airflow limitation as determined by either FEV1/FVC cutoff criterion plus a low FEV1.
Interpretation
Overall, our results concur with findings from previous studies on patient-reported outcomes. These studies showed that patients with airflow limitation determined by an unqualified fixed ratio (meaning irrespective of FEV1 level) had higher risks of several patient-reported outcomes relative to unaffected peers. When compared with patients having limited airflow defined similarly using unqualified LLN, these associations were somewhat weaker.20
Our results differ from those of previous studies in several aspects. First, the results indicated that overdiagnosis could be a problem when using a single criterion, either fixed ratio or LLN, as patients with overdiagnosis, having a normal FEV1, (one of at least 80% of predicted), appear to experience few patient-reported outcomes; that is, they do not appear to have clinically relevant disease. This lack of clinical relevance may be related to the fact that COPD found in the population may be milder than that reported in patient cohorts. The diagnosis of COPD in patients with mild disease has been debated before.14,24,25 Second, we found that the associations were modified by the level of lung function. Excluding patients with mild airflow limitation as evidenced by a normal FEV1, the use of either fixed ratio or LLN showed clear and firm associations with symptoms and exacerbations and worsening health status and disability. Third, the significant association between cardiovascular disease and airflow limitation determined by an unqualified fixed ratio criterion may suggest a misdiagnosis of COPD in these patients. Heart failure could potentially mimic airflow obstruction in such patients and raises the question as to whether clinicians should suspect undiagnosed cardiovascular disease in those patients with no or mild airflow limitation (FEV1 ≥80%), especially in the elderly, as determined by fixed ratio only.
Additionally, falsely applying the COPD label to patients with dyspnea who do not have undisputed evidence of airflow obstruction can lead to unnecessary and inappropriate treatments. It is conceivable that a high mortality for Group B in the new GOLD classification reflected the circumstance wherein patients had heart disease but were receiving care only for COPD.26 Finally, our results from this study support the use of either fixed ratio or LLN, further qualified by a low FEV1 in the diagnostic confirmation of COPD.
Strengths and Limitations
Although the relationship between diagnostic criteria for COPD and patient-reported outcomes has been previously reported,20 we believe that this is the first study that also includes the evaluation of health status as measured by the SF-12 and examines potential factors that could modify the associations with outcomes. In particular, we systematically assessed the effect of airflow limitation. Although GOLD stage 2 or higher classification has been previously compared with fixed ratio and LLN criteria, our study is the only one to use postbronchodilator spirometry data.27,28 Another strength lies in the derivation of data from the general population and not from a convenience sample of patients as in other published studies. The data come from a large multicenter study that used strict protocols and central data management to ensure that similar measurements across study centers.
There are also several limitations to this study. First, the questionnaire responses in this large epidemiologic study were not clinically confirmed. Second, as the analyzed data were cross-sectional, the diagnostic criteria could not predict outcomes. Last, the subgroup of patients with airflow limitation determined by LLN only was too small for analysis.
Future Directions
Underdiagnosis and overdiagnosis remain challenges to the adequate management of COPD and to the optimal use of health care resources. Given recommendations that diagnosis of COPD be based on the triple criteria of presence of risk factors, symptoms, and spirometric evidence of airflow limitation (a reduced FEV1/FVC) and that the main rationale of pharmacologic treatment of COPD is the reducing symptoms, it could be argued that overdiagnosis rather than underdiagnosis is a more pressing issue in clinical practice. The results of this large cross-sectional study have suggested that a clinically relevant diagnosis of COPD is best based on the use of either cutoff criterion for a low FEV1/FVC coupled with a low FEV1 as percentage of predicted. This impression awaits definitive clarification from longitudinal studies such as the follow-up phase of the ongoing CanCOLD study, in which clinical outcomes in patients having COPD diagnosed using different criteria can be further evaluated.
Conclusions
There is persistent controversy regarding the most appropriate cutoff values for FEV1/FVC in the diagnosis of chronic airflow limitation in COPD. The results from this population-based Canadian study on clinically relevant patient-reported outcomes showed that the use of a single criterion alone—FEV1/FVC either less than 0.7 or less than the LLN—was inadequate and may misdiagnose patients with COPD (in particular those with cardiovascular symptoms), putting them at risk for inappropriate or unnecessary treatments. Our results indicated that a low FEV1/FVC ratio by either the fixed ratio or LLN criterion coupled with a low FEV1 (<80% of predicted) is the most clinically relevant diagnostic criterion for COPD. Future guidelines may reconsider amending the current diagnostic spirometric cutoff to include this additional specification.
Footnotes
-
↵* Joint co–principal investigators.
-
↵† Group members listed at end of article.
-
Conflicts of interest: See funding support. The authors report no other conflicts of interest.
-
CanCOLD Study Group: Shawn Aaron, Kenneth Chapman, Mark J. FitzGerald, Roger Goldstein, Paul Hernandez, François Maltais, Darcy Marciniuk, Dennis O’Donnell, Donald Sin, Robert Cowie, Harvey Coxson, Jonathon Leipsic, Cameron Hague, Jeremy Road, Juli Atherton, Carolyn Baglole, Styliani Daskalopoulou, J. Pierre Després, Andrea Gershon, Kim Lavoie, David Miedinger, Remi Rabasa, Machelle Wilchesky, Ross Andersen, MarieEve Doucet, Jessica Evans, Carlo Marra, John Kimoff, Peter Pare, James C. Hogg, Hélène Perrault, Bryna Shatenstein, Bill Shell, Tanja Taivassalo, Teresa To, Carole Jabet, and Maria Sedeno.
-
Funding support: The Canadian Cohort Obstructive Lung Disease (CanCOLD) study is funded by the Canadian Institute of Heath Research (CIHR/Rx&D Collaborative Research Program Operating Grants- 93326); industry partners AstraZeneca Canada Ltd, Boehringer-Ingelheim Canada Ltd, GlaxoSmithKline Canada Ltd, Merck, Novartis Pharma Canada Inc, Nycomed Canada Inc, Pfizer Canada Ltd; the Respiratory Health Network of the FRQS; and the Research Institute of the McGill University Health Centre.
-
Disclaimer: The funding sponsors had no role in the design of the study; the collection, analysis, or interpretation of the data; or the writing of the article.
- Received for publication April 15, 2014.
- Revision received August 26, 2014.
- Accepted for publication September 5, 2014.
- © 2015 Annals of Family Medicine, Inc.