Background: International guidelines recommend that pulmonary reference populations consist of never-smokers without respiratory diseases or symptoms, but the diseases and symptoms are not clearly specified. The present study aimed to identify simple exclusion criteria for defining pulmonary reference populations.
Methods: Based on a random sample from a general population (the parent population), 2358 subjects aged 26–82 years performed spirometric tests. From this sample, subjects were stepwise excluded according to self-reported obstructive lung diseases, symptoms and smoking history. Four increasingly more healthy respiratory reference populations were formed. Prediction equations for the median and lower limit of normal lung function were derived using quantile regression analysis.
Results: Subjects without self-reported obstructive lung diseases or the cardinal respiratory symptoms of breathlessness, cough or wheeze (population B), never-smokers without cardinal symptoms (population C) and never-smokers without any respiratory symptoms (population D) constituted 50% (n = 1184), 23% (n = 539) and 14% (n = 331) of the parent population (population A), respectively. The largest discrepancy between prediction equations was found between the parent population and the population without cardinal respiratory symptoms (population B) (p<0.05). Minor changes in the reference equations were also seen when excluding ever-smokers (population C). There was no additional change with exclusion of other respiratory symptoms (population D). Age-related decline in lung function was steepest in the parent population.
Conclusions: Obstructive lung diseases, smoking history, breathlessness, cough and wheeze are optimal exclusion criteria for a pulmonary reference population. Further validation of the exclusion criteria identified in this study is recommended with identical wording in other and larger multinational populations.
- FEV1, forced expiratory volume in 1 s
- FVC, forced vital capacity
- LLN, lower limit of normal
Statistics from Altmetric.com
According to recommendations from the American Thoracic Society (ATS), reference values for normal lung function should be derived from lifetime non-smokers free of respiratory diagnoses and symptoms.1 The newly released joint guidelines from the ATS and the European Respiratory Society (ERS) also support these recommendations.2 The general nature of such recommendations results in various interpretations regarding exactly which respiratory symptoms should be excluded from a reference population.
During the last 30 years, numerous reference values for normal lung function have been published.3,4,5,6,7,8,9,10,11,12,13,14,15,16,17 Although they have all been estimated on the basis of healthy reference populations, the definition of a healthy population has varied from study to study. There is therefore considerable variation in the relative size and characteristics of reference populations. No previous study has assessed the implications of different definitions of reference populations for pulmonary prediction equations in a general population.
The existing literature shows an association between reduced forced expiratory volume in 1 s (FEV1) and the respiratory symptoms of wheeze, breathlessness and cough.18–21 Chronic phlegm, on the other hand, has in some surveys not been associated with airflow obstruction or reduced FEV1 levels,19 suggesting that perhaps not all respiratory symptoms need to be accounted for in a healthy reference population used for derivation of reference values.
The main objective of our study was to identify simple exclusion criteria for defining pulmonary reference populations. We compared pre-bronchodilator reference values derived from population samples representing increasingly healthier and narrower reference population definitions with regard to respiratory status.
The present study was based on the second phase of the Hordaland County Cohort Study. Information on sampling procedures and data collection in this longitudinal epidemiological study has been reported previously.22,23 Briefly, the baseline survey population was a random sample of the general adult population in Western Norway in 1985 within the age range 15–70 years. All adults in the study area were eligible for inclusion. A random sample of the population was drawn by Statistics Norway and invited to participate in the first phase of the Hordaland County Cohort Study. Based on the first phase in 1985, 2358 subjects of those eligible for follow-up (74%) participated in a second phase in 1996–7 comprising both questionnaires and spirometry. The questionnaires contained detailed questions on disease history, respiratory symptoms, occupational exposure to airborne agents, smoking history and educational level. Standing height without shoes was measured at the spirometric examination to the nearest centimetre. Weight was measured with light indoor clothing to the nearest 0.5 kg.
Respiratory disorders and smoking habits
The participants reported the presence or absence of 15 respiratory diseases and symptoms in a self-administered questionnaire (see Appendix in online supplement available at http://thorax.bmj.com/supplemental). The items in the questionnaire originated from the British Medical Research Council questionnaire (BMRC), the Norwegian Respiratory Questionnaire (NRQ) and from the European Community Respiratory Health Survey questionnaire (ECRHS).24–28
Subjects were classified as current smokers, ex-smokers and never-smokers according to self-reported daily smoking habits. Self-reported smoking habits in this population have been previously validated by carboxyhaemoglobin measurements of venous blood samples with an OSM3 Hemoximeter (Radiometer, Copenhagen, Denmark).29,30 Occupational exposure to airborne agents was defined as an affirmative answer to the question “Have you ever had a work-place with much dust or fumes in the air?”.
Pre-bronchodilator forced vital capacity (FVC) and FEV1 were measured with a dry wedge spirometer (Vitalograph S-model) according to the ATS criteria.31 The spirometer was calibrated each morning and afternoon with a 1 litre syringe. The participants were seated and wearing nose clips, and received standardised instructions from the laboratory technician. At least three measurements of FVC and FEV1 were obtained from each subject. The spirometric test was satisfactory when the two highest FVC measurements differed from each other by <300 ml. The highest FVC and FEV1 values were used in the FEV1/FVC ratio regardless of whether or not they came from the same trial. Room temperatures ranged from 19 to 24°C with a mean (SD) of 22 (0.5)°C.
Starting with the general study population (the parent population), four increasingly healthy respiratory reference populations were formed. These population types were obtained stepwise by excluding subjects based on the presence of numerous respiratory characteristics (obstructive lung diseases, respiratory symptoms and smoking history). The initial exclusion order of these characteristics was based on both clinical and methodological considerations. For each step, t tests were performed to assess whether mean lung function differed between subjects who had this characteristic and the remaining population who did not have this characteristic. If the t test was significant (p⩽0.05), subjects with the characteristic in question were excluded from the population before moving on to analysis of the next respiratory characteristic.
After identifying population samples at four delimitation levels, the direct standardisation method32 was used to standardise mean lung function in population types B, C and D, respectively, according to the sex, age and height distribution of population type A (the parent population).23
Sex-specific reference values for the median and lower limit of normal (LLN, 5th percentile) lung function were derived from all four populations using quantile regression analysis with age and height as predictors.
Interaction effects between age and height and the four population categories were examined. When plotting the modelled reference values for FEV1 and FVC against age, we standardised height using mean values. All statistical analyses were performed using Stata SE 9.1, Release 9.0 (Stata Statistical Software, StataCorp, 2005).
Four reference population types
We formed four populations (types) using four delimitation levels (table 1): (A) the parent population with none excluded; (B) a reference population excluding subjects with self-reported obstructive lung diseases and cardinal respiratory symptoms that led to a statistically significant decrease in lung function; (C) a reference population of never-smokers without obstructive lung diseases or respiratory cardinal symptoms; and (D) a reference population of never-smokers without obstructive lung diseases or any respiratory symptoms (a strict interpretation of the ERS/ATS recommendations). Cardinal respiratory symptoms were breathlessness (grades 1–4), cough (chronic cough and morning cough) and wheeze (table 1). Exclusion of subjects with these symptoms increased standardised mean FEV1 by 0.13 l, FVC by 0.12 l, and FEV1/FVC by 0.011. Exclusion of ever-smokers led to a further increase in mean lung function. However, exclusion of subjects with minor respiratory symptoms (phlegm, attacks of breathlessness, day cough, infancy lung disease, cough with cold, woken by breathlessness, breathlessness and wheeze) did not influence lung function.
The parent population (population A) consisted of 2358 subjects aged 26–82 years, of which 48% were men. Almost two-thirds of the men (63%) and one-third of the women (32%) reported occupational exposure to dust or gas (table 2); 25% of men and 22% of women had attained university education.
The reference population without obstructive lung diseases or the cardinal respiratory symptoms of breathlessness, cough or wheeze (population type B) consisted of 1184 subjects (50% of the parent population). They were slightly younger than the parent population and had generally higher lung function. Fewer reported occupational exposure to dust or gas while more had higher education.
Population types C and D (never-smokers without cardinal respiratory symptoms and never-smokers without any respiratory symptoms) consisted of 539 (23%) and 331 (14%) subjects, respectively. The tendency for less occupational exposure and higher education observed in the transition from the parent population to the population without cardinal respiratory symptoms remained stable. The mean age of the men was lower in population types C and D while the mean age of the women was higher (table 3). Height and weight were similar in all four population types.
Prediction equations derived from the four population types
After performing curve-estimation analysis to test for linear, quadratic, cubic, logarithmic and exponential associations, we found that simple linear models with age and height as predictors for FEV1 and FVC gave the best fit for all four population types (see tables E1–E4 in the online data supplement available at http://thorax.bmj.com/supplemental). FEV1/FVC also had a linear association with age, although the tendency was not statistically significant for men from population types C and D. The ratio was not associated with height in any of the populations.
For men, reference values for median and LLN lung function had a steeper decline with age for the parent population than for the other three populations (p<0.05, fig 1). Age-related equation difference in FEV1/FVC was also significant between population types B and C (p<0.05, fig 1). For women, LLN had a steeper decline with age in the parent population (p<0.05), while the tendency was less clear for median lung function. The population without cardinal respiratory symptoms, the never-smoking population without cardinal respiratory symptoms and the never-smoking population without any respiratory symptoms all produced similar equations for FEV1, FVC and FEV1/FVC (fig 1).
As expected, the confidence intervals of the regression coefficients for FEV1, FVC and FEV1/FVC were larger for LLN than for median since the SD for the 5th percentile is larger than for the 50th percentile (figs E1–E3 in the online supplement available at http://thorax.bmj.com/supplemental).32
Prediction coefficients for FEV1
Intercept coefficients from the equations for median and LLN FEV1 did not differ markedly between the four population types, indicating that differences between them are related to age or height associations rather than being constant (fig E1 in the online supplement available at http://thorax.bmj.com/supplemental). Height coefficients were higher for men than for women in all four populations. However, they did not differ between the four gender-specific population types (p>0.05).
The decrease in lung function with age was steeper for men than for women in the parent population and in population type B without cardinal respiratory symptoms (fig E1 in the online supplement available at http://thorax.bmj.com/supplemental). A similar sex difference was not observed in population types C and D, indicating that decline in FEV1 with age is similar for men and women in these two population types. Equations for median FEV1 in men (not women) and for LLN FEV1 in both men and women from the parent population had more negative age coefficients than the remaining three population types (p<0.05).
Prediction coefficients for FVC
Height coefficients for median FVC were higher overall for men than for women in all the four population types, suggesting that lung function increases more with increasing height in men than in women (fig E2 in the online supplement available at http://thorax.bmj.com/supplemental). The height coefficient was lower for predicted LLN FVC among women in population type D than among women in the other three populations (p<0.05).
Age coefficients for FVC were, on the whole, more negative for men than for women, suggesting a steeper decline in FVC with age for men than women. Equations derived for men in the parent population gave more negative age coefficients than equations derived for the other three population types (p<0.05), suggesting that FVC declines more steeply with age in a general male population than in a healthy male population.
Prediction coefficients for FEV1/FVC
Height was not a predictor of FEV1/FVC for any of the four population types. Age was a significant predictor for median FEV1/FVC among both men and women from population types A and B, and also among women from population types C and D. With regard to LLN FEV1/FVC, age was a significant predictor in men and women in the parent population (type A) and in women in population type B.
There were no significant differences between intercepts from one population type to another.
Age coefficients for median FEV1/FVC in men and LLN FEV1/FVC in women were more negative in the parent population than in the population without cardinal respiratory symptoms, suggesting a steeper age-related decline in FEV1/FVC in the parent population (p<0.05, fig E3 in the online supplement available at http://thorax.bmj.com/supplemental).
Based on a general population sample of the second phase of the Norwegian Hordaland County Cohort Study, we identified four delimitation levels defining four increasingly healthier and narrower pulmonary reference population types. We derived and compared prediction equations for all four population types and found that self-reported obstructive lung diseases, smoking history, breathlessness, cough and wheeze were optimal exclusion criteria when defining a pulmonary reference population. As expected, predicted lung function values derived from the parent population (population type A) were lower and had a much steeper age-related decline than lung function values derived from the other increasingly healthier population types. Some minor differences were also found between predicted values for the population without the cardinal respiratory symptoms breathlessness, cough and wheeze (population type B) and predicted values for the never-smoking population without the cardinal respiratory symptoms (population type C). The never-smoking population without cardinal respiratory symptoms and the never-smoking population without any respiratory symptoms did not differ from each other with regard to lung function equations.
This is the first study to use analysis of the associations between self-reported respiratory symptoms and lung function as a tool to identify simple exclusion criteria for a pulmonary reference population. It is also the first study to compare various definitions of pulmonary reference populations with a general population sample. Such comparison enabled us to examine associations between lung function and the predictors age, height and sex, both in a general population and in increasingly respiratory healthier and narrower reference populations.
A limitation of this study is the age range. The study did not include children and there were only a limited number of elderly subjects, especially men. Furthermore, the study population consisted of Caucasian subjects from an affluent Western country. In clinical practice it is important that reference values are derived from a population that shares such basic characteristics with the patient population.33 There is a need for other studies, preferably with a larger proportion of elderly subjects, to assess whether the exclusion criteria identified here are also optimal in other types of populations.
It is possible that the study population was healthier than a general population since all analyses were based on a follow-up study rather than a cross-sectional study. However, a previous report from the Hordaland County Cohort Study showed overall small differences between responders and non-responders in the follow-up survey in 1996–7, and there were no differences in the associations between risk factors such as age and smoking, and respiratory disorders.34
Several factors potentially affecting lung function were not examined in the present study—for example, abnormal chest radiographs, diabetes, cardiovascular disease, a history of tuberculosis and malnutrition. It is possible that the presence of such characteristics and diseases might have influenced lung function values in the reference population types. However, a previous study of the same population cohort in 1987 assessed that only 15 of 540 subjects (3%) had diseases that might affect pulmonary function,14 suggesting that such factors are not widespread in a healthy reference population.
The lack of detail in the international recommendations has led to differences in reference population exclusion criteria between studies. The number of exclusion criteria in various published studies presenting spirometric prediction equations varies from 3 to 20 and heavily influences the relative size and health status of the reference populations.5,8,10,16,35,36 Furthermore, the wording of questions has been shown to influence prevalence rates.26 In the present study we have used a questionnaire with questions originating from three previously published and validated respiratory questionnaires.24,26,27 The NRQ has been used extensively for more than 35 years in Norway.37,38 The other two questionnaires have been mostly used in Europe, but parts of them have also been applied in US studies.5,39 To enable comparable lung function prediction equations across studies, it is important to promote uniform exclusion criteria for reference population, with standardised wording of questions. To ensure standardised wording across languages, the methodology of translation and back-translation is recommended to enhance questionnaire reliability and validity.
Prediction equations for LLN lung function are used in clinical practice to determine abnormal spirometric rates. If airways obstruction is defined as a FEV1/FVC ratio below the LLN, the prevalence would be 5.0% in the parent population based on the type A reference equations. When implementing LLN reference equations from population types B, C and D, however, the percentage of subjects with airway obstruction in the parent population increased to 9.5%, 11.4% and 10.1%, respectively. Following a stricter definition of airway obstruction as the presence of both FEV1/FVC and FEV1 below the LLN, the prevalence would be 2.4% with type A reference equations, 5.3% with type B equations, 6.7% with type C equations and 6.1% with type D equations (results not shown). In any case, the most important difference between abnormal spirometric rates occurs when the cardinal respiratory symptoms of breathlessness, cough and wheeze were excluded from the general population. Some difference was also observed when never-smokers were excluded. However, results from the present study suggest that maintaining reference populations without any respiratory symptoms at all would have no further clinical implications.
In the present study, exclusion of ever-smokers increased observed lung function in the male reference population but decreased lung function in the female reference population. This is probably due to a sex difference in the association between smoking habits and age. While exclusion of ever-smokers made the male reference population considerably younger, it made the female reference population older. More young women than elderly women were smokers or ex-smokers. Even with the absence of respiratory symptoms, the ageing of the female population when excluding ever-smokers led to a natural decrease in lung function. When adjusting for age in the prediction models, expected lung function was higher after exclusion of never-smokers among both men and women. Furthermore, a previous study from the same population has shown that smoking was a strong predictor for the incidence of chronic obstructive pulmonary disease during a 9 year period for subjects with normal lung function at baseline,40 rendering support to the notion that ever-smokers should be excluded from reference populations even if they do not report respiratory symptoms. On the other hand, one could argue that exclusion of ever-smokers from reference populations is not possible in all parts of the world. In some developing countries the proportion of adult never-smoking men may be so small that it would be necessary also to include healthy ever-smokers in reference populations in order to estimate predicted lung function. Future studies should further explore similarities and differences between healthy ever-smokers and healthy never-smokers with regard to respiratory status.
Reference population criteria in the present study entailed self-reported respiratory symptoms rather than information on respiratory symptoms obtained by a physician in a clinical examination. Self-administered questionnaires have several advantages over physician-administered interviews. Observational bias affecting answers is not a problem, and the reproducibility for research purposes is better in the absence of a physician’s subjective interpretation.41,42 A disadvantage with the use of self-administered questionnaires, however, is that it depends on a literate study population.
Never-smokers without cardinal respiratory symptoms (population type C) and never-smokers without any respiratory symptoms (population type D) constituted 23% and 14% of the parent population (population type A), respectively. Whether we used population type C or D to derive prediction equations had no implications on the resulting reference values. There are important methodological advantages to keeping the reference population as large as possible, relatively speaking. Narrow reference populations will result in higher statistical uncertainty concerning the reference value estimates than will broader reference populations owing to fewer observations, especially in the older age groups. Rigid exclusion criteria will lead to selection bias and skewed age distributions. More elderly subjects than younger subjects suffer from respiratory symptoms, so older age groups will be under-represented in a reference population based on rigid exclusion criteria. This, in turn, may influence the association between age and lung function as observed in the present study where the association between age and FEV1/FVC did not reach statistical significance in the healthiest population types. There were only 13 never-smoking men in the 70–82 year age group without cardinal symptoms (population type C) in the present study. A larger proportion of elderly subjects in the reference population would perhaps lead to a significant reduction in FEV1/FVC with age, more in line with what has been observed in other studies.1
A similar selection bias was also observed with occupational exposure to dust or gas. After exclusion of cardinal respiratory symptoms, additional sex-stratified analyses showed no difference in lung function between persons who had been occupationally exposed to dust or gas and those who had never been subjected to such exposure (p>0.05, results not shown). However, the size of the reference populations would have decreased by 57% for men and 27% for women if all occupationally exposed subjects were excluded.
Subjects with respiratory symptoms were excluded in a stepwise order based on both existing clinical and methodological considerations.19–21 The symptom whose exclusion led to the largest increase in mean lung function was excluded first. It could be argued that the order of exclusion influenced the resulting reference population exclusion criteria. However, additional analyses in which the exclusion order was changed among the seven cardinal respiratory symptoms (breathlessness grades 1–4, morning cough, chronic cough and wheeze) gave the same overall results regarding statistical significance and change in lung function. Breathlessness grades 1–4 and morning cough also remained significant if ever-smokers were excluded first (p<0.05, results not shown).
Exclusion of all ever-smokers with obstructive lung diseases or any respiratory symptoms in the present study left only 14% of the total population. However, excluding only ever-smokers with obstructive lung diseases or the cardinal respiratory symptoms of breathlessness (grades 1–4), cough (morning cough and chronic cough) and wheeze resulted in a substantially larger and valid reference population. We believe that these exclusion criteria will be feasible and sufficient to enable derivation of prediction equations comparable across studies. We recommend testing the exclusion criteria with identical wording in both existing and future larger populations for further validation. The results from the present study should be included in the future discussion of a more detailed and standardised international definition of pulmonary reference populations.
The authors are indebted to the Centre for Clinical Research at Haukeland University Hospital, to respiratory laboratory technician Lene Svendsen, statistician Roy Miodini Nilsen, and to Professor Paul Enright for valuable comments.
Published Online First 27 March 2007
The Hordaland County Cohort Study was funded from the Royal Norwegian Council for Scientific and Industrial Research and the Norwegian Research Council.
Competing interests: None declared.