Article Text


Profiling serum biomarkers in patients with COPD: associations with clinical parameters
  1. Victor Pinto-Plata1,
  2. John Toso2,
  3. Kwan Lee2,
  4. Daniel Park2,
  5. John Bilello2,
  6. Hana Mullerova2,
  7. Mary M De Souza2,
  8. Rupert Vessey2,
  9. Bartolome Celli1
  1. 1Pulmonary, Critical Care and Sleep Division, Caritas St Elizabeth’s Medical Center, Tufts University, Boston, Massachusetts, USA
  2. 2Discovery Medicine, High Throughput Biology and Biomedical Data Sciences, GlaxoSmithKline R&D, USA
  1. Correspondence to:
    Dr Bartolome R Celli
    Caritas St Elizabeth’s Medical Center, 736 Cambridge Street, Boston, Massachusetts 02135, USA; bcelli{at}


Background: Chronic obstructive pulmonary disease (COPD) is an inflammatory lung disease associated with significant systemic consequences. Recognition of the systemic manifestations has stimulated interest in identifying circulating biomarkers in these patients. A systematic analysis was undertaken of multiple protein analytes in the serum of well characterised patients with COPD and matched controls using novel protein microarray platform (PMP) technology.

Methods: Forty-eight patients (65% men) with COPD (forced expiratory volume in 1 s <55%) and 48 matched controls were studied. Anthropometric parameters, pulmonary function tests, 6-minute walk distance, the BODE index and the number of exacerbations were measured and the association of these outcomes with the baseline levels of 143 serum biomarkers measured by PMP was explored.

Results: Thirty biomarker clusters were identified and ranked by computing the predictive value of each cluster for COPD (partial least squares discriminant analysis). From the 19 best predictive clusters, 2–3 biomarkers were selected based on their pathophysiological profile (chemoattractants, inflammation, tissue destruction and repair) and the statistical significance of their relationship with clinically important end points was tested. The selected panel of 24 biomarkers correlated (p<0.01) with forced expiratory volume in 1 s, carbon monoxide transfer factor, 6-minute walk distance, BODE index and exacerbation frequency.

Conclusion: PMP technology can be useful in identifying potential biomarkers in patients with COPD. Panels of selected serum markers are associated with important clinical predictors of outcome in these patients.

Statistics from

Chronic obstructive pulmonary disease (COPD) is projected to be the third leading cause of death in the world by the year 2020.1,2 Despite the well-documented role of cigarette smoking in the genesis of COPD, it is unclear what steps are involved in its pathogenesis.3 Most, if not all, patients with COPD develop a combination of lung emphysema with its characteristic pattern of alveolar destruction and abnormal repair as well as small airway inflammation that persists even years after smoking cessation.4

The current pathogenetic theories for the development of COPD include an imbalance between the protease and antiprotease system, dysregulation of oxidant-antioxidant activity and chronic airway inflammation, processes that lead to the progressive destruction and abnormal repair of the lung connective tissue matrix.5 Recent studies have suggested that increased apoptosis of the alveolar wall accounts in part for the loss of lung tissue that characterises emphysema.6,7 Transgenic and null mutant mouse studies have identified a number of genes and pathways that, when altered, result in the morphological changes of emphysema.8,9,10

Although COPD primarily affects the lungs, it is associated with important systemic consequences which include malnutrition with a low body mass index (BMI)11 and impaired peripheral muscle function.12 These clinically relevant expressions of the disease have been associated with detectable systemic changes including evidence of increased oxidative stress, activation of circulating inflammatory cells and increased levels of proinflammatory cytokines.13,14 The multidimensional expression of COPD can be expressed by a clinical score including BMI, degree of obstruction (O), perception of dyspnoea (D) and exercise capacity (E) by the 6-minute walk distance known as the BODE index.15 This index predicts mortality better than the forced expiratory volume in 1 s (FEV1).

We reasoned that the pathobiological processes that occur in the lungs and possibly in systemic tissues such as the peripheral muscles of patients with COPD could be associated with systemic biomarker levels detectable in the systemic circulation. Despite the many studies aimed at identifying the pathogenesis of COPD, to our knowledge only one study16 has explored the potential value of high-density microarray technology to systematically define the serum protein expression profile in patients with COPD. Using a novel protein microarray platform (PMP) technology, we compared the serum proteomic profile of 143 serum biomarkers in patients with COPD with that of age and sex-matched controls. We also explored the relationship between a selected subset of 24 biomarkers with clinically important outcome variables in COPD including lung function, the BODE index and its components and the frequency of exacerbations.


Patient recruitment

This is a matched case-control study of 48 patients with severe COPD (FEV1 <55% predicted), 8 of whom were current smokers. We then matched 8 control smokers and 40 subjects who had smoked <5 pack years and had stopped at least 20 years previously or who had always been non-smokers. All controls had a ratio of FEV1 to forced vital capacity (FVC) of >0.7 and FEV1 >70% predicted. Participants were >35 years of age and patients with COPD had to be clinically stable and without exacerbations for at least 3 months. Subjects with a history of asthma or atopy, conditions precluding performance of the tests, and a systemic infection or an inflammatory process that could be associated with abnormal biomarker profile were excluded. All patients were followed for 1 year and were stratified according to smoking history into ex-smokers (never smoked or ex-smokers for >15 years and <20 pack-years) and active smokers. The controls were frequency matched according to sex, age and smoking history (table 1).

Table 1

 Cases and controls stratified by exacerbation frequency and smoking status

The pulmonary function tests were measured according to ATS standards17 and the BODE index was calculated as previously reported.15 Exacerbations were defined as episodes of increased dyspnoea, sputum or cough lasting >24 h and requiring treatment with antibiotics and/or corticosteroids.18 After follow-up for 1 year, patients were stratified into no exacerbations (n = 12), <2 exacerbations (n = 12) and ⩾2 exacerbations (n = 15).

Specimen collection

Blood samples were drawn, centrifuged and the serum frozen at –80°C. Rolling cell amplification (RCA) immunoassay was performed by Molecular Staging Inc (MSI, New Haven, Connecticut, USA) using a protein microarray platform that measured levels of 143 analytes (see table S1 available online at on five separate arrays.19,20 After incubating and washing the serum samples on microarrays, the captured proteins were detected by specific biotinylated second antibodies and a universal anti-biotin antibody was bound to the secondary antibodies. The anti-biotin antibody contained an oligonucleotide DNA primer used for amplification. During the process, a circular DNA hybridises to the oligonucleotide DNA primer in the presence of DNA polymerase and fluorescent nucleotides to generate a signal. Following RCA, the slides were scanned (L200 scan, TECAN, Durham, North Carolina, USA) using a proprietary software. The fluorescence intensity of microarray spots was analysed and the resulting mean intensity values were measured. Dose-response curves for the biomarkers were determined with increasing intensity indicating increasing analyte concentration.

Data analysis

A more complete discussion of the analysis used in this study is available in the online supplement at In summary, two independent statistical approaches were used: (1) we tested the distribution of biomarkers for an association with COPD by univariate analysis adjusting for multiple comparisons using false discovery rate analysis;21 and (2) we used a variable clustering (VARCLUS) tool which divides the biomarkers into non-overlapping unidimensional groups or clusters,22 a process similar to factor analysis. Each cluster’s predictive value was determined by computing the partial regression coefficient of individual cluster centroids with COPD using partial least squares discriminant analysis (PLS-DA). After the initial analysis, we selected a group of 24 biomarkers from those clusters that showed a significant association with the diagnosis of COPD (clinical history and presence of airflow limitation). The biomarkers were chosen to reflect a variety of pathobiological mechanisms relevant to the disease process. The resultant panel of biomarkers was then tested for strength of association with variables known to predict outcome in COPD, including transfer factor for carbon monoxide (Tlco),23 6-minute walk distance (6MWD), the BODE index and exacerbation frequency.


Study population

The characteristics of the patients and the controls are summarised in table 2. As expected, the patients had higher smoking exposure (pack-years), significant airflow limitation, higher lung volumes, worse BODE scores and health-related quality of life than controls. Patients and controls were of similar age, sex and BMI.

Table 2

 Basic characteristics of patients with COPD and controls*

Biomarkers that distinguish between patients with COPD and controls

In the univariate analysis, 43 biomarkers were identified that differed between patients and controls. To adjust for multiple analysis, these were filtered by false discovery rate adjusted p value (FDR_p) of <0.015 (table 3).

Table 3

 Statistical comparison of biomarkers in patients with COPD and controls

The second approach (variable cluster analysis) resulted in 30 different clusters, 19 of which correlated significantly with the diagnosis of COPD. We selected biomarkers from among these 19 clusters to reflect a variety of pathophysiological mechanisms considered relevant to COPD. In order to enrich the exploratory value of the panel, two biomarkers—prolactin and plasminogen activator inhibitor type 2 (PAI-II)—were included despite lack of an obvious disease association. The selected panel biomarkers are shown in table 4 and their full description is given in the online supplement available at

Table 4

 Biomarkers selected for analysis

Associations of the biomarker panel with FEV1, Tlco, 6MWD, BODE index, BMI and exacerbation rate in patients with COPD

In the patients with COPD, the selected biomarkers tested in the panel correlated significantly with FEV1 (fig 1). The findings were replicated for the Tlco (fig 2), the BODE index (fig 3) and the exacerbation rate (fig 4).

Figure 1

 Correlation of the selected biomarker panel with forced expiratory volume in 1 s (FEV1) in patients with COPD. The size of the bar in the graph indicates the magnitude of the regression coefficients and the 95% confidence interval is also indicated for each bar. If the confidence interval includes zero, the associated biomarker is “not significant”. The overall regression model was significant by a permutation test (p<0.01). The standardised coefficients for this and for figs 2, 3 and 4 are for scaled and centred markers and scaled response. These coefficients can be used to interpret the influence of the markers on the clinical response. The standardised regression coefficient for each marker measures the effect of the marker on the clinical response adjusted for all other markers in the regression (partial correlation). The coefficients can also be compared across the clinical responses since they are scaled. Definitions of the biomarkers are given in the footnote to table 4.

Figure 2

 Correlation of the selected biomarker panel with carbon monoxide transfer factor (Tlco) in patients with COPD. The size of the bar in the graph indicates the magnitude of the regression coefficients and the 95% confidence interval is also indicated for each bar. If the confidence interval includes zero, the associated biomarker is “not significant”. The overall regression model was significant by a permutation test (p<0.01). Definitions of the biomarkers are given in the footnote to table 4.

Figure 3

 Correlation of the selected biomarker panel with the BODE index in patients with COPD. The size of the bar in the graph indicates the magnitude of the regression coefficients and the 95% confidence interval is also indicated for each bar. If the confidence interval includes zero, the associated biomarker is “not significant”. The overall regression model was significant by a permutation test (p<0.01). Definitions of the biomarkers are given in the footnote to table 4.

Figure 4

 Correlation of the selected biomarker panel with the exacerbation rate in patients with COPD. The size of the bar in the graph indicates the magnitude of the regression coefficients and the 95% confidence interval is also indicated for each bar. If the confidence interval includes zero, the associated biomarker is “not significant”. The overall regression model was significant by a permutation test (p<0.01). Definitions of the biomarkers are given in the footnote to table 4.

We also observed a correlation with the 6MWD while there was no correlation with BMI (not shown). The same selected biomarkers are shown for each analysis. Most of the markers were associated with all of the physiological indicators of disease, but the strength of the association differed from outcome to outcome as did the rank order of each biomarker.


This study had two important findings: (1) that PMP technology can be useful in identifying potential biomarkers in patients with COPD; and (2) that a pattern of systemic biomarkers identified in these patients can be associated with different clinical variables known to predict disease outcome including degree of airflow limitation, lung transfer factor, functional capacity, the BODE index and exacerbation frequency.

Several groups have shown an increase in a number of circulating inflammatory biomarkers in COPD,24–26 suggesting that it might be possible to characterise patients with COPD using systemic biomarkers. To address this question we used a novel technology that simultaneously evaluated analytes covering diverse potential processes including inflammation, chemo-attraction, cell activation, tissue destruction and repair. Based on a collaborative effort of statistical results and scientific plausibility, a subset of 24 biomarkers was identified and selected for subsequent testing against a variety of clinically important parameters. Many studies have been published on the association between a specific marker and COPD disease status, with both positive and negative results being reported.24–31 The disagreement in results can be attributed to different factors, including the heterogeneity of COPD phenotypes, low sample size, or the use of different methodologies and assays. The development of a panel of biomarkers addressing preconceived multiple pathophysiological pathways may provide a more specific tool to serve as an intermediate end point reflecting the natural history of the disease.

One obvious limitation of this preliminary dataset is that the biomarkers identified are limited by the pool of analytes that were available for the primary assessment. Clearly, use of an “open” proteomic platform would give information about a much broader range of proteins and might provide additional insights into biomarker selection and disease processes. However, a recent report16 using a panel developed from the one reported here shows that the panel as developed is valid and capable of reflecting changes induced by exacerbations. Recognising the fact that the discussion is valid only for the analytes explored, our findings may help to shed light on the underlying pathogenetic processes involved in this disease.

It has been proposed that various proteases break down lung connective tissue components to cause emphysema,5,6 leading to aberrant remodelling and/or degradation of the extracellular matrix. In our study, several proteins (table 3) related to the protease-antiprotease mechanism were clearly different between patients with COPD and controls. Thus, metalloproteinases 7, 8, 9 and 10 (MMP-7, MMP-8, MMP-9 and MMP-10) were among the proteins with large differences between groups. Of these, MMP-9 showed the strongest association with FEV1 and Tlco, which is interesting because MMP-9 has been implicated in the experimental genesis of emphysema.32,33 The tissue inhibitor of metalloproteinase 1 (TIMP-1), a collagenase inhibitor, was also different between patients and controls, providing evidence that the final expression of the disease may rest upon the appropriate balance of the system.33

Differences were also found in enzymes other than the metalloproteinases that are related to tissue destruction, as well as proteins related to repair, that deserve some comments. While the fold increase of neutrophil elastase in COPD was not as great as that found for the metalloproteinases, the difference was still statistically significant. Previous studies of experimental emphysema produced by pancreatic or neutrophil elastase showed that increased levels of elastase enzymes lead to the degradation of connective tissue components and, thus, enlargement of distal airspaces.34 While both elastin and collagen are rapidly re-synthesised in these animal models and mRNA levels for both are increased, the connective tissue remodelling process is ineffective and lung mechanical properties remain abnormal.35

The differences in tissue growth factor alpha (TGFα), amphiregulin (AR), brain-derived neurotropic factor (BDNF) and nerve growth factor β (βNGF) and their association with low FEV1 and Tlco (figs 1 and 2) suggest that connective tissue remodelling continues even in severe advanced COPD in humans, but the process fails effectively to restore the mechanical properties of the diseased lung. The role of TGFα is something of a mystery. Mice genetically manipulated to overexpress TGFα develop emphysema postnatally,36 yet an in vitro model of alveolar re-epithelialisation showed that TGFα induced faster wound repair.37 The presence of significant associations between BDNF and lung function and the BODE index (fig 3) is particularly interesting. Recent evidence indicates that BDNF decreases conversion from oxygen to hydrogen peroxide in experimental cell cultures,38 suggesting a role in the modulation of oxidative stress, and makes this an interesting marker to study. Furthermore, similar to results seen with AR, exogenous BDNF can protect cells from serum deprivation-induced cell death.39

It has been suggested that angiogenesis and apoptosis of the alveolar wall may have a role in emphysema. While little is known about the role of the EGF family member AR in the aetiology of COPD, one study has found that AR can inhibit apoptosis of non-small cell lung cancer cell line.40 Blockade of vascular endothelial growth factor R2 (VEGF-R2) receptor in rats induces apoptosis of the alveolar cell wall and results in an emphysema-like pathology.41,42 Several studies have found decreased expression of VEGF in induced sputum or bronchoalveolar lavage (BAL) fluid from patients with obstructive lung disease in comparison with normal subjects.43,44 These studies have also shown a direct association between the reduction in VEGF and FEV1. While our study showed an increase in VEGF serum content that was inversely associated with FEV1, this difference could be due to differential expression of VEGF in lung tissue and serum. Studies of VEGF expression in human lung tissue by immunohistochemistry have shown increased VEGF in pulmonary and airway smooth muscle in subjects with COPD that correlated with decreased FEV1.45 Furthermore, patients with cystic fibrosis show an inverse relationship in the level of VEGF in serum and BAL fluid compartments. These patients had a higher level of VEGF in serum and a lower level of VEGF in BAL fluid compared with controls.46 The role of apoptosis and its relationship to inflammation and repair seem supported by our findings.

Current thinking places inflammation at the centre of the pathogenetic mechanisms of COPD. The inflammation is characterised by increased numbers of alveolar macrophages, neutrophils and T lymphocytes, together with the release of multiple inflammatory mediators that result in a high level of oxidative stress. Multiple proteins related to inflammation were detected in the serum of patients with COPD (table 4). These included interleukin (IL)-12, IL-15, IL-17, IL-1 receptor antagonist (IL-1ra), tumour necrosis factor α (TNFα), tumour necrosis factor receptor 1 (TNF R1), interferon γ (IFNγ), IL-12p40 and IL-2Rγ. There is experimental evidence for the participation of all of these proteins in the inflammation that characterises COPD, and raises the possibility that the systemic manifestations of COPD may be intimately related to this process. Indeed, the association between inflammatory markers and exacerbation rate (fig 4) suggests that this manifestation of the disease could be modulated by amplification of the inflammatory cascade. In this regard, eotaxin-2—which had one of the strongest associations with the exacerbation rate in our patients—is a strong chemotactic cytokine for eosinophils,47 cells that have been found to be increased in airway biopsy tissue from patients with COPD exacerbations.48 Indeed, although the inflammatory pathways of COPD appear to be more related to lymphocytes expressing a T helper 1 (Th1) bias,49 a high level of Th2 chemokines have been reported in experimental models of emphysema induced by cigarette smoking.10

There were several novel proteins that differed between patients with COPD and controls. We selected two of them—plasminogen activator inhibitor type 2 (PAI-II) and prolactin—because of their presence in one of the eight clusters with the strongest association with COPD. PAI-II belongs to the serpine class of protease inhibitors and is involved in the thrombogenic cascade. Known to be produced by activated monocytes in the peripheral blood,50 this protein (together with PAI-I) may have a role in tissue remodelling in airways disease.51 These data warrant further investigation to explore the possible role of serpines in COPD.52 Prolactin upregulation presents an enigma. Prolactin receptor has recently been reported to be upregulated in the lungs of mice exposed to lipopolysaccharide,53 and prolactin can activate the inflammatory natural killer (NF)-κβ cascade in pulmonary fibroblasts.54 It is therefore plausible that prolactin may play a role in the inflammatory environment in COPD.

There are a number of important limitations to our study. Not all of the possible proteins that participate in the complex mechanism of COPD were tested. Absent were some with a known relationship to COPD such as C-reactive protein and fibrinogen, and some of potential importance such as MMP-12. The reason for their omission was not any preconceived mechanistic bias. Our study was designed as a proof of principle rather than a totally comprehensive evaluation of all of the markers that could potentially be explored. Many complex diseases have components related to inflammation, tissue remodelling, apoptosis and chemoattraction of specific cell types. This observation suggests that a panel of analytes might provide insight into the pathobiology of the disease under study in the absence of, or in conjunction with, novel “disease-specific” biomarkers. We also acknowledge that not all phenotypic expressions of COPD were analysed; for example, it would have been interesting to have related the biomarkers to changes in the CT scan of patients with emphysema, but unfortunately the technique needed to quantitatively express CT changes was not available. However, the Tlco does relate to the phenotypic expression of emphysema. We believe that this study represents a proof of concept and opens a window for hypothesis testing and perhaps the discovery of yet to be described pathway interactions and targets.

For the correlation analyses we attempted to address the issue of many proteins representing the same pathophysiological mechanism by empirically grouping them according to their statistical strength and their presumed pathobiological role. We acknowledge the latter to be empirical, but it is based on the data currently available and aimed at simplifying the prospective testing. Furthermore, the inclusion of too many proteins may be intellectually desirable but may cause important cross-correlative noise that may actually cloud the interpretation of the results. We also acknowledge that the patients included in the study do not represent the large population of patients with COPD since all of them had severe disease. However, the patients included represent those likely to be seen by clinicians and to benefit from new therapeutic strategies. On the other hand, this study is unique in that patients and controls were phenotypically well characterised and matched by age, sex and—very importantly—by smoking habits to minimise the hypothetical influence of these confounding factors. Indeed, the inclusion or exclusion of smokers in each of the groups did not affect the results. In addition, the evaluation of important associations of the panel markers with clinical markers of COPD such as the BODE index and its individual components offers a more comprehensive picture of the value of the technique. The association with exacerbation frequency is particularly interesting because exacerbations constitute an extremely important outcome and one where elucidation of the factors that may help prevent their occurrence would prove extremely useful. Finally, we also acknowledge that the stability of biomarker levels in serum samples is not well characterised and that we did not repeat the tests at different times. However, the recent report by Hurat and colleagues16 using the panel derived from this study independently validated our findings.

In summary, using a serum PMP, we have identified a biomarker profile whose expression levels can distinguish patients with COPD from smokers and non-smokers without COPD. We have also found an association between the level of selected biomarkers and lung function, the degree of airflow limitation and Tlco, a marker of lung tissue destruction. Furthermore, we documented an association between the expression of the serum biomarkers and the integrated local and systemic manifestations of the disease as represented by the functional capacity and the BODE index. The expression of biomarkers was also associated with the exacerbation rate, crucial events in the natural course of the disease. The ease of sampling of peripheral blood and the continuing improvement and availability of multiplexed immunoassay technology should provide us with a new tool for research in this deadly disease.

View Abstract


  • Published Online First 13 March 2007

  • This work was supported by an unrestricted grant from GlaxoSmithKline and the Thoracic and Overholt Foundation. No funds were received from the tobacco industry.

  • Competing interests: JT, KL, DP, JB, HM, MMDS and RV are all full time employees of GlaxoSmithKline.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.