The Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points (ECLIPSE) study was a large 3-year observational controlled multicentre international study aimed at defining clinically relevant subtypes of chronic obstructive pulmonary disease (COPD) and identifying novel biomarkers and genetic factors. So far, the ECLIPSE study has produced more than 50 original publications and 75 communications to international meetings, many of which have significantly influenced our understanding of COPD. However, because there is not one paper reporting the biomarker results of the ECLIPSE study that may serve as a reference for practising clinicians, researchers and healthcare providers from academia, industry and government agencies interested in COPD, we decided to write a review summarising the main biomarker findings in ECLIPSE.
Statistics from Altmetric.com
The Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points (ECLIPSE) study was a large 3-year observational controlled multicentre international study (Clinicaltrials.gov identifier NCT00292552; GSK study code SCO104960)1 aimed at defining clinically relevant subtypes of chronic obstructive pulmonary disease (COPD) and identifying novel biomarkers and genetic factors.1 It was a joint venture between academic researchers and GlaxoSmithKline, which fully funded it. So far, the ECLIPSE study has produced more than 50 original publications and 75 communications to international meetings (http://www.eclipse-copd.com), many of which have significantly influenced our understanding of COPD. This paper reviews the main biomarker findings in ECLIPSE (tables 1 and 2) in order to serve as a unified reference for practising clinicians, researchers and healthcare providers from academia, industry and government agencies interested in the field. Details of the design of the ECLIPSE study have been published elsewhere1 and will not be reviewed here.
Many previous studies have shown that sputum neutrophils are increased in smokers and in patients with COPD.2 In a subset of patients included in ECLIPSE, we examined its long-term variability at 1 year follow-up and its relationship with a number of clinically relevant characteristics of COPD.3 In 168 subjects who provided valid induced sputum samples at baseline and 1 year later, the mean change over 1 year in neutrophils was an increase of 3.5%; however, most of the change was in patients with a low proportion of neutrophils at the baseline visit.3 On the other hand, in 488 patients with COPD we found that the proportion of neutrophils in sputum at baseline increased with GOLD stage. There was a weak but statistically significant association between percentage sputum neutrophils and both forced expiratory volume in 1 s (FEV1) percentage predicted and health status (St George's Respiratory Questionnaire). By contrast, there were no associations between neutrophils and exacerbation rates or emphysema. Associations between sputum neutrophils and systemic biomarkers were non-significant or similarly weak.3 In summary, these observations suggest that sputum neutrophilia: (1) can be quantified reliably in multicentre trials using a standardised methodology; (2) is a relatively stable biomarker in COPD; and, (3) does not appear to be a major surrogate of clinical or pathophysiological abnormalities in COPD which limits its application in a clinical setting.
Circulating white blood cells
High circulating white blood cell (WBC) counts were weakly associated with persistent systemic inflammation,4 frequent exacerbations5 and mortality6 in ECLIPSE. A recent study in the general population has reported similar observations in relation to exacerbations.7 Although elevations in blood WBC were stable over time (at least in a subgroup of patients),4 numerical differences between patients with COPD and controls are small and often within the range of normal laboratory values. Overall, these results support the value of a high circulating WBC count as a relevant biomarker in COPD. Of note, similar results were obtained when neutrophil counts were assessed instead of total WBC.4–6 The potential role of other cellular biomarkers, such as circulating eosinophils, is currently being analysed.
We initially explored 34 protein biomarkers in peripheral blood selected on the basis of previously published work8 and/or potential association with biological mechanisms believed to be relevant in the pathogenesis of COPD, including chemoattractants, tissue destruction/repair/remodelling and other inflammatory markers. Prior to the assessment of samples in the full ECLIPSE cohort, each biomarker assay was validated in terms of its sensitivity, accuracy, precision and reproducibility because, except for plasma fibrinogen and C-reactive protein (CRP), there was a lack of validated clinical laboratory assays. Repeatability was assessed by calculating the proportion of values at 3 months that were within 25% of the baseline value, as this reflects the typical total error associated with ‘research-grade’ immunoassays. We did this in what we called ‘the ECLIPSE biomarker cohort’, which consisted of 201 former smokers with COPD and 74 controls (37 ex-smokers with normal lung function and 37 healthy non-smokers) selected as a representative sample of the full ECLIPSE cohort.9 Biomarkers fulfilling these validation criteria included plasma CRP and fibrinogen, serum interleukin (IL)-8 and the predominantly lung-derived biomarkers (so-called ‘pneumoproteins’) surfactant protein-D (SP-D), club cell protein (formerly Clara cell secretory protein-16 (CC16)) and CCL18 (also known as pulmonary and activation-regulated chemokine (PARC)).9
These validated biomarkers were then explored in the entire ECLIPSE population as a replication of the initial findings and to investigate their relationships with age, gender, smoking status, clinical characteristics and outcomes. The main results were: (1) plasma fibrinogen is the most robust (in terms of relative longitudinal stability) biomarker investigated so far. It is significantly associated with symptoms, exercise capacity, exacerbation rate, the BODE index and mortality.6 ,10 Plasma fibrinogen is currently being considered a potential candidate for regulatory qualification as a prognostic biomarker11; however, more research is needed to fully assess its utility as an outcome measure in clinical trials and as a personalised risk factor in clinical practice; (2) CC1612 was weakly associated with lung function decline,13 emphysema14 and depression15; (3) SP-D showed some weak association with COPD exacerbations5 ,16 and appears to be sensitive to treatment with oral and inhaled corticosteroids16; and (4) in a joint analysis of the ECLIPSE and Lung Health Study cohorts,17 serum CCL18 (PARC) was associated with an increased risk of cardiovascular hospitalisation or mortality.
We used a combined panel of six inflammatory markers in serum (WBC, CRP, IL-6, IL-8, tumour necrosis factor α (TNFα) and fibrinogen) to define a systemic ‘inflammome’ and we observed that the systemic ‘inflammome’ of smokers with normal lung function (basically characterised by increased circulating levels of WBC, IL-8 and TNFα) was different from that of smokers with COPD (characterised by a further increase in WBC plus abnormal serum levels of CRP, IL-6 and fibrinogen).4 We also found that 30% of patients with COPD did not have evidence of systemic inflammation (neither at baseline nor after 1-year follow-up), whereas 16% of them had persistent systemic inflammation (defined by the presence of two or more abnormal inflammatory markers, both at baseline and after 1-year follow-up). Finally, and importantly, we observed that mortality in patients with persistent systemic inflammation was six times higher than in patients without inflammation and their rate of exacerbations was double.4
Other studies have directly investigated the relationship between multi-morbidity and systemic inflammation in COPD and found that systemic inflammation was associated with the presence of heart disease, hypertension and diabetes.18 Likewise, since obesity and cachexia are common and relevant clinical problems in COPD,19 we compared adipokine metabolism (and markers of systemic inflammation) in 136 patients with COPD and 113 controls from the ECLIPSE cohort matched for age, gender and body composition.20 The main results showed that: (1) CRP, IL-6, fibrinogen and adiponectin serum levels were higher in patients with COPD; (2) CRP levels were positively related to leptin and inversely related to adiponectin; and (3) body mass index (BMI) and gender were the strongest determinants for both leptin and adiponectin levels.20
Vitamin D has been related to lung function in several previous studies of COPD, and a relationship between low levels of vitamin D in blood and emphysema, 6 min walk distance, airways reactivity and blood CC-16 levels was confirmed in a subset of 498 patients in ECLIPSE.21
Finally, in a very recent combined analysis we assessed the relationships of soluble receptor for advanced glycation end products (sRAGE) and CT-defined emphysema and found that lower circulating sRAGE levels are associated with emphysema severity and genetic polymorphisms in the AGER locus (in the gene coding for RAGE) were associated with circulating sRAGE levels.22
Overall, these studies have identified several systemic biomarkers that alone or in combination have potential relevance for the enrichment of clinical trials aimed at validating future therapeutic interventions.
The large number of patients with COPD included in the ECLIPSE study allowed the performance of genetic studies. In order to validate the findings from ECLIPSE or to reveal additional common variants that contribute to COPD susceptibility, analyses were often performed in collaboration with other large cohorts of patients with COPD and/or controls, including the GenKOLS cohort (Bergen, Norway),23 the International COPD Genetics Network (ICGN),24 the National Emphysema Treatment Trial (NETT),25 the Normative Aging Study (NAS),26 the Lung Health Study (LHS)27 and the COPDGene cohort.28 The number of patients and controls included in each of the following studies varied accordingly. Below we summarise the results of these genetic studies in relation to: (1) the smoking history; (2) the susceptibility to develop COPD among smokers; and/or (3) the occurrence of different COPD characteristics in those smokers who have already developed the disease. We refer to ‘genome-wide significant’ results as those associations with p values <5×10−8, in order to adjust for the multiple comparisons involved in testing a genome-wide single nucleotide polymorphism (SNP) panel.
Genes associated with smoking history
Tobacco smoking is the main risk factor for COPD.29 To identify SNPs associated with smoking intensity and behaviour, genome-wide association studies (GWAS) were conducted in four independent cohorts encompassing 3441 ever-smoking patients with COPD (GOLD stage ≥2).30 No genomic regions were identified that reached genome-wide significance for any of the smoking-related phenotypes in this population of smokers and ex-smokers with COPD. However, suggestive association results were found for several loci associated with age at smoking initiation (SNPs in an intergenic region on chromosome 2q21 and near the HLA region on chromosome 6p21), lifetime mean number of cigarettes per day (SNPs in CHRNA3/CHRNA5 and cytochrome P450, family 2, subfamily A, polypeptide 6 (CYP2A6)), current number of cigarettes smoked per day (CYP2A6) and smoking cessation (SNP rs3025343 in dopamine β-hydroxylase (DBH) locus).30 These results strongly support a genetic basis for smoking initiation, maintenance and cessation.
Genes associated with COPD susceptibility
Because not all smokers develop COPD29 and the disease clusters in families,31 it has long been proposed that there are specific genetic abnormalities that increase the susceptibility of some smokers to develop the disease. Several analyses explore this hypothesis in ECLIPSE.
An initial GWAS meta-analysis (including ECLIPSE) in 2940 patients with COPD and 1380 current or former smokers with normal lung function identified a new susceptibility locus at 4q22.1 in FAM13A and replicated this association in one case-control group (n=1006) and two family-based cohorts (n=3808) (rs7671167).32 Two previously reported genome-wide significant COPD GWAS regions near hedgehog interacting protein (HHIP) and CHRNA3/CHNRA5/IREB2 also showed evidence for association. Because a larger sample size in GWAS may identify additional loci associated with COPD susceptibility, we extended our GWAS to 3499 cases and 1922 control subjects from four different cohorts (ECLIPSE, NAS and NETT, GenKOLS and COPDGene) pooled together.33 The results identified a new genome-wide significant locus on chromosome 19q13 (rs7937). Genotyping this SNP and another nearby SNP in linkage disequilibrium (rs2604894) in 2859 subjects from the ICGN demonstrated supportive evidence of their association with COPD, pre-bronchodilator FEV1 and severe COPD (GOLD stages 3 and 4). This region includes RAB4B, EGLN2, MIA and CYP2A6, and has previously been identified in association with cigarette smoking behaviour.34 Finally, previous GWAS meta-analyses of lung function in general population samples35 ,36 identified genome-wide significant evidence for association of multiple novel loci with two key spirometric variables describing airflow limitation in COPD (FEV1 and FEV1/forced vital capacity (FVC). To investigate if a subset of these markers could also be associated with COPD susceptibility, 32 SNPs in or near 17 genes in 11 previously identified GWAS spirometric genomic regions were tested for association with COPD status in four COPD case–control study samples (NETT/NAS, GenKOLS, ECLIPSE and the first 1000 subjects in COPDGene; the total sample thus consisted of 3456 cases and 1906 controls).37 Three loci harboured SNPs with suggestive evidence for an association with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2 and the 5q33 locus including ADAM19.
Because SP-D is an immunomodulatory pneumoprotein essential to host defence and because we had identified it as a potentially relevant biomarker in COPD5 ,16 (see above), we hypothesised that polymorphisms in SP-D could influence the susceptibility to COPD. Indeed, we found that four SP-D SNPs (rs2245121, rs911887, rs6413520 and rs721917) showed suggestive associations with susceptibility to COPD (but not at genome-wide levels of significance), and that multiple SP-D SNPs were strongly associated with serum SP-D levels.38
Homozygosity haplotype analysis is a very efficient and effective methodology for identifying potential disease-linked regions.39 Using this approach, we identified 2318 regions of conserved homozygosity haplotype, of which 576 were significantly (p<0.05) over-represented in patients with COPD.40 After applying the weights constructed from these regions in the abovementioned GWAS of COPD,32 we identified two SNPs (rs12591300 and rs4480740) in a novel gene (fibroblast growth factor-7 (FGF7)) with suggestive evidence for an association with COPD susceptibility.40
Finally, we used ‘mediation analysis’ in 3424 COPD cases and 1872 controls to estimate the direct (ie, independent from smoking) and indirect (ie, mediated by smoking) effects of three loci previously associated with COPD development.41 The results showed that the AGPHD1/CHRNA3, IREB2, FAM13A and HHIP loci had direct effects on COPD development, although the association of the AGPHD1/CHRNA3 locus is significantly mediated by cumulative exposure to tobacco smoke.41
Overall, these studies have contributed to the identification of several genomic regions that are associated with COPD at genome-wide significance, including FAM13A, HHIP, CHRNA3/CHRNA5/IREB2 and a region on chromosome 19. Several other genes and gene regions including ADAM19, FGF7 and SP-D showed evidence for an association with the development of COPD in smokers but will need to be replicated in additional populations.
Genes associated with different COPD subtypes
COPD is a complex and heterogeneous disease. The potential genetic contributions to this heterogeneity are unknown. To investigate it, we explored the association of a number of genetic loci with different COPD-related characteristics.
We tested the association of several key SNPs within the COPD GWAS regions identified above, such as the cholinergic nicotinic acetylcholine receptor (CHRNA3/5) and FAM13A genes and variants near HHIP, with several clinically relevant characteristics of COPD in the ECLIPSE cohort and then validated the results in the ICGN cohort.42 We found that CHRNA3/5 was significantly associated with cumulative smoking exposure (pack-years), emphysema and airflow limitation in both populations.42 By contrast, HHIP was not associated with pack-years but it was related to the FEV1/FVC ratio in both populations and with lean body mass and COPD exacerbations in ECLIPSE.42
To explore the genetic basis of emphysema (as determined by chest CT), we used GWAS in 2380 patients with COPD from ECLIPSE, the GenKOLS cohort and NETT, and identified a borderline genome-wide significant association of BICD1 SNPs with the presence of emphysema as assessed by radiologist scores.43 Since variants in BICD1 are associated with telomere length,43 this observation suggests accelerated ageing as a potential mechanism involved in the development of emphysema.44
The occurrence of cachexia in some patients with COPD is associated with increased mortality and increased emphysema.45 To identify genetic susceptibility loci potentially related to cachexia in COPD (assessed by BMI or fat-free mass index (FFMI)), we performed a GWAS in patients with COPD pooled from the ECLIPSE study (n=1734), the GenKOLS cohort (n=851) and the NETT study (n=365).46 We identified a suggestive association between an SNP (rs8050136) located in the first intron of the fat mass and obesity-associated (FTO) gene and both BMI and FFMI; this observation was replicated in 502 patients with COPD from the COPDGene cohort.46 Interestingly, we also found a significant relationship between FEV1 and FTO genotype.46 All in all, these observations suggest a role for the FTO locus in the determination of body composition in COPD.
Finally, to investigate potential genetic determinants of the circulating levels of the protein biomarkers discussed above, we performed GWAS for two pneumoproteins (CC16 and SP-D) and five inflammatory markers (CRP, fibrinogen, IL-6, IL-8 and TNFα) in 1951 COPD subjects from ECLIPSE.47 Genome-wide significant associations were identified only for the bloodstream levels of CC16 and SP-D. For CC16, two discrete genetic loci were identified; one was near the CC16 coding gene (SCGB1A1) on chromosome 11 while the other was located more than 20 Mb away on the same chromosome. Multiple SNPs near the coding gene (SFTPD) were associated with SP-D levels at genome-wide significance. In addition, SNPs on chromosomes 6 and 16 also demonstrated genome-wide significant associations with SP-D serum levels.47
Overall, these studies have identified a number of novel genes associated with different COPD-related characteristics including: (1) airflow limitation (eg, CHRNA3/5, IREB2, HHIP, FTO); (2) emphysema (eg, CHRNA3/5, BICD1); (3) exacerbation frequency (eg, HHIP); (4) BMI (eg, HHIP, FTO); and (5) serum levels of two COPD-related pneumoproteins, CC16 and SP-D. Many of these associations were suggestive rather than genome-wide significant and will require replication in additional studies.
Sputum transcriptomic studies
ECLIPSE performed the largest sputum transcriptomic analysis ever reported in COPD. This has produced novel data (discussed below) but also highlights the feasibility of this type of analysis in large multicentre trials.
To identify candidate genes associated with the degree of airflow limitation and the extent of emphysema, sputum gene expression profiling was assessed in 148 former smokers with COPD (GOLD stages 2–4) from ECLIPSE and the findings were replicated in a separate population of 176 patients using real-time PCR.48 The results identified significant changes in 277 genes associated with the severity of airflow limitation (GOLD stage) and 198 genes with emphysema. Twelve candidate genes were selected from the microarray data set (based on a twofold change in expression between GOLD stage 2 vs GOLD stages 3 and 4) and 11 of them were validated by PCR in the replication cohort.48 To illustrate the potential of these findings, one selected gene (IL-18R) was further analysed using immunohistochemistry in lung tissue, which demonstrated increased expression of IL-18R in COPD airway macrophages.48 These results therefore have potential functional implications, given the role of IL-18 in neutrophil and macrophage activation as well as T cell development.49 ,50
We also explored the relationship between sputum gene transcriptomics and several SNPs affecting circulating levels of several protein biomarkers (see above) including CC16, SP-D, CRP, fibrinogen, IL-6, IL-8 and TNFα. We found that several SNPs affecting circulating CC16 protein levels were significantly associated with sputum mRNA expression of SCGB1A1, the CC16 coding gene on chromosome 11.47 This supports a coordinated regulation of CC16 expression, both systemically and in the lungs.47
To identify potential functional effects of known COPD susceptibility genes and to find novel disease gene candidates, we used an integrative genomics approach that combined analysis of GWAS (from ECLIPSE, Bergen, NETT and NAS) and gene expression data from relevant tissue samples (sputum) from 131 patients with COPD.51 This strategy located potential functional variants in two genes located within a COPD GWAS locus on chromosome 15 (CHRNA5 and IREB2) and has provided suggestive evidence for a novel COPD susceptibility locus in the HLA-C region on chromosome 6.51
Taken together, the above reviewed results on sputum RNA biomarkers show that: (1) sputum transcriptomics studies may be feasible in multicentre trials; (2) sputum gene expression profiling identified 277 genes associated with airflow limitation and 198 with emphysema48; (3) several SNPs affecting circulating CC16 levels were associated with sputum mRNA expression of the CC16 gene, suggesting coordinated CC16 regulation systemically and in the lungs47; and (4) an integrative genomics approach can identify potential COPD susceptibility loci.51
Serum metabolomic biomarkers
Several ECLIPSE studies investigated the serum metabolomic profile of patients with COPD. We used proton nuclear magnetic resonance (1H NMR) to compare the metabolomic serum profile of 1678 patients with COPD and 566 healthy smokers.52 The results showed: (1) decreased lipoproteins, N,N-dimethylglycine and increased glutamine, phenylalanine, 3-methylhistidine and ketone bodies in patients with COPD, with decreased branched chain amino acids (BCAAs) observed in patients with GOLD stage 4; (2) BCAAs and their degradation products (3-methylhistidine, ketone bodies and triglycerides) correlated negatively with body weight and positively with systemic inflammation; and (3) patients with emphysema displayed decreased serum creatine, glycine and N,N-dimethylglycine.52 Liquid chromatography with tandem mass spectrometry (LC-MS/ MS) confirmed most of the 1H NMR findings.52
In a follow-up study53 we used quantitative LC-MS/MS to measure 34 amino acids and dipeptides in different subgroups of patients with COPD classified according to: (1) the severity of airflow limitation (GOLD stage 4 (n=30) vs controls (n=30)); (2) presence (n=38) or absence (n=21) of emphysema; and (3) cachexia (n=30) vs normal BMI (n=30). Targeted LC-MS/MS distinguished groups in all three categories.53 In particular, glutamine, aspartate and arginine were significantly increased in patients with GOLD stage 4, emphysema or cachexia whereas aminoadipate was decreased.53
These results indicate that: (1) there is increased protein turnover in all patients with COPD, with increased protein degradation in individuals with emphysema and cachexia; and (2) while there are some promising metabolomic signals detected in the serum of certain subtypes of patients with COPD, replication in other cohorts is required.
Exhaled breath condensate
Several ECLIPSE studies explored exhaled breath condensate (EBC) as a potential source of valid COPD biomarkers. Using conventional methodology, we were not able reliably to measure protein biomarkers in EBC. However, we found that the pH of the EBC was consistently lower in COPD (but also in smoking controls) than in non-smokers, although it was not related to FEV1 or sputum leucocyte counts, and that it was unresponsive to oral steroid treatment.54 Using mass spectrometry, we found that the relative concentrations of adenosine and AMP were elevated in patients with COPD, and the former correlated with FEV1.55 Finally, EBC contains a complex mixture of volatile organic compounds (VOCs), some of which could potentially represent biomarkers for lung diseases.56 We developed a special sampling methodology for collecting concentrated samples of exhaled air from participants with impaired respiratory function. By using two-stage thermal desorption gas chromatography-differential mobility spectrometry (GC-DMS) analysis, we then showed that it is possible to discriminate between healthy smokers and patients with COPD.56 Mass spectrometry is then required to identify and quantify any discriminative biomarker. In summary, so far these studies do not support EBC as a useful source of biomarkers in COPD, although novel methodologies may help to explore potential new ones if care is taken to assess reproducibility with time and to validate the results in different cohorts.
Other lessons from the ECLIPSE biomarker expedition
The results discussed above clearly identify novel cellular, proteomic, genetic, transcriptomic and metabolomic biomarkers in COPD. Needless to say, many of them require validation in appropriately designed studies before routine use in clinical practice and as drug development tools. However, the ‘ECLIPSE biomarker expedition’ has also provided other equally important lessons for conducting biomarker analyses in large multicentre clinical studies. We believe that the following are worth considering:
Standardised methodology and strict quality control are critical for the assessment of biomarkers in a large multicentre trial.
Replication of data in different cohorts is imperative, highlighting the need for open collaborations. Plasma fibrinogen, serum CCL18 and sRAGE, as well as GWAS data on FAM13A, are specific examples of the power of a partnership approach in the data reviewed above.
ECLIPSE identified a panel of biomarkers associated with risks of poor clinical outcome. For newly emerging biomarkers, it is imperative to develop robust and cost-effective diagnostic tests that can be qualified and that will allow comparison between studies as a replacement to research grade laboratory assays.
Transcriptomics and metabolomics pave the future by identifying pathways of interest, but data require replication and, very importantly, the use of cluster and network analysis for comprehensive systems biomedicine.57 There are still several pending manuscripts of the ECLIPSE saga, but two of them address precisely this field of knowledge.58 ,59
The authors thank all the subjects, investigators and study site staff who participated in ECLIPSE and our many collaborators on the manuscripts cited herein.
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Files in this Data Supplement:
- Data supplement 1 - Online supplement
RF and RT-S contributed equally
Collaborators See online supplementary appendix for full list of ECLIPSE study investigators.
Contributors RF, RT-S, JV and AA planned the review, wrote the first draft, collated comments from the co-authors and are responsible for the overall content as guarantors. All the other co-authors read the draft, made comments and suggestions and approved the final manuscript. All authors (except RF) participate in the Steering Committee of ECLIPSE.
Funding The ECLIPSE study was funded by GlaxoSmithKline. Disclosures for each author are found in supplementary material.
Competing interests None.
Ethics approval Local Ethics Boards.
Provenance and peer review Not commissioned; internally peer reviewed.