Objective: The aim of this study was to estimate the contribution of polymorphisms in the positionally cloned asthma candidate genes ADAM33, PHF11, DPP10, GPRA and PTGDR to the risk of asthma, total and specific immunoglobulin E level, lung function and wheezing in a large, nationally representative, population.
Methods: An association analysis was undertaken using genotype data for tagging and previously associated single nucleotide polymorphisms (SNPs) in regions of these genes and longitudinal phenotype data from singletons of white ethnicity in the British 1958 Birth Cohort DNA archive (n = 7703). Population-attributable risk fractions for SNPs showing association were calculated.
Results: Polymorphisms producing small but statistically significant increases in asthma risk (OR 1.1 per allele) were identified in DPP10 and ADAM33, with the strongest evidence being for SNPs tagging the DPP10 gene. No individual SNP in any gene under study markedly increased risk for any of the phenotypes in the population studied.
Conclusions: These data suggest that DPP10 and ADAM33 influence asthma risk in the UK population. However, the effects driven by any given locus are small, and genotyping of multiple polymorphisms in many genes will be needed to define a full genetic profile for disease risk.
Statistics from Altmetric.com
Over the past 10 years, major efforts have been made using positional cloning approaches to identify genes predicting the risk of developing asthma and related phenotypes. At least 14 genome-wide linkage studies have been undertaken in a range of populations: this has resulted in the identification of a number of potential disease risk genes including ADAM33, PHF11, DPP10 and GPRA.1–4 A fifth gene, for the prostanoid D receptor (PTGDR), was identified using a combination of linkage and candidate gene approaches.5 The initial studies on these genes suggested potentially major contributions to disease risk. However, subsequent attempts at replication in small populations have given conflicting results (see Supplementary material). We have therefore utilised the longitudinal phenotypic data from the British 1958 Birth Cohort in order to assess the contribution of these genes to the risk of asthma and related phenotypes in the UK population. We previously used this approach in a similar candidate gene study to investigate the potential role of ADRB2 polymorphism in asthma incidence and prognosis.6 The specific aim of this study was to define population-attributable risk for single nucleotide polymorphisms (SNPs) in ADAM33, PHF11, DPP10, GPRA and PTGDR.
Study population and procedures
The DNA archive established from the British 1958 Birth Cohort (also known as the National Child Development Study) was used. Details of this study population and phenotyping procedures have been reported previously.6–11 In brief, 18 558 individuals born during 1 week in March 1958 were followed up at ages 7, 11 and 16 years by parental interviews and examinations by school medical officers, and at ages 23, 33 and 42 years by means of interviews. The presence of diagnosed asthma or a history of wheezy bronchitis were ascertained by parental interviews in childhood. In adulthood, questionnaires were used to define asthma ever, wheezing ever and wheezing episodes in the previous 12 months. During 2002–2004 all cohort members still in contact with the study team (12 069) and who had not needed a proxy interview were invited to be examined by a trained research nurse at home: 9377 were visited and adequate blood samples for DNA extraction obtained from 8018. Of these, 7703 were singletons of white ethnicity and were included in the present study. The lifetime histories of asthma and wheezing illness for these individuals who were included in the analyses in this study are similar to those who are not included: at age 42 years, the lifetime prevalence of asthma or wheeze in the full cohort was 49.9% and in the cohort with DNA was 48.8%. Total circulating immunoglobulin E (IgE) and specific IgE to house dust mite, mixed grass pollen and cat fur were measured with a Hytec enzyme immunoassay (Hycor Biomedical, Irvine, CA, USA12) at age 45.
Spirometry at age 44–45 years was done in the standing position without nose clips, using a Vitalograph hand-held spirometer as described in full elsewhere.13 In the analysis, all readings with a best-test variation >10% were excluded. Measures of lung function were adjusted for height and sex, and values with standardised residuals >3 SD units were excluded from analyses.
Protocols for the 2002–2004 biomedical examination were approved by the South East MultiCentre Research Ethics Committee. All participants gave informed written consent to participate in genetic association studies and the present study was approved by the oversight committee for the biomedical examination of the British 1958 Birth Cohort.
For each of the genes examined, the SNPs for genotyping were chosen either based on previously published studies identifying putative risk alleles or, where this information was not available in the literature, by choosing tagging SNPs which covered the major haplotype blocks across each gene of interest. A total of four SNPs in ADAM33 (based on our previous meta-analysis,14), five SNPs in GPRA, six SNPs in DPP10, three SNPs in PHF11 and three SNPs in PTGDR were genotyped (table 1) following SNP selection and quality control in assays using test samples from an in-house DNA archive (see Supplementary information for full details on SNP selection and quality control).
Genotyping was performed using Taqman technology (Applied Biosystems, Foster City, California, USA) by Geneservice (Cambridge, UK). Genotyping call rates for each SNP included in this study were ⩾98%. A random selection of 384-well plates were re-run and concordance on genotype calling was >99%. We have previously reported high levels of genotyping concordance with a different technology using separately aliquoted DNA from the central archive for rs1042713 and rs1042714 in a subset of 856 individuals.6 For one SNP, rs528557, a double heterozygote cluster was seen; these were combined for the present analysis. Only rs6561533 deviated from Hardy–Weinberg equilibrium (Pearson χ2 p = 0.02), though examination of genotype clustering revealed no apparent problems with allele calling.
Stata version 8.0 was used for cross-tabulations and regression modelling as described in a previous publication.6
Asthma prevalence was studied as described previously.6 In brief, the major outcomes were: risk of asthma or wheeze by age 42; atopy (defined as the presence of specific IgE to the allergens tested) and total IgE. Secondary analyses were done to study clinical asthma, maturity onset (age >17) asthma or wheeze, and wheezing in the absence of a diagnosis of asthma.
Haplotype analyses were performed by deriving haplotypes by the EM algorithm implemented in SNPHAP (http://www-gene.cimr.cam.ac.uk/clayton/software/snphap.txt). For all genes except DPP10, all genotyped SNPs were included and haplotypes with frequency >10% used for analyses. For DPP10 there were only two haplotypes with a frequency >10%; therefore, we derived simpler haplotypes using the loci showing association in the initial SNP analyses. Haplotype analyses were only performed where main effects were seen with at least one SNP in the relevant gene.
The population-attributable risk fraction associated with haplotype variation within each candidate gene was estimated as (Σf(R−1)/(Σf(R−1)+1)), where f is the frequency of each haplotype and R is the haplotype per copy odds ratio (OR)).
Primary end points
Tables 1 and 2 show the allelic frequencies of the SNPs studied for the major outcome of this study (asthma/wheeze ever; atopy). Significant associations with asthma or wheeze were seen for rs13011555 and rs13392783 (table 1), in both DPP10 and rs528557 (ADAM33, also known as S2). A borderline significant association was also seen with rs323922 in GPRA but only under an additive model. However, the highest OR seen for any individual SNP was only 1.1. No significant associations were seen with atopy as measured by specific IgE (table 2).
None of the SNPs studied showed significant associations with log total IgE despite the known higher heritability of this trait (data not shown).
Secondary end points
None of the SNPs studied predicted the incidence of asthma or wheezing from birth to age 16 (data not shown); however, rs13392783 (DPP10) predicted risk of the development of asthma in later life; a weaker association was also seen with this end point for rs574174 (ADAM33 ST+7) (OR 1.09, p = 0.03). rs1430091 in DPP10 predicted risk of clinical asthma (as opposed to wheezing disorders) in the cohort. In contrast, three SNPs in DPP10, rs13011555, rs13392783 and rs17048359, all showed association with lifetime risk of adult wheezing not labelled as asthma. rs13011555 also showed borderline association with lung function, in keeping with the risk allele for lower lung function being the same allele showing association with adult wheezing. Tables 3–6 show data for the main risk SNPs for these secondary end points.
Given the association seen with the range of SNPs in DPP10, ADAM33 and wheezing/asthma outcomes, we undertook a haplotype-based analysis to identify the major risk haplotype(s) in the Caucasian population. Statistically significant heterogeneity in incidence was evident only for DPP10; this was mainly attributable to variations in adult-onset disease (p = 0.0002) and wheezing not labelled as asthma (p = 0.005). A simple haplotype analysis using rs13011555 and rs13392783 (see the Methods section for details) showed that the combination T.G had the highest risk for developing phenotypes related to wheezing illness (see table 7): the major risk appears to be for wheezing rather than diagnosed asthma. The proportion of asthma or wheezing illness statistically attributable to common variation in DPP10 was 7%. Asthma risk was not associated with any haplotype in ADAM33 based upon rs511898, rs528557, rs574174 and rs612709.
In the light of previous published data, we looked specifically at an extended haplotype across GPRA for SNPs rs323917, rs323922, rs324377, rs324396 and rs740347 which had previously been suggested to be associated with asthma risk, C.C.A.C.C, but saw no significant increase in risk of asthma or wheezing phenotypes (OR for asthma/wheeze ever 1.06, 95% CI 0.95 to 1.19).
Extensive efforts have been invested in attempting to identify genes predicting risk of asthma and related traits using positional cloning approaches. In this study we have assessed the contribution of polymorphisms in the genes identified to date to estimate population-attributable risks. We found evidence supporting a role for DPP10 in the development of wheezing phenotypes, particularly in later life, and also saw weaker associations of SNPs in ADAM33 with progression of wheezing illness. Borderline association with DPP10 and lung function was also seen. Perhaps surprisingly, given the known heritability of the relevant traits, we did not see association of any individual SNP with either atopy or log total IgE, despite reasonable data from linkage studies suggesting a contribution of both DPP10 and PHF11 to these phenotypes.2 3 The maximum OR seen with risk alleles for wheezing phenotypes was around 1.1. However, as the major allele of these SNPs is often the risk allele for these common complex traits this is perhaps not surprising. Because the major alleles were usually the risk alleles, calculations of the population-attributable risk suggest that as much as 6–7% of disease risk in the population is potentially attributable to the SNPs studied in DPP10 and ADAM33. These values are in keeping with estimates derived from a meta-analysis of ADAM3314 and are likely to be typical for risk alleles for asthma. In all we are potentially able to explain a maximum of <13% of the total population risk of developing asthma by the polymorphisms studied in DPP10 and ADAM33, implying that other genes must contribute overall to the risk of disease development. These effects were not due to geographical stratification, which for the risk SNPs identified was not statistically significant when analysed by UK regions (data available at www.b58cgene.sgul.ac.uk).
There are several important implications of these findings. First it is clear that studies attempting accurately to define population-attributable risks for specific polymorphisms in risk genes for common polygenic disorders such as asthma are going to require very large study populations. There have been a number of studies in smaller populations, typically of a few hundred individuals, looking at the genes we investigated; most of these studies were thus underpowered to determine risk accurately. During the analysis of this study, the first genome-wide association study in childhood asthma was reported15; interestingly, whilst one new region was identified with strong evidence for an asthma-related gene, a number of suggestive regions were identified which would be in keeping with our conclusions at least for this phenotype, suggesting that multiple genes of small effect are involved. It is clear that studies will require many thousands of cases and controls to determine adequately the contribution of specific risk alleles to disease status overall.
Secondly, despite the weak effects of individual SNPs, our data provide support for a role for DPP10 in the development of wheezing disorders, particularly later in life. DPP10 codes for a homologue of dipeptidyl peptidase which cleaves terminal dipeptides from cytokines and chemokines, hence one might think polymorphisms in this gene would be more likely to show association with atopy-related phenotypes. However, in the original study which identified this gene as a potential candidate, two loci were identified: one showing association with IgE levels and one (in a separate LD block) showing association with asthma.2 One SNP in each of these regions showed significant association with wheezing phenotypes in this study, but we did not find evidence supporting a contribution of this gene to total IgE. Because of the LD structure of DPP10, the large size of the gene and the lack of knowledge regarding functional polymorphisms, it is conceivable we may have missed an association with atopy by our choice of tagging SNPs. DPP10 was initially identified in families with children with asthma,2 and our primary observation of an association with asthma or wheeze by age 42 is consistent with this. It is interesting, therefore, that a secondary analysis suggests that the effect in our study appears to be driven by a relationship with asthma or wheeze of onset after 17. It is unclear whether the signal for asthma in children in this study is attenuated by noise from factors contributing to non-asthmatic childhood wheezing as we use a composite end point. Additionally, it is uncertain whether the parents in the initial linkage study had a significant degree of asthma of later onset.
The association seen between rs528557 in ADAM33 and wheezing phenotypes is consistent with those previously reported in the literature1 14 16–18 in terms of the risk allele. Although a previous meta-analysis14 identified this polymorphism as a risk factor for asthma, it is worth emphasising that the total numbers of cases and controls combined in the meta-analysis (2964) is substantially smaller than the population used for the current study. Furthermore, this polymorphism has also been associated with decline in lung function16 which may explain its association with wheezing but not a diagnosis of asthma. However, the size of effect we report is smaller than that previously seen, and no significant effect was seen on lung function.
In contrast to ADAM33 and DPP10, we were unable to identify robust associations between any relevant end point and the SNPs chosen in PHF11, PGDR or GPRA. We extended our analysis of GPRA to look at the possible association between the major risk haplotype previously identified in the PARSIFAL population at this locus but did not identify a significant association. This suggests that previously defined risk alleles/haplotypes within these three genes do not appear to predict risk of asthma or allergic disease in the UK population. These data contrast with a recent study where association was seen with SNPs upstream of GPRA.19 That study differs from the data presented here in several key aspects: it included data from both American non-Hispanic white and Costa Rican populations, all cases were children, measurements were taken at a single time point and family-based analysis techniques were used. The study also genotyped approximately four times as many SNPs but each population was <500 individuals, raising the possibility of false-positive association explaining the findings.
The data we present have not been corrected for multiple testing. It is difficult to know the best way to do this, given that there were a priori reasons for suspecting the genes studied would be linked to asthma risk, and that effects would probably be additive between different genes. The SNPs and phenotypes are not truly independent variables, and hence comprehensive correction for multiple testing would be overly conservative. Hence we have chosen to present the data uncorrected. If one were to apply a Bonferroni correction in order to control the family-wise error rate at the level of the number of genes examined, then the associations with DPP10 would remain significant. Had the extremely conservative approach of correcting at the level of the effective number of independent SNPs (following the method of Nyholt20) been used, rs13392783 would still have shown significant association with adult-onset asthma or wheeze.
In conclusion, this study suggests that a genetic risk profile for the development of asthma and/or allergic disease in general is likely to require information on multiple SNPs in a large number of genes, and raises the question of how, even if such a profile could be derived, it would be feasible to evaluate prospectively the predictive value of a complex profile without access to extremely large populations probably involving tens of thousands of cases and controls.
This study was funded in part by Asthma UK grant no: 05/55. The 2002–4 biomedical survey and DNA collection were funded by Medical Research Council grant no, G0000934. We are especially grateful to all members of the British 1958 Birth Cohort who participated in the field studies and who gave consent for the use of their DNA for genetic epidemiological analyses, and to the research nurses who contributed to the completion of field studies. We are additionally grateful for helpful discussions with WOCM Cookson on study design.
Review history and Supplementary material
Additional figures are published online only at http://thorax.bmj.com/content/vol64/issue5
Competing interests: None.
Ethics approval: Protocols for the 2002–2004 biomedical examination were approved by the South East MultiCentre Research Ethics Committee. The present study was approved by the oversight committee for the biomedical examination of the British 1958 Birth Cohort.
Patient consent: All participants gave informed written consent to participate in genetic association studies.
Contributors: JB, IPH and DPS designed this study. DPS and SMR were responsible for the management of the 1958 birth cohort genetic resource. SNP selection was performed by JB and IS. JB was responsible for genetic data handling. Statistical analysis was performed by DPS with input from IPH. The manuscript was drafted by IPH and JB: all authors reviewed and revised the manuscript and approved the final version.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.