Background: Numerous areas of the human genome have previously been associated with asthma and asthma-related phenotypes, but few positive findings have been successfully replicated in independent populations. Initial studies have reported strong associations of variants in the plant homeodomain zinc finger protein 11 (PHF11) gene with serum IgE levels, asthma, airway hyper-responsiveness and childhood atopic dermatitis.
Objectives: To investigate the association of variants in the PHF11 gene with asthma and associated intermediate phenotypes in two independent Western Australian population-based samples.
Methods: A linkage-disequilibrium (LD)-tagging set of 20 single nucleotide polymorphisms (SNPs) was genotyped in PHF11 in two separate populations (total n = 2315), a family-based twin study consisting of 230 families (n = 992 subjects) and a population-based nested case-control study consisting of 617 asthma cases and 706 controls. Information regarding asthma, respiratory physiology, atopy and environmental exposures was collected. Transmission disequilibrium tests, variance components models and generalised linear models were used to test for association between PHF11 SNPs and selected asthma outcomes (including longitudinal change in lung function).
Results: After correction for multiple testing, no statistically significant (p<0.05) associations were found between PHF11 and either asthma or total serum IgE levels in either population. No statistically significant associations were found with any other asthma-associated phenotypes in either population.
Conclusions: Previously reported associations of PHF11 with asthma outcomes were not replicated in this study. This study suggests that PHF11 is unlikely to contain polymorphic loci that have a major impact on asthma susceptibility in our populations.
Statistics from Altmetric.com
Asthma is a chronic inflammatory disease of the airways which results in variable airway obstruction and episodes of wheezing and breathlessness due to widespread narrowing of the airways in the lung.1 The prevalence of asthma in the general population of Australia varies depending on the definition of asthma diagnosis, but is generally accepted to be around 10–12% of adults and 12–15% of children in Australia,2 both of which are high by international standards.3
Asthma is a complex disease caused by intricate interactions among genetic and environmental influences.4 The heterogeneous and complex nature of the aetiology of asthma has made it difficult to pinpoint precisely the extent of attribution of individual genetic and environmental influences, and can be further complicated by incomplete penetrance, epistasis and phenotypic pleiotrophy.5 Difficulties in defining asthma, and the complexities of the pathogenic mechanisms underlying the disease, mean that intermediate asthma phenotypes are commonly used in genetic studies.6 Commonly measured intermediate phenotypes for asthma include raised total serum IgE levels, increased exhaled nitric oxide (eNO), decreased spirometric indices (eg, forced expiratory volume in 1 s, FEV1) and airway hyper-responsiveness (AHR) to inhaled spasmogens. All of these phenotypes are highly heritable.7
There have been many genome-wide linkage scans for asthma, and several genes have been claimed as positionally cloned susceptibility loci.8–12 The 13q14 region, in which the plant homeodomain zinc finger protein 11 (PHF11) gene resides, has demonstrated significant linkage (p<0.05) with asthma, atopy and IgE levels in a number of studies.13–17 The genome-wide linkage scan of Zhang et al18 narrowed down this region of association with IgE levels to the PHF11 gene and two flanking genes on either side, with three SNPs identified as having independent associations with total serum IgE levels. In an independent study in children, Hersh et al19 reported marginal associations between PHF11 and asthma, as well as AHR, but found limited association with IgE levels.
The current interest in PHF11 as a potential candidate gene has arisen from positional approaches. Until recently there was no clearly defined biological mode of action for PHF11 in relation to asthma, although it has been suggested to play a role in chromatin-mediated transcriptional regulation and immunoregulation, especially of B lymphocytes.18 20 A recent study by Clark et al21 found a higher expression of PHF11 in Th1 cells than Th2 cells, as well as a marked increase in PHF11 expression in basophils and mature dendritic cells, although there was low expression in mast cells and eosinophils.
Given the complex nature of the aetiology of asthma and the frequent inability of secondary studies to replicate a positive result in a primary study, comprehensive replication of initial positive findings in gene-disease association studies has become critical for determining the validity of a candidate gene for a specific trait.22 23 To investigate the association of PHF11 with asthma phenotypes, we identified a linkage-disequilibrium (LD)-tagging set of SNPs in the gene and conducted genetic association studies in two independent comprehensively phenotyped Western Australian (WA) samples. The potential pathway(s) through which the PHF11 gene might act to influence asthma susceptibility remain unclear. Therefore, in addition to doctor-diagnosed asthma, we also investigated a number of quantitative asthma-associated phenotypes.
Subjects were recruited from two independent Western Australian (WA) populations, the WA Twin Register (WATR) and the Busselton Health Study (BHS).
The WATR recruited families of multiple birth children to the WA Twin Child Health (WATCH) study; recruitment is described in detail elsewhere.24 A subset of WATCH study families whose twins were born between 1990 and 1995 was then recruited to the “WATCH for asthma” study. The 992 individuals from 230 different families provided the data described in this paper.
The BHS includes a series of seven cross-sectional population health surveys of adult residents in the Shire of Busselton since 1966.25 In 1994/5 a cross-sectional community follow-up study was undertaken of all available attendees of previous surveys. A case-control sample of unrelated subjects with (n = 617) and without (n = 706) doctor-diagnosed asthma was selected from the 1994/5 BHS cohort. For the analysis between longitudinal lung function and PHF11, data collected from the 1994/5 survey and up to seven previous surveys were used (mean number of surveys per subject 3.8, range 2–7).
WATCH and BHS subjects were assessed for a variety of demographic, physiological and genetic variables.25 26 Physiological parameters within both populations included the dichotomous variables asthma and atopy, eosinophil count and respiratory measures. WATCH subjects were also assessed for total serum IgE levels and eNO. Dose-response slopes were also collected from the BHS population only. Participants were considered asthmatic if they responded positively to the question: “Has your doctor ever told you that you had asthma/bronchial asthma?” Total IgE concentrations were calculated using the Immulite 2000 immunoassay analyser (Diagnostic Products Corporation, Los Angeles, USA). Atopy was determined by skin prick testing and was defined as the development of a weal >3 mm to at least one allergen via a skin prick test.
Selection of SNPs
A set of LD-tagging SNPs was selected to tag PHF11 and 10 000 base pairs in each direction (n = 20) using the HapMap Phase II database.27 SNPs identified from HapMap were run through Haploview (V4.0).28 The minor allele frequency (MAF) was set at 0.05, Hardy-Weinberg Equilibrium (HWE) p value was set at 0.05 and the r2 threshold was set at 0.8. The three SNPs identified by Zhang et al18 as having an independent effect on total serum IgE levels were also genotyped.
SNPs were genotyped using tetra-primer amplification refractory mutation system polymerase chain reaction, generated by a primer design program accessible through the internet29 or by the TaqMan SNP Genotyping System (Applied Biosystems, ABI).
HWE was tested at each SNP locus through the use of a Markov chain random walk algorithm30 using unrelated individuals. Pairwise LD was analysed by a likelihood ratio test.31 Lewontin disequilibrium coefficient D′ and r2 were calculated for each pairwise comparison.
Generalised linear models (GLMs) were used to model the effects of multiple covariates and PHF11 genotypes on case-control status or normally distributed continuous outcomes in the BHS sample. The FEV1/FVC ratio, dose-response slope and eosinophil count were not normally distributed and were analysed using the Kruskal-Wallis test. Longitudinal genetic association analyses were conducted on FEV1 and FEV1/FVC ratio25 using Linear Mixed Effects (LME) models.32 Transmission disequilibrium tests were used in the WATCH families to investigate the associations between qualitative traits and PHF11 genotypes. Within the WATCH analysis, if twins were monozygous, one twin was deleted from the analysis as monozygous twins are non-informative in a study of this nature. A general variance components model to partition observed phenotypic variance into genetic and non-genetic components was used to model the effects of multiple covariates and PHF11 genotypes on quantitative outcomes.33 All BHS analyses were undertaken in the SimHap V.1.0.0 program (http://www.genepi.org.au/simhap.html) or the R statistical language.34 WATCH analyses were undertaken using the QTDT v2.6.0 program.33
Sex, age, smoking status, height, weight and body mass index (BMI) were included as potential covariates in the multivariate models in both populations. Multiple testing was corrected for using the false discovery rate method.35 Statistical significance was defined at the 0.05 level.
The characteristics of the subjects in the WATCH and BHS populations are shown in table 1. As expected, the WATCH population was significantly younger than the BHS population, reflecting the different recruitment criteria. Age-related parameters such as BMI were also significantly lower in the WATCH population.
A total of 20 SNPs were identified through HapMap as capturing common genetic variation within PHF11 with an r2 value ⩾0.8.28 These SNPs were genotyped and examined in the WATCH and BHS samples. The final set of 20 tag SNPs were found to provide 91% coverage for all 58 common HapMap-CEU SNPs within the target region (mean r2 = 0.94). Both populations included the three SNPs previously reported by Zhang et al18 as having independent effects on IgE levels. The distribution of genotypes was consistent with HWE (p>0.05) in all SNPs within the WATCH population. Deviation from HWE was seen in only one SNP (rs7332573) within the BHS population (p = 2.2×10−7). This SNP was excluded from further analysis within the BHS and subsequently within WATCH.
The PHF11 SNPs genotyped were in only weak/modest LD (fig 1), reflecting the use of LD-tagging SNPs. Genotype and allele frequencies are given in table E1 in the online supplement.
Univariate analyses within the WATCH population did not show any statistically significant associations between any SNP and any phenotype when accounting for multiple testing (data not shown). Multivariate modelling within WATCH, including BMI, age, sex and smoking status as covariates, also failed to show any statistically significant association between the genotyped SNPs and any quantitative asthma phenotype, including total serum IgE levels, eosinophil count, eNO, FEV1 or the FEV1/FVC ratio (see table E2 in the online supplement). Multivariate outcomes of asthma for both WATCH and BHS can be seen in table 2.
Univariate analyses within the BHS population suggested marginal associations between rs9568221 and FEV1 (dominant model: p = 0.02) and rs9535259 and asthma (dominant model: p = 0.04). Multivariate GLM analysis indicated that the association between rs9568221 and FEV1 remained statistically significant under a dominant model (β = 111.1, SE = 42.3, p = 0.01), although this result was not statistically significant once multiple testing was accounted for (q = 0.21). After adjusting for other covariates, the association between asthma and rs9535259 was no longer statistically significant (dominant model: odds ratio 0.81, 95% CI 0.63 to 1.03, p = 0.09). All multivariate data for the Busselton population, excluding asthma, are presented in table E3 in the online supplement.
A significant association was also observed between a longitudinal decline in the FEV1/FVC ratio and rs9568222 (dominant model: coefficient = −0.0009, SE = 0.0004, p = 0.02). Again, this association became statistically non-significant when multiple testing was taken into account (q = 0.10). No other SNP within PHF11 was significantly associated with the longitudinal lung function measures.
Our study was designed to select a set of SNPs capturing common genetic variation within the PHF11 gene, and to investigate a possible association between these selected variants and asthma-associated phenotypes in two well-characterised population-based samples. This study comprises a replication study in both child and adult populations of the associations reported by Zhang et al18 and Hersh et al19 with total serum IgE levels, asthma and AHR, and has examined the same SNPs identified as having an independent effect on IgE levels in the study by Zhang et al.18 The current study is both relatively large and the only study, to the authors’ knowledge, to investigate the association between PHF11 and the asthma-associated phenotypes of spirometric indices, eNO levels, eosinophil counts and longitudinal changes in lung function. We observed a statistically significant association between rs9568221 and FEV1 (p = 0.01) and rs9568222 and a decline in FEV1/FVC function (p = 0.02) after adjustments for covariates. Both SNPs are located in introns within SETDB2 which were tagged to account for possible genetic effects either side of the PHF11 gene (see table E1 in the online supplement for SNP locations).
After accounting for multiple testing, no statistically significant association was found between any SNP within PHF11 and doctor-diagnosed asthma, atopy, eNO level, eosinophil count, FEV1, FEV1/FVC ratio, dose-response slope to methacholine or longitudinal change in lung function in either children or adults. In particular, there was no association between PHF11 and total serum IgE levels, the quantitative trait first used to map the gene18 and the reason that PHF11 is often quoted as a potential asthma gene.
The potential role of PHF11 in the pathogenesis of asthma is uncertain, but recent research is shedding light on the possible role of PHF11 and atopy. PHF11 encodes the gene NY-REN-34, originally identified in patients with renal cell carcinoma, and contains two plant homeodomain zinc finger proteins. Evidence so far suggests a role in protein binding and/or homodimerisation.18 Expression of PHF11 was found to be higher in Th1 cells than in Th2 cells, and it is theorised that PHF11 may act in concert with the NF-κB pathway as a transcriptional activator.21 In atopic individuals there tends to be an imbalance in Th1 and Th2 cells, favouring Th2.21 The evidence suggests that the PHF11 gene may regulate factors which encourage the formation of Th1 cells and that mutations within PHF11 will result in the preferential activation of Th2 cells and thus will be more common in individuals displaying atopic tendencies. Despite these links with the pathophysiology of atopy, PHF11 was identified as a potential asthma candidate gene through a positional approach rather than because of a known biological mode of function.
To date, one study has reported statistically significant associations between PHF11 and IgE serum levels and asthma,18 a subsequent study reported an association between PHF11 and asthma,19 and one study has reported an association between PHF11 and atopic dermatitis.36 The observed association between PHF11 and IgE, first reported by Zhang et al,18 was not replicated by Hersh et al.19 Hersh et al19 demonstrated an association with only one SNP, which displayed a statistically significant association to total serum IgE levels (rs9568232; p = 0.03 in one study population only). The level of marginal (0.05⩾p⩾0.03) replication demonstrated by Hersh et al was only noted at the gene level (ie, no single SNP associations were replicated across the two populations studied for PHF11).19 A difference in results may also be attributed to differences in phenotypic definition between the studies.
It is of interest to note that the population used by Zhang et al18 for the primary mapping of PHF11 and the two flanking genes came from 80 nuclear families recruited in 1992 from the town of Busselton, the same town from which participants in the BHS were drawn. The population was selected to be informative for atopy and had to include families with atopic and non-atopic individuals. To the best of our knowledge, there is no overlap between the population in our study (derived from the 1994/5 BHS follow-up study) and that of Zhang et al.18 For the current study, participants were derived from an unselected general population sample. In contrast, the sample of families from the 1992 study reported by Zhang et al18 were selected to be informative for asthma; this may in part account for the different results observed.
A summary of genetic association analysis of SNPs examined in the four studies published to date (including this study) is shown in table 3.
The results of association with atopic dermatitis have been included as asthma and atopic diseases may share similar genetic pathways.37 The SNP rs1046295 looks to be the most promising SNP as it has previously been significantly (p<0.05) associated with asthma,18 IgE levels18 and atopic dermatitis36 in three separate populations. However, no statistically significant association was demonstrated in the current study.
Owing to the marked heterogeneity of the asthma phenotype, additional phenotypic parameters were analysed in our two populations and it was thus necessary to adjust for multiple testing. All the chosen phonotypes are related to asthma and may potentially allow further characterisation of genetic effects. Multiple testing in this study was adjusted for using the false discovery rate method and adjusts for the number of SNPs within each phenotype, thus the number of phenotypic end points examined will not affect the overall multiple adjustment. Multiple analyses were not taken into account by Hersh et al.19
Allele frequencies were very similar in the WATCH and BSH populations (see table E1 in the online supplement). Comparison of MAFs in our two populations and other reported populations18 19 presents greater difficulty, as the SNPs used for analysis have not been consistent across studies.
A post hoc power calculation indicated that, assuming an α value of 0.05 under a dominant model, our study had 80% power to detect a true odds ratio of ⩾1.46 for asthma between cases and controls for an SNP with a MAF⩾5%. The lowest MAF for an SNP in our study was 3.0% (rs9596127). Assuming a MAF of 3.0%, our study had 68% power to detect an odds ratio of 1.4 and 85% power to detect an odds ratio of 1.5. In comparison, Hersh et al reported 80% power to detect an odds ratio of 1.66 for a MAF of ⩾10% within the Costa Rica population and an ability to detect a lower odds ratio (not specified) within the Childhood Asthma Management Program (CAMP) population.19 Our study therefore suggests that PHF11 is unlikely to contain a major locus modulating asthma risk in these study populations of predominantly Western European origin.
Failure of secondary studies to replicate positive results of an initial study is not uncommon in the genetic association literature, and there are numerous reasons cited as to why replication may fail.22 23 38 39 These include (but are not limited to) small sample sizes resulting in insufficient power to detect minor genetic influences, poor study design, the analysis of different variants in replication studies, publication bias, known or unknown differences between populations, the heterogeneous nature of complex diseases and, more specifically, the differing definitions of what constitutes a positive outcome of asthma and atopy.22 40 There are estimates that, of the total number of studies which initially report a positive finding, 70–95% cannot be confirmed in subsequent studies.41 This highlights the need for careful confirmatory studies in independent populations of sufficient sample size. Positive publication bias is also likely to be a factor in the reporting of candidate gene studies22 and may be a factor in the reported positive association with PHF11. Associations between variations in susceptibility genes and asthma or asthma phenotypes will be difficult to identify as each individual gene will probably be associated with a modest effect on phenotype.42
This study fulfils the major criteria for a meaningful replication of genetic association studies23 and has the following strengths: (1) PHF11 was comprehensively tagged to capture common genetic variation across the gene; (2) this is a large internally replicated genetic association study of samples totalling 2315 subjects of European ancestry genotyped for 20 PHF11 SNPs; and (3) the subjects were homogeneous and well characterised. Asthma was defined similarly in the current study and that of Zhang et al (questionnaire-based). However, the criteria used by Hersh et al19 to define asthma was more comprehensive and included methacholine hyper-responsivness (hyper-responsivness was defined as a provocative concentration of methacholine causing a 20% fall in FEV1 of ⩽12.5 mg/ml).19
In conclusion, this replication study suggests that PHF11 is not a major susceptibility locus for asthma in our populations. Further attempted replication in different populations will be important, especially in populations of non-European ethnicity. The current study does not preclude the possibility that PHF11 variants interact with environmental factors or act epistatically with genetic variation at other loci to affect disease or trait outcomes.
The authors thank the people of the Busselton community for their participation in this study and the participants involved in the WATCH for Asthma study; the Busselton Population Medical Research Foundation and the many colleagues who assisted in the collection of these data; the generous support for the 1994/5 follow-up study from Healthway, Western Australia. The Busselton population studies are supported by The Great Wine Estates of the Margaret River region of Western Australia.
▸ Additional details are published online only at http://thorax.bmj.com/content/vol64/issue7
Funding: The authors acknowledge the assistance of the Western Australian DNA Bank (NHMRC Enabling Facility) with DNA samples for this study and the support provided by the Western Australian Genetic Epidemiology Resource (NHMRC Enabling Facility). They also gratefully acknowledge the support of the WA Health Promotion Foundation (Healthway) for providing the initial funds, allowing the establishment of the WATR, the Wind-Over-Water Foundation and the Australian National Health and Medical Research Council (NHMRC) for providing ongoing funding.
Competing interests: None.