Article Text

Original research
Genome-wide association study of non-tuberculous mycobacterial pulmonary disease
Free
  1. Jaeyoung Cho1,
  2. Kyungtaek Park2,
  3. Sun Mi Choi1,
  4. Jinwoo Lee1,3,
  5. Chang-Hoon Lee1,
  6. Jung-Kyu Lee4,
  7. Eun Young Heo4,
  8. Deog Kyeom Kim3,4,
  9. Yeon Joo Lee5,
  10. Jong Sun Park3,5,
  11. Young-Jae Cho5,
  12. Ho Il Yoon3,5,
  13. Jae Ho Lee3,5,
  14. Choon-Taek Lee3,5,
  15. Nayoung Kim3,6,
  16. Kyu Yeong Choi7,
  17. Kun Ho Lee7,8,9,
  18. Joohon Sung10,11,
  19. Sungho Won2,10,11,
  20. Jae-Joon Yim1,3
  1. 1 Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Seoul National University Hospital, Jongno-gu, Seoul, Korea (the Republic of)
  2. 2 Interdisciplinary Program of Bioinformatics, Seoul National University College of Natural Sciences, Seoul, Korea (the Republic of)
  3. 3 Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea (the Republic of)
  4. 4 Division of Pulmonary and Critical Care Medicine, Seoul Metropolitan Government–Seoul National University Boramae Medical Center, Dongjak-gu, Seoul, Korea (the Republic of)
  5. 5 Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Korea (the Republic of)
  6. 6 Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Korea (the Republic of)
  7. 7 Gwangju Alzheimer’s disease and Related Dementia Cohort Research Center, Chosun University, Gwangju, Korea (the Republic of)
  8. 8 Department of Biomedical Science, Chosun University, Gwangju, Korea (the Republic of)
  9. 9 Aging Neuroscience Research Group, Korea Brain Research Institute, Daegu, Korea (the Republic of)
  10. 10 Department of Public Health Sciences, Seoul National University Graduate School of Public Health, Seoul, Korea (the Republic of)
  11. 11 Seoul National University Institute of Health and Environment, Seoul, Korea (the Republic of)
  1. Correspondence to Dr Jae-Joon Yim, Department of Internal Medicine, Seoul National University College of Medicine, Seoul 03080, Korea (the Republic of); yimjj{at}snu.ac.kr; Professor Sungho Won, Department of Public Health Sciences, Seoul National University Graduate School of Public Health, Seoul, Korea (the Republic of); won1{at}snu.ac.kr

Abstract

Background The prevalence of non-tuberculous mycobacterial pulmonary disease (NTM-PD) is increasing in South Korea and many parts of the world. However, the genetic factors underlying susceptibility to this disease remain elusive.

Methods To identify genetic variants in patients with NTM-PD, we performed a genome-wide association study with 403 Korean patients with NTM-PD and 306 healthy controls from the Healthy Twin Study, Korea cohort. Candidate variants from the discovery cohort were subsequently validated in an independent cohort. The Genotype-Tissue Expression (GTEx) database was used to identify expression quantitative trait loci (eQTL) and to conduct Mendelian randomisation (MR).

Results We identified a putatively significant locus on chromosome 7p13, rs849177 (OR, 2.34; 95% CI, 1.71 to 3.21; p=1.36×10−7), as the candidate genetic variant associated with NTM-PD susceptibility. Its association was subsequently replicated and the combined p value was 4.92×10−8. The eQTL analysis showed that a risk allele at rs849177 was associated with lower expression levels of STK17A, a proapoptotic gene. In the MR analysis, a causal effect of STK17A on NTM-PD development was identified (β, −4.627; 95% CI, −8.768 to −0.486; p=0.029).

Conclusions The 7p13 genetic variant might be associated with susceptibility to NTM-PD in the Korean population by altering the expression level of STK17A.

  • atypical mycobacterial infection

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Key messages

What is the key question?

  • Is there genetic predisposition to non-tuberculous mycobacterial pulmonary disease (NTM-PD)?

What is the bottom line?

  • Through a genome-wide association study (GWAS) we identified a putatively significant locus on chromosome 7p13, rs849177, which is possibly associated with susceptibility to NTM-PD in Korean populations by altering the expression levels of the proapoptotic STK17A gene.

Why read on?

  • This first GWAS of NTM-PD with functional annotations helps offer insights into the pathogenesis of this disease.

Introduction

Non-tuberculous mycobacteria (NTM) are ubiquitous environmental organisms with a broad virulence spectrum. The incidence and prevalence of NTM pulmonary disease (NTM-PD) are increasing in many parts of the world,1 and especially in South Korea where the annual prevalence of NTM infections has increased by more than fivefold over the last decade, reaching almost 40 cases/100 000 of the population in 2016.2

Disseminated NTM infections occur in patients with primary immunodeficiencies3 as well as acquired immunocompromised states.4 Previous studies have reported that Mendelian susceptibility to disseminated NTM infections is associated with gene mutations in the interleukin-12 dependent interferon-γ pathway.3 5 However, NTM-PD develops in individuals without known immune defects, but typically in thin, postmenopausal women, sometimes with pectus excavatum, scoliosis or mitral valve prolapse.6 As only a small subset of people get NTM-PD despite widespread exposure to NTM, the presence of the distinct phenotype,6 familial clustering7 and the ethnic disparity8 of NTM-PD strongly suggest the presence of genetic predispositions to NTM-PD.

Unfortunately, genetic factors related to susceptibility to isolated NTM-PD remain elusive.5 Although a whole-exome sequencing (WES) study of Caucasian patients with NTM-PD proposed that NTM-PD is a multigenic disease resulting from protein-affecting variants in immune, cystic fibrosis transmembrane conductance regulator, cilia and connective tissue genes in combination, it did not identify any specific underlying mutations.9 Recently, two different WES studies identified macrophage stimulating 1 receptor (MST1R)10 and TTK protein kinase (TTK)11 as genetic risk factors for NTM-PD. However, only a small proportion of patients with NTM-PD could be explained by these variants, and these associations have not been replicated in other populations.5

Here, we carried out a genome-wide association study (GWAS) to elucidate genetic susceptibility to NTM-PD in the Korean population where NTM-PD is rapidly increasing.2

Methods

Discovery cohort

Altogether, 412 patients with NTM-PD who participated in the ongoing prospective NTM cohort12 from 1 July 2011 to 28 February 2017 at Seoul National University Hospital (SNUH) were included in this study. Diagnosis of NTM-PD was made based on the criteria of the American Thoracic Society/Infectious Diseases Society of America guidelines (online supplemental text).13 Controls were selected from the Healthy Twin Study, Korea (HTK) cohort consisting of adult twin pairs and their relatives.14 In the HTK cohort, 1056 participants from 364 families were genotyped using the same platform as that used for the discovery cohort cases. Among them, we randomly selected 494 participants who were one twin from each twin pair or their relatives.

Supplemental material

Replication cohort

To validate genome-wide significant single-nucleotide polymorphisms (SNPs) in the discovery cohort, 187 patients with NTM-PD were enrolled at the SNUH, Seoul National University Bundang Hospital (SNUBH) and Seoul Metropolitan Government-Seoul National University Boramae Medical Center from 16 January 2017 to 20 November 2017. Controls were 500 healthy family members of patients with gastric cancer from SNUBH and 1195 healthy and cognitively normal participants from the Gwangju Alzheimer’s disease and Related Dementia Cohort Research Center.15

All participants provided written informed consent and the study was conducted in accordance with the tenets of the Declaration of Helsinki.

GWAS with the discovery cohort

Both cases and controls in the discovery cohort were genotyped by the Illumina HumanCoreExome-12 V.1.0A on 298 930 variants. The first quality control (QC) stage was performed on cases and controls separately (online supplemental text and figure E1). After merging the cases and controls, the second QC stage was performed as suggested by Anderson et al.16 The second QC stage involved two steps. First, participants were removed where (1) the call rates were less than 97%, (2) the differences in the heterozygosity rates from the means were larger than three times the SD and (3) the estimates of identity-by-descent with other participants were larger than 0.185, with the highest missing rates to remove participants with first-degree or second-degree relationships. Second, variants were removed when (1) the missing rates differed significantly between the cases and controls (p<0.00001), (2) the minor allele frequencies (MAFs) were smaller than 5%, (3) the call rates were less than 97% and (4) the p values for the Hardy-Weinberg equilibrium (HWE) exact test were smaller than 0.001.

After the second QC stage, we imputed the remaining 211 744 variants from the 709 participants using the Michigan Imputation Server (V.1.0.2) and its established pipeline. Haplotype Reference Consortium (HRC, r1.1 2016) was selected as the reference panel, Eagle (V.2.3) as the phasing programme, and Asian as the QC population.17 The HRC panel consisted of 64 976 haplotypes and 32 611 samples from 20 different studies including the UK10K and 1000 Genomes Project.18 Only variants with MAFs larger than 0.05, p values for the HWE exact test larger than 0.001, and R2 values larger than 0.3 were analysed. Unknown body mass index (BMI) values for two patients with NTM-PD were imputed using the missForest method (R-package: missForest, V.1.4) for the following analyses.19

We conducted a logistic regression adjusting for age, age2, sex and BMI. To adjust for population substructures, we calculated principal component (PC) scores with pruned variants. The 10 PC scores corresponding to the 10 largest eigenvalues were then included as covariates.20 PLINK (V.1.90b3.44)21 or R (V.3.4.4) was used for all steps in the second QC stage and association tests.

Validation with the replication cohort

In the replication study, genotyping was done with the Affymetrix Axiom Korean Chip (K-CHIP) available through the K-CHIP consortium. K-CHIP was designed by the Center for Genome Science, Korea, National Institute of Health, Korea (4845–301, 3000–3031), and the K-CHIP consists of 827 783 variants. We conducted the first QC stage on cases and controls separately (online supplemental text and figure E1). After combining the cases and controls, the same steps used in the discovery cohort were performed in the second QC stage, except that participants were removed when the differences in the heterozygosity rates from the means were larger than four times the SD. After QC, 282 302 variants from 1864 participants were imputed, and some variants were removed using the same process as that used for the discovery cohort (online supplemental figure E1). Logistic regression was conducted by the same process with all covariates, except BMI, used in discovery analysis. BMI was not available for the controls in the replication cohort.

Genome-wide complex trait analysis

SNP-heritability for NTM-PD was estimated using Genome-wide complex trait analysis (GCTA) (V.1.26.0) with genome-based restricted maximum likelihood.22 Only pruned variants with MAF larger than 0.1 were used for the estimation. We incorporated an estimate of the lifetime prevalence of NTM-PD in Korea as well as all covariates used for the GWAS except for the top 10 PC scores. The estimate of lifetime NTM-PD prevalence was 0.028, and was obtained by multiplying the point prevalence (39.6 cases/100 000 population in 2016)2 by the average life span (70 years).23

Expression quantitative trait loci and PrediXcan analyses

Genes associated with the variants of interest and their expression levels were analysed using the Genotype-Tissue Expression (GTEx) Portal V.7 (https://www.gtexportal.org/). PrediXcan was also used with GTEx V.7 PredictDB to predict the gene expression levels in lung tissue using our imputed genotype data.24 Linear regression was then conducted to compare the gene expression levels between cases and controls adjusting for the same covariates used in the GWAS.

Two-sample Mendelian randomisation analysis

We conducted a two-sample Mendelian randomisation (MR) analysis to identify the causal effect associated with the expression level of a candidate gene on the risk of NTM-PD development (online supplemental figure E2).25 We selected genetic variants that were significantly associated with the expression level of a candidate gene in lung tissue from the GTEx V.7 data set. Then, we pruned out variants if their pairwise correlations with others were larger than 0.2 in our discovery cohort. Using the pruned-in variants as instrument variables (IVs) along with summary statistics from analyses of the GTEx data set and the discovery cohort, we applied the MR-Egger regression adjusting for correlations between the IVs.25 The variant–NTM-PD and variant–candidate gene association estimates were obtained from our discovery cohort and the GTEx data set, respectively. To detect weak IVs explaining a relatively small proportion of variance in the expression level of a candidate gene,26 we also applied the MR-pleiotropy residual sum and outlier (MR-PRESSO) test using the same IVs and summary statistics.27 To adjust for correlations between the IVs, the CI and p value of the causal effect were estimated from 100 000 bootstrap replicates created by random sampling with replacement from the set of the selected variants before pruning. The same number of IVs were used for each MR-PRESSO test.

Transcriptome profiling analysis of NTM-infected macrophages

We conducted a transcriptome profiling analysis using GSE7282128 and E-MTAB-1101,29 two publicly available data sets. GSE72821 is an messenger RNA sequencing data set showing transcriptional events in THP-1-derived macrophages at 1, 4 or 24 hours post-infection with Smooth and Rough morphotypes of Mycobacterium abscessus. 28 We considered all 18 samples of macrophages infected with M. abscessus as the case group and 9 samples of uninfected macrophages as the control group. After genes whose expression level sums were smaller than one-tenth of the total sample size were removed, DESeq2 (V.1.14.1)30 was used to identify genes with differential expression between the case and control groups.

E-MTAB-1101, an expression microarray data set, can decipher transcriptional responses in human macrophages after infection with Mycobacterium avium. 29 The CD14+ cells obtained from six donors were allowed to mature into macrophages by incubation for 5 days before they were infected with two M. avium subsp hominissuis isolates and one M. avium subsp avium isolate for 24 hours or left uninfected. We considered all nine samples of infected and three samples of uninfected macrophages as cases and controls, respectively. After normalisation, we analysed the expression data with limma (V.3.36.2).31

Gene Ontology enrichment analysis

Differentially expressed genes (DEGs) were defined based on a false discovery rate (FDR) of 0.05 from analyses of two data sets (GSE72821 and E-MTAB-1101). We imported those DEGs into the Gene Ontology (GO) consortium website (https://geneontology.org/) and applied Fisher’s exact test.32 The results were based on the database as of 21 February 2020.

Common variant and gene set analyses of previously identified genes

To examine whether genes identified in previous studies were associated with NTM-PD in our discovery cohort, we tested 43 genes previously reported to be associated with NTM-PD.5 9–11 33 34 Their transcripts and 0.1 MB flanking regions were identified using the University of California Santa Cruz table browser (https://genome.ucsc.edu/cgi-bin/hgTables). We reported the common variant whose association with NTM-PD was strongest for each gene. In addition, we performed gene set analysis using the optimal sequence kernel association test.35

Results

Putatively significant genome-wide association

After imputation, a total of 4 877 242 variants from 403 cases and 306 controls were considered for a GWAS to identify the disease susceptibility loci of NTM-PD. The clinical characteristics of these individuals are shown in table 1 and online supplemental table E1. The multidimensional scaling (MDS) analysis showed the presence of genetic heterogeneity between the discovery cohort and other populations in the 1000 Genomes Phase 3 data (online supplemental text and figure E3A), but no overall genetic heterogeneity between cases and controls was identified in the discovery cohort (online supplemental figure E3B). Only one peak around a significance level of 1×10−7 was observed, ranging from 43.6 to 43.8 MB of chromosome 7 (figure 1A). Among variants located in this region, rs849177 (chromosome 7p13; chr7:43783788) was most significantly associated with NTM-PD (OR, 2.34; 95% CI, 1.71 to 3.21; p=1.36×10−7; table 2). This association was consistently observed without adjustment for BMI (p=2.85×10−7; online supplemental figure E4). The frequencies of the rs849177 risk allele (C) were 0.70 for the cases and 0.57 for the controls. The genotype frequencies for rs849177 were 0.49 (risk allele homozygotes (CC); 197 participants), 0.42 (risk allele heterozygotes (CT); 168 participants) and 0.09 (protective allele homozygotes (TT); 38 participants) for the cases, and 0.33 (CC; 100 participants), 0.49 (CT; 149 participants) and 0.19 (TT; 57 participants) for the controls (table 2). The quantile-quantile plot shows that at 1.03 the value of the genomic inflation factor is very close to 1; therefore, type-1 analysis errors were well controlled (figure 1B). Figure 1C shows that the other variants near rs849177 are highly correlated with rs849177. However, the conditional analysis on rs849177 did not identify any other significant SNPs (online supplemental figure E5). Furthermore, the estimated heritability of NTM-PD explained by the pruned variants was 0.225 (p=0.266).

Table 1

Participant characteristics and the number of variants in the discovery and replication cohorts

Figure 1

Plots showing putatively significant loci in the discovery cohort. (A) Manhattan plot. Red and blue lines indicate significance levels of 1.00×10−7 and 1.00×10−5, respectively. (B) Quantile-quantile plot. CIs for the estimated p values are 0.95 and are coloured in grey. (C) Regional plot showing genes near to rs849177. Purple indicates rs849177 and other colours represent the degree of correlation between the variants and rs849177. Chr, chromosome; GIF, genomic inflation factor.

Table 2

GWAS results for rs849177 in the discovery and replication cohorts

The association with rs849177 was consistent for patients with M. avium complex pulmonary disease (OR, 2.40; 95% CI, 1.71 to 3.37; p=4.30×10−7) and for those with M. abscessus complex pulmonary disease (OR, 2.14; 95% CI, 1.27 to 3.61; p=4.21×10−3; table 3). Although differences in the p values were primarily related to differences in the sample sizes, there were no significant differences in the associations of rs849177 with the two groups (p=0.769).

Table 3

Association of rs849177 with Mycobacterium avium complex pulmonary disease and Mycobacterium abscessus complex pulmonary disease in the discovery cohort

Validation of the candidate SNP

We next validated the association study’s findings using a replication cohort of 184 patients with NTM-PD and 1680 controls. The clinical characteristics of these individuals are shown in table 1 and online supplemental table E1. The MDS analysis showed that there was no overall genetic heterogeneity between the cases and controls in the replication cohort (online supplemental figure E3B). The nominal p value for rs849177 was 0.017 (OR, 1.29; 95% CI, 1.02 to 1.62; table 2). Moreover, as shown in table 2, the direction of effect of the variant was the same in both the discovery and replication cohorts. Using the Fisher’s method, the combined p value for the discovery and replication cohorts was 4.92×10−8. There were no significant variants other than rs849177 under the Fisher’s method (online supplemental table E2).

Three-dimensional structural analysis

Serine/threonine kinase 17a (STK17A), cytochrome c oxidase assembly factor 1 homolog (COA1) and biliverdin reductase A (BLVRA) were found in the same topologically associating domain as rs849177 (online supplemental text and figure E6).

Candidate gene associated with NTM-PD rs849177 significantly affects STK17A expression in GTEx lung tissue (p=7.1×10−19; online supplemental table E3). In GTEx tissues other than lung tissue, the expression levels of STK17A, COA1 and BLVRA were affected by rs849177 (online supplemental table E3). The expression quantitative trait loci analysis showed that individuals with C alleles at rs849177 tended to have lower STK17A expression levels in GTEx lung tissue (figure 2).

Figure 2

STK17A expression levels relating to the rs849177 genotypes in lung tissue from the Genotype-Tissue Expression database. The risk and protective alleles for rs849177 are C and T, respectively. The violin plot’s width indicates the sample density. Overlaid boxes indicate IQR and centre-lines are the median. CC, risk allele homozygotes; CT, risk allele heterozygotes; TT, protective allele homozygotes; STK17A, serine/threonine kinase 17a.

Next, lung tissue-specific gene expression levels were predicted using PrediXcan with our imputed genotype data. Consequently, STK17A was the most significantly differentially expressed gene between patients with NTM-PD and controls in the discovery cohort (p=5.00×10−7 and FDR=2.26×10−4; figure 3A), a result validated in the replication cohort (p=0.048; figure 3B). Additionally, the expression levels of STK17A did not differ between patients with M. avium complex pulmonary disease or M. abscessus complex pulmonary disease in the discovery cohort (p=0.829; figure 3C) and the replication cohort (p=0.634; figure 3D).

Figure 3

Expression levels of STK17A predicted by PrediXcan in patients with non-tuberculous mycobacterial pulmonary disease and controls in (A) the discovery and (B) replication cohorts, and in patients with Mycobacterium avium complex pulmonary disease and Mycobacterium abscessus complex pulmonary disease in (C) the discovery and (D) replication cohorts. The violin plot’s width indicates the sample density. Overlaid black dots and vertical lines represent the means and one SD intervals, respectively. FDR, false discovery rate; NTM-PD, non-tuberculous mycobacterial pulmonary disease; STK17A, serine/threoninekinase 17a.

Causal effect of candidate genes on NTM-PD development

In the MR analysis, five variants associated with the expression levels of STK17A in lung tissue from the GTEx data set were used as IVs: rs7790685 (chr7:43597398), rs1100407 (chr7:43756898), rs757859 (chr7:43885849), rs929380 (chr7:43982139) and rs59505066 (chr7:44108863). The effects of these variants on the expression level of STK17A were significantly negatively associated with the effects of the variants on NTM-PD development as shown using both the MR-Egger test ( β , −4.627; 95% CI, −8.768 to −0.486; p=0.029; figure 4) and the MR-PRESSO test ( β , −1.780; 95% CI, −2.974 to −1.618; p=0.049).

Figure 4

Plot of the variant–non-tuberculous mycobacterial pulmonary disease (NTM-PD) versus variant–STK17A expression regression coefficient for a Mendelian randomisation (MR) analysis with five variants. X and Y axes represent the effects of the variants on the expression level of STK17A and NTM-PD development, respectively. The MR-Egger regression estimate (Embedded Image ) is shown by a slope of red line. STK17A, serine/threonine kinase 17a.

Differential expression of the candidate gene in NTM-infected macrophages

We performed transcriptome profiling using two publicly available data sets to examine the expression levels of STK17A, BLVRA and COA1 in macrophages to determine whether their levels change on infection with NTM. The expression levels of STK17A increased significantly in both M. abscessus-infected macrophages (p=4.24×10−5 and FDR=7.94×10−4; figure 5A) and M. avium-infected macrophages (p=9.50×10−3 and FDR=3.76×10−1; figure 5B) compared with the uninfected macrophages at a nominal significance level of 0.05. However, neither BLVRA in the two data sets (figure 5C,D), nor COA1 in the RNA-seq data set (figure 5E) was differentially expressed between the infected and uninfected macrophages. The differential expression level of COA1 in the expression microarray data could not be determined because COA1 was not included.

Figure 5

Expression levels of STK17A (A and B), BLVRA (C and D) and COA1 (E) in Mycobacterium abscessus-infected or Mycobacterium avium-infected macrophages. A, C and E represent the results for GSE72821 (M. abscessus); B and D represent those for E-MTAB-1101 (M. avium). X and Y axes represent infected or uninfected macrophages and the expression levels of genes with log counts per million (logcpm) values or log2 expression levels, respectively. BLVRA, biliverdin reductase A; FC, fold change; FDR, falsediscovery rate; STK17A, serine/threonine kinase 17a.

Enrichment of apoptotic processes in NTM-infected macrophages

The number of significant DEGs was 2939 and 143 in GSE72821 and E-MTAB-1101, respectively. GO analysis showed marked enrichment of apoptotic signalling pathways in NTM-infected macrophages (online supplemental table E4).

Association study of genes identified in previous studies

In the common variant analysis, rs3111229 (chr1:211363628) of repressor element-1-silencing transcription factor (REST) corepressor 3 (RCOR3) was significantly associated with NTM-PD as shown by the family-wise error rate of 0.05. In the gene set analysis, nuclear factor of activated T cells 2 (NFATC2) was significantly associated with NTM-PD with a FDR of 0.05 (online supplemental table E5).

Discussion

Through GWAS, we identified rs849177 on chromosome 7p13 as a genetic variant associated with susceptibility to NTM-PD in a Korean population. The association was subsequently replicated in an independent cohort. In GTEx lung tissue, the rs849177 genotype was strongly correlated with the expression level of STK17A, a proapoptotic gene. Patients with NTM-PD were more likely to carry the risk allele at rs849177 associated with lower STK17A expression levels. Moreover, our findings using a MR approach supported the hypothesis that STK17A was causally related to risk of NTM-PD. Considering the transcriptome data showing upregulation of STK17A in macrophages on infection with NTM, our findings suggest that the 7p13 variant associated with lower expression of STK17A could increase susceptibility to NTM-PD by impairing apoptosis of NTM-infected macrophages.

Activated macrophages can kill intracellular mycobacteria by enhancing phagosomal maturation,36 induction of autophagy37 and apoptosis.38 Virulence of mycobacteria is largely attributed to their ability to survive inside the host’s immune cells, particularly alveolar macrophages.39 Apoptotic cell death, as part of the innate immune response, reduces mycobacterial viability. Host cell apoptosis is also important for the acquired immune response as apoptotic bodies containing mycobacterial antigens are phagocytosed by dendritic cells, which can lead to the presentation of such antigens and subsequent T cell activation.40 It has been observed that virulent but not avirulent strains of Mycobacterium tuberculosis inhibit apoptosis of human alveolar macrophages.41 This ability of virulent M. tuberculosis to inhibit apoptotic processes has been widely studied.41 42

STK17A encodes serine/threonine-protein kinase 17A, which is also known as the death-associated protein kinase-related apoptosis-inducing protein kinase (DRAK) 1.43 DRAK1 is located in the nucleus and acts as a positive regulator of apoptosis.44 Previous gene expression microarray data have shown that STK17A is upregulated in both M. tuberculosis-infected human macrophages45 and M. avium-infected bovine macrophages.46 The present study has also shown that STK17A is upregulated in human macrophages on infection with M. avium or M. abscessus by analysing the publicly available transcriptome data. Our data revealed that patients with NTM-PD had higher frequencies of the risk allele (C at rs849177) associated with lower STK17A expression. Given that apoptosis of alveolar macrophages infected with mycobacteria is crucial for human immunity, decreased STK17A gene expression might lead to reduced apoptosis of infected alveolar macrophages, allowing NTM to persist and replicate within the macrophages.

Cytochrome c oxidase assembly factor 1 homolog47 and biliverdin reductase A48 also play roles in apoptosis. However, unlike STK17A, the expression levels of COA1 and BLVRA in lung tissue were not correlated with the rs849177 genotype. Furthermore, our analysis of recent transcriptome data showed that COA1 and BLVRA were not differentially expressed in macrophages on infection with NTM, suggesting that they are not candidate NTM susceptibility genes associated with genetic variants on 7p13.

RCOR3, a gene associated with NTM-PD in gene-level analyses,11 was also significantly associated with NTM-PD in the present study. RCOR3 is a member of the REST corepressors and its role in the pathogenesis of NTM-PD requires further exploration. In addition, NFATC2 was significantly associated with NTM-PD in our gene set analysis. NFATc2 expression was downregulated in patients with NTM-PD.33 As NFATc2 regulates the expression of interferon-γ49 and tumour necrosis factor-α,50 repression of NFATc2 might be associated with the development of NTM-PD.

A previous study reported that the SNP-heritability of pulmonary tuberculosis was 0.15.23 As expected, the estimated heritability of NTM-PD calculated in our study was higher than that of pulmonary tuberculosis when the same method was used for estimating lifetime prevalence.

To appreciate the results of our study appropriately, its limitations have to be recognised. First, this study is moderately sized and could be underpowered, which might lead to failure in reaching the conventional genome‐wide significance level of 5×10−8. However, the combined p value from the Fisher method was 4.92×10−8, which satisfies the conventional significance level. Second, the controls for the discovery and replication cohorts appear to have different allele frequencies at rs849177. However, as shown in online supplemental figure E3B, there was no overall genetic heterogeneity between the cases and controls within each cohort. Third, although we incorporated the data from the transcriptome analysis, the actual role for rs849177 and STK17A in human immunity against NTM infection awaits direct determination. The roles played by epithelial cells and immune cells other than macrophages should also be considered.

In conclusion, we identified a genetic variant on 7p13 that alters the expression of a proapoptotic gene and might be associated with susceptibility to NTM-PD in a Korean population. These findings offer insights into the pathogenesis of NTM-PD and may help with the search for novel therapeutics.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • JC and KP contributed equally.

  • Contributors JJY and SW had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. JC, KP, JJY and SW contributed to the study concept and design, analysis and interpretation of data, and writing and revision of the manuscript. SMC, JL, CHL, JKL, EYH, DKK, YJL, JSP, YJC, HIY, JHL, CTL, NK, KYC, KHL and JS contributed to patient enrolment and data acquisition. All authors read and approved the final manuscript.

  • Funding This work was supported by the Seoul National University College of Medicine Research Fund (Grant number 0320160080).

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Ethics approval This study was approved by Institutional Review Board of Seoul National University Hospital (H-1605-111-763), Seoul National University Bundang Hospital (B-1610/366–303) and Chosun University (CHOSUN 2013-12-018-066).

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data availability statement Data are available upon reasonable request. Contact details: Jae-Joon Yim, MD, yimjj@snu.ac.kr.