Introduction

Mutations in genes involved in mitochondrial protein synthesis have been found to associate with a diabetic state. The 3243A>G mutation in the mitochondrial tRNA (Leu, UUR) gene, which impairs mitochondrial protein synthesis, is strongly diabetogenic, as are some additional mutations in mtDNA-encoded tRNA genes.1, 2 Recently, we found that an H324Q variant in the nuclear-encoded mitochondrial leucyl tRNA synthetase (LARS2) results in an increased type II diabetes mellitus (T2DM) susceptibility.3 It seems that a change in the activity of mitochondrial protein synthesis makes the organism more vulnerable to develop glucose intolerance.

A change in mitochondrial protein synthesis is expected to result in an unbalance in the stoichiometry of the proteins composing the respiratory chain. Some proteins of the respiratory chain are synthesized through the mitochondrial protein synthesis machinery, whereas the others are synthesized in the cytosol and imported into the mitochondrion. Therefore, impaired activity of the mitochondrial protein synthesis machinery is expected to result in an unbalance between the nuclear DNA- and mitochondrial DNA-encoded proteins of the respiratory chain. A similar situation of an unbalance may arise when the rate of mitochondrial biogenesis is altered.

In T2DM, a reduced activity of the respiratory chain is seen, which is also present in the first degree relatives.4, 5 The biochemical basis of this phenomenon may arise through an unbalanced expression of the various proteins. On the basis of these considerations, we hypothesized that polymorphisms in genes involved in mitochondrial protein synthesis and mitochondrial biogenesis may alter the balanced expression of proteins of the respiratory chain. As a result, these polymorphisms are expected to modify the risk for T2DM. It is estimated that the complete mitochondrial proteome consists of approximately 1500 proteins, which are mostly encoded by the nuclear genome.6 Common variations in genes encoded by the mitochondrial genome are not associated with the onset of T2DM.7, 8 Based on a literature study, we selected 13 candidate genes divided in several clusters. These candidates are all nuclear genes encoding for mitochondrial proteins.

Owing to the previous association of LARS2 with T2DM, the first cluster consisted of the mitochondrial aminoacyl-tRNA synthetases DARS2, TARS2 and IARS2. DARS2 and TARS2 are located on a widely replicated T2DM linkage locus on chromosome 1q.9 Furthermore, variants in the tRNAIle gene were shown to be associated with metabolic disorders.10

The second cluster consisted of the two mitochondrial translation initiation factors, MTIF2 and MTIF3. These two genes are involved in initiation of the mitochondrial protein synthesis and their dysfunction could lead to altered expression of mitochondrial proteins, and subsequently to oxidative stress and late-onset disease.11, 12, 13 We have selected them above other important translation factors, like TUFM, TSFM and GFM1, because mutations in the latter have already been associated with severe early-onset disease.14

The third cluster consisted of MTO1, TRMU and GTPBP3, which encode tRNA modifying enzymes. Variation in these genes could lead to dysfunctional enzymes and subsequently result in impaired tRNA modification and mitochondrial disease.15, 16, 17

In our fourth cluster, we included TFAM and PPRC1, which are involved in transcription and replication of the mitochondrial genome. It has been shown earlier that a low mitochondrial copy number is associated with T2DM, although this is not confirmed in all studies.18, 19 TFAM is one of the main regulators of mitochondrial transcription and replication, and thus of the mitochondrial copy number.20, 21 PPRC1 is a PPARGC1A-related gene and involved in mitochondrial biogenesis.22

In addition, we selected three genes, which did not fit in a specific cluster but may affect mitochondrial biogenesis and function. SIRT3 and SIRT4, which are Sir2 homologs, are involved in mitochondrial function and insulin secretion.23, 24 Moreover, SIRT3 has been associated with longevity.25 GPAM is the final candidate gene, which is involved in triacylglycerol and phospholipid synthesis. GPAM knockout mice show hyperinsulinaemia and reduced glucose tolerance.26, 27 Therefore, we hypothesized that defects in GPAM could lead to insulin resistance and subsequently to T2DM.

Dysfunction of these 13 candidate genes potentially affects mitochondrial function and may therefore increase T2DM susceptibility. Therefore, the aim of this study was to analyze the association of common variants in these gene loci with T2DM, using a tagging single nucleotide polymorphism (SNP) approach.

Materials and methods

Single nucleotide polymorphism selection

Tagging SNPs were selected using the HapMap database (www.HAPMAP.org) and Tagger.28, 29 The threshold for the minor allele frequency (MAF) was set at 0.05. Using Tagger, 62 SNPs were selected, which cover all common variation in the selected genes according to HapMap data (phase 2, April 2007 (Build 36), population CEU, r2>0.8). Aggressive tagging with two- and three-marker tests was used. This resulted in four multimarker tests. The remaining tests were all single marker tests. Gene boundaries include 5′ and 3′-UTR regions. Details about the SNP selection for the individual genes, including gene boundaries, are summarized in Table 1.

Table 1 Gene boundaries and number of SNPs in the selected candidate genes

Genotyping and quality control

A two-stage design was used. For first-stage genotyping, the Sequenom platform (Sequenom, San Diego, CA, USA) was used. First, the allelic discrimination plots were visually observed for good clustering. Assays with bad clustering were not further analyzed. SNPs with a success rate below 95% or not obeying Hardy–Weinberg Equilibrium (P<0.01) were excluded from analysis. SNPs, which did not fit in the Sequenom assay or failed quality control (QC), were genotyped with Taqman SNP genotyping assays (Applied Biosystems, Foster City, CA, USA) and evaluated with the same QC guidelines. Duplicate samples (5%) showed identical genotypes. SNPs showing evidence for association with T2DM (P<0.05) were selected for second-stage genotyping. Taqman SNP genotyping assays were used for this.

Study cohorts

The first-stage genotyping was performed in the Hoorn study.30 This population-based study consisted of 519 normal glucose tolerant (NGT) participants (aged 65±8 years, 55% male) and 480 T2DM participants (aged 67±8 years, 52% male). Glucose tolerance was tested using a fasting oral glucose tolerance test (OGTT), according to World Health Organization (WHO) criteria.31

For the second-stage genotyping, three cohorts from the Netherlands were pooled. The first cohort was the New Hoorn Study (NHS), which is an ongoing, second non-overlapping population-based study in Hoorn.32 From this study, we randomly selected 1517 NGT and 147 T2DM participants. All participants underwent an OGTT according to WHO criteria.31 To increase power, we included 674 participants with T2DM from the diabetes clinics of the Leiden University Medical Center (LUMC, Leiden, The Netherlands) and from the Vrije Universiteit medical center (VUmc, Amsterdam, The Netherlands). All participants were Dutch caucasians. In total, the NHS sample included 1517 controls (aged 53±7 years, 44% male) and 821 cases (aged 61±11 years, 50% male).

The second cohort was the Breda study.33, 34 This case–control study consisted of 920 healthy controls (according to self report) from the Dutch blood bank (aged 48±13 years, 61% male). 501 cases (aged 71±10 years, 46% male) were T2DM patients according to WHO criteria.31

The third cohort was the ERGO study from Rotterdam.35 This population-based study consisted of 5183 NGT participants (aged 69±9 years, 41% male) and 1222 (73±9 years, 39% male) T2DM patients.

In total, for the second-stage genotyping, we used 7620 controls and 2544 T2DM cases. The characteristics of both stages are summarized in Table 2 (Characteristics of the independent cohorts are summarized in Supplementary Table S1.). For additional replication of our strongest signal, we extended the second stage with samples from the Danish Steno study.36 This case–control study consisted of 514 NGT controls (aged 57±10, 46% male) randomly selected from public registers at the Steno Diabetes Center and the Research Center for Prevention and Health. The 706 cases (aged 59±10 years, 48% male) were recruited from the Steno Diabetes Center.

Table 2 Descriptive statistics of the first and second stage

Statistical analysis

Differences in genotype distribution and allele frequency were analyzed using a χ2 test. An additive model was the best fit for most of the SNPs. rs1049432 (TFAM) and rs4917960 (PPRC1) might better fit a recessive model, and were therefore also analyzed with this model. A common allelic odds ratio (OR) covering both stages was calculated in a meta-analysis using a Mantel–Haenszel test. Homogeneity of ORs between the different cohorts was tested using a Tarone's test. Results from OGTT (only NGT participants) were analyzed with univariate analysis of variance, using additive, dominant and recessive models, and correction for age, BMI and gender, as possible confounders. Power calculations were performed using Quanto.37 Power to detect modest effects in the first stage was limited (80% power to detect, OR 1.45, MAF=0.1). Therefore, we did not correct for multiple testing but rather performed extensive replication of all signals from the first stage with P<0.05. All statistics were calculated using SPSS 14.0 (SPSS Inc., Chicago, IL, USA).

Results

Of the 62 SNPs tested in the first-stage genotyping, 58 SNPs passed QC guidelines. Of these SNPs, we selected the top seven SNPs for second-stage genotyping (P<0.05). Data from genome-wide association studies (GWAS), which became available after completion of our study, resulted in the inclusion of two additional SNPs, rs4397793 in TFAM and rs2792751 in GPAM. Compared with our data, both showed a similar effect in at least one of the publicly available databases of GWAS.38, 39 In total, nine SNPs were selected for genotyping in the second stage (Table 3).

Table 3 Results of first and second-stage genotyping

We analyzed the second-stage results for rs1049432 in TFAM and rs4917960 in PPRC1 first with a recessive model, because that was the best fit for the first-stage data (Prec=0.002 and Prec=0.07, respectively). However, second-stage results showed no support for this model; therefore, the additive model was used for further analyses. There was no evidence of heterogeneity of ORs between the independent cohorts; therefore, we pooled all the second-stage data. A meta-analysis of both stages was performed to calculate a common OR for the nine selected SNPs (Table 3). Only the G allele of rs2522138 in SIRT4 remained significant (P=0.01), but this was mainly caused by first-stage results. To confirm the observed association, we expanded the second stage with the Danish Steno study resulting in a common OR of 0.92 (0.85–1.00), P=0.06.

The C allele of rs4917960 in PPRC1 was borderline significant after the meta-analysis (OR=1.06 (1.00–1.13) P=0.06). This association was nominal and mainly caused by first-stage data. The ORs in all independent cohorts are shown in Supplementary Table S2. Correction for age, gender and BMI (when available) did not influence the results.

Finally, we analyzed OGTT data for association of the selected genes with differences in clinical variables like glucose and insulin. Furthermore, association with BMI was assessed when the data were available. No differences were observed using different models (data not shown).

After completion of our study, a meta-analysis of three T2DM GWAS appeared.40 The coverage of common variation in the 13 selected genes in the GWAS was between 80 and 100% except GTPBP3, which was covered for 50%. We compared our results with the GWAS data, using the OR-based analysis (Table 4). The G allele of rs2522138 in SIRT4 was not associated with T2DM susceptibility in the GWAS data (OR=1.02 (0.93–1.11), P=0.72). The C allele of rs4917960 in PPRC1 showed an OR of 1.04 (0.97–1.12), (P=0.23) in the GWAS data, which is in the same direction as observed in our study (OR=1.06 (1.00–1.13), P=0.06). Two SNPs located in IARS2 (the C alleles of rs17007135 and rs2289191) showed evidence of nominal association in the GWAS meta-analysis (P=6 × 10−4 and P=0.003, respectively). Unfortunately, the assays for these two SNPs failed in our study. None of the other SNPs in our 13 genes showed evidence of association in the T2DM GWAS meta-analysis.

Table 4 Comparison of the results of the T2DM GWAS meta-analysis and our replication SNPs

Discussion

We analyzed 58 tagging SNPs in 13 genes involved in mitochondrial function for association with T2DM susceptibility using a two-stage design. After meta-analysis of the data of both stages, only rs2522138 in SIRT4 remained borderline significant. However, after extension of the second stage with samples from the Steno study, this association did not remain significant.

For several gene variants measured in our study, there is substantial heterogeneity of ORs between the first and second stage, but not within each stage (Table 3 and Supplementary Table S2). Heterogeneity between the independent cohorts might be caused by the differences in the age, gender or selection criteria used, and might obscure true association (Supplementary Table S1). However, we were unable to identify the reasons for this heterogeneity in our study. Differences in LD between the measured and causal SNP are also an unlikely cause, because the study participants in our cohorts are from similar ancestry.41 In the context of testing multiple SNPs, our nominal significant results in SIRT4 and PPRC1 should be interpreted as consistent with statistical noise.

One of the limitations of our study is that it is statistically underpowered to detect SNPs with a very small impact or low allele frequency. Although power in the first stage alone was low, we had at least a power of 80% at an observed MAF0.13 to detect ORs comparable to those recently reported in GWAS (1.12–1.37) when first and second stage are combined.40 Our negative results for these 13 genes are confirmed by the DIAGRAM GWAS meta-analysis, which has a much higher power to detect also more modest effects.38, 39, 40, 42 Taken together, we can exclude that common variants in the selected genes have a major impact on T2DM susceptibility. Rare variants are not covered by our approach; therefore, we cannot exclude a role for such variants. Deep sequencing and analysis of novel rare variants in large well-phenotyped cohorts should provide more insight into these important issues.

Other nuclear-encoded mitochondrial genes are also not among the top hits of reported T2DM GWAS.38, 39, 40, 42, 43 Nevertheless, earlier studies suggested that they may be involved in T2DM susceptibility.3, 44, 45, 46, 47 Further combined and detailed analysis of the available GWAS datasets, and analysis of copy number and so-called low-frequency intermediate-penetrance variants are necessary to fully explore the role of nuclear-encoded mitochondrial genes in the pathogenesis of T2DM.

In conclusion, we were unable to detect association of common SNPs in 13 nuclear-encoded mitochondrial candidate genes with T2DM or with related continuous traits.