Trans-Ethnic Fine-Mapping of Lipid Loci Identifies Population-Specific Signals and Allelic Heterogeneity That Increases the Trait Variance Explained

Download PDF České info

Genome-wide association studies (GWAS) have identified ∼100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1×10⁻⁴ in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3 -⁠ to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies.

Published in the journal: . PLoS Genet 9(3): e32767. doi:10.1371/journal.pgen.1003379
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1003379

Summary

Introduction

Genome-wide association studies (GWAS) have identified many common genetic variants associated with human diseases and complex traits (www.genome.gov/gwastudies), including ∼100 loci associated with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), or total cholesterol [1]–[5]. A majority of the lead SNPs at these loci have shown small effect sizes, leaving much of the trait heritability unexplained. Some of this missing heritability may be due to the incomplete coverage of functional common or rare variants and the poor representation of appropriate proxies on commercial genotyping arrays [6], [7]. Other missing heritability may result from a failure to detect the full spectrum of causative variants present at GWAS-identified loci.

Fine-mapping of GWAS signals should increase the power to detect variants that influence trait variability. Genotyping of additional variants at GWAS loci can identify SNPs with stronger evidence of association than the reported GWAS index SNPs and may help detect or further localize the underlying causal variants [7], [8]. The Metabochip is a high-density custom genotyping array designed to replicate and fine-map known GWAS signals for metabolic and atherosclerotic/cardiovascular endpoints, and more extensively, to identify all signals around the index SNPs [9], [10]. The fine-mapping SNPs spanned a wide range of allele frequencies including rare (minor allele frequency (MAF)<0.005) and less common (0.005≤MAF<0.05) SNPs selected from the catalogs of the International HapMap Project and the August 2009 release of the 1000 Genomes Project. SNPs annotated as nonsynonymous, essential splice site or stop codon were included regardless of MAF, design score, or the presence of nearby SNPs [10]. The Metabochip contains densely spaced SNPs at 18, 22, and 18 loci previously reported for TG, HDL-C, and LDL-C, respectively.

Allelic heterogeneity, in which different variants at the same gene/locus affect the same phenotype, is a frequent characteristic of both single-gene and complex disorders. Recently GWAS have identified more than one independent signal at loci associated with coronary artery disease [11] and type 2 diabetes [12], [13]. Among a set of 30 lipid loci reported through GWAS, secondary SNPs that exhibited weak to moderate LD with the corresponding index SNPs and displayed little change of association in conditional analyses were detected at seven loci including CETP, LIPC, APOA5, APOE, LDLR, ABCG8, and LPL [4]. More than one association signal also was detected at 26 of 95 lipid loci reported by the Global Lipids Genetics Consortium [5]. However, allelic heterogeneity has not been comprehensively evaluated for common traits including lipid traits across ethnically diverse populations, especially in non-European populations such as African Americans and East Asians.

Due to divergent evolutionary and migratory histories, patterns of linkage disequilibrium (LD) vary across ancestry groups [14]. Greater haplotype diversity in some ancestry groups, especially in African ancestry populations, may facilitate the localization of functional variants that show association signals delimited in part due to weaker LD with neighboring SNPs [14], [15]. A recent multi-ethnic analysis of lipid associated loci demonstrated that genetic determinants at many lipid loci differed between European Americans and African Americans [16]. For example, in African Americans from the PAGE consortium [9], [17], a reported regulatory variant rs12740374 at CELSR2/PSRC1/SORT1 locus [18] was more strongly associated with LDL-C compared to many nearby variants demonstrating similar strength of association in European ancestry individuals [5]. High-density genotyping enables trans-ethnic fine-mapping studies to narrow the set of plausible candidate functional variants at GWAS loci without introducing uncertainty through imputation [19].

In this study, we analyzed high-density genotyped SNPs on the Metabochip for their associations with TG, HDL-C, and LDL-C in 6,832 African Americans, 9,449 East Asians, and 10,829 Europeans at 58 known lipid loci. We sought to (i) identify the variants with the strongest evidence of association at each locus in populations with different ancestries and in the combined trans-ethnic samples; (ii) investigate allelic heterogeneity and population-specific signals at the established lipid loci; (iii) explore whether high-density genotyping in diverse ethnic populations would narrow the sets of plausible candidate functional variants for further study; and (iv) assess whether the variants reported to have functional effects on gene expression or protein function during the past 30 years of biological study exhibited the strongest evidence of association at the corresponding GWAS signals.

Results

Loci with evidence of association in diverse populations and in the combined trans-ethnic samples

Descriptions of the collection, phenotyping, and genotyping of study samples for each study site are provided in Table S1. Given that all 58 loci have a priori genome-wide significant evidence of association with one or more of these three lipid traits, we used a P value threshold of 1×10⁻⁴ as an approximate correction for the mean of 451 SNPs tested at each locus in African Americans (Table S2). An average of 273 SNPs per locus was tested in East Asians and an average of 291 in Europeans, but we applied the same, more conservative, P value threshold of 1×10⁻⁴ to these two groups as well.

A total of 33 loci (nine for TG, 14 for HDL-C, and 10 for LDL-C) exhibited evidence of association at P<1×10⁻⁴ in at least one of the three ancestry groups, including 22 loci in African Americans, 17 in East Asians, and 31 in Europeans (Table S3A–S3C). The variants that reached this threshold of significance were common (MAF≥0.05), except at three loci (PCSK9 and ABO for LDL-C, and APOA5 for HDL-C) in African Americans and two loci (PCSK9 and TOP1, both for LDL-C) in European ancestry individuals. When individuals of diverse ancestry groups were combined, 11, 15, and 12 loci showed evidence of significant association with TG, HDL-C, and LDL-C, respectively (Table S4A–S4C). Among these 38 loci, six loci had not reached the P value threshold of 10⁻⁴ within any individual ancestry group, including CETP and NAT for TG, GALNT2 and MMAB for HDL-C, and TRIB1 and TIMD4 for LDL-C. One locus, COBLL1, was significantly associated with HDL-C in Europeans alone (P = 8.5×10⁻⁵), but displayed less evidence of association in the combined trans-ethnic samples (P = 1.6×10⁻⁴).

Loci with evidence of multiple signals at a locus, and often population-specific signals

To assess the presence of two or more signals at each locus that exhibited evidence of association in at least one ancestry group, we performed sequential conditional analyses by adding the most strongly associated SNP to the regression model as a covariate and testing the association with each of the remaining regional SNPs independently. A set of sequential conditional analyses were followed by inclusion of the strongest SNP in each conditional model until the most strongly associated SNP showed a conditional P value>10⁻⁴ and was not annotated as a nonsense or nonsynonymous substitution. We also investigated whether association signals were population-specific, which we defined as association signals with variants that are not variable in the samples from the other two ancestry groups in this study or in the 1000 Genomes Project populations that represent those groups among total European ancestry (EUR), total East Asian ancestry (ASN), or total west African ancestry (AFR).

In African Americans, sequential conditional analyses revealed that 10 of the 22 loci with evidence of association exhibited two or more signals at P<10⁻⁴ (Table 1). Two loci (PCSK9 and the TOMM40-APOE-APOC4 cluster; both for LDL-C) each had seven signals, four loci (APOB for LDL-C, LDLR for LDL-C, LCAT for HDL-C, and CETP for HDL-C) had three signals, and another four loci (APOB, APOC1, APOA5, and LPL; all for TG) had two signals. Among the 10 loci with two or more signals, all these signals led to an average 1.8-fold increase in the amount of phenotypic variance (R²) compared to that explained by the strongest signals alone (See Method) in African Americans. Among these 34 signals, 15 were represented by less common (0.005≤MAF<0.05, n = 11) or rare (MAF<0.005, n = 4) variants. In addition, 15 signals at eight loci were African American-specific. If we only include SNPs that meet a locus-specific P-value threshold based on the number of genotyped SNPs (Table S2), LPL for TG and APOB for both TG and LDL each had one signal, and the seven loci with multiple signals still showed an average of 1.8-fold increase in the explained phenotypic variance.

**Tab. 1. Lipid loci with multiple and population-specific signals in African Americans.**

The seven signals at PCSK9 in African Americans included six nonsense or nonsynonymous variants previously shown to associate with LDL-C levels and to affect PCSK9 expression or function [20]–[22], along with an unreported intronic variant (Table 1). The strongest signals were a nonsense variant rs28362286 (C679X, Figure 1A) and a nonsynonymous variant rs28362263 (A443T, Figure 1B), which showed no reduction of association evidence when conditioned on C679X. Conditional analysis on both C679X and A443T yielded a third signal at rs28362261 (N425S, Figure 1C); and further conditional analyses successively implicated rs67608943 (Y142X, Figure 1D), rs72646508 (L253F, Figure 1E), and an intronic variant rs11800243 (Figure 1F). The seventh signal, which did not reach the P_conditional<10⁻⁴ threshold, was represented by the nonsynonymous variant rs11591147 (R46L, Figure 1G) that exhibited the strongest and directionally consistent evidence of association with LDL-C in Europeans (P_initial = 2.8×10⁻³⁰, Table 2). The seven signals were weakly correlated with each other in African American individuals, and all pairwise LD r² values were less than 0.02. Among the seven PCSK9 signals, the top five were African American-specific, and six were either less common or rare in African Americans. The lead SNP C679X accounted for 1.3% of the explained LDL-C phenotypic variance and the seven signals together explained 3.6% of the phenotypic variance in African Americans. PCSK9 exhibited two signals in Europeans (R46L and rs2495477, Table 2), but no SNP reached P_initial<10⁻⁴ in East Asians.

LDL-C locus <i>PCSK9</i> exhibited seven signals in African Americans. — **Fig. 1. LDL-C locus *PCSK9* exhibited seven signals in African Americans.**

**Tab. 2. Lipid loci with multiple signals in Europeans.**

At the TOMM40-APOE-APOC4 cluster, the seven signals in African Americans explained 6.6% of the LDL-C phenotypic variance compared to 4.1% explained by the strongest signal R176C, which had reported functional effects [23] (Table 1, Figure S1). These seven signals were not entirely independent of one another. The fourth signal, rs157588, showed association with LDL-C (P = 2.0×10⁻⁷) only after conditioning on the top three signals, but not in the original unconditioned association analysis (P = 0.72). The trait-decreasing allele (G allele: freq = 0.176) of rs157588 was present on haplotypes containing the trait-increasing allele of the third signal rs1038026 (A allele: freq = 0.351), thus the association of the fourth signal increased in significance after accounting for linkage disequilibrium (r²/D′ = 0.35/0.92) with the third signal at the same locus. Haplotype analysis revealed that compared to the reference A-A (increasing-increasing) haplotype, the G-G (decreasing-decreasing) haplotype only displayed modest association with LDL-C (P = 7.5×10⁻³), but the A–G (rs1038026 increasing -⁠ rs157588 decreasing) haplotype showed significant association with decreased level of LDL-C (P = 1.5×10⁻¹⁰) (Table S5). In Europeans (Table 2) and East Asians (Table 3), three and two signals were identified at TOMM40-APOE-APOC4, respectively. The known functional variant R176C exhibited the strongest evidence of association across the three ancestry groups, with effect sizes of −0.536, −0.505, and −0.411 mmol/L in individuals of African American, European, and East Asian ancestry, respectively (Table 1). However, another APOE variant rs429358 (C130R), that together with R176C, defines the three major isoforms of APOE (ε2, ε3, and ε4) [7], [24], was not successfully genotyped, therefore the LDL-C association with either C130R or the APOE haplotype was unavailable in this study.

**Tab. 3. Lipid loci with multiple signals in East Asians.**

In Europeans, 21 signals at nine of the 31 loci exhibited multiple signals for at least one of the three lipid traits at P<10⁻⁴ (Table 2). Three loci (APOA5 for TG, TOMM40-APOE-APOC4 cluster for LDL-C, and CETP for HDL-C) each had three signals while another six loci (PCSK9 for LDL-C, GCKR for TG, LIPC for HDL-C, APOB for LDL-C, and LPL for both TG and HDL-C) each had two signals. At the nine loci that had two or more signals, all association signals resulted in an average of 1.3-fold increase in the explained phenotypic variance compared to the strongest signals alone across loci. At PCSK9, rs11591147 (R46L) exhibited the strongest evidence of association in Europeans. As reported above, R46L also represented the seventh signal in African Americans. R46L accounted for 1.2% of the total variation in LDL-C levels in Europeans compared the 0.16% in African Americans. This SNP was not variable in the 1000 Genomes Project ASN samples (East Asian ancestry) and the >9,000 East Asian individuals in this study.

In East Asians, we observed three signals at the TG locus APOA5, and two signals at three loci including TOMM40-APOE-APOC4 cluster for LDL-C, CETP for HDL-C, and ABO for LDL-C (Table 3). At the four loci that exhibited multiple signals, all the association signals increased the explained phenotypic variance by an average of 1.3-fold compared to the strongest signal across loci. The second signal at APOA5 was the nonsynonymous variant G185C previously reported to affect the protein function [25]. Although G185C was not unique to East Asians, the frequency was very low in African Americans (MAF = 0.002, P = 0.028) and Europeans (MAF = 0.0003, P = 0.23), and the low allele frequency meant that this study had less than 5% statistical power to detect the association in these groups.

At APOA5, which exhibited multiple signals in all three populations (Table 1, Table 2, Table 3), the strongest TG-associated SNPs differed and were not in high LD (r²<0.8) with each other in any of the ancestry groups. In African Americans, the two signals S19W (MAF = 0.058, P = 8.4×10⁻¹⁵) and rs79624460 (MAF = 0.083, P = 4.8×10⁻¹²), showed no evidence of significant association in East Asians (Table 1), likely due to the low allele frequency and the limited power (∼10%) to detect the association. The three signals at APOA5 in East Asians were only modestly associated with TG in African Americans (all P>10⁻³, Table 3). The SNP LD r² values between the African American and East Asian signals were less than 0.02 in both populations, suggesting that they represent distinct APOA5 signals in the two ancestry groups. In addition, the APOA5 signal rs3741298 (P = 9.7×10⁻⁴⁴, MAF = 0.222) in Europeans exhibited evidence of association with TG in African Americans (P = 9.8×10⁻⁵, MAF = 0.327) and East Asians (P = 1.2×10⁻²⁰, MAF = 0.357), but the significance levels of the association with rs3741298 were substantially attenuated by conditioning on the strongest signals S19W in African Americans (P = 0.10) and rs651821 in East Asians (P = 0.88). In Europeans, the associations with rs3741298 were partially removed when conditioning on S19W and rs651821 (P_conditional = 1.7×10⁻²⁸ and 3.1×10⁻¹⁷, respectively). The European signal rs3741298 was moderately correlated with the African American signal S19W (LD r² = 0.21 and 0.10 in the 1000 Genomes Project EUR samples (European ancestry) and in PAGE African American samples, respectively), and with the East Asian signal rs651821 (LD r² = 0.31 and 0.28 in 1000 Genomes Project EUR and ASN samples, respectively). Notably, the effect sizes of the two reported functional variants S19W [26] and G185C [25] at APOA5 were similar across the three groups (S19W, African American: 0.136; East Asian: 0.136; European: 0.121 and G185C, African American: 0.204; East Asian: 0.201; European: 0.269 mmol/L in log_e scale) despite the limited power to detect significant evidence of association at low allele frequencies. These findings support the hypothesis that causative variants may have a similar genetic impact on trait variation across populations if not influenced by hidden gene-gene or gene-environment interactions [27]. We also observed that the second European signal rs75919952 exhibited nominal evidence of association (P _initial = 0.018, MAF = 0.041), but was not associated with TG in the other two groups (Table 2). The lack of association may be due to insufficient power (15% and 55% in African Americans and East Asians, respectively; assuming α = 0.05) corresponding to the lower allele frequency (MAF = 0.012) in African Americans, the smaller sample sizes in both populations, or underlying interactions.

Trans-ethnic high-density genotyping narrowed the region of association signals

We next examined whether trans-ethnic meta-analysis or comparison across ancestries would refine the association signals by narrowing the genomic regions where functional variants might be expected to reside. The trans-ethnic analysis allowed the refinement of association signals at loci of GCKR, PPP1R3B, ABO, LCAT, and ABCA1 (Table 4, Table S3A–S3C). The signal at GCKR was localized to the reported functional variant P446L [28] due to the limited LD in African Americans (Figure S2A–S2D). Notably, there were seven and six variants in high LD (r²>0.8) with P446L in the 1000 Genomes Project ASN and EUR samples, but no SNP with LD r²>0.8 in African American individuals. At the signal ∼200 kb from the PPP1R3B gene for which no functional regulatory variant(s) have been reported, the association signal was narrowed from 4 SNPs spanning 36 kb (P<10⁻⁴) in Europeans to two highly correlated SNPs located 1 kb apart in African Americans (rs6601299, P = 8.0×10⁻⁸ and rs4841132, P = 2.9×10⁻⁷; LD r²>0.94) (Figure 2). The lead SNP rs6601299 was in high LD with 11 variants in the 1000 Genomes Project EUR samples but only highly correlated with two and one variant in the 1000 Genomes Project AFR samples (West African ancestry) and PAGE African American individuals, respectively. At the ABO locus, trans-ethnic meta-analysis revealed six SNPs exhibiting stronger evidence of association (P<1.1×10⁻¹¹) with LDL-C compared to other variants in the same region (P>2.3×10⁻⁷) (Figure S3A–S3D). At the locus LCAT for HDL-C, the association signals spanned ∼800 kb, ∼360 kb, and ∼360 kb in Europeans, East Asians, and African Americans, with a ∼50 kb overlapping region. Trans-ethnic meta-analysis of all samples localized the signal to four variants spanning this 50 kb region (Figure S4A–S4D). At HDL-C locus ABCA1, the reported GWAS index SNP rs1883025 consistently showed the strongest association within each of the three ancestry groups that we examined, but the significance level of the association was similar to those of the nearby SNPs. Trans-ethnic meta-analysis refined the signal by revealing that rs1883025 (P = 4.3×10⁻¹⁷) and rs2575876 (P = 1.8×10⁻¹⁵) displayed much stronger association than the neighboring SNPs (P>8.4×10⁻¹⁰) (Figure S5A–S5D).

Trans-ethnic high-density genotyping narrows the association signal at the HDL-C locus <i>PPP1R3B</i>. — **Fig. 2. Trans-ethnic high-density genotyping narrows the association signal at the HDL-C locus *PPP1R3B*.**

**Tab. 4. Trans-ethnic fine-mapping narrowed the association signals.**

Reported functional variants were frequently the most strongly associated ones at a signal

Among loci associated with at least one lipid trait (P<10⁻⁴), at least 27 variants at 15 loci have been previously reported [18], [22], [23], [25], [26], [28]–[47] to functionally influence gene expression or protein function in vitro (Table 5). Among the 27 variants, 17 are present on the Metabochip and two are well-represented by perfect proxies in complete LD (r² = 1) based on the 1000 Genomes Project EUR data. Of the 19 reported functional variants, 14 (74%) exhibited the strongest association P-value among all SNPs at that signal in at least one population. In addition, two more reported functional variants (APOB-rs7575840, P = 7.0×10⁻¹⁷ and LPL-rs328, P = 2.3×10⁻¹¹) were in high LD (r²>0.95) with the most strongly associated variants and showed similar evidence of association (APOB-rs934198, P = 3.7×10⁻¹⁷; LPL-rs1803924, P = 1.1×10⁻¹¹). If we include these two variants, then 16 of the 19 (84%) reported functional variants displayed the strongest association P-value at the primary, secondary, or successive signals. The remaining three reported functional variants: LDLR-rs688 (N591N), LPL-rs1801177 (D9N), and HMGCR-rs3761740 (911C>A), were poorly tagged (LD r²<0.2) by the strongest variants in our data. Additional functional variants may exist at these loci that have not yet been reported to change gene expression/protein function or that were not identified in our literature search. For example, P2739L and P145S that represented the two signals at APOB (Table 1) were predicted by PolyPhen [48] to be ‘probably damaging’ with a score of ‘1’, although their functional roles were unclear.

Reported functional variants exhibited the strongest association at a signal (<i>P</i><10<sup>−4</sup>). — **Tab. 5. Reported functional variants exhibited the strongest association at a signal (P<10⁻⁴).**

Among the 16 reported functional variants and proxies that exhibited the strongest association P-value at a signal (Table 5), R176C at APOE was strongest in all three populations and GCKR L446P was identified in both African Americans and Europeans. The remaining 14 variants showed the strongest associations in only one of the populations, including 10 in African Americans, three in East Asians, and one in Europeans. Five of the 10 variants in African Americans were at the PCSK9 locus. Furthermore, nine of the 16 variants represented the strongest signal at a given locus, three for a 2nd signal, and four for the 3rd or additional signals. These functional variants covered a wide allele frequency spectrum (MAF: 0.003–0.481), including five less common or rare variants observed only in African Americans.

Discussion

This study evaluated densely spaced SNPs at 58 lipid loci across three ancestrally diverse populations. The results support evidence that allelic heterogeneity is a frequent feature of polygenic traits [5], [49] and extend the findings to non-European populations, especially to African ancestry populations that have high levels of haplotype diversity. The results also provide strong evidence that fine mapping at GWAS loci can identify population-specific signals. Despite comparable sample sizes, we identified more signals per locus and more signals overall in African Americans (34 signals at 10 loci) compared to Europeans (21 signals at nine loci) and East Asians (nine signals at four loci), and 15 of the 34 signals identified in African Americans were population-specific (Table 1, Table 2, Table 3). These observations may reflect the larger number of SNPs genotyped in African Americans (Table S2), variation across populations subject to natural selection during human evolution [14], or genetic drift [50]. Due to the varied number of signals per locus, different associated markers, and different effect sizes, the phenotypic variance explained differs across populations [51]–[53]. Sampling variability, epistasis, and gene-environment interactions may cause over -⁠ or under-estimation of the proportion of explained phenotypic variance. In this study, we also observed that many population-specific signals, including those at PCSK9 and APOA5, are largely confirmatory [20], [22], [54]; however, the association evidence at other signals, in particular the additional signals at APOE, LDLR, and APOC1 identified by the conditional analyses, requires replication in future studies.

At PCSK9, the strongest signal C679X identified in African Americans is population-specific and showed substantially stronger evidence of association with LDL-C (P = 4.1×10⁻²²) compared to the GWAS index SNP rs2479409 [5] (P = 0.12) and the most strongly associated SNP R46L identified via fine-mapping [7] (P = 2.3×10⁻³), both of which were previously reported in Europeans. The proportion of phenotypic variance explained in African Americans increased from 0.16% by the GWAS index SNP to 1.3% by the Metabochip signal C679X, and all variants at the locus together explained 3.6% of the total variation in LDL-C, providing evidence that heritability at identified loci may be underestimated by GWAS [7]. A limitation of these variance estimates is that calculations included the SNPs based simply on their significant association P values rather than the variants with biological function, which could over-estimate effects due to the winner's curse.

Results across the genotyped loci demonstrated that the majority of signals were represented by common variants, yet high-density genotyping also identified less common and rare variants associated with lipid traits. At PCSK9, the MAFs of six out of the seven signals were <0.05 in African Americans. These signals, along with other low frequency variants identified at APOE, LDLR, LCAT, APOB, APOC1, and LPL provide evidence of the substantial contribution of low frequency genetic variants to the variance of lipid traits [6]. Other variants, some with very low allele frequency, may exist at these loci, suggesting that future sequencing studies may identify additional functional variants that influence lipid variation.

Sequential conditional analyses provided further insight into the genetic architecture of the established lipid loci by explaining additional phenotypic variation and revealing complex patterns of association. We observed loci at which signals were not independent of each other, but partially correlated based on moderate LD estimates and changes of association statistics before and after accounting for other signals. For these dependent signals, such as those at TOMM4-APOE-APOC4, the significance of residual association would increase when trait-increasing alleles were present on opposite haplotypes and decrease when trait-increasing alleles were on the same haplotype. Other signals that appeared to be independent on the basis of low pairwise LD and unchanged association evidence after conditional analysis may still be partially tagging an un-typed, yet influential, variant [55]–[57]. Therefore, deeper sequencing that identifies all variants at a locus will be required to characterize more fully the allelic heterogeneity and the patterns of association.

One of the major goals of high-density genotyping is to aid in identification of the functional variants by recognizing the most compelling candidate variants for experimental study. Because of the diverse LD structure across populations, particularly in terms of the limited LD extent in African ancestry populations, trans-ethnic fine-mapping of GWAS loci can narrow the region where functional variants are most likely to reside. This study was able to narrow the association signals at five lipid loci, based on the much smaller subsets of most strongly associated variants located in smaller regions. One signal was localized to a reported causal variant (GCKR-P446L) [28] and another to an uncharacterized nonsynonymous variant (SLC12A4-E4G near LCAT). These findings demonstrate that trans-ethnic association analyses can increase the resolution of fine-mapping by enlarging the haplotypic diversity of samples with different ancestries and consequently, narrowing the sets of candidate functional variants [58], [59]. The previously described functional variants at LCAT [44] and ABCA1 [42], [43], which are not present on the Metabochip, were physically located 22 kb and >43 kb away from the narrowed association signals observed in this study (Table 4).

Refining signals by trans-ethnic meta-analysis largely relies not only on the existence of distinct LD patterns across ancestry groups but also on shared functional variants. If functional variants are shared across populations, as observed with GCKR-P446L, performing trans-ethnic meta-analysis and integrating LD information across different populations may refine the signal. On the contrary, if trait variation is influenced by distinct functional variants across populations, as our data suggest for APOA5 (Figure S6A–S6D), the lead SNPs produced by meta-analysis would be influenced by the sample size, magnitude of genetic effects, and allele frequencies. Similarly, in the case of population-specific functional variants, such as those at PCSK9, the results from meta-analysis would reflect the association in one particular population rather than the combined effect across populations if signals unique to this population drive the results. Therefore, accurate assessment of allelic variability is needed on a population-by-population and locus-by-locus basis.

Although genotype imputation has become a standard practice to increase genome coverage in GWAS by predicting the genotypes at SNPs that are not directly genotyped, imputation accuracy tends to be lower for rare variants owing to the lower degree of LD and the more challenging haplotype reconstruction [60]. In addition, African American samples pose a challenge for imputation due to their varying degree of admixture [61]. A major strength of our study is that all variants we tested for association were directly genotyped using the Metabochip, which was designed to provide a high-density coverage for both overall SNPs and low frequency variants concentrated around GWAS-identified loci and/or signals [9], [10]. This approach increases the reliability of our association results overall, but in particular the variants with low allele frequencies.

In conclusion, we performed a large-scale trans-ethnic fine-mapping study to investigate the established lipid loci using the Metabochip high-density genotyping array and focusing on diverse groups including African Americans, East Asians, and Europeans. Our results highlight the value of high-density genotyping in diverse populations to identify a wider spectrum of susceptibility variants at established loci, both in terms of additional signals and in terms of population-specific and/or potentially functional variants. The additional signals revealed through the sequential conditional analyses lead to a 1.3 -⁠ to 1.8-fold increase in the explained phenotypic variance across the different populations. In addition, integrating diverse LD patterns across diverse ancestry groups allows for the refinement of association signals. Lastly, our findings that 74% of the reported functional variants exhibited the strongest association at these densely typed signals suggest that at loci and signals where functional variants are unknown, the variants with strongest association may be good candidates for functional assessment.

Materials and Methods

Study populations and phenotypes

The 6,832 African Americans studied are comprised of individuals from the Atherosclerosis Risk in Communities Study (ARIC) [62], the Multiethnic Cohort Study (MEC) [63], and the Women's Health Initiative (WHI) [64], [65] that are part of Population Architecture using Genomics and Epidemiology (PAGE) consortium [66] and from Hypertensive Genetic Epidemiology Network (HyperGEN) [67]. The 9,449 East Asian samples are comprised of 1,716 Filipinos from the Cebu Longitudinal Health and Nutrition Survey (CLHNS) [68] and 7,733 Chinese from Taiwan-Metabochip Study for Cardiovascular Disease (TAICHI). The 10,829 European samples are comprised of Finnish and Norwegian individuals; the Finns are from the Finland-United States Investigation of NIDDM Genetics (FUSION), Dehko 2D 2007 (D2D2007), Diabetes Prevention Study (DPS), Dose-Responses to Exercise Training (DR's EXTRA), and Metabolic Syndrome in Men (METSIM) [69], [70], and the Norwegians were from the cohorts of Nord-Trøndelag Health Study (HUNT 2) and the Tromsø Study (TROMSO) [71], [72].

All study protocols were approved by Institutional Review Boards at their respective sites. Brief descriptions of the studies are provided in the Text S1. General characteristics and measurements of TG, HDL-C, and LDL-C in each cohort are summarized in Table S1. Values of triglycerides were natural log transformed to approximate normality in each study sample separately.

Genotyping

We genotyped all study samples with the Metabochip according to the manufacturer's protocol (Illumina, San Diego, CA, USA). Table S1 summarizes the quality control criteria of genotyping, including call rate, sample success rate, Hardy-Weinberg equilibrium, and MAF that varied across studies.

Statistical analyses

We applied multiple linear regression models and assumed an additive mode of inheritance to test for association between genotypes and HDL-C, LDL-C, or log-transformed triglycerides. We performed each test of association separately in each of the 11 groups (Table S1) prior to meta-analysis. We constructed principal components (PCs) using the software EIGENSOFT. We used age and sex as covariates in each individual cohort; other cohort-specific covariates including age², enrollment site, socioeconomic status, and principal components varied across studies (Table S1). The European samples include type 2 diabetes (T2D) cases and unaffected controls; to avoid confounding due to T2D status, samples were analyzed separately as Finnish T2D patients, Finnish unaffected individuals, Norwegian T2D patients, and Norwegian unaffected individuals.

We first conducted the meta-analysis within the African Americans, East Asians, and Europeans separately. We then performed combined trans-ethnic meta-analyses by combining the statistics of each the 11 participating groups to assess the association with the SNPs at the 58 lipids loci.

At loci that exhibited evidence of association at P<10⁻⁴, we next performed a series of sequential conditional analyses by adding the most strongly associated SNP into the regression model as a covariate and testing all remaining regional SNPs for association. We conducted a set of sequential conditional analyses until the strongest SNP showed a conditional P value>10⁻⁴ and had no annotation or literature evidence that suggested a functional role.

For single SNP analyses, we applied PLINK (http://pngu.mgh.harvard.edu/~purcell/plink/) [73] for population-based studies. We used the R package GWAF [74] for the family-based study of HyperGEN. We applied an inverse variance-weighted fixed-effect meta-analysis implemented in METAL [75].

Unless otherwise noted, linkage disequilibrium estimates were obtained from the 1000 Genomes Project November 2010 release. SNP positions correspond to hg18.

We performed haplotype analysis at LDL-C locus TOMM40-APOE-APOC4 in 5,593 unrelated African Americans from the PAGE consortium, using the ‘haplo.stat’ R package. Haplotypes and haplotype frequencies were estimated using the R function ‘haplo.em’. The association between haplotypes and LDL-C was assessed using the R function ‘haplo.glm’. An additive model was assumed, in which the regression coefficient β represents the expected change in LDL-C level with each additional copy of the specific haplotype compared with the reference haplotype, which was set as the A-A (trait increasing-increasing) haplotype.

We created the regional association plots using LocusZoom [76]. To plot the association results in Europeans and East Asians, we used the LocusZoom-implemented LD estimates from the 1000 Genomes Project (June 2010) CEU and CHB+JPT samples, whose LD structures are similar to our samples with European and East Asian ancestries. We applied the user-supplied LD calculated from the genotype data of the PAGE African American samples to plot the regional association in African Americans [9], because the LD patterns may vary from any pre-computed LD sources implemented in LocusZoom.

We evaluated the proportion of variance explained by a single SNP or any given locus by including the SNP or a set of SNPs into a linear regression model with all covariates used in association analysis and calculating the R² for the full model. We subtracted the variance explained by a basic model in which only covariates were included from the variance we obtained from the full model. We performed these analyses using SAS version 9.2 (SAS Institute, Cary, NC, USA).

Supporting Information

Zdroje

1. KathiresanS, MelanderO, GuiducciC, SurtiA, BurttNP, et al. (2008) Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nat Genet 40 : 189–197.

2. WillerCJ, SannaS, JacksonAU, ScuteriA, BonnycastleLL, et al. (2008) Newly identified loci that influence lipid concentrations and risk of coronary artery disease. Nat Genet 40 : 161–169.

3. AulchenkoYS, RipattiS, LindqvistI, BoomsmaD, HeidIM, et al. (2009) Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts. Nat Genet 41 : 47–55.

4. KathiresanS, WillerCJ, PelosoGM, DemissieS, MusunuruK, et al. (2009) Common variants at 30 loci contribute to polygenic dyslipidemia. Nat Genet 41 : 56–65.

5. TeslovichTM, MusunuruK, SmithAV, EdmondsonAC, StylianouIM, et al. (2010) Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466 : 707–713.

6. McCarthyMI, AbecasisGR, CardonLR, GoldsteinDB, LittleJ, et al. (2008) Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet 9 : 356–369.

7. SannaS, LiB, MulasA, SidoreC, KangHM, et al. (2011) Fine mapping of five Loci associated with low-density lipoprotein cholesterol detects variants that double the explained heritability. PLoS Genet 7: e1002198 doi:10.1371/journal.pgen.1002198

8. HarituniansT, JonesMR, McGovernDP, ShihDQ, BarrettRJ, et al. (2011) Variants in ZNF365 isoform D are associated with Crohn's disease. Gut 60 : 1060–1067.

9. BuyskeS, WuY, CartyCL, ChengI, AssimesTL, et al. (2012) Evaluation of the Metabochip Genotyping Array in African Americans and Implications for Fine Mapping of GWAS-Identified Loci: The PAGE Study. PLoS ONE 7: e35651 doi:10.1371/journal.pone.0035651

10. VoightBF, KangHM, DingJ, PalmerCD, SidoreC, et al. (2012) The Metabochip, a Custom Genotyping Array for Genetic Studies of Metabolic, Cardiovascular, and Anthropometric Traits. PLoS Genet 8: e1002793 doi:10.1371/journal.pgen.1002793

11. PedenJF, FarrallM (2011) Thirty-five common variants for coronary artery disease: the fruits of much collaborative labour. Hum Mol Genet 20: R198–205.

12. VoightBF, ScottLJ, SteinthorsdottirV, MorrisAP, DinaC, et al. (2010) Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis. Nat Genet 42 : 579–589.

13. SimX, OngRT, SuoC, TayWT, LiuJ, et al. (2011) Transferability of type 2 diabetes implicated Loci in multi-ethnic cohorts from southeast Asia. PLoS Genet 7: e1001363 doi:10.1371/journal.pgen.1001363

14. The International HapMap Consortium (2005) A haplotype map of the human genome. Nature 437 : 1299–1320.

15. HelgasonA, PalssonS, ThorleifssonG, GrantSF, EmilssonV, et al. (2007) Refining the impact of TCF7L2 gene variants on type 2 diabetes and adaptive evolution. Nat Genet 39 : 218–225.

16. MusunuruK, RomaineSP, LettreG, WilsonJG, VolcikKA, et al. (2012) Multi-ethnic analysis of lipid-associated loci: the NHLBI CARe project. PLoS ONE 7: e36473 doi:10.1371/journal.pone.0036473

17. DumitrescuL, CartyCL, TaylorK, SchumacherFR, HindorffLA, et al. (2011) Genetic Determinants of Lipid Traits in Diverse Populations from the Population Architecture using Genomics and Epidemiology (PAGE) Study. PLoS Genet 7: e1002138 doi:10.1371/journal.pgen.1002138

18. MusunuruK, StrongA, Frank-KamenetskyM, LeeNE, AhfeldtT, et al. (2010) From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature 466 : 714–719.

19. TeoYY, SmallKS, KwiatkowskiDP (2010) Methodological challenges of genome-wide association analysis in Africa. Nat Rev Genet 11 : 149–160.

20. CohenJ, PertsemlidisA, KotowskiIK, GrahamR, GarciaCK, et al. (2005) Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9. Nat Genet 37 : 161–165.

21. KotowskiIK, PertsemlidisA, LukeA, CooperRS, VegaGL, et al. (2006) A spectrum of PCSK9 alleles contributes to plasma levels of low-density lipoprotein cholesterol. Am J Hum Genet 78 : 410–422.

22. ZhaoZ, Tuakli-WosornuY, LagaceTA, KinchL, GrishinNV, et al. (2006) Molecular characterization of loss-of-function mutations in PCSK9 and identification of a compound heterozygote. Am J Hum Genet 79 : 514–523.

23. RallSCJr, WeisgraberKH, InnerarityTL, MahleyRW (1982) Structural basis for receptor binding heterogeneity of apolipoprotein E from type III hyperlipoproteinemic subjects. Proc Natl Acad Sci U S A 79 : 4696–4700.

24. WardH, MitrouPN, BowmanR, LubenR, WarehamNJ, et al. (2009) APOE genotype, lipids, and coronary heart disease risk: a prospective population study. Arch Intern Med 169 : 1424–1429.

25. HuangYJ, LinYL, ChiangCI, YenCT, LinSW, et al. (2012) Functional importance of apolipoprotein A5 185G in the activation of lipoprotein lipase. Clin Chim Acta 413 : 246–250.

26. TalmudPJ, PalmenJ, PuttW, LinsL, HumphriesSE (2005) Determination of the functionality of common APOA5 polymorphisms. J Biol Chem 280 : 28215–28220.

27. McCarthyMI (2008) Casting a wider net for diabetes susceptibility genes. Nat Genet 40 : 1039–1040.

28. ReesMG, WincovitchS, SchultzJ, WaterstradtR, BeerNL, et al. (2012) Cellular characterisation of the GCKR P446L variant associated with type 2 diabetes risk. Diabetologia 55 : 114–122.

29. BenjannetS, RhaindsD, HamelinJ, NassouryN, SeidahNG (2006) The proprotein convertase (PC) PCSK9 is inactivated by furin and/or PC5/6A: functional consequences of natural mutations and post-translational modifications. J Biol Chem 281 : 30561–30572.

30. FasanoT, SunXM, PatelDD, SoutarAK (2009) Degradation of LDLR protein mediated by ‘gain of function’ PCSK9 mutants in normal and ARH cells. Atherosclerosis 203 : 166–171.

31. SullivanPM, MezdourH, QuarfordtSH, MaedaN (1998) Type III hyperlipoproteinemia and spontaneous atherosclerosis in mice resulting from gene replacement of mouse Apoe with human Apoe*2. J Clin Invest 102 : 130–135.

32. PalmenJ, SmithAJ, DorfmeisterB, PuttW, HumphriesSE, et al. (2008) The functional interaction on in vitro gene expression of APOA5 SNPs, defining haplotype APOA52, and their paradoxical association with plasma triglyceride but not plasma apoAV levels. Biochim Biophys Acta 1782 : 447–452.

33. ThompsonJF, LloydDB, LiraME, MilosPM (2004) Cholesteryl ester transfer protein promoter single-nucleotide polymorphisms in Sp1-binding sites affect transcription and are associated with high-density lipoprotein cholesterol. Clin Genet 66 : 223–228.

34. ZambonA, DeebSS, PaulettoP, CrepaldiG, BrunzellJD (2003) Hepatic lipase: a marker for cardiovascular disease risk and response to therapy. Curr Opin Lipidol 14 : 179–189.

35. HaasBE, Weissglas-VolkovD, Aguilar-SalinasCA, NikkolaE, VergnesL, et al. (2011) Evidence of how rs7575840 influences apolipoprotein B-containing lipid particles. Arterioscler Thromb Vasc Biol 31 : 1201–1207.

36. NiermanMC, RipJ, KuivenhovenJA, SakaiN, KasteleinJJ, et al. (2007) Enhanced apoB48 metabolism in lipoprotein lipase X447 homozygotes. Atherosclerosis 194 : 446–451.

37. ZhuH, TuckerHM, GrearKE, SimpsonJF, ManningAK, et al. (2007) A common polymorphism decreases low-density lipoprotein receptor exon 12 splicing efficiency and associates with increased cholesterol. Hum Mol Genet 16 : 1765–1772.

38. MaillyF, TugrulY, ReymerPW, BruinT, SeedM, et al. (1995) A common variant in the gene for lipoprotein lipase (Asp9→Asn). Functional implications and prevalence in normal and hyperlipidemic subjects. Arterioscler Thromb Vasc Biol 15 : 468–478.

39. KellerL, MurphyC, WangHX, FratiglioniL, OlinM, et al. (2010) A functional polymorphism in the HMGCR promoter affects transcriptional activity but not the risk for Alzheimer disease in Swedish populations. Brain Res 1344 : 185–191.

40. SmithAJ, AhmedF, NairD, WhittallR, WangD, et al. (2007) A functional mutation in the LDLR promoter (-139C>G) in a patient with familial hypercholesterolemia. Eur J Hum Genet 15 : 1186–1189.

41. ReymerPW, GagneE, GroenemeyerBE, ZhangH, ForsythI, et al. (1995) A lipoprotein lipase mutation (Asn291Ser) is associated with reduced HDL cholesterol levels in premature atherosclerosis. Nat Genet 10 : 28–34.

42. Acuna-AlonzoV, Flores-DorantesT, KruitJK, Villarreal-MolinaT, Arellano-CamposO, et al. (2010) A functional ABCA1 gene variant is associated with low HDL-cholesterol levels and shows evidence of positive selection in Native Americans. Hum Mol Genet 19 : 2877–2885.

43. KyriakouT, PontefractDE, ViturroE, HodgkinsonCP, LaxtonRC, et al. (2007) Functional polymorphism in ABCA1 influences age of symptom onset in coronary artery disease patients. Hum Mol Genet 16 : 1412–1422.

44. TaramelliR, PontoglioM, CandianiG, OttolenghiS, DieplingerH, et al. (1990) Lecithin cholesterol acyl transferase deficiency: molecular analysis of a mutated allele. Hum Genet 85 : 195–199.

45. AouizeratBE, EnglerMB, NatanzonY, KulkarniM, SongJ, et al. (2006) Genetic variation of PLTP modulates lipoprotein profiles in hypoalphalipoproteinemia. J Lipid Res 47 : 787–793.

46. EdmondsonAC, BrownRJ, KathiresanS, CupplesLA, DemissieS, et al. (2009) Loss-of-function variants in endothelial lipase are a cause of elevated HDL cholesterol in humans. J Clin Invest 119 : 1042–1050.

47. KhetarpalSA, EdmondsonAC, RaghavanA, NeeliH, JinW, et al. (2011) Mining the LIPG allelic spectrum reveals the contribution of rare and common regulatory variants to HDL cholesterol. PLoS Genet 7: e1002393 doi:10.1371/journal.pgen.1002393

48. AdzhubeiIA, SchmidtS, PeshkinL, RamenskyVE, GerasimovaA, et al. (2010) A method and server for predicting damaging missense mutations. Nat Methods 7 : 248–249.

49. Lango AllenH, EstradaK, LettreG, BerndtSI, WeedonMN, et al. (2010) Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467 : 832–838.

50. HuangL, JakobssonM, PembertonTJ, IbrahimM, NyamboT, et al. (2011) Haplotype variation and genotype imputation in African populations. Genet Epidemiol 35 : 766–780.

51. FriedlanderY, KarkJD, SteinY (1986) Heterogeneity in multifactorial inheritance of plasma lipids and lipoproteins in ethnically diverse families in Jerusalem. Genet Epidemiol 3 : 95–112.

52. BeekmanM, HeijmansBT, MartinNG, PedersenNL, WhitfieldJB, et al. (2002) Heritabilities of apolipoprotein and lipid levels in three countries. Twin Res 5 : 87–97.

53. IliadouA, SniederH, WangX, TreiberFA, DavisCL (2005) Heritabilities of lipids in young European American and African American twins. Twin Res Hum Genet 8 : 492–498.

54. KaoJT, WenHC, ChienKL, HsuHC, LinSW (2003) A novel genetic variant in the apolipoprotein A5 gene is associated with hypertriglyceridemia. Hum Mol Genet 12 : 2533–2539.

55. WoodAR, HernandezDG, NallsMA, YaghootkarH, GibbsJR, et al. (2011) Allelic heterogeneity and more detailed analyses of known loci explain additional phenotypic variation and reveal complex patterns of association. Hum Mol Genet 20 : 4082–4092.

56. TrynkaG, HuntKA, BockettNA, RomanosJ, MistryV, et al. (2011) Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease. Nat Genet 43 : 1193–1201.

57. SpencerC, HechterE, VukcevicD, DonnellyP (2011) Quantifying the underestimation of relative risks from genome-wide association studies. PLoS Genet 7: e1001337 doi:10.1371/journal.pgen.1001337

58. TeoYY, OngRT, SimX, TaiES, ChiaKS (2010) Identifying candidate causal variants via trans-population fine-mapping. Genet Epidemiol 34 : 653–664.

59. TeoYY, SimX (2010) Patterns of linkage disequilibrium in different populations: implications and opportunities for lipid-associated loci identified from genome-wide association studies. Curr Opin Lipidol 21 : 104–115.

60. LiuEY, BuyskeS, AragakiAK, PetersU, BoerwinkleE, et al. (2012) Genotype Imputation of MetabochipSNPs Using a Study-Specific Reference Panel of ∼4,000 Haplotypes in African Americans From the Women's Health Initiative. Genetic Epidemiology 36 : 107–117.

61. RosenbergNA, HuangL, JewettEM, SzpiechZA, JankovicI, et al. (2010) Genome-wide association studies in diverse populations. Nat Rev Genet 11 : 356–366.

62. The ARIC investigators (1989) The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. Am J Epidemiol 129 : 687–702.

63. KolonelLN, AltshulerD, HendersonBE (2004) The multiethnic cohort study: exploring genes, lifestyle and cancer risk. Nat Rev Cancer 4 : 519–527.

64. The Woman's Health Initiative Study Group (1998) Design of the Women's Health Initiative clinical trial and observational study. The Women's Health Initiative Study Group. Control Clin Trials 19 : 61–109.

65. AndersonGL, MansonJ, WallaceR, LundB, HallD, et al. (2003) Implementation of the Women's Health Initiative study design. Ann Epidemiol 13: S5–17.

66. MatiseTC, AmbiteJL, BuyskeS, CarlsonCS, ColeSA, et al. (2011) The Next PAGE in understanding complex traits: design for the analysis of Population Architecture Using Genetics and Epidemiology (PAGE) Study. Am J Epidemiol 174 : 849–859.

67. WilliamsRR, RaoDC, EllisonRC, ArnettDK, HeissG, et al. (2000) NHLBI family blood pressure program: methodology and recruitment in the HyperGEN network. Hypertension genetic epidemiology network. Ann Epidemiol 10 : 389–400.

68. AdairLS, PopkinBM, AkinJS, GuilkeyDK, GultianoS, et al. (2011) Cohort profile: the Cebu longitudinal health and nutrition survey. Int J Epidemiol 40 : 619–625.

69. ScottLJ, MohlkeKL, BonnycastleLL, WillerCJ, LiY, et al. (2007) A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 316 : 1341–1345.

70. StancakovaA, JavorskyM, KuulasmaaT, HaffnerSM, KuusistoJ, et al. (2009) Changes in insulin sensitivity and insulin release in relation to glycemia and glucose tolerance in 6,414 Finnish men. Diabetes 58 : 1212–1221.

71. MidthjellK, KrugerO, HolmenJ, TverdalA, ClaudiT, et al. (1999) Rapid changes in the prevalence of obesity and known diabetes in an adult Norwegian population. The Nord-Trondelag Health Surveys: 1984–1986 and 1995–1997. Diabetes Care 22 : 1813–1820.

72. JosephJ, SvartbergJ, NjolstadI, SchirmerH (2010) Incidence of and risk factors for type-2 diabetes in a general population: the Tromso Study. Scand J Public Health 38 : 768–775.

73. PurcellS, NealeB, Todd-BrownK, ThomasL, FerreiraMA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81 : 559–575.

74. ChenMH, YangQ (2010) GWAF: an R package for genome-wide association analyses with family data. Bioinformatics 26 : 580–581.

75. WillerCJ, LiY, AbecasisGR (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26 : 2190–2191.

76. PruimRJ, WelchRP, SannaS, TeslovichTM, ChinesPS, et al. (2010) LocusZoom: Regional visualization of genome-wide association scan results. Bioinformatics 26 : 2336–2337.