Variant level heritability estimates of type 2 diabetes in african americans

Play all audios:

ABSTRACT Type 2 diabetes (T2D) is caused by both genetic and environmental factors and is associated with an increased risk of cardiorenal complications and mortality. Though

disproportionately affected by the condition, African Americans (AA) are largely underrepresented in genetic studies of T2D, and few estimates of heritability have been calculated in this

race group. Using genome-wide association study (GWAS) data paired with phenotypic data from ~ 19,300 AA participants of the Reasons for Geographic and Racial Differences in Stroke (REGARDS)

study, Genetics of Hypertension Associated Treatments (GenHAT) study, and the Electronic Medical Records and Genomics (eMERGE) network, we estimated narrow-sense heritability using two

methods: Linkage-Disequilibrium Adjusted Kinships (LDAK) and Genome-Wide Complex Trait Analysis (GCTA). Study-level heritability estimates adjusting for age, sex, and genetic ancestry ranged

from 18% to 34% across both methods. Overall, the current study narrows the expected range for T2D heritability in this race group compared to prior estimates, while providing new insight

into the genetic basis of T2D in AAs for ongoing genetic discovery efforts. SIMILAR CONTENT BEING VIEWED BY OTHERS MULTI-ANCESTRY GENETIC STUDY OF TYPE 2 DIABETES HIGHLIGHTS THE POWER OF

DIVERSE POPULATIONS FOR DISCOVERY AND TRANSLATION Article 12 May 2022 VALIDITY OF EUROPEAN-CENTRIC CARDIOMETABOLIC POLYGENIC SCORES IN MULTI-ANCESTRY POPULATIONS Article Open access 05

January 2024 POPULATION-SPECIFIC AND TRANS-ANCESTRY GENOME-WIDE ANALYSES IDENTIFY DISTINCT AND SHARED GENETIC RISK LOCI FOR CORONARY ARTERY DISEASE Article 05 October 2020 INTRODUCTION

Diabetes mellitus is a heterogeneous group of metabolic and health conditions characterized by glucose dysregulation and defects in insulin secretion and/or insulin action1. Chronic

hyperglycemia has been associated with long-term damage and dysfunction of the kidneys, heart, and blood vessels2. Diabetes is a major risk factor for cardiovascular disease (CVD),

particularly coronary heart disease and stroke3. In the United States, type 2 diabetes (T2D) is the most common form of diabetes in adults, constituting > 90% of cases4 and is growing in

prevalence among adolescents5. T2D is more often associated with increased age6; however, the T2D epidemic can largely be attributed to a worldwide increase in obesity7. Since lifestyle

intervention focused on obesogenic behaviors is effective at preventing T2D8, identifying populations with increased genetic susceptibility could lower disease morbidity. It is

well-established that T2D is a complex disease, and the risk of developing T2D depends on environmental and genetic factors. Further supporting the genetic background of the disease is the

high concordance in monozygotic twins compared to dizygotic twins in familial studies9,10,11,12. While traditional twin studies have been considered the “gold standard” for measuring the

heritability of a trait, more recent literature has suggested that twin studies may overestimate heritability due to shared environment and non-additive genetic effects creating “phantom

heritability”13. While broad sense heritability of a trait is the proportion of phenotypic variation attributed to genetics, the narrow-sense heritability (h2) is the proportion attributable

to the additive gene effects14. These additive effects of variants underlying a trait is of particular importance because they constitute the genetic variation component transferred from

parent to offspring 15. With advances in genotyping technologies, the generation of genome-wide common genetic variant data has enabled approaches that provide an alternative to twin or

family heritability studies16,17. These methods estimate heritability in unrelated individuals via correlation between genetic and phenotypic sharing, similar to family-based studies.

However, instead of using the theoretic estimates of genetic sharing (i.e., Mendel’s Laws), single nucleotide polymorphism (SNP)-based heritability analyses allow for empirical estimates of

genetic sharing to be directly obtained from the genotype data17. Thus, by extending the models to utilize the contributions from all common genetic variants, these methods can detect

considerable shares of the additive genetic effect13. While previous h2 estimates of T2D and related clinical traits (e.g., fasting glucose, fasting insulin) have varied between 25% and 80%,

they have either excluded or underrepresented individuals of non-European ancestry, particularly African Americans (AAs)2,18,19,20,21,22. Importantly, known disparities exist in T2D, where

AAs have a higher prevalence of T2D, increased mortality rates, and an increased risk of T2D complications compared to individuals of European descent23,24. Further, there is a concern that

AAs and African ancestry groups may benefit less from genetic research, potentially exacerbating health disparities of common chronic conditions, such as T2D25. Given these trends,

additional heritability studies are warranted for these populations. For example, accurate heritability estimates are needed to understand the utility of polygenic risk scores (PRS) in

multi-ancestral populations with regard to the maximum trait variance expected to be explained by a PRS. In the present study, we seek to capture the T2D additive genetic variation (i.e.,

h2) from ~ 19,300 AA participants in the Reasons for Geographic and Racial Differences in Stroke (REGARDS) study, the Genetics of Hypertension Associated Treatments (GenHAT) study, and the

Electronic Medical Records and Genomics (eMERGE) network. Each study used an overlapping subset of 8.2 million imputed genetic variants to estimate the h2 using two common approaches,

Genome-Wide Complex Trait Analysis (GCTA) and Linkage-Disequilibrium Adjusted Kinships (LDAK), making this one of the most extensive studies to estimate T2D heritability in AAs. RESULTS

Descriptive statistics for 7957 AA T2D cases and 11,378 AA controls are presented in Table 1. On average, cases were slightly older (65 years versus 63 years for controls, 66 years versus 66

years for controls, and 68 years versus 67 years in controls for REGARDS, GenHAT, and eMERGE, respectively). Furthermore, T2D cases were more likely to be men in REGARDS (59%), but women in

GenHAT (61%) and eMERGE (65%). Study-level heritability estimates are provided in Table 2. The base model (Model 1) h2 estimates using LDAK were 19% in REGARDS, 19% in GenHAT, and 33% in

eMERGE. Similar trends were observed when estimates were calculated using GCTA, ranging from 21% in GenHAT to 31% in eMERGE. Upon age, sex, and genetic ancestry adjustment (Model 3), h2

estimates from LDAK were similar to Model 1 (18% in GenHAT, 18% in REGARDS, and 33% for eMERGE). In the fully-adjusted (Model 3) using GCTA, h2 estimates for REGARDS, GenHAT, and eMERGE were

24%, 21%, and 32%, respectively. In a sensitivity analysis, the study site was included as a covariate for the eMERGE cohort and results remained similar with and without adjustment.

Further, when adjusting for the AA-specific population prevalence of T2D (heritability liability, h2liab), estimates increased marginally across all models and methods (Supplemental Table

1). DISCUSSION African American populations continue to be underrepresented in genomic research. In the era of genomic medicine, this could unintentionally create new disparities in

prevention and prediction of T2D. By leveraging genome-wide SNP array data from unrelated participants belonging to well-characterized cohort studies the current study provides insight into

the additive genetic factors that contribute to the phenotypic variance of T2D in this non-European population. This type of common genetic variant heritability study is less prone to the

confounding effects of shared environment observed in family studies. Therefore, the results presented may give a more precise estimation of the genetic contribution to T2D which may inform

how genome-wide data is used in the future for the treatment and prevention of the condition. We estimated the h2 of T2D using two well-characterized estimation approaches, LDAK and GCTA.

Upon covariate adjustment, we observed h2 estimates from GCTA ranging from 21 to 32%. LDAK provided more conservative fully-adjusted estimates, ranging from 18 to 33%. This is most likely

due to the inclusion of LD and allele frequency patterns into the estimation, correcting uneven LD patterns across the genome26. Subsequent h2liab estimates, aimed to measure heritability

independent of disease prevalence, ranged from 19% to 34%. As described, we observed variable estimates across studies. This leads us to believe that age and sex were confounders, justifying

the need for a fully adjusted model and age-matching cases and controls in eMERGE to be representative of what was present in GenHAT and REGARDS. The h2liab estimates also showed slight

inflation in REGARDS across both methods and we speculate that could be a result of the smaller case proportion (~ 30%) compared to eMERGE and GenHAT (~ 50%). It is also important to note

that h2liab could potentially be biased, as previously reported in Golan et al., as it is a function of the prevalence in a given sample population27. Prior reports of heritability of T2D

and T2D-related traits (ranging 25–80%) have been described largely in populations or families of European descent. A 2005 familial aggregation study in ~ 2400 non-diabetic AA participants,

described heritability of 29% ± 9% for fasting glucose and 28% ± 8% for fasting insulin measures19, similar to our findings. Importantly, over 700 unique genetic loci associated with T2D

have been reported by GWAS28,29,30,31,32, explaining almost 20% of T2D heritability in a multi-ancestry sample31. The majority of these loci were identified in European or Asian

populations33, though > 25 loci were discovered in AAs31,34,35,36. Given the wide-range of reported heritability estimates it’s difficult to decipher how much missing heritability remains

after accounting for discovered significant loci. Therefore, more precise estimates of heritability such as that presented in our study are needed to inform future GWAS discovery and

precision medicine applications of GWAS data. Our study is subjected to several limitations. First, heterogeneity in the estimates could result from heterogeneity in the T2D phenotype,

heterogeneity in study designs, and population admixture. To account for these limitations, we adjusted for genetic principal components in each study, as well as the study site in

sensitivity models in eMERGE. We followed the same validated T2D phenotyping as a previously published report37, attempting to harmonize the phenotype between the population-based

longitudinal study (REGARDS), randomized clinical trial (GenHAT/ALLHAT), and electronic medical records (EMR) from eMERGE. While attempting to harmonize the genetic variants, it is important

to note the differences in the imputation reference panel between REGARDS and GenHAT (TOPMed release 2) versus eMERGE (HRC), which did result in the exclusion of nearly 7 million variants

from REGARDS and GenHAT. However, this number of variants is still compatible with other publications that have used these approaches38. Lastly, while the use of the contemporary arrays and

methodologies (e.g., Illumina MEGA array and TOPMed reference panel) allows for interrogation of more African-specific variants, the potential genetic contribution of rare minor allele

frequency (MAF) < 1% or structural variants was not investigated in this study. This is important since rare variants are thought to harbor more deleterious effects than common variants,

as well as playing an important role in accurately calculating the heritability of complex diseases39. In conclusion, we conducted one of the largest heritability estimates of T2D to date

utilizing genetic array data and to the best of our knowledge, the largest study of h2 in AAs, comprising three independent studies with sizable AA populations and T2D prevalence. All three

studies have well-described and extensive phenotyping, allowing for appropriate covariate adjustment and previously validated T2D case and control definitions. The use of imputed data allows

for more consistency across studies, with the inclusion of more than 8.2 million overlapping genetic variants. Though we observed similar trends using LDAK and GCTA, the considerations of

LD resulted in a more conservative h2 range across studies using LDAK. Our h2 estimates are lower with smaller confidence intervals than most familial studies on T2D that have been

predominately described in European populations; however, they are similar to a family study on fasting insulin and glucose levels in AAs. Future work in determining the heritability of

complex diseases, including T2D, is warranted in order to advance the availability of genomic resources for non-European populations. METHODS STUDY POPULATIONS Three cohorts, REGARDS,

GenHAT, and eMERGE contributed genetic data for this study, comprising 7,957 T2D cases and 11,378 controls of African descent (Fig. 1). Descriptive statistics for each study are described in

Table 1. Signed informed consent was collected from all participants in each study. All studies were reviewed and approved by the institutional review boards of the participating

institutions and study sites. REASONS FOR GEOGRAPHIC AND RACIAL DIFFERENCES IN STROKE (REGARDS) STUDY REGARDS is a national, longitudinal study of incident stroke and associated risk

factors, enrolling over 30,000 Black and white adults aged 45 years or older from all 48 contiguous US states and the District of Columbia40. Participants completed a computer-assisted

telephone interview (CATI) and an in-home visit where blood and urine were collected, as well as blood pressure measurements and a medicine review. Participants are contacted at 6 month

intervals to obtain information regarding incident stroke or secondary outcomes. A subset of 8916 Black REGARDS participants underwent genotyping using Illumina Infinium AMR/AFR (MEGA)

BeadChip arrays. Quality control procedures have been previously described41, but briefly, participants were excluded based on sex mismatches, internal duplicates, or HapMap controls.

Variants were excluded if they were located on sex chromosomes, had ambiguous strands, were not bi-allelic, were in violation of Hardy Weinberg (p < 1.00e-12 for REGARDS), had MAF <

5%, and/or had a missing rate > 10%. Imputation was performed using the Trans-omics for Precision Medicine (TOPMed) release 2 (Freeze 8) reference panel42. GENETICS OF HYPERTENSION

ASSOCIATED TREATMENTS (GENHAT) STUDY GenHAT is an ancillary study to the Antihypertensive and Lipid Lowering Treatment to Prevent Heart Attack Trial (ALLHAT). ALLHAT was a randomized,

double-blind multicenter clinical trial that enrolled over 42,000 high-risk individuals 55 years or older with hypertension and at least one additional risk factor for cardiovascular

disease43,44. The GenHAT study (N = 39,114) evaluated the interaction between candidate hypertensive genetic variants and antihypertensive treatments to modify the risk of CVD outcomes. A

subset of 7711 Black adults with hypertension was genotyped on Illumina MEGA BeadChip arrays. Similar to GenHAT, QC procedures have been previously described41. Participants were excluded

based on sex mismatches, internal duplicates, or HapMap controls, while variants were excluded if they were located on sex chromosomes, had ambiguous strands, were not bi-allelic, were in

violation of Hardy Weinberg (p < 1.00e-05), had MAF < 5%, and/or had a missing rate > 10%. Imputation was performed using the Trans-omics for Precision Medicine (TOPMed) release 2

(Freeze 8) reference panel42. ELECTRONIC MEDICAL RECORDS AND GENOMICS (EMERGE) NETWORK The eMERGE network combines DNA biorepositories with electronic medical records (EMR) for the purpose

of research focused on advancing efforts in genomic medicine. The eMERGE III cohort was initiated in 2015 and culminated in over 100,000 GWAS samples across the network, while eMERGE IV

aimed to develop and disseminate methodologies for genetic risk assessment, integrate genetics into routine medical practice to identify individuals at high risk for common disease, and

recommend interventions37,45. For the current study, genetic data from eight sites (Cincinnati Children’s Hospital Medical Center, Children’s Hospital of Philadelphia, Columbia University,

Mass General Brigham, Mayo Clinic, Icahn School of Medicine at Mount Sinai, Northwestern University, and Vanderbilt University Medical Center)46 was imputed against the Haplotype Reference

Consortium (HRC) panel47. To age-match cases and controls from eMERGE to the REGARDS and GenHAT studies, samples were selected from the age range 30–100. Individuals were divided into

deciles and numbers of cases and controls were matched in each decile (Supplemental Fig. 1). DEFINITION OF TYPE-2 DIABETES STATUS T2D status was defined independently across all three

studies. In the REGARDS study, T2D cases were classified based on fasting glucose ≥ 126 mg/dL (7 mmol/L), non-fasting glucose ≥ 200 mg/dL (11.1 mmol/L), or the use of diabetes medications

(e.g., oral hypoglycemic pills or insulin). The ALLHAT/GenHAT definition of T2D was described as a fasting glucose ≥ 140 mg/dL or use of diabetes medication43,44. Therefore, in the current

study we excluded controls that had baseline fasting glucose ≥ 126 mg/dL or missing a fasting glucose measure. In eMERGE, a revised EMR-based phenotyping algorithm based on ICD9/ICD10 codes

was applied across the participants37. Further information regarding all T2D definitions has been previously described in detail37. STATISTICAL ANALYSIS To estimate the h2 for T2D, we

employed two widely used methodologies, GCTA and LDAK, using an overlapping subset of 8,240,835 imputed genetic variants (Fig. 1). Three statistical models were fit: a base model without

covariates (Model 1), a model adjusting for genetic ancestry through principal component analysis48 (Model 2), and a model adjusting for genetic ancestry, age, and sex (Model 3). In addition

to the stated Model 3, a sensitivity analysis further adjusting for the study site to account for potential heterogeneity in sample characteristics across sites in the eMERGE cohort was

performed, and the results remained consistent. GENOME-WIDE COMPLEX TRAIT ANALYSIS (GCTA) METHOD Both genotypes and imputed variants that passed quality control (QC) filters were used to

construct a genomic relationship matrix (GRM) using the GCTA tool, as previously described using the genome-based restricted maximum likelihood (GREML)49,50. The GRM reflects allele sharing

(${A}_{ij}$) between two individuals (_i_ and _j_) across variants with entries $$A_{ij} = \frac{1}{m}\mathop \sum \limits_{k = 1}^{k = m} \frac{{\left( {x_{ik} - 2p_{k} } \right)\left(

{x_{jk} - 2p_{k} } \right)}}{{2p_{k} \left( {1 - p_{k} } \right)}},$$ (1) where _m_ is the number of variants, _x__ik_ and _x__jk_ are the genotypes coded as 0, 1, or 2 of individuals i and

_j,_ respectively, at the _k__th_ locus, and _p__k_ is the MAF of the _k__th_ locus. The variance of T2D was calculated as $$var\left( {T2D} \right) = A\sigma_{v}^{2} + I\sigma_{e}^{2} ,$$

(2) where the variance explained by the genetic variants (${\sigma }_{v}^{2}$) corresponding to GRM and residual error variance (${\sigma }_{e}^{2}$) were estimated using restricted

maximum likelihood (REML), _A_ is an _n_ × _n_ matrix with elements _A__ij,_ and I is an _n_ × _n_ identity matrix. The proportion of the variance of T2D explained by all the genetic

variants (h2) on the observed scale was then calculated as: $$h^{2} = \frac{{\sigma_{v}^{2} }}{{\left( {\sigma_{v}^{2} + \sigma_{e}^{2} } \right)}}$$ (3) We removed one individual from

relative pairs with estimated genetic relatedness greater than 0.025 to ensure no closely-related individuals were included in heritability estimates (e.g., parent-offspring, siblings,

cousins). LINKAGE-DISEQUILIBRIUM ADJUSTED KINSHIPS (LDAK) METHOD Knowing that African ancestral populations have greater haplotype diversity and, in turn, shorter segments of linked

alleles51, we utilized LDAK, which incorporates linkage disequilibrium (LD), as an additional approach. LDAK can be used as an alternative method of generating a GRM by weighting genetic

variants based on local LD patterns52. As previously described, the genetic variance of variants in high LD with a causal variant is typically overestimated in GCTA, while the genetic

variance is underestimated in lower LD regions52, thus demonstrating the importance of accounting for LD in the construction of the LD-weighted GRM. Therefore LD-weighting eliminates the

overestimation of heritability in high LD regions and underestimation of heritability in low LD regions by giving smaller weights to markers in the high-LD regions and large weights to

markers in low LD regions53. The GRM for LDAK is constructed as follows: $$GRM_{LDAK} = \frac{XWX^{\prime}}{m}$$ (4) where W is the diagonal matrix with elements representing the LD-weight

for each variant, N is the total number of variants, and X is a matrix with the general term $$x_{ij} = \left( {m_{ij} - 2_{pj} } \right) / \left( {\sqrt {2_{pj} \left( {1 - p_{j} } \right)}

} \right)$$ (5) with ${p}_{j}$ being the frequency of a given allele at variant _j_ and ${m}_{ij}$ being the genotype for the _j-th_ variant in the _i-th_ individual (represented by 0,

1, or 2). When estimating heritability, LDAK assumes: $$E\left[ {h_{j}^{2} } \right] \propto \left[ {f_{i} \left( {1 - f_{i} } \right)} \right]^{1 + \alpha } \times \varpi_{j} \times r_{j}$$

(6) where $E\left[{h}_{j}^{2}\right]$ is the expected heritability contribution of genetic variant _j_ and ${f}_{i}$ is its observed MAF. The parameter α determines the assumed

relationship between heritability and MAF. The genetic variant weighs ( ${\varpi }_{j}$) are based on the local level of LD and tend to be higher for variants in low-LD regions; thus, LDAK

assumes that these variants contribute more than those in the high-LD areas. ${r}_{j}$ $\epsilon \left[\text{0,1}\right]$ is an information score measuring genotype certainty, where

LDAK assumes higher-quality variants contribute more than lower-quality ones54. LIABILITY SCALE In order to account for the inflated proportion of cases in case–control designs, the

heritability estimation on the observed scale was transformed to that on the liability as $$h_{liab}^{2} = h^{2} \frac{{K\left( {1 - K} \right)}}{{z^{2} }} \frac{{K\left( {1 - K}

\right)}}{{P\left( {1 - P} \right)}},$$ (7) where $K$ is the population prevalence of the T2D in AAs,_ P_ is the sample prevalence of T2D, and z is the height of the standard normal

probability density function at the truncation threshold t, as previously described55. The AA-specific T2D prevalence of 12.5% was extracted from recent literature56. DATA AVAILABILITY The

REGARDS (Study Accession: phs002719.v1.p1), GenHAT (Study Accession: phs002716.v1.p1) and eMERGE (Study Accession: phs001584.v2.p2) phenotypic and genetic data are available on dbGaP.

REFERENCES * Virani, S. S. _et al._ Heart disease and stroke statistics-2021 update: A report from the american heart association. _Circulation_ 143, e254–e743.

https://doi.org/10.1161/CIR.0000000000000950 (2021). Article PubMed Google Scholar * Prasad, R. B. & Groop, L. Genetics of type 2 diabetes-pitfalls and possibilities. _Genes_ 6,

87–123. https://doi.org/10.3390/genes6010087 (2015). Article CAS PubMed PubMed Central Google Scholar * Factors, E. R. _et al._ Diabetes mellitus, fasting blood glucose concentration,

and risk of vascular disease: A collaborative meta-analysis of 102 prospective studies. _Lancet_ 375, 2215–2222. https://doi.org/10.1016/S0140-6736(10)60484-9 (2010). Article CAS Google

Scholar * Mayer-Davis, E. J. _et al._ Incidence trends of type 1 and type 2 diabetes among youths, 2002–2012. _N. Engl. J. Med._ 376, 1419–1429. https://doi.org/10.1056/NEJMoa1610187

(2017). Article PubMed PubMed Central Google Scholar * Lawrence, J. M. _et al._ Trends in prevalence of type 1 and type 2 diabetes in children and adolescents in the US, 2001–2017.

_JAMA_ 326, 717–727. https://doi.org/10.1001/jama.2021.11165 (2021). Article PubMed PubMed Central Google Scholar * Stankov, K., Benc, D. & Draskovic, D. Genetic and epigenetic

factors in etiology of diabetes mellitus type 1. _Pediatrics_ 132, 1112–1122. https://doi.org/10.1542/peds.2013-1652 (2013). Article PubMed Google Scholar * Lyssenko, V. _et al._ Clinical

risk factors, DNA variants, and the development of type 2 diabetes. _N. Engl. J. Med._ 359, 2220–2232. https://doi.org/10.1056/NEJMoa0801869 (2008). Article CAS PubMed Google Scholar *

Galaviz, K. I., Narayan, K. M. V., Lobelo, F. & Weber, M. B. Lifestyle and the prevention of type 2 diabetes: A status report. _Am. J. Lifestyle Med._ 12, 4–20.

https://doi.org/10.1177/1559827615619159 (2018). Article PubMed Google Scholar * Kaprio, J. _et al._ Concordance for type 1 (insulin-dependent) and type 2 (non-insulin-dependent) diabetes

mellitus in a population-based cohort of twins in Finland. _Diabetologia_ 35, 1060–1067. https://doi.org/10.1007/BF02221682 (1992). Article CAS PubMed Google Scholar * Newman, B. _et

al._ Concordance for type 2 (non-insulin-dependent) diabetes mellitus in male twins. _Diabetologia_ 30, 763–768. https://doi.org/10.1007/BF00275741 (1987). Article CAS PubMed Google

Scholar * Poulsen, P., Kyvik, K. O., Vaag, A. & Beck-Nielsen, H. Heritability of type II (non-insulin-dependent) diabetes mellitus and abnormal glucose tolerance–a population-based twin

study. _Diabetologia_ 42, 139–145. https://doi.org/10.1007/s001250051131 (1999). Article CAS PubMed Google Scholar * Medici, F., Hawa, M., Ianari, A., Pyke, D. A. & Leslie, R. D.

Concordance rate for type II diabetes mellitus in monozygotic twins: Actuarial analysis. _Diabetologia_ 42, 146–150. https://doi.org/10.1007/s001250051132 (1999). Article CAS PubMed

Google Scholar * Chen, X. _et al._ Dominant genetic variation and missing heritability for human complex traits: Insights from twin versus genome-wide common SNP models. _Am. J. Hum.

Genet._ 97, 708–714. https://doi.org/10.1016/j.ajhg.2015.10.004 (2015). Article CAS PubMed PubMed Central Google Scholar * Wang, Y., Vik, J. O., Omholt, S. W. & Gjuvsland, A. B.

Effect of regulatory architecture on broad versus narrow sense heritability. _PLoS Comput. Biol._ 9, e1003053. https://doi.org/10.1371/journal.pcbi.1003053 (2013). Article ADS CAS PubMed

PubMed Central Google Scholar * Karavolias, N. G. _et al._ Low additive genetic variation in a trait under selection in domesticated rice. _G3 (Bethesda)_ 10, 2435–2443.

https://doi.org/10.1534/g3.120.401194 (2020). Article CAS PubMed Google Scholar * Genomes Project, C. _et al._ 2012 An integrated map of genetic variation from 1,092 human genomes.

_Nature_ 491, 56-65, https://doi.org/10.1038/nature11632 (2012). * Hall, J. B. & Bush, W. S. Analysis of heritability using genome-wide data. _Curr. Protoc. Hum. Genet._

https://doi.org/10.1002/cphg.25 (2016). Article PubMed PubMed Central Google Scholar * Miljkovic-Gacic, I. _et al._ Genetic determination of adiponectin and its relationship with body

fat topography in multigenerational families of African heritage. _Metabolism_ 56, 234–238. https://doi.org/10.1016/j.metabol.2006.09.019 (2007). Article CAS PubMed PubMed Central Google

Scholar * Freedman, B. I. _et al._ Genome-wide scans for heritability of fasting serum insulin and glucose concentrations in hypertensive families. _Diabetologia_ 48, 661–668.

https://doi.org/10.1007/s00125-005-1679-5 (2005). Article CAS PubMed Google Scholar * Poveda, A. _et al._ The heritable basis of gene-environment interactions in cardiometabolic traits.

_Diabetologia_ 60, 442–452. https://doi.org/10.1007/s00125-016-4184-0 (2017). Article CAS PubMed Google Scholar * Vattikuti, S., Guo, J. & Chow, C. C. Heritability and genetic

correlations explained by common SNPs for metabolic syndrome traits. _PLoS Genet._ 8, e1002637. https://doi.org/10.1371/journal.pgen.1002637 (2012). Article CAS PubMed PubMed Central

Google Scholar * Almgren, P. _et al._ Heritability and familiality of type 2 diabetes and related quantitative traits in the Botnia Study. _Diabetologia_ 54, 2811–2819.

https://doi.org/10.1007/s00125-011-2267-5 (2011). Article CAS PubMed Google Scholar * Mansour, O., Golden, S. H. & Yeh, H. C. Disparities in mortality among adults with and without

diabetes by sex and race. _J. Diabetes Complicat._ 34, 107496. https://doi.org/10.1016/j.jdiacomp.2019.107496 (2020). Article Google Scholar * Lanting, L. C., Joung, I. M., Mackenbach, J.

P., Lamberts, S. W. & Bootsma, A. H. Ethnic differences in mortality, end-stage complications, and quality of care among diabetic patients: A review. _Diabetes Care_ 28, 2280–2288.

https://doi.org/10.2337/diacare.28.9.2280 (2005). Article PubMed Google Scholar * Horowitz, C. R. _et al._ Race, genomics and chronic disease: what patients with African ancestry have to

say. _J. Health Care Poor Underserved_ 28, 248–260. https://doi.org/10.1353/hpu.2017.0020 (2017). Article PubMed PubMed Central Google Scholar * Srivastava, A. K., Williams, S. M. &

Zhang, G. Heritability estimation approaches utilizing genome-wide data. _Curr. Protoc._ 3, e734. https://doi.org/10.1002/cpz1.734 (2023). Article CAS PubMed PubMed Central Google

Scholar * Golan, D., Lander, E. S. & Rosset, S. Measuring missing heritability: Inferring the contribution of common variants. _Proc. Natl. Acad. Sci. USA_ 111, E5272-5281.

https://doi.org/10.1073/pnas.1419064111 (2014). Article ADS CAS PubMed PubMed Central Google Scholar * Goodarzi, M. O. & Rotter, J. I. Genetics insights in the relationship between

type 2 diabetes and coronary heart disease. _Circ. Res._ 126, 1526–1548. https://doi.org/10.1161/CIRCRESAHA.119.316065 (2020). Article CAS PubMed PubMed Central Google Scholar *

Morris, A. P. Progress in defining the genetic contribution to type 2 diabetes susceptibility. _Curr. Opin. Genet. Dev._ 50, 41–51. https://doi.org/10.1016/j.gde.2018.02.003 (2018). Article

CAS PubMed Google Scholar * Mahajan, A. _et al._ Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. _Nat.

Genet._ 50, 1505–1513. https://doi.org/10.1038/s41588-018-0241-6 (2018). Article CAS PubMed PubMed Central Google Scholar * Vujkovic, M. _et al._ Discovery of 318 new risk loci for type

2 diabetes and related vascular outcomes among 1.4 million participants in a multi-ancestry meta-analysis. _Nat. Genet._ 52, 680–691. https://doi.org/10.1038/s41588-020-0637-y (2020).

Article CAS PubMed PubMed Central Google Scholar * Mahajan, A. _et al._ Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and

translation. _Nat. Genet._ 54, 560–572. https://doi.org/10.1038/s41588-022-01058-3 (2022). Article CAS PubMed PubMed Central Google Scholar * DeForest, N. & Majithia, A. R. Genetics

of type 2 diabetes: Implications from large-scale studies. _Curr. Diab. Rep._ 22, 227–235. https://doi.org/10.1007/s11892-022-01462-3 (2022). Article CAS PubMed PubMed Central Google

Scholar * Ng, M. C. _et al._ Meta-analysis of genome-wide association studies in African Americans provides insights into the genetic architecture of type 2 diabetes. _PLoS Genet._ 10,

e1004517. https://doi.org/10.1371/journal.pgen.1004517 (2014). Article CAS PubMed PubMed Central Google Scholar * Palmer, N. D. _et al._ A genome-wide association search for type 2

diabetes genes in African Americans. _PLoS One_ 7, e29202. https://doi.org/10.1371/journal.pone.0029202 (2012). Article ADS CAS PubMed PubMed Central Google Scholar * Chen, J. _et al._

Genome-wide association study of type 2 diabetes in Africa. _Diabetologia_ 62, 1204–1211. https://doi.org/10.1007/s00125-019-4880-7 (2019). Article PubMed PubMed Central Google Scholar

* Ge, T. _et al._ Development and validation of a trans-ancestry polygenic risk score for type 2 diabetes in diverse populations. _Genome Med._ 14, 70.

https://doi.org/10.1186/s13073-022-01074-2 (2022). Article PubMed PubMed Central Google Scholar * Jung, H. U. _et al._ Gene-environment interaction explains a part of missing

heritability in human body mass index. _Commun. Biol._ 6, 324. https://doi.org/10.1038/s42003-023-04679-4 (2023). Article PubMed PubMed Central Google Scholar * Manolio, T. A. Genomewide

association studies and assessment of the risk of disease. _N. Engl. J. Med._ 363, 166–176. https://doi.org/10.1056/NEJMra0905980 (2010). Article CAS PubMed Google Scholar * Howard, V.

J. _et al._ The reasons for geographic and racial differences in stroke study: Objectives and design. _Neuroepidemiology_ 25, 135–143. https://doi.org/10.1159/000086678 (2005). Article

PubMed Google Scholar * Armstrong, N. D. _et al._ Genetic contributors of incident stroke in 10,700 African Americans With hypertension: A meta-analysis from the genetics of hypertension

associated treatments and reasons for geographic and racial differences in stroke studies. _Front. Genet._ 12, 781451. https://doi.org/10.3389/fgene.2021.781451 (2021). Article CAS PubMed

PubMed Central Google Scholar * Das, S. _et al._ Next-generation genotype imputation service and methods. _Nat. Genet._ 48, 1284–1287. https://doi.org/10.1038/ng.3656 (2016). Article

CAS PubMed PubMed Central Google Scholar * Major cardiovascular events in hypertensive patients randomized to doxazosin vs chlorthalidone: the antihypertensive and lipid-lowering

treatment to prevent heart attack trial (ALLHAT). ALLHAT Collaborative Research Group. _JAMA_ 283, 1967–1975 (2000). * Arnett, D. K. _et al._ Pharmacogenetic approaches to hypertension

therapy: Design and rationale for the genetics of hypertension associated treatment (GenHAT) study. _Pharmacogenomics J._ 2, 309–317. https://doi.org/10.1038/sj.tpj.6500113 (2002). Article

CAS PubMed Google Scholar * Consortium, e. Lessons learned from the eMERGE Network: balancing genomics in discovery and practice. _HGG Adv_ 2, 100018,

https://doi.org/10.1016/j.xhgg.2020.100018 (2021). * Stanaway, I. B. _et al._ The eMERGE genotype set of 83,717 subjects imputed to ~40 million variants genome wide and association with the

herpes zoster medical record phenotype. _Genet. Epidemiol._ 43, 63–81. https://doi.org/10.1002/gepi.22167 (2019). Article PubMed Google Scholar * McCarthy, S. _et al._ A reference panel

of 64,976 haplotypes for genotype imputation. _Nat. Genet._ 48, 1279–1283. https://doi.org/10.1038/ng.3643 (2016). Article CAS PubMed PubMed Central Google Scholar * Price, A. L. _et

al._ Principal components analysis corrects for stratification in genome-wide association studies. _Nat. Genet._ 38, 904–909. https://doi.org/10.1038/ng1847 (2006). Article CAS PubMed

Google Scholar * Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: A tool for genome-wide complex trait analysis. _Am. J. Hum. Genet._ 88, 76–82.

https://doi.org/10.1016/j.ajhg.2010.11.011 (2011). Article CAS PubMed PubMed Central Google Scholar * Evans, L. M. _et al._ Narrow-sense heritability estimation of complex traits using

identity-by-descent information. _Heredity (Edinb)_ 121, 616–630. https://doi.org/10.1038/s41437-018-0067-0 (2018). Article PubMed Google Scholar * Charles, B. A., Shriner, D. &

Rotimi, C. N. Accounting for linkage disequilibrium in association analysis of diverse populations. _Genet. Epidemiol._ 38, 265–273. https://doi.org/10.1002/gepi.21788 (2014). Article

PubMed Google Scholar * Speed, D., Hemani, G., Johnson, M. R. & Balding, D. J. Improved heritability estimation from genome-wide SNPs. _Am. J. Hum. Genet._ 91, 1011–1021.

https://doi.org/10.1016/j.ajhg.2012.10.010 (2012). Article CAS PubMed PubMed Central Google Scholar * Ren, D. _et al._ Impact of linkage disequilibrium heterogeneity along the genome on

genomic prediction and heritability estimation. _Genet. Sel. Evol._ 54, 47. https://doi.org/10.1186/s12711-022-00737-3 (2022). Article CAS PubMed PubMed Central Google Scholar * Ma, Y.

_et al._ Excess heritability contribution of alcohol consumption variants in the “Missing Heritability” of type 2 diabetes mellitus. _Int. J. Mol. Sci._

https://doi.org/10.3390/ijms222212318 (2021). Article PubMed PubMed Central Google Scholar * Lee, S. H., Wray, N. R., Goddard, M. E. & Visscher, P. M. Estimating missing heritability

for disease from genome-wide association studies. _Am. J. Hum. Genet._ 88, 294–305. https://doi.org/10.1016/j.ajhg.2011.02.002 (2011). Article CAS PubMed PubMed Central Google Scholar

* Wang, L. _et al._ Trends in prevalence of diabetes and control of risk factors in diabetes among US adults, 1999–2018. _JAMA_ https://doi.org/10.1001/jama.2021.9883 (2021). Article PubMed

PubMed Central Google Scholar Download references ACKNOWLEDGEMENTS The authors thank all the eMERGE sites for providing genomic and health information. Additionally, the authors thank

the other investigators, the staff, and the participants of the REGARDS study for their valuable contributions. A full list of participating REGARDS investigators and institutions can be

found at: https://www.uab.edu/soph/regardsstudy/. FUNDING The eMERGE Network was funded by the National Human Genome Research Institute (NHGRI) through the following grants: U01HG006828

(Cincinnati Children's Hospital Medical Center and Boston Children’s Hospital); U01HG006830 (Children’s Hospital of Philadelphia); U01HG006389 (Essentia Institute of Rural Health,

Marshfield Clinic Research Foundation, and Pennsylvania State University); U01HG006382 (Geisinger Clinic); U01HG006375 (Group Health Cooperative and the University of Washington);

U01HG006379 (Mayo Clinic); U01HG006380 (Icahn School of Medicine at Mount Sinai); U01HG006388 (Northwestern University); U01HG006378 (Vanderbilt University Medical Center); and U01HG006385

(Vanderbilt University Medical Center serving as the Coordinating Center). The eMERGE IV Mass General Brigham site was funded by the NHGRI through U01HG008685, the Columbia University site

was funded through U01HG008680, and the University of Alabama at Birmingham site was funded through U01HG011167. The REGARDS (R01HL136666, MRI, LAL) and GenHAT (R01HL123782, MRI) genetic

studies were supported by the National Heart, Lung, and Blood Institute (NHLBI). The parent REGARDS study was supported by cooperative agreement U01 NS041588, co-funded by the National

Institute of Neurological Disorders and Stroke (NINDS) and the National Institute on Aging (NIA).The content is solely the responsibility of the authors and does not necessarily represent

the official views of the NINDS or the NIA. Representatives of the NINDS were involved in the review of the manuscript but were not directly involved in the collection, management, analysis,

or interpretation of the data. Other funding sources include NHLBI T32HL072757 (N.D.A.), UM1 DK078616 (J.B.M.) R01HL151855 (J.B.M.), R01HL092173 (N.A.L.), and R00AG054573 (T.G.). AUTHOR

INFORMATION AUTHORS AND AFFILIATIONS * Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AL, USA Nicole D. Armstrong & Marguerite R. Irvin * Department of

Biostatistics, University of Alabama at Birmingham, Birmingham, AL, USA Amit Patki, Vinodh Srinivasasainagendra & Hemant K. Tiwari * Center for Genomic Medicine, Massachusetts General

Hospital, Boston, MA, USA Tian Ge * Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA Tian Ge * Division of Biomedical Informatics and Personalized Medicine,

Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA Leslie A. Lange * Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical

Center, Cincinnati, OH, USA Leah Kottyan & Bahram Namjou * Department of Pediatrics, Cincinnati Children’s Hospital Medical Center &, The University of Cincinnati, Cincinnati, OH,

USA Amy S. Shah * Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA Laura J. Rasmussen-Torvik * Division of Medical Genetics,

Department of Medicine, University of Washington, Seattle, WA, USA Gail P. Jarvik * Division of General Internal Medicine, Department of Medicine, Massachusetts General Hospital, Boston, MA,

USA James B. Meigs * Department of Medicine, Harvard Medical School, Boston, MA, USA James B. Meigs * Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA James

B. Meigs * Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA Elizabeth W. Karlson * Mass General Brigham Personalized Medicine, Boston, MA, USA Elizabeth W. Karlson *

Department of Neurology, University of Alabama at Birmingham, Birmingham, AL, USA Nita A. Limdi Authors * Nicole D. Armstrong View author publications You can also search for this author

inPubMed Google Scholar * Amit Patki View author publications You can also search for this author inPubMed Google Scholar * Vinodh Srinivasasainagendra View author publications You can also

search for this author inPubMed Google Scholar * Tian Ge View author publications You can also search for this author inPubMed Google Scholar * Leslie A. Lange View author publications You

can also search for this author inPubMed Google Scholar * Leah Kottyan View author publications You can also search for this author inPubMed Google Scholar * Bahram Namjou View author

publications You can also search for this author inPubMed Google Scholar * Amy S. Shah View author publications You can also search for this author inPubMed Google Scholar * Laura J.

Rasmussen-Torvik View author publications You can also search for this author inPubMed Google Scholar * Gail P. Jarvik View author publications You can also search for this author inPubMed

Google Scholar * James B. Meigs View author publications You can also search for this author inPubMed Google Scholar * Elizabeth W. Karlson View author publications You can also search for

this author inPubMed Google Scholar * Nita A. Limdi View author publications You can also search for this author inPubMed Google Scholar * Marguerite R. Irvin View author publications You

can also search for this author inPubMed Google Scholar * Hemant K. Tiwari View author publications You can also search for this author inPubMed Google Scholar CONTRIBUTIONS NDA, NAL, MRI,

and HKT contributed to the concept and design of the study. AP and VS performed quality control of the genomics data. AP performed the heritability analysis. TG, EWK, MRI, and HKT provided

insight into methodology. LAL, LK, BN, ASS, LJR-T, GPJ, JBM, provided interpretation of the results. MRI and HKT supervised the study. NDA, MRI, and HKW drafted the manuscript. All authors

provided critical edits on subsequent versions of the manuscript and approved the final version. CORRESPONDING AUTHOR Correspondence to Nicole D. Armstrong. ETHICS DECLARATIONS COMPETING

INTERESTS JBM is an Academic Associate with Quest Diagnostics R&D. The remaining authors declare that they have no competing interests. ADDITIONAL INFORMATION PUBLISHER'S NOTE

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION. RIGHTS AND

PERMISSIONS OPEN ACCESS This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any

medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The

images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is

not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission

directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE

Armstrong, N.D., Patki, A., Srinivasasainagendra, V. _et al._ Variant level heritability estimates of type 2 diabetes in African Americans. _Sci Rep_ 14, 14009 (2024).

https://doi.org/10.1038/s41598-024-64711-3 Download citation * Received: 13 July 2023 * Accepted: 12 June 2024 * Published: 18 June 2024 * DOI: https://doi.org/10.1038/s41598-024-64711-3

SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a shareable link is not currently available for this article. Copy to

clipboard Provided by the Springer Nature SharedIt content-sharing initiative KEYWORDS * Heritable quantitative trait * Type 2 diabetes mellitus * Genomics * Genetic polymorphisms *

Disparities