Genetics & Longevity Variants
22 terms
- ACE I/D polymorphism
The ACE insertion/deletion (I/D) polymorphism (rs4646994) involves a 287-bp Alu repeat in intron 16 of the angiotensin-converting enzyme gene; the D allele is associated with approximately 1.5–2 times the serum ACE activity of the II genotype, influencing the renin-angiotensin-aldosterone system and thereby blood pressure regulation and cardiovascular tone. Early studies reported associations between the D allele and myocardial infarction risk, and between the I allele and elite endurance performance, though many of these findings have not replicated consistently in larger studies or meta-analyses. In the context of longevity, associations have been reported in several centenarian cohorts but the direction and magnitude are inconsistent across populations. The ACE I/D locus is best understood as a modest modulator of ACE enzyme levels with population-specific effects, rather than a robust longevity locus.
- APOE ε4 allele (mechanism)
The APOE ε4 allele encodes apolipoprotein E isoform E4, which differs from the ε3 isoform at residue 112 (cysteine→arginine), altering lipoprotein binding preferences and reducing efficient clearance of triglyceride-rich remnants and LDL from circulation. In the brain, E4 impairs amyloid-β clearance via the blood-brain barrier and through astrocytic and microglial processing, promotes tau pathology, and potentiates neuroinflammation via microglial activation — effects partly independent of amyloid. The allele confers dose-dependent Alzheimer's disease risk: one ε4 copy increases risk approximately 3–4-fold, two copies approximately 8–12-fold in European populations, with risk magnitudes varying across ancestries. Despite its disease associations, the ε4 allele has been maintained at roughly 14% average global allele frequency (with substantial regional variation, ~8–10% in East Asia to >30% in some African populations), likely reflecting ancient trade-offs involving immune function, fertility, and early cognitive performance.
- ATM (DNA-damage-response gene)
ATM (ataxia-telangiectasia mutated) encodes a serine/threonine protein kinase that is the master activator of the DNA damage response to double-strand breaks; upon activation it phosphorylates hundreds of substrates including H2AX, CHK2, p53, and BRCA1 to coordinate cell-cycle arrest, DNA repair, and apoptosis. Biallelic loss-of-function mutations cause ataxia-telangiectasia, a recessive syndrome characterized by cerebellar neurodegeneration, immunodeficiency, radiation sensitivity, and a cancer risk exceeding 30%. Heterozygous ATM carriers (~1% of the population) have intermediate cancer risk — particularly for breast and colorectal cancer — and recent data suggest moderately elevated cardiovascular disease risk, placing ATM among the clinically actionable cancer-predisposition genes. In longevity biology, compromised ATM function exemplifies how defects in the DNA damage response accelerate hallmarks of aging including genomic instability and inflammation.
- CETP I405V variant
Cholesteryl ester transfer protein (CETP) mediates the exchange of cholesteryl esters from HDL for triglycerides in VLDL and LDL, effectively lowering HDL-cholesterol. The I405V variant (rs5882) in the CETP gene reduces CETP activity and is enriched in Ashkenazi Jewish centenarians and their offspring relative to controls in the Longevity Genes Project at Albert Einstein College of Medicine (Barzilai et al., 2003), accompanying notably high HDL and large HDL particle size. Large HDL particles are more effective in reverse cholesterol transport and are associated with reduced cardiovascular and cognitive disease risk. The variant illustrates how a naturally occurring loss-of-function polymorphism can phenocopy the cardiovascular benefit sought by pharmacological CETP inhibitors, several of which failed in trials despite HDL elevation — suggesting particle quality over quantity matters.
- DNA repair pathways (NER, BER, HR, NHEJ)
DNA repair pathways are conserved mechanisms by which cells detect and correct genomic lesions — up to 100,000 per cell per day from endogenous sources (reactive oxygen species, replication errors, spontaneous hydrolysis). Four major pathways handle distinct damage types: nucleotide excision repair (NER) removes bulky adducts including UV-induced pyrimidine dimers; base excision repair (BER) corrects small oxidative lesions such as 8-oxoguanine; homologous recombination (HR) restores double-strand breaks with high fidelity via the sister chromatid in S/G2 phase; non-homologous end joining (NHEJ) ligates broken ends rapidly but less accurately, dominating DSB repair in non-dividing cells. Repair capacity declines with age — NER efficiency drops measurably in fibroblasts from older donors — and genomic instability is a recognized hallmark of aging (Schumacher et al., Nature 2021). Progeroid syndromes offer causal evidence: NER defects cause xeroderma pigmentosum (1,000-fold elevated cancer risk) and Cockayne syndrome; DSB-repair defects underlie Bloom and Werner syndromes. Oldest-old genomics (Kim et al. 2018) shows enrichment for variants in ERCC2, RAD52, and XRCC5. Repair enhancement as an aging therapy is unresolved; SIRT6 stimulates HR and NHEJ as an active preclinical target.
- Epigenome-wide association study (EWAS)
An epigenome-wide association study (EWAS) is a hypothesis-free scan that tests associations between DNA methylation levels at hundreds of thousands of CpG sites (cytosine–guanine dinucleotides where the cytosine can acquire a methyl group) across the genome and a phenotype of interest — such as chronological age, a disease, or an environmental exposure. Methylation is typically quantified using microarrays (most commonly the Illumina HumanMethylation450 or EPIC/850K BeadChip) or whole-genome bisulfite sequencing, yielding beta-values between 0 and 1 for each site; a linear or logistic regression is then run at every CpG, with correction for multiple testing and for confounders including estimated blood-cell-type proportions. In aging research, EWAS generated many of the CpG training sets that underlie first-generation epigenetic clocks: Hannum et al. (2013) used methylation data from 656 blood samples spanning ages 19–101 to identify 71 age-associated CpGs that predicted biological age with high accuracy in independent cohorts, and similar approaches led to Horvath's 353-CpG pan-tissue clock. The EWAS Catalog (Battram et al., 2022) aggregates over 1.7 million associations from more than 2,600 EWAS — including both peer-reviewed publications and unpublished scans — enabling lookup of CpG-phenotype links across cohorts. A persistent methodological challenge is reverse causation: a disease or age-related process can itself reshape the methylome, so an observed association does not establish that the CpG change precedes or drives the outcome; Mendelian randomisation and longitudinal designs are increasingly used alongside EWAS to triangulate causal direction.
- FOXO3 longevity variant
FOXO3 encodes a forkhead-box transcription factor that integrates signals from the insulin/IGF-1 and AMPK pathways to regulate stress resistance, autophagy, apoptosis, and antioxidant gene expression. A cluster of intronic single-nucleotide polymorphisms — most prominently rs2802292 — was first associated with exceptional longevity in Hawaiian men of Japanese ancestry (Willcox et al., 2008) and has since been replicated across multiple independent cohorts in Europe, East Asia, and Ashkenazi populations. The protective allele is thought to promote nuclear retention of FOXO3, enhancing expression of downstream targets including GADD45, SOD2, and autophagy regulators. Because FOXO3 sits at the convergence of multiple conserved longevity pathways, it remains one of the most consistently replicated genetic associations with human lifespan.
- GWAS (Genome-wide association study)
A genome-wide association study (GWAS) is an agnostic scan of common single-nucleotide polymorphisms (SNPs, typically minor allele frequency >1–5%) across the genome to identify loci statistically associated with a trait or disease, using a stringent significance threshold of p<5×10⁻⁸ to control for multiple testing of ~1 million tag SNPs in linkage disequilibrium (LD) with surrounding variants. GWAS operates on the common-disease/common-variant hypothesis and is optimized for polygenic traits; most discovered variants have modest individual effect sizes (OR 1.05–1.3), requiring very large sample sizes (tens to hundreds of thousands) to detect reliably. In longevity, GWAS findings are relatively sparse: the APOE locus (particularly ε2 protection and ε4 risk) is by far the strongest and most replicated hit for exceptional longevity; other candidates including FOXO3, TOMM40/APOC1, and CDKN2B-AS1 are supported by some studies but lack universal replication. The modest GWAS yield for longevity likely reflects its heterogeneous, polygenic, and late-acting genetic architecture.
- Klotho KL-VS variant
The KL-VS haplotype of the klotho gene (six linked variants in complete linkage disequilibrium, two of which produce the amino-acid substitutions F352V and C370S in exon 2) increases serum klotho protein levels and is associated with longevity in heterozygous but not homozygous carriers — a pattern consistent with heterozygote advantage. Heterozygous KL-VS carriers show elevated circulating klotho and, in several studies, improved cognitive function and reduced dementia risk, with structural MRI showing greater right dorsolateral prefrontal cortex volume in heterozygotes (Yokoyama et al., 2015) and later intrinsic-connectivity work linking serum klotho to prefrontal network strength (Yokoyama et al., 2017); the Dubal et al. (2014) study reported cognition outcomes rather than fMRI findings. The variant also associates with favorable cardiovascular and bone mineral density profiles in some cohorts. Mechanistically, higher serum klotho is thought to enhance FGF23 co-receptor signaling, modulate Wnt and IGF-1 pathways, and exert neuroprotective effects independently of its endocrine roles.
- LMNA (Lamin A/C gene; HGPS)
LMNA encodes the nuclear lamina proteins Lamin A and Lamin C through alternative splicing; the lamins form a filamentous meshwork underlying the inner nuclear membrane that provides mechanical support and organizes peripheral chromatin, influencing gene expression, DNA repair, and nuclear shape. A de novo C→T transition at position 1824 (c.1824C>T; G608G) in exon 11 activates a cryptic splice site, producing a truncated and permanently farnesylated Lamin A isoform called progerin, which causes Hutchinson-Gilford Progeria Syndrome (HGPS) — a devastating childhood progeroid syndrome with accelerated cardiovascular disease and a median survival of approximately 14 years. Importantly, low levels of the same aberrant splice product accumulate in normal aging cells even without the HGPS mutation, and nuclear lamina integrity declines broadly with age, suggesting LMNA biology has relevance beyond the rare syndrome. Lonafarnib (Zokinvy), a farnesyltransferase inhibitor that blocks progerin farnesylation, received FDA approval in November 2020 for HGPS and extends median survival by approximately 2.5 years; additional progerin-targeting approaches are in early investigation.
- Mitochondrial haplogroups
Mitochondrial haplogroups are clusters of maternally inherited mitochondrial DNA (mtDNA) haplotypes defined by shared polymorphisms, reflecting ancient migration patterns and geographic lineages. Because mtDNA encodes 13 essential respiratory-chain subunits and 22 tRNAs, haplogroup-defining variants can subtly alter oxidative phosphorylation efficiency, reactive oxygen species production, and mitochondrial morphology. Several studies have reported associations between specific haplogroups and longevity, most notably sub-haplogroups D4a and D5 among Japanese centenarians (Tanaka et al.), and haplogroup J in some European centenarian cohorts; however, replication across populations is inconsistent, sample sizes in the original studies were modest, and population stratification is a persistent confound. Mitochondrial haplogroups therefore represent plausible but not firmly established modulators of aging trajectory.
- mtDNA heteroplasmy
mtDNA heteroplasmy is the coexistence of two or more distinct mitochondrial DNA sequences in a cell, tissue, or individual — wild-type mixed with mutant molecules. Heteroplasmy level (percentage of mutant copies) is quantified by next-generation sequencing or droplet digital PCR. A threshold effect governs disease expression: mitochondrial disorders typically require mutant load above 60–90 % before oxidative phosphorylation fails enough to produce symptoms. Sub-clinical levels carry measurable consequences: a 2018 Scientific Reports analysis of 789 Health ABC participants found m.3243A>G heteroplasmy at 0–19 % associated with reduced grip strength, cognition, and arterial stiffness, with a 96 % higher dementia-mortality hazard in the highest versus lowest tertile (HR = 1.96; Tranah et al.). Somatic mutations accumulate with age; a 2026 Nature analysis of ~750,000 whole-genome sequences (Gupta et al.) found a sharp post-60 rise in heteroplasmy with a spectrum consistent with replication errors rather than oxidative damage — revising a long-held assumption. Accumulation is tissue-specific: post-mitotic tissues retain clonal expansions that blood cells dilute more readily (Sanchez-Contreras et al. 2023, eLife). A germline bottleneck during oogenesis (~30–35 segregating units) causes mutant fractions to shift between generations; maternal age independently raises heteroplasmies in offspring (Rebolledo-Jaramillo et al. 2014). Whether somatic accumulation is causal in aging or primarily a biomarker of replication stress remains under investigation.
- MTHFR C677T variant
The C677T polymorphism (rs1801133) in methylenetetrahydrofolate reductase (MTHFR) encodes a thermolabile enzyme with reduced activity (~70% reduced activity / ~30% residual in TT homozygotes and ~35% reduced / ~65% residual in CT heterozygotes, particularly under low-folate conditions) and causes modest elevation of plasma homocysteine. Elevated homocysteine has been epidemiologically associated with cardiovascular disease and neural tube defects, and the variant is correspondingly associated with those outcomes, though the causal role of homocysteine itself remains debated. The clinical relevance of MTHFR C677T genotyping is contested: major laboratory and genetics societies advise against routine population testing, noting that the association is modest, diet-modifiable, and that homocysteine-lowering B-vitamin supplementation has not consistently reduced cardiovascular events in trials. Despite its limited clinical actionability, it remains one of the most over-ordered genetic tests in functional medicine contexts.
- PCSK9 (gene and therapeutic target)
PCSK9 (proprotein convertase subtilisin/kexin type 9) is a serine protease secreted by hepatocytes that binds to the LDL receptor on the cell surface and directs it to lysosomal degradation rather than recycling, thereby reducing LDL uptake and raising circulating LDL-cholesterol. Rare gain-of-function PCSK9 mutations cause familial hypercholesterolaemia, while loss-of-function variants — particularly prevalent in African-American cohorts (e.g., Y142X, C679X) — produce lifelong very low LDL levels and markedly reduced coronary heart disease risk without adverse phenotypes, validating the target. Monoclonal antibodies against PCSK9 (alirocumab, evolocumab) reduce LDL by 50–60% on top of statin therapy and lower cardiovascular event rates in high-risk patients; inclisiran, a small-interfering RNA targeting PCSK9 mRNA in hepatocytes, achieves similar LDL lowering with twice-yearly dosing. Oral PCSK9 inhibitors such as MK-0616 (enlicitide; Merck) reached late-stage trials with Phase 3 LDL-C lowering readouts during 2024–2025; cardiovascular-outcomes trials (CORALreef Outcomes) remain pending.
- Pharmacogenomics
Pharmacogenomics studies how genetic variation — primarily in drug-metabolizing enzymes, transporters, and drug targets — influences individual drug response in terms of efficacy and toxicity. CYP2C9 and VKORC1 variants are the canonical example for warfarin dosing: poor metabolizers at CYP2C9 combined with VKORC1 low-expression haplotypes require markedly lower doses to achieve therapeutic anticoagulation, and dosing algorithms incorporating genotype reduce bleeding events. For statins, a non-synonymous variant in SLCO1B1 (rs4149056, Val174Ala) reduces hepatic uptake of simvastatin and atorvastatin, raising plasma drug levels and increasing myopathy risk several-fold in CC homozygotes. Pharmacogenomics is particularly relevant to older adults with polypharmacy because drug-drug-gene interactions compound with age-related changes in kidney and liver function; clinical implementation through pre-emptive panel genotyping is expanding, with CPIC guidelines providing evidence-based dose recommendations for over 40 drug-gene pairs.
- Polygenic risk score (PRS)
A polygenic risk score (PRS) is a weighted sum of an individual's risk-allele dosages across many SNPs associated with a trait, with weights typically derived from GWAS summary statistics using methods such as LD-pruning and thresholding (P+T) or Bayesian shrinkage (LDpred, PRSice). For common complex diseases — including coronary artery disease, type 2 diabetes, and breast cancer — high PRS identifies individuals with lifetime risk comparable to monogenic mutation carriers, suggesting clinical utility for stratified prevention. However, PRS performance degrades substantially when applied across ancestry groups because GWAS discovery cohorts have been disproportionately European, and LD patterns and allele frequencies differ between populations, limiting equitable deployment. Additional challenges include calibration drift over time, limited ability to capture rare variants and gene-environment interactions, and conceptual questions about whether PRS for longevity traits represents a useful clinical endpoint given their composite, late-acting nature.
- Single-nucleotide polymorphism (SNP)
A single-nucleotide polymorphism (SNP, pronounced "snip") is a germline variation at a single base-pair position where two or more nucleotide alleles occur at a population frequency above 1%. The human genome contains roughly 4–5 million common SNPs — on average one per 500–1,000 base pairs — genotyped using microarray chips querying hundreds of thousands of positions. SNPs are the primary markers in genome-wide association studies (GWAS), testing hundreds of thousands of variants against a trait or disease across tens or hundreds of thousands of participants — a statistical scan rather than mechanistic proof of causation. Because nearby SNPs are inherited together in blocks of high linkage disequilibrium (LD), a single tag SNP can proxy an entire haplotype, reducing genotyping cost without sacrificing discovery power, as demonstrated by the Phase II HapMap (Frazer et al., 2007). In aging genetics, common SNPs with individually small effects aggregate into polygenic risk scores explaining a meaningful fraction of lifespan variance — Yashin et al. (2012) identified 27 SNPs with consistent additive effects on life span across independent cohorts, implicating cell-growth, apoptosis, and other biological pathways. SNPs differ from rare variants (minor allele frequency below 1%), which GWAS arrays mostly miss and are better captured by whole-genome or whole-exome sequencing. Whether rare variants account for the heritability unexplained by common SNPs ("missing heritability") remains an open question.
- SIRT1 / SIRT3 / SIRT6 isoforms
Sirtuins are NAD⁺-dependent deacylases and ADP-ribosyltransferases; the three most studied longevity-relevant isoforms differ sharply in subcellular compartment and substrate specificity. SIRT1 is predominantly nuclear and cytosolic, deacetylating transcriptional regulators including p53, NF-κB, PGC-1α, and FOXO proteins to coordinate metabolism, stress response, and genome maintenance. SIRT3 localizes to the mitochondrial matrix, where its best-characterized substrates include SOD2 (K68; activating antioxidant defence) and components of the electron transport chain, directly linking NAD⁺ status to mitochondrial redox homeostasis. SIRT6 is a nuclear chromatin-associated sirtuin that removes H3K9ac and H3K56ac marks at sites of DNA damage and telomeres, and promotes genomic stability; overexpression of SIRT6 extends lifespan in male mice, and it was later shown to modulate IGF signalling and inflammation.
- Somatic mutations and mosaicism
Somatic mutations are DNA changes arising post-zygotically in body cells rather than the germline, affecting only descendants of the cell where they occur. Every cell division carries a small replication-error probability, and exogenous mutagens (UV radiation, tobacco carcinogens, reactive oxygen species) further damage DNA throughout life; mutational burden thus rises roughly linearly with age — ~40 single-nucleotide substitutions per year in intestinal and hepatic stem cells (Blokzijl et al., 2016, Nature). When the resulting patchwork of genetically distinct lineages becomes detectable, the state is called somatic mosaicism. If a mutation confers even a modest replicative advantage, the clone can expand beyond random drift — clonal expansion. The best-characterised example is clonal hematopoiesis (CH): mutations in DNMT3A, TET2, and ASXL1 drive replacement of normal blood cells by one expanded clone. Jaiswal et al. (2014, NEJM) found CH in fewer than 1% of individuals under 40, rising to ~9–18% after age 70, associated with 11-fold elevated haematological-malignancy risk, 2-fold increased coronary heart disease risk, and 40% higher all-cause mortality. This link is associational in humans; mouse models support a causal inflammatory mechanism through which mutant leukocytes exacerbate atherosclerosis. Somatic mutations contribute to genomic instability, listed among the primary hallmarks of aging by López-Otín et al. (2023, Cell), though whether they directly drive organismal aging versus co-occurring with it remains under investigation.
- TERT / TERC variants
TERT (telomerase reverse transcriptase) and TERC (telomerase RNA component) together constitute the catalytic core of telomerase; TERT provides reverse-transcriptase activity while TERC is the RNA template used to extend telomeric TTAGGG repeats. Common single-nucleotide variants in both loci are among the strongest GWAS hits for leukocyte telomere length and modestly influence disease risk for cancer, cardiovascular disease, and pulmonary fibrosis in proportion to their effect on telomere length. Rare heterozygous loss-of-function mutations in TERT or TERC cause autosomal dominant telomere biology disorders (TBDs) — a spectrum including dyskeratosis congenita, familial idiopathic pulmonary fibrosis, aplastic anemia, and hepatic cirrhosis — via telomere-mediated replicative failure in high-turnover tissues and anticipation across generations. The contrast between the modest effects of common variants and the severe phenotypes of rare pathogenic mutations illustrates the quantitative sensitivity of telomere homeostasis.
- Whole-genome sequencing in aging research
Whole-genome sequencing (WGS) generates complete base-pair-resolution data across nuclear and mitochondrial DNA, enabling discovery of rare coding and non-coding variants, structural variants, and copy-number changes that are invisible to SNP arrays. In aging research, WGS has several distinct applications: it identifies rare longevity-associated variants in centenarian families and cohorts (e.g., protective mutations in PCSK9, APOC3, or DNA repair genes) that require deep sequencing to detect; it quantifies somatic mutation burden and clonal hematopoiesis of indeterminate potential (CHIP) in aging tissues, linking somatic evolution to cardiovascular and cancer risk; and it maps mitochondrial heteroplasmy dynamics that accumulate with age. Sequencing costs have fallen from ~$3,000/Gb in 2008 toward approximately $1–5/Gb by the mid-2020s depending on platform and throughput, making population-scale WGS studies feasible; however, interpreting variants of uncertain significance (VUS) and managing incidental findings remain major clinical and ethical challenges, especially as WGS enters preventive medicine for healthy aging populations.
- WRN (Werner syndrome gene)
WRN encodes a member of the RecQ DNA helicase family with both helicase and exonuclease activities, involved in multiple DNA repair pathways including base excision repair, non-homologous end joining, and replication fork restart at sites of stalled polymerases. Biallelic loss-of-function mutations cause Werner syndrome, a segmental progeroid syndrome in which features of aging — including cataracts, atherosclerosis, type 2 diabetes, osteoporosis, and malignancies — appear in the third and fourth decades; mean age at death historically was reported as approximately 54 years (Huang et al. 2006, with similar values from Goto's Japanese Werner registry analyses), and a 2022 retrospective study by Kato, Koshizaka and colleagues found mean age at death for patients dying between 2011 and 2020 had risen to approximately 59 years, likely reflecting improved management of malignancy and vascular disease. At the cellular level, WRN-deficient cells accumulate replication stress, telomere dysfunction, and genomic instability disproportionately rapidly. Werner syndrome is extensively studied as a model of accelerated aging, particularly to distinguish aging-driver from aging-bystander mechanisms, though the segmental nature of the phenotype means it does not recapitulate normal aging comprehensively.
