Listen: Building a Global Genetic Map of Parkinson’s with Andy Singleton and Ignacio Mata
On genomics podcast Mendelspod, Andy Singleton, co-lead of the Global Parkinson's Genetics Program (GP2), and Nacho Mata, Cleveland Clinic geneticist and LARGE-PD founder, discuss how GP2 is accelerating discovery and expanding global representation in Parkinson’s research.
CMUL Commissions Parkinson’s Disease Research Lab
Reports that the College of Medicine, University of Lagos (CMUL) commissioned the GP2–CMUL Molecular Laboratory to strengthen Parkinson’s disease research, genomics capacity, and training opportunities across Africa in collaboration with Aligning Science Across Parkinson’s (ASAP) and the Global Parkinson’s Genetics Program (GP2).
Advancing Parkinson’s Disease Research in Africa: A Strategic Training Framework of the Global Parkinson’s Genetics Program
By onAfrica is the second most populous continent and is expected to host 26% of the global population by 2050.5 Despite exhibiting the highest genetic variation and complex admixture, African populations are significantly underrepresented in PD research, with only a fraction of their extensive genetic diversity being surveyed, primarily focusing on Mendelian genes associated with monogenic PD. Genetic studies have characterized a limited number of Africa's 2000 ethnolinguistic groups, mainly using genotyping arrays with variants common in Europeans, leaving the distribution of novel, rare, and medically relevant variations largely unknown. For instance, although the LRRK2 p.G2019S variant is present in 1% to 2% of Europeans PD patients, 29.7% of familial Ashkenazi Jewish PD patients, and 40% of North African Arabs, it has not been identified in Black Africans to date. Given Africa's ethnic and genetic diversity, including these populations is crucial for understanding novel genetic determinants underlying PD risk, onset, and progression. Research capacity and research infrastructure in Africa remain limited, with PD genetic research facing challenges, including political and economic instability, a predominant focus on infectious diseases, limited medical personnel, and insufficient funds and infrastructure.
Defining the causes of sporadic Parkinson’s disease in the global Parkinson’s genetics program (GP2)
By onThe Global Parkinson’s Genetics Program (GP2) will genotype over 150,000 participants from around the world, and integrate genetic and clinical data for use in large-scale analyses to dramatically expand our understanding of the genetic architecture of PD. This report details the workflow for cohort integration into the complex arm of GP2, and together with our outline of the monogenic hub in a companion paper, provides a generalizable blueprint for establishing large scale collaborative research consortia.
Identification of genetic risk loci and causal insights associated with Parkinson’s disease in African and African admixed populations: a genome-wide association study
By onOur study identified a novel genetic risk factor in GBA1 in people of African ancestry, which has not been seen in European populations, and it could be a major mechanistic basis of Parkinson's disease in African populations. This population-specific variant exerts substantial risk on Parkinson's disease as compared with common variation identified through GWAS and it was found to be present in 39% of the cases assessed in this study. This finding highlights the importance of understanding ancestry-specific genetic risk in complex diseases, a particularly crucial point as the Parkinson's disease field moves towards targeted treatments in clinical trials. The distinctive genetics of African populations highlights the need for equitable inclusion of ancestrally diverse groups in future trials, which will be a valuable step towards gaining insights into novel genetic determinants underlying the causes of Parkinson's disease. This finding opens new avenues towards RNA-based and other therapeutic strategies aimed at reducing lifetime risk of Parkinson's disease.
Insights into ancestral diversity in Parkinson’s disease risk: a comparative assessment of polygenic risk scores
By onRisk prediction models play a crucial role in advancing healthcare by enabling early detection and supporting personalized medicine. Nonetheless, polygenic risk scores (PRS) for Parkinson’s disease (PD) have not been extensively studied across diverse populations, contributing to health disparities. In this study, we constructed 105 PRS using individual-level data from seven ancestries and compared two different models. Model 1 was based on the cumulative effect of 90 known European PD risk variants, weighted by summary statistics from four independent ancestries (European, East Asian, Latino/Admixed American, and African/Admixed). Model 2 leveraged multi-ancestry summary statistics using a p-value thresholding approach to improve prediction across diverse populations. Our findings provide a comprehensive assessment of PRS performance across ancestries and highlight the limitations of a “one-size-fits-all” approach to genetic risk prediction. We observed variability in predictive performance between models, underscoring the need for larger sample sizes and ancestry-specific approaches to enhance accuracy. These results establish a foundation for future research aimed at improving generalizability in genetic risk prediction for PD.
Global Perspectives on Returning Genetic Research Results in Parkinson Disease
By onAiming to develop recommendations on return of research results (RoR) practice within the Global Parkinson's Genetics Program (GP2), we conducted a global survey to gain insight on GP2 members' perceptions, practice, readiness, and needs surrounding RoR. GP2 members (n = 191), representing 147 institutions and 60 countries across 6 continents, completed the survey. Access to clinical genetic testing services was significantly higher in high-income countries compared with low- and middle-income countries (96.6% vs 58.4%), where funding was predominantly covered by patients themselves. While 92.7% of the respondents agreed that genetic research results should be returned, levels of agreement were higher for clinically relevant results relating to pathogenic or likely pathogenic variants in genes known to cause PD or other neurodegenerative diseases. Less than 10% offered separate clinically accredited genetic testing before returning genetic research results. A total of 48.7% reported having a specific statement on RoR policy in their ethics consent form, while 53.9% collected data on participants' preferences on RoR prospectively. 24.1% had formal genetic counselling training. Notably, the comfort level in returning incidental genetic findings or returning results to unaffected individuals remains low.
Team Science Approaches to Unravel Monogenic Parkinson’s Disease on a Global Scale
By onIn this article, we describe combining both efforts in a merger project resulting in a global monogenic PD cohort with the buildup of a sustainable infrastructure to identify the multi-ancestry spectrum of monogenic PD and enable studies of factors modifying penetrance and expressivity of monogenic PD.
Investigating the Protective Role of the Mitochondrial 2158 T > C Variant in Parkinson’s Disease
By onHudson et al. proposed a protective role of two mitochondrial DNA variants in PD etiology. In an array-based genotyping study, the authors showed that the m.2158 T > C (p.Lys4Arg, rs41349444) variant in SHLP2 is associated with reduced risk for PD (P-value = 2 × 10−2, OR = 0.32). A follow-up functional study by Kim et al. demonstrated that the mutated protein was protective against mitochondrial dysfunction in both in vitro and in vivo models of PD. Nevertheless, the association of this variant with reduced risk of PD has not been confirmed in large-scale sequencing datasets. To further investigate the association between m.2158 T > C and PD, we conducted an extensive genetic characterization utilizing large-scale genome sequencing (GS) datasets, totaling 4358 PD cases and 16,609 controls.
Parkinson’s families project: a UK-wide study of early onset and familial Parkinson’s disease
By onThe Parkinson’s Families Project is a UK-wide study aimed at identifying genetic variation associated with familial and early-onset Parkinson’s disease (PD). We recruited individuals with a clinical diagnosis of PD and age at motor symptom onset ≤45 years and/or a family history of PD in up to third-degree relatives. Where possible, we also recruited affected and unaffected relatives. We analysed DNA samples with a combination of single nucleotide polymorphism (SNP) array genotyping, multiplex ligation-dependent probe amplification (MLPA), and whole-genome sequencing (WGS). We investigated the association between identified pathogenic mutations and demographic and clinical factors such as age at motor symptom onset, family history, motor symptoms (MDS-UPDRS) and cognitive performance (MoCA). We performed baseline genetic analysis in 718 families, of which 205 had sporadic early-onset PD (sEOPD), 113 had familial early-onset PD (fEOPD), and 400 had late-onset familial PD (fLOPD). 69 (9.6%) of these families carried pathogenic variants in known monogenic PD-related genes. The rate of a molecular diagnosis increased to 28.1% in PD with motor onset ≤35 years. We identified pathogenic variants in LRRK2 in 4.2% of families, and biallelic pathogenic variants in PRKN in 3.6% of families. We also identified two families with SNCA duplications and three families with a pathogenic repeat expansion in ATXN2, as well as single families with pathogenic variants in VCP, PINK1, PNPLA6, PLA2G6, SPG7, GCH1, and RAB32. An additional 73 (10.2%) families were carriers of at least one pathogenic or risk GBA1 variant. Most early-onset and familial PD cases do not have a known genetic cause, indicating that there are likely to be further monogenic causes for PD.
The LRRK2 p.L1795F variant causes Parkinson’s disease in the European population
By onLRRK2-PD represents the most common form of autosomal dominant Parkinson’s disease. We identified the LRRK2 p.L1795F variant in three families and six additional unrelated cases using genetic data from over 50,000 individuals. Carriers with available genotyping data shared a common haplotype. The clinical presentation resembles other LRRK2-PD forms. Combined with published functional evidence showing strongly enhanced LRRK2 kinase activity, we provide evidence that LRRK2 p.L1795F is pathogenic.
African ancestry neurodegeneration risk variant disrupts an intronic branchpoint in GBA1
By onRecently, an African ancestry-specific Parkinson disease (PD) risk signal was identified at the gene encoding glucocerebrosidase (GBA1). This variant (rs3115534-G) is carried by ~50% of West African PD cases and imparts a dose-dependent increase in risk for disease. The risk variant has varied frequencies across African ancestry groups but is almost absent in European and Asian ancestry populations. GBA1 is a gene of high clinical and therapeutic interest. Damaging biallelic protein-coding variants cause Gaucher disease and monoallelic variants confer risk for PD and dementia with Lewy bodies, likely by reducing the function of glucocerebrosidase. Interestingly, the African ancestry-specific GBA1 risk variant is a noncoding variant, suggesting a different mechanism of action. Using full-length RNA transcript sequencing, we identified partial intron 8 expression in risk variant carriers (G) but not in nonvariant carriers (T). Antibodies targeting the N terminus of glucocerebrosidase showed that this intron-retained isoform is likely not protein coding and subsequent proteomics did not identify a shorter protein isoform, suggesting that the disease mechanism is RNA based. Clustered regularly interspaced short palindromic repeats editing of the reported index variant (rs3115534) revealed that this is the sequence alteration responsible for driving the production of these transcripts containing intron 8. Follow-up analysis of this variant showed that it is in a key intronic branchpoint sequence and, therefore, has important implications in splicing and disease. In addition, when measuring glucocerebrosidase activity, we identified a dose-dependent reduction in risk variant carriers. Overall, we report the functional effect of a GBA1 noncoding risk variant, which acts by interfering with the splicing of functional GBA1 transcripts, resulting in reduced protein levels and reduced glucocerebrosidase activity. This understanding reveals a potential therapeutic target in an underserved and underrepresented population.
Assessment of common genetic variation in Alzheimer’s and Parkinson’s diseases reveals global distinction in population attributable risk
By onEmerging evidence suggests that the genetic architecture of Alzheimer’s (AD) and Parkinson’s diseases (PD) risk varies across ancestries. This study seeks to explore distinct and universal genetic targets across individuals of Latino, African/African Admixed, East Asian, and European populations by implementing Population Attributable Risk (PAR) comparisons on summary statistics from genome-wide association studies (GWAS). PAR was calculated for the most significant disease variants using summary statistics derived from select multi-ancestry GWAS meta-analyses, followed by fine-mapping analysis to validate genetic contribution of disease variants to European, African/African Admixed, East Asian, and Latino individuals. For both AD, APOE4 PAR estimates were universally high across all ancestries, with TSPAN14 and PICALM emerging as other common targets. Attributable risk varied across PD-related major risk loci including variation nearby GBA1 and LRRK2. In contrast, SNCA, MCCC1, VPS13C, and MAPT loci demonstrated comparable attributable risk across ancestries. This cross-ancestry evaluation of PAR reinforces the genetic heterogeneity of AD and PD. In consideration of the complex etiology of these diseases, these findings may inform the strategic prioritization of therapeutic targets and improve global health outcomes.
Genome-wide assessment identifies novel runs of homozygosity linked to Parkinson’s disease etiology across diverse ancestral populations
By onWe conducted the first large-scale, multi-ancestral investigation of Parkinson’s disease (PD) to examine the impact of genome-wide homozygosity on disease risk and age at onset. Using genotyping, imputed, and whole-genome sequencing (WGS) data from 16,599 PD cases and 13,585 controls across nine ancestral populations from the Global Parkinson’s Genetics Program, we aimed to identify novel regions of homozygosity contributing to PD heritability. Our findings suggest that ROH regions contribute to PD heritability in a global context, with a portion attributed to recessive allelic architecture. We developed an open-science framework for unbiased homozygosity mapping. Future studies should use larger, diverse cohorts and WGS data to uncover rare recessive variants linked to PD susceptibility.
CNV-Finder: Streamlining Copy Number Variation Discovery
By onCopy Number Variations (CNVs) play pivotal roles in the etiology of complex diseases and are variable across diverse populations. Understanding the association between CNVs and disease susceptibility is significant in disease genetics research and often requires analysis of large sample sizes. One of the most cost-effective and scalable methods for detecting CNVs is based on normalized signal intensity values, such as Log R Ratio (LRR) and B Allele Frequency (BAF), from Illumina genotyping arrays. In this study, we present CNV-Finder, a novel pipeline integrating deep learning techniques on array data, specifically a Long Short-Term Memory (LSTM) network, to expedite the large-scale identification of CNVs within predefined genomic regions. This facilitates efficient prioritization of samples for time-consuming or costly subsequent analyses such as Multiplex Ligation-dependent Probe Amplification (MLPA), short-read, and long-read whole genome sequencing. We incorporate four genes to establish our methods—Parkin (PRKN), Leucine Rich Repeat And Ig Domain Containing 2 (LINGO2), Microtubule Associated Protein Tau (MAPT), and alpha-Synuclein (SNCA)—which may be relevant to neurological diseases such as Alzheimer’s disease (AD), Parkinson’s disease (PD), Progressive Supranuclear Palsy (PSP), or related disorders such as essential tremor (ET). By training our models on expert-annotated samples and validating them across diverse cohorts, including those from the Global Parkinson’s Genetics Program (GP2) and additional dementia-specific databases, we demonstrate the efficacy of CNV-Finder in accurately detecting deletions and duplications. Our pipeline outputs app-compatible files for visualization within CNV-Finder’s interactive web application. This interface enables researchers to review predictions and filter displayed samples by model prediction values, LRR range, and variant count in order to explore or confirm results. Our pipeline integrates this human feedback to enhance model performance and reduce false positive rates. Through a series of comprehensive analyses and validations using visual inspection, MLPA, short-read, and long-read sequencing data, we demonstrate the robustness and adaptability of CNV-Finder in identifying CNVs with regions of varied size, probe density, and noise. Our findings highlight the significance of contextual understanding and human expertise in enhancing the precision of CNV identification, particularly in complex genomic regions like 17q21.31. The CNV-Finder pipeline is a scalable, publicly available resource for the scientific community, available on GitHub (https://github.com/GP2code/CNV-Finder; DOI 10.5281/zenodo.14182563). CNV-Finder not only expedites accurate candidate identification but also significantly reduces the manual workload for researchers, enabling future targeted validation and downstream analyses in regions or phenotypes of interest.
Parkinson’s Disease Pathogenic Variants: Cross-Ancestry Analysis and Microarray Data Validation
By onThis study evaluated genotyping success of the NeuroBooster array (NBA) and determined the frequencies of pathogenic variants across ancestries. We analyzed the presence and allele frequency of 34 pathogenic variants in 28,710 PD cases, 9,614 other neurodegenerative disorder cases, and 15,821 controls across 11 ancestries within the Global Parkinson’s Genetics Program dataset. Of these, 25 were genotyped on NBA and cluster plots were used to assess their quality. Genes previously predicted to have high or very high confidence of causing PD tend to have more pathogenic variants and are present across ancestry groups. Twenty-five of the 34 pathogenic variants were typed by the NBA array and classified “good” (n=12), “medium” (n=4), and “bad” (n=9) variants. Our results confirm the likelihood that established PD genes are pathogenic and highlight the importance of ancestrally diverse research in PD. We also show the usefulness of the NBA as a reliable tool for genotyping of rare variants for PD.
Large-scale genetic characterization of Parkinson’s disease in the African and African admixed populations
By onElucidating the genetic contributions to Parkinson’s disease (PD) etiology across diverse ancestries is a critical priority for the development of targeted therapies in a global context. We conducted the largest sequencing characterization of potentially disease-causing, protein-altering and splicing mutations in 710 cases and 11,827 controls from genetically predicted African or African admixed ancestries. We explored copy number variants (CNVs) and runs of homozygosity (ROHs) in prioritized early onset and familial cases. Our study identified rare GBA1 coding variants to be the most frequent mutations among PD patients, with a frequency of 4% in our case cohort. Out of the 18 GBA1 variants identified, ten were previously classified as pathogenic or likely pathogenic, four were novel, and four were reported as of uncertain clinical significance. The most common known disease-associated GBA1 variants in the Ashkenazi Jewish and European populations, p.Asn409Ser, p.Leu483Pro, p.Thr408Met, and p.Glu365Lys, were not identified among the screened PD cases of African and African admixed ancestry. Similarly, the European and Asian LRRK2 disease-causing mutational spectrum, including LRRK2 p.Gly2019Ser and p.Gly2385Arg genetic risk factors, did not appear to play a major role in PD etiology among West African-ancestry populations. However, we found three heterozygous novel missense LRRK2 variants of uncertain significance overrepresented in cases, two of which — p.Glu268Ala and p.Arg1538Cys — had a higher prevalence in the African ancestry population reference datasets. Structural variant analyses revealed the presence of PRKN CNVs with a frequency of 0.7% in African and African admixed cases, with 66% of CNVs detected being compound heterozygous or homozygous in early-onset cases, providing further insights into the genetic underpinnings in early-onset juvenile PD in these populations. Novel genetic variation overrepresented in cases versus controls among screened genes warrants further replication and functional prioritization to unravel their pathogenic potential. Here, we created the most comprehensive genetic catalog of both known and novel coding and splicing variants potentially linked to PD etiology in an underserved population. Our study has the potential to guide the development of targeted therapies in the emerging era of precision medicine. By expanding genetics research to involve underrepresented populations, we hope that future PD treatments are not only effective but also inclusive, addressing the needs of diverse ancestral groups.
Does COMT Play a Role in Parkinson’s Disease Susceptibility Across Diverse Ancestral Populations?
By onThe catechol-O-methyltransferase (COMT) gene is involved in brain catecholamine metabolism, but its association with Parkinson’s disease (PD) risk remains unclear. To investigate the relationship between COMT genetic variants and PD risk across diverse ancestries. We analyzed COMT variants in 2,251 PD patients and 2,835 controls of European descent using whole-genome sequencing from the Accelerating Medicines Partnership-Parkinson Disease (AMP-PD), along with 20,427 PD patients and 11,837 controls from 10 ancestries using genotyping data from the Global Parkinson’s Genetics Program (GP2). Utilizing the largest case-control datasets to date, no significant enrichment of COMT risk alleles in PD patients was observed across any ancestry group after correcting for multiple testing. Among Europeans, no correlations with cognitive decline, motor function, motor complications, or time to LID onset were observed. These findings emphasize the need for larger, diverse cohorts to confirm the role of COMT in PD development and progression.
Novel Parkinson’s Disease Genetic Risk Factors Within and Across European Populations
By onWe conducted a meta-analysis of Parkinson’s disease genome-wide association study summary statistics, stratified by source (clinically-recruited case-control cohorts versus population biobanks) and by general European versus European isolate ancestries. This study included 63,555 cases, 17,700 proxy cases with a family history of Parkinson’s disease, and 1,746,386 controls, making it the largest investigation of Parkinson’s disease genetic risk to date. The final combined cross-European meta-analysis identified 134 risk loci (59 novel), with a total of 157 independent signals, significantly expanding our understanding of Parkinson’s disease risk. Multi-omic data integration revealed that expression of the nominated risk genes are highly enriched in brain tissues, particularly in neuronal and astrocyte cell types. Additionally, we prioritized 33 high-confidence genes across these 134 loci for future follow-up studies.
Validation of a Mitochondrial Polygenic Score for Parkinson’s Disease
By onMitochondrial dysfunction is a key player in Parkinson’s disease (PD) pathogenesis. Mitochondrial polygenic scores (MGS) may be associated with PD but require validation across diverse populations. To validate the association between the MGS, PD status and age-at-onset (AAO) in idiopathic and LRRK2-PD across various ancestries. We analyzed data from 17,129 PD patients and 13,872 healthy individuals across 10 ancestries within the Global Parkinson’s Disease Genetic Program. We used regression models to assess the association between MGS, PD status and AAO. The MGS was associated with iPD in Europeans (β=0.19, SE=0.02, p<2.0×10−16) and Ashkenazi Jews (β=0.26, p=3.7×10-4) but not in other populations. Additionally, the MGS was strongly associated with LRRK2-PD status (β=0.82, p=2.0×10−16). No associations with AAO were observed. The MGS is robustly associated with iPD status in Europeans and Ashkenazi Jews and with LRRK2-PD status. Population-specific MGS are needed to improve accuracy in other ancestries.