WGS data related to “Is Gauchian genotyping of GBA1 variants reliable’

Output Details

Gaucher disease (GD) is an inherited lysosomal storage disorder caused by biallelic pathogenic variants in [GBA1](https://www.ncbi.nlm.nih.gov/gene/2629), which encodes glucocerebrosidase. Additionally, GBA1 variants are a well-established genetic risk factor for Parkinson's disease (PD), leading some PD research centers to incorporate GBA1 screening to identify patients who may benefit from targeted therapeutics. However, detecting GBA1 variants remains challenging due to the gene's high sequence homology with its pseudogene, which can give rise to complex recombinant alleles. This study evaluates the performance of Gauchian, a recently developed software tool for identifying GBA1 variants from whole genome sequencing (WGS) data. The assessment was conducted in a cohort of 90 individuals with GD and five GBA1 heterozygotes, all previously characterized through Sanger sequencing. While Gauchian successfully identified the most common genotypes, it demonstrated limitations in detecting rare or de novo variants due to its restricted internal database and heavy reliance on intergenic structural variants. These limitations led to misclassified homozygosity, incomplete genotyping, and undetected recombination events, ultimately reducing the tool's utility for comprehensive variant screening and precluding its application in diagnostic settings. The study dataset includes WGS from the 90 individuals used in the Gauchian pipeline. This data will be available through dbGaP to facilitate further research on GBA1 variant detection, genotype-phenotype correlations in GD and PD, and the development of improved bioinformatics tools for complex variant identification.
Tags
  • Antemortem