Bioinformatics-Based Identification of Expanded Repeats: A Non-reference Intronic Pentamer Expansion in RFC1 Causes CANVAS.

Bioinformatics-Based Identification of Expanded Repeats: A Non-reference Intronic Pentamer Expansion in RFC1 Causes CANVAS.

Rafehi, Haloom;Szmulewicz, David J;Bennett, Mark F;Sobreira, Nara L M;Pope, Kate;Smith, Katherine R;Gillies, Greta;Diakumis, Peter;Dolzhenko, Egor;Eberle, Michael A;Barcina, María García;Breen, David P;Chancellor, Andrew M;Cremer, Phillip D;Delatycki, Martin B;Fogel, Brent L;Hackett, Anna;Halmagyi, G Michael;Kapetanovic, Solange;Lang, Anthony;Mossman, Stuart;Mu, Weiyi;Patrikios, Peter;Perlman, Susan L;Rosemergy, Ian;Storey, Elsdon;Watson, Shaun R D;Wilson, Michael A;Zee, David S;Valle, David;Amor, David J;Bahlo, Melanie;Lockhart, Paul J;
American journal of human genetics 2019 Vol. 105 pp. 151-165
255
rafehi2019bioinformaticsbasedamerican

Abstract

Genomic technologies such as next-generation sequencing (NGS) are revolutionizing molecular diagnostics and clinical medicine. However, these approaches have proven inefficient at identifying pathogenic repeat expansions. Here, we apply a collection of bioinformatics tools that can be utilized to identify either known or novel expanded repeat sequences in NGS data. We performed genetic studies of a cohort of 35 individuals from 22 families with a clinical diagnosis of cerebellar ataxia with neuropathy and bilateral vestibular areflexia syndrome (CANVAS). Analysis of whole-genome sequence (WGS) data with five independent algorithms identified a recessively inherited intronic repeat expansion [(AAGGG)] in the gene encoding Replication Factor C1 (RFC1). This motif, not reported in the reference sequence, localized to an Alu element and replaced the reference (AAAAG) short tandem repeat. Genetic analyses confirmed the pathogenic expansion in 18 of 22 CANVAS-affected families and identified a core ancestral haplotype, estimated to have arisen in Europe more than twenty-five thousand years ago. WGS of the four RFC1-negative CANVAS-affected families identified plausible variants in three, with genomic re-diagnosis of SCA3, spastic ataxia of the Charlevoix-Saguenay type, and SCA45. This study identified the genetic basis of CANVAS and demonstrated that these improved bioinformatics tools increase the diagnostic utility of WGS to determine the genetic basis of a heterogeneous group of clinically overlapping neurogenetic disorders.

Citation

ID: 79027
Ref Key: rafehi2019bioinformaticsbasedamerican
Use this key to autocite in SciMatic or Thesis Manager

References

Blockchain Verification

Account:
NFT Contract Address:
0x95644003c57E6F55A65596E3D9Eac6813e3566dA
Article ID:
79027
Unique Identifier:
S0002-9297(19)30203-4
Network:
Scimatic Chain (ID: 481)
Loading...
Blockchain Readiness Checklist
Authors
Abstract
Journal Name
Year
Title
5/5
Creates 1,000,000 NFT tokens for this article
Token Features:
  • ERC-1155 Standard NFT
  • 1 Million Supply per Article
  • Transferable via MetaMask
  • Permanent Blockchain Record
Blockchain QR Code
Scan with Saymatik Web3.0 Wallet

Saymatik Web3.0 Wallet