Extensive identification and analysis of conserved small ORFs in animals

Extensive identification and analysis of conserved small ORFs in animals

Sebastian D. Mackowiak;Henrik Zauber;Chris Bielow;Denise Thiel;Kamila Kutz;Lorenzo Calviello;Guido Mastrobuoni;Nikolaus Rajewsky;Stefan Kempa;Matthias Selbach;Benedikt Obermayer;Sebastian D. Mackowiak;Henrik Zauber;Chris Bielow;Denise Thiel;Kamila Kutz;Lorenzo Calviello;Guido Mastrobuoni;Nikolaus Rajewsky;Stefan Kempa;Matthias Selbach;Benedikt Obermayer;
Genome biology 2015 Vol. 16 pp. 1-21
313
mackowiak2015genomeextensive

Abstract

There is increasing evidence that transcripts or transcript regions annotated as non-coding can harbor functional short open reading frames (sORFs). Loss-of-function experiments have identified essential developmental or physiological roles for a few of the encoded peptides (micropeptides), but genome-wide experimental or computational identification of functional sORFs remains challenging. Here, we expand our previously developed method and present results of an integrated computational pipeline for the identification of conserved sORFs in human, mouse, zebrafish, fruit fly, and the nematode C. elegans. Isolating specific conservation signatures indicative of purifying selection on amino acid (rather than nucleotide) sequence, we identify about 2,000 novel small ORFs located in the untranslated regions of canonical mRNAs or on transcripts annotated as non-coding. Predicted sORFs show stronger conservation signatures than those identified in previous studies and are sometimes conserved over large evolutionary distances. The encoded peptides have little homology to known proteins and are enriched in disordered regions and short linear interaction motifs. Published ribosome profiling data indicate translation of more than 100 novel sORFs, and mass spectrometry data provide evidence for more than 70 novel candidates. Taken together, we identify hundreds of previously unknown conserved sORFs in major model organisms. Our computational analyses and integration with experimental data show that these sORFs are expressed, often translated, and sometimes widely conserved, in some cases even between vertebrates and invertebrates. We thus provide an integrated resource of putatively functional micropeptides for functional validation in vivo.

Citation

ID: 113032
Ref Key: mackowiak2015genomeextensive
Use this key to autocite in SciMatic or Thesis Manager

References

Blockchain Verification

Account:
NFT Contract Address:
0x95644003c57E6F55A65596E3D9Eac6813e3566dA
Article ID:
113032
Unique Identifier:
doi:10.1186/s13059-015-0742-x
Network:
Scimatic Chain (ID: 481)
Loading...
Blockchain Readiness Checklist
Authors
Abstract
Journal Name
Year
Title
5/5
Creates 1,000,000 NFT tokens for this article
Token Features:
  • ERC-1155 Standard NFT
  • 1 Million Supply per Article
  • Transferable via MetaMask
  • Permanent Blockchain Record
Blockchain QR Code
Scan with Saymatik Web3.0 Wallet

Saymatik Web3.0 Wallet