triannot: a versatile and high performance pipeline for the automated annotation of plant genomes

triannot: a versatile and high performance pipeline for the automated annotation of plant genomes

;Philippe eLeroy;Nicolas eGuilhot;Hiroaki eSakai;Aurélien eBernard;Frédéric eChoulet;Sébastien eTheil;Sébastien eReboux;Naoki eAmano;Timothée eFlutre;Céline ePelegrin;Hajime eOhyanagi;Michael eSeidel;Franck eGiacomoni;Matthieu eReichstadt;Michael eAlaux;Emmanuelle eGicquello;Fabrice eLegeai;Lorenzo eCerutti;Hisataka eNuma;Tsuyoshi eTanaka;Klaus eMayer;Takeshi eItoh;Hadi eQuesneville;Catherine eFeuillet
phytochemistry letters 2012 Vol. 3 pp. -
218
eleroy2012frontierstriannot:

Abstract

In support of the international effort to obtain a reference sequence of the bread wheat genome and to provide plant communities dealing with large and complex genomes with a versatile, easy-to-use online automated tool for annotation, we have developed the TriAnnot pipeline. Its modular architecture allows for the annotation and masking of transposable elements, the structural and functional annotation of protein-coding genes with an evidence-based quality indexing, and the identification of conserved non-coding sequences and molecular markers. The TriAnnot pipeline is parallelized on a 712 CPU computing cluster that can run a 1 Gb sequence annotation in less than five days. It is accessible through a web interface for small scale analyses or through a server for large scale annotations. The performance of TriAnnot was evaluated in terms of sensitivity, specificity, and general fitness using curated reference sequence sets from rice and wheat. In less than 8 hours, TriAnnot was able to predict more than 83% of the 3,748 CDS from rice chromosome 1 with a fitness of 67.4%. On a set of 12 reference Mb-sized contigs from wheat chromosome 3B, TriAnnot predicted and annotated 93.3% of the genes among which 54% were perfectly identified in accordance with the reference annotation. It also allowed the curation of 12 genes based on new biological evidences, increasing the percentage of perfect gene prediction to 63%. TriAnnot systematically showed a higher fitness than other annotation pipelines that are not improved for wheat. As it is easily adaptable to the annotation of other plant genomes, TriAnnot should become a useful resource for the annotation of large and complex genomes in the future.

Citation

ID: 147328
Ref Key: eleroy2012frontierstriannot:
Use this key to autocite in SciMatic or Thesis Manager

References

Blockchain Verification

Account:
NFT Contract Address:
0x95644003c57E6F55A65596E3D9Eac6813e3566dA
Article ID:
147328
Unique Identifier:
10.3389/fpls.2012.00005
Network:
Scimatic Chain (ID: 481)
Loading...
Blockchain Readiness Checklist
Authors
Abstract
Journal Name
Year
Title
5/5
Creates 1,000,000 NFT tokens for this article
Token Features:
  • ERC-1155 Standard NFT
  • 1 Million Supply per Article
  • Transferable via MetaMask
  • Permanent Blockchain Record
Blockchain QR Code
Scan with Saymatik Web3.0 Wallet

Saymatik Web3.0 Wallet