TACT: Transcriptome Auto-annotation Conducting Tool of H-InvDB

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W345-9. doi: 10.1093/nar/gkl283.

Abstract

Transcriptome Auto-annotation Conducting Tool (TACT) is a newly developed web-based automated tool for conducting functional annotation of transcripts by the integration of sequence similarity searches and functional motif predictions. We developed the TACT system by integrating two kinds of similarity searches, FASTY and BLASTX, against protein sequence databases, UniProtKB (Swiss-Prot/TrEMBL) and RefSeq, and a unified motif prediction program, InterProScan, into the ORF-prediction pipeline originally designed for the 'H-Invitational' human transcriptome annotation project. This system successively applies these constituent programs to an mRNA sequence in order to predict the most plausible ORF and the function of the protein encoded. In this study, we applied the TACT system to 19 574 non-redundant human transcripts registered in H-InvDB and evaluated its predictive power by the degree of agreement with human-curated functional annotation in H-InvDB. As a result, the TACT system could assign functional description to 12 559 transcripts (64.2%), the remainder being hypothetical proteins. Furthermore, the overall agreement of functional annotation with H-InvDB, including those transcripts annotated as hypothetical proteins, was 83.9% (16 432/19 574). These results show that the TACT system is useful for functional annotation and that the prediction of ORFs and protein functions is highly accurate and close to the results of human curation. TACT is freely available at http://www.jbirc.aist.go.jp/tact/.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Computational Biology / methods
  • DNA, Complementary / chemistry
  • Databases, Protein
  • Expressed Sequence Tags / chemistry
  • Humans
  • Internet
  • Open Reading Frames
  • Proteins / genetics
  • Proteins / physiology
  • RNA, Messenger / chemistry*
  • Sequence Analysis / methods*
  • Sequence Analysis, DNA
  • Sequence Analysis, RNA
  • Software*
  • Systems Integration
  • User-Computer Interface

Substances

  • DNA, Complementary
  • Proteins
  • RNA, Messenger