Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis

Takeya Kasukawa, Shintaro Katayama, Hideya Kawaji, Harukazu Suzuki, David A Hume, Yoshihide Hayashizaki

Research output: Contribution to journalArticlepeer-review

Abstract

The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at .
Original languageEnglish
Pages (from-to)913-21
Number of pages9
JournalGenomics
Volume84
Issue number6
DOIs
Publication statusPublished - Dec 2004

Keywords

  • Animals
  • Cluster Analysis
  • Computational Biology
  • DNA, Complementary
  • Gene Expression Profiling
  • Genomics
  • Humans
  • Mice
  • Proteome
  • Rats
  • Reference Standards

Fingerprint

Dive into the research topics of 'Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis'. Together they form a unique fingerprint.

Cite this