Systematic identification of novel protein domain families associated with nuclear functions

Tobias Doerks, Richard R Copley, Jörg Schultz, Chris P Ponting, Peer Bork

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

A systematic computational analysis of protein sequences containing known nuclear domains led to the identification of 28 novel domain families. This represents a 26% increase in the starting set of 107 known nuclear domain families used for the analysis. Most of the novel domains are present in all major eukaryotic lineages, but 3 are species specific. For about 500 of the 1200 proteins that contain these new domains, nuclear localization could be inferred, and for 700, additional features could be predicted. For example, we identified a new domain, likely to have a role downstream of the unfolded protein response; a nematode-specific signalling domain; and a widespread domain, likely to be a noncatalytic homolog of ubiquitin-conjugating enzymes.

Original languageEnglish
Pages (from-to)47-56
Number of pages10
JournalGenome Research
Issue number1
Publication statusPublished - Jan 2002

Keywords / Materials (for Non-textual outputs)

  • Amidohydrolases
  • Amino Acid Motifs
  • Amino Acid Sequence
  • Animals
  • Caenorhabditis elegans
  • Caenorhabditis elegans Proteins
  • Cell Nucleus
  • Databases, Protein
  • Humans
  • Molecular Sequence Data
  • Multigene Family
  • Nuclear Proteins
  • Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase
  • Phylogeny
  • Protein Structure, Tertiary
  • Species Specificity


Dive into the research topics of 'Systematic identification of novel protein domain families associated with nuclear functions'. Together they form a unique fingerprint.

Cite this