Predicting protein cellular localization using a domain projection method

Richard Mott, Jörg Schultz, Peer Bork, Chris P Ponting

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

We investigate the co-occurrence of domain families in eukaryotic proteins to predict protein cellular localization. Approximately half (300) of SMART domains form a "small-world network", linked by no more than seven degrees of separation. Projection of the domains onto two-dimensional space reveals three clusters that correspond to cellular compartments containing secreted, cytoplasmic, and nuclear proteins. The projection method takes into account the existence of "bridging" domains, that is, instances where two domains might not occur with each other but frequently co-occur with a third domain; in such circumstances the domains are neighbors in the projection. While the majority of domains are specific to a compartment ("locale"), and hence may be used to localize any protein that contains such a domain, a small subset of domains either are present in multiple locales or occur in transmembrane proteins. Comparison with previously annotated proteins shows that SMART domain data used with this approach can predict, with 92% accuracy, the localizations of 23% of eukaryotic proteins. The coverage and accuracy will increase with improvements in domain database coverage. This method is complementary to approaches that use amino-acid composition or identify sorting sequences; these methods may be combined to further enhance prediction accuracy.

Original languageEnglish
Pages (from-to)1168-74
Number of pages7
JournalGenome Research
Issue number8
Publication statusPublished - Aug 2002

Keywords / Materials (for Non-textual outputs)

  • Animals
  • Benchmarking
  • Cell Compartmentation
  • Cluster Analysis
  • Computational Biology
  • Cytoplasm
  • Databases, Protein
  • Humans
  • Internet
  • Membrane Proteins
  • Nuclear Proteins
  • Probability
  • Protein Sorting Signals
  • Protein Structure, Tertiary
  • Proteins
  • Software


Dive into the research topics of 'Predicting protein cellular localization using a domain projection method'. Together they form a unique fingerprint.

Cite this