Mining housekeeping genes with a Naive Bayes classifier

Luna De Ferrari, Stuart Aitken

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

BACKGROUND: Traditionally, housekeeping and tissue specific genes have been classified using direct assay of mRNA presence across different tissues, but these experiments are costly and the results not easy to compare and reproduce.

RESULTS: In this work, a Naive Bayes classifier based only on physical and functional characteristics of genes already available in databases, like exon length and measures of chromatin compactness, has achieved a 97% success rate in classification of human housekeeping genes (93% for mouse and 90% for fruit fly).

CONCLUSION: The newly obtained lists of housekeeping and tissue specific genes adhere to the expected functions and tissue expression patterns for the two classes. Overall, the classifier shows promise, and in the future additional attributes might be included to improve its discriminating power.

Original languageEnglish
Pages (from-to)277
Number of pages14
JournalBMC Genomics
Volume7
Issue number277
DOIs
Publication statusPublished - 2006

Keywords / Materials (for Non-textual outputs)

  • Animals
  • Base Sequence
  • Bayes Theorem
  • Chromatin
  • Computational Biology
  • DNA, Complementary
  • Databases, Genetic
  • Drosophila
  • Exons
  • Humans
  • Mice
  • Organ Specificity
  • RNA, Messenger
  • ROC Curve

Fingerprint

Dive into the research topics of 'Mining housekeeping genes with a Naive Bayes classifier'. Together they form a unique fingerprint.

Cite this