Edinburgh Research Explorer

A review of bioinformatics tools for bio-prospecting from metagenomic sequence data

Research output: Contribution to journalLiterature review

  • Despoina Roumpeka
  • R. John Wallace
  • Frank Escalettes
  • Ian Fotheringham
  • Michael Watson

Related Edinburgh Organisations

Open Access permissions



  • Download as Adobe PDF

    Rights statement: Copyright © 2017 Roumpeka, Wallace, Escalettes, Fotheringham and Watson. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    Final published version, 952 KB, PDF document

    Licence: Creative Commons: Attribution (CC-BY)

Original languageEnglish
JournalFrontiers in genetics
Issue number23
Early online date16 Feb 2017
Publication statusE-pub ahead of print - 16 Feb 2017


The microbiome can be defined as the community of microorganisms that live in a particular environment. Metagenomics is the practice of sequencing DNA from the genomes of all organisms present in a particular sample, and has become a common method for the study of microbiome population structure and function. Increasingly, researchers are finding novel genes encoded within metagenomes, many of which may be of interest to the biotechnology and pharmaceutical industries. However, such "bioprospecting" requires a suite of sophisticated bioinformatics tools to make sense of the data. This review summarizes the most commonly used bioinformatics tools for the assembly and annotation of metagenomic sequence data with the aim of discovering novel genes.

    Research areas

  • metagenomics, bioinformatics, next generation sequencing, assembly, gene prediction, bioprospecting

Download statistics

No data available

ID: 31234325