DRL/BlobTools_manuscript

  • Dominik Laetsch (Creator)

Dataset

Description

Contents of folders:
figures/: figures -tables/: tables
supplementary_tables/: supplementary tables
scripts/: additional scripts used for analysis
supplementary_material/: Output files of the analyses

Abstract

The goal of many genome sequencing projects is to provide a complete representation of a target genome (or genomes) as underpinning data for further analyses. However, it can be problematic to identify which sequences in an assembly truly derive from the target genome(s) and which are derived from associated microbiome or contaminant organisms.
We present BlobTools, a modular command-line solution for visualisation, quality control and taxonomic partitioning of genome datasets. Using guanine+cytosine content of sequences, read coverage in sequencing libraries and taxonomy of sequence similarity matches, BlobTools can assist in primary partitioning of data, leading to improved assemblies, and screening of final assemblies for potential contaminants.
Through simulated paired-end read dataset,s containing a mixture of metazoan and bacterial taxa, we illustrate the main BlobTools workflow and suggest useful parameters for taxonomic partitioning of low-complexity metagenome assemblies.

Data Citation

Dominik R Laetsch, Georgios Koutsovoulos, Tim Booth, Jason Stajich, & Sujai Kumar. (2017, July 22). DRL/blobtools: BlobTools v1.0 (Version v1.0.0). Zenodo. http://doi.org/10.5281/zenodo.833879
Date made available24 Jul 2017
PublisherGitHub

Cite this