Open prediction of polysaccharide utilisation loci (PUL) in 5414 public Bacteroidetes genomes using PULpy

Dataset

Abstract

Polysaccharide utilisation loci (PUL) are regions within the genomes of Bacteroidetes that encode all the necessary machinery for the cleavage of particular carbohydrates. Prediction of PUL from genomic data alone involves the identification of carbohydrate-active enzymes (CAZymes) co-localised with susCD gene pairs. Here we present the open prediction of PUL in 5414 public Bacteroidetes genomes, and an open-source pipeline to reproduce or extend the results. The PULpy code "Open prediction of Polysaccharide Utilisation Loci (PUL)" can be obtained via GitHub as documented in the attached README.txt file.

Data Citation

Watson, Mick. (2018). Open prediction of polysaccharide utilisation loci (PUL) in 5414 public Bacteroidetes genomes using PULpy, [dataset]. The Roslin Institute and Royal (Dick) School of Veterinary Studies. University of Edinburgh. https://doi.org/10.7488/ds/2438.
Date made available18 Sept 2018
PublisherEdinburgh DataShare

Cite this