Edinburgh Research Explorer

A Data Transformation System for Biological Data Sources

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions

Open

Documents

  • Download as Adobe PDF

    Rights statement: Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.

    Final published version, 1 MB, PDF document

http://www.vldb.org/conf/1995/P158.PDF
Original languageEnglish
Title of host publicationProceedings of 21st VLDB Conference
PublisherMorgan Kaufmann
Pages158-169
Number of pages12
ISBN (Print)1-55860-379-4
Publication statusPublished - 1 Sep 1995

Abstract

Scientific data of importance to biologists in the Human Genome Project resides not only in conventional databases, but in structured files maintained in a number of different formats (e.g. ASN.1 and ACE) as well as sequence analysis packages (e.g. BLAST and FASTA). These formats and packages contain a number of data types not found in conventional databases, such as lists and variants, and may be deeply nested. We present in this paper techniques for querying and transforming such data, and illustrate their use in a prototype system developed in conjunction with the Human Genome Center for Chromosome 22. We also describe optimizations performed by the system, a crucial issue for bulk data.

Download statistics

No data available

ID: 10624740