A user-defined data type for the storage of time series data allowing efficient similarity screening

Anatoly Sorokin, Gene Selkov, Igor Goryanin, Karen Halliday (Group Leader)

Research output: Contribution to journalArticlepeer-review

Abstract

The volume of the experimentally measured time series data is rapidly growing, while storage solutions offering better data types than simple arrays of numbers or opaque blobs for keeping series data are sorely lacking. A number of indexing methods have been proposed to provide efficient access to time series data, but none has so far been integrated into a tried-and-proven database system. To explore the possibility of such integration, we have developed a data type for time series storage in PostgreSQL, an object-relational database system, and equipped it with an access method based on SAX (Symbolic Aggregate approXimation). This new data type has been successfully tested in a database supporting a large-scale plant gene expression experiment, and it was additionally tested on a very large set of simulated time series data.
Original languageEnglish
Pages (from-to)272-4
Number of pages3
JournalEuropean Journal of Pharmaceutical Sciences
Volume46
Issue number4
DOIs
Publication statusPublished - 16 Jul 2012

Keywords

  • algorithms
  • animals
  • computer simulation
  • data mining
  • database management systems
  • gene expression regulation
  • humans
  • information storage and retrieval
  • models
  • systems biology
  • systems integration
  • time factors

Fingerprint Dive into the research topics of 'A user-defined data type for the storage of time series data allowing efficient similarity screening'. Together they form a unique fingerprint.

Cite this