A user-defined data type for the storage of time series data allowing efficient similarity screening

Eur J Pharm Sci. 2012 Jul 16;46(4):272-4. doi: 10.1016/j.ejps.2011.12.008. Epub 2011 Dec 9.

Abstract

The volume of the experimentally measured time series data is rapidly growing, while storage solutions offering better data types than simple arrays of numbers or opaque blobs for keeping series data are sorely lacking. A number of indexing methods have been proposed to provide efficient access to time series data, but none has so far been integrated into a tried-and-proven database system. To explore the possibility of such integration, we have developed a data type for time series storage in PostgreSQL, an object-relational database system, and equipped it with an access method based on SAX (Symbolic Aggregate approXimation). This new data type has been successfully tested in a database supporting a large-scale plant gene expression experiment, and it was additionally tested on a very large set of simulated time series data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Computer Simulation
  • Data Mining
  • Database Management Systems*
  • Gene Expression Regulation, Plant
  • Humans
  • Information Storage and Retrieval*
  • Models, Biological
  • Systems Biology*
  • Systems Integration*
  • Time Factors