timescaledb

The Postgres client (psql)
A standard PostgreSQL installation with development environment (header files), or
Docker (see separate build and run instructions)

Build and install with local PostgreSQL

# To build the extension
make 

# To install 
make install

# To run tests (needs running Postgres server with preloaded extension)
make installcheck

Build and run in Docker

# To build a Docker image
make -f docker.mk build-image

# To run a container 
make -f docker.mk run

# To run tests
make -f docker test

Setting up a local single node database

After starting the Docker image or local PostgreSQL server, you can initiate a local single node database:

psql -U postgres -h localhost < scripts/sql/setup_single_node_db.psql

This will set up a database named iobeamdb which can be accessed with:

psql -U postgres -h localhost -d iobeamdb

Creating a table

Our specialized time-series table, which is the main abstraction for querying all your space-time partitions, is called a hypertable.

To create a hypertable, you start with a regular SQL table, and then convert it into a hypertable via the function create_hypertable()(API Definition) (which is loaded when you install the extension).

For example, let's create a hypertable for tracking temperature and humidity across a collection of devices over time.

First, create a regular SQL table:

CREATE TABLE conditions (
  time TIMESTAMP WITH TIME ZONE NOT NULL,
  device_id TEXT NOT NULL,
  temperature DOUBLE PRECISION NULL,
  humidity DOUBLE PRECISION NULL
);

Next, convert it into a hypertable using the provided function create_hypertable():

SELECT create_hypertable('"conditions"', 'time', 'device_id');

Now, a hypertable that is partitioned on time (using the values in the time column) and on device_id has been created.

Note:

You can also run the following command from inside the repo to create the above table for you:

./scripts/run_sql.sh setup_sample_hypertable.psql

Inserting and Querying

Inserting data into the hypertable is done via normal SQL INSERT commands, e.g. using millisecond timestamps:

INSERT INTO conditions(time,device_id,temperature,humidity)
VALUES(NOW(), 'office', 70.0, 50.0);

Similarly, querying data is done via normal SQL SELECT commands. SQL UPDATE and DELETE commands also work as expected.

Indexing data

Data is indexed using normal SQL CREATE INDEX commands. For instance,

CREATE INDEX ON conditions (device_id, time DESC);

This can be done before or after converting the table to a hypertable.

Indexing suggestions:

Our experience has shown that different types of indexes are most-useful for time-series data, depending on your data.

For indexing columns with discrete (limited-cardinality) values (e.g., where you are most likely to use an "equals" or "not equals" comparator) we suggest using an index like this (using our hypertable conditions for the example):

CREATE INDEX ON conditions (device_id, time DESC);

For all other types of columns, i.e., columns with continuous values (e.g., where you are most likely to use a "less than" or "greater than" comparator) the index should be in the form:

CREATE INDEX ON conditions (time DESC, temperature);

Having a time DESC column specification in the index allows for efficient queries by column-value and time. For example, the index defined above would optimize the following query:

SELECT * FROM conditions WHERE device_id = 'dev_1' ORDER BY time DESC LIMIT 10

For sparse data where a column is often NULL, we suggest adding a WHERE column IS NOT NULL clause to the index (unless you are often searching for missing data). For example,

CREATE INDEX ON conditions (time DESC, humidity) WHERE humidity IS NOT NULL;

this creates a more compact, and thus efficient, index.

Examples

TODO PROVIDE EXAMPLES

Languages

C 67.7%

PLpgSQL 25.6%

CMake 1.8%

Ruby 1.7%

Python 1.3%

Other 1.9%