timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-15 18:13:18 +08:00

Go to file

Erik Nordström d17976538b Run tests on temp Postgres instance

Tests are now run on a temporary Postgres instance, which is launched
by the `pg_regress` test runner. This allows running tests without
having an existing running instance with a matching configuration and
also obviates the need for preloading the TimescaleDB extension. As a
result, tests are simpler to setup and run, and are more reliable and
consistent.

2017-06-01 17:22:37 +02:00

docs

Remove setup_timescaledb() and fix pg_dump/pg_restore.

2017-05-22 16:12:21 -04:00

scripts

Remove setup_timescaledb() and fix pg_dump/pg_restore.

2017-05-22 16:12:21 -04:00

sql

if_not_exist flag to create_hypertable now works on hypertables with data as well

2017-06-01 16:51:37 +02:00

src

Reference the correct column when scanning partition epochs

2017-05-31 14:55:54 +02:00

test

Run tests on temp Postgres instance

2017-06-01 17:22:37 +02:00

.dir-locals.el

Add PostgreSQL's .dir-locals.el style file

2017-04-28 13:43:32 +02:00

.dockerignore

Add .dockerignore

2017-02-21 11:54:17 +01:00

.gitignore

Run tests on temp Postgres instance

2017-06-01 17:22:37 +02:00

.travis.yml

Fix versions of pg_dump and pg_restore used by travis

2017-05-22 16:12:21 -04:00

CHANGELOG.md

Release 0.0.11-beta

2017-05-24 13:33:15 -04:00

CONTRIBUTING.md

Add link to CLA

2017-04-24 10:31:56 -04:00

docker.mk

Run tests on temp Postgres instance

2017-06-01 17:22:37 +02:00

Dockerfile

Remove dblink dependency.

2017-04-12 11:14:54 -04:00

LICENSE

Add Apache 2.0 LICENSE and NOTICE

2017-03-07 12:02:21 -05:00

Makefile

Run tests on temp Postgres instance

2017-06-01 17:22:37 +02:00

NOTICE

Remove dblink dependency.

2017-04-12 11:14:54 -04:00

README.md

Add installation instructions for apt (Ubuntu)

2017-05-30 16:29:37 -04:00

timescaledb.control

Release 0.0.11-beta

2017-05-24 13:33:15 -04:00

README.md

TimescaleDB is an open-source database designed to make SQL scalable for time-series data. It is engineered up from PostgreSQL, providing automatic partitioning across time and space (partitioning key), as well as full SQL support.

TimescaleDB is packaged as a PostgreSQL extension and set of scripts.

For a more detailed description of our architecture, please read the technical paper. Additionally, more documentation can be found on our docs website.

There are several ways to install TimescaleDB:

Homebrew (for MacOS)
apt (Ubuntu 16.04 & 17.04)
Docker
From source

Installation

NOTE: Currently, upgrading to new versions requires a fresh install.

Prerequisite

The Postgres client (psql) is required for all of the following installation methods.

Option 1 - Homebrew (MacOS)

This will install PostgreSQL 9.6 via Homebrew as well. If you have another installation (e.g. Postgres.app), this will cause problems. We recommend removing other installations before using this method.

Prerequisites

Homebrew

Build and install

# Add our tap
brew tap timescale/tap
# To install
brew install timescaledb

You'll need to update your PostgreSQL configuration and restart as well.

Option 2 - `apt` (Ubuntu)

Prerequisites

Ubuntu 16.04 or 17.04

Build and install

# Add our PPA
sudo add-apt-repository ppa:timescale/timescaledb-ppa
sudo apt-get update
# To install
sudo apt install timescaledb

You'll need to update your PostgreSQL configuration and restart as well.

Option 3 - Docker Hub

You can pull our Docker images from Docker Hub.

docker pull timescale/timescaledb:latest

To run, you'll need to specify a directory where data should be stored/mounted from on the host machine. For example, if you want to store the data in /your/data/dir on the host machine:

docker run -d \
  --name timescaledb \
  -v /your/data/dir:/var/lib/postgresql/data \
  -p 5432:5432 \
  -e PGDATA=/var/lib/postgresql/data/timescaledb \
  timescale/timescaledb postgres \
  -cshared_preload_libraries=timescaledb

In particular, the -v flag sets where the data is stored. If not set, the data will be dropped when the container is stopped.

You can write the above command to a shell script for easy use, or use our docker-run.sh in scripts/, which saves the data to $PWD/data. There you can also see additional -c flags we recommend for memory settings, etc.

Option 4 - From source

We have only tested our build process on MacOS and Linux. We do not support building on Windows yet. Windows may be able to use our Docker image on Docker Hub (see above).

Prerequisites

A standard PostgreSQL 9.6 installation with development environment (header files) (e.g., Postgres.app for MacOS)

Build and install with local PostgreSQL

make
make install

You'll need to update your PostgreSQL configuration and restart as well.

Updating `postgresql.conf`

For every installation method except Docker, you'll need to add our extension as a preloaded library to your postgresql.conf file. To locate that file:

# Create postgres superuser if not already done:
createuser postgres -s
psql -U postgres -c 'SHOW config_file;'

               config_file
------------------------------------------
/path/to/your/postgresql.conf
(1 row)

Using your favorite editor, open the file and modify it to add TimescaleDB's library:

# Modify postgresql.conf to uncomment this line and add required libraries.
shared_preload_libraries = 'timescaledb'

Now restart PostgreSQL:

# Homebrew
brew services restart postgresql
# apt
sudo service postgresql restart
# Other platforms may have different methods

Setting up your initial database

Now, we'll install our extension and create an initial database. Below you'll find instructions for creating a new, empty database.

To help you quickly get started, we have also created some sample datasets. Once you complete the initial setup below you can then easily import this data to play around with TimescaleDB functionality. See our Sample Datasets for further instructions.

Setting up an empty database

When creating a new database, it is necessary to install the extension and then run an initialization function.

# Connect to Postgres, using a superuser named 'postgres'
psql -U postgres -h localhost

-- Install the extension
CREATE database tutorial;
\c tutorial
CREATE EXTENSION IF NOT EXISTS timescaledb CASCADE;

For convenience, this can also be done in one step by running a script from the command-line:

DB_NAME=tutorial ./scripts/setup-db.sh

Accessing your new database

You should now have a brand new time-series database running in Postgres.

# To access your new database
psql -U postgres -h localhost -d tutorial

Next let's load some data.

Working with time-series data

One of the core ideas of our time-series database are time-series optimized data tables, called hypertables.

Creating a (hyper)table

To create a hypertable, you start with a regular SQL table, and then convert it into a hypertable via the function create_hypertable()(API definition).

The following example creates a hypertable for tracking temperature and humidity across a collection of devices over time.

-- We start by creating a regular SQL table
CREATE TABLE conditions (
  time        TIMESTAMPTZ       NOT NULL,
  location    TEXT              NOT NULL,
  temperature DOUBLE PRECISION  NULL,
  humidity    DOUBLE PRECISION  NULL
);

Next, transform it into a hypertable using the provided function create_hypertable():

-- This creates a hypertable that is partitioned by time
--   using the values in the `time` column.
SELECT create_hypertable('conditions', 'time');

-- OR you can additionally partition the data on another dimension
--   (what we call 'space') such as `location`.
-- For example, to partition `location` into 2 partitions:
SELECT create_hypertable('conditions', 'time', 'location', 2);

Inserting and querying

Inserting data into the hypertable is done via normal SQL INSERT commands, e.g. using millisecond timestamps:

INSERT INTO conditions(time, location, temperature, humidity)
VALUES(NOW(), 'office', 70.0, 50.0);

Similarly, querying data is done via normal SQL SELECT commands. SQL UPDATE and DELETE commands also work as expected.

Indexing data

Data is indexed using normal SQL CREATE INDEX commands. For instance,

CREATE INDEX ON conditions (location, time DESC);

This can be done before or after converting the table to a hypertable.

Indexing suggestions:

Our experience has shown that different types of indexes are most-useful for time-series data, depending on your data.

For indexing columns with discrete (limited-cardinality) values (e.g., where you are most likely to use an "equals" or "not equals" comparator) we suggest using an index like this (using our hypertable conditions for the example):

CREATE INDEX ON conditions (location, time DESC);

For all other types of columns, i.e., columns with continuous values (e.g., where you are most likely to use a "less than" or "greater than" comparator) the index should be in the form:

CREATE INDEX ON conditions (time DESC, temperature);

Having a time DESC column specification in the index allows for efficient queries by column-value and time. For example, the index defined above would optimize the following query:

SELECT * FROM conditions WHERE location = 'garage' ORDER BY time DESC LIMIT 10

For sparse data where a column is often NULL, we suggest adding a WHERE column IS NOT NULL clause to the index (unless you are often searching for missing data). For example,

CREATE INDEX ON conditions (time DESC, humidity) WHERE humidity IS NOT NULL;

this creates a more compact, and thus efficient, index.

Current limitations

Below are a few current limitations of our database, which we are actively working to resolve:

Any user has full read/write access to the metadata tables for hypertables.
Permission changes on hypertables are not correctly propagated.
create_hypertable() can only be run on an empty table
Custom user-created triggers on hypertables currently not allowed
drop_chunks() (see our API Reference) is currently only supported for hypertables that are not partitioned by space.

Restoring a database from backup

A database with the timescaledb extension can be backed up using normal backup procedures (e.g., pg_dump). However, when restoring the database the following procedure must be used.

CREATE DATABASE db_for_restore;
ALTER DATABASE db_for_restore SET timescaledb.restoring='on';

--execute the restore below:
\! pg_restore -h localhost -U postgres -d single dump/single.sql

--connect to the restored db;
\c db_for_restore
SELECT restore_timescaledb();
ALTER DATABASE single SET timescaledb.restoring='off';

Note: You must use pg_dump and pg_restore versions 9.6.2 and above.

More APIs

For more information on TimescaleDB's APIs, check out our API Reference.

Testing

If you want to contribute, please make sure to run the test suite before submitting a PR.

If you are running locally:

make installcheck

If you are using Docker:

make -f docker.mk test

Languages

C 67.7%

PLpgSQL 25.6%

CMake 1.8%

Ruby 1.7%

Python 1.3%

Other 1.9%

README.md

Installation

Option 1 - Homebrew (MacOS)

Option 2 - apt (Ubuntu)

Option 3 - Docker Hub

Option 4 - From source

Updating postgresql.conf

Setting up your initial database

Setting up an empty database

Accessing your new database

Working with time-series data

Creating a (hyper)table

Inserting and querying

Indexing data

Current limitations

Restoring a database from backup

More APIs

Testing

Option 2 - `apt` (Ubuntu)

Updating `postgresql.conf`