timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-17 11:03:36 +08:00

Go to file

Markos Fountoulakis 221437e8ef Continuous aggregates for distributed hypertables

Add support for continuous aggregates for distributed hypertables by
allowing a continuous aggregate to read from a distributed hypertable
so that the continuous aggregate is on the access node while the
hypertable data is on the data nodes.

For distributed hypertables, both the hypertable and continuous
aggregate invalidation log are kept on the data nodes and the refresh
window is computed at refresh time on each data node. Since the
continuous aggregate materialization hypertable is not present on the
data nodes, the invalidation log was extended to allow using a
non-local hypertable id on the data nodes. This means that you cannot
create continuous aggregates on the data nodes since those could clash
with continuous aggregates on the access node.

Some utility statements added entries to the invalidation logs
directly (truncating chunks and hypertables, as well as dropping
individual chunks), so to handle this case, internal functions were
added to allow logging invalidation on the data nodes from the access
node.

The commit also includes some fixes to memory context usage that
caused crashes for invalidation triggers and also disable per data
node queries during refresh since that would otherwise generate an
exception.

Fixes #3435

Co-authored-by: Mats Kindahl <mats@timescale.com>

2021-10-25 18:20:11 +03:00

.github

Support transparent decompression on individual chunks

2021-10-20 20:42:21 +02:00

cmake

Continuous aggregates for distributed hypertables

2021-10-25 18:20:11 +03:00

codecov

Add workflow and CMake support for formatting

2021-06-17 22:52:29 +02:00

docs

Remove support for PG11

2021-06-01 20:21:06 +02:00

scripts

Continuous aggregates for distributed hypertables

2021-10-25 18:20:11 +03:00

sql

Continuous aggregates for distributed hypertables

2021-10-25 18:20:11 +03:00

src

Continuous aggregates for distributed hypertables

2021-10-25 18:20:11 +03:00

test

Fix GRANT/REVOKE ALL IN SCHEMA handling

2021-10-22 16:48:16 +05:30

tsl

Continuous aggregates for distributed hypertables

2021-10-25 18:20:11 +03:00

.clang-format

Add support for PG13 List implementation

2021-01-04 11:18:33 +01:00

.clang-tidy

Enable clang-tidy

2020-04-21 11:47:47 +02:00

.codecov.yml

Run all tests on cron triggered runs

2019-08-04 22:16:01 +02:00

.dir-locals.el

Add PostgreSQL's .dir-locals.el style file

2017-04-28 13:43:32 +02:00

.editorconfig

Add .editorconfig for better github display

2019-02-14 16:09:18 -05:00

.gitignore

Add common patterns to .gitignore

2020-09-07 17:44:53 +02:00

.travis.yml

Remove PG11 from CI

2021-06-01 20:21:06 +02:00

appveyor.yml

Switch appveyor to preview image

2021-10-04 12:57:45 +02:00

bootstrap

Replace hardcoded bash path in shell scripts

2018-10-10 17:55:39 +02:00

bootstrap.bat

Refactor telemetry and fixes

2018-09-10 13:29:59 -04:00

CHANGELOG.md

Fix SkipScan with varchar column

2021-10-25 12:02:03 +02:00

CMakeLists.txt

No longer require -DEXPERIMENTAL for PG14

2021-10-14 02:14:37 +02:00

CONTRIBUTING.md

Fix contributors documentation

2021-10-18 17:51:27 -03:00

LICENSE

Add new top-level licensing information

2018-12-29 00:12:45 -10:00

LICENSE-APACHE

Add new top-level licensing information

2018-12-29 00:12:45 -10:00

NOTICE

Update copyright year to 2021

2021-01-13 17:01:32 -05:00

README.md

Fix a typo in README.md

2021-10-06 13:56:46 +02:00

timescaledb.control.in

Mark timescaledb as trusted extension

2021-03-08 18:24:38 +01:00

version.config

Post 2.4.2 release

2021-09-22 18:20:30 +02:00

README.md

Linux/macOS	Linux i386	Windows	Coverity	Code Coverage

TimescaleDB

TimescaleDB is an open-source database designed to make SQL scalable for time-series data. It is engineered up from PostgreSQL and packaged as a PostgreSQL extension, providing automatic partitioning across time and space (partitioning key), as well as full SQL support.

If you prefer not to install or administer your instance of TimescaleDB, hosted versions of TimescaleDB are available in the cloud of your choice (pay-as-you-go, with a free trial to start).

To determine which option is best for you, see Timescale Products for more information about our Apache-2 version, TimescaleDB Community (self-hosted), and Timescale Cloud (hosted), including: feature comparisons, FAQ, documentation, and support.

Below is an introduction to TimescaleDB. For more information, please check out these other resources:

For reference and clarity, all code files in this repository reference licensing in their header (either the Apache-2-open-source license or Timescale License (TSL) ). Apache-2 licensed binaries can be built by passing -DAPACHE_ONLY=1 to bootstrap.

Contributors welcome.

(To build TimescaleDB from source, see instructions in Building from source.)

Using TimescaleDB

TimescaleDB scales PostgreSQL for time-series data via automatic partitioning across time and space (partitioning key), yet retains the standard PostgreSQL interface.

In other words, TimescaleDB exposes what look like regular tables, but are actually only an abstraction (or a virtual view) of many individual tables comprising the actual data. This single-table view, which we call a hypertable, is comprised of many chunks, which are created by partitioning the hypertable's data in either one or two dimensions: by a time interval, and by an (optional) "partition key" such as device id, location, user id, etc. (Architecture discussion)

Virtually all user interactions with TimescaleDB are with hypertables. Creating tables and indexes, altering tables, inserting data, selecting data, etc., can (and should) all be executed on the hypertable.

From the perspective of both use and management, TimescaleDB just looks and feels like PostgreSQL, and can be managed and queried as such.

Before you start

PostgreSQL's out-of-the-box settings are typically too conservative for modern servers and TimescaleDB. You should make sure your postgresql.conf settings are tuned, either by using timescaledb-tune or doing it manually.

Creating a hypertable

-- Do not forget to create timescaledb extension
CREATE EXTENSION timescaledb;

-- We start by creating a regular SQL table
CREATE TABLE conditions (
  time        TIMESTAMPTZ       NOT NULL,
  location    TEXT              NOT NULL,
  temperature DOUBLE PRECISION  NULL,
  humidity    DOUBLE PRECISION  NULL
);

-- Then we convert it into a hypertable that is partitioned by time
SELECT create_hypertable('conditions', 'time');

Inserting and querying data

Inserting data into the hypertable is done via normal SQL commands:

INSERT INTO conditions(time, location, temperature, humidity)
  VALUES (NOW(), 'office', 70.0, 50.0);

SELECT * FROM conditions ORDER BY time DESC LIMIT 100;

SELECT time_bucket('15 minutes', time) AS fifteen_min,
    location, COUNT(*),
    MAX(temperature) AS max_temp,
    MAX(humidity) AS max_hum
  FROM conditions
  WHERE time > NOW() - interval '3 hours'
  GROUP BY fifteen_min, location
  ORDER BY fifteen_min DESC, max_temp DESC;

In addition, TimescaleDB includes additional functions for time-series analysis that are not present in vanilla PostgreSQL. (For example, the time_bucket function above.)

Installation

TimescaleDB is available pre-packaged for several platforms:

Timescale Cloud (cloud-hosted and managed TimescaleDB) is available via free trial. You create database instances in the cloud of your choice and use TimescaleDB to power your queries, automating common operational tasks and reducing management overhead.

We recommend following our detailed installation instructions.

To build from source, see instructions here.

Resources

Useful tools

timescaledb-tune: Helps set your PostgreSQL configuration settings based on your system's resources.
timescaledb-parallel-copy: Parallelize your initial bulk loading by using PostgreSQL's COPY across multiple workers.

Additional documentation

Community & help

Slack Channel
Github Issues
Timescale Support: see support options (community & subscription)

Releases & updates

Timescale Release Notes & Future Plans: see planned and in-progress updates and detailed information about current and past releases.
Subscribe to Timescale Release Notes to get notified about new releases, fixes, and early access/beta programs.

Contributing

Languages

C 67.7%

PLpgSQL 25.6%

CMake 1.8%

Ruby 1.7%

Python 1.3%

Other 1.9%