timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-17 11:03:36 +08:00

Author	SHA1	Message	Date
Markos Fountoulakis	221437e8ef	Continuous aggregates for distributed hypertables Add support for continuous aggregates for distributed hypertables by allowing a continuous aggregate to read from a distributed hypertable so that the continuous aggregate is on the access node while the hypertable data is on the data nodes. For distributed hypertables, both the hypertable and continuous aggregate invalidation log are kept on the data nodes and the refresh window is computed at refresh time on each data node. Since the continuous aggregate materialization hypertable is not present on the data nodes, the invalidation log was extended to allow using a non-local hypertable id on the data nodes. This means that you cannot create continuous aggregates on the data nodes since those could clash with continuous aggregates on the access node. Some utility statements added entries to the invalidation logs directly (truncating chunks and hypertables, as well as dropping individual chunks), so to handle this case, internal functions were added to allow logging invalidation on the data nodes from the access node. The commit also includes some fixes to memory context usage that caused crashes for invalidation triggers and also disable per data node queries during refresh since that would otherwise generate an exception. Fixes #3435 Co-authored-by: Mats Kindahl <mats@timescale.com>	2021-10-25 18:20:11 +03:00
gayyappan	fffd6c2350	Use plpgsql procedure for executing compression policy This PR removes the C code that executes the compression policy. Instead we use a PL/pgSQL procedure to execute the policy. PG13.4 and PG12.8 introduced some changes that require PortalContexts while executing transactions. The compression policy procedure compresses chunks in multiple transactions. We have seen some issues with snapshots and portal management in the policy code (due to the PG13.4 code changes). SPI API has transaction-portal management code. However, the compression policy code does not use SPI interfaces. But it is fairly easy to just convert this into a PL/pgSQL procedure (which calls SPI) rather than replicating portal managment code in C to manage multiple txns in the compression policy. This PR also disallows decompress_chunk, compress_chunk and recompress_chunk in txn read only mode. Fixes #3656	2021-10-13 09:11:59 -04:00
gayyappan	c55cbb9350	Expose subtract_integer_from_now as SQL function Move subtract_integer_from_now to src directory and create a SQL function for it.	2021-10-13 09:11:59 -04:00
Fabrízio de Royes Mello	609b5ea34a	Refactor SQL function approximate_row_count Simplify the CTE to recursively inspect all partitions of a relation and calculate the sum of `pg_class.reltuples` taking in account the differences introduced by PG14.	2021-10-01 14:20:17 -03:00
Sven Klemm	6838fcf906	Post 2.4.2 release Update version numbers and add 2.4.2 to update tests. We have to put the DROP FUNCTION back in latest-dev because 2.4.2 did not include the commit which removed the function definitions.	2021-09-22 18:20:30 +02:00
Sven Klemm	3f944bee82	Release 2.4.2 This release contains bug fixes since the 2.4.1 release. We deem it high priority to upgrade. Bugfixes * #3437 Rename on all continuous aggregate objects * #3469 Use signal-safe functions in signal handler * #3520 Modify compression job processing logic * #3527 Fix time_bucket_ng behaviour with origin argument * #3532 Fix bootstrap with regresschecks disabled * #3574 Fix failure on job execution by background worker * #3590 Call cleanup functions on backend exit Thanks * @jankatins for reporting a crash with background workers * @LutzWeischerFujitsu for reporting an issue with bootstrap	2021-09-20 19:22:13 +02:00
Mats Kindahl	592e0bd46e	Rename on all continuous aggregate objects When renaming a column on a continuous aggregate, only the user view column was renamed. This commit changes this by applying the rename on the materialized table, the user view, the direct view, and the partial view, as well as the column name in the dimension table. Since this also changes some of the table definitions, we need to perform the same rename in the update scripts as well, which is done by comparing the direct view and the user view to decide what columns that require a rename and then executing that rename on the direct view, partial view, and materialization table, as well as updating the column name in the dimension table. When renaming columns in tables with indexes, the column in the table is renamed but not the column in the index, which keeps the same name. When restoring from a dump, however, the column name of the table is used, which create a diff in the update test. For that reason, we change the update tests to not list index definitions as part of the comparison. The existance of the indexes is still tracked and compared since the indexes for a hypertable is listed as part of the output. If a downgrade does not revert some changes, this will cause a diff in the downgrade test. Since the rename is benign and not easy to revert, this will cause test failure. Instead, we add a file to do extra actions during a clean-rerun to prevent these diffs. In this case, applying the same renames as the update script. Fixes #3405	2021-09-20 12:18:52 +02:00
Sven Klemm	43bb5ba7d1	Optimize approximate_row_count Rewrite approximate_row_count to SQL instead of PLpgSQL and remove superfluous JOINs against pg_namespace. Adjust tuple calculation for PG14 since in PG14 reltuples for partitioned tables is the sum of it's children so we need to exclude those from calculation to not doublecount.	2021-09-16 14:57:47 +02:00
Aleksander Alekseev	ab71c4a1c1	Mark time_bucket_ng() as IMMUTABLE This patch marks time_bucket_ng() as IMMUTABLE. Two exceptions are: - time_bucket_ng(interval, timestamptz) timestamptz - time_bucket_ng(interval, timestamptz, timestamptz) timestamptz ... due to their implementation, see the comments. These two overloaded versions were introduced only for backward compatibility with time_bucket() and are not needed for building continuous aggregates.	2021-09-07 20:00:31 +03:00
Aleksander Alekseev	dc67eb75d6	time_bucket_ng() version with `origin` argument This patch adds a version of time_bucket_ng() with 'origin' argument. It doesn't address any other known issues. E.g. volatility of the function will be changed in another patch. The error messages are going to be improved when the feature gets a little more stable.	2021-09-03 17:36:29 +03:00
gayyappan	dfc63fe063	Remove legacy functions Remove _timescaledb_internal.time_col_name_for_chunk and _timescaledb_internal.time_col_type_for_chunk functions. Fixes #3539	2021-09-02 15:25:59 -04:00
Aleksander Alekseev	22e77a77ad	Support timezones in time_bucket_ng() This patch adds support of timezones in time_bucket_ng(). The 'origin' argument can't be used with timezones yet. This will be implemented in a separate pull request.	2021-09-01 11:23:56 +03:00
Mats Kindahl	86cd8f6532	Post 2.4.1 release Updates `version.config` and adds new version to update tests.	2021-08-24 10:42:29 +02:00
Mats Kindahl	4e0954e87f	Release 2.4.1 This release contains bug fixes since the 2.4.0 release. We deem it high priority to upgrade. The release fixes continous aggregate refresh for postgres 12.8 and 13.4, a crash with ALTER TABLE commands and a crash with continuous aggregates with HAVING clause. Bugfixes * #3430 Fix havingqual processing for continuous aggregates * #3468 Disable tests by default if tools are not found * #3462 Fix crash while tracking alter table commands * #3489 Fix continuous agg bgw job failure for PG 12.8 and 13.4 * #3494 Improve error message when adding data nodes Thanks * @brianbenns for reporting a segfault with continuous aggregates * @brianbenns for reporting a segfault with continuous aggregates * @usego for reporting an issue with continuous aggregate refresh on PG 13.4	2021-08-19 19:16:20 +02:00
gayyappan	9ea77fb97f	Post release 2.4.0 Add tests to update from 2.4.0 Fix up version.config Fix view creation stmt in views_experimental.sql that causes extension update failures.	2021-07-30 17:51:30 -04:00
gayyappan	63f2bdfc9e	Release 2.4.0 This release adds new experimental features since the 2.3.1 release. The experimental features in this release are: * APIs for chunk manipulation across data nodes in a distributed hypertable setup. This includes the ability to add a data node and move chunks to the new data node for cluster rebalancing. * The `time_bucket_ng` function, a newer version of `time_bucket`. This function supports years, months, days, hours, minutes, and seconds. We’re committed to developing these experiments, giving the community a chance to provide early feedback and influence the direction of TimescaleDB’s development. We’ll travel faster with your input! Please create your feedback as a GitHub issue (using the experimental-schema label), describe what you found, and tell us the steps or share the code snip to recreate it. This release also includes several bug fixes. PostgreSQL 11 deprecation announcement Timescale is working hard on our next exciting features. To make that possible, we require functionality that is available in Postgres 12 and above. Postgres 11 is not supported with TimescaleDB 2.4. Experimental Features * #3293 Add timescaledb_experimental schema * #3302 Add block_new_chunks and allow_new_chunks API to experimental schema. Add chunk based refresh_continuous_aggregate. * #3211 Introduce experimental time_bucket_ng function * #3366 Allow use of experimental time_bucket_ng function in continuous aggregates * #3408 Support for seconds, minutes and hours in time_bucket_ng * #3446 Implement cleanup for chunk copy/move. Bugfixes * #3401 Fix segfault for RelOptInfo without fdw_private * #3411 Verify compressed chunk validity for compressed path * #3416 Fix targetlist names for continuous aggregate views * #3434 Remove extension check from relcache invalidation callback * #3440 Fix remote_tx_heal_data_node to work with only current database Thanks * @fvannee for reporting an issue with hypertable expansion in functions * @amalek215 for reporting an issue with cache invalidation during pg_class vacuum full * @hardikm10 for reporting an issue with inserting into compressed chunks * @dberardo-com and @iancmcc for reporting an issue with extension updates after renaming columns of continuous aggregates.	2021-07-29 13:59:32 -04:00
gayyappan	c9adab63b3	Revert cagg changes from reverse-dev.sql Cagg changes that were removed from reverse-dev.sql by PR 3443 were inadvertently added back in by PR 3446.	2021-07-29 13:59:32 -04:00
Nikhil	2ffa1bf436	Implement cleanup for chunk copy/move A chunk copy/move operation is carried out in stages and it can fail in any of them. We track the last completed stage in the "chunk_copy_operation" catalog table. In case of failure, a "chunk_copy_cleanup" function can be invoked to bring the chunk back to its original state on the source datanode and all transient objects like replication slot, publication, subscription, empty chunk, metadata updates, etc are cleaned up. Includes test case changes for each and every stage induced failure. To avoid confusion between chunk copy activity and chunk copy operation this patch also consistently uses "operation" everywhere now instead of "activity"	2021-07-29 16:53:12 +03:00
Erik Nordström	352dc9baec	Remove copy_chunk_data from downgrade script The internal function `copy_chunk_data` was removed as part of refactoring and is no longer necessary to remove in the downgrade script since the function was never part of a release.	2021-07-29 16:53:12 +03:00
Erik Nordström	b4710501dd	Add experimental chunk replication view A new view in the experimental schema shows information related to chunk replication. The view can be used to learn the replication status of a chunk while also providing a way to easily find nodes to move or copy chunks between in order to ensure a fully replicated multi-node cluster. Tests have been added to illustrate the potential usage.	2021-07-29 16:53:12 +03:00
Dmitry Simonenko	38c1781748	Copy/move chunk refactoring Remove copy_chunk_data() function and code needed to support it, such as the 'transactional' argument. Rework copy chunk logic using separate stages. Introduce copy_chunk() API function as an internal wrapper for the move_chunk().	2021-07-29 16:53:12 +03:00
Nikhil	f6b0250557	Implement wrapper API for copy/move chunk The building blocks required for implementing end-to-end copy/move chunk functionality have now been wrapped in a procedure. A procedure is required because multiple transactions are needed to carry out the activity across the access node and the involved two data nodes. The following steps are encapsulated in this procedure 1) Create an empty chunk table on the destination data node 2) Copy the data from the src data node chunk to this newly created destination node chunk. This is done via inbuilt PostgreSQL logical replication functionality 3) Attach this chunk to the hypertable on the dst data node 4) Remove this chunk from the src data node to complete the move if requested A new catalog table "chunk_copy_activity" has been added to track the progress of the above stages. A unique id gets assigned to each activity and it is updated with the completed stages as things progress.	2021-07-29 16:53:12 +03:00
Dmitry Simonenko	2c66c1fd64	Introduce function to copy chunk data between data nodes Add internal copy_chunk_data() function which implements a way to copy chunk data between data nodes using logical replication. This patch prepared together with @nikkhils.	2021-07-29 16:53:12 +03:00
Erik Nordström	b8ff780c50	Add ability to create chunk from existing table The `create_chunk` API has been extended to allow creating a chunk from an existing relational table. The table is turned into a chunk by attaching it to the root hypertable via inheritance. The purpose of this functionality is to allow copying a chunk to another node. First, the chunk table and data is copied. After that, the `create_chunk` can be executed to make the new table part of the hypertable. Currently, the relational table used to create the chunk has to match the hypertable in terms of constraints, triggers, etc. PostgreSQL itself enforces the existence of same-named CHECK constraints, but no enforcement currently exists for other objects, including triggers UNIQUE, PRIMARY KEY, or FOREIGN KEY constraints. Such enforcement can be implemented in the future, if deemed necessary. Another option is to automatically add all the required objects (triggers, constraints) based on the hypertable equivalents. However, that might also lead to duplicate objects in case some of them exist on the table prior to creating the chunk.	2021-07-29 16:53:12 +03:00
Nikhil	762053431e	Implement drop_chunk_replica API This function drops a chunk on a specified data node. It then removes the metadata about the datanode, chunk association on the access node. This function is meant for internal use as part of the "move chunk" functionality. If only one chunk replica remains then this function refuses to drop it to avoid data loss.	2021-07-29 16:53:12 +03:00
Ruslan Fomkin	404f1cdbad	Create chunk table from access node Creates a table for chunk replica on the given data node. The table gets the same schema and name as the chunk. The created chunk replica table is not added into metadata on the access node or data node. The primary goal is to use it during copy/move chunk.	2021-07-29 16:53:12 +03:00
Ruslan Fomkin	34e99a1c23	Return error for NULL input to create_chunk_table Gives errors if any argument of create_chunk_table is NULL instead of being STRICT. Utilizes newly added macros for this.	2021-07-29 16:53:12 +03:00
Ruslan Fomkin	28ccecbe7c	Create an empty chunk table Adds an internal API function to create an empty chunk table according the given hypertable for the given chunk table name and dimension slices. This functions creates a chunk table inheriting from the hypertable, so it guarantees the same schema. No TimescaleDB's metadata is updated. To be able to create the chunk table in a tablespace attached to the hyeprtable, this commit allows calculating the tablespace id without the dimension slice to exist in the catalog. If there is already a chunk, which collides on dimension slices, the function fails to create the chunk table. The function will be used internally in multi-node to be able to replicate a chunk from one data node to another.	2021-07-29 16:53:12 +03:00
Sven Klemm	ded8e82ebb	Fix downgrade files for 2.3.1 The cagg rebuild was required for a fix introduced in 2.3.1 but was not removed from reverse-dev.sql when that version got released.	2021-07-28 17:08:13 +02:00
Aleksander Alekseev	99f7a2122f	Support seconds, minutes, and hours in time_bucket_ng() As a future replacement for time_bucket(), time_bucket_ng() should support seconds, minutes, and hours. This patch adds this support. The implementation is the same as for time_bucket(). Timezones are not yet supported.	2021-07-20 12:34:57 +03:00
Mats Kindahl	b5ffc71071	Post-release steps for release 2.3.1 Add 2.3.1 to the update tests and update the downgrade target for the downgrade version. This commit also fixes two issues that were fixed in the release branch: 1. Drop `_timescaledb_internal.refresh_continuous_aggregate` 2. Update the changelog to match the release branch.	2021-07-08 14:31:01 +02:00
Mats Kindahl	06433f6228	Release 2.3.1 Bugfixes * #3279 Add some more randomness to chunk assignment * #3288 Fix failed update with parallel workers * #3300 Improve trigger handling on distributed hypertables * #3304 Remove paths that reference parent relids for compressed chunks * #3305 Fix pull_varnos miscomputation of relids set * #3310 Generate downgrade script * #3314 Fix heap buffer overflow in hypertable expansion * #3317 Fix heap buffer overflow in remote connection cache. * #3327 Make aggregate in caggs fully qualified * #3327 Make aggregates in caggs fully qualified * #3336 Fix pg_init_privs objsubid handling * #3345 Fix SkipScan distinct column identification * #3355 Fix heap buffer overflow when renaming compressed hypertable columns. * #3367 Improve DecompressChunk qual pushdown * #3377 Fix bad use of repalloc Thanks * @db-adrian for reporting an issue when accessing cagg view through postgres_fdw * @fncaldas and @pgwhalen for reporting an issue accessing caggs when public is not in search_path * @fvannee, @mglonnro and @ebreijo for reporting an issue with the upgrade script * @fvannee for reporting a performance regression with SkipScan	2021-07-02 09:12:38 +02:00
Mats Kindahl	a58ebdb3b4	Split update and downgrade version During an update, it is not possible to run the downgrade scripts until the release has been tagged, but the update scripts can be run. This means that we need to split the previous version into two different fields: one for running the update tests and one for running the downgrade tests.	2021-07-01 16:05:31 +02:00
davidkohn88	bfd92ab822	Use CREATE OR REPLACE AGGREGATE From PG12 on, CREATE OR REPLACE is supported for aggregates, therefore, since we have dropped support for PG11, we can avoid going through the rigamarole of having our aggregates in a separate file from the functions we define to support them. Nor do we need to handle aggregates separately from other functions as their creation is now idempotent.	2021-07-01 07:40:46 +02:00
Sven Klemm	ff5d7e42bb	Adjust code to PG14 reltuples changes PG14 changes the initial value of pg_class.reltuples to -1 to allow differentiating between an empty relation and a relation where ANALYZE has not yet run. https://github.com/postgres/postgres/commit/3d351d916b	2021-06-29 16:35:35 +02:00
Aleksander Alekseev	e57155fdc0	time_bucket_ng() may be IMMUTABLE depending on arguments. This function is IMMUTABLE when it doesn't accept timestamptz arguments, and STABLE otherwise. See the comments in sql/time_bucket_ng.sql for more details.	2021-06-25 12:36:14 +03:00
Mats Kindahl	15b46818ea	Generate downgrade script This commit add functions and code to generate a downgrade script from the current version to the previous version. This requires execution from a Git repository since it retrieves the prolog and epilog for the "downgrade" file from the version given by `update_from_version` in the `version.config` file. The commit adds several CMake functions that simplifies the composition of script files, but these are not used to generate the update scripts. A potential improvement is to use the scripts to also generate the update scripts. This commit supports generating a downgrade script from the current version to the previous version. Other versions are handled using a variable containing file names of reverse update scripts and the source and target version is extracted from the file names, which is assumed to be of the form `<source-version>--<target-version>.sql`. In addition to adding support for generating downgrade scripts, the commit adds a downgrade test file that tests a release in a similar way to the update script and adds it as a workflow.	2021-06-24 11:10:25 +02:00
Aleksander Alekseev	33dfdcf5ea	Introduce experimental time_bucket_ng() function This patch adds time_bucket_ng() function to the experimental schema. The "ng" part stands for "next generation". Unlike current time_bucket() implementation the _ng version will support months, years and timezones. Current implementation doesn't claim to be complete. For instance, it doesn't support timezones yet. The reasons to commit it in it's current state are 1) to shorten the feedback loop 2) to start experimenting with monthly buckets are soon as possible, 3) to reduce the unnecessary work of rebasing and resolving conflicts 4) to make the work easier to the reviewers	2021-06-21 12:00:18 +03:00
Nikhil	8aaef4ae14	Fix update tests to handle sequences The post-update script was handling preserving initprivs for newly added catalog tables and views. However, newly added catalog sequences need separate handling otherwise update tests start failing. We also now grant privileges for all future sequences in the update tests. In passing, default the PG_VERSION in the update tests to 12 since we don't work with PG11 anymore.	2021-06-21 13:22:08 +05:30
Mats Kindahl	71e8f13871	Add workflow and CMake support for formatting Add a workflow to check that CMake files are correctly formatted as well as a custom target to format CMake files in the repository. This commit also runs the formatting on all CMake files in the repository.	2021-06-17 22:52:29 +02:00
Sven Klemm	c4c0c3de4f	Harden pg_init_privs query Since we are only interested in entries with classoid pg_class our queries should reflect that. Without these restrictions objects that have entries for multiple classoids can cause the extension update to fail.	2021-06-16 10:15:53 +02:00
Markos Fountoulakis	6e7679e222	Fix failed update with parallel workers When executing "ALTER EXTENSION timescaledb UPDATE TO ..." it will fail if parallel workers spawn for the update itself. Disable parallel execution during the update.	2021-06-15 17:50:22 +03:00
Sven Klemm	6d6172b027	Fix pg_init_privs objsubid handling pg_init_privs can have multiple entries per relation if the relation has per column privileges. An objsubid different from 0 means that the entry is for a column privilege. Since we do not currently restore column privileges we have to ignore those rows otherwise the update script will fail when tables with column privileges are present.	2021-06-15 10:15:51 +02:00
Sven Klemm	f1ae6468d8	Make aggregate in caggs fully qualified When querying continuous aggregate views with a search_path not including public the query will fail cause the function reference in the finalize call is not fully qualified. This can surface when querying caggs through postgres_fdw which resets search_path to contain only pg_catalog. Fixes #1919 Fixes #3326	2021-06-14 18:21:27 +02:00
Erik Nordström	264b77eb20	Move internal API functions to experimental schema Move the "block new chunks" functions and the chunk-based continuous aggregate refresh function to the new experimental schema.	2021-06-09 14:47:16 +02:00
Mats Kindahl	10e339f591	Add experimental schema This commit adds an experimental schema where experimental features can be added.	2021-06-04 08:28:27 +02:00
Sven Klemm	dd3e422964	Remove update files for PG11 Remove update files that were only used for PG11 versions since those versions do not support postgres past PG11	2021-06-01 20:21:06 +02:00
Juanito Fatas	9fe90ebcea	Fix broken links in TimescaleDB NOTICE Link to Telemetry page explains more than disable it Co-authored-by: Mike Freedman <mfreed@cs.princeton.edu>	2021-05-31 06:46:44 +02:00
Erik Nordström	77a3d5a033	Release 2.3.0 This release adds major new features since the 2.2.1 release. We deem it moderate priority for upgrading. This release adds support for inserting data into compressed chunks and improves performance when inserting data into distributed hypertables. Distributed hypertables now also support triggers and compression policies. The bug fixes in this release address issues related to the handling of privileges on compressed hypertables, locking, and triggers with transition tables. Features * #3116 Add distributed hypertable compression policies * #3162 Use COPY when executing distributed INSERTs * #3199 Add GENERATED column support on distributed hypertables * #3210 Add trigger support on distributed hypertables * #3230 Support for inserts into compressed chunks Bugfixes * #3213 Propagate grants to compressed hypertables * #3229 Use correct lock mode when updating chunk * #3243 Fix assertion failure in decompress_chunk_plan_create * #3250 Fix constraint triggers on hypertables * #3251 Fix segmentation fault due to incorrect call to chunk_scan_internal * #3252 Fix blocking triggers with transition tables Thanks * @yyjdelete for reporting a crash with decompress_chunk and identifying the bug in the code * @fabriziomello for documenting the prerequisites when compiling against PostgreSQL 13	2021-05-25 15:11:55 +02:00
Sven Klemm	fe872cb684	Add policy_recompression procedure This patch adds a recompress procedure that may be used as custom job when compression and recompression should run as separate background jobs.	2021-05-24 18:03:47 -04:00

1 2 3 4 5 ...

622 Commits