timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-18 11:45:11 +08:00

Author	SHA1	Message	Date
Dmitry Simonenko	ea5038f263	Add connection cache invalidation ignore logic Calling `ts_dist_cmd_invoke_on_data_nodes_using_search_path()` function without an active transaction allows connection invalidation event happen between applying `search_path` and the actual command execution, which leads to an error. This change introduces a way to ignore connection cache invalidations using `remote_connection_cache_invalidation_ignore()` function. This work is based on @nikkhils original fix and the problem research. Fix #4022	2022-10-04 10:50:45 +03:00
Sven Klemm	1d4b9d6977	Fix join on time column of compressed chunk Do not allow paths that are parameterized on a compressed column to exist when creating paths for a compressed chunk.	2022-09-29 10:36:02 +02:00
Sven Klemm	940187936c	Fix segfault when INNER JOINing hypertables This fixing a segfault when INNER JOINing 2 hypertables that are ordered by time.	2022-09-28 17:12:45 +02:00
Sven Klemm	2529ae3f68	Fix chunk exclusion for prepared statements and dst changes The constify code constifying TIMESTAMPTZ expressions when doing chunk exclusion did not account for daylight saving time switches leading to different calculation outcomes when timezone changes. This patch adds a 4 hour safety buffer to any such calculations.	2022-09-22 18:16:20 +02:00
Sven Klemm	ffd9dfb7eb	Fix assertion failure in constify_now The code added to support VIEWs did not account for the fact that varno could be from a different nesting level and therefore not be present in the current range table.	2022-09-16 17:40:03 +02:00
Alexander Kuzmenkov	fee27484ce	Do not use row-by-row fetcher for parameterized plans We have to prepare the data node statement in this case, and COPY queries don't work with prepared statements.	2022-09-15 22:59:06 +03:00
Sven Klemm	d2baef3ef3	Fix planner chunk exclusion for VIEWs Allow planner chunk exclusion in subqueries. When we decicde on whether a query may benefit from constifying now and encounter a subquery peek into the subquery and check if the constraint references a hypertable partitioning column. Fixes #4524	2022-09-12 17:29:14 +02:00
Sven Klemm	a26a5974dc	Improve space constraint exclusion datatype handling This patch adjusts the operator logic for valid space dimension constraints to no longer look for an exact match on both sides of the operator but instead allow mismatched datatypes. Previously a constraint like `col = value` would require `col` and `value` to have matching datatype with this change `col` and `value` can be different datatype as long as they have equality operator in btree family. Mismatching datatype can happen commonly when using int8 columns and comparing them with integer literals. Integer literals default to int4 so the datatypes would not match unless special care has been taken in writing the constraints and therefore the optimization would never apply in those cases.	2022-09-11 10:57:54 +02:00
Sven Klemm	f27e627341	Fix chunk exclusion for space partitions in SELECT FOR UPDATE queries Since we do not use our own hypertable expansion for SELECT FOR UPDATE queries we need to make sure to add the extra information necessary to get hashed space partitions with the native postgres inheritance expansion working.	2022-09-11 10:57:54 +02:00
Sven Klemm	b34b91f18b	Add timezone support to time_bucket_gapfill This patch adds a new time_bucket_gapfill function that allows bucketing in a specific timezone. You can gapfill with explicit timezone like so: `SELECT time_bucket_gapfill('1 day', time, 'Europe/Berlin') ...` Unfortunately this introduces an ambiguity with some previous call variations when an untyped start/finish argument was passed to the function. Some queries might need to be adjusted and either explicitly name the positional argument or resolve the type ambiguity by casting to the intended type.	2022-09-07 16:37:53 +02:00
Sven Klemm	1c0bf4b777	Support bucketing by month in time_bucket_gapfill	2022-08-22 19:07:32 +02:00
Sven Klemm	49b6486dad	Change get_git_commit to return full commit hash This patch changes get_git_commit to always return the full hash. Since different git versions do not agree on the length of the abbreviated hash this made the length flaky. To make the length consistent change it to always be the full hash.	2022-08-01 10:45:17 +02:00
Sven Klemm	eccd6df782	Throw better error message on incompatible row fetcher settings When a query has multiple distributed hypertables the row-by-by fetcher cannot be used. This patch changes the fetcher selection logic to throw a better error message in those situations. Previously the following error would be produced in those situations: unexpected PQresult status 7 when starting COPY mode	2022-07-29 11:40:00 +02:00
Sven Klemm	d5619283f3	Fix gapfill group comparison The gapfill mechanism to detect an aggregation group change was using datumIsEqual to compare the group values. datumIsEqual does not detoast values so when one value is toasted and the other value is not it will not return the correct result. This patch changes the gapfill code to use the correct equal operator for the type of the group column instead of datumIsEqual.	2022-07-19 19:14:30 +02:00
Sven Klemm	0d175b262e	Fix prepared statement param handling in ChunkAppend This patch fixes the param handling in prepared statements for generic plans in ChunkAppend making those params usable in chunk exclusion. Previously those params would not be resolved and therefore not used for chunk exclusion. Fixes #3719	2022-07-19 14:50:17 +02:00
Sven Klemm	597b71881a	Fix assertion hit in row_by_row_fetcher_close When executing multinode queries that initialize row-by-row fetcher but never execute it the node cleanup code would hit an assertion checking the state of the fetcher. Found by sqlsmith.	2022-07-18 09:39:48 +02:00
Alexander Kuzmenkov	1bbb6059cb	Add more tests for distributed INSERT and COPY More interleavings of INSERT/COPY, and test with slow recv() to check waiting.	2022-07-04 22:38:53 +05:30
Nikhil Sontakke	e3b2fbdf15	Fix empty bytea handlng with distributed tables The "empty" bytea value in a column of a distributed table when selected was being returned as "null". The actual value on the datanodes was being stored appropriately but just the return code path was converting it into "null" on the AN. This has been handled via the use of PQgetisnull() function now. Fixes #3455	2022-06-22 12:25:54 +05:30
Alexander Kuzmenkov	5c69adfb7e	Add more tests for errors on data nodes Use a data type with faulty send/recv functions to test various error handling paths.	2022-06-21 14:55:14 +05:30
Sven Klemm	308ce8c47b	Fix various misspellings	2022-06-13 10:53:08 +02:00
Sven Klemm	216ea65937	Enable chunk exclusion for space dimensions in UPDATE/DELETE This patch transforms constraints on hash-based space partitions to make them usable by postgres constraint exclusion. If we have an equality condition on a space partitioning column, we add a corresponding condition on get_partition_hash on this column. These conditions match the constraints on chunks, so postgres' constraint exclusion is able to use them and exclude the chunks. The following transformations are done: device_id = 1 becomes ((device_id = 1) AND (_timescaledb_internal.get_partition_hash(device_id) = 242423622)) s1 = ANY ('{s1_2,s1_2}'::text[]) becomes ((s1 = ANY ('{s1_2,s1_2}'::text[])) AND (_timescaledb_internal.get_partition_hash(s1) = ANY ('{1583420735,1583420735}'::integer[]))) These transformations are not visible in EXPLAIN output as we remove them again after hypertable expansion is done.	2022-06-07 13:10:28 +02:00
Erik Nordström	8f9975d7be	Fix crash during insert into distributed hypertable For certain inserts on a distributed hypertable, e.g., involving CTEs and upserts, plans can be generated that weren't properly handled by the DataNodeCopy and DataNodeDispatch execution nodes. In particular, the nodes expect ChunkDispatch as a child node, but PostgreSQL can sometimes insert a Result node above ChunkDispatch, causing the crash. Further, behavioral changes in PG14 also caused the DataNodeCopy node to sometimes wrongly believe a RETURNING clause was present. The check for returning clauses has been updated to fix this issue. Fixes #4339	2022-06-02 17:25:33 +02:00
Alexander Kuzmenkov	5c0110cbbf	Mark partialize_agg as parallel safe Postgres knows whether a given aggregate is parallel-safe, and creates parallel aggregation plans based on that. The `partialize_agg` is a wrapper we use to perform partial aggregation on data nodes. It is a pure function that produces serialized aggregation state as a result. Being pure, it doesn't influence parallel safety. This means we don't need to mark it parallel-unsafe to artificially disable the parallel plans for partial aggregation. They will be chosen as usual based on the parallel-safety of the underlying aggregate function.	2022-05-31 14:53:58 +05:30
Sven Klemm	1fbe2eb36f	Support intervals with month component when constifying now() When dealing with Intervals with month component timezone changes can result in multiple day differences in the outcome of these calculations due to different month lengths. When dealing with months we add a 7 day safety buffer. For all these calculations it is fine if we exclude less chunks than strictly required for the operation, additional exclusion with exact values will happen in the executor. But under no circumstances must we exclude too much cause there would be no way for the executor to get those chunks back.	2022-05-30 18:02:58 +02:00
Sven Klemm	12574dc8ec	Support intervals with day component when constifying now() The initial patch to use now() expressions during planner hypertable expansion only supported intervals with no day or month component. This patch adds support for intervals with day component. If the interval has a day component then the calculation needs to take into account daylight saving time switches and thereby a day would not always be exactly 24 hours. We mitigate this by adding a safety buffer to account for these dst switches when dealing with intervals with day component. These calculations will be repeated with exact values during execution. Since dst switches seem to range between -1 and 2 hours we set the safety buffer to 4 hours. This patch also refactors the tests since the previous tests made it hard to tell the feature was working after the constified values have been removed from the plans.	2022-05-28 10:02:33 +02:00
Sven Klemm	dcb7dcc506	Remove constified now() constraints from plan Commit 35ea80ff added an optimization to enable expressions with now() to be used during plan-time chunk exclusion by constifying the now() expression. The added constified constraints were left in the plan even though they were only required during the hypertable explansion. This patch marks those constified constraints and removes them once they are no longer required.	2022-05-24 17:19:18 +02:00
Sven Klemm	eab4efa323	Move metrics_dist1 out of shared_setup The table metrics_dist1 was only used by a single test and therefore should not be part of shared_setup but instead be created in the test that actually uses it. This reduces executed time of regresscheck-shared when that test is not run.	2022-05-19 21:33:33 +02:00
Sven Klemm	43c8e51510	Fix Var handling for Vars of different level in constify_now This patch fixes the constify_now optimization to ignore Vars of different level. Previously this could potentially lead to an assertion failure cause the varno of that varno might be bigger than the number of entries in the rangetable. Found by sqlsmith.	2022-05-19 11:45:17 +02:00
Sven Klemm	4988dac273	Fix sqlsmith CI workflow Commit 3b35da76 changed the setup script for regresscheck-shared to no longer be usable directly by the sqlsmith workflow. This patch set TEST_DBNAME at the top of the script so it is easier to use the script outside of regression check environment.	2022-05-18 11:47:07 +02:00
Sven Klemm	35ea80ffdf	Enable now() usage in plan-time chunk exclusion This implements an optimization to allow now() expression to be used during plan time chunk exclusions. Since now() is stable it would not normally be considered for plan time chunk exclusion. To enable this behaviour we convert `column > now()` expressions into `column > const AND column > now()`. Assuming that time always moves forward this is safe even for prepared statements. This optimization works for SELECT, UPDATE and DELETE. On hypertables with many chunks this can lead to a considerable speedup for certain queries. The following expressions are supported: - column > now() - column >= now() - column > now() - Interval - column > now() + Interval - column >= now() - Interval - column >= now() + Interval Interval must not have a day or month component as those depend on timezone settings. Some microbenchmark to show the improvements, I did best of five for all of the queries. -- hypertable with 1k chunks -- with optimization select * from metrics1k where time > now() - '5m'::interval; Time: 3.090 ms -- without optimization select * from metrics1k where time > now() - '5m'::interval; Time: 145.640 ms -- hypertable with 5k chunks -- with optimization select * from metrics5k where time > now() - '5m'::interval; Time: 4.317 ms -- without optimization select * from metrics5k where time > now() - '5m'::interval; Time: 775.259 ms -- hypertable with 10k chunks -- with optimization select * from metrics10k where time > now() - '5m'::interval; Time: 4.853 ms -- without optimization select * from metrics10k where time > now() - '5m'::interval; Time: 1766.319 ms (00:01.766) -- hypertable with 20k chunks -- with optimization select * from metrics20k where time > now() - '5m'::interval; Time: 6.141 ms -- without optimization select * from metrics20k where time > now() - '5m'::interval; Time: 3321.968 ms (00:03.322) Speedup with 1k chunks: 47x Speedup with 5k chunks: 179x Speedup with 10k chunks: 363x Speedup with 20k chunks: 540x	2022-05-17 21:47:39 +02:00
Fabrízio de Royes Mello	047d6b175b	Revert "Pushdown of gapfill to data nodes" This reverts commit eaf3a38fe9553659e515fac72aaad86cf1a06d1e.	2022-05-16 15:21:32 -03:00
Fabrízio de Royes Mello	4083e48a1c	Revert "Add missing gitignore entry" This reverts commit 57411719fb1f5e4d5863089bb4b840abea3bc3db.	2022-05-16 15:21:32 -03:00
Alexander Kuzmenkov	3b35da7607	More tests for errors when fetching from data nodes Add a special function that allows to inject these errors.	2022-05-16 18:57:42 +05:30
Alexander Kuzmenkov	6e26a1187a	Use binary format in row-by-row fetcher The general idea is to have two types of fetcher: "fast" and "general purpose". We use the row-by-row fetcher as the "fast" one. This commit removes support of text protocol in this fetcher, because it's only needed for some niche types that don't have a binary serialization, and is also slower than binary one. Because the row-by-row fetcher now only understands binary protocol, we must check that the binary serialization is actually available for the participating data types. If not, we have to revert to using the cursor fetcher unless row-by-row was explicitly requested by the user. This happens at execution time, precisely, at creation of TupleFactory, because that's when we look up the conversion functions. The rest of the commit is removing the text protocol support from row-by-row, plus EXPLAIN changes (we don't know the fetcher type at the planning stage anymore, so not showing it).	2022-05-06 22:13:17 +05:30
Fabrízio de Royes Mello	57411719fb	Add missing gitignore entry Pull request #4033 introduced a new template SQL test file but missed to add the properly gitgnore entry to ignore generated test files.	2022-04-08 18:41:10 -03:00
Rafia Sabih	eaf3a38fe9	Pushdown of gapfill to data nodes Allow the calls of time_bucket_gapfill to be executed at the data nodes for improved query performance. With this, time_bucket_gapfill is pushed to data nodes in the following conditions, 1. when only one data node has all the chunks 2. when space dimension does not overlap across data nodes 3. when group-by matches space dimension	2022-04-07 21:09:49 +02:00
Sven Klemm	06d8375594	Enhance extension function test This patch changes the extension function list to include the signature as well since functions with different signature are separate objects in postgres. This also changes the list to include all functions. Even though functions in internal schemas are not considered public API they still need be treated the same as functions in other schemas with regards to extension upgrade/downgrade. This patch also moves the test to regresscheck-shared since we do not dedicated database to run these tests.	2022-03-10 11:22:33 +01:00
Sven Klemm	5c22ef3da2	Rename continuous aggregate tests Change the prefix for continuous aggregate tests from continuous_aggs_ to cagg_. This is similar to commit 6a8c2b66 which did this adjustment for isolation tests because we were running into length limitations for the spec name. This patch adjusts the remaining tests to be consistent with the naming used in isolation tests.	2022-01-24 14:12:56 +01:00
Sven Klemm	29856fd0ac	Eliminate float rounding instabilities in interpolate When interpolating float values the result of the calculation might be unstable for certain values when y0 and y1 are equal. This patch short circuits the formula and returns y0 immediately when y0 and y1 are identical. Fixes #1528	2022-01-24 13:33:26 +01:00
Sven Klemm	39645d56da	Fix subtract_integer_from_now on 32-bit platforms This patch fixes subtract_integer_from_now on 32-bit platforms, improves error handling and adds some basic tests. subtract_integer_from_now would trigger an assert when called on a hypertable without integer time dimension (found by sqlsmith). Additionally subtract_integer_from_now would segfault when called on a hypertable without partitioning dimensions.	2021-12-20 10:02:57 +01:00
Sven Klemm	a760887145	Fix projection handling in gapfill When getting the next tuple from the subplan gapfill would apply the projection to it which was incorrect since the subplan already did the projection and the projection for the gapfill tuple has to be done when the tuple is handed to the parent node. Fixes #3834	2021-12-17 23:58:43 +01:00
Fabrízio de Royes Mello	244568f23a	Add regression tests for caggs+compression Closes timescale/timescaledb-private#962	2021-12-17 10:51:33 -05:00
Sven Klemm	7f494077ed	Fix DataNodeScan plans with One-Time Filter When a query has a filter that only needs to be evaluated once per query it will be represented as a Result node with the filter condition on the Result node and the actual query as child of the result node. find_data_node_scan_state_child did not consider Result node as valid node to contain a DataNodeScan node leading to a `unexpected child node of Append or MergeAppend: 62` for queries that had one-time filter with a subquery.	2021-12-13 21:10:30 +01:00
Sven Klemm	1b4780df31	Fix assertion failure in cursor_fetcher_rewind The code in cursor_fetcher_rewind asserted that there always is an associated request which is not true if EOF was reached already. Found by sqlsmith. Fixes #3786	2021-12-13 20:03:57 +01:00
Alexander Kuzmenkov	0f81a60cbb	Use row-by-row fetcher to enable parallel plans on data nodes The row-by-row fetcher is more efficient, so we want to use it when we can -- that is, when the have to read only one table from the data node, without interleaving it with anything else. This patch adds an option of choosing the fetcher type automatically. It detects the simplest case of only one distributed table in the entire query, and enables row-by-row fetcher. For other cases, the cursor fetcher is used.	2021-12-10 14:40:34 +03:00
Alexander Kuzmenkov	f1e103fab1	Fix DISTINCT ON queries for distributed hyperatbles Previously, we would push DISTINCT ON down to the data nodes even when the pathkeys of the resulting paths on the data nodes were not compatible with the given DISTINCT ON columns. This commit disables pushdown when the sorting is not compatible. Fixes #3784	2021-11-17 15:42:40 +03:00
Fabrízio de Royes Mello	d117d8772f	Add missing gitignore entry Pull request #3717 introduced a new template SQL test file but missed to add the properly gitgnore entry to ignore generated test files.	2021-10-27 12:37:30 -03:00
Sven Klemm	acc6abee92	Support transparent decompression on individual chunks This patch adds support for transparent decompression in queries on individual chunks. This is required for distributed hypertables with compression when enable_per_data_node_queries is set to false. Without this functionality queries on distributed hypertables with compression would not return data for compressed chunks as the generated FDW queries would target individual chunks. Fixes #3714	2021-10-20 20:42:21 +02:00
Fabrízio de Royes Mello	f25e795ec8	Add regression tests for Memoize Node PostgreSQL 14 introduced new `Memoize Node` that serve as a cache of results from parameterized nodes. We should make sure it will work correctly together with ChunckAppend custom node over hypertables (compressed and uncompressed). Closes #3684	2021-10-15 19:20:33 -03:00
Sven Klemm	4d425d9470	Disable memoize node for append and transparent_decompression tests With memoize enabled PG14 append tests produce a very different plan compared to previous PG versions. To make comparing plans between PG versions easier we disable memoize for PG14. PG14 also modified how EXTRACT is shown in EXPLAIN output so any query using EXTRACT will have different EXPLAIN output between PG14 and previous versions.	2021-10-09 00:15:23 +02:00

1 2

91 Commits