timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-15 10:11:29 +08:00

Author	SHA1	Message	Date
Sven Klemm	3b94b996f2	Use custom node to block frozen chunk modifications This patch changes the code that blocks frozen chunk modifications to no longer use triggers but to use custom node instead. Frozen chunks is a timescaledb internal object and should therefore not be protected by TRIGGER which is external and creates several hazards. TRIGGERs created to protect internal state contend with user-created triggers. The trigger created to protect frozen chunks does not work well with our restoring GUC which we use when restoring logical dumps. Thirdly triggers are not functional for any internal operations but are only working in code paths that explicitly added trigger support.	2022-11-25 19:56:48 +01:00
Fabrízio de Royes Mello	a4356f342f	Remove trailing whitespaces from test code	2022-11-18 16:31:47 -03:00
Jan Nidzwetzki	380464df9b	Perform frozen chunk status check via trigger The commit 9f4dcea30135d1e36d1c452d631fc8b8743b3995 introduces frozen chunks. Checking whether a chunk is frozen or not has been done so far in the query planner. If it is not possible to determine which chunks are affected by a query in the planner (e.g., due to a cast in the WHERE condition), all chunks are checked. This leads (1) to an increased planning time and (2) to the situation that a single frozen chunk could reject queries, even if the frozen chunk is not addressed by the query.	2022-11-18 15:29:49 +01:00
gayyappan	6c20e74674	Block drop chunk if chunk is in frozen state A chunk in frozen state cannot be dropped. drop_chunks will skip over frozen chunks without erroring. Internal api , drop_chunk will error if you attempt to drop a chunk without unfreezing it. This PR also adds a new internal API to unfreeze a chunk.	2022-06-30 09:56:50 -04:00
gayyappan	131f58ee60	Add internal api for foreign table chunk Add _timescaledb_internal.attach_osm_table_chunk. This treats a pre-existing foreign table as a hypertable chunk by adding dummy metadata to the catalog tables.	2022-06-23 10:11:56 -04:00
gayyappan	5d56b1cdbc	Add api _timescaledb_internal.drop_chunk Add an internal api to drop a single chunk. This function drops the storage and metadata associated with the chunk. Note that chunk dependencies are not affected. e.g. Continuous aggs are not updated when this chunk is dropped.	2022-05-11 15:10:38 -04:00
gayyappan	9f4dcea301	Add _timescaledb_internal.freeze_chunk API This is an internal function to freeze a chunk for PG14 and later. This function sets a chunk status to frozen. Operations that modify the chunk data (like insert, update, delete) are not supported. Frozen chunks can be dropped. Additionally, chunk status is cached as part of classify_relation.	2022-05-10 14:00:32 -04:00
Mats Kindahl	aae19319c0	Rewrite recompress_chunk as procedure When executing `recompress_chunk` and a query at the same time, a deadlock can be generated because the chunk relation and the chunk index and the compressed and uncompressd chunks are locked in different orders. In particular, when `recompress_chunk` is executing, it will first decompress the chunk and as part of that lock the uncompressed chunk index in AccessExclusive mode and when trying to compress the chunk again it will try to lock the uncompressed chunk in AccessExclusive as part of truncating it. Note that `decompress_chunk` and `compress_chunk` lock the relations in the same order and the issue arises because the procedures are combined inth a single transaction. To avoid the deadlock, this commit rewrites the `recompress_chunk` to be a procedure and adds a commit between the decompression and compression. Committing the transaction after the decompress will allow reads and inserts to proceed by working on the uncompressed chunk, and the compression part of the procedure will take the necessary locks in strict order, thereby avoiding a deadlock. In addition, the isolation test is rewritten so that instead of adding a waitpoint in the PL/SQL function, we implement the isolation test by taking a lock on the compressed table after the decompression. Fixes #3846	2021-12-09 19:42:12 +01:00
Dmitry Simonenko	38c1781748	Copy/move chunk refactoring Remove copy_chunk_data() function and code needed to support it, such as the 'transactional' argument. Rework copy chunk logic using separate stages. Introduce copy_chunk() API function as an internal wrapper for the move_chunk().	2021-07-29 16:53:12 +03:00
Dmitry Simonenko	2c66c1fd64	Introduce function to copy chunk data between data nodes Add internal copy_chunk_data() function which implements a way to copy chunk data between data nodes using logical replication. This patch prepared together with @nikkhils.	2021-07-29 16:53:12 +03:00
Erik Nordström	b8ff780c50	Add ability to create chunk from existing table The `create_chunk` API has been extended to allow creating a chunk from an existing relational table. The table is turned into a chunk by attaching it to the root hypertable via inheritance. The purpose of this functionality is to allow copying a chunk to another node. First, the chunk table and data is copied. After that, the `create_chunk` can be executed to make the new table part of the hypertable. Currently, the relational table used to create the chunk has to match the hypertable in terms of constraints, triggers, etc. PostgreSQL itself enforces the existence of same-named CHECK constraints, but no enforcement currently exists for other objects, including triggers UNIQUE, PRIMARY KEY, or FOREIGN KEY constraints. Such enforcement can be implemented in the future, if deemed necessary. Another option is to automatically add all the required objects (triggers, constraints) based on the hypertable equivalents. However, that might also lead to duplicate objects in case some of them exist on the table prior to creating the chunk.	2021-07-29 16:53:12 +03:00
Ruslan Fomkin	34e99a1c23	Return error for NULL input to create_chunk_table Gives errors if any argument of create_chunk_table is NULL instead of being STRICT. Utilizes newly added macros for this.	2021-07-29 16:53:12 +03:00
Ruslan Fomkin	28ccecbe7c	Create an empty chunk table Adds an internal API function to create an empty chunk table according the given hypertable for the given chunk table name and dimension slices. This functions creates a chunk table inheriting from the hypertable, so it guarantees the same schema. No TimescaleDB's metadata is updated. To be able to create the chunk table in a tablespace attached to the hyeprtable, this commit allows calculating the tablespace id without the dimension slice to exist in the catalog. If there is already a chunk, which collides on dimension slices, the function fails to create the chunk table. The function will be used internally in multi-node to be able to replicate a chunk from one data node to another.	2021-07-29 16:53:12 +03:00
Sven Klemm	fc5f10d454	Remove chunk_dml_blocker trigger Remove the chunk_dml_blocker trigger which was used to prevent INSERTs into compressed chunks.	2021-05-24 18:03:47 -04:00
Brian Rowe	6b62ed543c	Fetch collations from data nodes during ANALYZE This change fixes the stats collecting code to also return the slot collation fields for PG12. This fixes a bug (#2093) where running an ANALYZE in PG12 would break queries on distributed tables.	2020-07-20 10:54:44 -07:00
Brian Rowe	fad33fe954	Collect column stats for distributed tables. This change adds a new command to return a subset of the column stats for a hypertable (column width, percent null, and percent distinct). As part of the execution of this command on an access node, these stats will be collected for distributed chunks and updated on the access node.	2020-05-27 17:31:09 +02:00
Erik Nordström	6a9db8a621	Add function to fetch remote chunk relation stats A new function, `get_chunk_relstats()`, allows fetching relstats (basically `pg_class.{relpages,reltuples`) from remote chunks on data nodes and writing it to the `pg_class` entry for the corresponding local chunk. The function expects either a chunk or a hypertable as input and returns the relstats for the given chunk or all chunks for the given hypertable, respectively. Importing relstats as described is useful as part of a distributed ANALYZE/VACUUM that won't require fetching all data into the access node for local sampling (like the current implemention does). In a future change, this function will be called as part of a local ANALYZE on the access node that runs ANALYZE on all data nodes followed by importing of the resulting relstats for the analyzed chunks.	2020-05-27 17:31:09 +02:00
niksa	94979412ef	Fix chunks_in function declaration We need to mark this function as stable and parallel safe so the planner can pick the most optimal plan.	2020-05-27 17:31:09 +02:00
Erik Nordström	9108ddad15	Fix corner cases when detaching data nodes This change fixes the following: * Refactor the code for setting the default data node for a chunk. The `set_chunk_default_data_node()` API function now takes a `regclass`/`oid` instead of separate schema + table names and returns `true` when a new data node is set and `false` if called with a data node that is already the default. Like before, exceptions are thrown on errors. It also does proper permissions checks. The related code has been moved from `data_node.c` to `chunk.c` since this is an operation on a chunk, and the code now also lives in the `tsl` directory since this is non-trivial logic that should fall under the TSL license. * When setting the default data node on a chunk (failing over to another data node), it is now verified that the new data node actually has a replica of the chunk and that the corresponding foreign server belongs to the "right" foreign data wrapper. * Error messages and permissions handling have been tweaked.	2020-05-27 17:31:09 +02:00
Erik Nordström	e2371558f7	Create chunks on remote servers This change ensures that chunk replicas are created on remote (datanode) servers whenever a chunk is created in a local distributed hypertable. Remote chunks are created using the `create_chunk()` function, which has been slightly refactored to allow specifying an explicit chunk table name. The one making the remote call also records the resulting remote chunk IDs in its `chunk_server` mappings table. Since remote command invokation without super-user permissions requires password authentication, the test configuration files have been updated to require password authentication for a cluster test user that is used in tests.	2020-05-27 17:31:09 +02:00
Erik Nordström	596be8cda1	Add mappings table for remote chunks A frontend node will now maintain mappings from a local chunk to the corresponding remote chunks in a `chunk_server` table. The frontend creates local chunks as foreign tables and adds entries to `chunk_server` for each chunk it creates on remote data node. Currently, the creation of remote chunks is not implemented, so a dummy chunk_id for the remote chunk will be added instead for testing purposes.	2020-05-27 17:31:09 +02:00
Erik Nordström	ae587c9964	Add API function for explicit chunk creation This adds an internal API function to create a chunk using explicit constraints (dimension slices). A function to export a chunk in a format consistent with the chunk creation function is also added. The chunk export/create functions are needed for distributed hypertables so that an access node can create chunks on data nodes according to its own (global) partitioning configuration.	2020-05-27 17:31:09 +02:00
gayyappan	edd3999553	Add trigger to block INSERT on compressed chunk Prevent insert on compressed chunks by adding a trigger that blocks it. Enable insert if the chunk gets decompressed.	2019-10-29 19:02:58 -04:00
Matvey Arye	e049238a07	Adjust permissions on internal functions The following functions have had permission checks added or adjusted: ts_chunk_index_clone ts_chunk_index_replace ts_hypertable_insert_blocker_trigger_add ts_current_license_key ts_calculate_chunk_interval ts_chunk_adaptive_set The following functions have been removed from the regular SQL install. They are only installed and used in tests: dimension_calculate_default_range_open dimension_calculate_default_range_closed	2019-06-24 10:57:38 -04:00
Matvey Arye	74f8d204a5	Optimize getting the chunk_id in continuous aggs We replace chunk_for_tuple with chunk_id_from_relid for getting chunk id fields when materializing continuous aggs. The old function required passing in the entire row. This was very slow because a lot of data was passed around at execution time. The new function just uses the internal `tableoid` attribute to convert the table relid to a chunk_id. This is much more efficient. We also add memoization to the new function because it is most often called consecutively for the same chunk.	2019-04-29 15:45:23 -04:00
Joshua Lockerman	1e486ef2a4	Fix ts_chunk_for_tuple performance ts_chunk_for_tuple should use the chunk cache. ts_chunk_for_tuple should be marked stable. These fixes markedly improve performance.	2019-04-19 12:46:36 -04:00
Joshua Lockerman	905cd4becc	Add function to determine the chunk for a given row	2019-03-11 16:29:50 -04:00
niksa	319b79c8ec	Making chunks_in function internal This function needs chunk ids as input. Since chunk ids are TimescaleDB internal metadata it feels more natural to make this function internal.	2019-01-23 10:04:06 +01:00
niksa	c77f4ab1b3	Explicit chunk exclusion In some cases user might already know what chunks need to be scanned to answer a particular query. Using `chunks_in` function we can skip calculating chunks involved in particular query which should result in better performances as well. A simple example: `SELECT * FROM hypertable WHERE chunks_in(hypertable, ARRAY[1,2])`	2019-01-19 00:02:01 +01:00
Sven Klemm	f89fd07c5b	Remove year from SQL file license text This changes the license text for SQL files to be identical with the license text for C files.	2019-01-13 23:30:22 +01:00
Joshua Lockerman	e06733acf0	Fix casing in SQL license header to be consistent with elsewhere	2018-11-15 15:18:58 -05:00
Joshua Lockerman	20ec6914c0	Add license headers to SQL files and test code	2018-10-29 13:28:19 -04:00
Joshua Lockerman	974788516a	Prefix public C functions with ts_ We've decided to adopt the ts_ prefix on all exported C functions in order to avoid having symbol conflicts with future postgres functions. We've already started using this prefix on new functions and this commit adds the prefix to to the old functions.	2018-09-27 11:45:04 -04:00
Erik Nordström	9c9cdca6d3	Add support for adaptive chunk sizing Users can now (optionally) set a target chunk size and TimescaleDB will try to adapt the interval length of the first open ("time") dimension in order to reach that target chunk size. If a hypertable has more than one open dimension, only the first one will have a dynamically adapting interval. Users can optionally specify their own function that calculates the new dimension interval. They can also set a target size of 0 in order to estimate a suitable target size for a chunk based on available memory.	2018-08-08 17:01:31 +02:00
Matvey Arye	da8cc797a4	Add support for multiple extension version in one pg instance This PR adds the ability to have multiple different versions of the timescaledb extension be used by different databases in the same PostgreSQL instance (server). This is accomplished by splitting this extension into two .so files. 1) timescaledb.so -- stuff under loader/. Really not a lot of code. This code MUST be backwards compatible in the future. 2) timescaledb-version.so (most of our code). Need not be backwards compatible. Timescaledb.so becomes a small stub which is preloaded and whose main reason for existing is to dynamically load the right timescaledb-version.so when the time comes. This change allows either of the above .so to be loaded in shared_preload_libraries. But timescaledb.so allows for multiple versions used on different databases in the same instance along with smoother upgrades. Using timescaledb-version.so allows for finer-grained control and lock-in and is appropriate in only a few production environments. This PR also adds version checking so that a clear failure message will be displayed if the .so version does not match the SQL extension version. To support multi-version functionality we changed the way SQL update scripts are generated. Previously, the system used a bunch of intermediate upgrade scripts. So with 3 versions, you would have an update script of 1--2, 2--3. But, this PR changes things so that we produce direct "shortcut" update files: 1--3, 2--3. This is done for 2 reasons: 1) Each of the update files should point to $libdir/timescaledb-current_version. Since you cannot guarantee that Previous .so for each intermediate version has been installed. 2) You don't want intermediate version updates installed without the .so. For example, if you have versions 1,2,3 and you are installing version 3, you want the upgrade files 1--3, 2--3 but not 1--2 because if you have 1--2 then a user could do ALTER EXTENSION timescaledb UPDATE TO 2. But the .so for version 2 may not be installed. In order to test this functionality, we add a mock extension version .so that we can test extension loading inside the regression framework.	2018-01-05 12:15:54 -05:00
Erik Nordström	21efcce95c	Refactor chunk table creation and unify constraint handling This change is part of an effort to create a consistent way of dealing with metadata catalog updates, which is currently a mix of C API and INSERT/UPDATE/DELETE statements from SQL code. This mix makes catalog handling unnecessarily complex as there are multiple ways to update metadata, increasing the risk of security issues with publically exposed SQL functions. It also complicates things like cache invalidation, requiring different mechanisms for C and SQL code. Catalog updates from SQL code require triggers on metadata tables for cache invalidation that do not work with native catalog updates. The creation of chunks has been particularly messy in this regard, making the code hard to follow. Especially the handling of a chunk's constraints, where dimensional and other constraints were handled differently. With this change, constraint handling is now consistent across constraint types with a single API for updating metadata. Reduce memory usage for out-of-order inserts The chunk_result_relation_info should be put on the chunk memory context. This will cause the rri constraint expr to also go onto that context and be correctly freed when the chunk insert state is destroyed.	2017-12-28 11:24:29 +01:00
Matvey Arye	c420c11e44	Create a catalog entry for constraint-backed indexes This change makes our catalog more in-line with the pg catalog. It also will simplify a lot of other code.	2017-10-24 11:26:04 -04:00
Matvey Arye	c3ebc676e3	Fix permission problems with dropping hypertables and chunks This change fixes permissions with dropping hypertables and chunks. Fixes #226.	2017-10-05 12:06:09 -04:00
Erik Nordström	097db3d589	Refactor chunk index handling This change refactors the chunk index handling to make better use of standard PostgreSQL catalog information, while removing the hypertable_index metadata table and associated triggers, including those on the chunk_index table. The chunk_index table itself is also simplified. A benefit of this refactoring is that indexes are no longer created using string mangling to construct the CREATE INDEX command for a chunk, based on the string definition of the hypertable index. Instead, indexes are created in C using proper index-related internal data structures. Chunk indexes can now also be renamed and are added in the parent index tablespace. Changing tablespace on a hypertable index also recurses to chunks, as expected. Default indexes that are added when creating a hypertable use the hypertable's tablespace. Creating Hypertable indexes with the CONCURRENTLY modifier is currently blocked, due to unclear semantics regarding concurrent creation over many tables, including how to deal with snapshots.	2017-10-03 10:51:32 +02:00
Erik Nordström	48fbfaf99a	Make update test script restore database instead of reinserting data The update test script compares a TimescaleDB instance, which is updated to the latest version from an older version, to a cleanly installed one at the latest version. It does this by inserting the same data on both and comparing the data and metadata tables. This comparison breaks if the new version has changed some behavior (e.g., chunk indexes might use a different naming scheme), because the updated version will retain the metadata related to the old behavior while the latest version will create metadata according to the new behavior. To fix this issue, the update script now performs a database backup on the old version and restores it on the cleanly installed version of TimescaleDB. This retains the metadata in accordance with the old behavior, although the new version will create new metadata according to the new behavior. This has the further benefit of testing the restore process during upgrades, which is probably the way people will move to a new version of TimescaleDB if they do not update the extension with the data in place. The capture errors related to broken update scripts, the update test now includes an integrity test for some metadata tables.	2017-09-22 08:58:02 +02:00
Matvey Arye	51821b3709	Move trigger handling from PLPGSQL to C Applying triggers to chunks requires taking the definition of a trigger on a hypertable and executing it on a chunk. Previously this was done with string replacement in the trigger definition. This was not especially safe, and thus we moved the logic to C where we can do proper parsing/deparsing and replacement of the table name. Another positive aspect is that we got rid of some DDL triggers.	2017-09-14 13:01:46 -04:00
Matvey Arye	238003316e	Block ALTER TABLE and handle DROP TABLE on chunks This PR adds logic to block alter table and handle drop operations on chunks.	2017-09-13 18:42:45 -04:00
Matvey Arye	48e0a61131	Remove triggers from chunk and chunk_constraint Streamline code and remove triggers from chunk and chunk_constraint. Lots of additional cleanup. Also removes need to CASCADE hypertable drops (fixes #88).	2017-09-07 11:31:48 -04:00
Matvey Arye	4dcbe6114d	Add support for hypertable constraints This PR add support for primary-key, foreign-key, unique, and exclusion constraints. Previously supported are CHECK and NOT NULL constraints. Now, foreign key constraints where a hypertable references a plain table is support (while vice versa, with a plain table references a hypertable, is still not).	2017-09-07 11:31:48 -04:00
Erik Nordström	c2f686dbba	Refactor chunk creation to handle chunk collisions and alignment When new chunks are created, the calculated chunk hypercube might collide or not align with existing chunks when partitioning has changed in one or more dimensions. In such cases, the chunk should be cut to fit the alignment criteria and any collisions should be resolved. Unfortunately, alignment and collision detection wasn't properly handled. This refactoring adds proper axis-aligned bounding box collision detection generalized to N dimensions. It also correctly handles dimension alignment.	2017-09-06 15:07:13 +02:00
Matvey Arye	c3b6fb90d9	Fix bug with negative dimension values Previously, negative dimension values had an off-by-one bug where the wrong chunk was created for points on chunk borders. This PR fixes that.	2017-08-28 10:29:05 -04:00
Rob Kiefer	2a51cf0701	Add support for negative values in dimension range Previously dimension_slice would only allow ranges to start and end with values >= 0. However, this did not allow for timestamps prior to 1970 because those times would be represented by negative integers. With this PR we now support dates prior to 1970. Addresses issue #153.	2017-08-18 11:28:11 -04:00
Matvey Arye	83c75fd154	Add support for most triggers on hypertables This adds support for all types of triggers on a hypertable except INSERT AFTER. UPDATE and DELETE ROW triggers are automatically copied from a hypertable onto the chunks. Therefore, any trigger defined on the parent hypertable will apply to any row in any of the chunks as well. STATEMENT level triggers and iNSERT triggers need not be copied in this way.	2017-07-13 08:27:54 -04:00
Matvey Arye	70ede45083	Fix off-by-one error when creating aligned chunks It is not an error for range_end of a new chunk to be equal to range_start of an existing chunk because range_end is exclusive. Fix logic not to throw an error in that case.	2017-06-27 16:51:07 -04:00
Matvey Arye	97681c2328	Fixes permission handling Previously, catalog tables were not fully protected from malicious non-superusers. This PR fixes permission handling be severely restricting permissions to the catalog and instead using SECURITY DEFINER functions to alter the catalog when needed without giving users permission to do those same operations outside of these functions. In addition, these functions check for proper permissions themselves so are safe to use. This PR also makes sure that chunk tables have the same owner as the hypertable and correctly handles `ALTER TABLE...OWNER TO` commands to keep this info in sync.	2017-06-27 11:20:41 -04:00

1 2

60 Commits