timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-16 02:23:49 +08:00

Author	SHA1	Message	Date
Sven Klemm	b2a91494a1	Move ddl_internal functions to _timescaledb_functions schema To increase schema security we do not want to mix our own internal objects with user objects. Since chunks are created in the _timescaledb_internal schema our internal functions should live in a different dedicated schema. This patch make the necessary adjustments for the following functions: - chunk_constraint_add_table_constraint(_timescaledb_catalog.chunk_constraint) - chunk_drop_replica(regclass,name) - chunk_index_clone(oid) - chunk_index_replace(oid,oid) - create_chunk_replica_table(regclass,name) - drop_stale_chunks(name,integer[]) - health() - hypertable_constraint_add_table_fk_constraint(name,name,name,integer) - process_ddl_event() - wait_subscription_sync(name,name,integer,numeric)	2023-08-29 11:15:39 +02:00
Dmitry Simonenko	5813173e07	Introduce drop_stale_chunks() function This function drops chunks on a specified data node if those chunks are not known by the access node. Call drop_stale_chunks() automatically when data node becomes available again. Fix #4848	2022-11-23 19:21:05 +02:00
Erik Nordström	4b05402580	Add health check function A new health check function _timescaledb_internal.health() returns the health and status of the database instance, including any configured data nodes (in case the instance is an access node). Since the function returns also the health of the data nodes, it tries hard to avoid throwing errors. An error will fail the whole function and therefore not return any node statuses, although some of the nodes might be healthy. The health check on the data nodes is a recursive (remote) call to the same function on those nodes. Unfortunately, the check will fail with an error if a connection cannot be established to a node (or an error occurs on the connection), which means the whole function call will fail. This will be addressed in a future change by returning the error in the function result instead.	2022-10-21 10:34:16 +02:00
Sven Klemm	a4081516ca	Append pg_temp to search_path Postgres will prepend pg_temp to the effective search_path if it is not present in the search_path. While pg_temp will never be used to look up functions or operators unless explicitly requested pg_temp will be used to look up relations. Putting pg_temp in search_path makes sure objects in pg_temp will be considered last and pg_temp cannot be used to mask existing objects.	2022-05-03 07:55:43 +02:00
Sven Klemm	6dddfaa54e	Lock down search_path in install scripts This patch locks down search_path in extension install and update scripts to only contain pg_catalog, this requires that any reference in those scripts is fully qualified. Additionally we add explicit create commands to all update scripts for objects added to the public schema. This change will make update scripts fail if a function with identical signature already exists when installing or upgrading instead reusing the existing object.	2022-02-09 17:53:20 +01:00
Dmitry Simonenko	2c66c1fd64	Introduce function to copy chunk data between data nodes Add internal copy_chunk_data() function which implements a way to copy chunk data between data nodes using logical replication. This patch prepared together with @nikkhils.	2021-07-29 16:53:12 +03:00
Nikhil	762053431e	Implement drop_chunk_replica API This function drops a chunk on a specified data node. It then removes the metadata about the datanode, chunk association on the access node. This function is meant for internal use as part of the "move chunk" functionality. If only one chunk replica remains then this function refuses to drop it to avoid data loss.	2021-07-29 16:53:12 +03:00
Ruslan Fomkin	404f1cdbad	Create chunk table from access node Creates a table for chunk replica on the given data node. The table gets the same schema and name as the chunk. The created chunk replica table is not added into metadata on the access node or data node. The primary goal is to use it during copy/move chunk.	2021-07-29 16:53:12 +03:00
Erik Nordström	264b77eb20	Move internal API functions to experimental schema Move the "block new chunks" functions and the chunk-based continuous aggregate refresh function to the new experimental schema.	2021-06-09 14:47:16 +02:00
Ruslan Fomkin	791b0a4db7	Add API to refresh continuous aggregate on chunk Function refresh_continuous_aggregate, which takes a continuous aggregate and a chunk, is added. It refreshes the continuous aggregate on the given chunk if there are invalidations. The function can be used in a transaction, e.g., together with following drop_chunks. This allows users to create a user defined action to refresh and drop chunks. Therefore, the refresh on drop is removed from drop_chunks.	2020-11-12 08:33:35 +01:00
Erik Nordström	1068188128	Make block chunks API internal This change moves the block chunks functionality to the internal namespace since it won't be part of the public API for the 2.0 release. Fixes #2236	2020-09-14 12:01:43 +02:00
Sven Klemm	f89fd07c5b	Remove year from SQL file license text This changes the license text for SQL files to be identical with the license text for C files.	2019-01-13 23:30:22 +01:00
Joshua Lockerman	47b5b7d553	Log which chunks are dropped by background workers We don't want to do this silently, so that users are able to debug where their chunks went.	2019-01-10 13:53:38 -05:00
Amy Tai	83014ee2b0	Implement drop_chunks in C Remove the existing PLPGSQL function that implements drop_chunks, replacing it with a direct call to the C function, which also implements the old PLPGSQL checks in C. Refactor out much of the code shared between the C implementations of show_chunks and drop_chunks.	2018-12-06 13:27:12 -05:00
Narek Galstyan	9a3402809f	Implement show_chunks in C and have drop_chunks use it Timescale provides an efficient and easy to use api to drop individual chunks from timescale database through drop_chunks. This PR builds on that functionality and through a new show_chunks function gives the opportunity to see the chunks that would be dropped if drop_chunks was run. Additionally, it adds a newer_than option to drop_chunks (also supported by show_chunks) that allows to see/drop chunks in an interval or newer than a point in time. This commit includes: - Implementation of show_chunks in C - Additional helper functions to work with chunks - New version of drop_chunks in sql that uses show_chunks. This also adds a newer_than option to drop_chunks - More enhanced tests of drop_chunks and new tests for show_chunks Among other reasons, show_chunks was implemented in C in order to be able to have both older_than and newer_than arguments be null. This was not possible in SQL because the arguments had to have polymorphic types and whether they are used in function body or not, PL/pgSQL requires these arguments to typecheck.	2018-11-28 13:46:07 -05:00
Joshua Lockerman	e06733acf0	Fix casing in SQL license header to be consistent with elsewhere	2018-11-15 15:18:58 -05:00
Joshua Lockerman	20ec6914c0	Add license headers to SQL files and test code	2018-10-29 13:28:19 -04:00
Joshua Lockerman	b43574f82e	Switch 'IO' error prefix to 'TS' Our errorcodes have a 'IO' prefix from when we were Iobeam. This commit switches that prefix to 'TS' for consistency.	2018-09-27 13:06:11 -04:00
Joshua Lockerman	974788516a	Prefix public C functions with ts_ We've decided to adopt the ts_ prefix on all exported C functions in order to avoid having symbol conflicts with future postgres functions. We've already started using this prefix on new functions and this commit adds the prefix to to the old functions.	2018-09-27 11:45:04 -04:00
Mike Futerko	4f2f1a6eb7	Update the error messages to conform with the style guide; Fix tests An attempt to unify the error messages to conform with the PostgreSQL error messages style guide. See the link below: https://www.postgresql.org/docs/current/static/error-style-guide.html	2018-07-10 12:55:02 -04:00
Erik Nordström	6adce4cbd8	Handle TRUNCATE without upcall and handle ONLY modifier This change refactors the handling of TRUNCATE so that it is performed directly in process utility without doing an upcall to PL/pgSQL. It also adds handling for the ONLY modifier to TRUNCATE, which shouldn't work on a hypertable. TRUNCATE now generates an error if TRUNCATE ONLY is used on a hypertable.	2018-02-01 10:12:48 +01:00
Erik Nordström	b7ebe06f2e	Handle change owner without upcall Changing the owner of a hypertable is now handled entirely in the process utility hook without doing an upcall to PL/pgSQL.	2018-01-31 22:42:37 +01:00
Erik Nordström	fa19a54a88	Handle deletes on metadata objects via native catalog API Deletes on metadata in the TimescaleDB catalog has so far been a mix of native deletes using the C-based catalog API and SQL-based DELETE statements that CASCADEs. This mixed environment is confusing, and SQL-based DELETEs do not consistently clean up objects that are related to the deleted metadata. This change moves towards A C-based API for deletes that consistently deletes also the dependent objects (such as indexes, tables and constraints). Ideally, we should prohobit direct manipulation of catalog tables using SQL statements to avoid ending up in a bad state. Once all catalog manipulations happend via the native API, we can also remove the cache invalidation triggers on the catalog tables.	2018-01-26 21:39:12 +01:00
Erik Nordström	6e011d12fb	Refactor hypertable-related API functions This is a continuation of prior efforts to refactor API functions in C to: - improve usage of proper error codes - use error messages that better conform with the PostgreSQL standard. - improve security by avoiding that lots of code run under SECURITY DEFINER - move towards doing all metadata updates using a consistent catalog API Most importantly, `create_hypertable()` has been refactored in C, which simplifies a lot of code that previously required upcalls/downcalls between C code and plpgsql code, or duplicated functionality between the two environments.	2018-01-26 18:42:20 +01:00
Erik Nordström	71962b86ec	Refactor dimension-related API functions The functions for adding and updating dimensions have been refactored in C to: - improve usage of proper error codes - make messages that better conform with the PostgreSQL standard. - improve security by avoiding that lots of code run under SECURITY DEFINER A new if_not_exists option has also been added to add_dimension() and a the number of partitions can now be set using the new set_number_partitions() function. A bug in the validation of smallint time intervals has been fixed. The previous code didn't check for intervals > 0 and smallint intervals accepted values up to UINT16_MAX instead of INT16_MAX.	2018-01-25 19:02:34 +01:00
Matvey Arye	da8cc797a4	Add support for multiple extension version in one pg instance This PR adds the ability to have multiple different versions of the timescaledb extension be used by different databases in the same PostgreSQL instance (server). This is accomplished by splitting this extension into two .so files. 1) timescaledb.so -- stuff under loader/. Really not a lot of code. This code MUST be backwards compatible in the future. 2) timescaledb-version.so (most of our code). Need not be backwards compatible. Timescaledb.so becomes a small stub which is preloaded and whose main reason for existing is to dynamically load the right timescaledb-version.so when the time comes. This change allows either of the above .so to be loaded in shared_preload_libraries. But timescaledb.so allows for multiple versions used on different databases in the same instance along with smoother upgrades. Using timescaledb-version.so allows for finer-grained control and lock-in and is appropriate in only a few production environments. This PR also adds version checking so that a clear failure message will be displayed if the .so version does not match the SQL extension version. To support multi-version functionality we changed the way SQL update scripts are generated. Previously, the system used a bunch of intermediate upgrade scripts. So with 3 versions, you would have an update script of 1--2, 2--3. But, this PR changes things so that we produce direct "shortcut" update files: 1--3, 2--3. This is done for 2 reasons: 1) Each of the update files should point to $libdir/timescaledb-current_version. Since you cannot guarantee that Previous .so for each intermediate version has been installed. 2) You don't want intermediate version updates installed without the .so. For example, if you have versions 1,2,3 and you are installing version 3, you want the upgrade files 1--3, 2--3 but not 1--2 because if you have 1--2 then a user could do ALTER EXTENSION timescaledb UPDATE TO 2. But the .so for version 2 may not be installed. In order to test this functionality, we add a mock extension version .so that we can test extension loading inside the regression framework.	2018-01-05 12:15:54 -05:00
Erik Nordström	4df8f287a6	Add proper permissions handling for associated (chunk) schemas A hypertable's associated schema is used to create and store internal data tables (chunks). A hypertable creates tables in that schema, typically with full superuser permissions, regardless of whether the hypertable's owner or the current user have permissions for the schema. If the schema doesn't exist, the hypertable will create it when creating the first chunk, even though the user or table owner does not have permissions to create schemas in the database. This change adds proper permissions checks to create_hypertable() so that users cannot create hypertables with a custom associated schema unless they have the proper permissions on the schema or the database. Chunks are also no longer created with internal schema permissions if the associated schema is something different from the internal schema.	2017-12-28 11:24:29 +01:00
Erik Nordström	21efcce95c	Refactor chunk table creation and unify constraint handling This change is part of an effort to create a consistent way of dealing with metadata catalog updates, which is currently a mix of C API and INSERT/UPDATE/DELETE statements from SQL code. This mix makes catalog handling unnecessarily complex as there are multiple ways to update metadata, increasing the risk of security issues with publically exposed SQL functions. It also complicates things like cache invalidation, requiring different mechanisms for C and SQL code. Catalog updates from SQL code require triggers on metadata tables for cache invalidation that do not work with native catalog updates. The creation of chunks has been particularly messy in this regard, making the code hard to follow. Especially the handling of a chunk's constraints, where dimensional and other constraints were handled differently. With this change, constraint handling is now consistent across constraint types with a single API for updating metadata. Reduce memory usage for out-of-order inserts The chunk_result_relation_info should be put on the chunk memory context. This will cause the rri constraint expr to also go onto that context and be correctly freed when the chunk insert state is destroyed.	2017-12-28 11:24:29 +01:00
Matvey Arye	2fe447ba14	Make TimescaleDB work with pg_upgrade Compatibility with pg_upgrade required 2 changes: 1) search_path on functions cannot be blank for pg_upgrade. 2) The timescaledb.restoring GUC had to apply to more code (now moved to higher-level check) `pg_upgrade` must be passed the following option: `-O "-c timescaledb.restoring='on'"`	2017-12-19 11:47:49 -05:00
Erik Nordström	176b75e43d	Add command to show tablespaces attached to a hypertable Users can now call `show_tablespaces()` to list the tablespaces attached to a particular hypertable.	2017-12-09 18:27:50 +01:00
Erik Nordström	6e92383592	Add function to detach tablespaces from hypertables Tablespaces can now be detached from hypertables using `tablespace_detach()`. This function can either detach a tablespace from all tables or only a specific table. Having the ability to detach tablespace allows more advanced storage management, for instance, one can detach tablespaces that are running low on diskspace while attaching new ones to replace the old ones.	2017-12-09 18:27:50 +01:00
Erik Nordström	e593876cb0	Refactor tablespace handling Attaching tablespaces to hypertables is now handled in native code, with improved permissions checking and caching of tablespaces in the Hypertable data object.	2017-12-09 18:27:50 +01:00
Rob Kiefer	e44e47ed88	Update add_dimension to take INTERVAL times The user should be able to add time dimensions using INTERVAL when the column type is TIMESTAMP/TIMESTAMPTZ/DATE, so this change adds that support. Additionally it adds some additional tests and checks for add_dimension, e.g., a nice error when the table is not a hypertable.	2017-12-07 12:09:35 -05:00
Matvey Arye	8b772be994	Change time handling in drop_chunks for TIMESTAMP times This PR fixes the handling of drop_chunks when the hypertable's time field is a TIMESTAMP or DATE field. Previously, such hypertables needed drop_chunks to be given a timestamptz in UTC. Now, drop_chunks can take a DATE or TIMESTAMP. Also, the INTERVAL version of drop_chunks correctly handles these cases. A consequence of this change is that drop_chunks cannot be called on multiple tables (with table_name = NULL or schema_name = NULL) if the tables have different time column types.	2017-11-27 16:17:42 -05:00
Erik Nordström	1e947da456	Permission fixes and allow SET ROLE This change reduces the usage of SECURITY DEFINER on SQL functions and fixes related permissions issues. It also properly checks hypertable permissions relative the current_user instead of the session_user, which otherwise breaks SET ROLE, among other things.	2017-11-27 15:55:26 +01:00
Matvey Arye	13e1cb5343	Add reindex function reindex allows you to reindex the indexes of only certain chunks, filtering by time. This is a common use case because a user may want to reindex chunks after they are no longer getting new data once. reindex also has a recreate option which will not use REINDEX but will rather CREATE INDEX a new index and then DROP INDEX / RENAME new_index to old_name. This approach has advantages in terms of blocking reads for a much shorter period of time. However, it does more work and will use more disk space during the operation.	2017-11-21 14:08:57 -05:00
Erik Nordström	741b25662e	Mark IMMUTABLE functions as PARALLEL SAFE Functions marked IMMUTABLE should also be parallel safe, but aren't by default. This change marks all immutable functions as parallel safe and removes the IMMUTABLE definitions on some functions that have been wrongly labeled as IMMUTABLE. If functions that are IMMUTABLE does not have the PARALLEL SAFE label, then some standard PostgreSQL regression tests will fail (this is true for PostgreSQL >= 10).	2017-11-17 20:24:30 +01:00
Olof Rensfelt	201a948452	Check that time dimensions are set as NOT NULL. Add check that time dimensions are set as NOT NULL in the main table that a hypertable is created from. If it is not set, the constraint will be added.	2017-11-02 09:12:15 +01:00
Erik Nordström	4532650411	Allow setting partitioning function Users might want to implement their own partitioning function or use the legacy one included with TimescaleDB. This change adds support for setting the partitioning function in create_hypertable() and add_dimension().	2017-10-31 10:20:52 +01:00
Erik Nordström	cf009cc584	Avoid string conversion in hash partitioning Hash partitioning previously relied on coercing (casting) values to strings before calculating a hash value, including creating CHECK constraints with casts. This approach is fairly suboptimal from a performance perspective and might have issues related to different character encodings depending on system. Hash partitioning now instead uses a partitioning function that takes an anyelement type that calls type-dependent hash functions internal to PostgreSQL. This should provide more efficient hashing both by avoiding unnecessary string conversions and by using more optimal type-specific hash functions. Support for the previous hash partitioning function is preserved for backwards compatibility. Hypertables created with the previous function will continue to use to old hashing strategy, while new tables will default to the updated hash partitioning. For safety, this change also blocks changing types on hash-partitioned columns, since it seems hard to guarantee the same hash result between different types.	2017-10-25 15:30:56 +02:00
Rob Kiefer	fbd4349234	Change integral drop_chunks() to use BIGINT Previously drop_chunks() only took INTEGER, which prevented it from being called with BIGINT values, e.g. for nanoseconds.	2017-10-19 16:23:32 -04:00
Matvey Arye	c3ebc676e3	Fix permission problems with dropping hypertables and chunks This change fixes permissions with dropping hypertables and chunks. Fixes #226.	2017-10-05 12:06:09 -04:00
Erik Nordström	040e815dba	Remove truncate and hypertable metadata triggers This is part of the ongoing effort to simplify the metadata tables and removing any triggers on them that cause side effects. This change includes the following: - Remove the on_change_hypertable() trigger on the hypertable catalog table. - Remove the TRUNCATE blocking triggers on all metadata tables. If we think such blocking is important, we should do this in an event trigger or the processUtility hook. - Put all SQL files in a single load_order.txt instead of splitting across three distinct files. Now all SQL files are included in update scripts as well for simplicity and consistency. - As a result of removing triggers and related functions, the setup_main() and restore_timescaledb() functions are no longer needed. This also further simplifies the database restore process as calling restore_timescaledb() is no longer needed (or possible). - Refactor create_hypertable_row() to do more validation before allocating a new hypertable ID. This avoids incrementing the serial ID unnecessarily in case some validations fail.	2017-10-05 09:56:32 +02:00
Erik Nordström	097db3d589	Refactor chunk index handling This change refactors the chunk index handling to make better use of standard PostgreSQL catalog information, while removing the hypertable_index metadata table and associated triggers, including those on the chunk_index table. The chunk_index table itself is also simplified. A benefit of this refactoring is that indexes are no longer created using string mangling to construct the CREATE INDEX command for a chunk, based on the string definition of the hypertable index. Instead, indexes are created in C using proper index-related internal data structures. Chunk indexes can now also be renamed and are added in the parent index tablespace. Changing tablespace on a hypertable index also recurses to chunks, as expected. Default indexes that are added when creating a hypertable use the hypertable's tablespace. Creating Hypertable indexes with the CONCURRENTLY modifier is currently blocked, due to unclear semantics regarding concurrent creation over many tables, including how to deal with snapshots.	2017-10-03 10:51:32 +02:00
Matvey Arye	5cee104d57	Allow chunk_time_interval to be specified as an INTERVAL type	2017-09-15 12:48:14 -04:00
Matvey Arye	51821b3709	Move trigger handling from PLPGSQL to C Applying triggers to chunks requires taking the definition of a trigger on a hypertable and executing it on a chunk. Previously this was done with string replacement in the trigger definition. This was not especially safe, and thus we moved the logic to C where we can do proper parsing/deparsing and replacement of the table name. Another positive aspect is that we got rid of some DDL triggers.	2017-09-14 13:01:46 -04:00
Matvey Arye	72d668150e	Move security checks for ALTER TABLE ALTER COLUMN to C	2017-09-13 18:28:24 -04:00
Matvey Arye	19d3d8981b	Handle changing the type of dimension columns correctly. Update the type in the dimension metadata table and recreate the check constraints.	2017-09-13 18:28:24 -04:00
Matvey Arye	17c4ba9ec3	Handle ALTER TABLE rename column Update the column names stored in the dimension metadata table.	2017-09-13 18:28:24 -04:00
Matvey Arye	d2561cc4fd	Add ability to partition by a date type	2017-09-07 12:22:03 -04:00

1 2

66 Commits