timescaledb

mirror of https://github.com/timescale/timescaledb.git synced 2025-05-19 20:24:46 +08:00

Author	SHA1	Message	Date
Alexander Kuzmenkov	f08dd10425	Check the spelling in our code with codespell	2024-01-11 12:18:50 +01:00
Sven Klemm	6395b249a9	Remove remote connection handling code Remove the code used by multinode to handle remote connections. This patch completely removes tsl/src/remote and any remaining distributed hypertable checks.	2023-12-15 19:13:08 +01:00
Lakshmi Narayanan Sreethar	4bd704f3fc	Further code cleanup after PG12 removal Removed PG12 specific code guarded by the PG13_LT and PG13_GE macros.	2023-08-11 00:31:48 +05:30
Lakshmi Narayanan Sreethar	933285e646	PG16: Remove MemoryContextContains usage Remove the usage of MemoryContextContains as it has been removed in PG16. postgres/postgres@9543eff	2023-06-21 22:52:22 +05:30
Sven Klemm	336f4b513f	Fix hash_create calls without HASH_CONTEXT flag This patch fixes callsites that set an explicit memory context in the control structure but do not specify the HASH_CONTEXT flag leading to the hash table being created in TopMemoryContext. This patch also changes call sites that want to create the hash table in TopMemoryContext to be explicit about this. Additionally this patch adds a coccinelle script to detect these errors and prevent adding similar code in the future.	2022-07-31 21:47:26 +02:00
Erik Nordström	9ee9a1e1c0	Call cleanup functions on backend exit A number of cleanup functions for various data structures (e.g., caches) are designed to be called when the extension library is unloaded (via `_PG_fini`). But unloading is disabled in PostgreSQL and the library is also not unloaded when the backend exits. Therefore, cleanup functions are never called in practice. In most cases this is not a problem since memory is released in any case. However, in the case of the connection cache, connections need to be closed before memory is released or otherwise the DNs will emit "Connection reset by peer" log entries when the AN backend exits. Add proper cleanup of caches and other data structures by adding a callback via `on_proc_exit`, which gets called when the backend exits. The closing of AN to DN backend connections are also logged on the AN if `log_connections=true`.	2021-09-17 11:17:57 +02:00
Sven Klemm	d0426ff234	Move all compatibility related files into compat directory	2021-08-28 05:17:22 +02:00
Dmitry Simonenko	16accae67d	Fix use after free in cache When calling hash_search with HASH_REMOVE the returned pointer should not be dereferenced because it returns a dangling pointer	2021-04-19 12:43:56 +03:00
Sven Klemm	b1a9c3b7b7	Add support for PG13 List implementation PG13 changes the List implementation from a linked list to an array while most of the API functions did not change a few them have slightly different signature in PG13, additionally the list_make5 functions got removed. https://github.com/postgres/postgres/commit/1cff1b95ab	2021-01-04 11:18:33 +01:00
Matvey Arye	02c178d9ca	Add connection caching infrastructure This commit adds the ability to cache remote connections across commands and transactions. This is needed since establishing new connections is expensive. The cache is invalidated when the foreign server or user mapping is changed. Because of this the cache is tied to a user mapping (it is keyed by the user mapping's oid and requires a user_mapping for invalidation). We use the syscache invalidation mechanism since foreign servers and user mappings are already invalidated using this mechanism. This requires some extra plumbing in our cache invalidation handling. This cache will be used in txn callback handling and so the regular auto-release of caches on (sub)txn commits/aborts that happens with most caches is inappropriate. Therefore we added a new flag to the cache called `handle_txn_callbacks` that allows a cache to turn off the auto-release mechanism	2020-05-27 17:31:09 +02:00
Erik Nordström	ae587c9964	Add API function for explicit chunk creation This adds an internal API function to create a chunk using explicit constraints (dimension slices). A function to export a chunk in a format consistent with the chunk creation function is also added. The chunk export/create functions are needed for distributed hypertables so that an access node can create chunks on data nodes according to its own (global) partitioning configuration.	2020-05-27 17:31:09 +02:00
Erik Nordström	36af23ec94	Use flags for cache query options Cache queries support multiple optional behaviors, such as "missing ok" (do not fail on cache miss) and "no create" (do not create a new entry if one doesn't exist in the cache). With multiple boolean parameters, the query API has become unwieldy so this change turns these booleans into one flag parameter.	2020-04-14 23:12:15 +02:00
Erik Nordström	afb4c7ba51	Refactor planner hooks This change refactors our main planner hooks in `planner.c` with the intention of providing a consistent way to classify planned relations across hooks. In our hooks, we'd like to know whether a planned relation (`RelOptInfo`) is one of the following: * Hypertable * Hypertable child (a hypertable can appear as a child of itself) * Chunk as a child of hypertable (from expansion) * Chunk as standalone (operation directly on chunk) * Any other relation Previously, there was no way to consistently know which of these one was dealing with. Instead, a mix of various functions was used without "remembering" the classification for reuse in later sections of the code. When classifying relations according to the above categories, the only source of truth about a relation is our catalog metadata. In case of hypertables, this is cached in the hypertable cache. However, this cache is read-through, so, in case of a cache miss, the metadata will always be scanned to resolve a new entry. To avoid unnecessary metadata scans, this change introduces a way to do cache-only queries. This requires maintaining a single warmed cache throughout planning and is enabled by using a planner-global cache object. The pre-planning query processing warms the cache by populating it with all hypertables in the to-be-planned query.	2020-04-14 23:12:15 +02:00
Joshua Lockerman	949b88ef2e	Initial support for PostgreSQL 12 This change includes a major refactoring to support PostgreSQL 12. Note that many tests aren't passing at this point. Changes include, but are not limited to: - Handle changes related to table access methods - New way to expand hypertables since expansion has changed in PostgreSQL 12 (more on this below). - Handle changes related to table expansion for UPDATE/DELETE - Fixes for various TimescaleDB optimizations that were affected by planner changes in PostgreSQL (gapfill, first/last, etc.) Before PostgreSQL 12, planning was organized something like as follows: 1. construct add `RelOptInfo` for base and appendrels 2. add restrict info, joins, etc. 3. perform the actual planning with `make_one_rel` For our optimizations we would expand hypertables in the middle of step 1; since nothing in the query planner before `make_one_rel` cared about the inheritance children, we didn’t have to be too precises about where we were doing it. However, with PG12, and the optimizations around declarative partitioning, PostgreSQL now does care about when the children are expanded, since it wants as much information as possible to perform partition-pruning. Now planning is organized like: 1. construct add RelOptInfo for base rels only 2. add restrict info, joins, etc. 3. expand appendrels, removing irrelevant declarative partitions 4. perform the actual planning with make_one_rel Step 3 always expands appendrels, so when we also expand them during step 1, the hypertable gets expanded twice, and things in the planner break. The changes to support PostgreSQL 12 attempts to solve this problem by keeping the hypertable root marked as a non-inheritance table until `make_one_rel` is called, and only then revealing to PostgreSQL that it does in fact have inheritance children. While this strategy entails the least code change on our end, the fact that the first hook we can use to re-enable inheritance is `set_rel_pathlist_hook` it does entail a number of annoyances: 1. this hook is called after the sizes of tables are calculated, so we must recalculate the sizes of all hypertables, as they will not have taken the chunk sizes into account 2. the table upon which the hook is called will have its paths planned under the assumption it has no inheritance children, so if it's a hypertable we have to replan it's paths Unfortunately, the code for doing these is static, so we need to copy them into our own codebase, instead of just using PostgreSQL's. In PostgreSQL 12, UPDATE/DELETE on inheritance relations have also changed and are now planned in two stages: - In stage 1, the statement is planned as if it was a `SELECT` and all leaf tables are discovered. - In stage 2, the original query is planned against each leaf table, discovered in stage 1, directly, not part of an Append. Unfortunately, this means we cannot look in the appendrelinfo during UPDATE/DELETE planning, in particular to determine if a table is a chunk, as the appendrelinfo is not at the point we wish to do so initialized. This has consequences for how we identify operations on chunks (sometimes for blocking and something for enabling functionality).	2020-04-14 23:12:15 +02:00
Matvey Arye	34edba16a9	Run clang-format on code	2019-02-05 16:55:16 -05:00
Joshua Lockerman	acc41a7712	Update license header Only have the copyright in the NOTICE. Hopefully only having to update one place each year will keep it consistent.	2019-01-03 11:57:51 -05:00
Joshua Lockerman	4e1e15f079	Add reorder command New cluster-like command which writes to a new index than swaps, much like is done for the data table, and only acquires exclusive locks for said swap. This trades off disk usage for lower contention: we hold locks for a much lower period of time, allowing reads to work concurrently, but we have both the old and new versions of the table existing at once, approximately doubling storage usage while reorder is running. Currently only works on chunks.	2019-01-02 15:43:48 -05:00
Sven Klemm	92586d8fc9	Fix typos in comments	2018-12-31 18:36:05 +01:00
Sven Klemm	b1378449bc	Remove unused functions Remove the following unused functions: ts_cache_switch_to_memory_context ts_chunk_free ts_chunk_exists ts_chunk_index_delete_children_of ts_chunk_index_delete_by_hypertable_id ts_hypertable_scan_relid ts_tablespaces_clear ts_tablespaces_delete	2018-12-18 10:35:04 +01:00
Joshua Lockerman	9de504f958	Add ts_ prefix to everything in headers Future proofing: if we ever want to make our functions available to others they’d need to be prefixed to prevent name collisions. In order to avoid having some functions with the ts_ prefix and others without, we’re adding the prefix to all non-static functions now.	2018-12-05 14:43:22 -05:00
Joshua Lockerman	d8e41ddaba	Add Apache License header to all C files	2018-10-29 13:28:19 -04:00
Erik Nordström	3c28f65f44	Release cache pin memory Cache pins are allocated on the CacheMemoryContext in order to survive subtransactions, and, optionally, main transactions (in case of, e.g., clustering or vaccuming). However, cache pins that are released also needs to free memory on the CacheMemoryContext in order to avoid leaking memory in a session. This change ensures cache pin memory is freed when a cache pin is released. It also allocates cache pins on a separate memory context, which is a child of the CacheMemoryContext. This separate memory context makes it easier to track the memory used by cache pins and also release it when necessary.	2018-07-10 10:15:30 +02:00
Erik Nordström	2e1f3b9fd0	Improve memory allocation during cache lookups Previously, cache lookups were run on the cache's memory context. While simple, this risked allocating transient (work) data on that memory context, e.g., when scanning for new cache data during cache misses. This change makes scan functions take a memory context, which the found data should be allocated on. All other data is allocated on the current memory (typically the transaction's memory context). With this functionality, a cache can pass its memory context to the scan, thus avoiding taking on unnecessary memory allocations.	2018-06-22 16:45:07 +02:00
Matvey Arye	744ca0920c	Fix Cache Pinning for Subtxns Previously, all Subtxn aborts released all cache pins. This is wrong because that can release pins that are still in use by higher-level subtxns and top-level txns. Now, we release only pins corresponding to the appropriate subtxn. That means that we now track the subtxn a pin was created in.	2018-03-15 11:52:51 -04:00
Matvey Arye	b31c8dbd1e	Address coverity complaints	2018-02-16 15:58:44 -05:00
Matvey Arye	78d36b52d4	Handle subtxn for cache pinning This commit adds logic for cache pinning to handle subtxn. It also makes it easier to find cache pinning leaks. Finally, it fixes handling of cross-commit operations like VACUUM and CLUSTER. Previously, such operations incorrectly released that cache pin on the first commit even though the object was used after that.	2018-01-11 09:35:37 -05:00
Erik Nordström	55bfdf753a	Release all cache pins when a transaction ends Currently, if a transaction ends (normally or abnormally) while a cache is pinned, that cache will never be released, leading to a memory leak. This change ensures that all taken cache pins are released when a transaction ends by registering all pins in a list and cleaning it up in a transaction end callback. This makes the use of cache_release() optional. However, timely calls to cache_release() is still encouraged since it is better to release memory as soon as possible during long-running transactions.	2017-08-09 13:53:03 +02:00
Matvey Arye	aca7f326b3	Fix extension drop handling Previously, the extension could end up in a bad state if it was dropped as part of a cascade. This PR fixes that by checking explicitly for the presence of the proxy table to make sure we are not in the middle of an extension drop. Fixes #73.	2017-06-27 10:57:10 -04:00
Matvey Arye	14ac892c88	Fix possible segfault Previously cache_destroy NULLed out memory in a MemoryContext that it just deleted. This could cause a segfault and is fixed by this PR.	2017-06-27 09:27:17 -04:00
Robin Thomas	7b2097a975	ran pgindent successfully with local typedefs; ended up correcting several code style discrepancies from master.	2017-04-29 18:32:07 -04:00
Erik Nordström	5c2bef47f1	Use a cache's own memory context on cache updates In case new allocations are made during a cache entry update, we should use the cache's internal memory context to ensure that new data is allocated and freed in a way consistent with the rest of the cache.	2017-03-24 08:16:36 +01:00
Erik Nordström	89692c9761	Add cache statistics and do minor cleanup Track statistics for cache hits and misses in the cache module. Currently not exposed to SQL, but might be useful for internal debugging.	2017-03-22 09:57:44 +01:00
Matvey Arye	32c45b75b2	formatting with pgindent	2017-03-06 15:20:00 -05:00
Matvey Arye	4d4ac78ef5	cleanup	2017-03-05 10:13:45 -05:00
Matvey Arye	64e8ec1877	Ordering inserts to avoid deadlocks	2017-03-03 10:10:14 -05:00
Matvey Arye	67ad21ee36	Cache hypertable metadata information for faster inserts. Add caches for hypertable metadata to make it faster to map INSERT rows to the chunk they should go into.	2017-03-02 12:24:53 +01:00

36 Commits