citus

Commit Graph

Author	SHA1	Message	Date
Naisila Puka	da2624cee8	PG17 compatibility: Resolve compilation issues (#7699 ) This PR provides successful compilation against PG17.0. - Remove ExecFreeExprContext call Relevant PG commit d060e921ea5aa47b6265174c32e1128cebdbc3df `d060e921ea` - PG17 uses streaming IO in analyze, fix scan_analyze_next_block function Relevant PG commit 041b96802efa33d2bc9456f2ad946976b92b5ae1 `041b96802e` - Define ObjectClass for PG17+ only since it's removed Relevant PG commit: 89e5ef7e21812916c9cf9fcf56e45f0f74034656 `89e5ef7e21` - Remove ReorderBufferTupleBuf structure. Relevant PG commit: 08e6344fd6423210b339e92c069bb979ba4e7cd6 `08e6344fd6` - Define colliculocale and daticulocale since they have been renamed Relevant PG commit: f696c0cd5f299f1b51e214efc55a22a782cc175d `f696c0cd5f` - makeStringConst defined in PG17 Relevant PG commit: de3600452b61d1bc3967e9e37e86db8956c8f577 `de3600452b` - RangeVarCallbackOwnsTable was replaced by RangeVarCallbackMaintainsTable Relevant PG commit: ecb0fd33720fab91df1207e85704f382f55e1eb7 `ecb0fd3372` - attstattarget is nullable, define pg compatible functions for it Relevant PG commit: 4f622503d6de975ac87448aea5cea7de4bc140d5 `4f622503d6` - stxstattarget is nullable in PG17, write compat functions for it Relevant PG commit: 012460ee93c304fbc7220e5b55d9d0577fc766ab `012460ee93` - Use ResourceOwner to track WaitEventSet in PG17 Relevant PG commit: 50c67c2019ab9ade8aa8768bfe604cd802fe8591 `50c67c2019` - getIdentitySequence now uses Relation instead of relation_id Relevant PG commit: 509199587df73f06eda898ae13284292f4ae573a `509199587d` - Remove no-op tuplestore_donestoring function Relevant PG commit: 75680c3d805e2323cd437ac567f0677fdfc7b680 `75680c3d80` - MergeAction can have 3 merge kinds (now enum) in PG17, write compat Relevant PG commit: 0294df2f1f842dfb0eed79007b21016f486a3c6c `0294df2f1f` - EXPLAIN (MEMORY) is added, make changes to ExplainOnePlan Relevant PG commit: 5de890e3610d5a12cdaea36413d967cf5c544e20 `5de890e361` - LIMIT_OPTION_DEFAULT has been removed as it's useless, use LIMIT_OPTION_COUNT Relevant PG commit: a6be0600ac3b71dda8277ab0fcbe59ee101ac1ce `a6be0600ac` - write compat for create_foreignscan_path bcs of more arguments in PG17 Relevant PG commit: 9e9931d2bf40e2fea447d779c2e133c2c1256ef3 `9e9931d2bf` - pgprocno and lxid have been combined into a struct in PGPROC Relevant PG commits: 28f3915b73f75bd1b50ba070f56b34241fe53fd1 `28f3915b73` ab355e3a88de745607f6dd4c21f0119b5c68f2ad `ab355e3a88` 024c521117579a6d356050ad3d78fdc95e44eefa `024c521117` - Simplify CitusNewNode (#7434) postgres refactored newNode() in PG 17, the main point for doing this is the original tricks is no longer neccessary for modern compilers[1]. This does the same for Citus. This should have no backward compatibility issues since it just replaces palloc0fast with palloc0. This is good for forward compatibility since palloc0fast no longer exists in PG 17. [1] https://www.postgresql.org/message-id/b51f1fa7-7e6a-4ecc-936d-90a8a1659e7c@iki.fi (cherry picked from commit `4b295cc`)	2024-10-17 15:37:13 +03:00
Naisila Puka	9d364332ac	Rename foreach_ macros to foreach_declared_ macros (#7700 ) This is prep work for successful compilation with PG17 PG17added foreach_ptr, foreach_int and foreach_oid macros Relevant PG commit 14dd0f27d7cd56ffae9ecdbe324965073d01a9ff `14dd0f27d7` We already have these macros, but they are different with the PG17 ones because our macros take a DECLARED variable, whereas the PG16 macros declare a locally-scoped loop variable themselves. Hence I am renaming our macros to foreach_declared_ I am separating this into its own PR since it touches many files. The main compilation PR is https://github.com/citusdata/citus/pull/7699	2024-10-16 17:01:39 +03:00
Hanefi Onaldi	15ecc37ecd	Bump Citus to 12.1.5	2024-07-17 15:11:38 +03:00
Parag Jain	6349f2d52d	Support MERGE command for single_shard_distributed Target (#7643 ) This PR has following changes : 1. Enable MERGE command for single_shard_distributed targets. (cherry picked from commit `3c467e6e02`)	2024-07-17 15:11:38 +03:00
Nils Dijk	9dcfcb92ff	CI: move to github container registry (#7652 ) We move the CI images to the github container registry. Given we mostly (if not solely) run these containers on github actions infra it makes sense to have them hosted closer to where they are needed. Image changes: https://github.com/citusdata/the-process/pull/157 (cherry picked from commit `e776a7ebbb`)	2024-07-17 14:57:50 +03:00
paragjain	caee20ad7c	fixing expected file of multi_move_mx test	2024-06-18 16:49:39 +02:00
Onur Tirtir	d9635609f4	Fix flaky multi_mx_node_metadata.sql test (#7317 ) Fixes the flaky test that results in following diff: ```diff --- /__w/citus/citus/src/test/regress/expected/multi_mx_node_metadata.out.modified 2023-11-01 14:22:12.890476575 +0000 +++ /__w/citus/citus/src/test/regress/results/multi_mx_node_metadata.out.modified 2023-11-01 14:22:12.914476657 +0000 @@ -840,24 +840,26 @@ (1 row) \c :datname - - :master_port SELECT datname FROM pg_stat_activity WHERE application_name LIKE 'Citus Met%'; datname ------------ db_to_drop (1 row) DROP DATABASE db_to_drop; +ERROR: database "db_to_drop" is being accessed by other users SELECT datname FROM pg_stat_activity WHERE application_name LIKE 'Citus Met%'; datname ------------ -(0 rows) + db_to_drop +(1 row) -- cleanup DROP SEQUENCE sequence CASCADE; NOTICE: drop cascades to default value for column a of table reference_table ``` (cherry picked from commit `9867c5b949`)	2024-06-18 16:49:39 +02:00
Jelte Fennema-Nio	4f0053ed6d	Redo #7620 : Fix merge command when insert value does not have source distributed column (#7627 ) Related to issue #7619, #7620 Merge command fails when source query is single sharded and source and target are co-located and insert is not using distribution key of source. Example ``` CREATE TABLE source (id integer); CREATE TABLE target (id integer ); -- let's distribute both table on id field SELECT create_distributed_table('source', 'id'); SELECT create_distributed_table('target', 'id'); MERGE INTO target t USING ( SELECT 1 AS somekey FROM source WHERE source.id = 1) s ON t.id = s.somekey WHEN NOT MATCHED THEN INSERT (id) VALUES (s.somekey) ERROR: MERGE INSERT must use the source table distribution column value HINT: MERGE INSERT must use the source table distribution column value ``` Author's Opinion: If join is not between source and target distributed column, we should not force user to use source distributed column while inserting value of target distributed column. Fix: If user is not using distributed key of source for insertion let's not push down query to workers and don't force user to use source distributed column if it is not part of join. This reverts commit `fa4fc0b372`. Co-authored-by: paragjain <paragjain@microsoft.com> (cherry picked from commit `aaaf637a6b`)	2024-06-18 16:49:39 +02:00
Jelte Fennema-Nio	3594bd7ac0	Fix CI issues after Github Actions networking changes (#7624 ) For some reason using localhost in our hba file doesn't have the intended effect anymore in our Github Actions runners. Probably because of some networking change (IPv6 maybe) or some change in the `/etc/hosts` file. Replacing localhost with the equivalent loopback IPv4 and IPv6 addresses resolved this issue. (cherry picked from commit `8c9de08b76`)	2024-06-18 16:49:39 +02:00
Gürkan İndibay	7e0dc18b22	Bump Citus version to 12.1.4 (#7610 )	2024-05-29 11:35:08 +03:00
Gürkan İndibay	4e838a471a	Adds null check for node in HasRangeTableRef (#7604 ) DESCRIPTION: Adds null check for node in HasRangeTableRef to prevent errors When executing the query below, users encountered an error due to a null Node object. This PR adds a null check to handle this error. Query: ```sql select ct.conname as constraint_name, a.attname as column_name, fc.relname as foreign_table_name, fns.nspname as foreign_table_schema, fa.attname as foreign_column_name from (SELECT ct.conname, ct.conrelid, ct.confrelid, ct.conkey, ct.contype, ct.confkey, generate_subscripts(ct.conkey, 1) AS s FROM pg_constraint ct ) AS ct inner join pg_class c on c.oid=ct.conrelid inner join pg_namespace ns on c.relnamespace=ns.oid inner join pg_attribute a on a.attrelid=ct.conrelid and a.attnum = ct.conkey[ct.s] left join pg_class fc on fc.oid=ct.confrelid left join pg_namespace fns on fc.relnamespace=fns.oid left join pg_attribute fa on fa.attrelid=ct.confrelid and fa.attnum = ct.confkey[ct.s] where ct.contype='f' and c.relname='table1' and ns.nspname='schemauser' order by fns.nspname, fc.relname, a.attnum ; ``` Error: ``` #0 HasRangeTableRef (node=0x0, varno=varno@entry=0x7ffe18cc3674) at worker/worker_shard_visibility.c:507 507 if (IsA(node, RangeTblRef)) #0 HasRangeTableRef (node=0x0, varno=varno@entry=0x7ffe18cc3674) at worker/worker_shard_visibility.c:507 #1 0x0000561b0aae390e in expression_tree_walker_impl (node=0x561b0d19cc78, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2091 #2 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #3 0x0000561b0aae3e09 in expression_tree_walker_impl (node=0x561b0d19cd68, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=context@entry=0x7ffe18cc3674) at nodeFuncs.c:2405 #4 0x0000561b0aae3945 in expression_tree_walker_impl (node=0x561b0d19d0f8, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2111 #5 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #6 0x0000561b0aae3e09 in expression_tree_walker_impl (node=0x561b0d19cb38, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=context@entry=0x7ffe18cc3674) at nodeFuncs.c:2405 #7 0x0000561b0aae396d in expression_tree_walker_impl (node=0x561b0d19d198, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2127 #8 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #9 0x0000561b0aae3ef7 in expression_tree_walker_impl (node=0x561b0d183e88, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2464 #10 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #11 0x0000561b0aae3ed3 in expression_tree_walker_impl (node=0x561b0d184278, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2460 #12 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #13 0x0000561b0aae3ed3 in expression_tree_walker_impl (node=0x561b0d184668, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2460 #14 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #15 0x0000561b0aae3ed3 in expression_tree_walker_impl (node=0x561b0d184f68, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=0x7ffe18cc3674) at nodeFuncs.c:2460 #16 0x00007f2a73249f26 in HasRangeTableRef (node=<optimized out>, varno=<optimized out>) at worker/worker_shard_visibility.c:513 #17 0x0000561b0aae3e09 in expression_tree_walker_impl (node=0x7f2a68010148, walker=walker@entry=0x7f2a73249f0a <HasRangeTableRef>, context=context@entry=0x7ffe18cc3674) at nodeFuncs.c:2405 #18 0x00007f2a7324a0eb in FilterShardsFromPgclass (node=node@entry=0x561b0d185de8, context=context@entry=0x0) at worker/worker_shard_visibility.c:464 #19 0x00007f2a7324a5ff in HideShardsFromSomeApplications (query=query@entry=0x561b0d185de8) at worker/worker_shard_visibility.c:294 #20 0x00007f2a731ed7ac in distributed_planner (parse=0x561b0d185de8, query_string=0x561b0d009478 "select\n ct.conname as constraint_name,\n a.attname as column_name,\n fc.relname as foreign_table_name,\n fns.nspname as foreign_table_schema,\n fa.attname as foreign_column_name\nfrom\n (S"..., cursorOptions=<optimized out>, boundParams=0x0) at planner/distributed_planner.c:237 #21 0x00007f2a7311a52a in pgss_planner (parse=0x561b0d185de8, query_string=0x561b0d009478 "select\n ct.conname as constraint_name,\n a.attname as column_name,\n fc.relname as foreign_table_name,\n fns.nspname as foreign_table_schema,\n fa.attname as foreign_column_name\nfrom\n (S"..., cursorOptions=2048, boundParams=0x0) at pg_stat_statements.c:953 #22 0x0000561b0ab65465 in planner (parse=parse@entry=0x561b0d185de8, query_string=query_string@entry=0x561b0d009478 "select\n ct.conname as constraint_name,\n a.attname as column_name,\n fc.relname as foreign_table_name,\n fns.nspname as foreign_table_schema,\n fa.attname as foreign_column_name\nfrom\n (S"..., cursorOptions=cursorOptions@entry=2048, boundParams=boundParams@entry=0x0) at planner.c:279 #23 0x0000561b0ac53aa3 in pg_plan_query (querytree=querytree@entry=0x561b0d185de8, query_string=query_string@entry=0x561b0d009478 "select\n ct.conname as constraint_name,\n a.attname as column_name,\n fc.relname as foreign_table_name,\n fns.nspname as foreign_table_schema,\n fa.attname as foreign_column_name\nfrom\n (S"..., cursorOptions=cursorOptions@entry=2048, boundParams=boundParams@entry=0x0) at postgres.c:904 #24 0x0000561b0ac53b71 in pg_plan_queries (querytrees=0x7f2a68012878, query_string=query_string@entry=0x561b0d009478 "select\n ct.conname as constraint_name,\n a.attname as column_name,\n fc.relname as foreign_table_name,\n fns.nspname as foreign_table_schema,\n fa.attname as foreign_column_name\nfrom\n (S"..., cursorOptions=cursorOptions@entry=2048, boundParams=boundParams@entry=0x0) at postgres.c:996 #25 0x0000561b0ac5408e in exec_simple_query ( query_string=query_string@entry=0x561b0d009478 "select\n ct.conname as constraint_name,\n a.attname as column_name,\n fc.relname as foreign_table_name,\n fns.nspname as foreign_table_schema,\n fa.attname as foreign_column_name\nfrom\n (S"...) at postgres.c:1193 #26 0x0000561b0ac56116 in PostgresMain (dbname=<optimized out>, username=<optimized out>) at postgres.c:4637 #27 0x0000561b0abab7a7 in BackendRun (port=port@entry=0x561b0d0caf50) at postmaster.c:4464 #28 0x0000561b0abae969 in BackendStartup (port=port@entry=0x561b0d0caf50) at postmaster.c:4192 #29 0x0000561b0abaeaa6 in ServerLoop () at postmaster.c:1782 ``` Fixes #7603	2024-05-28 08:54:40 +03:00
Gürkan İndibay	035aa6eada	Bump Citus version to 12.1.3 (#7588 )	2024-04-24 11:15:04 +03:00
Jelte Fennema-Nio	bac95cc523	Greatly speed up "\d tablename" on servers with many tables (#7577 ) DESCRIPTION: Fix performance issue when using "\d tablename" on a server with many tables We introduce a filter to every query on pg_class to automatically remove shards. This is useful to make sure \d and PgAdmin are not cluttered with shards. However, the way we were introducing this filter was using `securityQuals` which can have negative impact on query performance. On clusters with 100k+ tables this could cause a simple "\d tablename" command to take multiple seconds, because a skipped optimization by Postgres causes a full table scan. This changes the code to introduce this filter in the regular `quals` list instead of in `securityQuals`. Which causes Postgres to use the intended optimization again. For reference, this was initially reported as a Postgres issue by me: https://www.postgresql.org/message-id/flat/4189982.1712785863%40sss.pgh.pa.us#b87421293b362d581ea8677e3bfea920 (cherry picked from commit `a0151aa31d`)	2024-04-17 10:26:50 +02:00
Xing Guo	38967491ef	Add missing volatile qualifier. (#7570 ) Variables being modified in the PG_TRY block and read in the PG_CATCH block should be qualified with volatile. The variable waitEventSet is modified in the PG_TRY block (line 1085) and read in the PG_CATCH block (line 1095). The variable relation is modified in the PG_TRY block (line 500) and read in the PG_CATCH block (line 515). Besides, the variable objectAddress doesn't need the volatile qualifier. Ref: C99 7.13.2.1[^1], > All accessible objects have values, and all other components of the abstract machine have state, as of the time the longjmp function was called, except that the values of objects of automatic storage duration that are local to the function containing the invocation of the corresponding setjmp macro that do not have volatile-qualified type and have been changed between the setjmp invocation and longjmp call are indeterminate. [^1]: https://www.open-std.org/jtc1/sc22/wg14/www/docs/n1256.pdf DESCRIPTION: Correctly mark some variables as volatile --------- Co-authored-by: Hong Yi <zouzou0208@gmail.com> (cherry picked from commit `ada3ba2507`)	2024-04-17 10:26:50 +02:00
Filip Sedlák	fc09e1cfdc	Log username in the failed connection message (#7432 ) This patch includes the username in the reported error message. This makes debugging easier when certain commands open connections as other users than the user that is executing the command. ``` monitora_snapshot=# SELECT citus_move_shard_placement(102030, 'monitora.db-dev-worker-a', 6005, 'monitora.db-dev-worker-a', 6017); ERROR: connection to the remote node monitora_user@monitora.db-dev-worker-a:6017 failed with the following error: fe_sendauth: no password supplied Time: 40,198 ms ``` (cherry picked from commit `8b48d6ab02`)	2024-04-17 10:26:50 +02:00
Karina	7513061057	Make isolation_update_node test system independent (#7423 ) Test isolation_update_node fails on some systems with the following error: ``` -s2: WARNING: connection to the remote node non-existent:57637 failed with the following error: could not translate host name "non-existent" to address: Name or service not known +s2: WARNING: connection to the remote node non-existent:57637 failed with the following error: could not translate host name "non-existent" to address: Temporary failure in name resolution ``` This slightly modifies an already existing [normalization rule](`739c6d26df/src/test/regress/bin/normalize.sed (L217-L218)`) to fix it. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com> (cherry picked from commit `21464adfec`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	f4af59ab4b	Support running isolation_update_node in flaky test detection (#7425 ) I noticed in #7423 that `isolation_update_node` could not be run using flaky test detection. This fixes that.	2024-04-17 10:26:50 +02:00
sminux	5708fca1ea	fix bad copy-paste rightComparisonLimit (#7547 ) DESCRIPTION: change for #7543 (cherry picked from commit `d59c93bc50`)	2024-04-17 10:26:50 +02:00
LightDB Enterprise Postgres	2a6164d2d9	Fix timeout when underlying socket is changed in a MultiConnection (#7377 ) When there are multiple localhost entries in /etc/hosts like following /etc/hosts: ``` 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 ::1 localhost localhost.localdomain localhost6 localhost6.localdomain6 127.0.0.1 localhost ``` multi_cluster_management check will failed: ``` @@ -857,20 +857,21 @@ ERROR: group 14 already has a primary node -- check that you can add secondaries and unavailable nodes to a group SELECT groupid AS worker_2_group FROM pg_dist_node WHERE nodeport = :worker_2_port \gset SELECT 1 FROM master_add_node('localhost', 9998, groupid => :worker_1_group, noderole => 'secondary'); ?column? ---------- 1 (1 row) SELECT 1 FROM master_add_node('localhost', 9997, groupid => :worker_1_group, noderole => 'unavailable'); +WARNING: could not establish connection after 5000 ms ?column? ---------- 1 (1 row) ``` This actually isn't just a problem in test environments, but could occur as well during actual usage when a hostname in pg_dist_node resolves to multiple IPs and one of those IPs is unreachable. Postgres will then automatically continue with the next IP, but Citus should listen for events on the new socket. Not on the old one. Co-authored-by: chuhx43211 <chuhx43211@hundsun.com> (cherry picked from commit `9a91136a3d`)	2024-04-17 10:26:50 +02:00
eaydingol	db391c0bb7	Change the order in which the locks are acquired (#7542 ) This PR changes the order in which the locks are acquired (for the target and reference tables), when a modify request is initiated from a worker node that is not the "FirstWorkerNode". To prevent concurrent writes, locks are acquired on the first worker node for the replicated tables. When the update statement originates from the first worker node, it acquires the lock on the reference table(s) first, followed by the target table(s). However, if the update statement is initiated in another worker node, the lock requests are sent to the first worker in a different order. This PR unifies the modification order on the first worker node. With the third commit, independent of the node that received the request, the locks are acquired for the modified table and then the reference tables on the first node. The first commit shows a sample output for the test prior to the fix. Fixes #7477 --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com> (cherry picked from commit `8afa2d0386`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	146725fc9b	Replace more spurious strdups with pstrdups (#7441 ) DESCRIPTION: Remove a few small memory leaks In #7440 one instance of a strdup was removed. But there were a few more. This removes the ones that are left over, or adds a comment why strdup is on purpose. (cherry picked from commit `9683bef2ec`)	2024-04-17 10:26:50 +02:00
Marco Slot	94ab1dc240	Replace spurious strdup with pstrdup (#7440 ) Not sure why we never found this using valgrind, but using strdup will cause memory leaks because the pointer is not tracked in a memory context. (cherry picked from commit `72fbea20c4`)	2024-04-17 10:26:50 +02:00
Onur Tirtir	812a2b759f	Improve error message for recursive CTEs (#7407 ) Fixes #2870 (cherry picked from commit `5aedec4242`)	2024-04-17 10:26:50 +02:00
Onur Tirtir	452564c19b	Allow providing "host" parameter via citus.node_conninfo (#7541 ) And when that is the case, directly use it as "host" parameter for the connections between nodes and use the "hostname" provided in pg_dist_node / pg_dist_poolinfo as "hostaddr" to avoid host name lookup. This is to avoid allowing dns resolution (and / or setting up DNS names for each host in the cluster). This already works currently when using IPs in the hostname. The only use of setting host is that you can then use sslmode=verify-full and it will validate that the hostname matches the certificate provided by the node you're connecting too. It would be more flexible to make this a per-node setting, but that requires SQL changes. And we'd like to backport this change, and backporting such a sql change would be quite hard while backporting this change would be very easy. And in many setups, a different hostname for TLS validation is actually not needed. The reason for that is query-from-any node: With query-from-any-node all nodes usually have a certificate that is valid for the same "cluster hostname", either using a wildcard cert or a Subject Alternative Name (SAN). Because if you load balance across nodes you don't know which node you're connecting to, but you still want TLS validation to do it's job. So with this change you can use this same "cluster hostname" for TLS validation within the cluster. Obviously this means you don't validate that you're connecting to a particular node, just that you're connecting to one of the nodes in the cluster, but that should be fine from a security perspective (in most cases). Note to self: This change requires updating https://docs.citusdata.com/en/latest/develop/api_guc.html#citus-node-conninfo-text. DESCRIPTION: Allows overwriting host name for all inter-node connections by supporting "host" parameter in citus.node_conninfo (cherry picked from commit `3586aab17a`)	2024-04-17 10:26:50 +02:00
Karina	9b06d02c43	Fix error in master_disable_node/citus_disable_node (#7492 ) This fixes #7454: master_disable_node() has only two arguments, but calls citus_disable_node() that tries to read three arguments Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com> (cherry picked from commit `683e10ab69`)	2024-04-17 10:26:50 +02:00
copetol	2ee43fd00c	Fix segfault when using certain DO block in function (#7554 ) When using a CASE WHEN expression in the body of the function that is used in the DO block, a segmentation fault occured. This fixes that. Fixes #7381 --------- Co-authored-by: Konstantin Morozov <vzbdryn@yahoo.com> (cherry picked from commit `12f56438fc`)	2024-04-17 10:26:50 +02:00
Emel Şimşek	f2d102d54b	Fix crash caused by some form of ALTER TABLE ADD COLUMN statements. (#7522 ) DESCRIPTION: Fixes a crash caused by some form of ALTER TABLE ADD COLUMN statements. When adding multiple columns, if one of the ADD COLUMN statements contains a FOREIGN constraint ommitting the referenced columns in the statement, a SEGFAULT occurs. For instance, the following statement results in a crash: ``` ALTER TABLE lt ADD COLUMN new_col1 bool, ADD COLUMN new_col2 int references rt; ``` Fixes #7520. (cherry picked from commit `fdd658acec`)	2024-04-17 10:26:50 +02:00
Karina	82637f3e13	Use expecteddir option in _run_pg_regress() (#7582 ) Fix check-arbitrary-configs tests failure with current REL_16_STABLE. This is the same problem as described in #7573. I missed pg_regress call in _run_pg_regress() in that PR. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com> (cherry picked from commit `41e2af8ff5`)	2024-04-17 10:26:50 +02:00
Karina	79616bc7db	Use expecteddir option when running vanilla tests (#7573 ) In PostgreSQL 16 a new option expecteddir was introduced to pg_regress. Together with fix in [196eeb6b](https://github.com/postgres/postgres/commit/196eeb6b) it causes check-vanilla failure if expecteddir is not specified. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com> (cherry picked from commit `41d99249d9`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	364e8ece14	Speed up GetForeignKeyOids (#7578 ) DESCRIPTION: Fix performance issue in GetForeignKeyOids on systems with many constraints GetForeignKeyOids was showing up in CPU profiles when distributing schemas on systems with 100k+ constraints. The reason was that this function was doing a sequence scan of pg_constraint to get the foreign keys that referenced the requested table. This fixes that by finding the constraints referencing the table through pg_depend instead of pg_constraint. We're doing this indirection, because pg_constraint doesn't have an index that we can use, but pg_depend does. (cherry picked from commit `a263ac6f5f`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	62c32067f1	Use an index to get FDWs that depend on extensions (#7574 ) DESCRIPTION: Fix performance issue when distributing a table that depends on an extension When the database contains many objects this function would show up in profiles because it was doing a sequence scan on pg_depend. And with many objects pg_depend can get very large. This starts using an index scan to only look for rows containing FDWs, of which there are expected to be very few (often even zero). (cherry picked from commit `16604a6601`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	d9069b1e01	Speed up SequenceUsedInDistributedTable (#7579 ) DESCRIPTION: Fix performance issue when creating distributed tables if many already exist This builds on the work to speed up EnsureSequenceTypeSupported, and now does something similar for SequenceUsedInDistributedTable. SequenceUsedInDistributedTable had a similar O(number of citus tables) operation. This fixes that and speeds up creation of distributed tables significantly when many distributed tables already exist. Fixes #7022 (cherry picked from commit `cdf51da458`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	fa6743d436	Speed up EnsureSequenceTypeSupported (#7575 ) DESCRIPTION: Fix performance issue when creating distributed tables and many already exist EnsureSequenceTypeSupported was doing an O(number of distributed tables) operation. This can become very slow with lots of Citus tables, which now happens much more frequently in practice due to schema based sharding. Partially addresses #7022 (cherry picked from commit `381f31756e`)	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	26178fb538	Add missing postgres.h includes After sorting includes in the previous commit some files were now invalid because they were not including postgres.h	2024-04-17 10:26:50 +02:00
Jelte Fennema-Nio	48c62095ff	Actually sort includes after cherry-pick	2024-04-17 10:26:50 +02:00
Nils Dijk	75df19b616	move pg_version_constants.h to toplevel include (#7335 ) In preparation of sorting and grouping all includes we wanted to move this file to the toplevel includes for good grouping/sorting. (cherry picked from commit `0dac63afc0`)	2024-04-17 10:26:50 +02:00
Gürkan İndibay	e2d18c5472	Bump Citus version to 12.1.2 (#7504 )	2024-02-14 08:41:15 +03:00
Teja Mupparti	a945971f48	Fix the incorrect column count after ALTER TABLE, this fixes the bug #7378 (please read the analysis in the bug for more information) (cherry picked from commit `00068e07c5`)	2024-01-24 11:48:06 -08:00
Hanefi Onaldi	2c630eca50	Bump Citus version to 12.1.1	2023-11-13 14:47:11 +03:00
Onur Tirtir	a4fe969947	Make sure to disallow creating a replicated distributed table concurrently (#7219 ) See explanation in https://github.com/citusdata/citus/issues/7216. Fixes https://github.com/citusdata/citus/issues/7216. DESCRIPTION: Makes sure to disallow creating a replicated distributed table concurrently (cherry picked from commit `111b4c19bc`)	2023-10-24 14:04:37 +03:00
Nils Dijk	e59ffbf549	Fix leaking of memory and memory contexts in Foreign Constraint Graphs (#7236 ) DESCRIPTION: Fix leaking of memory and memory contexts in Foreign Constraint Graphs Previously, every time we (re)created the Foreign Constraint Relationship Graph, we created a new Memory Context while loosing a reference to the previous context. This old context could still have left over memory in there causing a memory leak. With this patch we statically have one memory context that we lazily initialize the first time we create our foreign constraint relationship graph. On every subsequent creation, beside destroying our previous hashmap we also reset our memory context to remove any left over references.	2023-10-09 13:07:30 +02:00
Naisila Puka	9b6ffece5e	Adds PostgreSQL 16.0 Support (#7201 ) This commit concludes PG16.0 Support in Citus. The main PG16 support work has been done for 16beta3 https://github.com/citusdata/citus/pull/6952 There was some extra work needed for 16rc1 https://github.com/citusdata/citus/pull/7173 And this PR yet introduces some extra work needed to 16.0 :) `pgstat_fetch_stat_local_beentry` has been renamed to `pgstat_get_local_beentry_by_index` in PG16.0 Relevant PG commit: `8dfa37b797` 8dfa37b797843a83a5756ea3309055e8953e1a86 Sister PR https://github.com/citusdata/the-process/pull/150 (cherry picked from commit `4e46708789`)	2023-09-15 12:27:09 +03:00
aykut-bozkurt	1b4d7a51f8	bump citus into 12.1.0	2023-09-13 14:20:21 +03:00
Gürkan İndibay	e5e64b7454	Adds alter database propagation - with and refresh collation (#7172 ) DESCRIPTION: Adds ALTER DATABASE WITH ... and REFRESH COLLATION VERSION support This PR adds supports for basic ALTER DATABASE statements propagation support. Below statements are supported: ALTER DATABASE <database_name> with IS_TEMPLATE <true/false>; ALTER DATABASE <database_name> with CONNECTION LIMIT <integer_value>; ALTER DATABASE <database_name> REFRESH COLLATION VERSION; --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2023-09-12 14:09:15 +03:00
Naisila Puka	1da99f8423	PG16 - Don't propagate GRANT ROLE with INHERIT/SET option (#7190 ) We currently don't support propagating these options in Citus Relevant PG commits: https://github.com/postgres/postgres/commit/e3ce2de https://github.com/postgres/postgres/commit/3d14e17 Limitation: We also need to take care of generated GRANT statements by dependencies in attempt to distribute something else. Specifically, this part of the code in `GenerateGrantRoleStmtsOfRole`: ``` grantRoleStmt->admin_opt = membership->admin_option; ``` In PG16, membership also has `inherit_option` and `set_option` which need to properly be part of the `grantRoleStmt`. We can skip for now since #7164 will take care of this soon, and also this is not an expected use-case.	2023-09-12 12:47:37 +03:00
Naisila Puka	c1dc378504	Fix WITH ADMIN FALSE propagation (#7191 )	2023-09-11 15:58:24 +03:00
Onur Tirtir	d628a4c21a	Add citus_schema_move() function (#7180 ) Add citus_schema_move() that can be used to move tenant tables within a distributed schema to another node. The function has two variations as simple wrappers around citus_move_shard_placement() and citus_move_shard_placement_with_nodeid() respectively. They pick a shard that belongs to the given tenant schema and resolve the source node that contain the shards under given tenant schema. Hence their signatures are quite similar to underlying functions: ```sql -- citus_schema_move(), using target node name and node port CREATE OR REPLACE FUNCTION pg_catalog.citus_schema_move( schema_id regnamespace, target_node_name text, target_node_port integer, shard_transfer_mode citus.shard_transfer_mode default 'auto') RETURNS void LANGUAGE C STRICT AS 'MODULE_PATHNAME', $$citus_schema_move$$; -- citus_schema_move(), using target node id CREATE OR REPLACE FUNCTION pg_catalog.citus_schema_move( schema_id regnamespace, target_node_id integer, shard_transfer_mode citus.shard_transfer_mode default 'auto') RETURNS void LANGUAGE C STRICT AS 'MODULE_PATHNAME', $$citus_schema_move_with_nodeid$$; ```	2023-09-08 12:03:53 +03:00
Naisila Puka	8894c76ec0	PG16 - Add rules option to CREATE COLLATION (#7185 ) Relevant PG commit: https://github.com/postgres/postgres/commit/30a53b7 30a53b7	2023-09-07 13:50:47 +03:00
Naisila Puka	2df88042b3	Add tests with JSON_ARRAYAGG and JSON_OBJECTAGG aggregates (#7186 ) Relevant PG commit: `7081ac46ac` 7081ac46ace8c459966174400b53418683c9fe5c	2023-09-07 13:29:39 +03:00
Naisila Puka	7e5136f2de	Add tests with publications with schema and table of the same schema (#7184 ) Relevant PG commit: https://github.com/postgres/postgres/commit/13a185f 13a185f It was backpatched through PG15 so I added this test in publication.sql instead of pg16.sql	2023-09-06 16:40:36 +03:00
Naisila Puka	b2fc763bc3	PG16 - Add tests with random_normal (#7183 ) Relevant PG commit: https://github.com/postgres/postgres/commit/38d8176	2023-09-06 14:57:24 +03:00
Naisila Puka	5c658b4eb7	PG16 - Add citus_truncate_trigger for Citus foreign tables (#7170 ) Since in PG16, truncate triggers are supported on foreign tables, we add the citus_truncate_trigger to Citus foreign tables as well, such that the TRUNCATE command is propagated to the table's single local shard as well. Note that TRUNCATE command was working for foreign tables even before this commit: see https://github.com/citusdata/citus/pull/7170#issuecomment-1706240593 for details This commit also adds tests with user-enabled truncate triggers on Citus foreign tables: both trigger on the shell table and on its single foreign local shard. Relevant PG commit: https://github.com/postgres/postgres/commit/3b00a94	2023-09-05 19:42:39 +03:00
zhjwpku	205b159606	get rid of {Push/Pop}OverrideSearchPath (#7145 )	2023-09-05 17:40:22 +02:00
aykut-bozkurt	8eb3360017	Fixes visibility problems with dependency propagation (#7028 ) Problem: Previously we always used an outside superuser connection to overcome permission issues for the current user while propagating dependencies. That has mainly 2 problems: 1. Visibility issues during dependency propagation, (metadata connection propagates some objects like a schema, and outside transaction does not see it and tries to create it again) 2. Security issues (it is preferrable to use current user's connection instead of extension superuser) Solution (high level): Now, we try to make a smarter decision on whether should we use an outside superuser connection or current user's metadata connection. We prefer using current user's connection if any of the objects, which is already propagated in the current transaction, is a dependency for a target object. We do that since we assume if current user has permissions to create the dependency, then it can most probably propagate the target as well. Our assumption is expected to hold most of the times but it can still be wrong. In those cases, transaction would fail and user should set the GUC `citus.create_object_propagation` to `deferred` to work around it. Solution: 1. We track all objects propagated in the current transaction (we can handle subtransactions), 2. We propagate dependencies via the current user's metadata connection if any dependency is created in the current transaction to address issues listed above. Otherwise, we still use an outside superuser connection. DESCRIPTION: Fixes some object propagation errors seen with transaction blocks. Fixes https://github.com/citusdata/citus/issues/6614 --------- Co-authored-by: Nils Dijk <nils@citusdata.com>	2023-09-05 18:04:16 +03:00
Naisila Puka	9f067731c0	Adds PostgreSQL 16 RC1 support (#7173 )	2023-09-05 14:32:41 +03:00
Emel Şimşek	a849570f3f	Improve the performance of CitusHasBeenLoaded function for a database that does not do CREATE EXTENSION citus but load citus.so. (#7123 ) For a database that does not create the citus extension by running ` CREATE EXTENSION citus;` `CitusHasBeenLoaded ` function ends up querying the `pg_extension` table every time it is invoked. This is not an ideal situation for a such a database. The idea in this PR is as follows: ### A new field in MetadataCache. Add a new variable `extensionCreatedState `of the following type: ``` typedef enum ExtensionCreatedState { UNKNOWN = 0, CREATED = 1, NOTCREATED = 2, } ExtensionCreatedState; ``` When the MetadataCache is invalidated, `ExtensionCreatedState` will be set to UNKNOWN. ### Invalidate MetadataCache when CREATE/DROP/ALTER EXTENSION citus commands are run. - Register a callback function, named `InvalidateDistRelationCacheCallback`, for relcache invalidation during the shared library initialization for `citus.so`. This callback function is invoked in all the backends whenever the relcache is invalidated in one of the backends. (This could be caused many DDLs operations). - In the cache invalidation callback,` InvalidateDistRelationCacheCallback`, invalidate `MetadataCache` zeroing it out. - In `CitusHasBeenLoaded`, perform the costly citus is loaded check only if the `MetadataCache` is not valid. ### Downsides Any relcache invalidation (caused by various DDL operations) will case Citus MetadataCache to get invalidated. Most of the time it will be unnecessary. But we rely on that DDL operations on relations will not be too frequent.	2023-09-05 13:29:35 +03:00
Hanefi Onaldi	c22547d221	Create a new colocation properly after braking one When braking a colocation, we need to create a new colocation group record in pg_dist_colocation for the relation. It is not sufficient to have a new colocationid value in pg_dist_partition only. This patch also fixes a bug when deleting a colocation group if no tables are left in it. Previously we passed a relation id as a parameter to DeleteColocationGroupIfNoTablesBelong function, where we should have passed a colocation id.	2023-09-05 10:58:46 +03:00
Jelte Fennema	bdf085eabb	Add some small improvements to python testing framework (#7159 ) 1. Adds an `sql_row` function, for when a query returns a single row with multiple columns. 2. Include a `notice_handler` for easier debugging 3. Retry dropping replication slots when they are "in use", this is often an ephemeral state and can cause flaky tests	2023-09-05 09:34:56 +02:00
Ivan Vyazmitinov	e94bf93152	#6548 2PC recovery is extremely ineffective on a cluster with multiple DATABASEs fix (#7174 )	2023-09-04 15:28:22 +02:00
Naisila Puka	de9af078b0	PG16 - Add reindex database/system tests (#7167 ) In PG16, REINDEX DATABASE/SYSTEM name is optional. We already don't propagate these commands automatically. Testing here with run_command_on_workers. Relevant PG commit: https://github.com/postgres/postgres/commit/2cbc3c1	2023-09-04 11:31:57 +03:00
Naisila Puka	cf71e80bfd	PG16 - Add tests for createdb with ICU_RULES option (#7161 ) When we create a database, it already needs to be manually created in the workers as well. This new icu_rules option should work as the other options as well. Added a test for that. Relevant PG commit: https://github.com/postgres/postgres/commit/30a53b7	2023-09-04 11:13:46 +03:00
zhjwpku	9fd4ef042f	avoid rebuilding MetadataCache for each placement insertion (#7163 )	2023-09-04 09:57:25 +02:00
zhjwpku	5034f8eba5	polish the codebase by fixing dozens of typos (#7166 )	2023-09-01 12:21:53 +02:00
Naisila Puka	05443a77ad	Adds test for COPY FROM failure in Citus foreign tables (#7160 )	2023-09-01 12:20:07 +03:00
Gürkan İndibay	b8bded6454	Adds citus_pause_node udf (#7089 ) DESCRIPTION: Presenting citus_pause_node UDF enabling pausing by node_id. citus_pause_node takes a node_id parameter and fetches all the shards in that node and puts AccessExclusiveLock on all the shards inside that node. With this lock, insert is disabled, until citus_pause_node transaction is closed. --------- Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2023-09-01 11:39:30 +03:00
Gürkan İndibay	4a1a5491ce	Refactors grant statements (#7153 ) DESCRIPTION: Refactors all grant statements to use common code blocks to deparse	2023-09-01 09:49:46 +03:00
zhjwpku	f03291a8c8	remove useless code block (#7158 )	2023-08-29 17:15:22 +02:00
Naisila Puka	a17fae36b9	Disable statistics collection (#7162 ) Enabled by mistake in `ba40eb363c`	2023-08-29 16:09:19 +03:00
Onur Tirtir	a830862717	Not undistribute Citus local table when converting it to a reference table / single-shard table	2023-08-29 12:57:28 +03:00
Onur Tirtir	34e3119b48	Intersect shard placements in a table type agnostic way If we're in the middle of a table type conversion (such as from Citus local table to a reference table), the table might not have all the placements that we expect from the table type. For this reason, we should intersect the placements of tables at hand when creating inter-shard ddl tasks.	2023-08-29 12:57:28 +03:00
Onur Tirtir	5bdf19f517	Use CopyShardForeignConstraintCommandList in WorkerCreateShardCommandList What we do to collect foreign key constraint commands in WorkerCreateShardCommandList is quite similar to what we do in CopyShardForeignConstraintCommandList. Plus, the code that we used in WorkerCreateShardCommandList before was not able to properly handle foreign key constraints between Citus local tables --when creating a reference table from the referencing one. With a few slight modifications made to CopyShardForeignConstraintCommandList, we can use the same logic in WorkerCreateShardCommandList too.	2023-08-29 12:57:28 +03:00
zhjwpku	d97f786296	PQputCopyData's return value 0 should be considered fail (#7152 )	2023-08-29 11:19:18 +02:00
Onur Tirtir	d5d1684c45	Use correct errorCode for the errors thrown during recovery (#7146 )	2023-08-28 11:03:38 +03:00
Naisila Puka	afab879de3	PG16 - Add COPY FROM default tests (#7143 ) Already supported in Citus, adding the same tests as in PG Relevant PG commit: https://github.com/postgres/postgres/commit/9f8377f	2023-08-24 15:52:09 +03:00
Naisila Puka	70c8aba967	PG16 - Add tests for CREATE/ALTER TABLE .. STORAGE (#7140 ) Relevant PG commits: https://github.com/postgres/postgres/commit/784cedd https://github.com/postgres/postgres/commit/b9424d0	2023-08-24 15:26:40 +03:00
Gürkan İndibay	8d3a06c1c7	Adds grant/revoke privileges on database propagation (#7109 ) DESCRIPTION: Adds grant/revoke propagation support for database privileges Following the implementation of support for granting and revoking database privileges, certain tests that issued grants for worker nodes experienced failures. These ones are fixed in this PR as well.	2023-08-24 14:43:19 +03:00
Naisila Puka	b8c493f2c4	PG16 - Add GENERIC_PLAN option to EXPLAIN (#7141 )	2023-08-23 20:15:54 +03:00
Naisila Puka	c73ef405f5	PG16 - IS JSON predicate and SYSTEM_USER tests (#7137 ) Support the IS JSON predicate Relevant PG commit: https://github.com/postgres/postgres/commit/6ee30209 SYSTEM_USER Relevant PG commit: https://github.com/postgres/postgres/commit/0823d061	2023-08-23 14:13:56 +03:00
Marco Slot	ba55fd67d7	Rename planner_readme.md to README.md (#7139 )	2023-08-23 13:47:18 +03:00
Naisila Puka	36b51d617c	PG16 - Throw meaningful error for stats without a name on Citus tables (#7136 ) Relevant PG commit: `624aa2a13b` 624aa2a13bd02dd584bb0995c883b5b93b2152df	2023-08-23 10:25:01 +03:00
Gürkan İndibay	371f094b68	Removes pg_send_cancellation (#7135 ) DESCRIPTION: Removes pg_send_cancellation and all references	2023-08-21 17:29:44 +03:00
zhjwpku	ba2a0aec16	fix some obvious typo and reduce usage of magic number (#7130 ) fix some obvious typo and reduce usage of magic number Signed-off-by: Zhao Junwang <zhjwpku@gmail.com>	2023-08-18 14:50:20 +00:00
Naisila Puka	682dca1f12	Adds PG16Beta3 support (#6952 ) DESCRIPTION: Adds PG16Beta3 support This is the final commit that adds PG16 compatibility with Citus's current features. You can use Citus community with PG16Beta3. This commit: - Enables PG16 in the configure script. - Adds PG16 tests to CI using test images that have 16beta3 - Skips wal2json cdc test since wal2json package is not available for PG16 yet - Fixes an isolation test Several PG16 Compatibility commits have been merged before this final one. All these subtasks are done https://github.com/citusdata/citus/issues/7017 See the list below: 1 - `42d956888d` Resolve compilation issues 2 - `0d503dd5ac` Ruleutils and successful CREATE EXTENSION 3 - `907d72e60d` Some test outputs 4 - `7c6b4ce103` Outer join checks, subscription password, crash fixes 5 - `6056cb2c29` get_relation_info hook to avoid crash from adjusted partitioning 6 - `b36c431abb` Rework PlannedStmt and Query's Permission Info 7 - `ee3153fe50` More test output fixes 8 - `2c50b5f7ff` varnullingrels additions 9 - `b2291374b4` More test output fixes 10- `a2315fdc67` New options to vacuum and analyze 11- `9fa72545e2` Fix AM dependency and grant's admin option 12- `2d6cf8e79a` One more outer join check Stay tuned for PG16 new features in Citus :)	2023-08-17 21:02:59 +03:00
Naisila Puka	2d6cf8e79a	PG16 compatibility - one more outer join check (#7126 ) PG16 compatibility - part 11 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` part 9 `b2291374b4` part 10 `a2315fdc67` part 11 `9fa72545e2` This commit is in the series of PG16 compatibility commits. We already took care of the majority of necessary outer join checks in part 4 `7c6b4ce103` However, In RelationInfoContainsOnlyRecurringTuples, we need to add one more check of whether we are dealing with an outer join RTE using IsRelOptOuterJoin function. This prevents an outer join crash in sqlancer_failures.sql test. We expect one more commit of PG compatibility with Citus's current features are regression tests sanity.	2023-08-17 19:07:18 +03:00
zhjwpku	b10320be6f	fix wrong type convertion (#7116 ) partitionMethod and replicationModel are both type char, there seems meaningless to convert them to type Oid implicitly.	2023-08-17 13:53:43 +02:00
Naisila Puka	a5ce601c07	Bump PG14 and PG15 versions for CI tests (#7111 ) Postgres got minor updates on Aug10, this commit starts using the images with the latest version for our tests, namely 14.9 and 15.4. Depends on https://github.com/citusdata/the-process/pull/147 For CI images, we needed to regenerate Pipfile.lock, mainly because of an issue with pyyaml version: https://github.com/yaml/pyyaml/issues/601 We also needed to remove a failing test in subquery_local_tables.sql. Relevant PG commit: `b0e390e6d1` b0e390e6d1d68b92e9983840941f8f6d9e083fe0 Issue: https://github.com/citusdata/citus/issues/7119 For joins where consider_join_pushdown is false, we cannot get the information that we used to get, which prevents doing the distributed planning. Team already contacted PG committers for this. Until then, we remove the test from the schedule.	2023-08-17 11:53:19 +03:00
Naisila Puka	9fa72545e2	PG16 compatibility - fix AM dependency and grant's admin option (#7113 ) PG16 compatibility - part 11 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` part 9 `b2291374b4` part 10 `a2315fdc67` This commit is in the series of PG16 compatibility commits. It fixes AM dependency and grant's admin option: - Fix with admin option in grants grantstmt->admin_opt no longer exists in PG16 instead, grantstmt has a list of options, one of them is admin option. Relevant PG commit: `e3ce2de09d` e3ce2de09d814f8770b2e3b3c152b7671bcdb83f - Fix pg_depend entry to AMs after ALTER TABLE .. SET ACCESS METHOD Relevant PG commit: `97d8910104` 97d89101045fac8cb36f4ef6c08526ea0841a596 More PG16 compatibility commits are coming soon: We are very close to merging "PG16Beta3 Support - Regression tests sanity"	2023-08-17 11:22:34 +03:00
Naisila Puka	71c475af52	Fix GetUndistributableDependency (#7124 ) This is a leftover task from merging enterprise to community. Roles are distributed in community now, the comment is stale and the check is redundant.	2023-08-17 10:57:22 +03:00
Naisila Puka	a2315fdc67	PG16 compatibility - new options to vacuum and analyze (#7114 ) PG16 compatibility - part 10 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` part 9 `b2291374b4` This commit is in the series of PG16 compatibility commits. It: - Adds buffer_usage_limit to vacuum and analyze - Adds process_main, skip_database_stats, only_database_stats to vacuum Important Note: adding these options is actually required for check-vanilla tests to succeed. However, in concept, this PR belongs to "PG16 new features", rather than "PG16 regression tests sanity" Relevant PG commits: `1cbbee0338` 1cbbee03385763b066ae3961fc61f2cd01a0d0d7 `4211fbd841` 4211fbd8413b26e0abedbe4338aa7cda2cd469b4 `a46a7011b2` a46a7011b27188af526047a111969f257aaf4db8 More PG16 compatibility commits are coming soon ...	2023-08-16 16:18:28 +03:00
Naisila Puka	b982f2dee6	Changes PROCESS_TOAST default value to true (#7122 ) Process toast should be true by default, like in PG.	2023-08-16 14:40:24 +03:00
Naisila Puka	b2291374b4	PG16 compatibility - more test output fixes (#7112 ) PG16 compatibility - part 9 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` This commit is in the series of PG16 compatibility commits. It makes some changes to our tests in order to be compatible with the following in PG16: - Fix multi_subquery_in_where_reference_clause test somehow PG got rid of the outer join (e.g., explain doesn't show outer joins), hence we can pushdown the subquery. Changing to users_reference_table - Fix unqualified column names for views in PG16 Relevant PG commit: `47bb9db759` 47bb9db75996232ea71fc1e1888ffb0e70579b54 - Fix global_cancel test Error wording and detail changed Relevant PG commit: `2631ebab7b` 2631ebab7b18bdc079fd86107c47d6104a6b3c6e - Fix local_table_join_test with lateral subquery Possible relevant PG commit: `ae89129aa3` ae89129aa3555c263b8c3ccc4c0f1ef7e46201aa I removed the where clause and the limit count error was hit again. With the where clause the query unexpectedly works. - Fix test outputs Relevant PG commits: -- `1349d2790b` -- `f4c7c410ee` For multi_explain and multi_complex_count_distinct there were too many places touched so I just added an alternative test output. For the other tests I modified the problematic parts. More PG16 compatibility commits are coming soon ...	2023-08-15 13:49:25 +03:00
Naisila Puka	2c50b5f7ff	PG16 compatibility - varnullingrels additions (#7107 ) PG16 compatibility - part 7 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` This commit is in the series of PG16 compatibility commits. PG16 introduced a new entry varnnullingrels to Var, which represents our partkey in pg_dist_partition. This commit does the necessary changes in Citus to support this. Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-15 13:07:55 +03:00
Naisila Puka	ee3153fe50	PG16 compatibility - more test output fixes (#7108 ) PG16 compatibility - part 7 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` This commit is in the series of PG16 compatibility commits. It makes some changes to our tests in order to be compatible with the following in PG16: - PG16 removed logic for converting a table to a view Relevant PG commit: `b23cd185fd` b23cd185fd5410e5204683933f848d4583e34b35 - Fix changed error message in certificate verification Relevant PG commit: `8eda731465` 8eda7314652703a2ae30d6c4a69c378f6813a7f2 - Fix backend type order in tests Relevant PG commit: `0c679464a8` 0c679464a837079acc75ff1d45eaa83f79e05690 - Reduce log level to omit extra NOTICE in create collation in PG16 Relevant PG commit: `a14e75eb0b` a14e75eb0b6a73821e0d66c0d407372ec8376105 That commit made LOCALE parameter apply regardless of the provider used, and it printed the following notice: NOTICE: using standard form "und-u-ks-level2" for ICU locale "@colStrength=secondary" We omit this notice to omit output change between pg versions. - Fix columnar_memory test TopMemoryContext now has more children contexts Possible relevant PG commit: `9d3ebba729` 9d3ebba729ebaf5882a92f0f5f662a3312037605 memusage is now around 8.5 MB, whereas it was less than 8MB before. To avoid differences between PG versions, I changed the test to compare to less than 9 MB. It still reflects very well the improvement from 28MB. - Alternative test output for GRANTOR values in pg_auth_members grantor changed in PG16 Relevant PG commit: `ce6b672e44` ce6b672e4455820a0348214be0da1a024c3f619f - Remove redundant grouping columns from our tests Relevant PG commit: `8d83a5d0a2` 8d83a5d0a2673174dc478e707de1f502935391a5 - Fix tests with different order in Filters Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-09 18:04:32 +03:00
Naisila Puka	b36c431abb	PG16 compatibility - Rework PlannedStmt and Query's Permission Info (#7098 ) PG16 compatibility - Part 6 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` This commit is in the series of PG16 compatibility commits. It handles the Permission Info changes in PG16. See below: The main issue lies in the following entries of PlannedStmt: { rtable permInfos } Each rtable has an int perminfoindex, and its actual permission info is obtained through the following: permInfos[perminfoindex] We had crashes because perminfoindexes were not updated in the finalized planned statement after distributed planner hook. So, basically, everywhere we set a query's or planned statement's rtable entry, we need to set the rteperminfos/permInfos accordingly. Relevant PG commits: `a61b1f7482` a61b1f74823c9c4f79c95226a461f1e7a367764b `b803b7d132` b803b7d132e3505ab77c29acf91f3d1caa298f95 More PG16 compatibility commits are coming soon ...	2023-08-09 15:23:00 +03:00
Naisila Puka	6056cb2c29	PG16 compatibility - get_relation_info hook to avoid crash from adjusted partitioning (#7099 ) PG16 compatibility - Part 5 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` This commit is in the series of PG16 compatibility commits. Find the explanation below: If we allow to adjust partitioning, we get a crash when accessing amcostestimate of partitioned indexes, because amcostestimate is NULL for them. The following PG commit is the culprit: `3c569049b7` 3c569049b7b502bb4952483d19ce622ff0af5fd6 Previously, partitioned indexes would just be ignored. Now, they are added in the list. However get_relation_info expects the tables which have partitioned indexes to have the inh flag set properly. AdjustPartitioningForDistributedPlanning plays with that flag, hence we don't get the desired behaviour. The hook is simply removing all partitioned indexes from the list. More PG16 compatibility commits are coming soon ...	2023-08-08 15:51:21 +03:00
Naisila Puka	7c6b4ce103	PG16 compatibility - outer join checks, subscription password, crash fixes (#7097 ) PG16 compatibility - Part 4 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` This commit is in the series of PG16 compatibility commits. It adds some outer join checks to the planner, the new password_required option to the subscription, and a crash fix related to PGIOAlignedBlock, see below for more details: - Fix PGIOAlignedBlock Assert crash in PG16 Relevant PG commit: `faeedbcefd` faeedbcefd40bfdf314e048c425b6d9208896d90 - Pass planner info as argument to make_simple_restrictinfo Pre PG16 passing plannerInfo to make_simple_restrictinfo was only needed for placeholder Vars, which is not the case in this part of the codebase because we are building the expression from shard intervals which don't have placeholder vars. However, PG16 is counting baserels appearing in clause_relids and is deleting the rels mentioned in plannerinfo->outer_join_rels Hence directly accessing plannerinfo. We will crash if we leave it as NULL. For reference `2489d76c49 (diff-e045c41eda9686451a7993e91518e40056b3739365e39eb1b70ae438dc1f7c76R207)` Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d - Add outer join checks, root->simple_rel_array - fix rebalancer to include passwork_required option Relevant PG commit: `c3afe8cf5a` c3afe8cf5a1e465bd71e48e4bc717f5bfdc7a7d6 More PG16 compatibility commits are coming soon ...	2023-08-04 14:51:28 +03:00
Naisila Puka	907d72e60d	PG16 compatibility - some test outputs (#7100 ) PG16 compatibility - Part 3 Check out part 1 `42d956888d` and part 2 `0d503dd5ac` This commit is in the series of PG compatibility. It makes some changes to our tests in order to be compatible with the following in PG16: Use debug_parallel_query in PG16+, force_parallel_mode otherwise Relevant PG commit `5352ca22e0` 5352ca22e0012d48055453ca9992a9515d811291 HINT changed to DETAIL in PG16 Relevant PG commit: `56d0ed3b75` 56d0ed3b756b2e3799a7bbc0ac89bc7657ca2c33 Fix removed read-only server setting lc_collate Relevant PG commit: `b0f6c43716` b0f6c437160db640d4ea3e49398ebc3ba39d1982 Fix unsupported join alias expression in sqlancer_failures Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-04 13:03:15 +03:00
Önder Kalacı	4ae3982d14	Add single-shard router Merge command support (#7088 ) Similar to https://github.com/citusdata/citus/pull/7077. As PG 16+ has changed the join restriction information for certain outer joins, MERGE is also impacted given that is is also underlying an outer join. See #7077 for the details.	2023-08-04 08:16:29 +03:00
Naisila Puka	0d503dd5ac	PG16 compatibility: ruleutils and successful CREATE EXTENSION (#7087 ) PG16 compatibility - Part 2 Part 1 provided successful compilation against pg16beta2. `42d956888d` This PR provides ruleutils changes with pg16beta2 and successful CREATE EXTENSION command. Note that more changes are needed in order to have successful regression tests. More commits are coming soon ... For any_value changes, I referred to this commit `8ef94dc1f5` where we did something similar for PG14 support.	2023-08-02 16:04:51 +03:00
Önder Kalacı	960a5f6104	Improve failure handling of distributed execution (#7090 ) Prior to this commit, the code would skip processing the errors happened for local commands. Prior to https://github.com/citusdata/citus/pull/5379, it might make sense to allow the execution continue. But, as of today, if a modification fails on any placement, we can safely fail the execution. The first commit show the problem in action. The second commit includes the fix and the test fixes.	2023-08-01 16:47:59 +03:00
Onur Tirtir	dd6ea1ebd5	Makes sure to handle NULL constraints for ADD COLUMN commands (#7093 ) DESCRIPTION: Fixes a bug that causes an unexpected error when adding a column with a NULL constraint Fixes https://github.com/citusdata/citus/issues/7092.	2023-08-01 11:07:47 +03:00
Önder Kalacı	cb5eb73048	Add support for router INSERT .. SELECT commands (#7077 ) Tradionally our planner works in the following order: router - > pushdown -> repartition -> pull to coordinator However, for INSERT .. SELECT commands, we did not support "router". In practice, that is not a big issue, because pushdown planning can handle router case as well. However, with PG 16, certain outer joins are converted to JOIN without any conditions (e.g., JOIN .. ON (true)) and the filters are pushed down to the tables. When the filters are pushed down to the tables, router planner can detect. However, pushdown planner relies on JOIN conditions. An example query: ``` INSERT INTO agg_events (user_id) SELECT raw_events_first.user_id FROM raw_events_first LEFT JOIN raw_events_second ON raw_events_first.user_id = raw_events_second.user_id WHERE raw_events_first.user_id = 10; ``` As a side effect of this change, now we can also relax certain limitation that "pushdown" planner emposes, but not "router". So, with this PR, we also allow those. Closes https://github.com/citusdata/citus/pull/6772 DESCRIPTION: Prevents unnecessarily pulling the data into coordinator for some INSERT .. SELECT queries that target a single-shard group	2023-07-28 15:07:20 +03:00
Teja Mupparti	846cbc3a39	In the MERGE join clause, there is a datatype mismatch between target's distribution column and the expression originating from the source. If the types are different, Citus uses different hash functions for the two column types, which might lead to incorrect repartitioning of the result data	2023-07-27 16:06:00 -07:00
Nils Dijk	186804c119	fix flappyness of shard_rebalancer operations test (#7083 ) Fixes flappyness where the order of shards was dependent on the physical layout in the heap. Failed here https://app.circleci.com/pipelines/github/citusdata/citus/33844/workflows/1651f8f5-6e6a-457e-9d35-34b8788ea6d1/jobs/1189836 ```diff --- /home/circleci/project/src/test/regress/expected/shard_rebalancer.out.modified 2023-07-24 12:51:27.126284675 +0000 +++ /home/circleci/project/src/test/regress/results/shard_rebalancer.out.modified 2023-07-24 12:51:27.170285079 +0000 @@ -2571,24 +2571,24 @@ CREATE TABLE test_with_all_shards_excluded(a int PRIMARY KEY); SELECT create_distributed_table('test_with_all_shards_excluded', 'a', colocate_with:='none', shard_count:=4); create_distributed_table -------------------------- (1 row) SELECT shardid FROM pg_dist_shard; shardid --------- - 433504 433505 433506 433507 + 433504 (4 rows) SELECT rebalance_table_shards('test_with_all_shards_excluded', excluded_shard_list:='{102073, 102074, 102075, 102076}'); rebalance_table_shards ------------------------ (1 row) DROP TABLE test_with_all_shards_excluded; SET citus.shard_count TO 2; ```	2023-07-27 16:24:35 +02:00
zhjwpku	6a00517312	[typo] fix typo in comments (#7073 ) %s/pg_dist_local_node_group/pg_dist_local_group/g Signed-off-by: Zhao Junwang <zhjwpku@gmail.com>	2023-07-25 16:43:55 +03:00
Önder Kalacı	862dae823e	Expand EnableNonColocatedRouterQueryPushdown to cover shard colocation (e.g., shard index) (#7076 ) Previously, we only checked whether the relations are colocated, but we ignore the shard indexes. That causes certain queries still to be accidentally router. We should enforce colocation checks for both shard index and table colocation id to make the check restrictive enough. For example, the following query should not be router, and after this patch, it won't: ```SQL SELECT user_id FROM ((SELECT user_id FROM raw_events_first WHERE user_id = 15) EXCEPT (SELECT user_id FROM raw_events_second where user_id = 17)) as foo; ``` DESCRIPTION: Enforce shard level colocation with citus.enable_non_colocated_router_query_pushdown	2023-07-25 16:20:13 +03:00
ahmet gedemenli	3f11139b5c	Do not move a shard to a node that it already exists on	2023-07-25 13:38:33 +03:00
ahmet gedemenli	c968dc9c27	Do not rebalance if replication factor is greater than the node count	2023-07-25 13:38:33 +03:00
Naisila Puka	42d956888d	PG16 compatibility: Resolve compilation issues (#7005 ) This PR provides successful compilation against PG16Beta2. It does some necessary refactoring to prepare for full support of version 16, in https://github.com/citusdata/citus/pull/6952 . Change RelFileNode to RelFileNumber or RelFileLocator Relevant PG commit b0a55e43299c4ea2a9a8c757f9c26352407d0ccc new header for varatt.h Relevant PG commit: d952373a987bad331c0e499463159dd142ced1ef drop support for Abs, use fabs Relevant PG commit 357cfefb09115292cfb98d504199e6df8201c957 tuplesort PGcommit: d37aa3d35832afde94e100c4d2a9618b3eb76472 Relevant PG commit: d37aa3d35832afde94e100c4d2a9618b3eb76472 Fix vacuum in columnar Relevant PG commit: 4ce3afb82ecfbf64d4f6247e725004e1da30f47c older one: b6074846cebc33d752f1d9a66e5a9932f21ad177 Add alloc_flags to pg_clean_ascii Relevant PG commit: 45b1a67a0fcb3f1588df596431871de4c93cb76f Merge GetNumConfigOptions() into get_guc_variables() Relevant PG commit: 3057465acfbea2f3dd7a914a1478064022c6eecd Minor PG refactor PG_FUNCNAME_MACRO __func__ Relevant PG commit 320f92b744b44f961e5d56f5f21de003e8027a7f Pass NULL context to stringToQualifiedNameList, typeStringToTypeName The pre-PG16 error behaviour for the following stringToQualifiedNameList & typeStringToTypeName was ereport(ERROR, ...) Now with PG16 we have this context input. We preserve the same behaviour by passing a NULL context, because of the following: (copy paste comment from PG16) If "context" isn't an ErrorSaveContext node, this behaves as errstart(ERROR, domain), and the errsave() macro ends up acting exactly like ereport(ERROR, ...). Relevant PG commit 858e776c84f48841e7e16fba7b690b76e54f3675 Use RangeVarCallbackMaintainsTable instead of RangeVarCallbackOwnsTable Relevant PG commit: 60684dd834a222fefedd49b19d1f0a6189c1632e FIX THIS: Not implemented grant-level control of role inheritance see PG commit e3ce2de09d814f8770b2e3b3c152b7671bcdb83f Make Scan node abstract PG commit: 8c73c11a0d39049de2c1f400d8765a0eb21f5228 Change in Var representations, get_relids_in_jointree PG commit 2489d76c4906f4461a364ca8ad7e0751ead8aa0d Deadlock detection changes because SHM_QUEUE is removed Relevant PG Commit: d137cb52cb7fd44a3f24f3c750fbf7924a4e9532 TU_UpdateIndexes Relevant PG commit 19d8e2308bc51ec4ab993ce90077342c915dd116 Use object_ownercheck and object_aclcheck functions Relevant PG commits: afbfc02983f86c4d71825efa6befd547fe81a926 c727f511bd7bf3c58063737bcf7a8f331346f253 Rework Permission Info for successful compilation Relevant PG commits: postgres/postgres@a61b1f7 postgres/postgres@b803b7d --------- Co-authored-by: onderkalaci <onderkalaci@gmail.com>	2023-07-21 14:32:37 +03:00
Naisila Puka	a282953274	Fix ScanKeyInit RegProcedure and Datum arguments (#7072 ) Index scans in PG16 return empty sets because of extra compatibility enforcement for `ScanKeyInit` arguments. Could be one of the relevant PG commits: `c8b2ef05f4` This PR fixes all incompatible `RegProcedure` and `Datum` arguments in all `ScanKeyInit` functions used throughout the codebase. Helpful for https://github.com/citusdata/citus/pull/6952	2023-07-21 14:11:10 +03:00
Teja Mupparti	87dc88f837	Isolate schema sharding/MERGE tests into a new file, and use the new GUC parameter	2023-07-19 12:23:45 -07:00
Halil Ozan Akgül	c99a93ffa7	Move SQL file changes for citus_shard_sizes fixes into the new 11.3-2 version (#7050 ) This PR moves `citus_shard_sizes` changes from #7003, and #7018 to into a new Citus version, 11.3-2	2023-07-14 17:19:54 +03:00
aykut-bozkurt	609a5465ea	Bump Citus version into 12.1devel (#7061 )	2023-07-14 13:12:30 +03:00
Gürkan İndibay	0f0b60c29c	Fix format attribute and IsLocalReplicationOriginSessionActive errors (#7055 ) This PR fixes the following: - in oraclelinux-7 `Make` step ``` /usr/bin/ld: utils/replication_origin_session_utils.o: relocation R_X86_64_PC32 against undefined symbol `IsLocalReplicationOriginSessionActive' can not be used when making a shared object; recompile with -fPIC /usr/bin/ld: final link failed: Bad value collect2: error: ld returned 1 exit status ``` `IsLocalReplicationOriginSessionActive` function has improper inline declaration, fixed that - in centos-7 `Make` step ``` utils/background_jobs.c: In function 'StartCitusBackgroundTaskExecutor': utils/background_jobs.c:1746:6: warning: function might be possible candidate for 'gnu_printf' format attribute [-Wsuggest-attribute=format] database, user, jobId, taskId); ^ ``` should use `pg_attribute_printf(3,4)` instead of `pg_attribute_printf(3,0)` since the number of arguments varies for `SafeSnprintf(char str, rsize_t count, const char fmt, ...)` --------- Co-authored-by: naisila <nicypp@gmail.com>	2023-07-13 17:41:57 +03:00
Onur Tirtir	f3cdb6d1bf	Deparse ALTER TABLE commands if ADD COLUMN is the only subcommand And stabilize multi_alter_table_statements.sql.	2023-07-12 18:17:47 +03:00
Onur Tirtir	6365f47b57	Properly handle index storage options for ADD CONSTRAINT / COLUMN	2023-07-11 17:42:43 +03:00
Onur Tirtir	ae142e1764	Properly handle IF NOT EXISTS for ADD COLUMN	2023-07-11 17:42:43 +03:00
Onur Tirtir	d4789a2c3a	Stabilize test helper sql files multi_test_helpers is run in parallel with others, so need to stabilize other test helpers too to make multi_test_helpers runnable multiple times.	2023-07-06 10:47:41 +03:00
Onur Tirtir	001437bdfe	Refactor AppendAlterTableCmdAddConstraint to reuse it for ADD COLUMN too	2023-07-06 10:47:41 +03:00
Onur Tirtir	56f1daa800	Refactor the code that extends constraint/index names on shards into a func	2023-07-06 10:47:41 +03:00
Onur Tirtir	ba1ea9b5bd	Refactor the code that prepares constraint objects in an alter table stmt into a func	2023-07-06 10:47:41 +03:00
Halil Ozan Akgül	613cced1ae	Use citus_shard_sizes in citus_tables (#7018 ) Fixes #7019 This PR updates citus_tables view to use citus_shard_sizes function, instead of citus_total_relation_size to improve performance.	2023-07-05 11:40:34 +03:00
aykut-bozkurt	719d92c8b9	mat view should not be converted to tenant table (#7043 ) We allow materialized view to exist in distrbuted schema but they should not be tried to be converted to a tenant table since they cannot be distributed. Fixes https://github.com/citusdata/citus/issues/7041	2023-07-04 17:28:03 +03:00
Ahmet Gedemenli	5051be86ff	Skip distributed schema insertion into pg_dist_schema, if already exists (#7044 ) Inserting into `pg_dist_schema` causes unexpected duplicate key errors, for distributed schemas that already exist. With this commit we skip the insertion if the schema already exists in `pg_dist_schema`. The error: ```sql SET citus.enable_schema_based_sharding TO ON; CREATE SCHEMA sc2; CREATE SCHEMA IF NOT EXISTS sc2; NOTICE: schema "sc2" already exists, skipping ERROR: duplicate key value violates unique constraint "pg_dist_schema_pkey" DETAIL: Key (schemaid)=(17294) already exists. ``` fixes: #7042	2023-07-04 15:19:07 +03:00
Gokhan Gulbiz	e0d3476526	Add locking mechanism for tenant monitoring probabilistic approach (#7026 ) This PR * Addresses a concurrency issue in the probabilistic approach of tenant monitoring by acquiring a shared lock for tenant existence checks. * Changes `citus.stat_tenants_sample_rate_for_new_tenants` type to double * Renames `citus.stat_tenants_sample_rate_for_new_tenants` to `citus.stat_tenants_untracked_sample_rate`	2023-07-03 13:08:03 +03:00
Jelte Fennema	ac24e11986	Change default rebalance strategy to by_disk_size (#7033 ) DESCRIPTION: Change default rebalance strategy to by_disk_size When introducing rebalancing by disk size we didn't make it the default initially. The main reason was, because we expected some problems with it. We have indeed had some problems/bugs with it over the years, and have fixed all of them. By now we're quite confident in its stability, and that it pretty much always gives better results than by_shard_count. So this PR makes by_disk_size the new default. We don't change the default when some other strategy than by_shard_count is the current default. This is in case someone defined their own rebalance strategy and marked this as the default themselves. Note: It explicitly does nothing during a downgrade, because there's no way of knowing if the rebalance strategy before the upgrade was by_disk_size or by_shard_count. And even in previous versions by_disk_size is considered superior for quite some time.	2023-07-03 11:08:24 +02:00
Jelte Fennema	fd1427de2c	Change by_disk_size rebalance strategy to have a base size (#7035 ) One problem with rebalancing by disk size is that shards in newly created collocation groups are considered extremely small. This can easily result in bad balances if there are some other collocation groups that do have some data. One extremely bad example of this is: 1. You have 2 workers 2. Both contain about 100GB of data, but there's a 70MB difference. 3. You create 100 new distributed schemas with a few empty tables in them 4. You run the rebalancer 5. Now all new distributed schemas are placed on the node with that had 70MB less. 6. You start loading some data in these shards and quickly the balance is completely off To address this edge case, this PR changes the by_disk_size rebalance strategy to add a a base size of 100MB to the actual size of each shard group. This can still result in a bad balance when shard groups are empty, but it solves some of the worst cases.	2023-06-27 16:37:09 +02:00
Halil Ozan Akgül	03a4769c3a	Fix Reference Table Check for CDC (#7025 ) Previously reference table check only looked at `partition method = 'n'`. This PR adds `replication model = 't'` to that.	2023-06-23 16:37:35 +03:00
Teja Mupparti	387b5f80f9	Fixes the bug#6785	2023-06-22 10:44:45 -07:00
Ahmet Gedemenli	99edb2675f	Improve error/hint messages related to schema-based sharding (#7027 ) Improve error/hint messages related to schema-based sharding	2023-06-22 18:10:12 +03:00
Ahmet Gedemenli	44e3c3b9c6	Improve error message for CREATE SCHEMA .. CREATE TABLE (#7024 ) Improve error message for CREATE SCHEMA .. CREATE TABLE when enable_schema_based_sharding is enabled.	2023-06-21 15:24:09 +03:00
aykut-bozkurt	565c5260fd	Properly handle error at owner check (#6984 ) We did not properly handle the error at ownership check method, which causes `max stack depth for errors` as in https://github.com/citusdata/citus/issues/6980. Fix: In case of an error, we should rollback subtransaction and throw the message with log level to `LOG_SERVER_ONLY`. Note: We prevent logs from the client to prevent pg vanilla test failures due to Citus logs which differs from the actual Postgres logs. (For context: https://github.com/citusdata/citus/pull/6130) I also needed to fix a flaky test: `multi_schema_support` DESCRIPTION: Fixes a bug related to non-existent objects in DDL commands. Fixes https://github.com/citusdata/citus/issues/6980	2023-06-21 14:50:01 +03:00
Naisila Puka	69af3e8509	Drop PG13 Support Phase 2 - Remove PG13 specific paths/tests (#7007 ) This commit is the second and last phase of dropping PG13 support. It consists of the following: - Removes all PG_VERSION_13 & PG_VERSION_14 from codepaths - Removes pg_version_compat entries and columnar_version_compat entries specific for PG13 - Removes alternative pg13 test outputs - Removes PG13 normalize lines and fix the test outputs based on that It is a continuation of `5bf163a27d`	2023-06-21 14:18:23 +03:00
aykut-bozkurt	1bb667ce6e	Fix create schema authorization bug (#7015 ) Fixes a bug related to `CREATE SCHEMA AUTHORIZATION <rolename>` for single shard tables. We should properly fetch schema name from role specification if schema name is not given.	2023-06-20 22:05:17 +03:00
aykut-bozkurt	f667f14029	Rewind tuple store to fix scrollable with hold cursor fetches (#7014 ) We need to rewind the tuplestorestate's tuple index to get correct results on fetching scrollable with hold cursors. `PersistHoldablePortal` is responsible for persisting out tuplestorestate inside a with hold cursor before commiting a transaction. It rewinds the cursor like below (`ExecutorRewindcalls` calls `rescan`): ```c if (portal->cursorOptions & CURSOR_OPT_SCROLL) { ExecutorRewind(queryDesc); } ``` At the end, it adjusts tuple index for holdStore in the portal properly. ```c if (portal->cursorOptions & CURSOR_OPT_SCROLL) { if (!tuplestore_skiptuples(portal->holdStore, portal->portalPos, true)) elog(ERROR, "unexpected end of tuple stream"); } ``` DESCRIPTION: Fixes incorrect results on fetching scrollable with hold cursors. Fixes https://github.com/citusdata/citus/issues/7010	2023-06-19 23:00:18 +03:00
Teja Mupparti	58da8771aa	This pull request introduces support for nonroutable merge commands in the following scenarios: 1) For distributed tables that are not colocated. 2) When joining on a non-distribution column for colocated tables. 3) When merging into a distributed table using reference or citus-local tables as the data source. This is accomplished primarily through the implementation of the following two strategies. Repartition: Plan the source query independently, execute the results into intermediate files, and repartition the files to co-locate them with the merge-target table. Subsequently, compile a final merge query on the target table using the intermediate results as the data source. Pull-to-coordinator: Execute the plan that requires evaluation at the coordinator, run the query on the coordinator, and redistribute the resulting rows to ensure colocation with the target shards. Direct the MERGE SQL operation to the worker nodes' target shards, using the intermediate files colocated with the data as the data source.	2023-06-19 12:23:40 -07:00
Xin Li	c10cb50aa9	Support custom cast from / to timestamptz in time partition management UDFs (#6923 ) This is to implement custom cast of table partition column type from / to `timestamptz` in time partition management UDFs, as proposed in ticket #6454 The general idea is for a time partition column with type other than `date`, `timestamp`, or `timestamptz`, users can provide custom bidirectional cast between the column type and `timestamptz`, the UDFs then will be able to create and drop time partitions for such tables. Fixes #6454 --------- Signed-off-by: Xin Li <xin@swirldslabs.com> Co-authored-by: Marco Slot <marco.slot@microsoft.com> Co-authored-by: Ahmet Gedemenli <afgedemenli@gmail.com>	2023-06-19 17:49:05 +03:00
Halil Ozan Akgül	d71ad4b65a	Add Publication Tests for Tenant Schema Tables (#7011 ) This PR adds schema based sharding tests to publication.sql file	2023-06-19 12:39:41 +03:00
aykut-bozkurt	fba5c8dd30	ALTER TABLE <tblname> SET SCHEMA <schemaname> for single shard tables (#7004 ) Adds support for altering schema of single shard tables. We do that in 2 steps. 1. Undistribute the tenant table at `preprocess` step, 2. Distribute new schema if it is a distributed schema after DDLs are propagated. DESCRIPTION: Adds support for altering a table's schema to/from distributed schemas.	2023-06-19 10:21:13 +03:00
Nils Dijk	ce2ba1d07e	Optimize QueryPushdownSqlTaskList on memory and cpu (#6945 ) While going over this piece of code (a long time ago) it was bothering to me we keep a bool array with the size of shardcount to iterate only over shards present in the list of non-pruned shards. Especially since we keep min/max of the set shards to optimize iteration. Postgres has the bitmapset datastructure which a) takes significantly less space, b) has iterator functions to only iterate over set bits, c) can efficiently skip long sequences of unset bits and d) stops quickly once the last set bit has been reached. I have been contemplating if it is worth to keep the minShardOffset because of readability and the efficient skipping of unset bits, however, I have decided to keep it -although less readable-, as there are known usecases where 100k+ shards are pruned to single digit shards. If these would end up at the end of `shardcount` a hotloop of zero checks on the first iteration _could_ cause a theoretical performance regression. All in all, this code is using less memory in all cases where it matters, and less cpu in most cases, while using more idiomatic datastructures for the task at hand.	2023-06-16 16:06:22 +02:00
Marco Slot	3adc1575d9	Fix DROP CONSTRAINT in command string with other commands (#7012 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-06-16 15:54:37 +02:00
Onur Tirtir	12a093b456	Allow using generated identity column based on int/smallint when creating a distributed table (#7008 ) Allow using generated identity column based on int/smallint when creating a distributed table so that applications that rely on those data types don't break. Inserting into / modifying such columns from workers is not allowed but it's better than not allowing such columns altogether.	2023-06-16 14:34:23 +03:00
Halil Ozan Akgül	04f6868ed2	Add citus_schemas view (#6979 ) DESCRIPTION: Adds citus_schemas view The citus_schemas view will be created in public schema if it exists, if not the view will be created in pg_catalog. Need to: - [x] Add tests - [x] Fix tests	2023-06-16 14:21:58 +03:00
Naisila Puka	5bf163a27d	Remove PG13 from CI and Configure (#7002 ) DESCRIPTION: Drops PG13 Support This commit is the first phase of dropping PG13 support. It consists of the following: - Removes pg13 from CI tests Among other things, Citus upgrade tests should now use PG14. Earliest Citus version supporting PG14 is 10.2. We also pick 11.3 version for upgrade_pg_dist_cleanup tests. Therefore, we run the citus upgrade tests with versions 10.2 and 11.3. - Removes pg13 from configure script - Remove upgrade_columnar_metapage upgrade tests We populate first_row_number column of columnar.stripe table during citus 10.1-10.2 upgrade. Given that we start from citus 10.2.0, which is the oldest version supporting PG14, we don't have that upgrade path anymore. Hence we remove these tests. - Removes upgrade_pg_dist_object_test and upgrade_partition_constraints tests These upgrade tests require the citus old version to be less than 10.0. Given that we drop support for PG13, we run upgrade tests with PG14, which starts with 10.2. So we remove these upgrade tests. - Documents that upgrade_post_11 should upgrade from version less than 11 In this way we make sure we run citus_finalize_upgrade_to_citus11 script - Adds needed alternative output for upgrade_citus_finish_citus_upgrade Given that we use 11.3 as the citus old version as well, we add this alternative output because pg_catalog.citus_finish_citus_upgrade() makes sense if last_upgrade_major_version < 11. See below for reference: pg_catalog.citus_finish_citus_upgrade(): ... IF last_upgrade_major_version < 11 THEN PERFORM citus_finalize_upgrade_to_citus11(); performed_upgrade := true; END IF; IF NOT performed_upgrade THEN RAISE NOTICE 'already at the latest distributed schema version (%)', last_upgrade_version_string; RETURN; END IF; ... And that's it :) The second phase of dropping PG13 support will consist in removing all the PG13 specific compilation paths/tests in the Citus repo. Will be done soon.	2023-06-15 14:54:06 +03:00
Ahmet Gedemenli	002a88ae7f	Error for single shard table creation if replication factor > 1 (#7006 ) Error for single shard table creation if replication factor > 1	2023-06-15 13:13:45 +03:00
Emel Şimşek	4f793abc4a	Turn on GUC_REPORT flag for search_path to enable reporting back the parameter value upon change. (#6983 ) DESCRIPTION: Turns on the GUC_REPORT flag for search_path. This results in postgres to report the parameter status back in addition to Command Complete packet. In response to the following command, > SET search_path TO client1; postgres sends back the following packets (shown in pseudo form): C (Command Complete) SET + S (Parameter Status) search_path = client1	2023-06-14 17:35:52 +03:00
Naisila Puka	3cc7a4aa42	Fix pg14-pg15 upgrade_distributed_triggers test (#6981 ) This test is only relevant for pg14-15 upgrade. However, the check on `upgrade_distributed_triggers_after` didn't take into consideration the case when we are doing pg15-16 upgrade. Hence, I added one more condition to the test: existence of `upgrade_distributed_triggers` schema which can only be created in pg14.	2023-06-14 15:32:38 +03:00
Onur Tirtir	dbdf04e8ba	Rename pg_dist tenant_schema to pg_dist_schema (#7001 )	2023-06-14 12:12:15 +03:00
Naisila Puka	ba40eb363c	Fix some gucs' initial and boot values, and flag combinations (#6957 ) PG16beta1 added some sanity checks for GUCS, find the Relevant PG commits below: 1- Add check on initial and boot values when loading GUCs `a73952b795` 2- Extend check_GUC_init() with checks on flag combinations when loading GUCs `009f8d1714` I fixed our currently problematic GUCS, we can merge this directly into main as these make sense for any PG version. There was a particular NodeConninfo issue: Previously we would rely on the fact that NodeConninfo initial value is an empty string. However, with PG16 enforcing same initial and boot values, we can't use an empty initial value for NodeConninfo anymore. Therefore we add a new flag to indicate whether we are at boot check.	2023-06-14 11:55:52 +03:00
Ahmet Gedemenli	7b0bc62173	Support CREATE TABLE .. AS SELECT .. commands for tenant tables (#6998 ) Support CREATE TABLE .. AS SELECT .. commands for tenant tables	2023-06-13 17:54:09 +03:00

1 2 3 4 5 ...

4556 Commits (e34dd6b134f1549fabd2562ec6a94dc6d84e8a56)