citus

Commit Graph

Author	SHA1	Message	Date
Mehmet YILMAZ	188c182be4	PG18 - Enable NUMA syscalls in CI containers to fix PG18 numa.out regression test failures (#8258 ) fixes #8246 PostgreSQL 18 introduced stricter NUMA page-inquiry permissions for the `pg_shmem_allocations_numa` view. Without the required kernel capabilities, the test fails with: ``` ERROR: failed NUMA pages inquiry status: Operation not permitted ``` This PR updates our test containers to include the necessary privileges: * Adds `--cap-add=SYS_NICE` and `--security-opt seccomp=unconfined` When PostgreSQL’s new NUMA views (`pg_shmem_allocations_numa`, `pg_buffercache_numa`) run, they call `move_pages()` to ask the kernel which NUMA node holds each shared memory page. https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=8cc139bec That syscall (`move_pages()`) requires `CAP_SYS_NICE` when inspecting another process. So: `--cap-add=SYS_NICE` grants the container permission to perform that NUMA page query. https://man7.org/linux/man-pages/man2/move_pages.2.html#:~:text=must%20be%20privileged%0A%20%20%20%20%20%20%20%20%20%20(-,CAP_SYS_NICE,-)%20or%20the%20real `--security-opt seccomp=unconfined` Docker containers still run under a seccomp filter which a kernel-level sandbox that blocks many system calls entirely for safety. The default Docker seccomp profile blocks `move_pages()` outright, because it can expose kernel memory layout information. https://docs.docker.com/engine/security/seccomp/#:~:text=You%20can%20pass-,unconfined,-to%20run%20a In combination Both flags are required for NUMA introspection inside a container: - `SYS_NICE` → permission - `seccomp=unconfined` → ability	2025-10-27 21:00:32 +03:00
Mehmet YILMAZ	dba9379ea5	PG18: Normalize EXPLAIN ANALYZE output for drop “Index Searches” line (#8291 ) fixes #8265 `0fbceae841` PostgreSQL 18 started printing an extra line in `EXPLAIN (ANALYZE …)` for index scans: ``` Index Searches: N ``` normalize.sed: add a rule to remove the PG18-only line ``` /^\sIndex Searches:\s\d+\s*$/d ```	2025-10-27 14:37:20 +03:00
Mehmet YILMAZ	785a87c659	PG18: adapt multi_subquery_misc expected output to SQL-function plan cache (#8289 ) `0dca5d68d7` fixes #8153 ```diff /citus/src/test/regress/expected/multi_subquery_misc.out -- should error out SELECT sql_subquery_test(1,1); -ERROR: could not create distributed plan -DETAIL: Possibly this is caused by the use of parameters in SQL functions, which is not supported in Citus. -HINT: Consider using PL/pgSQL functions instead. -CONTEXT: SQL function "sql_subquery_test" statement 1 + sql_subquery_test +------------------- + 307 +(1 row) + ``` PostgreSQL 18 changes planner behavior for inlining/parameter handling in SQL functions (pg18 commit `0dca5d68d`). As a result, a query in `multi_subquery_misc` that previously failed to create a distributed plan now succeeds. This PR updates the regression expected file to reflect the new outcome on PG18+. ### What changed * Updated `src/test/regress/expected/multi_subquery_misc.out`: * Replaced the previous error block: ``` ERROR: could not create distributed plan DETAIL: Possibly this is caused by the use of parameters in SQL functions, which is not supported in Citus. HINT: Consider using PL/pgSQL functions instead. CONTEXT: SQL function "sql_subquery_test" statement 1 ``` * with the actual successful result on PG18+: ``` sql_subquery_test -------------------------------------------- 307 (1 row) ``` * PG < 18: Behavior remains unchanged; the test still errors on older versions. * PG ≥ 18: The call succeeds; the updated expected output matches actual results. similar PR: https://github.com/citusdata/citus/pull/8184	2025-10-24 15:48:08 +03:00
Mehmet YILMAZ	95477e6d02	PG18 - Add BUFFERS OFF to remaining EXPLAIN calls (#8288 ) fixes #8093 `c2a4078eba` - Enable buffer-usage reporting by default in `EXPLAIN ANALYZE` on PostgreSQL 18 and above. - Introduce the explicit `BUFFERS OFF` option in every existing regression test to maintain pre-PG18 output consistency. - This appends, `BUFFERS OFF` to all `EXPLAIN(...)` calls in src/test/regress/sql and the corresponding .out files.	2025-10-24 15:09:49 +03:00
ibrahim halatci	5fc4cea1ce	pin PostgreSQL server development package version to 17 (#8286 ) DESCRIPTION: pin PostgreSQL server development package version to 17 rather than full dev package which now pulls in 18 and Citus does not yet support pg18	2025-10-21 11:32:32 +03:00
Colm	bf959de39e	PG18: Fix diffs in EXPLAINs introduced by PR #8242 in pg18 goldfile (#8262 )	2025-10-19 21:20:16 +01:00
Onur Tirtir	90f2ab6648	Actually deprecate mark_tables_colocated()	2025-10-17 11:57:36 +00:00
Colm	3ca66e1fcc	PG18: Fix "Unrecognized range table id" in INSERT .. SELECT planning (#8256 ) The error `Unrecognized range table id` seen in regress test `insert_select_into_local_tables` is a consequence of the INSERT .. SELECT planner getting confused by a SELECT query with a GROUP BY and hence a Group RTE, introduced in PG18 (commit 247dea89f). The solution is to flatten the relevant parts of the SELECT query before preparing the INSERT .. SELECT query tree for use by Citus.	2025-10-17 11:21:25 +01:00
Colm	5d71fca3b4	PG18 regress sanity: disable `enable_self_join_elimination` on queries (#8242 ) .. involving Citus tables. Interim fix for #8217 to achieve regress sanity with PG18. A complete fix will follow with PG18 feature integration.	2025-10-17 10:25:33 +01:00
naisila	76f18624e5	PG18 - use systable_inplace_update_* in UpdateStripeMetadataRow PG18 has removed heap_inplace_update(), which is crucial for citus_columnar extension because we always want to update stripe entries for columnar in-place. Relevant PG18 commit: https://github.com/postgres/postgres/commit/a07e03f heap_inplace_update() has been replaced by heap_inplace_update_and_unlock, which is used inside systable_inplace_update_finish, which is used together with systable_inplace_update_begin. This change has been back-patched up to v12, which is enough for us since the oldest version Citus supports is v15. In PG<18, a deprecated heap_inplace_update() is retained, however, let's start using the new functions because they are better, and such that we don't need to wrap these changes in PG18 version conditionals. Basically, in this commit we replace the following: SysScanDesc scanDescriptor = systable_beginscan(columnarStripes, indexId, indexOk, &dirtySnapshot, 2, scanKey); heap_inplace_update(columnarStripes, modifiedTuple); with the following: systable_inplace_update_begin(columnarStripes, indexId, indexOk, NULL, 2, scanKey, &tuple, &state); systable_inplace_update_finish(state, tuple); For more understanding, it's best to refer to an example: REL_18_0/src/backend/catalog/toasting.c#L349-L371 of how systable_inplace_update_begin and systable_inplace_update_finish are used in PG18, because they mirror the need of citus columnar. Fixes #8207	2025-10-17 11:11:08 +03:00
naisila	abd50a0bb8	Revert "PG18 - Adapt columnar stripe metadata updates (#8030 )" This reverts commit `5d805eb10b`. heap_inplace_update was incorrectly replaced by CatalogTupleUpdate in `5d805eb`. In Citus, we assume a stripe entry with some columns set to null means that a write is in-progress, because otherwise we wouldn't see a such row. But this breaks when we use CatalogTupleUpdate because it inserts a new version of the row, which leaves the in-progress version behind. Among other things, this was causing various issues in PG18 - check-columnar test.	2025-10-17 11:11:08 +03:00
eaydingol	aa0ac0af60	Citus upgrade tests (#8237 ) Expand the citus upgrade tests matrix: PG15: v11.1.0 v11.3.0 v12.1.10 PG16: v12.1.10 See https://github.com/citusdata/the-process/pull/174	2025-10-15 15:28:44 +03:00
Naisila Puka	432b69eb9d	PG18 - fix naming diffs of child FK constraints (#8247 ) PG18 changed the names generated for child foreign key constraints. https://github.com/postgres/postgres/commit/3db61db48 The test failures in Citus regression suite are all changing the name of a constraint from `'sensors%'` to `'%to_parent%_1'`: the naming is very nice here because `to_parent` means that we have a foreign key to a parent table. To fix the diff, we exclude those constraints from the output. To verify correctness, we still count the problematic constraints to make sure they are there - we are simply removing them from the first output (we add this count query right after the previous one) Fixes #8126 Co-authored-by: Mehmet YILMAZ <mehmety87@gmail.com>	2025-10-13 13:33:38 +03:00
Naisila Puka	f1dd976a14	Fix vanilla tests with domain creation (#8238 ) Qualify create domain stmt after local execution, to avoid such diffs in PG vanilla tests: ```diff create domain d_fail as anyelement; -ERROR: "anyelement" is not a valid base type for a domain +ERROR: "pg_catalog.anyelement" is not a valid base type for a domain ``` These tests were newly added in PG18, however this is not new PG18 behavior, just some added tests. https://github.com/postgres/postgres/commit/0172b4c94 Fixes #8042	2025-10-10 15:34:32 +03:00
Naisila Puka	351cb2044d	PG18 - define some EXPLAIN funcs and structs only in PG17 (#8239 ) PG18 changed the visibility of various Explain Serialize functions and structs to `extern`. Previously, for PG17 support, these were `static`, so we had to copy paste their definitions from `explain.c` to Citus's `multi_explain.c`. Relevant PG18 commits: https://github.com/postgres/postgres/commit/555960a0 https://github.com/postgres/postgres/commit/77cb08be Now we don't need to define the following anymore in Citus, since they are extern in PG18: - typedef struct SerializeMetrics - void ExplainIndentText(ExplainState es); - SerializeMetrics GetSerializationMetrics(DestReceiver dest); - typedef struct SerializeDestReceiver (this is not extern, however it is only used by GetSerializationMetrics function) This was incorrectly handled in https://github.com/citusdata/citus/commit/9e42f3f2c by wrapping these definitions and usages in PG17 only, causing such diffs in PG18 (not able to see serialization at all): ```diff citus/src/test/regress/expected/pg17.out select public.explain_filter('explain (analyze, serialize binary,buffers,timing) select * from int8_tbl i8'); ... Planning Time: N.N ms - Serialization: time=N.N ms output=NkB format=binary Execution Time: N.N ms Planning Time: N.N ms Serialization: time=N.N ms output=NkB format=binary Execution Time: N.N ms -(14 rows) +(13 rows) ```	2025-10-10 15:05:47 +03:00
Naisila Puka	287abea661	PG18 compatibility - varreturningtype additions (#8231 ) This PR solves the following diffs, originating from the addition of `varreturningtype` field to the `Var` struct in PG18: https://github.com/postgres/postgres/commit/80feb727c Previously we didn't account for this new field (as it's new), so this wouldn't allow the parser to correctly reconstruct the `Var` node structure, but rather it would error out with `did not find '}' at end of input node`: ```diff SELECT column_to_column_name(logicalrelid, partkey) FROM pg_dist_partition WHERE partkey IS NOT NULL ORDER BY 1 LIMIT 1; - column_to_column_name ---------------------------------------------------------------------- - a -(1 row) - +ERROR: did not find '}' at end of input node ``` Solution follows precedent https://github.com/citusdata/citus/pull/7107, when varnullingrels field was added to the `Var` struct in PG16. The solution includes: - Taking care of the `partkey` in `pg_dist_partition` table because it's coming from the `Var` struct. This mainly includes fixing the upgrade script to PG18, by saving all the `partkey` infos before upgrading to PG18 (in `citus_prepare_pg_upgrade`), and then re-generating `partkey` columns in `pg_dist_partition` (using `UPDATE`) after upgrading to PG18 (in `citus_finish_pg_upgrade`). - Adding a normalize rule to fix output differences among PG versions. Note that we need two normalize lines: one for PG15 since it doesn't have `varnullingrels`, and one for PG16/PG17. - Small trick on `metadata_sync_helpers` to use different text when generating the `partkey`, based on the PG version. Fixes #8189	2025-10-09 17:35:03 +03:00
Naisila Puka	f0014cf0df	PG18 compatibility: misc output diffs pt2 (#8234 ) 3 minor changes to reduce some noise from the regression diffs. 1 - Reduce verbosity when ALTER EXTENSION fails PG18 has improved reporting of errors in extension script files Relevant PG commit: https://github.com/postgres/postgres/commit/774171c4f There was more context in PG18, so reducing verbosity ``` ALTER EXTENSION citus UPDATE TO '11.0-1'; ERROR: cstore_fdw tables are deprecated as of Citus 11.0 HINT: Install Citus 10.2 and convert your cstore_fdw tables to the columnar access method before upgrading further CONTEXT: PL/pgSQL function inline_code_block line 4 at RAISE +SQL statement "DO LANGUAGE plpgsql +$$ +BEGIN + IF EXISTS (SELECT 1 FROM pg_dist_shard where shardstorage = 'c') THEN + RAISE EXCEPTION 'cstore_fdw tables are deprecated as of Citus 11.0' + USING HINT = 'Install Citus 10.2 and convert your cstore_fdw tables to the columnar access method before upgrading further'; + END IF; +END; +$$" +extension script file "citus--10.2-5--11.0-1.sql", near line 532 ``` 2 - Fix backend type order in tests for PG18 PG18 added another backend type which messed the order in this test Adding a separate IF condition for PG18 Relevant PG commit: https://github.com/postgres/postgres/commit/18d67a8d7d 3 - Ignore "DEBUG: find_in_path" lines in output Relevant PG commit: https://github.com/postgres/postgres/commit/4f7f7b0375 The new GUC extension_control_path specifies a path to look for extension control files.	2025-10-09 16:50:41 +03:00
Naisila Puka	d9652bf5f9	PG18 compatibility: misc output diffs (#8233 ) 6 minor changes to reduce some noise from the regression diffs. 1 - Add ORDER BY to fix subquery_in_where diff 2 - Disable buffers in explain analyze calls Leftover work from https://github.com/citusdata/citus/commit/f1f0b09f7 3 - Reduce verbosity to avoid diffs between PG versions Relevant PG commit: https://github.com/postgres/postgres/commit/0dca5d68d7 diff was: ``` CALL test_procedure_commit(2,5); ERROR: COMMIT is not allowed in an SQL function -CONTEXT: SQL function "test_procedure_commit" during startup +CONTEXT: SQL function "test_procedure_commit" statement 2 ``` 4 - Rename array_sort to array_sort_citus since PG18 added array_sort Relevant PG commit: https://github.com/postgres/postgres/commit/6c12ae09f5a Diff we were seeing in multi_array_agg, because the PG18 test was using PG18's array_sort function instead: ``` -- Check that we return NULL in case there are no input rows to array_agg() SELECT array_sort(array_agg(l_orderkey)) FROM lineitem WHERE l_orderkey < 0; array_sort ------------ - {} + (1 row) ``` 5 - Exclude not-null constraints from output to avoid diffs PG18 has added pg_constraint rows for not-null constraints Relevant PG commit https://github.com/postgres/postgres/commit/14e87ffa5c Remove them by condition contype <> 'n' 6 - Reduce verbosity to avoid md5 pwd deprecation warning in PG18 PG18 has deprecated MD5 passwords Relevant PG commit: https://github.com/postgres/postgres/commit/db6a4a985 Fixes #8154 Fixes #8157	2025-10-08 13:23:55 +03:00
Naisila Puka	77d5807fd6	Update changelog with 12.1.9, 12.1.10, 13.0.5, 13.1.1 entries (#8224 )	2025-10-07 22:33:12 +03:00
Naisila Puka	2a6414c727	PG18: use data-checksums by default in upgrades (#8230 ) Checksums are now on by default in PG18: 04bec894a Upgrade to PG18 fails with the following error: `old cluster does not use data checksums but the new one does` To overcome this error, we add --data-checksums option such that clusters with PG less than 18 also use data checksums. Fixes #8229	2025-10-07 12:17:08 +03:00
Naisila Puka	c5dde4b115	Fix crash on create statistics with non-RangeVar type pt2 (#8227 ) Fixes #8225 very similar to #8213 Also the error message changed between pg18rc1 and pg18.0	2025-10-07 11:56:20 +03:00
Naisila Puka	5a3648b2cb	PG18: fix yet another unregistered snapshot crash (#8228 ) PG18 added an assertion that a snapshot is active or registered before it's used. Relevant PG commit `8076c00592` Fixes #8209	2025-10-07 11:31:41 +03:00
Mehmet YILMAZ	d4dfdd765b	PG18 - Normalize \d+ output in PG18 by filtering “Not-null constraints” blocks (#8183 ) DESCRIPTION: Normalize \d+ output in PG18 by filtering “Not-null constraints” blocks fixes #8095 PR Description Postgres 18 started representing column `NOT NULL` as named constraints in `pg_constraint`, and `psql \d+` now prints them under a `Not-null constraints:` section. This caused extra diffs in our regression tests. `14e87ffa5c` This PR updates the normalization rules to strip those sections during diff filtering by adding two regex rules: * remove the `Not-null constraints:` header * remove any indented constraint lines ending in `_not_null`	2025-10-02 13:48:27 +03:00
Mehmet YILMAZ	cec1848b13	PG18: adapt multi_sql_function expected output to SQL-function plan cache (#8184 ) `0dca5d68d7` fixes #8153 ```diff diff -dU10 -w /__w/citus/citus/src/test/regress/expected/multi_sql_function.out /__w/citus/citus/src/test/regress/results/multi_sql_function.out --- /__w/citus/citus/src/test/regress/expected/multi_sql_function.out.modified 2025-08-25 12:43:24.373634581 +0000 +++ /__w/citus/citus/src/test/regress/results/multi_sql_function.out.modified 2025-08-25 12:43:24.383634533 +0000 @@ -317,24 +317,25 @@ $$ LANGUAGE SQL STABLE; INSERT INTO test_parameterized_sql VALUES(1, 1); -- all of them should fail SELECT * FROM test_parameterized_sql_function(1); ERROR: cannot perform distributed planning on this query because parameterized queries for SQL functions referencing distributed tables are not supported HINT: Consider using PL/pgSQL functions instead. SELECT (SELECT 1 FROM test_parameterized_sql limit 1) FROM test_parameterized_sql_function(1); ERROR: cannot perform distributed planning on this query because parameterized queries for SQL functions referencing distributed tables are not supported HINT: Consider using PL/pgSQL functions instead. SELECT test_parameterized_sql_function_in_subquery_where(1); -ERROR: could not create distributed plan -DETAIL: Possibly this is caused by the use of parameters in SQL functions, which is not supported in Citus. -HINT: Consider using PL/pgSQL functions instead. -CONTEXT: SQL function "test_parameterized_sql_function_in_subquery_where" statement 1 + test_parameterized_sql_function_in_subquery_where +--------------------------------------------------- + 1 +(1 row) + ``` allows custom vs. generic plans for SQL functions; arguments can be folded to consts, enabling more rewrites/optimizations (and in your case, routable Citus plans) seems that P18 rewrote how LANGUAGE SQL functions are planned/executed: they now go through the plan cache (like PL/pgSQL does) and can produce custom plans with the function arguments substituted as constants. That means your call SELECT test_parameterized_sql_function_in_subquery_where(1); is planned with org_id_val = 1 baked in, so Citus no longer sees an unresolved Param inside the function body and is able to build a distributed plan instead of tripping the old “params in SQL functions” error path. What’s in here - Update `expected/multi_sql_function.out` to reflect PG18 behavior - Add `expected/multi_sql_function_0.out` as an alternate expected file that retains the pre-PG18 error output for the same test	2025-10-01 16:03:21 +03:00
Naisila Puka	bb840e58a7	Fix crash on create statistics with non-RangeVar type (#8213 ) This crash has been there for a while but wasn't tested before pg18. PG18 added this test: CREATE STATISTICS tst ON a FROM (VALUES (x)) AS foo; which tries to create statistics on a derived-on-the-fly table (which is not allowed) However Citus assumes we always have a valid table when intercepting CREATE STATISTICS command to check for Citus tables Added a check to return early if needed. pg18 commit: https://github.com/postgres/postgres/commit/3eea4dc2c Fixes #8212	2025-10-01 00:09:11 +03:00
Onur Tirtir	5eb1d93be1	Properly detect no-op shard-key updates via UPDATE / MERGE (#8214 ) DESCRIPTION: Fixes a bug that causes allowing UPDATE / MERGE queries that may change the distribution column value. Fixes: #8087. Probably as of #769, we were not properly checking if UPDATE may change the distribution column. In #769, we had these checks: ```c if (targetEntry->resno != column->varattno) { /* target entry of the form SET some_other_col = <x> / isColumnValueChanged = false; } else if (IsA(setExpr, Var)) { Var newValue = (Var ) setExpr; if (newValue->varattno == column->varattno) { / target entry of the form SET col = table.col / isColumnValueChanged = false; } } ``` However, what we check in "if" and in the "else if" are not so different in the sense they both attempt to verify if SET expr of the target entry points to the attno of given column. So, in #5220, we even removed the first check because it was redundant. Also see this PR comment from #5220: https://github.com/citusdata/citus/pull/5220#discussion_r699230597. In #769, probably we actually wanted to first check whether both SET expr of the target entry and given variable are pointing to the same range var entry, but this wasn't what the "if" was checking, so removed. As a result, in the cases that are mentioned in the linked issue, we were incorrectly concluding that the SET expr of the target entry won't change given column just because it's pointing to the same attno as given variable, regardless of what range var entries the column and the SET expr are pointing to. Then we also started using the same function to check for such cases for update action of MERGE, so we have the same bug there as well. So with this PR, we properly check for such cases by comparing varno as well in TargetEntryChangesValue(). However, then some of the existing tests started failing where the SET expr doesn't directly assign the column to itself but the "where" clause could actually imply that the distribution column won't change. Even before we were not attempting to verify if "where" cluse quals could imply a no-op assignment for the SET expr in such cases but that was not a problem. This is because, for the most cases, we were always qualifying such SET expressions as a no-op update as long as the SET expr's attno is the same as given column's. For this reason, to prevent regressions, this PR also adds some extra logic as well to understand if the "where" clause quals could imply that SET expr for the distribution key is a no-op. Ideally, we should instead use "relation restriction equivalence" mechanism to understand if the "where" clause implies a no-op update. This is because, for instance, right now we're not able to deduce that the update is a no-op when the "where" clause transitively implies a no-op update, as in the case where we're setting "column a" to "column c" and where clause looks like: "column a = column b AND column b = column c". If this means a regression for some users, we can consider doing it that way. Until then, as a workaround, we can suggest adding additional quals to "where" clause that would directly imply equivalence. Also, after fixing TargetEntryChangesValue(), we started successfully deducing that the update action is a no-op for such MERGE queries: ```sql MERGE INTO dist_1 USING dist_1 src ON (dist_1.a = src.b) WHEN MATCHED THEN UPDATE SET a = src.b; ``` However, we then started seeing below error for above query even though now the update is qualified as a no-op update: ``` ERROR: Unexpected column index of the source list ``` This was because of #8180 and #8201 fixed that. In summary, with this PR: We disallow such queries, ```sql -- attno for dist_1.a, dist_1.b: 1, 2 -- attno for dist_different_order_1.a, dist_different_order_1.b: 2, 1 UPDATE dist_1 SET a = dist_different_order_1.b FROM dist_different_order_1 WHERE dist_1.a dist_different_order_1.a; -- attno for dist_1.a, dist_1.b: 1, 2 -- but ON (..) doesn't imply a no-op update for SET expr MERGE INTO dist_1 USING dist_1 src ON (dist_1.a = src.b) WHEN MATCHED THEN UPDATE SET a = src.a; ``` * .. and allow such queries, ```sql MERGE INTO dist_1 USING dist_1 src ON (dist_1.a = src.b) WHEN MATCHED THEN UPDATE SET a = src.b; ```	2025-09-30 10:13:47 +00:00
Naisila Puka	de045402f3	PG18 - register snapshot where needed (#8196 ) Register and push snapshots as needed per the relevant PG18 commits `8076c00592` https://github.com/postgres/postgres/commit/706054b `citus_split_shard_columnar_partitioned`, `multi_partitioning` tests are handled. Fixes #8195	2025-09-26 18:04:34 +03:00
Colm McHugh	81776fe190	Fix crash in Range Table identity check. The range table entry array created by the Postgres planner for each SELECT in a query may have NULL entries as of PG18. Add a NULL check to skip over these when looking for matches in rte identities.	2025-09-26 14:53:30 +01:00
Colm	80945212ae	PG18 regress sanity: update pg18 ruleutils with fix #7675 (#8216 ) Fix deparsing of UPDATE statements with indirection (#7675) involved changing ruleutils of our supported Postgres versions. It means that when integrating a new Postgres version we need to update its ruleutils with the relevant parts of #7675; basically PG ruleutils needs to call the `citus_ruleutils.c` functions added by #7675.	2025-09-26 13:19:47 +01:00
Onur Tirtir	83b25e1fb1	Fix unexpected column index error for repartitioned merge (#8201 ) DESCRIPTION: Fixes a bug that causes an unexpected error when executing repartitioned merge. Fixes #8180. This was happening because of a bug in SourceResultPartitionColumnIndex(). And to fix it, this PR avoids using DistributionColumnIndex() in SourceResultPartitionColumnIndex(). Instead, invents FindTargetListEntryWithVarExprAttno(), which finds the index of the target entry in the source query's target list that can be used to repartition the source for a repartitioned merge. In short, to find the source target entry that refences the Var used in ON (..) clause and that references the source rte, we should check the varattno of the underlying expr, which presumably is always a Var for repartitioned merge as we always wrap the source rte with a subquery, where all target entries point to the columns of the original source relation. Using DistributionColumnIndex() prior to 13.0 wasn't causing such an issue because prior to 13.0, the varattno of the underlying expr of the source target entries was almost (1) always equal to resno of the target entry as we were including all target entries of the source relation. However, starting with #7659, which is merged to main before 13.0, we started using CreateFilteredTargetListForRelation() instead of CreateAllTargetListForRelation() to compute the target entry list for the source rte to fix another bug. So we cannot revert to using CreateAllTargetListForRelation() because otherwise we would re-introduce bug that it helped fixing, so we instead had to find a way to properly deal with the "filtered target list"s, as in this commit. Plus (1), even before #7659, probably we would still fail when the source relation has dropped attributes or such because that would probably also cause such a mismatch between the varattno of the underlying expr of the target entry and its resno.	2025-09-23 11:17:51 +00:00
Colm	b5e70f56ab	Postgres 18: Fix regress tests caused by GROUP RTE. (#8206 ) The change in `merge_planner.c` fixes _unrecognized range table entry_ diffs in merge regress tests (category 2 diffs in #7992), the change in `multi_router_planner.c` fixes _column reference ... is ambiguous_ diffs in `multi_insert_select` and `multi_insert_select_window` (category 3 diffs in #7992). Edit to `common.py` enables standalone regress tests with pg18 (e..g `citus_tests/run_test.py merge`).	2025-09-22 16:13:59 +03:00
Colm	d2ea4043d4	Postgres 18: fix 'column does not exist' errors in grouping regress tests. (#8199 ) DESCRIPTION: Fix 'column does not exist' errors in grouping regress tests. Postgres 18's GROUP RTE was being ignored by query pushdown planning when constructing the query tree for the worker subquery. The solution is straightforward - ensure the worker subquery tree has the same groupRTE property as the original query. Postgres ruleutils then does the right thing when generating the pushed down query. Fixes category 1 in #7992.	2025-09-22 16:13:59 +03:00
Mehmet YILMAZ	10d62d50ea	Stabilize table_checks across PG15–PG18: switch to pg_constraint, remove dupes, exclude NOT NULL (#8140 ) DESCRIPTION: Stabilize table_checks across PG15–PG18: switch to pg_constraint, remove dupes, exclude NOT NUL fixes #8138 fixes #8131 Problem ```diff diff -dU10 -w /__w/citus/citus/src/test/regress/expected/multi_create_table_constraints.out /__w/citus/citus/src/test/regress/results/multi_create_table_constraints.out --- /__w/citus/citus/src/test/regress/expected/multi_create_table_constraints.out.modified 2025-08-18 12:26:51.991598284 +0000 +++ /__w/citus/citus/src/test/regress/results/multi_create_table_constraints.out.modified 2025-08-18 12:26:52.004598519 +0000 @@ -403,22 +403,30 @@ relid = 'check_example_partition_col_key_365068'::regclass; Column \| Type \| Definition ---------------+---------+--------------- partition_col \| integer \| partition_col (1 row) SELECT "Constraint", "Definition" FROM table_checks WHERE relid='public.check_example_365068'::regclass; Constraint \| Definition -------------------------------------+----------------------------------- check_example_other_col_check \| CHECK other_col >= 100 + check_example_other_col_check \| CHECK other_col >= 100 + check_example_other_col_check \| CHECK other_col >= 100 + check_example_other_col_check \| CHECK other_col >= 100 + check_example_other_col_check \| CHECK other_col >= 100 check_example_other_other_col_check \| CHECK abs(other_other_col) >= 100 -(2 rows) + check_example_other_other_col_check \| CHECK abs(other_other_col) >= 100 + check_example_other_other_col_check \| CHECK abs(other_other_col) >= 100 + check_example_other_other_col_check \| CHECK abs(other_other_col) >= 100 + check_example_other_other_col_check \| CHECK abs(other_other_col) >= 100 +(10 rows) ``` On PostgreSQL 18, `NOT NULL` is represented as a cataloged constraint and surfaces through `information_schema.check_constraints`. `14e87ffa5c` Our helper view `table_checks` (built on `information_schema.check_constraints` + `constraint_column_usage`) started returning: * Extra `…_not_null` rows (noise for our tests) * Duplicate rows for real CHECKs due to the one-to-many join via `constraint_column_usage` * Occasional literal formatting differences (e.g., dates) coming from the information\_schema deparser ### What changed 1. Rewrite `table_checks` to use system catalogs directly We now select only expression-based, table-level constraints—excluding NOT NULL—by filtering on `contype <> 'n'` and requiring `conbin IS NOT NULL`. This yields the same effective set as real CHECKs while remaining future-proof against non-CHECK constraint types. ```sql CREATE OR REPLACE VIEW table_checks AS SELECT c.conname AS "Constraint", 'CHECK ' \|\| -- drop a single pair of outer parens if the deparser adds them regexp_replace(pg_get_expr(c.conbin, c.conrelid, true), '^$(.)$$', '\1') AS "Definition", c.conrelid AS relid FROM pg_catalog.pg_constraint AS c WHERE c.contype <> 'n' -- drop NOT NULL (PG18) AND c.conbin IS NOT NULL -- only expression-bearing constraints (i.e., CHECKs) AND c.conrelid <> 0 -- table-level only (exclude domains) ORDER BY "Constraint", "Definition"; ``` Why this filter? `contype <> 'n'` excludes PG18’s NOT NULL rows. * `conbin IS NOT NULL` restricts to expression-backed constraints (CHECKs); PK/UNIQUE/FK/EXCLUSION don’t have `conbin`. * `conrelid <> 0` removes domain constraints. 2. Add a PG18-specific regression test for `contype = 'n'` New test (`pg18_not_null_constraints`) verifies: * Coordinator tables have `n` rows for NOT NULL (columns `a`, `c`), * A worker shard has matching `n` rows, * Dropping a NOT NULL on the coordinator propagates to shards (count goes from 2 → 1), * `table_checks` never reports NOT NULL, but does report a real CHECK added for the test. --- ### Why this works (PG15–PG18) * Stable source of truth: Directly reads `pg_constraint` instead of `information_schema`. * No duplicates: Eliminates the `constraint_column_usage` join, removing multiplicity. * No NOT NULL noise: PG18’s `contype = 'n'` is filtered out by design. * Deterministic text: Uses `pg_get_expr` and strips a single outer set of parentheses for consistent output. --- ### Impact on tests * Removes spurious `…_not_null` entries and duplicate `checky_…` rows (e.g., in `multi_name_lengths` and similar). * Existing expected files stabilize without adding brittle normalizations. * New PG18 test asserts correct catalog behavior and Citus propagation while remaining a no-op on earlier PG versions. ---	2025-09-22 15:50:32 +03:00
Naisila Puka	b4cb1a94e9	Bump citus and citus_columnar to 14.0devel (#8170 )	2025-09-19 12:54:55 +03:00
Naisila Puka	becc02b398	Cleanup from dropping pg14 in merge isolation tests (#8204 ) These alternative test outputs are redundant since we have dropped PG14 support on main.	2025-09-19 12:01:29 +03:00
eaydingol	360fbe3b99	Technical document update for outer join pushdown (#8200 ) Outer join pushdown entry and an example.	2025-09-17 17:01:45 +03:00
Mehmet YILMAZ	b58af1c8d5	PG18: stabilize constraint-name tests by filtering pg_constraint on contype (#8185 ) `14e87ffa5c` PostgreSQL 18 now records column `NOT NULL` constraints in `pg_constraint` (`contype = 'n'`). That means queries that previously listed “all constraints” for a relation now return extra rows, causing noisy diffs in Citus regression tests. This PR narrows each catalog probe to the specific constraint type under test (PK/UNIQUE/EXCLUDE/CHECK), keeping results stable across PG15–PG18. ## What changed * Update `src/test/regress/sql/multi_alter_table_add_constraints_without_name.sql` to: * Add `AND con.contype IN ('p'\|'u'\|'x'\|'c')` in each query, matching the constraint just created. * Join namespace via `rel.relnamespace` for robustness. * Refresh `src/test/regress/expected/multi_alter_table_add_constraints_without_name.out` to reflect the filtered results. ## Why * PG18 adds named `NOT NULL` entries to `pg_constraint`, which previously lived only in `pg_attribute`. Tests that select from `pg_constraint` without filtering now see extra rows (e.g., `*_not_null`), breaking expectations. Filtering by `contype` validates exactly what the test intends (PK/UNIQUE/EXCLUDE/CHECK naming/propagation) and ignores unrelated `NOT NULL` rows. ```diff diff -dU10 -w /__w/citus/citus/src/test/regress/expected/multi_alter_table_add_constraints_without_name.out /__w/citus/citus/src/test/regress/results/multi_alter_table_add_constraints_without_name.out --- /__w/citus/citus/src/test/regress/expected/multi_alter_table_add_constraints_without_name.out.modified 2025-09-11 14:36:52.521254512 +0000 +++ /__w/citus/citus/src/test/regress/results/multi_alter_table_add_constraints_without_name.out.modified 2025-09-11 14:36:52.549254440 +0000 @@ -20,34 +20,36 @@ ALTER TABLE AT_AddConstNoName.products ADD PRIMARY KEY(product_no); SELECT con.conname FROM pg_catalog.pg_constraint con INNER JOIN pg_catalog.pg_class rel ON rel.oid = con.conrelid INNER JOIN pg_catalog.pg_namespace nsp ON nsp.oid = connamespace WHERE rel.relname = 'products'; conname ------------------------------ products_pkey -(1 row) + products_product_no_not_null +(2 rows) -- Check that the primary key name created on the coordinator is sent to workers and -- the constraints created for the shard tables conform to the <conname>_shardid naming scheme. \c - - :public_worker_1_host :worker_1_port SELECT con.conname FROM pg_catalog.pg_constraint con INNER JOIN pg_catalog.pg_class rel ON rel.oid = con.conrelid INNER JOIN pg_catalog.pg_namespace nsp ON nsp.oid = connamespace WHERE rel.relname = 'products_5410000'; conname -------------------------------------- + products_5410000_product_no_not_null products_pkey_5410000 -(1 row) +(2 rows) ``` after pr: https://github.com/citusdata/citus/actions/runs/17697415668/job/50298622183#step:5:265	2025-09-17 14:12:15 +03:00
Mehmet YILMAZ	4012e5938a	PG18 - normalize PG18 “RESTRICT” FK error wording to legacy form (#8188 ) fixes #8186 `086c84b23d` PG18 emitting a more specific message for foreign-key violations when the action is `RESTRICT` (SQLSTATE 23001), e.g. `violates RESTRICT setting of foreign key constraint ...` and `Key (...) is referenced from table ...`. Older versions printed the generic FK text (SQLSTATE 23503), e.g. `violates foreign key constraint ...` and `Key (...) is still referenced from table ...`. This change was causing noisy diffs in our regression tests (e.g., `multi_foreign_key.out`). To keep a single set of expected files across PG15–PG18, this PR adds two normalization rules to the test filter: ```sed # PG18 FK wording -> legacy generic form s/violates RESTRICT setting of foreign key constraint/violates foreign key constraint/g # DETAIL line: "is referenced" -> "is still referenced" s/\<is referenced from table\>/is still referenced from table/g ``` Scope / impact * Test-only change; runtime behavior is unaffected. * Keeps outputs stable across PG15–PG18 without version-splitting expected files. * Rules are narrowly targeted to the FK wording introduced in PG18. with pr: https://github.com/citusdata/citus/actions/runs/17698469722/job/50300960878#step:5:252	2025-09-17 10:46:36 +03:00
Mehmet YILMAZ	8bb8b2ce2d	Remove Code Climate coverage upload steps from GitHub Actions workflow (#8182 ) DESCRIPTION: Remove Code Climate coverage upload steps from GitHub Actions workflow CI: remove Code Climate coverage reporting (cc-test-reporter) and related jobs; keep Codecov as source of truth * Why Code Climate’s test-reporter has been archived; their download/API path is no longer served, which breaks our CC upload step (`cc-test-reporter …` ends up downloading HTML/404). * What changed * Drop the Code Climate formatting/artifact steps from the composite action `.github/actions/upload_coverage/action.yml`. * Delete the `upload-coverage` job that aggregated and pushed to Code Climate (`cc-test-reporter sum-coverage` / `upload-coverage`). * Impact * Codecov uploads remain; coverage stays visible via Codecov. * No test/build behavior change—only removes a failing reporter path.	2025-09-15 13:53:35 +03:00
Colm	b7bfe42f1a	Document delayed fast path planning in README (#8176 ) Added detailed explanation of delayed fast path planning in Citus 13.2, including conditions and processes involved. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2025-09-09 11:54:13 +01:00
Onur Tirtir	0c658b73fc	Fix an assertion failure in Citus maintenance daemon that can happen in very slow systems (#8158 ) Fixes #5808. DESCRIPTION: Fixes an assertion failure in Citus maintenance daemon that can happen in very slow systems. Try running `make -C src/test/regress/ check-multi-1-vg` - while the tests will exit with code 2 at least %50 of the times in the very early stages of the test suite by producing a core-dump on main, it won't be the case on this branch, at least based on my trials :)	2025-09-04 12:13:57 +00:00
manaldush	2834fa26c9	Fix an undefined behavior for bit shift in citus_stat_tenants.c (#7954 ) DESCRIPTION: Fixes an undefined behavior that could happen when computing tenant score for citus_stat_tenants Add check for shift size, reset to zero in case of overflow Fixes #7953. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2025-09-04 10:57:45 +00:00
Onur Tirtir	8ece8acac7	Check citus version in citus_promote_clone_and_rebalance (#8169 )	2025-08-29 11:19:50 +03:00
Naisila Puka	0fd95d71e4	Order same frequency common values, and add test (#8167 ) Added similar test to what @colm-mchugh tested in the original PR https://github.com/citusdata/citus/pull/8026#discussion_r2279021218	2025-08-29 01:41:32 +03:00
Naisila Puka	d5f0ec5cd1	Fix invalid input syntax for type bigint (#8166 ) Fixes #8164	2025-08-29 01:01:18 +03:00
Naisila Puka	544b6c4716	Add GUC for queries with outer joins and pseudoconstant quals (#8163 ) Users can turn on this GUC at their own risk.	2025-08-27 22:31:22 +03:00
Onur Tirtir	2e1de77744	Also use pid in valgrind logfile name (#8150 ) Also use pid in valgrind logfile name to avoid overwriting the valgrind logs due to the memory errors that can happen in different processes concurrently: (from https://valgrind.org/docs/manual/manual-core.html) ``` --log-file=<filename> Specifies that Valgrind should send all of its messages to the specified file. If the file name is empty, it causes an abort. There are three special format specifiers that can be used in the file name. %p is replaced with the current process ID. This is very useful for program that invoke multiple processes. WARNING: If you use --trace-children=yes and your program invokes multiple processes OR your program forks without calling exec afterwards, and you don't use this specifier (or the %q specifier below), the Valgrind output from all those processes will go into one file, possibly jumbled up, and possibly incomplete. ``` With this change, we'll start having lots of valgrind output files generated under "src/test/regress" with the same prefix, citus_valgrind_test_log.txt, by default, during valgrind tests, so it'll look a bit ugly; but one can use `cat src/test/regress/citus_valgrind_test_log.txt.[0-9]*"` or such to combine them into a single valgrind log file later.	2025-08-27 14:01:25 +00:00
Colm	bb6eeb17cc	Fix bug in redundant WHERE clause detection. (#8162 ) Need to also check Postgres plan's rangetables for relations used in Initplans. DESCRIPTION: Fix a bug in redundant WHERE clause detection; we need to additionally check the Postgres plan's range tables for the presence of citus tables, to account for relations that are referenced from scalar subqueries. There is a fundamental flaw in `4139370`, the assumption that, after Postgres planning has completed, all tables used in a query can be obtained by walking the query tree. This is not the case for scalar subqueries, which will be referenced by `PARAM` nodes. The fix adds an additional check of the Postgres plan range tables; if there is at least one citus table in there we do not need to change the needs distributed planning flag. Fixes #8159	2025-08-27 13:32:02 +01:00
Colm	0a5cae19ed	In UPDATE deparse, check for a subscript before processing the targets. (#8155 ) DESCRIPTION: Checking first for the presence of subscript ops avoids a shallow copy of the target list for target lists where there are no array or json subscripts. Commit `0c1b31c` fixed a bug in UPDATE statements with array or json subscripting in the target list. This commit modifies that to first check that the target list has a subscript and avoid a shallow copy of the target list for UPDATE statements with no array/json subscripting.	2025-08-27 11:00:27 +00:00
Muhammad Usama	62e5fcfe09	Enhance clone node replication status messages (#8152 ) - Downgrade replication lag reporting from NOTICE to DEBUG to reduce noise and improve regression test stability. - Add hints to certain replication status messages for better clarity. - Update expected output files accordingly.	2025-08-26 21:48:07 +03:00

1 2 3 4 5 ...

7154 Commits (188c182be4796babe77b10ac50ed729d75e385cf) All Branches Search

7154 Commits (188c182be4796babe77b10ac50ed729d75e385cf)

All Branches