citus

Commit Graph

Author	SHA1	Message	Date
Onur Tirtir	8783cae57f	Avoid publishing artifacts with conflicting names .. as documented in actions/upload-artifact#480. (cherry picked from commit `0d4c676b07`)	2025-02-04 16:49:20 +03:00
Onur Tirtir	b6e3f39583	Fix flaky citus upgrade test (cherry picked from commit `4cad81d643`)	2025-02-04 16:49:12 +03:00
Onur Tirtir	a28f75cc77	Upgrade download-artifacts action to 4.1.8 (cherry picked from commit `5317cc7310`)	2025-02-04 16:49:06 +03:00
Onur Tirtir	af5fced935	Upgrade upload-artifacts action to 4.6.0 (cherry picked from commit `398a2ea197`)	2025-02-04 16:47:04 +03:00
Naisila Puka	7b6a828c74	Changelog entries for 13.0.0 (#7850 )	2025-01-22 12:22:31 +03:00
Naisila Puka	f7bead22d4	Remove accidentally added citus-tools empty submodule (#7842 ) Accidentally added here `4775715691`	2025-01-13 16:49:50 +03:00
Naisila Puka	5ef2cd67ed	Bump pg versions 14.15, 15.10, 16.6 (#7829 ) Bump PG versions to the latest minors 14.15, 15.10, 16.6 There is a libpq symlink issue when the images are built remotely https://github.com/citusdata/citus/actions/runs/12583502447/job/35071296238 Hence, we use the commit sha of a local build of the images, pushed. This is temporary, until we find the underlying cause of the symlink issue. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2025-01-13 16:24:51 +03:00
Seda Gündoğdu	70f84e4aee	Remove Debian Buster support from packaging pipelines (#7828 ) Remove Debian Buster support from packaging-test-pipelines Co-authored-by: Gürkan İndibay <gindibay@microsoft.com>	2025-01-02 12:22:22 +03:00
Naisila Puka	0a6adf4ccc	EXPLAIN generic_plan NOT supported in Citus (#7825 ) We thought we provided support for this in `b8c493f2c4` However the use of parameters in SQL is not supported in Citus. Since generic plan queries use parameters, we can't support for now. Relevant PG16 commit https://github.com/postgres/postgres/commit/3c05284 Fixes #7813 with proper error message	2025-01-02 01:00:40 +03:00
Teja Mupparti	ab7c13beb5	For scenarios, such as, Bug 3697586: Server crashes when assigning distributed transaction: Raise an ERROR instead of a crash	2024-12-26 10:45:59 -08:00
Onur Tirtir	73411915a4	Avoid re-assigning the global pid for client backends and bg workers when the application_name changes (#7791 ) DESCRIPTION: Fixes a crash that happens because of unsafe catalog access when re-assigning the global pid after application_name changes. When application_name changes, we don't actually need to try re-assigning the global pid for external client backends because application_name doesn't affect the global pid for such backends. Plus, trying to re-assign the global pid for external client backends would unnecessarily cause performing a catalog access when the cached local node id is invalidated. However, accessing to the catalog tables is dangerous in certain situations like when we're not in a transaction block. And for the other types of backends, i.e., the Citus internal backends, we need to re-assign the global pid when the application_name changes because for such backends we simply extract the global pid inherited from the originating backend from the application_name -that's specified by originating backend when openning that connection- and this doesn't require catalog access.	2024-12-23 14:01:53 +00:00
Naisila Puka	665d72a2f5	Bump postgres versions in CI and dev: 14.14, 15.9, 16.5 (#7779 ) Upgrade postgres versions to: - 14.14 - 15.9 - 16.5 Depends on https://github.com/citusdata/the-process/pull/163 We had some errors with the latest minors, so this is a 2-level bump for now.	2024-12-23 15:15:15 +03:00
Emel Şimşek	0355b12c7f	Add changelog entries for 12.1.6 (#7770 ) Add changelog entries for 12.1.6	2024-12-04 08:11:33 +00:00
Pavel Seleznev	fe6d198ab2	Remove warnings on some builds (#7680 ) Co-authored-by: Pavel Seleznev <PNSeleznev@sberbank.ru>	2024-12-03 17:10:36 +03:00
Colm	248ff5d52a	[Bug Fix] Query on distributed tables with window partition may cause segfault #7705 (#7718 ) This PR is a proposed fix for issue [7705](https://github.com/citusdata/citus/issues/7705). The following is the background and rationale for the fix (please refer to [7705](https://github.com/citusdata/citus/issues/7705) for context); The `varnullingrels `field was introduced to the Var node struct definition in Postgres 16. Its purpose is to associate a variable with the set of outer join relations that can cause the variable to be NULL. The `varnullingrels ` for the variable `"gianluca_camp_test"."start_timestamp"` in the problem query is 3, because the variable "gianluca_camp_test"."start_timestamp" is coming from the inner (nullable) side of an outer join and 3 is the RT index (aka relid) of that outer join. The problem occurs when the Postgres planner attempts to plan the combine query. The format of a combine query is: ``` SELECT <targets> FROM pg_catalog.citus_extradata_container(); ``` There is only one relation in a combine query, so no outer joins are present, but the non-empty `varnullingrels `field causes the Postgres planner to access structures for a non-existent relation. The source of the problem is that, when creating the target list for the combine query, function MasterAggregateMutator() uses copyObject() to construct a Var node before setting the master table ID, and this copies over the non-empty varnullingrels field in the case of the `"gianluca_camp_test"."start_timestamp"` var. The proposed solution is to have MasterAggregateMutator() use makeVar() instead of copyObject(), and only set the fields that make sense for the combine query; var type, collation and type modifier. The `varnullingrels `field can be left empty because there is only one relation in the combine query. A new regress test issue_7705.sql is added to exercise the fix. The issue is not specific to window functions, any target expression that cannot be pushed down and contains at least one column from the inner side of a left outer join (so has a non-empty varnullingrels field) can cause the same issue. More about Citus combine queries [here](https://github.com/citusdata/citus/tree/main/src/backend/distributed#combine-query-planner). More about Postgres varnullingrels [here](https://github.com/postgres/postgres/blob/master/src/backend/optimizer/README).	2024-11-13 15:19:59 +00:00
Colm McHugh	c52f36019f	[Bug Fix] [SEGFAULT] Querying distributed tables with window partition may cause segfault #7705 In function MasterAggregateMutator(), when the original Node is a Var node use makeVar() instead of copyObject() when constructing the Var node for the target list of the combine query. The varnullingrels field of the original Var node is ignored because it is not relevant for the combine query; copying this cause the problem in issue 7705, where a coordinator query had a Var with a reference to a non-existent join relation.	2024-11-06 19:26:29 +00:00
Erik Karsten	f6959715dc	fix: typo runnnig -> running (#7686 ) Very small PR, no changes to behaviour. Just a typo fix :-) Under `src/backend/distributed/sql/udfs/citus_finalize_upgrade_to_citus11/` the sql has a typo "runnnig", which will be displayed to the user if the `citus_check_cluster_node_health()` fails when calling `citus_finish_citus_upgrade();` Co-authored-by: eaydingol <60466783+eaydingol@users.noreply.github.com>	2024-09-17 09:28:46 +03:00
Parag Jain	5bad6c6a1d	[Bug Fix] : writing incorrect data to target Merge repartition Command (#7659 ) We were writing incorrect data to target collection in some cases of merge command. In case of repartition when source query is RELATION. We were referring to incorrect attribute number that was resulting into this incorrect behavior. Example : ![image](https://github.com/user-attachments/assets/a101cb36-7976-459c-befb-96a55a5b3dc1) ![image](https://github.com/user-attachments/assets/e5c83b7b-5b8e-4d79-a927-95684dc9ba49) I have added fixed tests as part of this PR , Thanks.	2024-09-12 21:16:39 -07:00
Mehmet YILMAZ	4775715691	Fix race condition in citus_set_coordinator_host when adding multiple coordinator nodes concurrently (#7682 ) When multiple sessions concurrently attempt to add the same coordinator node using `citus_set_coordinator_host`, there is a potential race condition. Both sessions may pass the initial metadata check (`isCoordinatorInMetadata`), but only one will succeed in adding the node. The other session will fail with an assertion error (`Assert(!nodeAlreadyExists)`), causing the server to crash. Even though the `AddNodeMetadata` function takes an exclusive lock, it appears that the lock is not preventing the race condition before the initial metadata check. - Issue: The current logic allows concurrent sessions to pass the check for existing coordinators, leading to an attempt to insert duplicate nodes, which triggers the assertion failure. - Impact: This race condition leads to crashes during operations that involve concurrent coordinator additions, as seen in https://github.com/citusdata/citus/issues/7646. Test Plan: - Isolation Test Limitation: An isolation test was added to simulate concurrent additions of the same coordinator node, but due to the behavior of PostgreSQL locking mechanisms, the test does not trigger the edge case. The lock applied within the function serializes the operations, preventing the race condition from occurring in the isolation test environment. While the edge case is difficult to reproduce in an isolation test, the fix addresses the core issue by ensuring concurrency control through proper locking. - Existing Tests: All existing tests related to node metadata and coordinator management have been run to ensure that no regressions were introduced. After the Fix: - Concurrent attempts to add the same coordinator node will be serialized. One session will succeed in adding the node, while the others will skip the operation without crashing the server. Co-authored-by: Mehmet YILMAZ <mehmet.yilmaz@microsoft.com>	2024-09-09 17:09:56 +03:00
Mehmet YILMAZ	68d28ecdc0	Add Debugging Instructions to Devcontainer Setup in CONTRIBUTING.md (#7673 ) Description: This PR adds a section to CONTRIBUTING.md that explains how to set up debugging in the devcontainer using VS Code. Changes: - New Debugging Section: Clear instructions on starting the debugger, selecting the appropriate PostgreSQL process, and setting breakpoints for easier troubleshooting. Purpose: - Improved Contributor Workflow: Enables contributors to debug the Citus extension within the devcontainer, enhancing productivity and making it easier to resolve issues. --------- Co-authored-by: Mehmet YILMAZ <mehmet.yilmaz@microsoft.com>	2024-08-23 12:16:18 +03:00
eaydingol	9e1852eac7	Check if the limit is null (#7665 ) DESCRIPTION: Add a check to see if the given limit is null. Fixes a bug by checking if the limit given in the query is null when the actual limit is computed with respect to the given offset. Prior to this change, null is interpreted as 0 during the limit calculation when both limit and offset are given. Fixes #7663	2024-07-31 14:53:38 +03:00
Hanefi Onaldi	2a263fe69a	Add changelog entries for 12.1.5 (#7648 )	2024-07-17 12:21:51 +00:00
Parag Jain	3c467e6e02	Support MERGE command for single_shard_distributed Target (#7643 ) This PR has following changes : 1. Enable MERGE command for single_shard_distributed targets.	2024-07-16 08:08:44 -07:00
Nils Dijk	accb7d09f7	bump postgres versions in CI and dev (#7655 ) Upgrade postgres versions to: - 14.12 - 15.7 - 16.3 Depends on https://github.com/citusdata/the-process/pull/158	2024-07-12 15:26:23 +00:00
Gürkan İndibay	8ac9f0fcee	Adds changelog for 12.1.4 (#7632 )	2024-07-12 09:43:33 +00:00
Gürkan İndibay	c603c3ed74	Removes el/7 and ol/7 as runners (#7650 ) Removes el/7 and ol/7 as runners and update checkout action to v4 We use EL/7 and OL/7 runners to test packaging for these distributions. However, for the past two weeks, we've encountered errors during the checkout step in the pipelines. The error message is as follows: ``` /__e/node20/bin/node: /lib64/libm.so.6: version `GLIBC_2.27' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.20' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libstdc++.so.6: version `CXXABI_1.3.9' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.21' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libc.so.6: version `GLIBC_2.28' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libc.so.6: version `GLIBC_2.25' not found (required by /__e/node20/bin/node) ``` The GCC version within the EL/7 and OL/7 Docker images is 2.17, and we cannot upgrade it. Therefore, we need to remove these images from the packaging test pipelines. Consequently, we will no longer verify if the code builds for EL/7 and OL/7. However, we are not using these packaging images as runners within the packaging infrastructure, so we can continue to use these images for packaging. Additional Info: I learned that Marlin team fully dropped the el/7 support so we will drop in further releases as well	2024-07-12 12:25:12 +03:00
Nils Dijk	e776a7ebbb	CI: move to github container registry (#7652 ) We move the CI images to the github container registry. Given we mostly (if not solely) run these containers on github actions infra it makes sense to have them hosted closer to where they are needed. Image changes: https://github.com/citusdata/the-process/pull/157	2024-07-12 11:26:38 +03:00
Jelte Fennema-Nio	58fef24142	Update Citus Technical Documentation about the rebalancer (#7638 ) The sections about the rebalancer algorithm and the backround tasks were empty. --------- Co-authored-by: Marco Slot <marco.slot@gmail.com> Co-authored-by: Steven Sheehy <17552371+steven-sheehy@users.noreply.github.com>	2024-06-27 16:07:38 +02:00
Jelte Fennema-Nio	aaaf637a6b	Redo #7620 : Fix merge command when insert value does not have source distributed column (#7627 ) Related to issue #7619, #7620 Merge command fails when source query is single sharded and source and target are co-located and insert is not using distribution key of source. Example ``` CREATE TABLE source (id integer); CREATE TABLE target (id integer ); -- let's distribute both table on id field SELECT create_distributed_table('source', 'id'); SELECT create_distributed_table('target', 'id'); MERGE INTO target t USING ( SELECT 1 AS somekey FROM source WHERE source.id = 1) s ON t.id = s.somekey WHEN NOT MATCHED THEN INSERT (id) VALUES (s.somekey) ERROR: MERGE INSERT must use the source table distribution column value HINT: MERGE INSERT must use the source table distribution column value ``` Author's Opinion: If join is not between source and target distributed column, we should not force user to use source distributed column while inserting value of target distributed column. Fix: If user is not using distributed key of source for insertion let's not push down query to workers and don't force user to use source distributed column if it is not part of join. This reverts commit `fa4fc0b372`. Co-authored-by: paragjain <paragjain@microsoft.com>	2024-06-17 14:07:25 +00:00
Jelte Fennema-Nio	fa4fc0b372	Revert rebase merge of #7620 (#7626 ) Because we want to track PR numbers and to make backporting easy we (pretty much always) use squash-merges when merging to master. We accidentally used a rebase merge for PR #7620. This reverts those changes so we can redo the merge using squash merge. This reverts all commits from `eedb607c` to `9e71750fc`.	2024-06-17 15:46:00 +02:00
paragjain	9e71750fcd	fixing flakyness in test	2024-06-15 14:55:36 -07:00
paragjain	e62ae64d00	some more	2024-06-15 14:55:36 -07:00
paragjain	76f68f47c4	removing flakyness from test	2024-06-15 14:55:36 -07:00
Jelte Fennema-Nio	d5231c34ab	Revert "Try to fix failure" This reverts commit `89f7217660`.	2024-06-15 14:55:36 -07:00
Jelte Fennema-Nio	f883cfdd77	Try to fix failure	2024-06-15 14:55:36 -07:00
paragjain	7c8a366ba2	some more	2024-06-15 14:55:36 -07:00
paragjain	06e9c29950	some more	2024-06-15 14:55:36 -07:00
paragjain	493140287a	fix some indent	2024-06-15 14:55:36 -07:00
paragjain	ec25b433d4	adding update and delete tests	2024-06-15 14:55:36 -07:00
paragjain	eedb607cd5	merge command fix	2024-06-15 14:55:36 -07:00
Jelte Fennema-Nio	8c9de08b76	Fix CI issues after Github Actions networking changes (#7624 ) For some reason using localhost in our hba file doesn't have the intended effect anymore in our Github Actions runners. Probably because of some networking change (IPv6 maybe) or some change in the `/etc/hosts` file. Replacing localhost with the equivalent loopback IPv4 and IPv6 addresses resolved this issue.	2024-06-14 16:20:23 +02:00
Gürkan İndibay	2874d7af46	Updates github checkout actions to v4 (#7611 ) Updates checkout plugin for github actions to v4. Can not update the version for check-sql-snapshots since new plugin causes below error in the docker image this step is using . Please refer to: https://github.com/citusdata/citus/actions/runs/9286197994/job/25552373953 Error: ``` /__e/node20/bin/node: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.27' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.28' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.25' not found (required by /__e/node20/bin/node) ```	2024-05-31 20:52:17 +03:00
Gürkan İndibay	0ab42e7a80	Adds null check for node in HasRangeTableRef (#7609 ) DESCRIPTION: Adds null check for node in HasRangeTableRef to prevent errors	2024-05-28 11:03:38 +03:00
Evgeny Nechayev	fcc72d8a23	Use macro wrapper to access PGPROC data, which allow to improve compa… (#7607 ) DESCRIPTION: Use macro wrapper to access PGPROC data, to improve compatibility with PostgreSQL forks.	2024-05-28 00:39:13 +00:00
Gürkan İndibay	553d5ba15d	Adds changelog for 12.1.3 (#7587 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2024-04-22 15:38:51 +03:00
Jelte Fennema-Nio	a0151aa31d	Greatly speed up "\d tablename" on servers with many tables (#7577 ) DESCRIPTION: Fix performance issue when using "\d tablename" on a server with many tables We introduce a filter to every query on pg_class to automatically remove shards. This is useful to make sure \d and PgAdmin are not cluttered with shards. However, the way we were introducing this filter was using `securityQuals` which can have negative impact on query performance. On clusters with 100k+ tables this could cause a simple "\d tablename" command to take multiple seconds, because a skipped optimization by Postgres causes a full table scan. This changes the code to introduce this filter in the regular `quals` list instead of in `securityQuals`. Which causes Postgres to use the intended optimization again. For reference, this was initially reported as a Postgres issue by me: https://www.postgresql.org/message-id/flat/4189982.1712785863%40sss.pgh.pa.us#b87421293b362d581ea8677e3bfea920	2024-04-16 17:26:12 +02:00
Xing Guo	ada3ba2507	Add missing volatile qualifier. (#7570 ) Variables being modified in the PG_TRY block and read in the PG_CATCH block should be qualified with volatile. The variable waitEventSet is modified in the PG_TRY block (line 1085) and read in the PG_CATCH block (line 1095). The variable relation is modified in the PG_TRY block (line 500) and read in the PG_CATCH block (line 515). Besides, the variable objectAddress doesn't need the volatile qualifier. Ref: C99 7.13.2.1[^1], > All accessible objects have values, and all other components of the abstract machine have state, as of the time the longjmp function was called, except that the values of objects of automatic storage duration that are local to the function containing the invocation of the corresponding setjmp macro that do not have volatile-qualified type and have been changed between the setjmp invocation and longjmp call are indeterminate. [^1]: https://www.open-std.org/jtc1/sc22/wg14/www/docs/n1256.pdf DESCRIPTION: Correctly mark some variables as volatile --------- Co-authored-by: Hong Yi <zouzou0208@gmail.com>	2024-04-16 15:29:14 +02:00
Karina	41e2af8ff5	Use expecteddir option in _run_pg_regress() (#7582 ) Fix check-arbitrary-configs tests failure with current REL_16_STABLE. This is the same problem as described in #7573. I missed pg_regress call in _run_pg_regress() in that PR. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-04-16 08:44:47 +00:00
Jelte Fennema-Nio	a263ac6f5f	Speed up GetForeignKeyOids (#7578 ) DESCRIPTION: Fix performance issue in GetForeignKeyOids on systems with many constraints GetForeignKeyOids was showing up in CPU profiles when distributing schemas on systems with 100k+ constraints. The reason was that this function was doing a sequence scan of pg_constraint to get the foreign keys that referenced the requested table. This fixes that by finding the constraints referencing the table through pg_depend instead of pg_constraint. We're doing this indirection, because pg_constraint doesn't have an index that we can use, but pg_depend does.	2024-04-16 08:16:40 +00:00
Jelte Fennema-Nio	110b4192b2	Fix PG upgrades when invalid rebalance strategies exist (#7580 ) DESCRIPTION: Fix PG upgrades when invalid rebalance strategies exist Without this change an upgrade of a cluster with an invalid rebalance strategy would fail with an error like this: ``` cache lookup failed for shard_cost_function with oid 6077337 CONTEXT: SQL statement "SELECT citus_validate_rebalance_strategy_functions( NEW.shard_cost_function, NEW.node_capacity_function, NEW.shard_allowed_on_node_function)" PL/pgSQL function citus_internal.pg_dist_rebalance_strategy_trigger_func() line 5 at PERFORM SQL statement "INSERT INTO pg_catalog.pg_dist_rebalance_strategy SELECT name, default_strategy, shard_cost_function::regprocedure::regproc, node_capacity_function::regprocedure::regproc, shard_allowed_on_node_function::regprocedure::regproc, default_threshold, minimum_threshold, improvement_threshold FROM public.pg_dist_rebalance_strategy" PL/pgSQL function citus_finish_pg_upgrade() line 115 at SQL statement ``` This fixes that by disabling the trigger and simply re-inserting the invalid rebalance strategy without checking. We could also silently remove it, but this seems nicer.	2024-04-15 14:26:33 +00:00

1 2 3 4 5 ...

7014 Commits (680b870d4593146c51a318789aac5c39a1b32009) All Branches Search

7014 Commits (680b870d4593146c51a318789aac5c39a1b32009)

All Branches