citus

Commit Graph

Author	SHA1	Message	Date
Emel Şimşek	28ed013a91	Enable ALTER TABLE ... ADD CHECK (#6606 ) DESCRIPTION: Enable adding CHECK constraints on distributed tables without the client having to provide a constraint name. This PR enables the following command syntax for adding check constraints to distributed tables. ALTER TABLE ... ADD CHECK ... by creating a default constraint name and transforming the command into the below syntax before sending it to workers. ALTER TABLE ... ADD CONSTRAINT \<conname> CHECK ...	2023-01-12 23:31:06 +03:00
Ahmet Gedemenli	a6ad4574f6	Transfer shard with nodeid (#6612 ) DESCRIPTION: Introduce citus_copy_shard_placement UDF with node id DESCRIPTION: Introduce citus_move_shard_placement UDF with node id DESCRIPTION: Use new shard transfer functions with node id for rebalancing New shard transfer functions to be used with nodeid instead of hostname and port. Use these functions in shard rebalancer.	2023-01-12 18:33:46 +03:00
Ahmet Gedemenli	9b9d8e7abd	Use shard transfer UDFs with node ids for rebalancing	2023-01-12 16:57:51 +03:00
Ahmet Gedemenli	e5fef40c06	Introduce citus_move_shard_placement UDF with nodeid	2023-01-12 16:57:51 +03:00
Ahmet Gedemenli	e19c545fbf	Introduce citus_copy_shard_placement UDF with nodeid	2023-01-12 16:57:51 +03:00
Emel Şimşek	d322f9e382	Handle DEFERRABLE option for the relevant constraints at deparser. (#6613 ) Table Constraints UNIQUE, PRIMARY KEY and EXCLUDE may have option DEFERRABLE in their command syntax. This PR handles the option when deparsing the relevant constraint statements. NOT DEFERRABLE and INITIALLY IMMEDIATE (if DEFERRABLE} are the default values for the option so we only append the non-default values to the alter table statement.	2023-01-12 12:32:38 +03:00
Jelte Fennema	34df853bda	Fix bug introduced by #6412 (#6590 ) In #6412 I made a change to not re-assign the global PID if it was already set. This inadvertently introduced a regression where `userId` and `databaseId` would not be set on the backend data when the global PID was assigned in the authentication hook. This fixes it by doing two things: 1. Removing `userId` from `BackendData`, since it's not used anywhere anyway. 2. Move assignment of `databaseId` to dedicated `SetBackendDataDatabaseId` function, that isn't a no-op when global pid is already set. Since #6412 is not released yet this does not need a description.	2023-01-10 16:21:57 +01:00
Jelte Fennema	17775dad5d	Only run package builds on pull requests (#6605 ) Recently a package test build pipeline was introduced, to build citus on all OS that we build packages for. However, every pull request would run each build twice. This fixes that by only running it for the pull request event, not for the push event. Example of duplicate run: ![image](https://user-images.githubusercontent.com/1162278/211028723-8c0e8aa0-e267-4665-811c-6cecd4286621.png)	2023-01-06 16:02:49 +00:00
Jelte Fennema	c2b4087ff0	Quote all identifiers that we use for logical replication (#6604 ) In #6598 it was noticed that Citus could generate syntactically invalid statements during logical replication. With #6603 we resolved the direct issue, by only generating valid subscription names. But there was also the underlying problem that we did not escape certain identifier strings. While in theory this should be okay since we should only generate names that are valid, this issue reiterated that we should not take this for granted. As an extra line of defense this quotes all identifiers we use during logical replication setup.	2023-01-06 14:12:03 +00:00
Jelte Fennema	44e09128f0	Fix failures in mx_base_schedule (#6601 ) Apparently no-one actually ran the mx_base_schedule, because the tests in schedule itself were already failing. This updates it to be in line with multi_mx_schedule again to make the tests pass again. Notably it doesn't contain multi_mx_node_metadata and multi_extension. Because those tests take long to run and the were not necessary to make multi_mx_create_table pass again.	2023-01-06 14:48:18 +01:00
Ahmet Gedemenli	26b170e1a8	Use %u instead of %i for naming subscriptions & roles (#6603 ) DESCRIPTION: Fix the modifier for subscription and role creation fixes: #6598 Reported by @ivyazmitinov	2023-01-06 14:38:32 +01:00
Ahmet Gedemenli	bc3383170e	Fix crash when trying to replicate a ref table that is actually dropped (#6595 ) DESCRIPTION: Fix crash when trying to replicate a ref table that is actually dropped see #6592 We should have a real solution for it.	2023-01-06 14:52:08 +03:00
Emel Şimşek	db7a70ef3e	Enable ALTER TABLE ... ADD UNIQUE and ADD EXCLUDE. (#6582 ) DESCRIPTION: Adds support for creating table constraints UNIQUE and EXCLUDE via ALTER TABLE command without client having to specify a name. ALTER TABLE ... ADD CONSTRAINT <conname> UNIQUE ... ALTER TABLE ... ADD CONSTRAINT <conname> EXCLUDE ... commands require the client to provide an explicit constraint name. However, in postgres it is possible for clients not to provide a name and let the postgres generate it using the following commands ALTER TABLE ... ADD UNIQUE ... ALTER TABLE ... ADD EXCLUDE ... This PR enables the same functionality for citus tables.	2023-01-05 18:12:32 +03:00
Emel Şimşek	135c519c62	Fix flakyness for multi_name_lengths test (#6599 ) Fix flakyness for multi_name_lengths test.	2023-01-05 17:27:16 +03:00
aykut-bozkurt	f79ee13eef	we can reuse some steps in jobs in circle-ci config. (#6164 ) - Reuse steps in jobs via common commands, - Use yaml alias and anchors to share common params for workflow jobs.	2023-01-05 11:23:39 +03:00
Önder Kalacı	eb75decbeb	Undo planner extended statistics override (#6492 )	2023-01-04 13:25:57 +01:00
Önder Kalacı	a1aa96b32c	Make the metadata syncing less resource invasive [Phase-1] (#6537 )	2023-01-04 11:36:45 +01:00
Ahmet Gedemenli	235047670d	Drop SHARD_STATE_TO_DELETE (#6494 ) DESCRIPTION: Drop `SHARD_STATE_TO_DELETE` and use the cleanup records instead Drops the shard state that is used to mark shards as orphaned. Now we insert cleanup records into `pg_dist_cleanup` so "orphaned" shards will be dropped either by maintenance daemon or internal cleanup calls. With this PR, we make the "cleanup orphaned shards" functions to be no-op, as they would not be needed anymore. This PR includes some naming changes about placement functions. We don't need functions that filter orphaned shards, as there will be no orphaned shards anymore. We will also be introducing a small script with this PR, for users with orphaned shards. We'll basically delete the orphaned shard entries from `pg_dist_placement` and insert cleanup records into `pg_dist_cleanup` for each one of them, during Citus upgrade. We also have a lot of flakiness fixes in this PR. Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2023-01-03 14:38:16 +03:00
Jelte Fennema	f56904fe04	Fix flakyness in isolation_insert_vs_vacuum (#6589 ) Sometimes our `isolation_insert_vs_vacuum` test would fail like this. ```diff step s2-vacuum-analyze: VACUUM ANALYZE test_insert_vacuum; - + <waiting ...> step s1-commit: COMMIT; +step s2-vacuum-analyze: <... completed> ``` The reason seems to be that VACUUM ANALYZE tries to take some locks that conflict with the other transaction, but these locks somehow get released or VACUUM ANALYZE stops waiting for them. This is somewhat expected since VACUUM has some special locking logic. To solve the flakyness we now trigger VACUUM ANALYZE to always report as blocking and after that we wait explicitly wait for it to complete. This is done like is suggested by the flaky test tips from postgres: `c68a183990/src/test/isolation/README (L152)` I've confirmed that this fixes the issue suing our flaky-test-debugging CI workflow.	2023-01-02 16:51:58 +01:00
Ahmet Gedemenli	0c74e4cc0f	Fix some flaky tests (#6587 ) Fix for some simple flakiness'. All `DROP USER' and cleanup function calls.	2022-12-29 10:19:09 +03:00
Ahmet Gedemenli	f824c996b3	Remove duplicate split entry in run_test.py (#6586 ) Nothing important. Removing the duplicate `"split" in test_schedule` check	2022-12-28 18:18:24 +03:00
Ahmet Gedemenli	1b1e737e51	Drop cleanup on failure (#6584 ) DESCRIPTION: Defers cleanup after a failure in shard move or split We don't need to do a cleanup in case of failure on a shard transfer or split anymore. Because, * Maintenance daemon will clean them up anyway. * We trigger a cleanup at the beginning of shard transfers/splits. * The cleanup on failure logic also can fail sometimes and instead of the original error, we throw the error that is raised by the cleanup procedure, and it causes confusion.	2022-12-28 15:48:44 +03:00
Ahmet Gedemenli	cfc17385e9	Some minor improvements on flakiness detection (#6585 ) * Skip some exceptional test files in the flaky workflow, like multi_extension * Run some tests without a schedule, like single_node_enterprise * Use minimal schedule for the tests in split and operations schedules	2022-12-28 15:08:39 +03:00
Ahmet Gedemenli	eba9abeee2	Fix leftover shard copy on the target node when tx with move is aborted (#6583 ) DESCRIPTION: Cleanup the shard on the target node in case of a failed/aborted shard move Inserts a cleanup record for the moved shard placement on the target node. If the move operation succeeds, the record will be deleted. If not, it will remain there to be cleaned up later. fixes: #6580	2022-12-27 22:42:46 +03:00
Naisila Puka	e937935935	Clean up normalize file (#6578 )	2022-12-26 12:08:27 +03:00
Naisila Puka	bc49616426	Fix link of path to flaky tests docs (#6579 )	2022-12-23 12:09:22 +03:00
Naisila Puka	9c78f693f2	Fix typo in fix_style.sh (#6577 ) For reference, this is the print stack trace PR https://github.com/citusdata/citus/pull/6539	2022-12-22 16:24:26 +03:00
Ahmet Gedemenli	cf93eb46c6	Fix split schedule (#6571 ) * Drop enterprise_split_schedule as it's not even called in our CI pipeline. It's actually a subset of split_schedule, except for `citus_split_shard_by_split_points_deferred_drop`. Added that one into split_schedule and dropped the enterprise one. * Delete `citus_non_blocking_shard_split_cleanup.out`, as there is no sql file for it. It seems it's renamed to some other test and the sql file is deleted, but we forgot to delete the output file. * 6 test files are chained to each other with dependent objects. Unified them into one test file so that the flaky check will not fail for them anymore. * Some cleanup lines to prevent the flakiness check from failing.	2022-12-22 13:20:06 +03:00
Ahmet Gedemenli	96bf648d1a	Unify dependent test files into one	2022-12-22 13:06:26 +03:00
Ahmet Gedemenli	acf3539a90	Fix split schedule	2022-12-22 13:06:26 +03:00
Ahmet Gedemenli	497a7589d0	Use failtester for flakyness detection (#6569 ) We need failtester image to run failure tests in flakiness detection workflow.	2022-12-21 18:56:51 +03:00
Hanefi Onaldi	303db172f8	Use Citus version comparison in upgrade tests, not equality (#6568 ) We have several version checks in our Citus upgrade tests. However, as we drop support for PG versions, we need to update the Citus versions used in our CI images. Therefore we must compare Citus versions in our tests instead of using equality checks so that the queries are ran in all the associated Citus versions. For example, we have many conditionals where we early exit if the Citus version is not equal to 9.0. However, as of today we never use version 9.0 and thus we always early exit in those tests.	2022-12-21 14:01:57 +03:00
Hanefi Onaldi	d6833c877b	Add changelog entries for 11.1.5 (#6567 )	2022-12-20 14:56:26 +03:00
Teja Mupparti	9a9989fc15	Support MERGE Phase – I All the tables (target, source or any CTE present) in the SQL statement are local i.e. a merge-sql with a combination of Citus local and Non-Citus tables (regular Postgres tables) should work and give the same result as Postgres MERGE on regular tables. Catch and throw an exception (not-yet-supported) for all other scenarios during Citus-planning phase.	2022-12-18 20:32:15 -08:00
Emel Şimşek	5268d0a6cb	Enable PRIMARY KEY generation via ALTER TABLE even if the constraint name is not provided (#6520 ) DESCRIPTION: Support ALTER TABLE .. ADD PRIMARY KEY ... command Before processing > ALTER TABLE ... ADD PRIMARY KEY ... command 1. Create a primary key name to use as the constraint name. 2. Change the ALTER TABLE ... ADD PRIMARY KEY ... command to into ALTER TABLE ... ADD CONSTRAINT \<constraint name> PRIMARY KEY ... form. This is the only form we can specify a name for a primary key. If we run ALTER TABLE .. ADD PRIMARY KEY, postgres would create a constraint name internally in its own scheme. But the problem is that we need to create constraint names for shards in our own scheme which is \<constraint name>_\<shardid>. Hence we need to create a name and send it to workers so that the workers can append the shardid. 4. Run the changed command on the coordinator to make sure we are using the same constraint name across the board. 5. Send the changed command to workers such that it is executed for the main table as well as for the shards. Fixes #6515.	2022-12-16 20:34:00 +03:00
aykut-bozkurt	9c0073ba57	remove unused boundary type (#6563 ) Removes unused job boundary tag `SUBQUERY_MAP_MERGE_JOB`. Only usage is at `BuildMapMergeJob`, which is only called when the boundary = `JOIN_MAP_MERGE_JOB`. Hence, it should be safe to remove.	2022-12-16 18:19:22 +03:00
Önder Kalacı	f7e881a4c4	Do not create additional WaitEventSet for RemoteSocketClosed checks (#6505 ) Fixes #6501 Before this commit, we created an additional WaitEventSet for checking whether the remote socket is closed per connection - only once at the start of the execution. However, for certain workloads, such as pgbench select-only workloads, the creation/deletion of the additional WaitEventSet adds ~7% CPU overhead, which is also reflected on the benchmark results. With this commit, we use the same WaitEventSet for the purposes of checking the remote socket at the start of the execution. We use "rebuildWaitEventSet" flag so that the executor can re-use the existing WaitEventSet. As a result, we see the following improvements on PG 15: main : 120051 tps, 0.532 ms latency avg. avoid_wes_rebuild: 127119 tps, 0.503 ms latency avg. And, on PG 14, as expected, there is no difference main : 129191 tps, 0.495 ms latency avg. avoid_wes_rebuild: 129480 tps, 0.494 ms latency avg. But, note that PG 15 is slightly (~1.5%) slower than PG 14. That is probably the overhead of checking the remote socket.	2022-12-14 22:53:38 +01:00
Onder Kalaci	feb5534c65	Do not create additional WaitEventSet for RemoteSocketClosed checks Before this commit, we created an additional WaitEventSet for checking whether the remote socket is closed per connection - only once at the start of the execution. However, for certain workloads, such as pgbench select-only workloads, the creation/deletion of the additional WaitEventSet adds ~7% CPU overhead, which is also reflected on the benchmark results. With this commit, we use the same WaitEventSet for the purposes of checking the remote socket at the start of the execution. We use "rebuildWaitEventSet" flag so that the executor can re-use the existing WaitEventSet. As a result, we see the following improvements on PG 15: main : 120051 tps, 0.532 ms latency avg. avoid_wes_rebuild: 127119 tps, 0.503 ms latency avg. And, on PG 14, as expected, there is no difference main : 129191 tps, 0.495 ms latency avg. avoid_wes_rebuild: 129480 tps, 0.494 ms latency avg. But, note that PG 15 is slightly (~1.5%) slower than PG 14. That is probably the overhead of checking the remote socket.	2022-12-14 22:42:55 +01:00
Onder Kalaci	d52da55ac0	Move WaitEvent to DistributedExecution Prep. for caching WaitEventsSet/WaitEvents	2022-12-14 21:59:19 +01:00
Nils Dijk	b5b73d78c3	add prepare and finish pg upgrade functions to 11.2-1 (#6560 ) Fixes a missed include in #6315. While adding the cluster clock we have added some extra steps to `citus_prepare_pg_upgrade` and `citus_finish_pg_upgrade`. These changes were not added to the citus upgrade and downgrade scripts, this allowed for a syntax error to slip in. This PR adds the new versions of both UDF's to the upgrade script while adding the old version to the downgrade script. This exposed the syntax error which is also solved.	2022-12-14 12:34:22 +01:00
Gokhan Gulbiz	556161be32	Fix make recipe mapping in test runner (#6561 )	2022-12-14 12:57:13 +03:00
aykut-bozkurt	8be4ce546e	fix vanilla test status on CI (#6555 ) - Because of the make command used for vanilla tests, test status is always shown as success on CI. As a fix, I added `&& false` at the end of the copying diff file to make the command fail when check-vanilla fails. ```make check-vanilla: all $(pg_regress_multi_check) --vanillatest \|\| (cp $(vanilla_diffs_file) $(citus_abs_srcdir)/regression.diffs && false) ``` - I also fixed some vanilla tests that fails due to recently added clock related operators shown up at some queries.	2022-12-13 11:15:47 +03:00
Gürkan İndibay	3f091e3493	Give nicer error message when using alter_table_set_access_method on a view (#6553 ) DESCRIPTION: Fixes alter_table_set_access_method error for views. Fixes #6001	2022-12-12 23:56:22 +03:00
aykut-bozkurt	1ad1a0a336	add citus_task_wait udf to wait on desired task status (#6475 ) We already have citus_job_wait to wait until the job reaches the desired state. That PR adds waiting on task state to allow more granular waiting. It can be used for Citus operations. Moreover, it is also useful for testing purposes. (wait until a task reaches specified state) Related to #6459.	2022-12-12 22:41:03 +03:00
aykut-bozkurt	80686907a3	print stack trace from core files instead of uploading them (#6539 ) Prints stack trace from core files instead of uploading them.	2022-12-12 18:53:17 +03:00
Gokhan Gulbiz	d307e342a2	Use test name parameter in flakiness detection (#6559 ) This PR changes test-flakyness CI job to pass the test name instead of the file path to `run_test.py` script.	2022-12-12 17:53:25 +03:00
aykut-bozkurt	3da6e3e743	bgworkers with backend connection should handle SIGTERM properly (#6552 ) Fixes task executor SIGTERM handling. Problem: When task executors are sent SIGTERM, their default handler `bgworker_die`, which is set at worker startup, logs FATAL error. But they do not release locks there before logging the error, which sometimes causes hanging of the monitor. e.g. Monitor waits for the lock forever at pg_stat flush after calling proc_exit. Solution: Because executors have connection to backend, they should handle SIGTERM similar to normal backends. Normal backends uses `die` handler, in which they set ProcDiePending flag and the next CHECK_FOR_INTERRUPTS call handles it gracefully by releasing any lock before termination.	2022-12-12 16:44:36 +03:00
dependabot[bot]	f6b8990fc7	Bump certifi from 2022.9.14 to 2022.12.7 in /src/test/regress (#6554 ) Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.9.14 to 2022.12.7. Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-12-12 14:13:08 +01:00
Gokhan Gulbiz	e2a73ad8a8	Flaky Test Detection CI Workflow (#6495 ) This PR adds a new CI workflow named ```flaky-test``` to run flaky test detection on newly introduced regression tests. Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2022-12-12 14:36:23 +03:00
Ahmet Gedemenli	190307e8d8	Wait for cleanup function (#6549 ) Adding a testing function `wait_for_resource_cleanup` which waits until all records in `pg_dist_cleanup` are cleaned up. The motivation is to prevent flakiness in our tests, since the `NOTICE: cleaned up X orphaned resources` message is not consistent in many cases. This PR replaces `citus_cleanup_orphaned_resources` calls with `wait_for_resource_cleanup` calls.	2022-12-08 13:19:25 +03:00

... 3 4 5 6 7 ...

6530 Commits (350a0f64173306c7dadcfabba21979a7ca865bcd) All Branches Search

6530 Commits (350a0f64173306c7dadcfabba21979a7ca865bcd)

All Branches