citus

Commit Graph

Author	SHA1	Message	Date
Burak Velioglu	e244e9ffb6	Fix dropping temporary view without specifying the explicit schema name (#6003 )	2022-06-15 16:41:12 +02:00
Marco Slot	ee34e1ed9d	Fix bug in unqualified, non-existing DROP DOMAIN IF EXISTS	2022-06-15 13:59:08 +02:00
Ahmet Gedemenli	268d3fa3a6	Fix materialized view intermediate result filename (#5982 )	2022-06-14 15:07:08 +03:00
Onder Kalaci	af22a30b48	Use citus_finish_citus_upgrade() in the tests We already have tests relying on citus_finalize_upgrade_to_citus11(). Now, adjust those to rely on citus_finish_citus_upgrade() and always call citus_finish_citus_upgrade().	2022-06-13 13:15:15 +02:00
Marco Slot	36c4ec6d53	Introduce a citus_finish_citus_upgrade() function	2022-06-13 13:15:15 +02:00
Halil Ozan Akgul	b255706189	Fixes the bug where undistribute can drop Citus extension	2022-05-31 16:23:28 +03:00
Onder Kalaci	89c1ccb7a5	Show that no metadata is sent when disabled	2022-05-30 13:41:06 +02:00
Onder Kalaci	7157152f6c	Do not send metadata changes during add node if citus.enable_metadata_sync is set to false	2022-05-30 13:24:31 +02:00
Onder Kalaci	010a2a408e	Avoid assertion failure on citus_add_node	2022-05-30 12:22:09 +02:00
Gledis Zeneli	beef392f5a	Fix memory error with citus_add_node reported by valgrind test (#5967 ) The error comes due to the datum jsonb in pg_dist_metadata_node.metadata being 0 in some scenarios. This is likely due to not copying the data when receiving a datum from a tuple and pg deciding to deallocate that memory when the table that the tuple was from is closed. Also fix another place in the code that might have been susceptible to this issue. I tested on both multi-vg and multi-1-vg and the test were successful.	2022-05-28 00:22:00 +03:00
Ahmet Gedemenli	26d927178c	Propagate dependent views upon distribution (#5950 )	2022-05-26 14:23:45 +03:00
jeff-davis	74ce210f8b	Columnar: fix wraparound bug. (#5962 ) columnar_vacuum_rel() now advances relfrozenxid. Fixes #5958.	2022-05-25 07:50:48 -07:00
Burak Velioglu	1d7dda991f	Create view and materialized views with right schema and owner while altering the distributed table. To be able to alter view's owner without enforcing sequential mode. Alter view process functions have been udpated to use metadata connection.	2022-05-24 15:27:30 +03:00
Gledis Zeneli	27ddb4fc8e	Do not obtain AccessShareLock before actual lock (#5965 ) Do not obtain AccessShareLock before acquiring the distributed locks. Acquiring an AccessShareLock ensures that the relations which we are trying to get a distributed lock on will not be dropped in the time between when the LOCK command is issued and the LOCK commands are send to the worker. However, this also leads to distributed deadlocks in such scenarios: ```sql -- for dist lock acquiring order coor, w1, w2 -- on w2 LOCK t1 IN ACCESS EXLUSIVE MODE; -- acquire AccessShareLock locally on t1 to ensure it is not dropped while we get ready to distribute the lock -- concurrently on w1 LOCK t1 IN ACCESS EXLUSIVE MODE; -- acquire AccessShareLock locally on t1 to ensure it is not dropped while we get ready to distribute the lock -- acquire dist lock on coor, w1, gets blocked on local AccessShareLock on w2 -- on w2 continuation of the execution above -- starts to acquire dist locks and gets blocked on the coor by the lock acquired by w1 -- distributed deadlock ``` We opt for avoiding such deadlocks with the cost of the possibility of running into errors when the relations on which we are trying to acquire locks on get dropped.	2022-05-23 13:06:38 +03:00
Onder Kalaci	dd02e1755f	Parallelize metadata syncing on node activate It is often useful to be able to sync the metadata in parallel across nodes. Also citus_finalize_upgrade_to_citus11() uses start_metadata_sync_to_primary_nodes() after this commit. Note that this commit does not parallelize all pieces of node activation or metadata syncing. Instead, it tries to parallelize potenially large parts of metadata, which is the objects and distributed tables (in general Citus tables). In the future, it would be nice to sync the reference tables in parallel across nodes. Create ~720 distributed tables / ~23450 shards ```SQL -- declaratively partitioned table CREATE TABLE github_events_looooooooooooooong_name ( event_id bigint, event_type text, event_public boolean, repo_id bigint, payload jsonb, repo jsonb, actor jsonb, org jsonb, created_at timestamp ) PARTITION BY RANGE (created_at); SELECT create_time_partitions( table_name := 'github_events_looooooooooooooong_name', partition_interval := '1 day', end_at := now() + '24 months' ); CREATE INDEX ON github_events_looooooooooooooong_name USING btree (event_id, event_type, event_public, repo_id); SELECT create_distributed_table('github_events_looooooooooooooong_name', 'repo_id'); SET client_min_messages TO ERROR; ``` across 1 node: almost same as expected ```SQL SELECT start_metadata_sync_to_primary_nodes(); Time: 15664.418 ms (00:15.664) select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 14284.069 ms (00:14.284) ``` across 7 nodes: ~3.5x improvement ```SQL SELECT start_metadata_sync_to_primary_nodes(); ┌──────────────────────────────────────┐ │ start_metadata_sync_to_primary_nodes │ ├──────────────────────────────────────┤ │ t │ └──────────────────────────────────────┘ (1 row) Time: 25711.192 ms (00:25.711) -- across 7 nodes select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 82126.075 ms (01:22.126) ```	2022-05-23 09:15:48 +02:00
jeff-davis	a2f5b068e6	Columnar: tighten security and improve visibility. (#5922 ) Move internal storage details to a separate schema with no public access to limit the possibility for information leakage. Create views with public access that show storage details for those columnar tables where the user has ownership privileges. Include mapping between relation ID and storage ID for easier interpretation.	2022-05-20 15:30:31 -07:00
Hanefi Onaldi	52541c5802	Add normalization rules for flaky isolation tests We remove `<waiting ...>` and `<... completed>` outputs for some CREATE INDEX CONCURRENTLY commands since they can cause flakiness in some scenarios. Postgres calls WaitForOlderSnapshots() and this can cause CREATE INDEX CONCURRENTLY commands for shards to get blocked by each other for brief periods of time. The extra waits can pop-up, or they can get completed at different lines in the output files. To remedy that, we rename those indexes to be captured by the new normalization rule.	2022-05-21 00:55:47 +03:00
Ying Xu	a1151c2395	Clear metadatacache during abort for create extension (#5907 ) * Bug fix for bug #5876. Memset MetadataCacheSystem every time there is an abort * Created an ObjectAccessHook that saves the transactionlevel of when citus was created and will clear metadatacache if that transaction level is rolled back. Added additional tests to make sure metadatacache is cleared	2022-05-20 13:47:58 -07:00
Marco Slot	7abcfac61f	Add caching for functions that check the backend type	2022-05-20 19:02:37 +02:00
Marco Slot	09ec366ff5	Improve nested execution checks and add GUC to disable	2022-05-20 18:55:43 +02:00
Marco Slot	e683993449	Fix prepared statement bug when switching from local to remote execution	2022-05-20 18:55:43 +02:00
jeff-davis	a9f8a60007	Columnar: support relation options with ALTER TABLE. (#5935 ) Columnar: support relation options with ALTER TABLE. Use ALTER TABLE ... SET/RESET to specify relation options rather than alter_columnar_table_set() and alter_columnar_table_reset(). Not only is this more ergonomic, but it also allows better integration because it can be treated like DDL on a regular table. For instance, citus can use its own ProcessUtility_hook to distribute the new settings to the shards. DESCRIPTION: Columnar: support relation options with ALTER TABLE.	2022-05-20 08:35:00 -07:00
Marco Slot	ad5214b50c	Allow distributed execution from run_command_on_* functions	2022-05-20 15:26:47 +02:00
gledis69	4731630741	Add distributing lock command support	2022-05-20 12:28:07 +03:00
Marco Slot	79d7e860e6	Add a run_command_on_coordinator function	2022-05-19 10:26:09 +02:00
Marco Slot	fa9cee409c	Fix downgrade scripts and add new downgrade tests	2022-05-19 10:26:09 +02:00
Ahmet Gedemenli	48d5c9a1b5	Fix schemaname qualify for rename seq stmts	2022-05-18 19:04:22 +03:00
Onder Kalaci	0596062f96	Serialize reference table modifications with node changes & restore point With Citus MX enabled, when a reference table is modified, it does some operations on the first worker node(e.g., acquire locks). If node metadata is locked (via add node or create restore point), the changes to the reference tables should be blocked.	2022-05-18 17:23:38 +02:00
Onder Kalaci	127450466e	Do not warn unncessarily when a node is removed In the past (pre-11), we allowed removing worker nodes that had active placements for replicated distributed table, without even checking if there are any other replicas of the same placement. However, with #5469, we prevent disabling nodes via a hard error when there is the last active placement of shard, as we do for reference tables. Note that otherwise, we'd allow users to lose data. As of today, the NOTICE is completely irrelevant.	2022-05-18 17:23:38 +02:00
Onder Kalaci	b4dbd84743	Prevent distributed queries while disabling first worker node First worker node has a special meaning for modifications on the replicated tables It is used to acquire a remote lock, such that the modifications are serialized. With this commit, we make sure that we do not let any distributed query to see a different 'first worker node' while first worker node is disabled. Note that, maybe implicitly mentioned above, when first worker node is disabled, the first worker node changes, that's why we have to handle the situation.	2022-05-18 17:21:12 +02:00
Onder Kalaci	db998b3d66	Adds "sync" option to citus_disable_node() UDF Before this commit, we had: ```SQL SELECT citus_disable_node(nodename, nodeport, force boolean DEFAULT false) ``` Where, we allow forcing to disable first worker node with `force:=true`. However, it entails the risk for losing data / diverging placement data etc. With `force` flag, we control disabling the first worker node, and with `async` flag we control whether the changes are done via bg worker or immediately. ```SQL SELECT citus_disable_node(nodename, nodeport, force boolean DEFAULT false, sync boolean DEFAULT false) ``` Where we can achieve all the following: \| Mode \| Data loss possibility \| Can run in 2PC \| Handle multiple node failures \| Immediately effective \| \| --- \|--- \|--- \|--- \|--- \| \| force:false, sync: false \| false \| true \| true \| false \| \| force:false, sync: true \| false \| false \| false \| true \| \| force:true, sync: false \| true \| true \| true \| false \| \| force:true, sync: true \| false \| false \| false \| true \|	2022-05-18 17:21:12 +02:00
Onder Kalaci	2cc4053fc1	Fixes a bug that prevents dropping/altering indexes There are two problems in this area. First, when there are expressions on the index name, we should call `transformIndexExpression()` before generating the index name. That is what Postgres does. Second, because of `40c24bfef9` PG 13 and PG 14 generates different names for indexes with function calls even for local PG tables. Assume we have: ```SQL create table t(id int); select create_distributed_table('t', 'id'); create index ON t (my_very_boring_function(id)); ``` On PG 13, the name of the index is `t_expr_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_expr_idx" btree (my_very_boring_function(id::bigint)) ``` On PG 14, the name of the index is `t_my_very_boring_function_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_my_very_boring_function_idx" btree (my_very_boring_function(id::bigint)) ``` The second issue is not very critical. The important part is that we adjust regression tests to drop all the indexes, which ensures the index names are sane on any version.	2022-05-18 16:35:17 +02:00
Nils Dijk	b71a08955a	Refactor: reduce complexity and code duplication for Object Propagation Over time we have added significantly improved the support for objects to be propagated by Citus as to make scaling out the database more seamless. It became evident that there was a lot of code duplication that got into the codebase to implement the propagation. This PR tries to reduce the amount of repeated code that is at most only slightly different. To make things worse, most of the differences were actually oversights instead of correct. This Patch introduces 3 reusable sets of pre/post processing steps for respectively - create - alter - drop With the use of the common functionality we should have more coherent behaviour between different supported object by Citus. Some steps either omit the Pre or Post processing step if they would not make sense to include. All tests pass, only 1 test needed changing, foreign servers, as the dropping of foreign servers didn't implement support for dropping multiple foreign servers at once. Given the common approach correctly supports dropping of multiple objects, either distributed or not, the test that assumed it wouldn't work was now obsolete.	2022-05-18 15:58:28 +02:00
Onder Kalaci	ee45e7bfbf	Mark existing views as distributed when upgrade to 11.0+ We have a mechanism which ensures that newly distributed objects are recorded during `alter extension citus update`. However, the logic was lacking "view"s. With this commit, we make sure that existing views are also marked as distributed during upgrade.	2022-05-18 15:43:17 +02:00
Nils Dijk	14c6c799f2	suppress notices when more dependencies are found (#5954 ) We are nearing the 100 objects being propagated in `master_copy_shard_placement` and with the extra supported objects this gets pushed over a 100 objects. When a 100 objects are reached for propagation a notice will be shown to the user, informing them it might take a while to finish the operation. During testing this is not important to see. Since the message contains the exact number of objects to be propagated the tests becomes very unstable when merging community into enterprsie. This change makes that the test output stays stable.	2022-05-18 14:31:10 +03:00
Hanefi Onaldi	313104ab9b	Grep logs for deterministic global_cancel test results (#5948 )	2022-05-18 11:09:54 +03:00
Halil Ozan Akgul	d171a736ab	Revert "Creates new colocation for colocate_with:='none' too" This reverts commit `f74447b3b7`.	2022-05-17 15:32:22 +03:00
Ahmet Gedemenli	aa8f46ead0	Fix schema name bug for sequences (#5937 )	2022-05-16 18:11:57 +03:00
Halil Ozan Akgul	f74447b3b7	Creates new colocation for colocate_with:='none' too	2022-05-16 13:39:05 +03:00
Teja Mupparti	e56fc34404	Fixes: #5787 In prepared statements, map any unused parameters to a generic type.	2022-05-13 19:31:05 -07:00
Burak Velioglu	1875516ae9	Add ALTER VIEW support Adds support for propagation ALTER VIEW commands to - Change owner of view - SET/RESET option - Rename view and view's column name - Change schema of the view Since PG also supports targeting views with ALTER TABLE commands, related code also added to direct such ALTER TABLE commands to ALTER VIEW commands while sending them to workers.	2022-05-13 13:21:53 +03:00
Marco Slot	6fad5dc207	Add a citus_is_coordinator function	2022-05-13 10:02:52 +02:00
Ahmet Gedemenli	00e0f4d8e6	Fix alter statistics namespace name	2022-05-11 18:44:37 +03:00
Gledis Zeneli	4c6f62efc6	Switch to using LOCK instead of lock_relation_if_exists in TRUNCATE (#5930 ) Breaking down #5899 into smaller PR-s This particular PR changes the way TRUNCATE acquires distributed locks on the relations it is truncating to use the LOCK command instead of lock_relation_if_exists. This has the benefit of using pg's recursive locking logic it implements for the LOCK command instead of us having to resolve relation dependencies and lock them explicitly. While this does not directly affect truncate, it will allow us to generalize this locking logic to then log different relations where the pg recursive locking will become useful (e.g. locking views). This implementation is a bit more complex that it needs to be due to pg not supporting locking foreign tables. We can however, still lock foreign tables with lock_relation_if_exists. So for a command: TRUNCATE dist_table_1, dist_table_2, foreign_table_1, foreign_table_2, dist_table_3; We generate and send the following command to all the workers in metadata: ```sql SEL citus.enable_ddl_propagation TO FALSE; LOCK dist_table_1, dist_table_2 IN ACCESS EXCLUSIVE MODE; SELECT lock_relation_if_exists('foreign_table_1', 'ACCESS EXCLUSIVE'); SELECT lock_relation_if_exists('foreign_table_2', 'ACCESS EXCLUSIVE'); LOCK dist_table_3 IN ACCESS EXCLUSIVE MODE; SEL citus.enable_ddl_propagation TO TRUE; ``` Note that we need to alternate between the lock command and lock_table_if_exists in order to preserve the TRUNCATE order of relations. When pg supports locking foreign tables, we will be able to massive simplify this logic and send a single LOCK command.	2022-05-11 18:38:48 +03:00
Burak Velioglu	1460452442	Introduce CREATE/DROP VIEW Adds support for propagating create/drop view commands and views to worker node while scaling out the cluster. Since views are dropped while converting the table type, metadata connection will be used while propagating view commands to not switch to sequential mode.	2022-05-10 13:07:14 +03:00
Burak Velioglu	06a94d167e	Use object address instead of relation id on DDLJob to decide on syncing metadata	2022-05-05 17:59:44 +03:00
Onder Kalaci	f193e16a01	Refrain reading the metadata cache for all tables during upgrade First, it is not needed. Second, in the past we had issues regarding this: https://github.com/citusdata/citus/pull/4344 When I create 10k tables, ~120K shards, this saves 40Mb of memory during ALTER EXTENSION citus UPDATE. Before the change: MetadataCacheMemoryContext: 41943040 ~ 40MB After the change: MetadataCacheMemoryContext: 8192	2022-05-04 16:44:06 +02:00
Marco Slot	ceb593c9da	Convert citus.hide_shards_from_app_name_prefixes to citus.show_shards_for_app_name_prefixes	2022-05-03 14:22:13 +02:00
Jeff Davis	3e1180de78	PG15: handle extra argument to parse_analyze_varparams(). From PG commit 25751f54b8.	2022-05-02 10:12:03 -07:00
Jeff Davis	b6a5617ea8	PG15: handle pg_analyze_and_rewrite_* renaming. From PG commit 791b1b71da.	2022-05-02 10:12:03 -07:00
Jeff Davis	33ee4877d4	PG15: rename pgstat_initstats() -> pgstat_init_relation(). From PG commits bff258a273 and be902e2651.	2022-05-02 10:12:03 -07:00
Jeff Davis	033f9cfff7	PG15: update copied pg_get_object_address() code. Account for PG commits 5a2832465fd8 and a0ffa885e478.	2022-05-02 10:12:03 -07:00
Jeff Davis	bd455f42e3	PG15: handle change to SeqScan structure. Account for PG commit 2226b4189b. The one site dependent on it can do just as well with a Scan instead of a SeqScan.	2022-05-02 10:12:03 -07:00
Jeff Davis	3799f95742	PG15: Value -> String, Integer, Float. Handle PG commit 639a86e36a.	2022-05-02 10:12:03 -07:00
Jeff Davis	26f5e20580	PG15: update integer parsing APIs. Account for PG commits 3c6f8c011f and cfc7191dfe.	2022-05-02 10:12:03 -07:00
Jeff Davis	70c915a0f2	PG15: Handle data type changes in pg_collation. Account for PG commit 54637508f8.	2022-05-02 10:12:03 -07:00
Jeff Davis	9915fe8a1a	PG15: Handle different ways to get publication actions. Account for PG commit 52e4f0cd47.	2022-05-02 10:12:03 -07:00
Jeff Davis	1c1ef7ab8d	PG15: Handle extra argument to RelationCreateStorage. Account for PG commit 9c08aea6a309. Introduce RelationCreateStorage_compat.	2022-05-02 10:12:03 -07:00
Jeff Davis	ac952b2cc2	PG15: Handle extra argument to ExecARDeleteTriggers. Account for PG commit ba9a7e3921. Introduce ExecARDeleteTriggers_compat.	2022-05-02 10:12:03 -07:00
Jeff Davis	f944722c6a	PG15: Use RelationGetSmgr() instead of RelationOpenSmgr(). Handle PG commit f10f0ae420.	2022-05-02 10:12:03 -07:00
Hanefi Onaldi	518fb0873e	Introduce one new alternative text output to fix flakiness (#5913 ) Here is a flaky test output that is quite hard to fix: ```diff diff -dU10 -w /home/circleci/project/src/test/regress/expected/isolation_master_update_node_1.out /home/circleci/project/src/test/regress/results/isolation_master_update_node.out --- /home/circleci/project/src/test/regress/expected/isolation_master_update_node_1.out.modified 2022-03-21 19:03:54.237042562 +0000 +++ /home/circleci/project/src/test/regress/results/isolation_master_update_node.out.modified 2022-03-21 19:03:54.257043084 +0000 @@ -49,18 +49,20 @@ <waiting ...> step s2-update-node-1-force: <... completed> master_update_node ------------------ (1 row) step s2-abort: ABORT; step s1-abort: ABORT; FATAL: terminating connection due to administrator command -SSL connection has been closed unexpectedly +server closed the connection unexpectedly + This probably means the server terminated abnormally + before or while processing the request. ``` I could not come up with a solution that would decrease the flakiness in the test outputs. We already have 3 output files for the same test and now I introduced a 4th one. I can also add complex regular expressions that span multiple lines, and normalize these error messages. Feel free to suggest a normalized error message in a comment here. ## Current alternative file contents `isolation_master_update_node.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly ``` `isolation_master_update_node_0.out` ``` step s1-abort: ABORT; WARNING: this step had a leftover error message FATAL: terminating connection due to administrator command server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. ``` `isolation_master_update_node_1.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly ``` new file: `isolation_master_update_node_2.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. ```	2022-04-28 16:52:02 +03:00
Onder Kalaci	5fc7661169	Do not set coordinator's metadatasynced column to false After a disable_node	2022-04-25 09:25:59 +02:00
Onder Kalaci	a2debe0f02	Do not assign distributed transaction ids for local execution In the past, for all modifications on the local execution, we enabled 2PC (with `6a7ed7b309`). This also required us to enable coordinated transactions via https://github.com/citusdata/citus/pull/4831 . However, it does have a very substantial impact on the distributed deadlock detection. The distributed deadlock detection is designed to avoid single-statement transactions because they cannot lead to any actual deadlocks. The implementation is to skip backends without distributed transactions are assigned. Now that we assign single statement local executions in the lock graphs, we are conflicting with the design of distributed deadlock detection. In general, we should fix it. However, one might think that it is not a big deal, even if the processes show up in the lock graphs, the deadlock detection should not be causing any false positives. That is false, unless https://github.com/citusdata/citus/issues/1803 is fixed. Now that local processes are considered as a single distributed backend, the lock graphs might find: local execution 1 [tx id: 1] -> any local process [tx id: 0] any local process [tx id: 0] -> local execution 2 [tx id: 2] And, decides that there is a distributed deadlock. This commit is: (a) right thing to do, as local execuion should not need any distributed tx id (b) Eliminates performance issues that might come up with deadlock detection does a lot of unncessary checks (c) After moving local execution after the remote execution via https://github.com/citusdata/citus/pull/4301, the vauge requirement for assigning distributed tx ids are already gone.	2022-04-13 13:25:12 +02:00
Hanefi Onaldi	6254f30305	Add arbitrary config tests for function DDL statements (#5885 )	2022-04-12 16:03:10 +03:00
Önder Kalacı	dd78c81378	Fix flaky isolation - 1 (#5900 ) * Do not show any PG internal queries	2022-04-11 20:43:51 -07:00
Burak Velioglu	5d9599f964	Create function in transaction according to create object propagation guc	2022-04-08 17:15:31 +03:00
Nils Dijk	8897361f95	Implement DOMAIN propagation for citus	2022-04-08 15:25:39 +02:00
Jelte Fennema	6d8c5931d6	Work around flaky test related to search_path (#5894 ) For some reason search_path is not always set correctly on the worker when calling a distributed function, this shows up when calling `insert_document` in our distributed_triggers test. The underlying reason is currently unknown and warrants deeper investigation. Currently this test is one of the main causes for random CI failures. So this change sets the search_path of each function explicitly, to reduce these failures. So other devs can be more efficient, while I continue investigating the root cause of this issue. Also changes explicit `SET citus.enable_unsafe_triggers = false` to `RESET citus.enable_unsafe_triggers` in passing.	2022-04-08 16:09:33 +03:00
Onder Kalaci	b0b91bab04	Rename metadata sync to node metadata sync where applicable	2022-04-07 17:51:31 +02:00
Marco Slot	2304815356	Allow adding a unique constraint with an index	2022-04-07 16:00:31 +02:00
Marco Slot	c0827703ec	Fix EXPLAIN ANALYZE JSON format for subplans	2022-04-07 11:38:20 +02:00
Marco Slot	544dce919a	Handle user-defined type parameters in EXPLAIN ANALYZE	2022-04-07 11:14:32 +02:00
Marco Slot	9476f377b5	Remove old re-partitioning functions	2022-04-04 18:11:52 +02:00
Marco Slot	8c8c3b665d	Add TABLESAMPLE support	2022-04-01 15:51:40 +02:00
Ahmet Gedemenli	a62de6494d	Add schema tests to arbitrary configs	2022-04-01 13:57:17 +03:00
jeff-davis	c485a04139	Separate build of citus.so and citus_columnar.so. (#5805 ) * Separate build of citus.so and citus_columnar.so. Because columnar code is statically-linked to both modules, it doesn't make sense to load them both at once. A subsequent commit will make the modules entirely separate and allow loading them both simultaneously. Author: Yanwen Jin * Separate citus and citus_columnar modules. Now the modules are independent. Columnar can be loaded by itself, or along with citus. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2022-03-31 19:47:17 -07:00
Gledis Zeneli	c9aab7fb8b	Add TRUNCATE arbitrary config tests (#5848 ) Adds TRUNCATE arbitrary config tests. Also adds the ability to skip tests from particular configs.	2022-03-31 14:14:47 +03:00
Onder Kalaci	9043a1ed3f	Only hide shards from client backends and pg bg workers The aim of hiding shards is to hide shards from client applications. Certain bg workers (such as pg_cron or Citus maintanince daemon) should be treated like client applications because users can run queries from such bg workers. And, these bg workers should follow the similar application_name checks as client backeends. Certain other bg workers, such as logical replication or postgres' parallel workers, should never hide shards. They are internal operations. Similarly the other backend types like the walsender or checkpointer or autovacuum should never hide shards.	2022-03-30 16:56:12 +02:00
Ahmet Gedemenli	f74d3eedc8	Add tests for materialized views	2022-03-30 16:01:11 +03:00
Ahmet Gedemenli	8ef2da8192	Add view tests to arbitrary configs	2022-03-30 12:28:31 +03:00
Önder Kalacı	670fae99f7	Add tests with function dependencies on tables (#5866 ) We are not sure if we have such tests, but lets add anyway	2022-03-29 18:04:07 +03:00
Ahmet Gedemenli	1e1e66eeed	Add index tests to arbitrary configs (#5862 )	2022-03-29 13:49:05 +03:00
Ahmet Gedemenli	b52823f8b4	Fix typo in error message for truncating foreign tables (#5864 )	2022-03-29 13:14:16 +03:00
Onder Kalaci	23ff095905	add missing check_mx	2022-03-29 10:35:12 +02:00
Halil Ozan Akgul	c843ebe48e	Turn metadata sync on in arbitrary config tests	2022-03-23 15:19:52 +03:00
Jelte Fennema	3a44fa827a	Add versions of forboth that don't need ListCell (#5856 ) We've had custom versions of Postgres its `foreach` macro which with a hidden ListCell for quite some time now. People like these custom macros, because they are easier to use and require less boilerplate. This adds similar custom versions of Postgres its `forboth` macro. Now you don't need ListCells anymore when looping over two lists at the same time.	2022-03-23 14:50:36 +03:00
Ahmet Gedemenli	b5448e43e3	Fix aggregate signature bug (#5854 )	2022-03-23 13:42:03 +03:00
Burak Velioglu	db9f0d926c	Add support for deparsing ALTER FUNCION ... SUPPORT ... commands	2022-03-22 21:55:55 +03:00
Onder Kalaci	af4ba3eb1f	Remove citus.enable_cte_inlining GUC In Postgres 12+, users can adjust whether to inline/not inline CTEs by [NOT] MATERIALIZED keywords. So, this GUC is already useless.	2022-03-22 17:14:44 +01:00
Halil Ozan Akgul	4690c42121	Fixes ALTER COLLATION encoding does not exist bug	2022-03-22 17:42:20 +03:00
Marco Slot	32c23c2775	Disallow re-partition joins when no hash function defined	2022-03-22 13:42:53 +01:00
Onur Tirtir	11433ed357	Create DDL job for create enum command in postprocess as we do for composite types Since now we don't throw an error for enums that user attempts creating in temp schema, the preprocess / DDL job that contains the prepared statement (to idempotently create the enum type) gets executed. As a result, we were emitting the following warning because of the error the underlying worker connection throws: ```sql WARNING: cannot PREPARE a transaction that has operated on temporary objects CONTEXT: while executing command on localhost:xxxxx WARNING: connection to the remote node localhost:xxxxx failed with the following error: another command is already in progress ERROR: cannot PREPARE a transaction that has operated on temporary objects CONTEXT: while executing command on localhost:xxxxx ```	2022-03-22 15:09:23 +03:00
Onur Tirtir	dc31102630	Locally create objects having a dependency that we cannot distribute We were already doing so for functions & types believing that this cannot be the case for other object types. However, as in #5830, we cannot distribute an object that user attempts creating in temp schema. Even more, this doesn't only apply to functions and types but also to many other object types. So with this commit, we teach preprocess/postprocess functions (that need to create dependencies on worker nodes) how to skip trying to distribute such objects. We also start identifying temp schemas as the objects that we don't know how to propagate to worker nodes so that we can simply create objects locally if user attempts creating them in a temp schema. There are 36 callers of `EnsureDependenciesExistOnAllNodes` in the codebase atm and for the most we still need to throw a hard error (i.e.: not use `DeferErrorIfHasUnsupportedDependency` beforehand), such as: i) user explicitly wants to create a distributed object * CreateCitusLocalTable * CreateDistributedTable * master_create_worker_shards * master_create_empty_shard * create_distributed_function * EnsureExtensionFunctionCanBeDistributed ii) we don't want to skip altering distributed table on worker nodes * PostprocessIndexStmt * PostprocessCreateTriggerStmt * PostprocessCreateStatisticsStmt iii) object is already distributed / handled by Citus before, so we aren't okay with not propagating the ALTER command * PostprocessAlterTableSchemaStmt * PostprocessAlterCollationOwnerStmt * PostprocessAlterCollationSchemaStmt * PostprocessAlterDatabaseOwnerStmt * PostprocessAlterExtensionSchemaStmt * PostprocessAlterFunctionOwnerStmt * PostprocessAlterFunctionSchemaStmt * PostprocessAlterSequenceOwnerStmt * PostprocessAlterSequenceSchemaStmt * PostprocessAlterStatisticsSchemaStmt * PostprocessAlterStatisticsOwnerStmt * PostprocessAlterTextSearchConfigurationSchemaStmt * PostprocessAlterTextSearchDictionarySchemaStmt * PostprocessAlterTextSearchConfigurationOwnerStmt * PostprocessAlterTextSearchDictionaryOwnerStmt * PostprocessAlterTypeSchemaStmt * PostprocessAlterForeignServerOwnerStmt iv) we already cannot create those objects in temp schemas, so skipping for now * PostprocessCreateExtensionStmt * PostprocessCreateForeignServerStmt Also note that there are 3 more callers of `EnsureDependenciesExistOnAllNodes` in enterprise in addition to those 36 but we don't need to do anything specific about them due to the same reasoning given in iii).	2022-03-22 15:09:23 +03:00
Halil Ozan Akgul	50bace9cfb	Fixes the type names that start with underscore bug	2022-03-22 14:24:30 +03:00
Halil Ozan Akgul	4dbc760603	Introduces citus_coordinator_node_id	2022-03-22 10:34:22 +03:00
Hanefi Onaldi	9f204600af	Allow all possible option types for text search objects (#5838 )	2022-03-21 20:01:53 +01:00
Halil Ozan Akgül	6c05e4b35c	Add check_mx to operations schedule (#5818 )	2022-03-21 19:09:26 +03:00
Burak Velioglu	d4625ec6a1	Add support for zero-argument polymorphic aggregates	2022-03-21 16:10:40 +03:00
Ahmet Gedemenli	46c6630328	Qualify CREATE AGGREGATE stmts in Preprocess (#5834 )	2022-03-21 13:55:09 +03:00
Burak Velioglu	2c2064bf36	Create type locally if it has undistributable dependency	2022-03-18 18:23:32 +03:00
Marco Slot	055bbd6212	Use coordinated transaction when there are multiple queries per task	2022-03-18 15:04:27 +01:00
Marco Slot	cab243218d	Avoid locks in relation_is_a_known_shard	2022-03-18 14:37:39 +01:00
Marco Slot	5bb5359da0	Fix worker node version check	2022-03-17 13:23:02 +01:00
Marco Slot	22a18fc1f2	Fix typo in upgrade function	2022-03-17 13:23:02 +01:00
Jelte Fennema	68bfc8d1c0	Use good initdb options in arbitrary configs tests (#5802 ) In `pg_regress_multi.pl` we're running `initdb` with some options that the `common.py` `initdb` is currently not using. All these flags seem reasonable, so this brings `common.py` in line with `pg_regress_multi.pl`. In passing change the `--nosync` flag to `--no-sync`, since that's what the PG documentation lists as the official option name (but both work).	2022-03-17 13:22:23 +01:00
Jelte Fennema	b0e406a478	Disable ddl propagation when creating users in arbitrary config tests (#5814 ) This should help with failing enterprise tests.	2022-03-16 15:12:20 +01:00
Ahmet Gedemenli	eddfea18c2	Fix role creation issue on schema tests (#5812 )	2022-03-16 13:49:28 +01:00
Burak Velioglu	333c73a53c	Drop distributed table on worker with ProcessUtilityParseTree	2022-03-15 17:42:01 +03:00
Gledis Zeneli	56ab64b747	Patches #5758 with some more error checks (#5804 ) Add error checks to detect failed connection and don't ping secondary nodes to detect self reference.	2022-03-15 15:02:47 +03:00
Hanefi Onaldi	c0cd8f3d56	Wait until metadata sync before testing distributed sequences	2022-03-15 10:28:51 +01:00
Marco Slot	e42a798707	Always use RowShareLock in pg_dist_node when syncing metadata	2022-03-15 10:28:51 +01:00
Ahmet Gedemenli	36b33e2491	Add sequence tests to arbitrary config (#5771 ) Add sequence tests to arbitrary config (#5771)	2022-03-14 19:16:24 +03:00
Jelte Fennema	41c6393e82	Parallelize cluster setup in arbitrary config tests (#5738 ) Cluster setup time is significant in arbitrary configs. We can parallelize this a bit more. Runtime of the following command decreases from ~25 seconds to ~22 seconds on my machine with this change: ``` make -C src/test/regress/ check-arbitrary-base CONFIGS=CitusDefaultClusterConfig EXTRA_TESTS=prepared_statements_1 ``` Currently we can only run different configs in parallel. However, when working on a feature or trying to fix a bug this is not important. In those cases you simply want to run a single test file on a single config. And you want to run that every time you made a change to the code that you think fixes the issue. This PR allows parallelising running of bash commands. So `initdb` and `pg_ctl start` is run in parallel for all nodes in the cluster. Instead of one waiting for the other. When you run the above command nothing is being run in parallel. After this PR, cluster setup is being run in parallel.	2022-03-14 16:42:20 +01:00
Jelte Fennema	5063257252	Disable fsync in arbitrary config tests (#5800 ) We have fsync enabled for regular tests already in `pg_regress_multi.pl`. This does the same for the arbitrary config tests. On my machine this changes the runtime from the following command from ~37 to ~25 seconds: ```bash make -C src/test/regress/ check-arbitrary-configs CONFIGS=CitusDefaultClusterConfig ```	2022-03-14 18:12:38 +03:00
Onder Kalaci	338752d96e	Guard against hard wait event set errors Similar to https://github.com/citusdata/citus/pull/5158, but this time instead of the executor, use this in all the remaining places.	2022-03-14 14:35:56 +01:00
Onder Kalaci	953951007c	Move wait event error checks to connection manager	2022-03-14 14:35:56 +01:00
Onur Tirtir	216b9b5b7a	Fix an incorrect error message related with fkeys between replicated dist tables (#5796 ) This is not supported in enterprise too.	2022-03-14 14:34:09 +01:00
Hanefi Onaldi	b24e1dfccc	Propagate text search commands to all worker nodes (#5797 ) Here is a list of some functions, and the `TargetWorkerSet` parameters they supply to `NodeDDLTaskList`: PostprocessCreateTextSearchConfigurationStmt - NON_COORDINATOR_NODES PreprocessDropTextSearchConfigurationStmt - NON_COORDINATOR_METADATA_NODES PreprocessAlterTextSearchConfigurationSchemaStmt - NON_COORDINATOR_METADATA_NODES I guess this means that, if metadata syncing is disabled on the node, we may have some issues. Consider the following: Let's assume the user has metadata syncing disabled. 2 workers. `CREATE TEXT SEARCH CONFIGURATION ...` will get propagated to all workers. `ALTER ... CONFIGURATION ...` will not get propagated to workers. After adding a new non-metadata node, the new node will get the altered configuration as it reads from catalog. At this point CONFIGURATION definitions got diverged in the cluster. I suggest that we always use `NON_COORDINATOR_METADATA_NODES` in all the TEXT SEARCH operations here.	2022-03-14 14:44:34 +03:00
Onder Kalaci	db529facab	Only change the sequence types if the target column type is a supported sequence type Before this commit, we erroneously converted the sequence type to the column's type it is used. However, it is possible that the sequence is used in an expression which then converted to a type that cannot be a sequence, such as text. With this commit, we only try this conversion if the column type is a supported sequence type (e.g., smallint, int and bigint). Note that we do this conversion because if the column type is a bigint and the sequence is NOT a bigint, users would be in trouble because sequences would generate values that are out of the range of the column. (The other ways are already not supported such as the column is int and the sequence is bigint would fail on the worker.) In other words, with this commit, we scope this optimization only when the target column type is a supported sequence type. Otherwise, we let users to more freely use the sequences.	2022-03-11 16:06:00 +01:00
Halil Ozan Akgül	37fafd007c	Turn metadata sync on in isolation_update_node and isolation_update_node_lock_writes tests (#5779 )	2022-03-11 16:39:20 +03:00
Ahmet Gedemenli	d06146360d	Support GRANT ON SCHEMA commands in CREATE SCHEMA statements (#5789 ) * Support GRANT ON SCHEMA commands in CREATE SCHEMA statements * Add test * add comment * Rename to GetGrantCommandsFromCreateSchemaStmt	2022-03-11 14:47:45 +03:00
Jelte Fennema	e5d5c7be93	Start erroring out for unsupported lateral subqueries (#5753 ) With the introduction of #4385 we inadvertently started allowing and pushing down certain lateral subqueries that were unsafe to push down. To be precise the type of LATERAL subqueries that is unsafe to push down has all of the following properties: 1. The lateral subquery contains some non recurring tuples 2. The lateral subquery references a recurring tuple from outside of the subquery (recurringRelids) 3. The lateral subquery requires a merge step (e.g. a LIMIT) 4. The reference to the recurring tuple should be something else than an equality check on the distribution column, e.g. equality on a non distribution column. Property number four is considered both hard to detect and probably not used very often. Thus this PR ignores property number four and causes query planning to error out if the first three properties hold. Fixes #5327	2022-03-11 11:59:18 +01:00
Halil Ozan Akgül	c9913b135c	Turn metadata sync on in isolation_ref2ref_foreign_keys test (#5791 )	2022-03-11 13:30:11 +03:00
Halil Ozan Akgül	2edaf0971c	Turn metadata sync on in isolation reference copy vs all (#5790 ) * Turn metadata sync on in isolation_reference_copy_vs_all test * Update the output of isolation_reference_copy_vs_all test	2022-03-11 11:27:46 +03:00
Hanefi Onaldi	b0eb685101	Add support for TEXT SEARCH DICTIONARY objects TEXT SEARCH DICTIONARY objects depend on TEXT SEARCH TEMPLATE objects. Since we do not yet support distributed TS TEMPLATE objects, we skip dependency checks for text search templates, similar to what we do for roles. The user is expected to manually create the TEXT SEARCH TEMPLATE objects before a) adding new nodes, b) creating TEXT SEARCH DICTIONARY objects.	2022-03-11 03:40:20 +03:00
Marco Slot	49467e27e6	Ensure worker_save_query_explain_analyze always fully qualifies types (#5776 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-03-10 07:30:11 -08:00
Gledis Zeneli	2cb02bfb56	Fix node adding itself with citus_add_node leading to deadlock (Fix #5720 ) (#5758 ) If a worker node is being added, a command is sent to get the server_id of the worker from the pg_dist_node_metadata table. If the worker's id is the same as the node executing the code, we will know the node is trying to add itself. If the node tries to add itself without specifying `groupid:=0` the operation will result in an error.	2022-03-10 17:46:33 +03:00
Burak Velioglu	547f6b18ef	Ensure dependencies exists for all alter owner commands	2022-03-10 16:37:55 +03:00
Ahmet Gedemenli	4312486141	Remove unnecessary schema name from CREATE SCHEMA stmts (#5785 )	2022-03-10 15:19:14 +03:00
Hanefi Onaldi	d153c2de0d	Fix some typos in comments	2022-03-10 15:03:26 +03:00
Ahmet Gedemenli	551a7d1383	Support CREATE SCHEMA without name (#5782 )	2022-03-10 13:38:00 +03:00
Marco Slot	8e43c8094d	Fix CREATE EXTENSION propagation with custom version	2022-03-09 17:40:50 +01:00
Marco Slot	7559ad12ba	Change create_object_propagation default to immediate	2022-03-09 17:40:50 +01:00
Burak Velioglu	bbe1b16125	Check whether the object has unsupported or circular dependency	2022-03-09 16:37:53 +03:00
Jelte Fennema	c8839de68b	Don't use cascading deletes in Citus 11 migration script (#5767 ) Using CASCADE in a DELETE can inadvertently delete things we don't intend to. It's safer to fail hard and make the user delete depending things manually.	2022-03-09 14:35:23 +01:00
Halil Ozan Akgül	333bcc7948	Global PID Helper Functions (#5768 ) * Introduces citus_nodename_for_nodeid and citus_nodeport_for_nodeid functions * Introduces citus_nodeid_for_gpid and citus_pid_for_gpid functions * Add tests	2022-03-09 13:15:59 +03:00
Ahmet Gedemenli	264cf78842	Disable use_citus_managed_tables for Postgres config (#5773 )	2022-03-08 17:13:49 +03:00
Onder Kalaci	c32b2de1a7	Improve citus_lock_waits 1) Remove useless columns 2) Show backends that are blocked on a DDL even before gpid is assigned 3) One minor bugfix, where we clear distributedCommandOriginator properly.	2022-03-07 11:10:44 +01:00
Ahmet Gedemenli	2a3c0c1914	Revert upgrade script changes (#5757 )	2022-03-07 13:04:58 +03:00
Onder Kalaci	24fcd2a88c	Handle dropping the partitioned tables properly Before this commit, we might be leaving some metadata on the workers. Now, we handle DROP SCHEMA .. CASCADE properly to avoid any metadata leakage.	2022-03-07 10:02:54 +01:00
Nils Dijk	3801576dfb	Move pg_dist_object to pg_catalog (#5765 ) DESCRIPTION: Move pg_dist_object to pg_catalog Historically `pg_dist_object` had been created in the `citus` schema as an experiment to understand if we could move our catalog tables to a branded schema. We quickly realised that this interfered with the UX on our managed services and other environments, where users connected via a user with the name of `citus`. By default postgres put the username on the search_path. To be able to read the catalog in the `citus` schema we would need to grant access permissions to the schema. This caused newly created objects like tables etc, to default to this schema for creation. This failed due to the write permissions to that schema. With this change we move the `pg_dist_object` catalog table to the `pg_catalog` schema, where our other schema's are also located. This makes the catalog table visible and readable by any user, like our other catalog tables, for debugging purposes. Note: due to the change of schema, we had to disable 1 test that was running into a discrepancy between the schema and binary. Secondly, we needed to make the lookup functions for the `pg_dist_object` relation and their indexes less strict on the fallback of the naming due to an other test that, due to an unfortunate cache invalidation, needed to lookup the relation again. This makes that we won't default to _only_ resolving from `pg_catalog` outside of upgrades.	2022-03-04 17:40:38 +00:00
Halil Ozan Akgul	0500a62515	Updates citus_dist_stat_activity to use citus_stat_activity	2022-03-04 17:28:17 +03:00
Ahmet Gedemenli	b8eedcd261	Notice when create_distributed_function called without params (#5752 ) * Notice when create_distributed_function called without params * Move variable comments to top * Add valid check for cache entry * add objtype to notice msg * update test outputs * Add more tests * Address feedback	2022-03-04 17:26:39 +03:00
Önder Kalacı	bd6a6563ff	Merge branch 'master' into calculate_gpid	2022-03-04 11:34:12 +01:00
Burak Velioglu	cb6d67a9a9	Make sure that all dependencies of citus tables can be distributed	2022-03-03 20:08:09 +03:00
Onder Kalaci	c7b67ba0ea	Add citus_backend_gpid() And also citus_calculate_gpid(nodeId,pid). These UDFs are just wrappers for the existing functions. Useful for testing and simple manipulation of citus_stat_activity.	2022-03-03 15:29:40 +01:00
Halil Ozan Akgul	06a0509b1a	Introduces citus_stat_activity view	2022-03-03 16:19:20 +03:00
Marco Slot	ddf7cf29f3	Sync pg_dist_colocation as a batch	2022-03-03 12:48:48 +01:00
Marco Slot	3ba61244b8	Synchronize pg_dist_colocation metadata	2022-03-03 11:01:59 +01:00
Marco Slot	43e4dd3808	Add a citus.internal_reserved_connections setting	2022-03-02 19:13:53 +01:00

1 2 3 4 5 ...

3826 Commits (cc694b6bcfb13d02aa00ba467acf356b61d612a1)