citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	c7b67ba0ea	Add citus_backend_gpid() And also citus_calculate_gpid(nodeId,pid). These UDFs are just wrappers for the existing functions. Useful for testing and simple manipulation of citus_stat_activity.	2022-03-03 15:29:40 +01:00
Halil Ozan Akgul	06a0509b1a	Introduces citus_stat_activity view	2022-03-03 16:19:20 +03:00
Marco Slot	ddf7cf29f3	Sync pg_dist_colocation as a batch	2022-03-03 12:48:48 +01:00
Marco Slot	3ba61244b8	Synchronize pg_dist_colocation metadata	2022-03-03 11:01:59 +01:00
Marco Slot	43e4dd3808	Add a citus.internal_reserved_connections setting	2022-03-02 19:13:53 +01:00
Onder Kalaci	e80a36c4b6	Improve visibility rules for non-priviledge roles It seems like our approach is way too restrictive and some places are wrong. Now, we follow very similar approach to pg_stat_activity. Some of the changes are pre-requsite for implementing citus_dist_stat_activity via citus_stat_activity.	2022-03-02 18:04:01 +01:00
Onder Kalaci	35ec9721b4	Add a new API for enabling Citus MX for clusters upgrading from earlier versions Clusters created pre-Citus 11 mostly didn't have metadata sync enabled. For those clusters, we add a utility UDF which fixes some minor issues and sync the necessary objects to the workers.	2022-03-02 17:02:55 +01:00
Onder Kalaci	98751058a9	Add Primary key to the table Otherwise enterprise tests fail	2022-03-02 12:03:59 +01:00
Marco Slot	dcfbb51b6b	Revert "Build Columnar.so and make Citus depends on it (#5661 )" This reverts commit `a4133c69e8`.	2022-03-02 11:33:15 +01:00
Ahmet Gedemenli	e1809af376	Propagate CREATE AGGREGATE commands	2022-03-02 10:52:43 +03:00
Onder Kalaci	b79a0052a4	Drop function in the tests on a never version As dropping the function now relies on pg_dist_object, which exists with 9.0+	2022-03-02 08:45:35 +01:00
ywj	a4133c69e8	Build Columnar.so and make Citus depends on it (#5661 ) * [Columnar] Build columnar.so and let citus depends on it Co-authored-by: Yanwen Jin <yanwjin@microsoft.com> Co-authored-by: Ying Xu <32597660+yxu2162@users.noreply.github.com> Co-authored-by: jeff-davis <Jeffrey.Davis@microsoft.com>	2022-03-01 23:31:14 +03:00
Nils Dijk	65bd540943	Feature: configure object propagation behaviour in transactions (#5724 ) DESCRIPTION: Add GUC to control ddl creation behaviour in transactions Historically we would _not_ propagate objects when we are in a transaction block. Creation of distributed tables would not always work in sequential mode, hence objects created in the same transaction as distributing a table that would use the just created object wouldn't work. The benefit was that the user could still benefit from parallelism. Now that the creation of distributed tables is supported in sequential mode it would make sense for users to force transactional consistency of ddl commands for distributed tables. A transaction could switch more aggressively to sequential mode when creating new objects in a transaction. We don't change the default behaviour just yet. Also, many objects would not even propagate their creation when the transaction was already set to sequential, leaving the probability of a self deadlock. The new policy checks solve this discrepancy between objects as well.	2022-03-01 17:29:31 +03:00
Burak Velioglu	f17872aed4	Expand functions while resolving dependencies	2022-03-01 17:08:46 +03:00
Gledis Zeneli	b825232ecb	Handle rebalance / replication when a node is disabled (Fix #5664 ) (#5729 ) The issue in question is caused when rebalance / replication call `FullShardPlacementList` which returns all shard placements (including those in disabled nodes with `citus_disable_node`). Eventually, `FindFillStateForPlacement` looks for the state across active workers and fails to find a state for the placements which are in the disabled workers causing a seg fault shortly after. Approach: * `ActivePlacementHash` was not using the status of the shard placement's node to determine if the node it is active. Initially, I just fixed that. * Additionally, I refactored the code which handles active shards in replication / rebalance to: * use a single function to determine if a shard placement is active. * do the shard active shard filtering before calling `RebalancePlacementUpdates` and `ReplicationPlacementUpdates`, so test methods like `shard_placement_rebalance_array` and `shard_placement_replication_array` which have different shard placement active requirements can do their own filtering while using the same rebalance / replicate logic that `rebalance_table_shards` and `replicate_table_shards` use. Fix #5664	2022-02-25 19:54:30 +03:00
Hanefi Onaldi	6c25eea62f	Fix some typos in comments	2022-02-24 19:48:52 +03:00
Onder Kalaci	df95d59e33	Drop support for CitusInitiatedBackend CitusInitiatedBackend was a pre-mature implemenation of the whole GlobalPID infrastructure. We used it to track whether any individual query is triggered by Citus or not. As of now, after GlobalPID is already in place, we don't need CitusInitiatedBackend, in fact it could even be wrong.	2022-02-24 12:12:43 +01:00
Marco Slot	0c4e3cb69c	Drop worker_partition_query_result on downgrade	2022-02-24 10:18:56 +01:00
Hanefi Onaldi	7bd6c2c9ac	Isolation tests for various ddl operations and metadata sync	2022-02-24 03:19:56 +03:00
Hanefi Onaldi	f4e8af2c22	Do not acquire locks on node metadata explicitly	2022-02-24 03:19:56 +03:00
Hanefi Onaldi	b70949ae8c	Lock nodes when building ddl task lists	2022-02-24 03:19:56 +03:00
Marco Slot	ef1ceb3953	Only use a single placement for map tasks	2022-02-23 19:40:21 +01:00
Marco Slot	8de802eec5	Enable local_shared_pool_size 5 in arbitrary configs test	2022-02-23 19:40:21 +01:00
Marco Slot	490765a754	Enable re-partition joins after local execution	2022-02-23 19:40:21 +01:00
Marco Slot	3cd9aa655a	Stop using citus.binary_worker_copy_format	2022-02-23 19:40:21 +01:00
Marco Slot	5ac0d31e8b	Fix re-partition hash range generation	2022-02-23 19:40:21 +01:00
Marco Slot	72d8fde28b	Use intermediate results for re-partition joins	2022-02-23 19:40:21 +01:00
Nils Dijk	1fb970224e	Fix: partitioned index dependencies (#5741 ) #5685 introduced the resolution of dependencies for indices. This missed support for indices on partitioned tables. This change adds support for partitioned indices to the dependency resolution code.	2022-02-23 17:53:26 +03:00
Jelte Fennema	e1afd30263	Speed up test runs on WSL2 a lot (#5736 ) It turns out `whereis` is incredibly slow on WSL2 (at least on my machine): ``` $ time whereis diff diff: /usr/bin/diff /usr/share/man/man1/diff.1.gz real 0m0.408s user 0m0.010s sys 0m0.101s ``` This command is run by our custom `diff` script, which is run for every test file that is run. So this adds lots of unnecessary runtime time to tests. This changes our custom `diff` script to only call `whereis` in the strange case that `/usr/bin/diff` does not exist. The impact of this small change on the total runtime of the tests on WSL is huge. As an example the following command takes 18 seconds without this change and 7 seconds with it: ``` make -C src/test/regress/ check-arbitrary-configs CONFIGS=PostgresConfig ```	2022-02-23 13:03:29 +01:00
Ahmet Gedemenli	8b9402540f	Add use_citus_managed_tables to arbitrary configs (cherry picked from commit 4e93afd1f78854e1aaab63690c441b0b0598a82c) (cherry picked from commit `0295fe2f5b`) (cherry picked from commit 878510725fab9cb6870b4504e0b1f055d7bbc68d)	2022-02-22 11:39:30 +03:00
Teja Mupparti	a62901396b	Allow unsafe triggers via a GUC	2022-02-21 22:45:17 -08:00
Onder Kalaci	95d5918967	Properly set worker_query and use	2022-02-21 18:22:33 +01:00
Onder Kalaci	dffcafc096	Use global pids in citus_lock_waits	2022-02-21 17:46:34 +01:00
Onder Kalaci	331af3dce8	Dumping wait edges becomes optionally scan all backends Before this commit, dumping wait edges can only be used for distributed deadlock detection purposes. With this commit, we open the possibility that we can use it for any backend.	2022-02-21 17:37:07 +01:00
Halil Ozan Akgul	f6cd4d0f07	Overrides pg_cancel_backend and pg_terminate_backend to accept global pid	2022-02-21 16:41:35 +03:00
Ahmet Gedemenli	c1d5ca9896	Do distributed check first, for DropSchema stmts	2022-02-21 14:43:04 +03:00
Ahmet Gedemenli	28aa715ce2	Add test for citus local tables with dropped columns	2022-02-21 12:07:17 +03:00
Ahmet Gedemenli	2bc6a00408	Refactor CreateDistributedTable to take column name	2022-02-21 12:07:17 +03:00
yxu2162	8974b2de66	Copied CheckCitusVersion over to Columnar to handle dependency issue. If we split columnar into two extensions, this will later be changed tl CheckColumnarVersion.	2022-02-18 09:47:39 -08:00
Philip Dubé	3d044dc543	Merge branch 'master' into avoid-exceptional-control-flow-in-fluent-py	2022-02-18 16:10:45 +00:00
Burak Velioglu	fa6866ed36	Start to propagate functions to worker nodes with CREATE FUNCTION command together with it's dependencies. If the function depends on any nondistributable object, function will be created only locally. Parameterless version of create_distributed_function becomes obsolete with this change, it will deprecated from the code with a subsequent PR.	2022-02-18 13:56:51 +03:00
gledis69	a14fada153	Prevent Deadlocks When a Worker Tries to Create Collation (Fix #5583 ) * When a worker tried to create a collation which had a dependency in the same worker node, it would cause a deadlock, now it throws the correct "not a coordinator" error.	2022-02-18 12:28:02 +03:00
Teja Mupparti	46fa47beea	Force-delegated functions' distribution argument must be reset as soon as the routine completes execution, and not wait until the top level Executor ends. This fixes issue #5687	2022-02-17 10:48:30 -08:00
Philip Dubé	e4420a6252	fluent.py: prefer simpler return based control flow in _accept rather than relying on raising an exception	2022-02-17 13:30:17 +00:00
Nils Dijk	768b320470	reuse GetRoleSpecObjectForUser	2022-02-17 13:16:10 +01:00
Nils Dijk	ea86f9f94e	Add support for TEXT SEARCH CONFIGURATION objects (#5685 ) DESCRIPTION: Implement TEXT SEARCH CONFIGURATION propagation The change adds support to Citus for propagating TEXT SEARCH CONFIGURATION objects. TSConfig objects cannot always be created in one create statement, and instead require a create statement followed by many alter statements to get turned into the object they should represent. To support this we add functionality to the worker to create or replace objects based on a list of statements. When the lists of the local object and the remote object correspond 1:1 we skip the creation of the object and simply mark it distributed. This is especially important for TSConfig objects as initdb pre-populates databases with a dozen configurations (for many different languages). When the user creates a new TSConfig based on the copy of an existing configuration there is no direct link to the object copied from. Since there is no link we can't simply rely on propagating the dependencies to the worker and send a qualified	2022-02-17 13:12:46 +01:00
Hanefi Onaldi	ccc4cc6bf0	Move test in isolation schedule to prevent failure We check for metadata consistency across the cluster in the test isolation_metadata_sync_vs_all. However, some earlier tests in enterprise repo leave invalid pg_dist_node entries in the worker nodes that have Oid values for already dropped role objects. To remedy that, I suggest that we move the test to earlier in the schedule, thereby making the tests pass for the time being. We should later introduce metadata checking either in a new isolation test or by moving this test later in the schedule. However, we should do that after we fix the underlying issue.	2022-02-17 13:15:21 +03:00
Ahmet Gedemenli	a1c3580c64	Support TRUNCATE for foreign tables	2022-02-17 09:59:53 +03:00
Onder Kalaci	abd5b1c506	Prevent any monitoring view/udf to show already exited backends The low-level StoreAllActiveTransactions() function filters out backends that exited. Before this commit, if you run a pgbench, after that you'd still see the backends show up: ```SQL select count() from get_global_active_transactions(); ┌───────┐ │ count │ ├───────┤ │ 538 │ └───────┘ ``` After this patch, only active backends show-up: ```SQL select count() from get_global_active_transactions(); ┌───────┐ │ count │ ├───────┤ │ 72 │ └───────┘ ```	2022-02-14 17:34:32 +01:00
Ahmet Gedemenli	0411a98c99	Refactor EnsureSequentialMode functions (#5704 )	2022-02-14 18:38:21 +03:00
Gledis Zeneli	badfd561b2	Prevent Citus table functions from being called on shards (Fix #5610 ) (#5694 ) DESCRIPTION: Prevent Citus table functions from being called on shards The operations that guard against using shards are: * Create Local Table * Create distributed table (which affects reference table creation as well). * I used a `ErrorIfRaltionIsKnownShard` instead of `ErrorIfIllegallyChangingKnownShard`. `ErrorIfIllegallyChangingKnownShard` allows the operation if `citus.enable_manual_changes_to_shards`, but I am not sure if it ever makes sense to create a distributed, reference, or citus local table out of a shard. I tried to go over the code to identify other UDF-s where shards could be illegaly changed, but I could not find any other. My knowledge of the codebase is not solid enough for me to say for sure. Fixes #5610	2022-02-14 16:06:48 +03:00
Hanefi Onaldi	2e5ca8ba2b	Add isolation tests for metadata sync vs all This commit introduces several test cases for concurrent operations that change metadata, and a concurrent metadata sync operation. The overall structure is as follows: - Session#1 starts metadata syncing in a transaction block - Session#2 does an operation that change metadata - Both sessions are committed - Another session checks whether the metadata are the same accross all nodes in the cluster.	2022-02-11 01:55:04 +03:00
Önder Kalacı	dc6c194916	Show IDLE backends in citus_dist_stat_activity (#5700 ) * Break the dependency to CitusInitiatedBackend infrastructure With this change, we start to show non-distributed backends as well in citus_dist_stat_activity. I think that (a) it is essential for making citus_lock_waits to work for blocked on DDL commands. (b) it is more expected from the user's perspective. The name of the view is a little inconsistent now (e.g., citus_dist_stat_activity) but we are already planning to improve the names with followup PRs. Also, we have global pids assigned, the CitusInitiatedBackend becomes obsolete.	2022-02-10 08:59:28 -08:00
Ahmet Gedemenli	76b63a307b	Propagate create/drop schema commands	2022-02-10 14:58:09 +03:00
Marco Slot	d0711ea9b4	Delegate function calls in FROM outside of transaction block	2022-02-09 20:56:25 +01:00
Onder Kalaci	1c30f61a70	Prevent citus.node_conninfo to use "application_name" With https://github.com/citusdata/citus/pull/5657, Citus uses a fixed application_name while connecting to remote nodes for internal purposes. It means that we cannot allow users to override it via citus.node_conninfo.	2022-02-09 13:22:04 +01:00
Teja Mupparti	1e3c8e34c0	Allow create_distributed_function() on a function owned by an extension Implement #5649 Allow create_distributed_function() on functions owned by extensions 1) Only update pg_dist_object, and do not propagate CREATE FUNCTION. 2) Ensure corresponding extension is in pg_dist_object. 3) Verify if dependencies exist on the function they should resolve to the extension. 4) Impact on node-scaling: We build a list of ddl commands based on all objects in pg_dist_object. We need to omit the ddl's for the extension-function, as it will get propagated by the virtue of the extension creation. 5) Extra checks for functions coming from extensions, to not propagate changes via ddl commands, even though the function is marked as distributed in pg_dist_object	2022-02-08 11:52:56 -08:00
Halil Ozan Akgul	8ee02b29d0	Introduce global PID	2022-02-08 16:49:38 +03:00
Burak Velioglu	0a70b78bf5	Add test for dist type	2022-02-07 17:50:49 +03:00
Burak Velioglu	c0aece64d0	Add test for checking distributed extension function	2022-02-07 17:50:48 +03:00
Burak Velioglu	ab248c1785	Check object ownership while creating pg_dist_object entries on remote	2022-02-07 17:50:48 +03:00
Burak Velioglu	8ae7577581	Use superuser connection while syncing dependent objects' pg_dist_object tuples	2022-02-07 17:50:45 +03:00
Marco Slot	872f0a79db	Remove random shard placement policy	2022-02-06 21:55:58 +01:00
Marco Slot	0cae8e7d6b	Remove local-node-first shard placement	2022-02-06 21:36:34 +01:00
Teja Mupparti	c8e504dd69	Fix the issue #5673 If the expression is simple, such as, SELECT function() or PEFORM function() in PL/PgSQL code, PL engine does a simple expression evaluation which can't interpret the Citus CustomScan Node. Code checks for simple expressions when executing an UDF but missed the DO-Block scenario, this commit fixes it.	2022-02-04 15:44:53 -08:00
Ying Xu	b5c116449b	Removed dependency from EnsureTableOwner (#5676 ) Removed dependency for EnsureTableOwner. Also removed pg_fini() and columnar_tableam_finish() Still need to remove CheckCitusVersion dependency to make Columnar_tableam.h dependency free from Citus.	2022-02-04 12:45:07 -08:00
Onur Tirtir	79442df1b7	Fix coordinator/worker query targetlists for agg. that we cannot push-down (#5679 ) Previously, we were wrapping targetlist nodes with Vars that reference to the result of the worker query, if the node itself is not `Const` or not a `Param`. Indeed, we should not do that unless the node itself is a `Var` node or contains a `Var` within it (e.g.: `OpExpr(Var(column_a) > 2)`). Otherwise, when worker query returns empty result set, then combine query exec would crash since the `Var` would be pointing to an empty tuple slot, which is not desirable for the node-executor methods.	2022-02-04 05:37:25 -08:00
Onder Kalaci	72d7d92611	Apply code review feedback	2022-02-04 10:52:57 +01:00
Onder Kalaci	923bb194a4	Move isolation_multiuser_locking to MX tests	2022-02-04 10:52:57 +01:00
Onder Kalaci	bcb00e3318	remove not used files	2022-02-04 10:52:57 +01:00
Onder Kalaci	ff234fbfd2	Unify old GUCs into a single one Replaces citus.enable_object_propagation with citus.enable_metadata_sync Also, within Citus 11 release cycle, we added citus.enable_metadata_sync_by_default, that is also replaced with citus.enable_metadata_sync. In essence, when citus.enable_metadata_sync is set to true, all the objects and the metadata is send to the remote node. We strongly advice that the users never changes the value of this GUC.	2022-02-04 10:52:56 +01:00
Teja Mupparti	f31bce5b48	Fixes the issue seen in https://github.com/citusdata/citus-enterprise/issues/745 With this commit, rebalancer backends are identified by application_name = citus_rebalancer and the regular internal backends are identified by application_name = citus_internal	2022-02-03 09:40:46 -08:00
jeff-davis	b072b9235e	Columnar: fix checksums, broken in `a4067913`. (#5669 ) Checksums must be set directly before writing the page. log_newpage() sets the page LSN, and therefore invalidates the checksum.	2022-02-02 13:22:11 -08:00
Onder Kalaci	650243927c	Relax some transactional limications on activate node We already enforce EnsureSequentialModeMetadataOperations(), and given that all activate node is transaction, we should be fine	2022-02-01 15:56:55 +01:00
Onder Kalaci	34d91009ed	Update outdated comment As of the current HEAD, we support sequences as first class objects	2022-02-01 15:37:10 +01:00
Marco Slot	63c6896716	Enable function call pushdown from workers	2022-02-01 14:13:25 +01:00
Önder Kalacı	f712dfc558	Add tests coverage (#5672 ) For extension owned tables with sequences	2022-02-01 15:39:52 +03:00
Burak Velioglu	f88cc230bf	Handle tables and objects as metadata. Update UDFs accordingly With this commit we've started to propagate sequences and shell tables within the object dependency resolution. So, ensuring any dependencies for any object will consider shell tables and sequences as well. Separate logics for both shell tables and sequences have been removed. Since both shell tables and sequences logic were implemented as a part of the metadata handling before that logic, we were propagating them while syncing table metadata. With this commit we've divided metadata (which means anything except shards thereafter) syncing logic into multiple parts and implemented it either as a part of ActivateNode. You can check the functions called in ActivateNode to check definition of different metadata. Definitions of start_metadata_sync_to_node and citus_activate_node have also been updated. citus_activate_node will basically create an active node with all metadata and reference table shards. start_metadata_sync_to_node will be same with citus_activate_node except replicating reference tables. stop_metadata_sync_to_node will remove all the metadata. All of those UDFs need to be called by superuser.	2022-01-31 16:20:15 +03:00
Önder Kalacı	f68ac4a7cf	Consider foreign keys between reference tables (#5659 ) On #5071, we avoid edge cases, but below there are foreign key constraints as well This commit makes sure we cover those as well	2022-01-28 13:38:14 +01:00
Heikki Linnakangas	a40679139b	Use smgrextend() when extending relation, and WAL-log first. (#5654 ) When creating a new table, we bypass the buffer cache and write the initial pages directly with smgrwrite(). However, you're supposed to use smgrextend() when extending a relation, rather than smgrwrite(). There isn't much difference between them, but smgrextend() updates the relation size cache, which seems important, although I haven't seen any real bugs caused by that. Also, write the block to disk only after WAL-logging it, so that we can include the LSN of the WAL record in the version that we write out. Currently, the page as written to disk has LSN 0. That doesn't cause any user-visible issues either, at worst it could make us WAL-log a full page image of the page earlier than necessary, but that doesn't matter currently because we WAL-log full page images of all changes anyway. I bumped into that issue with LSN 0 in the page header when testing Citus with Zenith (https://github.com/zenithdb/zenith/issues/1176). Zenith contains a check that PANICs if you write a block to disk without WAL-logging it, and it works by checking the LSN of the page that's written out. In this case, we are WAL-logging the page even though the LSN on the page is 0, so it was a false alarm, but I'd love to get this changed in Citus to keep the check in Zenith simple. A downside of WAL-logging the page first is that if you run out of disk space, you have already created the WAL record. So if you then crash and restart, WAL recovery will likely run out of disk space, too, which is bad. In practice, we have the same problem in other places, like rewriteheap.c. Also, if you are on the brink of running out of disk space, you will probably run out at WAL replay anyway, regardless of which order we write these few pages. But if we wanted to fix that, we could first extend the relation with zeros, and then WAL-log the pages. That's how heap extension works. It would be even nicer to use the buffer cache for this, and skip the smgrimmedsync() on the relation. However, that would require more work, because we don't have the Relation struct for the relation here. We could use ReadBufferWithoutRelcache(), but that doesn't work for unlogged tables. Unlogged tables are currently not supported (https://github.com/citusdata/citus/issues/4742), but that would become a problem if we want to support them in the future. CreateFakeRelcacheEntry() also doesn't work with unlogged tables. We could do things differently for logged and unlogged tables, but that complicates the code further. Co-authored-by: jeff-davis <Jeffrey.Davis@microsoft.com>	2022-01-27 12:04:08 -08:00
Onder Kalaci	303540e494	Add PGAPPNAME env. variable to arbitrary configs	2022-01-27 11:00:15 +01:00
Onder Kalaci	b26eeaecd3	Use a fixed application_name while connecting to remote nodes Citus heavily relies on application_name, see `IsCitusInitiatedRemoteBackend()`. But if the user set the application name, such as export PGAPPNAME=test_name, Citus uses that name while connecting to the remote node. With this commit, we ensure that Citus always connects with the "citus" user name to the remote nodes.	2022-01-27 10:46:25 +01:00
Onder Kalaci	b9b419ef16	Allow creating distributed tables in sequential mode With https://github.com/citusdata/citus/pull/2780, we allow COPY to use any number of connections that the executor used in a tx block. Meaning that, while COPYing data to the shards, create_distributed_table could allow sequential mode.	2022-01-26 12:58:18 +01:00
Onur Tirtir	8c8d696621	Not fail over to local execution when it's not supported (#5625 ) We fall back to local execution if we cannot establish any more connections to local node. However, we should not do that for the commands that we don't know how to execute locally (or we know we shouldn't execute locally). To fix that, we take localExecutionSupported take into account in CanFailoverPlacementExecutionToLocalExecution too. Moreover, we also prompt a more accurate hint message to inform user about whether the execution is failed because local execution is disabled by them, or because local execution wasn't possible for given command.	2022-01-25 16:43:21 +01:00
Onur Tirtir	ff3913ad99	Copy errmsg for distributed deadlock error into heap (#5641 ) multi_log_hook() hook is called by EmitErrorReport() when emitting the ereport either to frontend or to the server logs. And some callers of EmitErrorReport() (e.g.: errfinish()) seems to assume that string fields of given ErrorData object needs to be freed. For this reason, we copy the message into heap here. I don't think we have faced with such a problem before but it seems worth fixing as it is theoretically possible due to the reasoning above.	2022-01-24 06:27:41 -08:00
Ahmet Gedemenli	c838fb428f	Refactor GenerateGrantOnSchemaStmtForRights	2022-01-24 11:31:59 +03:00
Ahmet Gedemenli	e6fc0c6f36	Turn mx on for test: multi_colocation_utils	2022-01-21 19:31:47 +03:00
Onur Tirtir	4dc38e9e3d	Use EnsureCompatibleLocalExecutionState instead (#5640 )	2022-01-21 15:37:59 +01:00
Ahmet Gedemenli	8647682c11	Fix typo: taget/target	2022-01-21 10:35:56 +03:00
Onur Tirtir	181111b84f	Drop ruleutils copied for statistics	2022-01-20 17:28:19 +03:00
Onur Tirtir	7b59295af2	Drop ruleutils copied for triggers	2022-01-20 17:28:19 +03:00
Önder Kalacı	e8ba9dd9d3	Merge branch 'master' into make_minimal_work_again	2022-01-20 11:48:53 +01:00
Teja Mupparti	54862f8c22	(1) Functions will be delegated even when present in the scope of an explicit BEGIN/COMMIT transaction block or in a UDF calling another UDF. (2) Prohibit/Limit the delegated function not to do a 2PC (or any work on a remote connection). (3) Have a safety net to ensure the (2) i.e. we should block the connections from the delegated procedure or make sure that no 2PC happens on the node. (4) Such delegated functions are restricted to use only the distributed argument value. Note: To limit the scope of the project we are considering only Functions(not procedures) for the initial work. DESCRIPTION: Introduce a new flag "force_delegation" in create_distributed_function(), which will allow a function to be delegated in an explicit transaction block. Fixes #3265 Once the function is delegated to the worker, on that node during the planning distributed_planner() TryToDelegateFunctionCall() CheckDelegatedFunctionExecution() EnableInForceDelegatedFuncExecution() Save the distribution argument (Constant) ExecutorStart() CitusBeginScan() IsShardKeyValueAllowed() Ensure to not use non-distribution argument. ExecutorRun() AdaptiveExecutor() StartDistributedExecution() EnsureNoRemoteExecutionFromWorkers() Ensure all the shards are local to the node in the remoteTaskList. NonPushableInsertSelectExecScan() InitializeCopyShardState() EnsureNoRemoteExecutionFromWorkers() Ensure all the shards are local to the node in the placementList. This also fixes a minor issue: Properly handle expressions+parameters in distribution arguments	2022-01-19 16:43:33 -08:00
Onder Kalaci	7f30222c90	Fix check-minimal It seems like we broke check-minimal with the refactor on #5486 This commit fixes the minor issue	2022-01-19 16:21:59 +01:00
Ahmet Gedemenli	9e6ebe4826	Turn mx on for test file citus_local_tables, on multi-1 schedule	2022-01-19 13:55:51 +03:00
Onur Tirtir	4a53967bdd	Remove an outdated comment from RelationIsAKnownShard (#5629 )	2022-01-19 11:24:10 +01:00
Ahmet Gedemenli	37b3f50447	Turn mx on for multi-1 schedule (#5627 ) For test files: multi_generate_ddl_commands, multi_repair_shards, multi_create_shards, mixed_relkind_tests	2022-01-19 12:05:54 +03:00
Marco Slot	33bfa0b191	Hide shards from application_name's with a specific prefix	2022-01-18 15:20:55 +04:00
Onur Tirtir	d98500ac22	Fix a flaky test related with temp columnar table cleanup (#5599 ) Wait until old backend to expire to make sure that temp table cleanup is complete.	2022-01-17 09:26:30 -08:00
Ahmet Gedemenli	e564220dd5	Fix typo: GetRelationTriggerFunctionDependencyList (#5626 )	2022-01-17 18:17:07 +03:00
Ahmet Gedemenli	8936543b80	Create wrapper function CreateObjectAddressDependencyDefList (#5623 )	2022-01-17 15:35:40 +03:00
Ying Xu	4dca662e97	Making Columnar Dependency Free from Citus (#5622 ) * Removed distributed dependency in columnar_metadata.c * Changed columnar_debug.c so that it no longer needed distributed/tuplestore and made it return a record instead of a tuplestore * removed distributed/commands.h dependency * Made columnar_tableam.c dependency-free * Fixed spacing for columnar_store_memory_stats function * indentation fix * fixed test failures	2022-01-14 09:43:05 -08:00
Onur Tirtir	70d8e1fe97	Assert that we will create indexes on shards via local execution (#5620 )	2022-01-13 17:09:57 +01:00
Halil Ozan Akgul	63cd90e5dd	Add missing library to dependencies.c	2022-01-11 18:36:43 +03:00
Önder Kalacı	46ec7cd5cf	Enable MX for rebalancer tests	2022-01-11 12:07:39 +01:00
Önder Kalacı	885601c02c	Require superuser while activating a node (#5609 ) * Require superuser while activating a node With this change, we require ActiveNode() (hence citus_add_node(), citus_activate_node()) explicitly require for a superuser. Before this commit, these functions were designed to work with non-superuser roles with the relevent GRANTs given. However, that is not a widely used way for calling the functions above. Due to possibility of non-super user calling the UDFs, they were designed in a way that some commands were using some additional short-lived superuser connections. That is: (a) breaking transactional behavior (e.g., ROLLBACK wouldn't fully rollback the whole transaction) (b) Making it very complicated to reason about which parts of the node activation goes over which connections, and becoming vulnerable to deadlocks / visibility issues.	2022-01-10 08:30:13 -08:00
Onur Tirtir	3cc44ed8b3	Tell other backends it's safe to ignore the backend that concurrently built the shell table index (#5520 ) In addition to starting a new transaction, we also need to tell other backends --including the ones spawned for connections opened to localhost to build indexes on shards of this relation-- that concurrent index builds can safely ignore us. Normally, DefineIndex() only does that if index doesn't have any predicates (i.e.: where clause) and no index expressions at all. However, now that we already called standard process utility, index build on the shell table is finished anyway. The reason behind doing so is that we cannot guarantee not grabbing any snapshots via adaptive executor, and the backends creating indexes on local shards (if any) might block on waiting for current xact of the current backend to finish, which would cause self deadlocks that are not detectable.	2022-01-10 10:23:09 +03:00
Marco Slot	ee3b50b026	Disallow remote execution from queries on shards	2022-01-07 17:46:21 +01:00
Önder Kalacı	8d1b188620	Enable MX for the remaining failure tests (#5606 )	2022-01-07 17:24:31 +01:00
Ahmet Gedemenli	3c834e6693	Disable foreign distributed tables (#5605 ) * Disable foreign distributed tables * Add warning for existing distributed foreign tables	2022-01-07 18:12:23 +03:00
Onder Kalaci	7cb1d6ae06	Improve metadata connections With https://github.com/citusdata/citus/pull/5493 we introduced metadata specific connections. With this connection we guarantee that there is a single metadata connection. But note that this connection can be used for any other operation. In other words, this connection is not only reserved for metadata operations. However, as https://github.com/citusdata/citus-enterprise/issues/715 showed us that the logic has a flaw. We allowed ineligible connections to be picked as metadata connections: such as exclusively claimed connections or not fully initialized connections. With this commit, we make sure that we only consider eligable connections for metadata operations.	2022-01-07 10:36:32 +01:00
Onder Kalaci	9f2d9e1487	Move placement deletion from disable node to activate node We prefer the background daemon to only sync node metadata. That's why we move placement metadata changes from disable node to activate node. With that, we can make sure that disable node only changes node metadata, whereas activate node syncs all the metadata changes. In essence, we already expect all nodes to be up when a node is activated. So, this does not change the behavior much.	2022-01-07 09:56:03 +01:00
Hanefi Onaldi	9edfbe7718	Fix the default value for DeferShardDeleteOnMove The default for GUC citus.defer_drop_after_shard_move is true. However we initialize the global variable with a false value.	2022-01-07 11:01:49 +03:00
Ahmet Gedemenli	45e423136c	Support foreign tables in MX (#5461 )	2022-01-06 18:50:34 +03:00
Önder Kalacı	5305aa4246	Do not drop sequences when dropping metadata (#5584 ) Dropping sequences means we need to recreate and hence losing the sequence. With this commit, we keep the existing sequences such that resyncing wouldn't drop the sequence. We do that by breaking the dependency of the sequence from the table.	2022-01-06 09:48:34 +01:00
Önder Kalacı	8007adda25	Convert the function to a distributed function (#5596 ) so that when metadata is synced, the table is on the worker	2022-01-06 11:32:40 +03:00
Önder Kalacı	6d9218540b	Enable single node tests with Citus MX (#5595 ) * Enable single node tests with Citus MX The test already has comment on the changes	2022-01-05 16:00:44 +03:00
jeff-davis	2e03efd91e	Columnar: move DDL hooks to citus to remove dependency. (#5547 ) Add a new hook ColumnarTableSetOptions_hook so that citus can get control when the columnar table options change.	2022-01-04 23:26:46 -08:00
jeff-davis	c9292cfad1	Make pg_version_compat.h and listutils.c dependency-free. (#5548 ) Split distributed/version_compat.h into dependency-free pg_version_compat.h, and the original which still has dependencies. The original doesn't have much purpose, but until other files have better discipline about including the correct header files, then it's still needed. Also make distributed/listutils.h dependency-free. Should be moved outside of 'distributed' subdirectory, but that will cause significant code churn, so leave for another cleanup patch. Now both files can be included in columnar without creating a dependency on citus.	2022-01-04 23:02:08 -08:00
jeff-davis	1546aa0d9f	Columnar: use proper generic WAL interface. (#5543 ) Previously, we cheated by using the RM_GENERIC_ID record type, but not actually using the generic WAL API. This worked because we always took a full page image, and saved the extra work of allocating and copying to a temporary page. But it introduced complexity, and perhaps fragility, so better to just use the API properly. The performance penalty for a serial data load seems to be less than 1%.	2022-01-04 22:42:21 -08:00
Onder Kalaci	22b5175fd1	Make sure that the community and enterprise tests produce the same output	2022-01-04 13:30:31 +01:00
Önder Kalacı	0a8b0b06c6	Do not allow distributed functions on non-metadata synced nodes (#5586 ) Before this commit, Citus was triggering metadata syncing in the background when a function is distributed. However, with Citus 11, we expect all clusters to have metadata synced enabled. So, we do not expect any nodes not to have the metadata. This change: (a) pro: simplifies the code and opens up possibilities to simplify futher by reducing the scope of bg worker to only sync node metadata (b) pro: explicitly asks users to sync the metadata such that any unforseen impact can be easily detected (c) con: For distributed functions without distribution argument, we do not necessarily require the metadata sycned. However, for completeness and simplicity, we do so.	2022-01-04 13:12:57 +01:00
Halil Ozan Akgul	9547228e8d	Add isolation_check_mx test	2021-12-30 14:58:30 +03:00
Halil Ozan Akgul	aef2d83c7d	Fix metadata sync fails on multi_transaction_recovery	2021-12-29 11:21:32 +03:00
Önder Kalacı	d33650d1c1	Record if any partitioned Citus tables during upgrade (#5555 ) With Citus 11, the default behavior is to sync the metadata. However, partitioned tables created pre-Citus 11 might have index names that are not compatiable with metadata syncing. See https://github.com/citusdata/citus/issues/4962 for the details. With this commit, we record the existence of partitioned tables such that we can fix it later if any exists.	2021-12-27 03:33:34 -08:00
Halil Ozan Akgul	0c292a74f5	Fix metadata sync fails on multi_truncate	2021-12-27 13:54:53 +03:00
Önder Kalacı	c9127f921f	Avoid round trips while fixing index names (#5549 ) With this commit, fix_partition_shard_index_names() works significantly faster. For example, 32 shards, 365 partitions, 5 indexes drop from ~120 seconds to ~44 seconds 32 shards, 1095 partitions, 5 indexes drop from ~600 seconds to ~265 seconds `queryStringList` can be really long, because it may contain #partitions * #indexes entries. Before this change, we were actually going through the executor where each command in the query string triggers 1 round trip per entry in queryStringList. The aim of this commit is to avoid the round-trips by creating a single query string. I first simply tried sending `q1;q2;..;qn` . However, the executor is designed to handle `q1;q2;..;qn` type of query executions via the infrastructure mentioned above (e.g., by tracking the query indexes in the list and doing 1 statement per round trip). One another option could have been to change the executor such that only track the query index when `queryStringList` is provided not with queryString including multiple `;`s . That is (a) more work (b) could cause weird edge cases with failure handling (c) felt like coding a special case in to the executor	2021-12-27 10:29:37 +01:00
Halil Ozan Akgul	bb636e6a29	Fix metadata sync fails on multi_function_evaluation	2021-12-24 19:32:58 +03:00
Halil Ozan Akgul	70e68d5312	Fix metadata sync fails on multi_name_lengths	2021-12-24 14:33:32 +03:00
Halil Ozan Akgul	5c2fb06322	Fix metadata sync fails on multi_sequence_default	2021-12-24 14:33:32 +03:00
Halil Ozan Akgul	b9c06a6762	Turn metadata sync on in multi_metadata_sync	2021-12-24 10:58:13 +03:00
Hanefi Onaldi	479b2da740	Fix one flaky failure test	2021-12-23 20:11:45 +03:00
Ahmet Gedemenli	042d45b263	Propagate foreign server ops	2021-12-23 17:54:04 +03:00
Onur Tirtir	61b5fb1cfc	Run failure_test_helpers in base schedule (#5559 )	2021-12-23 12:54:12 +01:00
Talha Nisanci	e196d23854	Refactor AttributeEquivalenceId (#5006 )	2021-12-23 13:19:02 +03:00
Hanefi Onaldi	76176caea7	Fix typo s/exlusive/exclusive/	2021-12-23 01:35:01 +03:00
Hanefi Onaldi	1af8ca8f7c	Fix statical analysis findings (#5550 )	2021-12-22 18:16:11 +03:00
Ahmet Gedemenli	8e4ff34a2e	Do not include return table params in the function arg list (cherry picked from commit `90928cfd74`) Fix function signature generation Fix comment typo Add test for worker_create_or_replace_object Add test for recreating distributed functions with OUT/TABLE params Add test for recreating distributed function that returns setof int Fix test output Fix comment	2021-12-21 19:01:42 +03:00
Marco Slot	2eef71ccab	Propagate SET TRANSACTION commands	2021-12-18 11:31:39 +01:00
Halil Ozan Akgul	46f718c76d	Turn metadata sync on in add_coordinator, foreign_key_to_reference_table and replicate_reference_tables_to_coordinator	2021-12-17 16:33:25 +03:00
Halil Ozan Akgul	25755a7094	Turn ddl propagation off in worker on multi_copy	2021-12-17 15:54:20 +03:00
Onder Kalaci	fc98f83af2	Add citus.grep_remote_commands Simply applies ```SQL SELECT textlike(command, citus.grep_remote_commands) ``` And, if returns true, the command is logged. Else, the log is ignored. When citus.grep_remote_commands is empty string, all commands are logged.	2021-12-17 11:47:40 +01:00
Halil Ozan Akgul	df8d0f3db1	Turn metadata sync on in multi_replicate_reference_table and multi_citus_tools	2021-12-17 10:25:57 +03:00
Onur Tirtir	cc4c83b1e5	HAVE_LZ4 -> HAVE_CITUS_LZ4 (#5541 )	2021-12-16 16:21:52 +03:00
Talha Nisanci	c0945d88de	Normalize a debug failure to WARNING failure (#4996 )	2021-12-16 13:43:49 +03:00
Halil Ozan Akgul	8943d7b52f	Turn metadata sync on in mx_regular_user and remove_coordinator	2021-12-16 11:26:24 +03:00
Halil Ozan Akgul	b82af4db3b	Turn metadata sync on in multi_size_queries, multi_drop_extension and multi_unsupported_worker_operations	2021-12-16 11:10:54 +03:00
Hanefi Onaldi	9d4d73898a	Move healthcheck logic into new file (#5531 ) and add a missing `CheckCitusVersion(ERROR)` call	2021-12-15 15:58:20 -08:00
Hanefi Onaldi	acdcd9422c	Fix one flaky failure test (#5528 ) Removes flaky test	2021-12-15 18:59:58 +03:00
Hanefi Onaldi	29e4516642	Introduce citus_check_cluster_node_health UDF This UDF coordinates connectivity checks accross the whole cluster. This UDF gets the list of active readable nodes in the cluster, and coordinates all connectivity checks in sequential order. The algorithm is: for sourceNode in activeReadableWorkerList: c = connectToNode(sourceNode) for targetNode in activeReadableWorkerList: result = c.execute( "SELECT citus_check_connection_to_node(targetNode.name, targetNode.port") emit sourceNode.name, sourceNode.port, targetNode.name, targetNode.port, result - result -> true -> connection attempt from source to target succeeded - result -> false -> connection attempt from source to target failed - result -> NULL -> connection attempt from the current node to source node failed I suggest you use the following query to get an overview on the connectivity: SELECT bool_and(COALESCE(result, false)) FROM citus_check_cluster_node_health(); Whenever this query returns false, there is a connectivity issue, check in detail.	2021-12-15 01:41:51 +03:00

1 2 3 4 5 ...

3681 Commits (6da2d41e00eb33d4257e46ffbaaed131e2a89f8b)