citus

Commit Graph

Author	SHA1	Message	Date
Jeff Davis	33ee4877d4	PG15: rename pgstat_initstats() -> pgstat_init_relation(). From PG commits bff258a273 and be902e2651.	2022-05-02 10:12:03 -07:00
Jeff Davis	033f9cfff7	PG15: update copied pg_get_object_address() code. Account for PG commits 5a2832465fd8 and a0ffa885e478.	2022-05-02 10:12:03 -07:00
Jeff Davis	bd455f42e3	PG15: handle change to SeqScan structure. Account for PG commit 2226b4189b. The one site dependent on it can do just as well with a Scan instead of a SeqScan.	2022-05-02 10:12:03 -07:00
Jeff Davis	3799f95742	PG15: Value -> String, Integer, Float. Handle PG commit 639a86e36a.	2022-05-02 10:12:03 -07:00
Jeff Davis	26f5e20580	PG15: update integer parsing APIs. Account for PG commits 3c6f8c011f and cfc7191dfe.	2022-05-02 10:12:03 -07:00
Jeff Davis	70c915a0f2	PG15: Handle data type changes in pg_collation. Account for PG commit 54637508f8.	2022-05-02 10:12:03 -07:00
Jeff Davis	9915fe8a1a	PG15: Handle different ways to get publication actions. Account for PG commit 52e4f0cd47.	2022-05-02 10:12:03 -07:00
Jeff Davis	1c1ef7ab8d	PG15: Handle extra argument to RelationCreateStorage. Account for PG commit 9c08aea6a309. Introduce RelationCreateStorage_compat.	2022-05-02 10:12:03 -07:00
Jeff Davis	ac952b2cc2	PG15: Handle extra argument to ExecARDeleteTriggers. Account for PG commit ba9a7e3921. Introduce ExecARDeleteTriggers_compat.	2022-05-02 10:12:03 -07:00
Jeff Davis	f944722c6a	PG15: Use RelationGetSmgr() instead of RelationOpenSmgr(). Handle PG commit f10f0ae420.	2022-05-02 10:12:03 -07:00
Hanefi Onaldi	518fb0873e	Introduce one new alternative text output to fix flakiness (#5913 ) Here is a flaky test output that is quite hard to fix: ```diff diff -dU10 -w /home/circleci/project/src/test/regress/expected/isolation_master_update_node_1.out /home/circleci/project/src/test/regress/results/isolation_master_update_node.out --- /home/circleci/project/src/test/regress/expected/isolation_master_update_node_1.out.modified 2022-03-21 19:03:54.237042562 +0000 +++ /home/circleci/project/src/test/regress/results/isolation_master_update_node.out.modified 2022-03-21 19:03:54.257043084 +0000 @@ -49,18 +49,20 @@ <waiting ...> step s2-update-node-1-force: <... completed> master_update_node ------------------ (1 row) step s2-abort: ABORT; step s1-abort: ABORT; FATAL: terminating connection due to administrator command -SSL connection has been closed unexpectedly +server closed the connection unexpectedly + This probably means the server terminated abnormally + before or while processing the request. ``` I could not come up with a solution that would decrease the flakiness in the test outputs. We already have 3 output files for the same test and now I introduced a 4th one. I can also add complex regular expressions that span multiple lines, and normalize these error messages. Feel free to suggest a normalized error message in a comment here. ## Current alternative file contents `isolation_master_update_node.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly ``` `isolation_master_update_node_0.out` ``` step s1-abort: ABORT; WARNING: this step had a leftover error message FATAL: terminating connection due to administrator command server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. ``` `isolation_master_update_node_1.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly ``` new file: `isolation_master_update_node_2.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. ```	2022-04-28 16:52:02 +03:00
Onder Kalaci	5fc7661169	Do not set coordinator's metadatasynced column to false After a disable_node	2022-04-25 09:25:59 +02:00
Onder Kalaci	a2debe0f02	Do not assign distributed transaction ids for local execution In the past, for all modifications on the local execution, we enabled 2PC (with `6a7ed7b309`). This also required us to enable coordinated transactions via https://github.com/citusdata/citus/pull/4831 . However, it does have a very substantial impact on the distributed deadlock detection. The distributed deadlock detection is designed to avoid single-statement transactions because they cannot lead to any actual deadlocks. The implementation is to skip backends without distributed transactions are assigned. Now that we assign single statement local executions in the lock graphs, we are conflicting with the design of distributed deadlock detection. In general, we should fix it. However, one might think that it is not a big deal, even if the processes show up in the lock graphs, the deadlock detection should not be causing any false positives. That is false, unless https://github.com/citusdata/citus/issues/1803 is fixed. Now that local processes are considered as a single distributed backend, the lock graphs might find: local execution 1 [tx id: 1] -> any local process [tx id: 0] any local process [tx id: 0] -> local execution 2 [tx id: 2] And, decides that there is a distributed deadlock. This commit is: (a) right thing to do, as local execuion should not need any distributed tx id (b) Eliminates performance issues that might come up with deadlock detection does a lot of unncessary checks (c) After moving local execution after the remote execution via https://github.com/citusdata/citus/pull/4301, the vauge requirement for assigning distributed tx ids are already gone.	2022-04-13 13:25:12 +02:00
Hanefi Onaldi	6254f30305	Add arbitrary config tests for function DDL statements (#5885 )	2022-04-12 16:03:10 +03:00
Önder Kalacı	dd78c81378	Fix flaky isolation - 1 (#5900 ) * Do not show any PG internal queries	2022-04-11 20:43:51 -07:00
Burak Velioglu	5d9599f964	Create function in transaction according to create object propagation guc	2022-04-08 17:15:31 +03:00
Nils Dijk	8897361f95	Implement DOMAIN propagation for citus	2022-04-08 15:25:39 +02:00
Jelte Fennema	6d8c5931d6	Work around flaky test related to search_path (#5894 ) For some reason search_path is not always set correctly on the worker when calling a distributed function, this shows up when calling `insert_document` in our distributed_triggers test. The underlying reason is currently unknown and warrants deeper investigation. Currently this test is one of the main causes for random CI failures. So this change sets the search_path of each function explicitly, to reduce these failures. So other devs can be more efficient, while I continue investigating the root cause of this issue. Also changes explicit `SET citus.enable_unsafe_triggers = false` to `RESET citus.enable_unsafe_triggers` in passing.	2022-04-08 16:09:33 +03:00
Onder Kalaci	b0b91bab04	Rename metadata sync to node metadata sync where applicable	2022-04-07 17:51:31 +02:00
Marco Slot	2304815356	Allow adding a unique constraint with an index	2022-04-07 16:00:31 +02:00
Marco Slot	c0827703ec	Fix EXPLAIN ANALYZE JSON format for subplans	2022-04-07 11:38:20 +02:00
Marco Slot	544dce919a	Handle user-defined type parameters in EXPLAIN ANALYZE	2022-04-07 11:14:32 +02:00
Marco Slot	9476f377b5	Remove old re-partitioning functions	2022-04-04 18:11:52 +02:00
Marco Slot	8c8c3b665d	Add TABLESAMPLE support	2022-04-01 15:51:40 +02:00
Ahmet Gedemenli	a62de6494d	Add schema tests to arbitrary configs	2022-04-01 13:57:17 +03:00
jeff-davis	c485a04139	Separate build of citus.so and citus_columnar.so. (#5805 ) * Separate build of citus.so and citus_columnar.so. Because columnar code is statically-linked to both modules, it doesn't make sense to load them both at once. A subsequent commit will make the modules entirely separate and allow loading them both simultaneously. Author: Yanwen Jin * Separate citus and citus_columnar modules. Now the modules are independent. Columnar can be loaded by itself, or along with citus. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2022-03-31 19:47:17 -07:00
Gledis Zeneli	c9aab7fb8b	Add TRUNCATE arbitrary config tests (#5848 ) Adds TRUNCATE arbitrary config tests. Also adds the ability to skip tests from particular configs.	2022-03-31 14:14:47 +03:00
Onder Kalaci	9043a1ed3f	Only hide shards from client backends and pg bg workers The aim of hiding shards is to hide shards from client applications. Certain bg workers (such as pg_cron or Citus maintanince daemon) should be treated like client applications because users can run queries from such bg workers. And, these bg workers should follow the similar application_name checks as client backeends. Certain other bg workers, such as logical replication or postgres' parallel workers, should never hide shards. They are internal operations. Similarly the other backend types like the walsender or checkpointer or autovacuum should never hide shards.	2022-03-30 16:56:12 +02:00
Ahmet Gedemenli	f74d3eedc8	Add tests for materialized views	2022-03-30 16:01:11 +03:00
Ahmet Gedemenli	8ef2da8192	Add view tests to arbitrary configs	2022-03-30 12:28:31 +03:00
Önder Kalacı	670fae99f7	Add tests with function dependencies on tables (#5866 ) We are not sure if we have such tests, but lets add anyway	2022-03-29 18:04:07 +03:00
Ahmet Gedemenli	1e1e66eeed	Add index tests to arbitrary configs (#5862 )	2022-03-29 13:49:05 +03:00
Ahmet Gedemenli	b52823f8b4	Fix typo in error message for truncating foreign tables (#5864 )	2022-03-29 13:14:16 +03:00
Onder Kalaci	23ff095905	add missing check_mx	2022-03-29 10:35:12 +02:00
Halil Ozan Akgul	c843ebe48e	Turn metadata sync on in arbitrary config tests	2022-03-23 15:19:52 +03:00
Jelte Fennema	3a44fa827a	Add versions of forboth that don't need ListCell (#5856 ) We've had custom versions of Postgres its `foreach` macro which with a hidden ListCell for quite some time now. People like these custom macros, because they are easier to use and require less boilerplate. This adds similar custom versions of Postgres its `forboth` macro. Now you don't need ListCells anymore when looping over two lists at the same time.	2022-03-23 14:50:36 +03:00
Ahmet Gedemenli	b5448e43e3	Fix aggregate signature bug (#5854 )	2022-03-23 13:42:03 +03:00
Burak Velioglu	db9f0d926c	Add support for deparsing ALTER FUNCION ... SUPPORT ... commands	2022-03-22 21:55:55 +03:00
Onder Kalaci	af4ba3eb1f	Remove citus.enable_cte_inlining GUC In Postgres 12+, users can adjust whether to inline/not inline CTEs by [NOT] MATERIALIZED keywords. So, this GUC is already useless.	2022-03-22 17:14:44 +01:00
Halil Ozan Akgul	4690c42121	Fixes ALTER COLLATION encoding does not exist bug	2022-03-22 17:42:20 +03:00
Marco Slot	32c23c2775	Disallow re-partition joins when no hash function defined	2022-03-22 13:42:53 +01:00
Onur Tirtir	11433ed357	Create DDL job for create enum command in postprocess as we do for composite types Since now we don't throw an error for enums that user attempts creating in temp schema, the preprocess / DDL job that contains the prepared statement (to idempotently create the enum type) gets executed. As a result, we were emitting the following warning because of the error the underlying worker connection throws: ```sql WARNING: cannot PREPARE a transaction that has operated on temporary objects CONTEXT: while executing command on localhost:xxxxx WARNING: connection to the remote node localhost:xxxxx failed with the following error: another command is already in progress ERROR: cannot PREPARE a transaction that has operated on temporary objects CONTEXT: while executing command on localhost:xxxxx ```	2022-03-22 15:09:23 +03:00
Onur Tirtir	dc31102630	Locally create objects having a dependency that we cannot distribute We were already doing so for functions & types believing that this cannot be the case for other object types. However, as in #5830, we cannot distribute an object that user attempts creating in temp schema. Even more, this doesn't only apply to functions and types but also to many other object types. So with this commit, we teach preprocess/postprocess functions (that need to create dependencies on worker nodes) how to skip trying to distribute such objects. We also start identifying temp schemas as the objects that we don't know how to propagate to worker nodes so that we can simply create objects locally if user attempts creating them in a temp schema. There are 36 callers of `EnsureDependenciesExistOnAllNodes` in the codebase atm and for the most we still need to throw a hard error (i.e.: not use `DeferErrorIfHasUnsupportedDependency` beforehand), such as: i) user explicitly wants to create a distributed object * CreateCitusLocalTable * CreateDistributedTable * master_create_worker_shards * master_create_empty_shard * create_distributed_function * EnsureExtensionFunctionCanBeDistributed ii) we don't want to skip altering distributed table on worker nodes * PostprocessIndexStmt * PostprocessCreateTriggerStmt * PostprocessCreateStatisticsStmt iii) object is already distributed / handled by Citus before, so we aren't okay with not propagating the ALTER command * PostprocessAlterTableSchemaStmt * PostprocessAlterCollationOwnerStmt * PostprocessAlterCollationSchemaStmt * PostprocessAlterDatabaseOwnerStmt * PostprocessAlterExtensionSchemaStmt * PostprocessAlterFunctionOwnerStmt * PostprocessAlterFunctionSchemaStmt * PostprocessAlterSequenceOwnerStmt * PostprocessAlterSequenceSchemaStmt * PostprocessAlterStatisticsSchemaStmt * PostprocessAlterStatisticsOwnerStmt * PostprocessAlterTextSearchConfigurationSchemaStmt * PostprocessAlterTextSearchDictionarySchemaStmt * PostprocessAlterTextSearchConfigurationOwnerStmt * PostprocessAlterTextSearchDictionaryOwnerStmt * PostprocessAlterTypeSchemaStmt * PostprocessAlterForeignServerOwnerStmt iv) we already cannot create those objects in temp schemas, so skipping for now * PostprocessCreateExtensionStmt * PostprocessCreateForeignServerStmt Also note that there are 3 more callers of `EnsureDependenciesExistOnAllNodes` in enterprise in addition to those 36 but we don't need to do anything specific about them due to the same reasoning given in iii).	2022-03-22 15:09:23 +03:00
Halil Ozan Akgul	50bace9cfb	Fixes the type names that start with underscore bug	2022-03-22 14:24:30 +03:00
Halil Ozan Akgul	4dbc760603	Introduces citus_coordinator_node_id	2022-03-22 10:34:22 +03:00
Hanefi Onaldi	9f204600af	Allow all possible option types for text search objects (#5838 )	2022-03-21 20:01:53 +01:00
Halil Ozan Akgül	6c05e4b35c	Add check_mx to operations schedule (#5818 )	2022-03-21 19:09:26 +03:00
Burak Velioglu	d4625ec6a1	Add support for zero-argument polymorphic aggregates	2022-03-21 16:10:40 +03:00
Ahmet Gedemenli	46c6630328	Qualify CREATE AGGREGATE stmts in Preprocess (#5834 )	2022-03-21 13:55:09 +03:00
Burak Velioglu	2c2064bf36	Create type locally if it has undistributable dependency	2022-03-18 18:23:32 +03:00
Marco Slot	055bbd6212	Use coordinated transaction when there are multiple queries per task	2022-03-18 15:04:27 +01:00
Marco Slot	cab243218d	Avoid locks in relation_is_a_known_shard	2022-03-18 14:37:39 +01:00
Marco Slot	5bb5359da0	Fix worker node version check	2022-03-17 13:23:02 +01:00
Marco Slot	22a18fc1f2	Fix typo in upgrade function	2022-03-17 13:23:02 +01:00
Jelte Fennema	68bfc8d1c0	Use good initdb options in arbitrary configs tests (#5802 ) In `pg_regress_multi.pl` we're running `initdb` with some options that the `common.py` `initdb` is currently not using. All these flags seem reasonable, so this brings `common.py` in line with `pg_regress_multi.pl`. In passing change the `--nosync` flag to `--no-sync`, since that's what the PG documentation lists as the official option name (but both work).	2022-03-17 13:22:23 +01:00
Jelte Fennema	b0e406a478	Disable ddl propagation when creating users in arbitrary config tests (#5814 ) This should help with failing enterprise tests.	2022-03-16 15:12:20 +01:00
Ahmet Gedemenli	eddfea18c2	Fix role creation issue on schema tests (#5812 )	2022-03-16 13:49:28 +01:00
Burak Velioglu	333c73a53c	Drop distributed table on worker with ProcessUtilityParseTree	2022-03-15 17:42:01 +03:00
Gledis Zeneli	56ab64b747	Patches #5758 with some more error checks (#5804 ) Add error checks to detect failed connection and don't ping secondary nodes to detect self reference.	2022-03-15 15:02:47 +03:00
Hanefi Onaldi	c0cd8f3d56	Wait until metadata sync before testing distributed sequences	2022-03-15 10:28:51 +01:00
Marco Slot	e42a798707	Always use RowShareLock in pg_dist_node when syncing metadata	2022-03-15 10:28:51 +01:00
Ahmet Gedemenli	36b33e2491	Add sequence tests to arbitrary config (#5771 ) Add sequence tests to arbitrary config (#5771)	2022-03-14 19:16:24 +03:00
Jelte Fennema	41c6393e82	Parallelize cluster setup in arbitrary config tests (#5738 ) Cluster setup time is significant in arbitrary configs. We can parallelize this a bit more. Runtime of the following command decreases from ~25 seconds to ~22 seconds on my machine with this change: ``` make -C src/test/regress/ check-arbitrary-base CONFIGS=CitusDefaultClusterConfig EXTRA_TESTS=prepared_statements_1 ``` Currently we can only run different configs in parallel. However, when working on a feature or trying to fix a bug this is not important. In those cases you simply want to run a single test file on a single config. And you want to run that every time you made a change to the code that you think fixes the issue. This PR allows parallelising running of bash commands. So `initdb` and `pg_ctl start` is run in parallel for all nodes in the cluster. Instead of one waiting for the other. When you run the above command nothing is being run in parallel. After this PR, cluster setup is being run in parallel.	2022-03-14 16:42:20 +01:00
Jelte Fennema	5063257252	Disable fsync in arbitrary config tests (#5800 ) We have fsync enabled for regular tests already in `pg_regress_multi.pl`. This does the same for the arbitrary config tests. On my machine this changes the runtime from the following command from ~37 to ~25 seconds: ```bash make -C src/test/regress/ check-arbitrary-configs CONFIGS=CitusDefaultClusterConfig ```	2022-03-14 18:12:38 +03:00
Onder Kalaci	338752d96e	Guard against hard wait event set errors Similar to https://github.com/citusdata/citus/pull/5158, but this time instead of the executor, use this in all the remaining places.	2022-03-14 14:35:56 +01:00
Onder Kalaci	953951007c	Move wait event error checks to connection manager	2022-03-14 14:35:56 +01:00
Onur Tirtir	216b9b5b7a	Fix an incorrect error message related with fkeys between replicated dist tables (#5796 ) This is not supported in enterprise too.	2022-03-14 14:34:09 +01:00
Hanefi Onaldi	b24e1dfccc	Propagate text search commands to all worker nodes (#5797 ) Here is a list of some functions, and the `TargetWorkerSet` parameters they supply to `NodeDDLTaskList`: PostprocessCreateTextSearchConfigurationStmt - NON_COORDINATOR_NODES PreprocessDropTextSearchConfigurationStmt - NON_COORDINATOR_METADATA_NODES PreprocessAlterTextSearchConfigurationSchemaStmt - NON_COORDINATOR_METADATA_NODES I guess this means that, if metadata syncing is disabled on the node, we may have some issues. Consider the following: Let's assume the user has metadata syncing disabled. 2 workers. `CREATE TEXT SEARCH CONFIGURATION ...` will get propagated to all workers. `ALTER ... CONFIGURATION ...` will not get propagated to workers. After adding a new non-metadata node, the new node will get the altered configuration as it reads from catalog. At this point CONFIGURATION definitions got diverged in the cluster. I suggest that we always use `NON_COORDINATOR_METADATA_NODES` in all the TEXT SEARCH operations here.	2022-03-14 14:44:34 +03:00
Onder Kalaci	db529facab	Only change the sequence types if the target column type is a supported sequence type Before this commit, we erroneously converted the sequence type to the column's type it is used. However, it is possible that the sequence is used in an expression which then converted to a type that cannot be a sequence, such as text. With this commit, we only try this conversion if the column type is a supported sequence type (e.g., smallint, int and bigint). Note that we do this conversion because if the column type is a bigint and the sequence is NOT a bigint, users would be in trouble because sequences would generate values that are out of the range of the column. (The other ways are already not supported such as the column is int and the sequence is bigint would fail on the worker.) In other words, with this commit, we scope this optimization only when the target column type is a supported sequence type. Otherwise, we let users to more freely use the sequences.	2022-03-11 16:06:00 +01:00
Halil Ozan Akgül	37fafd007c	Turn metadata sync on in isolation_update_node and isolation_update_node_lock_writes tests (#5779 )	2022-03-11 16:39:20 +03:00
Ahmet Gedemenli	d06146360d	Support GRANT ON SCHEMA commands in CREATE SCHEMA statements (#5789 ) * Support GRANT ON SCHEMA commands in CREATE SCHEMA statements * Add test * add comment * Rename to GetGrantCommandsFromCreateSchemaStmt	2022-03-11 14:47:45 +03:00
Jelte Fennema	e5d5c7be93	Start erroring out for unsupported lateral subqueries (#5753 ) With the introduction of #4385 we inadvertently started allowing and pushing down certain lateral subqueries that were unsafe to push down. To be precise the type of LATERAL subqueries that is unsafe to push down has all of the following properties: 1. The lateral subquery contains some non recurring tuples 2. The lateral subquery references a recurring tuple from outside of the subquery (recurringRelids) 3. The lateral subquery requires a merge step (e.g. a LIMIT) 4. The reference to the recurring tuple should be something else than an equality check on the distribution column, e.g. equality on a non distribution column. Property number four is considered both hard to detect and probably not used very often. Thus this PR ignores property number four and causes query planning to error out if the first three properties hold. Fixes #5327	2022-03-11 11:59:18 +01:00
Halil Ozan Akgül	c9913b135c	Turn metadata sync on in isolation_ref2ref_foreign_keys test (#5791 )	2022-03-11 13:30:11 +03:00
Halil Ozan Akgül	2edaf0971c	Turn metadata sync on in isolation reference copy vs all (#5790 ) * Turn metadata sync on in isolation_reference_copy_vs_all test * Update the output of isolation_reference_copy_vs_all test	2022-03-11 11:27:46 +03:00
Hanefi Onaldi	b0eb685101	Add support for TEXT SEARCH DICTIONARY objects TEXT SEARCH DICTIONARY objects depend on TEXT SEARCH TEMPLATE objects. Since we do not yet support distributed TS TEMPLATE objects, we skip dependency checks for text search templates, similar to what we do for roles. The user is expected to manually create the TEXT SEARCH TEMPLATE objects before a) adding new nodes, b) creating TEXT SEARCH DICTIONARY objects.	2022-03-11 03:40:20 +03:00
Marco Slot	49467e27e6	Ensure worker_save_query_explain_analyze always fully qualifies types (#5776 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-03-10 07:30:11 -08:00
Gledis Zeneli	2cb02bfb56	Fix node adding itself with citus_add_node leading to deadlock (Fix #5720 ) (#5758 ) If a worker node is being added, a command is sent to get the server_id of the worker from the pg_dist_node_metadata table. If the worker's id is the same as the node executing the code, we will know the node is trying to add itself. If the node tries to add itself without specifying `groupid:=0` the operation will result in an error.	2022-03-10 17:46:33 +03:00
Burak Velioglu	547f6b18ef	Ensure dependencies exists for all alter owner commands	2022-03-10 16:37:55 +03:00
Ahmet Gedemenli	4312486141	Remove unnecessary schema name from CREATE SCHEMA stmts (#5785 )	2022-03-10 15:19:14 +03:00
Hanefi Onaldi	d153c2de0d	Fix some typos in comments	2022-03-10 15:03:26 +03:00
Ahmet Gedemenli	551a7d1383	Support CREATE SCHEMA without name (#5782 )	2022-03-10 13:38:00 +03:00
Marco Slot	8e43c8094d	Fix CREATE EXTENSION propagation with custom version	2022-03-09 17:40:50 +01:00
Marco Slot	7559ad12ba	Change create_object_propagation default to immediate	2022-03-09 17:40:50 +01:00
Burak Velioglu	bbe1b16125	Check whether the object has unsupported or circular dependency	2022-03-09 16:37:53 +03:00
Jelte Fennema	c8839de68b	Don't use cascading deletes in Citus 11 migration script (#5767 ) Using CASCADE in a DELETE can inadvertently delete things we don't intend to. It's safer to fail hard and make the user delete depending things manually.	2022-03-09 14:35:23 +01:00
Halil Ozan Akgül	333bcc7948	Global PID Helper Functions (#5768 ) * Introduces citus_nodename_for_nodeid and citus_nodeport_for_nodeid functions * Introduces citus_nodeid_for_gpid and citus_pid_for_gpid functions * Add tests	2022-03-09 13:15:59 +03:00
Ahmet Gedemenli	264cf78842	Disable use_citus_managed_tables for Postgres config (#5773 )	2022-03-08 17:13:49 +03:00
Onder Kalaci	c32b2de1a7	Improve citus_lock_waits 1) Remove useless columns 2) Show backends that are blocked on a DDL even before gpid is assigned 3) One minor bugfix, where we clear distributedCommandOriginator properly.	2022-03-07 11:10:44 +01:00
Ahmet Gedemenli	2a3c0c1914	Revert upgrade script changes (#5757 )	2022-03-07 13:04:58 +03:00
Onder Kalaci	24fcd2a88c	Handle dropping the partitioned tables properly Before this commit, we might be leaving some metadata on the workers. Now, we handle DROP SCHEMA .. CASCADE properly to avoid any metadata leakage.	2022-03-07 10:02:54 +01:00
Nils Dijk	3801576dfb	Move pg_dist_object to pg_catalog (#5765 ) DESCRIPTION: Move pg_dist_object to pg_catalog Historically `pg_dist_object` had been created in the `citus` schema as an experiment to understand if we could move our catalog tables to a branded schema. We quickly realised that this interfered with the UX on our managed services and other environments, where users connected via a user with the name of `citus`. By default postgres put the username on the search_path. To be able to read the catalog in the `citus` schema we would need to grant access permissions to the schema. This caused newly created objects like tables etc, to default to this schema for creation. This failed due to the write permissions to that schema. With this change we move the `pg_dist_object` catalog table to the `pg_catalog` schema, where our other schema's are also located. This makes the catalog table visible and readable by any user, like our other catalog tables, for debugging purposes. Note: due to the change of schema, we had to disable 1 test that was running into a discrepancy between the schema and binary. Secondly, we needed to make the lookup functions for the `pg_dist_object` relation and their indexes less strict on the fallback of the naming due to an other test that, due to an unfortunate cache invalidation, needed to lookup the relation again. This makes that we won't default to _only_ resolving from `pg_catalog` outside of upgrades.	2022-03-04 17:40:38 +00:00
Halil Ozan Akgul	0500a62515	Updates citus_dist_stat_activity to use citus_stat_activity	2022-03-04 17:28:17 +03:00
Ahmet Gedemenli	b8eedcd261	Notice when create_distributed_function called without params (#5752 ) * Notice when create_distributed_function called without params * Move variable comments to top * Add valid check for cache entry * add objtype to notice msg * update test outputs * Add more tests * Address feedback	2022-03-04 17:26:39 +03:00
Önder Kalacı	bd6a6563ff	Merge branch 'master' into calculate_gpid	2022-03-04 11:34:12 +01:00
Burak Velioglu	cb6d67a9a9	Make sure that all dependencies of citus tables can be distributed	2022-03-03 20:08:09 +03:00
Onder Kalaci	c7b67ba0ea	Add citus_backend_gpid() And also citus_calculate_gpid(nodeId,pid). These UDFs are just wrappers for the existing functions. Useful for testing and simple manipulation of citus_stat_activity.	2022-03-03 15:29:40 +01:00
Halil Ozan Akgul	06a0509b1a	Introduces citus_stat_activity view	2022-03-03 16:19:20 +03:00
Marco Slot	ddf7cf29f3	Sync pg_dist_colocation as a batch	2022-03-03 12:48:48 +01:00
Marco Slot	3ba61244b8	Synchronize pg_dist_colocation metadata	2022-03-03 11:01:59 +01:00
Marco Slot	43e4dd3808	Add a citus.internal_reserved_connections setting	2022-03-02 19:13:53 +01:00
Onder Kalaci	e80a36c4b6	Improve visibility rules for non-priviledge roles It seems like our approach is way too restrictive and some places are wrong. Now, we follow very similar approach to pg_stat_activity. Some of the changes are pre-requsite for implementing citus_dist_stat_activity via citus_stat_activity.	2022-03-02 18:04:01 +01:00
Onder Kalaci	35ec9721b4	Add a new API for enabling Citus MX for clusters upgrading from earlier versions Clusters created pre-Citus 11 mostly didn't have metadata sync enabled. For those clusters, we add a utility UDF which fixes some minor issues and sync the necessary objects to the workers.	2022-03-02 17:02:55 +01:00
Onder Kalaci	98751058a9	Add Primary key to the table Otherwise enterprise tests fail	2022-03-02 12:03:59 +01:00
Marco Slot	dcfbb51b6b	Revert "Build Columnar.so and make Citus depends on it (#5661 )" This reverts commit `a4133c69e8`.	2022-03-02 11:33:15 +01:00
Ahmet Gedemenli	e1809af376	Propagate CREATE AGGREGATE commands	2022-03-02 10:52:43 +03:00
Onder Kalaci	b79a0052a4	Drop function in the tests on a never version As dropping the function now relies on pg_dist_object, which exists with 9.0+	2022-03-02 08:45:35 +01:00
ywj	a4133c69e8	Build Columnar.so and make Citus depends on it (#5661 ) * [Columnar] Build columnar.so and let citus depends on it Co-authored-by: Yanwen Jin <yanwjin@microsoft.com> Co-authored-by: Ying Xu <32597660+yxu2162@users.noreply.github.com> Co-authored-by: jeff-davis <Jeffrey.Davis@microsoft.com>	2022-03-01 23:31:14 +03:00
Nils Dijk	65bd540943	Feature: configure object propagation behaviour in transactions (#5724 ) DESCRIPTION: Add GUC to control ddl creation behaviour in transactions Historically we would _not_ propagate objects when we are in a transaction block. Creation of distributed tables would not always work in sequential mode, hence objects created in the same transaction as distributing a table that would use the just created object wouldn't work. The benefit was that the user could still benefit from parallelism. Now that the creation of distributed tables is supported in sequential mode it would make sense for users to force transactional consistency of ddl commands for distributed tables. A transaction could switch more aggressively to sequential mode when creating new objects in a transaction. We don't change the default behaviour just yet. Also, many objects would not even propagate their creation when the transaction was already set to sequential, leaving the probability of a self deadlock. The new policy checks solve this discrepancy between objects as well.	2022-03-01 17:29:31 +03:00
Burak Velioglu	f17872aed4	Expand functions while resolving dependencies	2022-03-01 17:08:46 +03:00
Gledis Zeneli	b825232ecb	Handle rebalance / replication when a node is disabled (Fix #5664 ) (#5729 ) The issue in question is caused when rebalance / replication call `FullShardPlacementList` which returns all shard placements (including those in disabled nodes with `citus_disable_node`). Eventually, `FindFillStateForPlacement` looks for the state across active workers and fails to find a state for the placements which are in the disabled workers causing a seg fault shortly after. Approach: * `ActivePlacementHash` was not using the status of the shard placement's node to determine if the node it is active. Initially, I just fixed that. * Additionally, I refactored the code which handles active shards in replication / rebalance to: * use a single function to determine if a shard placement is active. * do the shard active shard filtering before calling `RebalancePlacementUpdates` and `ReplicationPlacementUpdates`, so test methods like `shard_placement_rebalance_array` and `shard_placement_replication_array` which have different shard placement active requirements can do their own filtering while using the same rebalance / replicate logic that `rebalance_table_shards` and `replicate_table_shards` use. Fix #5664	2022-02-25 19:54:30 +03:00
Hanefi Onaldi	6c25eea62f	Fix some typos in comments	2022-02-24 19:48:52 +03:00
Onder Kalaci	df95d59e33	Drop support for CitusInitiatedBackend CitusInitiatedBackend was a pre-mature implemenation of the whole GlobalPID infrastructure. We used it to track whether any individual query is triggered by Citus or not. As of now, after GlobalPID is already in place, we don't need CitusInitiatedBackend, in fact it could even be wrong.	2022-02-24 12:12:43 +01:00
Marco Slot	0c4e3cb69c	Drop worker_partition_query_result on downgrade	2022-02-24 10:18:56 +01:00
Hanefi Onaldi	7bd6c2c9ac	Isolation tests for various ddl operations and metadata sync	2022-02-24 03:19:56 +03:00
Hanefi Onaldi	f4e8af2c22	Do not acquire locks on node metadata explicitly	2022-02-24 03:19:56 +03:00
Hanefi Onaldi	b70949ae8c	Lock nodes when building ddl task lists	2022-02-24 03:19:56 +03:00
Marco Slot	ef1ceb3953	Only use a single placement for map tasks	2022-02-23 19:40:21 +01:00
Marco Slot	8de802eec5	Enable local_shared_pool_size 5 in arbitrary configs test	2022-02-23 19:40:21 +01:00
Marco Slot	490765a754	Enable re-partition joins after local execution	2022-02-23 19:40:21 +01:00
Marco Slot	3cd9aa655a	Stop using citus.binary_worker_copy_format	2022-02-23 19:40:21 +01:00
Marco Slot	5ac0d31e8b	Fix re-partition hash range generation	2022-02-23 19:40:21 +01:00
Marco Slot	72d8fde28b	Use intermediate results for re-partition joins	2022-02-23 19:40:21 +01:00
Nils Dijk	1fb970224e	Fix: partitioned index dependencies (#5741 ) #5685 introduced the resolution of dependencies for indices. This missed support for indices on partitioned tables. This change adds support for partitioned indices to the dependency resolution code.	2022-02-23 17:53:26 +03:00
Jelte Fennema	e1afd30263	Speed up test runs on WSL2 a lot (#5736 ) It turns out `whereis` is incredibly slow on WSL2 (at least on my machine): ``` $ time whereis diff diff: /usr/bin/diff /usr/share/man/man1/diff.1.gz real 0m0.408s user 0m0.010s sys 0m0.101s ``` This command is run by our custom `diff` script, which is run for every test file that is run. So this adds lots of unnecessary runtime time to tests. This changes our custom `diff` script to only call `whereis` in the strange case that `/usr/bin/diff` does not exist. The impact of this small change on the total runtime of the tests on WSL is huge. As an example the following command takes 18 seconds without this change and 7 seconds with it: ``` make -C src/test/regress/ check-arbitrary-configs CONFIGS=PostgresConfig ```	2022-02-23 13:03:29 +01:00
Ahmet Gedemenli	8b9402540f	Add use_citus_managed_tables to arbitrary configs (cherry picked from commit 4e93afd1f78854e1aaab63690c441b0b0598a82c) (cherry picked from commit `0295fe2f5b`) (cherry picked from commit 878510725fab9cb6870b4504e0b1f055d7bbc68d)	2022-02-22 11:39:30 +03:00
Teja Mupparti	a62901396b	Allow unsafe triggers via a GUC	2022-02-21 22:45:17 -08:00
Onder Kalaci	95d5918967	Properly set worker_query and use	2022-02-21 18:22:33 +01:00
Onder Kalaci	dffcafc096	Use global pids in citus_lock_waits	2022-02-21 17:46:34 +01:00
Onder Kalaci	331af3dce8	Dumping wait edges becomes optionally scan all backends Before this commit, dumping wait edges can only be used for distributed deadlock detection purposes. With this commit, we open the possibility that we can use it for any backend.	2022-02-21 17:37:07 +01:00
Halil Ozan Akgul	f6cd4d0f07	Overrides pg_cancel_backend and pg_terminate_backend to accept global pid	2022-02-21 16:41:35 +03:00
Ahmet Gedemenli	c1d5ca9896	Do distributed check first, for DropSchema stmts	2022-02-21 14:43:04 +03:00
Ahmet Gedemenli	28aa715ce2	Add test for citus local tables with dropped columns	2022-02-21 12:07:17 +03:00
Ahmet Gedemenli	2bc6a00408	Refactor CreateDistributedTable to take column name	2022-02-21 12:07:17 +03:00
yxu2162	8974b2de66	Copied CheckCitusVersion over to Columnar to handle dependency issue. If we split columnar into two extensions, this will later be changed tl CheckColumnarVersion.	2022-02-18 09:47:39 -08:00
Philip Dubé	3d044dc543	Merge branch 'master' into avoid-exceptional-control-flow-in-fluent-py	2022-02-18 16:10:45 +00:00
Burak Velioglu	fa6866ed36	Start to propagate functions to worker nodes with CREATE FUNCTION command together with it's dependencies. If the function depends on any nondistributable object, function will be created only locally. Parameterless version of create_distributed_function becomes obsolete with this change, it will deprecated from the code with a subsequent PR.	2022-02-18 13:56:51 +03:00
gledis69	a14fada153	Prevent Deadlocks When a Worker Tries to Create Collation (Fix #5583 ) * When a worker tried to create a collation which had a dependency in the same worker node, it would cause a deadlock, now it throws the correct "not a coordinator" error.	2022-02-18 12:28:02 +03:00
Teja Mupparti	46fa47beea	Force-delegated functions' distribution argument must be reset as soon as the routine completes execution, and not wait until the top level Executor ends. This fixes issue #5687	2022-02-17 10:48:30 -08:00
Philip Dubé	e4420a6252	fluent.py: prefer simpler return based control flow in _accept rather than relying on raising an exception	2022-02-17 13:30:17 +00:00
Nils Dijk	768b320470	reuse GetRoleSpecObjectForUser	2022-02-17 13:16:10 +01:00
Nils Dijk	ea86f9f94e	Add support for TEXT SEARCH CONFIGURATION objects (#5685 ) DESCRIPTION: Implement TEXT SEARCH CONFIGURATION propagation The change adds support to Citus for propagating TEXT SEARCH CONFIGURATION objects. TSConfig objects cannot always be created in one create statement, and instead require a create statement followed by many alter statements to get turned into the object they should represent. To support this we add functionality to the worker to create or replace objects based on a list of statements. When the lists of the local object and the remote object correspond 1:1 we skip the creation of the object and simply mark it distributed. This is especially important for TSConfig objects as initdb pre-populates databases with a dozen configurations (for many different languages). When the user creates a new TSConfig based on the copy of an existing configuration there is no direct link to the object copied from. Since there is no link we can't simply rely on propagating the dependencies to the worker and send a qualified	2022-02-17 13:12:46 +01:00
Hanefi Onaldi	ccc4cc6bf0	Move test in isolation schedule to prevent failure We check for metadata consistency across the cluster in the test isolation_metadata_sync_vs_all. However, some earlier tests in enterprise repo leave invalid pg_dist_node entries in the worker nodes that have Oid values for already dropped role objects. To remedy that, I suggest that we move the test to earlier in the schedule, thereby making the tests pass for the time being. We should later introduce metadata checking either in a new isolation test or by moving this test later in the schedule. However, we should do that after we fix the underlying issue.	2022-02-17 13:15:21 +03:00
Ahmet Gedemenli	a1c3580c64	Support TRUNCATE for foreign tables	2022-02-17 09:59:53 +03:00
Onder Kalaci	abd5b1c506	Prevent any monitoring view/udf to show already exited backends The low-level StoreAllActiveTransactions() function filters out backends that exited. Before this commit, if you run a pgbench, after that you'd still see the backends show up: ```SQL select count() from get_global_active_transactions(); ┌───────┐ │ count │ ├───────┤ │ 538 │ └───────┘ ``` After this patch, only active backends show-up: ```SQL select count() from get_global_active_transactions(); ┌───────┐ │ count │ ├───────┤ │ 72 │ └───────┘ ```	2022-02-14 17:34:32 +01:00
Ahmet Gedemenli	0411a98c99	Refactor EnsureSequentialMode functions (#5704 )	2022-02-14 18:38:21 +03:00
Gledis Zeneli	badfd561b2	Prevent Citus table functions from being called on shards (Fix #5610 ) (#5694 ) DESCRIPTION: Prevent Citus table functions from being called on shards The operations that guard against using shards are: * Create Local Table * Create distributed table (which affects reference table creation as well). * I used a `ErrorIfRaltionIsKnownShard` instead of `ErrorIfIllegallyChangingKnownShard`. `ErrorIfIllegallyChangingKnownShard` allows the operation if `citus.enable_manual_changes_to_shards`, but I am not sure if it ever makes sense to create a distributed, reference, or citus local table out of a shard. I tried to go over the code to identify other UDF-s where shards could be illegaly changed, but I could not find any other. My knowledge of the codebase is not solid enough for me to say for sure. Fixes #5610	2022-02-14 16:06:48 +03:00
Hanefi Onaldi	2e5ca8ba2b	Add isolation tests for metadata sync vs all This commit introduces several test cases for concurrent operations that change metadata, and a concurrent metadata sync operation. The overall structure is as follows: - Session#1 starts metadata syncing in a transaction block - Session#2 does an operation that change metadata - Both sessions are committed - Another session checks whether the metadata are the same accross all nodes in the cluster.	2022-02-11 01:55:04 +03:00
Önder Kalacı	dc6c194916	Show IDLE backends in citus_dist_stat_activity (#5700 ) * Break the dependency to CitusInitiatedBackend infrastructure With this change, we start to show non-distributed backends as well in citus_dist_stat_activity. I think that (a) it is essential for making citus_lock_waits to work for blocked on DDL commands. (b) it is more expected from the user's perspective. The name of the view is a little inconsistent now (e.g., citus_dist_stat_activity) but we are already planning to improve the names with followup PRs. Also, we have global pids assigned, the CitusInitiatedBackend becomes obsolete.	2022-02-10 08:59:28 -08:00
Ahmet Gedemenli	76b63a307b	Propagate create/drop schema commands	2022-02-10 14:58:09 +03:00
Marco Slot	d0711ea9b4	Delegate function calls in FROM outside of transaction block	2022-02-09 20:56:25 +01:00
Onder Kalaci	1c30f61a70	Prevent citus.node_conninfo to use "application_name" With https://github.com/citusdata/citus/pull/5657, Citus uses a fixed application_name while connecting to remote nodes for internal purposes. It means that we cannot allow users to override it via citus.node_conninfo.	2022-02-09 13:22:04 +01:00
Teja Mupparti	1e3c8e34c0	Allow create_distributed_function() on a function owned by an extension Implement #5649 Allow create_distributed_function() on functions owned by extensions 1) Only update pg_dist_object, and do not propagate CREATE FUNCTION. 2) Ensure corresponding extension is in pg_dist_object. 3) Verify if dependencies exist on the function they should resolve to the extension. 4) Impact on node-scaling: We build a list of ddl commands based on all objects in pg_dist_object. We need to omit the ddl's for the extension-function, as it will get propagated by the virtue of the extension creation. 5) Extra checks for functions coming from extensions, to not propagate changes via ddl commands, even though the function is marked as distributed in pg_dist_object	2022-02-08 11:52:56 -08:00
Halil Ozan Akgul	8ee02b29d0	Introduce global PID	2022-02-08 16:49:38 +03:00
Burak Velioglu	0a70b78bf5	Add test for dist type	2022-02-07 17:50:49 +03:00
Burak Velioglu	c0aece64d0	Add test for checking distributed extension function	2022-02-07 17:50:48 +03:00
Burak Velioglu	ab248c1785	Check object ownership while creating pg_dist_object entries on remote	2022-02-07 17:50:48 +03:00
Burak Velioglu	8ae7577581	Use superuser connection while syncing dependent objects' pg_dist_object tuples	2022-02-07 17:50:45 +03:00
Marco Slot	872f0a79db	Remove random shard placement policy	2022-02-06 21:55:58 +01:00
Marco Slot	0cae8e7d6b	Remove local-node-first shard placement	2022-02-06 21:36:34 +01:00
Teja Mupparti	c8e504dd69	Fix the issue #5673 If the expression is simple, such as, SELECT function() or PEFORM function() in PL/PgSQL code, PL engine does a simple expression evaluation which can't interpret the Citus CustomScan Node. Code checks for simple expressions when executing an UDF but missed the DO-Block scenario, this commit fixes it.	2022-02-04 15:44:53 -08:00
Ying Xu	b5c116449b	Removed dependency from EnsureTableOwner (#5676 ) Removed dependency for EnsureTableOwner. Also removed pg_fini() and columnar_tableam_finish() Still need to remove CheckCitusVersion dependency to make Columnar_tableam.h dependency free from Citus.	2022-02-04 12:45:07 -08:00
Onur Tirtir	79442df1b7	Fix coordinator/worker query targetlists for agg. that we cannot push-down (#5679 ) Previously, we were wrapping targetlist nodes with Vars that reference to the result of the worker query, if the node itself is not `Const` or not a `Param`. Indeed, we should not do that unless the node itself is a `Var` node or contains a `Var` within it (e.g.: `OpExpr(Var(column_a) > 2)`). Otherwise, when worker query returns empty result set, then combine query exec would crash since the `Var` would be pointing to an empty tuple slot, which is not desirable for the node-executor methods.	2022-02-04 05:37:25 -08:00
Onder Kalaci	72d7d92611	Apply code review feedback	2022-02-04 10:52:57 +01:00
Onder Kalaci	923bb194a4	Move isolation_multiuser_locking to MX tests	2022-02-04 10:52:57 +01:00
Onder Kalaci	bcb00e3318	remove not used files	2022-02-04 10:52:57 +01:00
Onder Kalaci	ff234fbfd2	Unify old GUCs into a single one Replaces citus.enable_object_propagation with citus.enable_metadata_sync Also, within Citus 11 release cycle, we added citus.enable_metadata_sync_by_default, that is also replaced with citus.enable_metadata_sync. In essence, when citus.enable_metadata_sync is set to true, all the objects and the metadata is send to the remote node. We strongly advice that the users never changes the value of this GUC.	2022-02-04 10:52:56 +01:00
Teja Mupparti	f31bce5b48	Fixes the issue seen in https://github.com/citusdata/citus-enterprise/issues/745 With this commit, rebalancer backends are identified by application_name = citus_rebalancer and the regular internal backends are identified by application_name = citus_internal	2022-02-03 09:40:46 -08:00
jeff-davis	b072b9235e	Columnar: fix checksums, broken in `a4067913`. (#5669 ) Checksums must be set directly before writing the page. log_newpage() sets the page LSN, and therefore invalidates the checksum.	2022-02-02 13:22:11 -08:00
Onder Kalaci	650243927c	Relax some transactional limications on activate node We already enforce EnsureSequentialModeMetadataOperations(), and given that all activate node is transaction, we should be fine	2022-02-01 15:56:55 +01:00
Onder Kalaci	34d91009ed	Update outdated comment As of the current HEAD, we support sequences as first class objects	2022-02-01 15:37:10 +01:00
Marco Slot	63c6896716	Enable function call pushdown from workers	2022-02-01 14:13:25 +01:00
Önder Kalacı	f712dfc558	Add tests coverage (#5672 ) For extension owned tables with sequences	2022-02-01 15:39:52 +03:00
Burak Velioglu	f88cc230bf	Handle tables and objects as metadata. Update UDFs accordingly With this commit we've started to propagate sequences and shell tables within the object dependency resolution. So, ensuring any dependencies for any object will consider shell tables and sequences as well. Separate logics for both shell tables and sequences have been removed. Since both shell tables and sequences logic were implemented as a part of the metadata handling before that logic, we were propagating them while syncing table metadata. With this commit we've divided metadata (which means anything except shards thereafter) syncing logic into multiple parts and implemented it either as a part of ActivateNode. You can check the functions called in ActivateNode to check definition of different metadata. Definitions of start_metadata_sync_to_node and citus_activate_node have also been updated. citus_activate_node will basically create an active node with all metadata and reference table shards. start_metadata_sync_to_node will be same with citus_activate_node except replicating reference tables. stop_metadata_sync_to_node will remove all the metadata. All of those UDFs need to be called by superuser.	2022-01-31 16:20:15 +03:00
Önder Kalacı	f68ac4a7cf	Consider foreign keys between reference tables (#5659 ) On #5071, we avoid edge cases, but below there are foreign key constraints as well This commit makes sure we cover those as well	2022-01-28 13:38:14 +01:00
Heikki Linnakangas	a40679139b	Use smgrextend() when extending relation, and WAL-log first. (#5654 ) When creating a new table, we bypass the buffer cache and write the initial pages directly with smgrwrite(). However, you're supposed to use smgrextend() when extending a relation, rather than smgrwrite(). There isn't much difference between them, but smgrextend() updates the relation size cache, which seems important, although I haven't seen any real bugs caused by that. Also, write the block to disk only after WAL-logging it, so that we can include the LSN of the WAL record in the version that we write out. Currently, the page as written to disk has LSN 0. That doesn't cause any user-visible issues either, at worst it could make us WAL-log a full page image of the page earlier than necessary, but that doesn't matter currently because we WAL-log full page images of all changes anyway. I bumped into that issue with LSN 0 in the page header when testing Citus with Zenith (https://github.com/zenithdb/zenith/issues/1176). Zenith contains a check that PANICs if you write a block to disk without WAL-logging it, and it works by checking the LSN of the page that's written out. In this case, we are WAL-logging the page even though the LSN on the page is 0, so it was a false alarm, but I'd love to get this changed in Citus to keep the check in Zenith simple. A downside of WAL-logging the page first is that if you run out of disk space, you have already created the WAL record. So if you then crash and restart, WAL recovery will likely run out of disk space, too, which is bad. In practice, we have the same problem in other places, like rewriteheap.c. Also, if you are on the brink of running out of disk space, you will probably run out at WAL replay anyway, regardless of which order we write these few pages. But if we wanted to fix that, we could first extend the relation with zeros, and then WAL-log the pages. That's how heap extension works. It would be even nicer to use the buffer cache for this, and skip the smgrimmedsync() on the relation. However, that would require more work, because we don't have the Relation struct for the relation here. We could use ReadBufferWithoutRelcache(), but that doesn't work for unlogged tables. Unlogged tables are currently not supported (https://github.com/citusdata/citus/issues/4742), but that would become a problem if we want to support them in the future. CreateFakeRelcacheEntry() also doesn't work with unlogged tables. We could do things differently for logged and unlogged tables, but that complicates the code further. Co-authored-by: jeff-davis <Jeffrey.Davis@microsoft.com>	2022-01-27 12:04:08 -08:00
Onder Kalaci	303540e494	Add PGAPPNAME env. variable to arbitrary configs	2022-01-27 11:00:15 +01:00
Onder Kalaci	b26eeaecd3	Use a fixed application_name while connecting to remote nodes Citus heavily relies on application_name, see `IsCitusInitiatedRemoteBackend()`. But if the user set the application name, such as export PGAPPNAME=test_name, Citus uses that name while connecting to the remote node. With this commit, we ensure that Citus always connects with the "citus" user name to the remote nodes.	2022-01-27 10:46:25 +01:00
Onder Kalaci	b9b419ef16	Allow creating distributed tables in sequential mode With https://github.com/citusdata/citus/pull/2780, we allow COPY to use any number of connections that the executor used in a tx block. Meaning that, while COPYing data to the shards, create_distributed_table could allow sequential mode.	2022-01-26 12:58:18 +01:00
Onur Tirtir	8c8d696621	Not fail over to local execution when it's not supported (#5625 ) We fall back to local execution if we cannot establish any more connections to local node. However, we should not do that for the commands that we don't know how to execute locally (or we know we shouldn't execute locally). To fix that, we take localExecutionSupported take into account in CanFailoverPlacementExecutionToLocalExecution too. Moreover, we also prompt a more accurate hint message to inform user about whether the execution is failed because local execution is disabled by them, or because local execution wasn't possible for given command.	2022-01-25 16:43:21 +01:00
Onur Tirtir	ff3913ad99	Copy errmsg for distributed deadlock error into heap (#5641 ) multi_log_hook() hook is called by EmitErrorReport() when emitting the ereport either to frontend or to the server logs. And some callers of EmitErrorReport() (e.g.: errfinish()) seems to assume that string fields of given ErrorData object needs to be freed. For this reason, we copy the message into heap here. I don't think we have faced with such a problem before but it seems worth fixing as it is theoretically possible due to the reasoning above.	2022-01-24 06:27:41 -08:00
Ahmet Gedemenli	c838fb428f	Refactor GenerateGrantOnSchemaStmtForRights	2022-01-24 11:31:59 +03:00
Ahmet Gedemenli	e6fc0c6f36	Turn mx on for test: multi_colocation_utils	2022-01-21 19:31:47 +03:00
Onur Tirtir	4dc38e9e3d	Use EnsureCompatibleLocalExecutionState instead (#5640 )	2022-01-21 15:37:59 +01:00
Ahmet Gedemenli	8647682c11	Fix typo: taget/target	2022-01-21 10:35:56 +03:00
Onur Tirtir	181111b84f	Drop ruleutils copied for statistics	2022-01-20 17:28:19 +03:00
Onur Tirtir	7b59295af2	Drop ruleutils copied for triggers	2022-01-20 17:28:19 +03:00
Önder Kalacı	e8ba9dd9d3	Merge branch 'master' into make_minimal_work_again	2022-01-20 11:48:53 +01:00
Teja Mupparti	54862f8c22	(1) Functions will be delegated even when present in the scope of an explicit BEGIN/COMMIT transaction block or in a UDF calling another UDF. (2) Prohibit/Limit the delegated function not to do a 2PC (or any work on a remote connection). (3) Have a safety net to ensure the (2) i.e. we should block the connections from the delegated procedure or make sure that no 2PC happens on the node. (4) Such delegated functions are restricted to use only the distributed argument value. Note: To limit the scope of the project we are considering only Functions(not procedures) for the initial work. DESCRIPTION: Introduce a new flag "force_delegation" in create_distributed_function(), which will allow a function to be delegated in an explicit transaction block. Fixes #3265 Once the function is delegated to the worker, on that node during the planning distributed_planner() TryToDelegateFunctionCall() CheckDelegatedFunctionExecution() EnableInForceDelegatedFuncExecution() Save the distribution argument (Constant) ExecutorStart() CitusBeginScan() IsShardKeyValueAllowed() Ensure to not use non-distribution argument. ExecutorRun() AdaptiveExecutor() StartDistributedExecution() EnsureNoRemoteExecutionFromWorkers() Ensure all the shards are local to the node in the remoteTaskList. NonPushableInsertSelectExecScan() InitializeCopyShardState() EnsureNoRemoteExecutionFromWorkers() Ensure all the shards are local to the node in the placementList. This also fixes a minor issue: Properly handle expressions+parameters in distribution arguments	2022-01-19 16:43:33 -08:00
Onder Kalaci	7f30222c90	Fix check-minimal It seems like we broke check-minimal with the refactor on #5486 This commit fixes the minor issue	2022-01-19 16:21:59 +01:00
Ahmet Gedemenli	9e6ebe4826	Turn mx on for test file citus_local_tables, on multi-1 schedule	2022-01-19 13:55:51 +03:00
Onur Tirtir	4a53967bdd	Remove an outdated comment from RelationIsAKnownShard (#5629 )	2022-01-19 11:24:10 +01:00
Ahmet Gedemenli	37b3f50447	Turn mx on for multi-1 schedule (#5627 ) For test files: multi_generate_ddl_commands, multi_repair_shards, multi_create_shards, mixed_relkind_tests	2022-01-19 12:05:54 +03:00
Marco Slot	33bfa0b191	Hide shards from application_name's with a specific prefix	2022-01-18 15:20:55 +04:00
Onur Tirtir	d98500ac22	Fix a flaky test related with temp columnar table cleanup (#5599 ) Wait until old backend to expire to make sure that temp table cleanup is complete.	2022-01-17 09:26:30 -08:00
Ahmet Gedemenli	e564220dd5	Fix typo: GetRelationTriggerFunctionDependencyList (#5626 )	2022-01-17 18:17:07 +03:00
Ahmet Gedemenli	8936543b80	Create wrapper function CreateObjectAddressDependencyDefList (#5623 )	2022-01-17 15:35:40 +03:00
Ying Xu	4dca662e97	Making Columnar Dependency Free from Citus (#5622 ) * Removed distributed dependency in columnar_metadata.c * Changed columnar_debug.c so that it no longer needed distributed/tuplestore and made it return a record instead of a tuplestore * removed distributed/commands.h dependency * Made columnar_tableam.c dependency-free * Fixed spacing for columnar_store_memory_stats function * indentation fix * fixed test failures	2022-01-14 09:43:05 -08:00
Onur Tirtir	70d8e1fe97	Assert that we will create indexes on shards via local execution (#5620 )	2022-01-13 17:09:57 +01:00
Halil Ozan Akgul	63cd90e5dd	Add missing library to dependencies.c	2022-01-11 18:36:43 +03:00
Önder Kalacı	46ec7cd5cf	Enable MX for rebalancer tests	2022-01-11 12:07:39 +01:00

... 2 3 4 5 6 ...

3826 Commits (cc694b6bcfb13d02aa00ba467acf356b61d612a1)