citus

Commit Graph

Author	SHA1	Message	Date
Onur Tirtir	452b6a2212	Hint users to call "citus_set_coordinator_host" first (#6425 ) If an operation requires having coordinator in pg_dist_node and if that is not the case, then we automatically add the coordinator into pg_dist_node if user didn't add any worker nodes yet. However, if user have already added some worker nodes before, we throw an error. With this commit, we improve the error thrown in that case. Closes #6423 based on the discussion made there. (cherry picked from commit `20847515fa`)	2022-10-12 18:29:39 +03:00
Hanefi Onaldi	64db74c051	Remove references to optimization PG15 reverted PG15 introduced an optimization on GROUP BY keys that is now reverted on RC2. Relevant PG commit: Revert "Optimize order of GROUP BY keys". 443df6e2db932a7cd6d85ddfb67e11a43345130d (cherry picked from commit `cbe4298c5b`)	2022-10-12 17:01:05 +03:00
Hanefi Onaldi	f06ae0c106	Add tests for CREATE DATABASE with OID option (#6376 ) PG15 now allows users to specify oids when creating databases. This feature is a side effect of a bigger feature in pg_upgrade. Relevant PG Commit: pg_upgrade: Preserve database OIDs. aa01051418f10afbdfa781b8dc109615ca785ff9 (cherry picked from commit `7e0edee4ec`)	2022-10-12 15:15:39 +03:00
Naisila Puka	2e06e62476	Adds tests for suppressed constants in postgres_fdw queries (#6370 ) PG15 has suppressed some casts on constants when querying foreign tables. For example, we can use text to represent a type that's an enum on the remote side. A comparison on such a column will get shipped as "var = 'foo'::text". But there's no enum = text operator on the remote side. If we leave off the explicit cast, the comparison will work. Test we behave in the same way with a Citus foreign table Reminder: foreign tables cannot be distributed/reference, can only be Citus local Relevant PG commit: `f8abb0f5e1` (cherry picked from commit `1b26d57288`)	2022-10-12 15:14:04 +03:00
Hanefi Onaldi	5294df1602	Add tests for jsonpath changes on PG15 PostgreSQL 15 had some changes to jsonpath to conform with ECMA-262 referenced by SQL standard. This commit adds tests to make sure Citus also supports the same standards. Relevant pg commit: e26114c817b610424010cfbe91a743f591246ff1 (cherry picked from commit `30ac6f0fe9`)	2022-10-12 15:13:08 +03:00
Naisila Puka	781960d16b	Comment about column list for fk ON DELETE SET in PG15 (#6372 ) As a part of `a868cc049a` (cherry picked from commit `dc9723fa45`)	2022-10-12 15:12:00 +03:00
Naisila Puka	4e54d1f0be	Add tests to verify we support security invoker views (#6362 ) PG15 added support for security invoker views. Relevant PG conmit: `7faa5fc84b` These views check the permissions for the underlying tables of the view invoker user, not the view definer user. When the view has underlying distributed tables, the queries to the shards are sent by opening connections with the current user, which is the view invoker, no matter what the type of the view is. This means that, for distributed views, they were always behaving like security invoker views. Check the following issue for more details: https://github.com/citusdata/citus/issues/6161 So, Citus doesn't fully support security definer views. However Citus does fully support security invoker views. We add tests to make sure we cover different cases. (cherry picked from commit `1ede0b9db3`)	2022-10-12 15:10:37 +03:00
Onder Kalaci	270a18ca06	Add tests for PG15 new aggregate commands Both tests include pushdown and pull to coordinator type of aggregate execution. Relevant PG commits: Add min() and max() aggregates for xid8 400fc6b6487ddf16aa82c9d76e5cfbe64d94f660 Add range_agg with multirange inputs 7ae1619bc5b1794938c7387a766b8cae34e38d8a Co-authored-by: Onder Kalaci <onderkalaci@gmail.com> (cherry picked from commit `03ac8b4f82`)	2022-10-12 15:07:15 +03:00
Naisila Puka	71d049025d	Fixes empty password issue (#6417 ) (cherry picked from commit `89aa9a015f`)	2022-10-11 15:59:43 +03:00
Onur Tirtir	c2c0b97a5b	Retain trigger settings when re-creating the triggers (on shards) (#6398 ) Fixes https://github.com/citusdata/citus/issues/6394. DESCRIPTION: Fixes a bug that causes creating disabled-triggers on shards as enabled Since CREATE TRIGGER doesn't have syntax support to specify whether the trigger should be enabled/disabled, the underlying PG function (`pg_get_triggerdef()`) that we use to generate the command to create the trigger is not enough. For this reason, we append a second command to enable/disable trigger, right after creating it. We don't retain explicit extension dependencies set by using `ALTER trigger DEPENDS ON EXTENSION` commands too, but apparently right fix for that is to throw an error as in `PreprocessAlterTriggerDependsStmt()`; so, opened a separate PR to fix that #6399. (cherry picked from commit `86e186f671`)	2022-10-10 11:24:36 +03:00
Ying Xu	cf3018bd5a	[Columnar] Bugfix for Columnar: options ignored during ALTER TABLE (#6411 ) DESCRIPTION: Fixes a bug that prevents retaining columnar table options after a table-rewrite A fix for this issue: Columnar: options ignored during ALTER TABLE rewrite #5927 The OID for the temporary table created during ALTER TABLE was not the same as the original table's OID so the columnar options were not being applied during rewrite. The change is that I applied the original table's columnar options to the new table so that it has the correct options during write. I also added a test. Cherry-pick from commit `f21cbe68f8`	2022-10-09 22:30:54 -07:00
Naisila Puka	b926fe8114	Use original relation to retrieve column name because of syscache (#6387 ) During alter_distributed_table, we create a new table like the original table but with the altered options. To retrieve the name of the distribution column, we were using the attribute syscache of the new table, since we already created the new table as identical to the original table. However, the attribute syscaches of these two tables are not the same if the original table has dropped columns. The reason is that dropped columns are all still present in the cache. Hence, for example, the attnos would be different in the syscaches. So, let's use the attribute syscache of the original table.	2022-10-06 12:13:57 +03:00
Hanefi Onaldi	d2181aec7f	Document failing downgrades from 10.2-4 to 10.2-2 (cherry picked from commit `5ddd4754a2`)	2022-10-04 21:08:56 +03:00
Hanefi Onaldi	8a1c0ae821	Fix tests for missing downgrades (cherry picked from commit `0efd6f7829`)	2022-10-04 21:08:56 +03:00
Jelte Fennema	006f6aceaf	Reuse connections for Splits and Logical Replication (#6314 ) In Split, Logical replication logic and ShardCleaner we call `SendCommandListToWorkerOutsideTransaction` and `SendOptionalCommandListToWorkerOutsideTransaction` frequently. This opens new connection for each of those calls, even though we already have a perfectly good connection lying around. This PR adds two new APIs `SendCommandListToWorkerOutsideTransactionWithConnection` and `SendOptionalCommandListToWorkerOutsideTransactionWithConnection` that allow sending a list of queries in a transaction over an existing connection. We also update the callers (Split, ShardCleaner, Logical Replication) to use these new APIs instead. Co-authored-by: Nitish Upreti <niupre@microsoft.com> Co-authored-by: Onder Kalaci <onderkalaci@gmail.com> (cherry picked from commit `24e06af6d2`)	2022-09-26 16:53:38 +02:00
Onur Tirtir	b9e4364acc	Not allow ON DELETE/UPDATE SET DEFAULT actions on columns that default to sequences (#6340 ) Given that we drop DEFAULT nextval('sequence') expressions from shard relation columns, allowing `ON DELETE/UPDATE SET DEFAULT` on such columns might cause inserting NULL values as a result of a delete/update operation. For this reason, we disallow ON DELETE/UPDATE SET DEFAULT actions on columns that default to sequences. DESCRIPTION: Disallows having ON DELETE/UPDATE SET DEFAULT actions on columns that default to sequences Fixes #6339. (cherry picked from commit `a868cc049a`) Conflicts: src/test/regress/expected/pg15.out src/test/regress/sql/pg15.sql	2022-09-23 13:55:51 +03:00
Onur Tirtir	53ec5abb75	Not drop default col exprs from shard when adding local table to metadata (#6323 ) As we did for GENERATED STORED columns in #4613, we should not drop column default expressions that are not based on sequences from shard relation since such expressions need to exist e.g. for foreign key actions. For the column default expressions that are based on sequences we cannot do much, so we need to disallow having ON DELETE SET DEFAULT actions on such columns in a separate PR, see #6339. Fixes #6318. DESCRIPTION: Fixes a bug that might cause inserting incorrect DEFAULT values when applying foreign key actions (cherry picked from commit `de24a3eda5`)	2022-09-23 13:53:04 +03:00
Marco Slot	d5db0adc17	Allow create_distributed_table_concurrently on an empty node (#6353 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-16 12:17:25 +03:00
Onder Kalaci	af448da1a7	Prevent failures on partitioned distributed tables with statistics objects on PG 15 Comment from the code is clear on this: /* * The statistics objects of the distributed table are not relevant * for the distributed planning, so we can override it. * * Normally, we should not need this. However, the combination of * Postgres commit 269b532aef55a579ae02a3e8e8df14101570dfd9 and * Citus function AdjustPartitioningForDistributedPlanning() * forces us to do this. The commit expects statistics objects * of partitions to have "inh" flag set properly. Whereas, the * function overrides "inh" flag. To avoid Postgres to throw error, * we override statlist such that Postgres does not try to process * any statistics objects during the standard_planner() on the * coordinator. In the end, we do not need the standard_planner() * on the coordinator to generate an optimized plan. We call * into standard_planner() for other purposes, such as generating the * relationRestrictionContext here. * * AdjustPartitioningForDistributedPlanning() is a hack that we use * to prevent Postgres' standard_planner() to expand all the partitions * for the distributed planning when a distributed partitioned table * is queried. It is required for both correctness and performance * reasons. Although we can eliminate the use of the function for * the correctness (e.g., make sure that rest of the planner can handle * partitions), it's performance implication is hard to avoid. Certain * planning logic of Citus (such as router or query pushdown) relies * heavily on the relationRestrictionList. If * AdjustPartitioningForDistributedPlanning() is removed, all the * partitions show up in the, causing high planning times for * such queries. */	2022-09-15 14:37:28 +03:00
aykut-bozkurt	77947da17c	ensure we have more active nodes than replication factor. (#6341 ) DESCRIPTION: Fixes floating exception during create_distributed_table_concurrently. Fixes #6332. During create_distributed_table_concurrently, when there is no active primary node, it fails with floating exception. We added similar check with create_distributed_table. It will fail with proper message if current active node is less than replication factor. (cherry picked from commit `739b91afa6`)	2022-09-14 18:22:58 +03:00
Nils Dijk	7b51f3eee2	Fix: rebalance stop non super user (#6334 ) No need for description, fixing issue introduced with new feature for 11.1 Fixes #6333 Due to Postgres' C api being o-indexed and postgres' attributes being 1-indexed, we were reading the wrong Datum as the Task owner when cancelling. Here we add a test to show the error and fix the off-by-one error.	2022-09-13 23:20:06 +02:00
Naisila Puka	76ff4ab188	Adds support for unlogged distributed sequences (#6292 ) We can now do the following: - Distribute sequence with logged/unlogged option - ALTER TABLE my_sequence SET LOGGED/UNLOGGED - ALTER SEQUENCE my_sequence SET LOGGED/UNLOGGED Relevant PG commit `344d62fb9a`	2022-09-13 10:53:39 +03:00
Hanefi Onaldi	5cfcc63308	Add warning messages for cluster commands on partitioned tables (#6306 ) PG15 introduces `CLUSTER` commands for partitioned tables. Similar to a `CLUSTER` command with no supplied table names, these commands also can not be run inside transaction blocks and therefore can not be propagated in a distributed transaction block with ease. Therefore we raise warnings. Relevant PG commit: cfdd03f45e6afc632fbe70519250ec19167d6765	2022-09-13 00:05:58 +03:00
Hanefi Onaldi	164f2fa0a6	PG15: Add support for NULLS NOT DISTINCT (#6308 ) Relevant PG commit: 94aa7cc5f707712f592885995a28e018c7c80488	2022-09-12 23:47:37 +03:00
Marco Slot	b79111527e	Avoid blocking writes in create_distributed_table_concurrently (#6324 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 12:09:37 -07:00
Nils Dijk	cda3686d86	Feature: run rebalancer in the background (#6215 ) DESCRIPTION: Add a rebalancer that uses background tasks for its execution Based on the baclground jobs and tasks introduced in #6296 we implement a new rebalancer on top of the primitives of background execution. This allows the user to initiate a rebalance and let Citus execute the long running steps in the background until completion. Users can invoke the new background rebalancer with `SELECT citus_rebalance_start();`. It will output information on its job id and how to track progress. Also it returns its job id for automation purposes. If you simply want to wait till the rebalance is done you can use `SELECT citus_rebalance_wait();` A running rebalance can be canelled/stopped with `SELECT citus_rebalance_stop();`.	2022-09-12 20:46:53 +03:00
Marco Slot	48f7d6c279	Show local managed tables in citus_tables and hide tables owned by extensions (#6321 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 17:49:17 +03:00
Marco Slot	b036e44aa4	Fix bug preventing isolate_tenant_to_new_shard with text column (#6320 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 16:29:57 +02:00
naisila	47bea76c6c	Revert "Support JSON_TABLE on PG 15 (#6241 )" This reverts commit `1f4fe35512`.	2022-09-12 15:20:17 +03:00
Onder Kalaci	36f8c48560	Add tests for allowing SET NULL/DEFAULT for subseet of columns PG 15 added support for that (d6f96ed94e73052f99a2e545ed17a8b2fdc1fb8a). We also add support, but we already do not support ON DELETE SET NULL/DEFAULT for distribution column. So, in essence, we add support for reference tables and Citus local tables.	2022-09-12 13:56:09 +03:00
Marco Slot	2e943a64a0	Make shard moves more idempotent (#6313 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-09 18:21:36 +02:00
Jelte Fennema	a2d86214b2	Share more replication code between moves and splits (#6310 ) The logical replication catchup part for shard splits and shard moves is very similar. This abstracts most of that similarity away into a single function. This also improves the logic for non blocking shard splits a bit, by using faster foreign key creation. It also parallelizes index creation which shard moves were already doing, but shard splits did not.	2022-09-09 16:45:38 +02:00
Marco Slot	ba2fe3e3c4	Remove do_repair option from citus_copy_shard_placement (#6299 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-09 15:44:30 +02:00
Nils Dijk	00a94c7f13	Implement infrastructure to run sql jobs in the background (#6296 ) DESCRIPTION: Add infrastructure to run long running management operations in background This infrastructure introduces the primitives of jobs and tasks. A task consists of a sql statement and an owner. Tasks belong to a Job and can depend on other tasks from the same job. When there are either runnable or running tasks we would like to make sure a bacgrkound task queue monitor process is running. A Task could be in running state while there is actually no monitor present due to a database restart or failover. Once the monitor starts it will reset any running task to its runnable state. To make sure only one background task queue monitor is ever running at once it will acquire an advisory lock that self conflicts. Once a task is done it will find all tasks depending on this task. After checking that the task doesn't have unmet dependencies it will transition the task from blocked to runnable state for the task to be picked up on a subsequent task start. Currently only one task can be running at a time. This can be improved upon in later releases without changes to the higher level API. The initial goal for this background tasks is to allow a rebalance to run in the background. This will be implemented in a subsequent PR.	2022-09-09 16:11:19 +03:00
Jelte Fennema	76137e967f	Create all foreign keys quickly at the end of a shard move (#6148 ) Previously we would create foreign keys to reference table in an extra fast way at the end of a shard move. This uses that same logic to also do it for foreign keys between distributed tables. Fixes #6141	2022-09-09 09:58:33 +02:00
Ahmet Gedemenli	eadc88a800	Introduce GUC citus.skip_constraint_validation (#6281 ) Introduces a new GUC named citus.skip_constraint_validation, which basically skips constraint validation when set to on. For some several places that we hack to skip the foreign key validation phase, now we use this GUC.	2022-09-08 18:13:18 +03:00
Hanefi Onaldi	a557a196aa	Add tests for numeric with scale greater than precision	2022-09-07 13:12:04 +03:00
Hanefi Onaldi	4db113496f	Add tests for new COPY features in PG15	2022-09-07 13:12:04 +03:00
Hanefi Onaldi	3e4e42253f	Add tests for new regexp sql functions	2022-09-07 13:12:04 +03:00
Nitish Upreti	d7404a9446	'Deferred Drop' and robust 'Shard Cleanup' for Splits. (#6258 ) DESCRIPTION: This PR adds support for 'Deferred Drop' and robust 'Shard Cleanup' for Splits. Common Infrastructure This PR introduces new common infrastructure so as any operation that wants robust cleanup of resources can register with the cleaner and have the resources cleaned appropriately based on a specified policy. 'Shard Split' is the first consumer using this new infrastructure. Note : We only support adding 'shards' as resources to be cleaned-up right now but the framework will be extended to support other resources in future. Deferred Drop for Split Deferred Drop Support ensures that shards undergoing split are not dropped inline as part of operation but dropped later when no active read queries are running on shard. This helps with : Avoids any potential deadlock scenarios that can cause long running Split operation to rollback. Avoids Split operation blocking writes and then getting blocked (due to running queries on the shard) when trying to drop shards. Deferred drop is the new default behavior going forward. Shard Cleaner Extension Shard Cleaner is a background task responsible for deferred drops in case of 'Move' operations. The cleaner has been extended to ensure robust cleanup of shards (dummy shards and split children) in case of a failure based on the new infrastructure mentioned above. The cleaner also handles deferred drop for 'Splits'. TESTING: New test ''citus_split_shard_by_split_points_deferred_drop' to test deferred drop support. New test 'failure_split_cleanup' to test shard cleanup with failures in different stages. Update 'isolation_blocking_shard_split and isolation_non_blocking_shard_split' for deferred drop. Added non-deferred drop version of existing tests : 'citus_split_shard_no_deferred_drop' and 'citus_non_blocking_splits_no_deferred_drop'	2022-09-06 12:11:20 -07:00
Gokhan Gulbiz	ac96370ddf	Use IsMultiStatementTransaction for SELECT .. FOR UPDATE queries (#6288 ) * Use IsMultiStatementTransaction instead of IsTransaction for row-locking operations. * Add regression test for SELECT..FOR UPDATE statement	2022-09-06 16:38:41 +02:00
Emel Şimşek	6f06ff78cc	Throw an error if there is a RangeTblEntry that is not assigned an RTE identity. (#6295 ) * Fix issue : 6109 Segfault or (assertion failure) is possible when using a SQL function * DESCRIPTION: Ensures disallowing the usage of SQL functions referencing to a distributed table and prevents a segfault. Using a SQL function may result in segmentation fault in some cases. This change fixes the issue by throwing an error message when a SQL function cannot be handled. Fixes #6109. * DESCRIPTION: Ensures disallowing the usage of SQL functions referencing to a distributed table and prevents a segfault. Using a SQL function may result in segmentation fault in some cases. This change fixes the issue by throwing an error message when a SQL function cannot be handled. Fixes #6109. Co-authored-by: Emel Simsek <emel.simsek@microsoft.com>	2022-09-06 15:46:41 +02:00
Hanefi Onaldi	85b19c851a	Disallow distributing by numeric with negative scale PG15 allows numeric scale to be negative or greater than precision. This causes issues and we may end up routing queries to a wrong shard due to differing hash results after rounding. Formerly, when specifying NUMERIC(precision, scale), the scale had to be in the range [0, precision], which was per SQL spec. PG15 extends the range of allowed scales to [-1000, 1000]. A negative scale implies rounding before the decimal point. For example, a column might be declared with a scale of -3 to round values to the nearest thousand. Note that the display scale remains non-negative, so in this case the display scale will be zero, and all digits before the decimal point will be displayed. Relevant PG commit: 085f931f52494e1f304e35571924efa6fcdc2b44	2022-09-06 12:40:56 +03:00
Naisila Puka	d7f41cacbe	Prohibit renaming child trigger on distributed partition pre PG15 (#6290 ) Pre PG15, renaming child triggers on partitions is allowed. When creating a trigger in a distributed parent partitioned table, the triggers on the shards of the partitions have the same name with the triggers on the corresponding parent shards of the parent table. Therefore, they don't have the same appended shard id as the shard id of the partition. Hence, when trying to rename a child trigger on a partition of a distributed table, we can't correctly find the triggers on the shards of the partition in order to rename them since we append a different shard id to the name of the trigger. Since we can't find the trigger we get a misleading error of inexistent trigger. In this commit we prohibit renaming child triggers on distributed partitions altogether.	2022-09-06 12:19:25 +03:00
Naisila Puka	fd9b3f4ae9	Add tests to make sure distributed clone trigger rename fails in PG15 (#6291 ) Relevant PG commit: 80ba4bb383538a2ee846fece6a7b8da9518b6866	2022-09-06 11:04:14 +03:00
Marco Slot	e6b1845931	Change split logic to avoid EnsureReferenceTablesExistOnAllNodesExtended (#6208 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-05 22:02:18 +02:00
Önder Kalacı	bd13836648	Add citus.skip_advisory_lock_permission_checks (#6293 )	2022-09-05 17:47:41 +02:00
Ahmet Gedemenli	7c8cc7fc61	Fix flakiness for view tests (#6284 )	2022-09-02 10:12:07 +03:00
Marco Slot	432f399a5d	Allow citus_internal application_name with additional suffix (#6282 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-01 14:26:43 +02:00
Naisila Puka	9e2b96caa5	Add pg14->pg15 upgrade test for dist. triggers on part. tables (#6265 ) PRE PG15, Renaming the parent triggers on partitioned tables doesn't recurse to renaming the child triggers on the partitions as well. In PG15, Renaming triggers on partitioned tables recurses to renaming the triggers on the partitions as well. Add an upgrade test to make sure we are not breaking anything with distributed triggers on distributed partitioned tables. Relevant PG commit: 80ba4bb383538a2ee846fece6a7b8da9518b6866	2022-09-01 12:32:44 +03:00

1 2 3 4 5 ...

1925 Commits (8e0ce65d6dad1262270234bc32776b92ce574a63)