citus

Commit Graph

Author	SHA1	Message	Date
Naisila Puka	498131b4f6	Use RelationGetPrimaryKeyIndex for citus catalog tables (#6262 ) pg_dist_node and pg_dist_colocation have a primary key index, not a replica identity index. Citus catalog tables are created in public schema, which has replica identity index by default as primary key index. Later the citus catalog tables are moved to pg_catalog schema. During pg_upgrade, all tables are recreated, and given that pg_dist_colocation is found in pg_catalog schema, it is recreated in that schema, and when it is recreated it doesn't have a replica identity index, because catalog tables have no replica identity. Further action: Do we even need to acquire this lock on the primary key index? Postgres doesn't acquire such locks on indexes before deleting catalog tuples. Also, catalog tuples don't have replica identities by definition.	2022-09-22 12:50:11 +03:00
Jelte Fennema	75cf7a748d	Define symbols required for downgrade from 11.1 (#6301 ) Since #6300/e29db74 changed the C symbol that our bigint overrides of pg_cancel_backend and pg_terminate_backend called. We needed to do something to continue to make these functions work after downgrading. Recreating the old definition with a downgrade scripts is not really possible, since people are expected to run the downgrade steps when using the new .so file, which does not contain the old symbols. So, the easiest way to solve it was also defining the new symbols in our old Citus versions. Luckily our overrides haven't existed for long, so these symbol definitions only needed to be backported to 11.0.	2022-09-07 12:18:39 +02:00
Marco Slot	5f57d77899	Allow citus_internal application_name with additional suffix (#6282 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-05 21:41:06 +02:00
Marco Slot	0a11da1291	Add an allow_unsafe_constraints flag for constraints without distribution column (#6237 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-25 16:13:07 +02:00
Gokhan Gulbiz	07143e7d12	Use the same colocation group for child and parent rels when altering a distributed table (#6225 ) * Alter_distributed_table colocateWith:none bug fix for partitioned tables. * Regression tests added for alter_distributed_table colocateWith:none for partitioned tables * Update query comparision to be more accurate (cherry picked from commit `69d2fcf5c0`)	2022-08-25 11:47:06 +03:00
Marco Slot	9dc6273b88	Set application_name to citus_rebalancer when copying reference tables	2022-08-23 23:26:49 +02:00
Nils Dijk	08dee6fe08	Fix reference table lock contention (#6173 ) DESCRIPTION: Fix reference table lock contention Dropping and creating reference tables unintentionally blocked on each other due to the use of an ExclusiveLock for both the Drop and conditionally copying existing reference tables to (new) nodes. The patch does the following: - Lower lock lever for dropping (reference) tables to `ShareLock` so they don't self conflict - Treat reference tables and distributed tables equally and acquire the colocation lock when dropping any table that is in a colocation group - Perform the precondition check for copying reference tables twice, first time with a lower lock that doesn't conflict with anything. Could have been a NoLock, however, in preparation for dropping a colocation group, it is an `AccessShareLock` During normal operation the first check will always pass and we don't have to escalate that lock. Making it that we won't be blocked on adding and remove reference tables. Only after a node addition the first `create_reference_table` will still need to acquire an `ExclusiveLock` on the colocation group to perform the copy.	2022-08-18 13:22:31 +02:00
Onder Kalaci	87787dd146	Support Sequences owned by columns before distributing tables There are 3 different ways that a sequence can be interacting with tables. (1) and (2) are already supported. This commit adds support for (3). (1) column DEFAULT nextval('seq'): The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \|------------------ sequence <--------\| (2) serial columns: Bigserial/small serial etc: The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \| \| sequence <--------\| (3) Sequence OWNED BY table.column: Added support for this type of resolution in this commit. The dependency is almost like the following, and ExpandCitusSupportedTypes() is NOT responsible for finding the dependency. schema <--- table <--- column ^ \| sequence (cherry picked from commit `9ec8e627c1`)	2022-08-18 11:22:25 +02:00
Marco Slot	56939f0d14	Fix relation access tracking for local only transactions on release-11.0 (#6182 ) Co-authored-by: Onder Kalaci <onderkalaci@gmail.com>	2022-08-18 10:13:41 +02:00
Ahmet Gedemenli	7df8588107	Fix upgrade paths for 11.0 (#6171 ) * Fix upgrade paths for 11.0	2022-08-17 21:34:23 +03:00
aykut-bozkurt	e0b4455e45	sysid should be parsed as int. (#6150 ) (cherry picked from commit `898801504e`)	2022-08-11 11:03:41 +03:00
Ying Xu	a8aa82a3ec	Bugfix for IN clause to be considered during planner phase in Columnar (#6030 ) Reported bug #5803 shows that we are currently not sending the IN clause to our planner for columnar. This PR fixes it by checking for ScalarArrayOpExpr in ExtractPushdownClause so that we do not skip it. Also added a test case for this new addition.	2022-07-29 17:56:24 +02:00
Ahmet Gedemenli	2f1719c149	Do not create truncate triggers on foreign tables (#6103 )	2022-07-29 16:43:09 +03:00
Marco Slot	4eb0749369	Avoid catalog read via superuser() call in DecrementSharedConnectionCounter	2022-07-29 14:22:51 +02:00
Marco Slot	4439124b6d	Fix issues with insert..select casts and column ordering	2022-07-28 13:54:04 +02:00
Jelte Fennema	1cf079581f	Avoid possible information leakage about existing users (#6090 ) (cherry picked from commit `0f50bef696`)	2022-07-27 17:58:24 +02:00
Ahmet Gedemenli	4d01af5160	Error out for views with circular dependencies (#6051 ) Adds error check for views with circular dependencies (cherry picked from commit `2b2a529653`)	2022-07-27 17:59:49 +03:00
Marco Slot	e45b6ece0d	Allow WITH HOLD cursors with parameters	2022-07-27 14:08:18 +02:00
Onder Kalaci	9af736c7a6	Concurrent shard move/copy and colocated table creation fix It turns out that create_distributed_table and citus_move/copy_shard_placement does not work well concurrently. To fix that, we need to acquire a lock, which sounds like a good use of colocation lock. However, the current usage of colocation lock is limited to higher level UDFs like rebalance_table_shards etc. Those usage of lock is still useful, but we cannot acquire the same lock on citus_move_shard_placement etc. because the coordinator connects to itself to acquire the lock. Hence, the high level UDF blocks itself. To fix that, we use one more colocation lock, with the placements are the main objects to consider. (cherry picked from commit `12fa3aaf6b`)	2022-07-27 10:10:46 +02:00
Onder Kalaci	a21a4e128c	Optimize StringJoin() for when prefix-postfix is needed Before this commit, we required multiple copies of the same stringInfo if we needed to append/prepend data to the stringInfo. Now, we optionally get prefix/postfix. For large string operations, this can save up to %10 memory. (cherry picked from commit `26fdcb68f0`)	2022-07-27 10:02:32 +02:00
Onder Kalaci	2a684e426c	Do not cache all the metadata during fix_all_partition_shard_index_names (cherry picked from commit `f076e81166`)	2022-07-27 10:02:05 +02:00
Onder Kalaci	377375de2a	Reduce memory consumption while adjust partition index names Previously, CreateFixPartitionShardIndexNames() created all the relevant query strings for all the shards, and executed the large query string. And, in terms of the memory consumption, this huge command (and its ExprContext generated while running the command) is the main bottleneck/ With this change, we are reducing the total amount of memory usage to almost 1/shard_count. On my local machine, a distributed partitioned table with 120 partitions, each 32 shards, the total memory consumption reduced from ~3GB to ~0.1GB. And, the total execution time increased from ~28 seconds to ~30 seconds. This seems like a good trade-off. (cherry picked from commit `b8008999dc`)	2022-07-27 10:02:00 +02:00
Nitish Upreti	fcdf4434c6	Fix blocking shard moves failure due to constraint failure. DESCRIPTION: Fix Bug #4949 where Blocking shard moves fails if there is a foreign key between partitioned distributed tables (from child to parent). This is because we try to create constraints before attaching child partitions to parent. This causes constraint failure as parent table will be empty. Fix is to reverse the order i.e. attach partitions before we create constraints. TESTING: Added a new test 'shard_move_constraints_blocking' inspired for existing 'shard_move_constraints' where we trigger shard move with 'block_writes' instead of 'force_logical' to add coverage for this scenario.	2022-07-24 21:21:25 -07:00
Onder Kalaci	06e55df141	Make sure citus_is_coordinator works on read replicas (cherry picked from commit `b2e9a5baf1`)	2022-07-13 15:15:46 +02:00
Onder Kalaci	06d6ffbb6e	LOCK COMMAND does not require primaries at the start (cherry picked from commit `8ab696f7e2`)	2022-07-13 15:15:40 +02:00
Ahmet Gedemenli	ac7511de7d	Fix matviews for citus_add_local_table_to_metadata (#6023 ) (cherry picked from commit `c8e1e243b8`)	2022-07-04 17:01:40 +03:00
Hanefi Onaldi	0eee7fd9b8	Fix downgrade scripts from 11.0-2 to 11.0-1 (cherry picked from commit `f60809a6c1`) Conflicts: src/test/regress/expected/multi_extension.out src/test/regress/sql/multi_extension.sql	2022-06-29 22:52:07 +03:00
Önder Kalacı	03a4305e06	Fixes a bug that prevents upgrades when there are no worker nodes (#6037 ) (cherry picked from commit `bab4c0a8c3`)	2022-06-29 14:36:24 +03:00
Onder Kalaci	d397dd0dfe	Fixes a bug that prevents upgrades when there COMPRESSION and DEFAULT columns	2022-06-29 10:45:33 +02:00
Ahmet Gedemenli	b559ae5813	Fix creating stats bug when CREATE TABLE LIKE (#6006 ) (cherry picked from commit `1ee3e8b7f4`)	2022-06-16 12:45:23 +03:00
Jelte Fennema	a01e45f3df	Make enterprise features open source This PR makes all of the features open source that were previously only available in Citus Enterprise. Features that this adds: 1. Non blocking shard moves/shard rebalancer (`citus.logical_replication_timeout`) 2. Propagation of CREATE/DROP/ALTER ROLE statements 3. Propagation of GRANT statements 4. Propagation of CLUSTER statements 5. Propagation of ALTER DATABASE ... OWNER TO ... 6. Optimization for COPY when loading JSON to avoid double parsing of the JSON object (`citus.skip_jsonb_validation_in_copy`) 7. Support for row level security 8. Support for `pg_dist_authinfo`, which allows storing different authentication options for different users, e.g. you can store passwords or certificates here. 9. Support for `pg_dist_poolinfo`, which allows using connection poolers in between coordinator and workers 10. Tracking distributed query execution times using citus_stat_statements (`citus.stat_statements_max`, `citus.stat_statements_purge_interval`, `citus.stat_statements_track`). This is disabled by default. 11. Blocking tenant_isolation 12. Support for `sslkey` and `sslcert` in `citus.node_conninfo`	2022-06-16 08:09:45 +02:00
Marco Slot	0861c80c8b	Fix bug in unqualified, non-existing DROP DOMAIN IF EXISTS (cherry picked from commit `ee34e1ed9d`)	2022-06-15 16:53:25 +02:00
Burak Velioglu	de6373b842	Fix dropping temporary view without specifying the explicit schema name (cherry picked from commit `4d533c3c56`)	2022-06-15 16:36:52 +02:00
Ahmet Gedemenli	4345627480	Fix materialized view intermediate result filename (#5982 ) (cherry picked from commit `268d3fa3a6`)	2022-06-14 15:43:18 +03:00
Marco Slot	4bcffce036	Introduce a citus_finish_citus_upgrade() function	2022-06-13 13:28:31 +02:00
Halil Ozan Akgul	7166901492	Fixes the bug where undistribute can drop Citus extension (cherry picked from commit `b255706189`)	2022-06-01 18:56:56 +03:00
Gledis Zeneli	c440cbb643	Fix memory error with citus_add_node reported by valgrind test (#5967 ) The error comes due to the datum jsonb in pg_dist_metadata_node.metadata being 0 in some scenarios. This is likely due to not copying the data when receiving a datum from a tuple and pg deciding to deallocate that memory when the table that the tuple was from is closed. Also fix another place in the code that might have been susceptible to this issue. I tested on both multi-vg and multi-1-vg and the test were successful. (cherry picked from commit `beef392f5a`)	2022-06-01 13:06:54 +03:00
gledis69	a64e135a36	Revert "Copy data from heap tuples instead of using references" This reverts commit `50e8638ede`.	2022-06-01 13:06:38 +03:00
gledis69	50e8638ede	Copy data from heap tuples instead of using references The general rule is: If the data is used within the bounds of table_open ... table_close > no need to copy If the data is required for use even after the table is closed > copy (cherry picked from commit `dc9da7630f`)	2022-06-01 12:27:11 +03:00
jeff-davis	b34b1ce06b	Columnar: fix wraparound bug. (#5962 ) columnar_vacuum_rel() now advances relfrozenxid. Fixes #5958. (cherry picked from commit `74ce210f8b`)	2022-05-31 07:46:12 -07:00
Onder Kalaci	3227d6551e	Do not send metadata changes during add node if citus.enable_metadata_sync is set to false (cherry picked from commit `7157152f6c`)	2022-05-30 17:01:44 +02:00
Onder Kalaci	d147d5d0c5	Avoid assertion failure on citus_add_node (cherry picked from commit `010a2a408e`)	2022-05-30 17:01:38 +02:00
Ahmet Gedemenli	4b5f749c23	Propagate dependent views upon distribution (#5950 ) (cherry picked from commit `26d927178c`)	2022-05-26 18:58:04 +03:00
Burak Velioglu	29c67c660d	Create view and materialized views with right schema and owner while altering the distributed table. To be able to alter view's owner without enforcing sequential mode. Alter view process functions have been udpated to use metadata connection.	2022-05-25 10:42:54 +03:00
Gledis Zeneli	6da2d41e00	Do not obtain AccessShareLock before actual lock (#5965 ) Do not obtain AccessShareLock before acquiring the distributed locks. Acquiring an AccessShareLock ensures that the relations which we are trying to get a distributed lock on will not be dropped in the time between when the LOCK command is issued and the LOCK commands are send to the worker. However, this also leads to distributed deadlocks in such scenarios: ```sql -- for dist lock acquiring order coor, w1, w2 -- on w2 LOCK t1 IN ACCESS EXLUSIVE MODE; -- acquire AccessShareLock locally on t1 to ensure it is not dropped while we get ready to distribute the lock -- concurrently on w1 LOCK t1 IN ACCESS EXLUSIVE MODE; -- acquire AccessShareLock locally on t1 to ensure it is not dropped while we get ready to distribute the lock -- acquire dist lock on coor, w1, gets blocked on local AccessShareLock on w2 -- on w2 continuation of the execution above -- starts to acquire dist locks and gets blocked on the coor by the lock acquired by w1 -- distributed deadlock ``` We opt for avoiding such deadlocks with the cost of the possibility of running into errors when the relations on which we are trying to acquire locks on get dropped. (cherry picked from commit `27ddb4fc8e`)	2022-05-23 17:28:37 +03:00
Onder Kalaci	8b0499c91a	Parallelize metadata syncing on node activate It is often useful to be able to sync the metadata in parallel across nodes. Also citus_finalize_upgrade_to_citus11() uses start_metadata_sync_to_primary_nodes() after this commit. Note that this commit does not parallelize all pieces of node activation or metadata syncing. Instead, it tries to parallelize potenially large parts of metadata, which is the objects and distributed tables (in general Citus tables). In the future, it would be nice to sync the reference tables in parallel across nodes. Create ~720 distributed tables / ~23450 shards ```SQL -- declaratively partitioned table CREATE TABLE github_events_looooooooooooooong_name ( event_id bigint, event_type text, event_public boolean, repo_id bigint, payload jsonb, repo jsonb, actor jsonb, org jsonb, created_at timestamp ) PARTITION BY RANGE (created_at); SELECT create_time_partitions( table_name := 'github_events_looooooooooooooong_name', partition_interval := '1 day', end_at := now() + '24 months' ); CREATE INDEX ON github_events_looooooooooooooong_name USING btree (event_id, event_type, event_public, repo_id); SELECT create_distributed_table('github_events_looooooooooooooong_name', 'repo_id'); SET client_min_messages TO ERROR; ``` across 1 node: almost same as expected ```SQL SELECT start_metadata_sync_to_primary_nodes(); Time: 15664.418 ms (00:15.664) select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 14284.069 ms (00:14.284) ``` across 7 nodes: ~3.5x improvement ```SQL SELECT start_metadata_sync_to_primary_nodes(); ┌──────────────────────────────────────┐ │ start_metadata_sync_to_primary_nodes │ ├──────────────────────────────────────┤ │ t │ └──────────────────────────────────────┘ (1 row) Time: 25711.192 ms (00:25.711) -- across 7 nodes select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 82126.075 ms (01:22.126) ``` (cherry picked from commit `dd02e1755f`)	2022-05-23 09:25:31 +02:00
Onder Kalaci	513e073206	Fixes a bug that prevents dropping/altering indexes There are two problems in this area. First, when there are expressions on the index name, we should call `transformIndexExpression()` before generating the index name. That is what Postgres does. Second, because of `40c24bfef9` PG 13 and PG 14 generates different names for indexes with function calls even for local PG tables. Assume we have: ```SQL create table t(id int); select create_distributed_table('t', 'id'); create index ON t (my_very_boring_function(id)); ``` On PG 13, the name of the index is `t_expr_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_expr_idx" btree (my_very_boring_function(id::bigint)) ``` On PG 14, the name of the index is `t_my_very_boring_function_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_my_very_boring_function_idx" btree (my_very_boring_function(id::bigint)) ``` The second issue is not very critical. The important part is that we adjust regression tests to drop all the indexes, which ensures the index names are sane on any version. (cherry picked from commit `2cc4053fc1`)	2022-05-23 09:22:25 +02:00
Onder Kalaci	4b5cb7e2b9	Mark existing views as distributed when upgrade to 11.0+ We have a mechanism which ensures that newly distributed objects are recorded during `alter extension citus update`. However, the logic was lacking "view"s. With this commit, we make sure that existing views are also marked as distributed during upgrade. (cherry picked from commit `ee45e7bfbf`)	2022-05-23 09:22:17 +02:00
Marco Slot	8c5035c0a5	Improve nested execution checks and add GUC to disable	2022-05-20 19:35:59 +02:00
Marco Slot	7c6784b1f4	Add caching for functions that check the backend type	2022-05-20 19:35:52 +02:00

1 2 3 4 5 ...

2880 Commits (498131b4f669af036f0e922401df1de3b69f2d82)