citus

Commit Graph

Author	SHA1	Message	Date
Jelte Fennema	6460fc45e4	Fix crash in shard rebalancer when no distributed tables exist (#5205 ) The logging of the amount of ignored moves crashed when no distributed tables existed in a cluster. This also fixes in passing that the logging of ignored moves logs the correct number of ignored moves if there exist multiple colocation groups and all are rebalanced at the same time. (cherry picked from commit `481f8be084`)	2021-09-01 11:55:31 +02:00
Hanefi Onaldi	280bc704d0	Bump Citus version to 10.1.2	2021-08-16 17:26:07 +03:00
Hanefi Onaldi	fb0bb40225	Add changelog entries for 10.1.2 (cherry picked from commit `da29a57837`)	2021-08-16 17:24:45 +03:00
Onder Kalaci	9f4b6a6cb9	Guard against hard WaitEvenSet errors In short, add wrappers around Postgres' AddWaitEventToSet() and ModifyWaitEvent(). AddWaitEventToSet()/ModifyWaitEvent*() may throw hard errors. For example, when the underlying socket for a connection is closed by the remote server and already reflected by the OS, however Citus hasn't had a chance to get this information. In that case, if replication factor is >1, Citus can failover to other nodes for executing the query. Even if replication factor = 1, Citus can give much nicer errors. So CitusAddWaitEventSetToSet()/CitusModifyWaitEvent() simply puts AddWaitEventToSet()/ModifyWaitEvent() into a PG_TRY/PG_CATCH block in order to catch any hard errors, and returns this information to the caller.	2021-08-10 09:36:11 +02:00
Onder Kalaci	3c5ea1b1f2	Adjust the tests to earlier versions - Drop PRIMARY KEY for Citus 10 compatibility - Drop columnar for PG 12 - Do not start/stop metadata sync as stop is not implemented in 10.1	2021-08-06 15:56:17 +02:00
Onder Kalaci	0eb5c144ed	Dropped columns do not diverge distribution column for partitioned tables Before this commit, creating a partition after a DROP column on the parent (position before dist. key) was leading to partition to have the wrong distribution column.	2021-08-06 13:42:40 +02:00
Hanefi Onaldi	b3947510b9	Bump Citus version to 10.1.1	2021-08-05 20:50:42 +03:00
Hanefi Onaldi	e64e627e31	Add changelog entries for 10.1.1 (cherry picked from commit `bc5553b5d1`)	2021-08-05 20:49:43 +03:00
naisila	c456a933f0	Fix master_update_table_statistics scripts for 9.5	2021-08-03 16:46:05 +03:00
naisila	86f1e181c4	Fix master_update_table_statistics scripts for 9.4	2021-08-03 16:46:05 +03:00
Jelte Fennema	84410da2ba	Fix showing target shard size in the rebalance progress monitor (#5136 ) The progress monitor wouldn't actually update the size of the shard on the target node when using "block_writes" as the `shard_transfer_mode`. The reason for this is that the CREATE TABLE part of the shard creation would only be committed once all data was moved as well. This caused our size calculation to always return 0, since the table did not exist yet in the session that the progress monitor used. This is fixed by first committing creation of the table, and only then starting the actual data copy. The test output changes slightly. Apparently splitting this up in two transactions instead of one, increases the table size after the copy by about 40kB. The additional size used doesn't increase when with the amount of data in the table is larger (it stays ~40kB per shard). So this small change in test output is not considered an actual problem. (cherry picked from commit `2aa67421a7`)	2021-07-23 16:49:39 +02:00
Önder Kalacı	106d68fd61	CLUSTER ON deparser should consider schemas (#5122 ) (cherry picked from commit `87a51ae552`)	2021-07-16 19:16:37 +03:00
Hanefi Onaldi	f571abcca6	Add changelog entries for 10.1.0 This patch also moves the section to the top of the changelog (cherry picked from commit `6b4996f47e`)	2021-07-16 18:09:17 +03:00
Sait Talha Nisanci	6fee3068e3	Not include to-be-deleted shards while finding shard placements Ignore orphaned shards in more places Only use active shard placements in RouterInsertTaskList Use IncludingOrphanedPlacements in some more places Fix comment Add tests (cherry picked from commit `e7ed16c296`) Conflicts: src/backend/distributed/planner/multi_router_planner.c Quite trivial conflict that was easy to resolve	2021-07-14 19:28:32 +03:00
Jelte Fennema	6f400dab58	Fix check to always allow foreign keys to reference tables (#5073 ) With the previous version of this check we would disallow distributed tables that did not have a colocationid, to have a foreign key to a reference table. This fixes that, since there's no reason to disallow that. (cherry picked from commit `e9bfb8eddd`)	2021-07-14 19:09:58 +03:00
Jelte Fennema	90da684f56	Only allow moves of shards of distributed tables (#5072 ) Moving shards of reference tables was possible in at least one case: ```sql select citus_disable_node('localhost', 9702); create table r(x int); select create_reference_table('r'); set citus.replicate_reference_tables_on_activate = off; select citus_activate_node('localhost', 9702); select citus_move_shard_placement(102008, 'localhost', 9701, 'localhost', 9702); ``` This would then remove the reference table shard on the source, causing all kinds of issues. This fixes that by disallowing all shard moves except for shards of distributed tables. Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> (cherry picked from commit `d1d386a904`)	2021-07-14 19:07:45 +03:00
Jelte Fennema	6986ac2f17	Avoid two race conditions in the rebalance progress monitor (#5050 ) The first and main issue was that we were putting absolute pointers into shared memory for the `steps` field of the `ProgressMonitorData`. This pointer was being overwritten every time a process requested the monitor steps, which is the only reason why this even worked in the first place. To quote a part of a relevant stack overflow answer: > First of all, putting absolute pointers in shared memory segments is > terrible terible idea - those pointers would only be valid in the > process that filled in their values. Shared memory segments are not > guaranteed to attach at the same virtual address in every process. > On the contrary - they attach where the system deems it possible when > `shmaddr == NULL` is specified on call to `shmat()` Source: https://stackoverflow.com/a/10781921/2570866 In this case a race condition occurred when a second process overwrote the pointer in between the first process its write and read of the steps field. This issue is fixed by not storing the pointer in shared memory anymore. Instead we now calculate it's position every time we need it. The second race condition I have not been able to trigger, but I found it while investigating this. This issue was that we published the handle of the shared memory segment, before we initialized the data in the steps. This means that during initialization of the data, a call to `get_rebalance_progress()` could read partial data in an unsynchronized manner. (cherry picked from commit `ca00b63272`)	2021-07-14 19:06:32 +03:00
Marco Slot	998b044fdc	Fix a bug that causes worker_create_or_alter_role to crash with NULL input (cherry picked from commit `a7e4d6c94a`)	2021-07-14 13:56:43 +03:00
Naisila Puka	1507f32282	Fix nextval('seq_name'::text) bug, and schema for seq tests (#5046 ) (cherry picked from commit `e26b29d3bb`)	2021-07-14 13:55:48 +03:00
Hanefi Onaldi	20e500f96b	Remove public schema dependency for 10.1 upgrades This commit contains a subset of the changes that should be cherry picked to 10.1 releases. (cherry picked from commit `efc5776451`)	2021-07-09 12:12:19 +03:00
Hanefi Onaldi	60424534ef	Remove public schema dependency for 10.0 upgrades This commit contains a subset of the changes that should be cherry picked to 10.0 releases. (cherry picked from commit `8e9cc229ff`)	2021-07-09 12:12:01 +03:00
Nils Dijk	fefaed37e7	fix 10.1-1 upgrade script to adhere to idempotency	2021-07-08 12:25:32 +02:00
Nils Dijk	5adc151e7c	fix 9.5-2 upgrade script to adhere to idempotency	2021-07-08 12:25:32 +02:00
Nils Dijk	6192dc2bff	Add test for idempotency of citus_prepare_pg_upgrade	2021-07-08 12:25:32 +02:00
Onur Tirtir	3f6e903722	Fix lower boundary calculation when pruning range dist table shards (#5082 ) This happens only when we have a "<" or "<=" filter on distribution column of a range distributed table and that filter falls in between two shards. When the filter falls in between two shards: If the filter is ">" or ">=", then UpperShardBoundary was returning "upperBoundIndex - 1", where upperBoundIndex is exclusive shard index used during binary seach. This is expected since upperBoundIndex is an exclusive index. If the filter is "<" or "<=", then LowerShardBoundary was returning "lowerBoundIndex + 1", where lowerBoundIndex is inclusive shard index used during binary seach. On the other hand, since lowerBoundIndex is an inclusive index, we should just return lowerBoundIndex instead of doing "+ 1". Before this commit, we were missing leftmost shard in such queries. * Remove useless conditional branches The branch that we delete from UpperShardBoundary was obviously useless. The other one in LowerShardBoundary became useless after we remove "+ 1" from there. This indeed is another proof of what & how we are fixing with this pr. * Improve comments and add more * Add some tests for upper bound calculation too (cherry picked from commit `b118d4188e`)	2021-07-07 13:14:27 +03:00
Marco Slot	9efd8e05d6	Fix PG upgrade scripts for 10.1	2021-07-06 16:08:47 +02:00
Marco Slot	210bcdcc08	Fix PG upgrade scripts for 10.0	2021-07-06 16:08:46 +02:00
Marco Slot	d3417a5e34	Fix PG upgrade scripts for 9.5	2021-07-06 16:08:46 +02:00
Marco Slot	e2330e8f87	Fix PG upgrade scripts for 9.4	2021-07-06 16:08:46 +02:00
Onder Kalaci	690dab316a	fix regression tests to avoid any conflicts in enterprise	2021-06-22 08:49:48 +03:00
Onder Kalaci	be6e372b27	Deparse/parse the local cached queries With local query caching, we try to avoid deparse/parse stages as the operation is too costly. However, we can do deparse/parse operations once per cached queries, right before we put the plan into the cache. With that, we avoid edge cases like (4239) or (5038). In a sense, we are making the local plan caching behave similar for non-cached local/remote queries, by forcing to deparse the query once. (cherry picked from commit `69ca943e58`)	2021-06-22 08:24:15 +03:00
Onder Kalaci	3d6bc315ab	Get ready for Improve index backed constraint creation for online rebalancer See: https://github.com/citusdata/citus-enterprise/issues/616 (cherry picked from commit `bc09288651`)	2021-06-22 08:23:57 +03:00
Ahmet Gedemenli	4a904e070d	Set table size to zero if no size is read (#5049 ) (#5056 ) * Set table size to zero if no size is read * Add comment to relation size bug fix (cherry picked from commit `5115100db0`)	2021-06-21 14:36:13 +03:00
Halil Ozan Akgul	f8e06fb1ed	Bump citus version to 10.1.0	2021-06-15 18:50:09 +03:00
Halil Ozan Akgül	72eb37095b	Merge pull request #5043 from citusdata/citus-10.1.0-changelog-1623733267 Update Changelog for 10.1.0	2021-06-15 17:21:19 +03:00
Halil Ozan Akgul	91db015051	Add changelog entry for 10.1.0	2021-06-15 14:28:15 +03:00
Jelte Fennema	4c3934272f	Improve performance of citus_shards (#5036 ) We were effectively joining on a calculated column because of our calls to `shard_name`. This caused a really bad plan to be generated. In my specific case it was taking ~18 seconds to show the output of citus_shards. It had this explain plan: ``` QUERY PLAN ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── Subquery Scan on citus_shards (cost=18369.74..18437.34 rows=5408 width=124) (actual time=18277.461..18278.509 rows=5408 loops=1) -> Sort (cost=18369.74..18383.26 rows=5408 width=156) (actual time=18277.457..18277.726 rows=5408 loops=1) Sort Key: ((pg_dist_shard.logicalrelid)::text), pg_dist_shard.shardid Sort Method: quicksort Memory: 1629kB CTE shard_sizes -> Function Scan on citus_shard_sizes (cost=0.00..10.00 rows=1000 width=40) (actual time=71.137..71.934 rows=5413 loops=1) -> Hash Join (cost=177.62..18024.42 rows=5408 width=156) (actual time=77.985..18257.237 rows=5408 loops=1) Hash Cond: ((pg_dist_shard.logicalrelid)::oid = (pg_dist_partition.logicalrelid)::oid) -> Hash Join (cost=169.81..371.98 rows=5408 width=48) (actual time=1.415..13.166 rows=5408 loops=1) Hash Cond: (pg_dist_placement.groupid = pg_dist_node.groupid) -> Hash Join (cost=168.68..296.49 rows=5408 width=16) (actual time=1.403..10.011 rows=5408 loops=1) Hash Cond: (pg_dist_placement.shardid = pg_dist_shard.shardid) -> Seq Scan on pg_dist_placement (cost=0.00..113.60 rows=5408 width=12) (actual time=0.004..3.684 rows=5408 loops=1) Filter: (shardstate = 1) -> Hash (cost=101.08..101.08 rows=5408 width=12) (actual time=1.385..1.386 rows=5408 loops=1) Buckets: 8192 Batches: 1 Memory Usage: 318kB -> Seq Scan on pg_dist_shard (cost=0.00..101.08 rows=5408 width=12) (actual time=0.003..0.688 rows=5408 loops=1) -> Hash (cost=1.06..1.06 rows=6 width=40) (actual time=0.007..0.007 rows=6 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 9kB -> Seq Scan on pg_dist_node (cost=0.00..1.06 rows=6 width=40) (actual time=0.004..0.005 rows=6 loops=1) -> Hash (cost=5.69..5.69 rows=169 width=130) (actual time=0.070..0.071 rows=169 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 36kB -> Seq Scan on pg_dist_partition (cost=0.00..5.69 rows=169 width=130) (actual time=0.009..0.041 rows=169 loops=1) SubPlan 2 -> Limit (cost=0.00..3.25 rows=1 width=8) (actual time=3.370..3.370 rows=1 loops=5408) -> CTE Scan on shard_sizes (cost=0.00..32.50 rows=10 width=8) (actual time=3.369..3.369 rows=1 loops=5408) Filter: ((shard_name(pg_dist_shard.logicalrelid, pg_dist_shard.shardid) = table_name) OR (('public.'::text \|\| shard_name(pg_dist_shard.logicalrelid, pg_dist_shard.shardid)) = table_name)) Rows Removed by Filter: 2707 Planning Time: 0.705 ms Execution Time: 18278.877 ms ``` With the changes it only takes 180ms to show the same output: ``` QUERY PLAN ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── Sort (cost=904.59..918.11 rows=5408 width=156) (actual time=182.508..182.960 rows=5408 loops=1) Sort Key: ((pg_dist_shard.logicalrelid)::text), pg_dist_shard.shardid Sort Method: quicksort Memory: 1629kB -> Hash Join (cost=418.03..569.27 rows=5408 width=156) (actual time=136.333..146.591 rows=5408 loops=1) Hash Cond: ((pg_dist_shard.logicalrelid)::oid = (pg_dist_partition.logicalrelid)::oid) -> Hash Join (cost=410.22..492.83 rows=5408 width=56) (actual time=136.231..140.132 rows=5408 loops=1) Hash Cond: (pg_dist_placement.groupid = pg_dist_node.groupid) -> Hash Right Join (cost=409.09..417.34 rows=5408 width=24) (actual time=136.218..138.890 rows=5408 loops=1) Hash Cond: ((((regexp_matches(citus_shard_sizes.table_name, '_(\d+)$'::text))[1])::integer) = pg_dist_shard.shardid) -> HashAggregate (cost=45.00..48.50 rows=200 width=12) (actual time=131.609..132.481 rows=5408 loops=1) Group Key: ((regexp_matches(citus_shard_sizes.table_name, '_(\d+)$'::text))[1])::integer Batches: 1 Memory Usage: 737kB -> Result (cost=0.00..40.00 rows=1000 width=12) (actual time=107.786..129.831 rows=5408 loops=1) -> ProjectSet (cost=0.00..22.50 rows=1000 width=40) (actual time=107.780..128.492 rows=5408 loops=1) -> Function Scan on citus_shard_sizes (cost=0.00..10.00 rows=1000 width=40) (actual time=107.746..108.107 rows=5414 loops=1) -> Hash (cost=296.49..296.49 rows=5408 width=16) (actual time=4.595..4.598 rows=5408 loops=1) Buckets: 8192 Batches: 1 Memory Usage: 339kB -> Hash Join (cost=168.68..296.49 rows=5408 width=16) (actual time=1.702..3.783 rows=5408 loops=1) Hash Cond: (pg_dist_placement.shardid = pg_dist_shard.shardid) -> Seq Scan on pg_dist_placement (cost=0.00..113.60 rows=5408 width=12) (actual time=0.004..0.837 rows=5408 loops=1) Filter: (shardstate = 1) -> Hash (cost=101.08..101.08 rows=5408 width=12) (actual time=1.683..1.685 rows=5408 loops=1) Buckets: 8192 Batches: 1 Memory Usage: 318kB -> Seq Scan on pg_dist_shard (cost=0.00..101.08 rows=5408 width=12) (actual time=0.004..0.824 rows=5408 loops=1) -> Hash (cost=1.06..1.06 rows=6 width=40) (actual time=0.007..0.008 rows=6 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 9kB -> Seq Scan on pg_dist_node (cost=0.00..1.06 rows=6 width=40) (actual time=0.004..0.006 rows=6 loops=1) -> Hash (cost=5.69..5.69 rows=169 width=130) (actual time=0.079..0.079 rows=169 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 36kB -> Seq Scan on pg_dist_partition (cost=0.00..5.69 rows=169 width=130) (actual time=0.011..0.046 rows=169 loops=1) Planning Time: 0.789 ms Execution Time: 184.095 ms ```	2021-06-14 13:32:30 +02:00
Onur Tirtir	a209999618	Enforce table opt constraints when using alter_columnar_table_set (#5029 )	2021-06-08 17:39:16 +03:00
Hanefi Onaldi	5c6069a74a	Do not rely on fk cache when truncating local data (#5018 )	2021-06-07 11:56:48 +03:00
Marco Slot	9770a1bf00	Merge pull request #5020 from citusdata/disable-dropping-shards	2021-06-04 14:48:11 +02:00
Jelte Fennema	c113cb3198	Merge pull request #5024 from citusdata/cleanup-old-shards-before-rebalance	2021-06-04 14:37:22 +02:00
Jelte Fennema	1a83628195	Use "orphaned shards" naming in more places We were not very consistent in how we named these shards.	2021-06-04 11:39:19 +02:00
Jelte Fennema	3f60e4f394	Add ExecuteCriticalCommandInDifferentTransaction function We use this pattern multiple times throughout the codebase now. Seems like a good moment to abstract it away.	2021-06-04 11:30:27 +02:00
Jelte Fennema	503c70b619	Cleanup orphaned shards before moving when necessary A shard move would fail if there was an orphaned version of the shard on the target node. With this change before actually fail, we try to clean up orphaned shards to see if that fixes the issue.	2021-06-04 11:23:07 +02:00
Jelte Fennema	280b9ae018	Cleanup orphaned shards at the start of a rebalance In case the background daemon hasn't cleaned up shards yet, we do this manually at the start of a rebalance.	2021-06-04 11:23:07 +02:00
Jelte Fennema	7015049ea5	Add citus_cleanup_orphaned_shards UDF Sometimes the background daemon doesn't cleanup orphaned shards quickly enough. It's useful to have a UDF to trigger this removal when needed. We already had a UDF like this but it was only used during testing. This exposes that UDF to users. As a safety measure it cannot be run in a transaction, because that would cause the background daemon to stop cleaning up shards while this transaction is running.	2021-06-04 11:23:07 +02:00
Naisila Puka	0f37ab5f85	Fixes column default coming from a sequence (#4914 ) * Add user-defined sequence support for MX * Remove default part when propagating to workers * Fix ALTER TABLE with sequences for mx tables * Clean up and add tests * Propagate DROP SEQUENCE * Removing function parts * Propagate ALTER SEQUENCE * Change sequence type before propagation & cleanup * Revert "Propagate ALTER SEQUENCE" This reverts commit 2bef64c5a29f4e7224a7f43b43b88e0133c65159. * Ensure sequence is not used in a different column with different type * Insert select tests * Propagate rename sequence stmt * Fix issue with group ID cache invalidation * Add ALTER TABLE ALTER COLUMN TYPE .. precaution * Fix attnum inconsistency and add various tests * Add ALTER SEQUENCE precaution * Remove Citus hook * More tests Co-authored-by: Marco Slot <marco.slot@gmail.com>	2021-06-03 23:02:09 +03:00
Marco Slot	ec9664c5a4	Merge pull request #5021 from citusdata/marcocitus/fix-remove-node	2021-06-03 11:27:57 +02:00
Hanefi Onaldi	056005db4d	Improve tests for truncating local data (#5012 ) We have a slightly different behavior when using truncate_local_data_after_distributing_table UDF on metadata synced clusters. This PR aims to add tests to cover such cases. We allow distributing tables with data that have foreign keys to reference tables only on metadata synced clusters. This is the reason why some of my earlier tests failed when run on a single node Citus cluster.	2021-06-03 08:51:32 +03:00
Nils Dijk	5f76b93eac	fix link to codecov report from badge (#5022 ) links the codecov badge to the codecov report instead of the badge	2021-06-02 16:48:33 +02:00

1 2 3 4 5 ...

4887 Commits (6460fc45e41cb3c057df45c6b5c498b4d557a30f) All Branches Search

4887 Commits (6460fc45e41cb3c057df45c6b5c498b4d557a30f)

All Branches