citus

Commit Graph

Author	SHA1	Message	Date
Naisila Puka	b982f2dee6	Changes PROCESS_TOAST default value to true (#7122 ) Process toast should be true by default, like in PG.	2023-08-16 14:40:24 +03:00
Naisila Puka	b2291374b4	PG16 compatibility - more test output fixes (#7112 ) PG16 compatibility - part 9 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` This commit is in the series of PG16 compatibility commits. It makes some changes to our tests in order to be compatible with the following in PG16: - Fix multi_subquery_in_where_reference_clause test somehow PG got rid of the outer join (e.g., explain doesn't show outer joins), hence we can pushdown the subquery. Changing to users_reference_table - Fix unqualified column names for views in PG16 Relevant PG commit: `47bb9db759` 47bb9db75996232ea71fc1e1888ffb0e70579b54 - Fix global_cancel test Error wording and detail changed Relevant PG commit: `2631ebab7b` 2631ebab7b18bdc079fd86107c47d6104a6b3c6e - Fix local_table_join_test with lateral subquery Possible relevant PG commit: `ae89129aa3` ae89129aa3555c263b8c3ccc4c0f1ef7e46201aa I removed the where clause and the limit count error was hit again. With the where clause the query unexpectedly works. - Fix test outputs Relevant PG commits: -- `1349d2790b` -- `f4c7c410ee` For multi_explain and multi_complex_count_distinct there were too many places touched so I just added an alternative test output. For the other tests I modified the problematic parts. More PG16 compatibility commits are coming soon ...	2023-08-15 13:49:25 +03:00
Naisila Puka	2c50b5f7ff	PG16 compatibility - varnullingrels additions (#7107 ) PG16 compatibility - part 7 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` This commit is in the series of PG16 compatibility commits. PG16 introduced a new entry varnnullingrels to Var, which represents our partkey in pg_dist_partition. This commit does the necessary changes in Citus to support this. Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-15 13:07:55 +03:00
Naisila Puka	ee3153fe50	PG16 compatibility - more test output fixes (#7108 ) PG16 compatibility - part 7 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` This commit is in the series of PG16 compatibility commits. It makes some changes to our tests in order to be compatible with the following in PG16: - PG16 removed logic for converting a table to a view Relevant PG commit: `b23cd185fd` b23cd185fd5410e5204683933f848d4583e34b35 - Fix changed error message in certificate verification Relevant PG commit: `8eda731465` 8eda7314652703a2ae30d6c4a69c378f6813a7f2 - Fix backend type order in tests Relevant PG commit: `0c679464a8` 0c679464a837079acc75ff1d45eaa83f79e05690 - Reduce log level to omit extra NOTICE in create collation in PG16 Relevant PG commit: `a14e75eb0b` a14e75eb0b6a73821e0d66c0d407372ec8376105 That commit made LOCALE parameter apply regardless of the provider used, and it printed the following notice: NOTICE: using standard form "und-u-ks-level2" for ICU locale "@colStrength=secondary" We omit this notice to omit output change between pg versions. - Fix columnar_memory test TopMemoryContext now has more children contexts Possible relevant PG commit: `9d3ebba729` 9d3ebba729ebaf5882a92f0f5f662a3312037605 memusage is now around 8.5 MB, whereas it was less than 8MB before. To avoid differences between PG versions, I changed the test to compare to less than 9 MB. It still reflects very well the improvement from 28MB. - Alternative test output for GRANTOR values in pg_auth_members grantor changed in PG16 Relevant PG commit: `ce6b672e44` ce6b672e4455820a0348214be0da1a024c3f619f - Remove redundant grouping columns from our tests Relevant PG commit: `8d83a5d0a2` 8d83a5d0a2673174dc478e707de1f502935391a5 - Fix tests with different order in Filters Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-09 18:04:32 +03:00
Naisila Puka	b36c431abb	PG16 compatibility - Rework PlannedStmt and Query's Permission Info (#7098 ) PG16 compatibility - Part 6 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` This commit is in the series of PG16 compatibility commits. It handles the Permission Info changes in PG16. See below: The main issue lies in the following entries of PlannedStmt: { rtable permInfos } Each rtable has an int perminfoindex, and its actual permission info is obtained through the following: permInfos[perminfoindex] We had crashes because perminfoindexes were not updated in the finalized planned statement after distributed planner hook. So, basically, everywhere we set a query's or planned statement's rtable entry, we need to set the rteperminfos/permInfos accordingly. Relevant PG commits: `a61b1f7482` a61b1f74823c9c4f79c95226a461f1e7a367764b `b803b7d132` b803b7d132e3505ab77c29acf91f3d1caa298f95 More PG16 compatibility commits are coming soon ...	2023-08-09 15:23:00 +03:00
Naisila Puka	6056cb2c29	PG16 compatibility - get_relation_info hook to avoid crash from adjusted partitioning (#7099 ) PG16 compatibility - Part 5 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` This commit is in the series of PG16 compatibility commits. Find the explanation below: If we allow to adjust partitioning, we get a crash when accessing amcostestimate of partitioned indexes, because amcostestimate is NULL for them. The following PG commit is the culprit: `3c569049b7` 3c569049b7b502bb4952483d19ce622ff0af5fd6 Previously, partitioned indexes would just be ignored. Now, they are added in the list. However get_relation_info expects the tables which have partitioned indexes to have the inh flag set properly. AdjustPartitioningForDistributedPlanning plays with that flag, hence we don't get the desired behaviour. The hook is simply removing all partitioned indexes from the list. More PG16 compatibility commits are coming soon ...	2023-08-08 15:51:21 +03:00
Naisila Puka	7c6b4ce103	PG16 compatibility - outer join checks, subscription password, crash fixes (#7097 ) PG16 compatibility - Part 4 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` This commit is in the series of PG16 compatibility commits. It adds some outer join checks to the planner, the new password_required option to the subscription, and a crash fix related to PGIOAlignedBlock, see below for more details: - Fix PGIOAlignedBlock Assert crash in PG16 Relevant PG commit: `faeedbcefd` faeedbcefd40bfdf314e048c425b6d9208896d90 - Pass planner info as argument to make_simple_restrictinfo Pre PG16 passing plannerInfo to make_simple_restrictinfo was only needed for placeholder Vars, which is not the case in this part of the codebase because we are building the expression from shard intervals which don't have placeholder vars. However, PG16 is counting baserels appearing in clause_relids and is deleting the rels mentioned in plannerinfo->outer_join_rels Hence directly accessing plannerinfo. We will crash if we leave it as NULL. For reference `2489d76c49 (diff-e045c41eda9686451a7993e91518e40056b3739365e39eb1b70ae438dc1f7c76R207)` Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d - Add outer join checks, root->simple_rel_array - fix rebalancer to include passwork_required option Relevant PG commit: `c3afe8cf5a` c3afe8cf5a1e465bd71e48e4bc717f5bfdc7a7d6 More PG16 compatibility commits are coming soon ...	2023-08-04 14:51:28 +03:00
Naisila Puka	907d72e60d	PG16 compatibility - some test outputs (#7100 ) PG16 compatibility - Part 3 Check out part 1 `42d956888d` and part 2 `0d503dd5ac` This commit is in the series of PG compatibility. It makes some changes to our tests in order to be compatible with the following in PG16: Use debug_parallel_query in PG16+, force_parallel_mode otherwise Relevant PG commit `5352ca22e0` 5352ca22e0012d48055453ca9992a9515d811291 HINT changed to DETAIL in PG16 Relevant PG commit: `56d0ed3b75` 56d0ed3b756b2e3799a7bbc0ac89bc7657ca2c33 Fix removed read-only server setting lc_collate Relevant PG commit: `b0f6c43716` b0f6c437160db640d4ea3e49398ebc3ba39d1982 Fix unsupported join alias expression in sqlancer_failures Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-04 13:03:15 +03:00
Önder Kalacı	4ae3982d14	Add single-shard router Merge command support (#7088 ) Similar to https://github.com/citusdata/citus/pull/7077. As PG 16+ has changed the join restriction information for certain outer joins, MERGE is also impacted given that is is also underlying an outer join. See #7077 for the details.	2023-08-04 08:16:29 +03:00
Naisila Puka	0d503dd5ac	PG16 compatibility: ruleutils and successful CREATE EXTENSION (#7087 ) PG16 compatibility - Part 2 Part 1 provided successful compilation against pg16beta2. `42d956888d` This PR provides ruleutils changes with pg16beta2 and successful CREATE EXTENSION command. Note that more changes are needed in order to have successful regression tests. More commits are coming soon ... For any_value changes, I referred to this commit `8ef94dc1f5` where we did something similar for PG14 support.	2023-08-02 16:04:51 +03:00
Önder Kalacı	960a5f6104	Improve failure handling of distributed execution (#7090 ) Prior to this commit, the code would skip processing the errors happened for local commands. Prior to https://github.com/citusdata/citus/pull/5379, it might make sense to allow the execution continue. But, as of today, if a modification fails on any placement, we can safely fail the execution. The first commit show the problem in action. The second commit includes the fix and the test fixes.	2023-08-01 16:47:59 +03:00
Onur Tirtir	dd6ea1ebd5	Makes sure to handle NULL constraints for ADD COLUMN commands (#7093 ) DESCRIPTION: Fixes a bug that causes an unexpected error when adding a column with a NULL constraint Fixes https://github.com/citusdata/citus/issues/7092.	2023-08-01 11:07:47 +03:00
Önder Kalacı	cb5eb73048	Add support for router INSERT .. SELECT commands (#7077 ) Tradionally our planner works in the following order: router - > pushdown -> repartition -> pull to coordinator However, for INSERT .. SELECT commands, we did not support "router". In practice, that is not a big issue, because pushdown planning can handle router case as well. However, with PG 16, certain outer joins are converted to JOIN without any conditions (e.g., JOIN .. ON (true)) and the filters are pushed down to the tables. When the filters are pushed down to the tables, router planner can detect. However, pushdown planner relies on JOIN conditions. An example query: ``` INSERT INTO agg_events (user_id) SELECT raw_events_first.user_id FROM raw_events_first LEFT JOIN raw_events_second ON raw_events_first.user_id = raw_events_second.user_id WHERE raw_events_first.user_id = 10; ``` As a side effect of this change, now we can also relax certain limitation that "pushdown" planner emposes, but not "router". So, with this PR, we also allow those. Closes https://github.com/citusdata/citus/pull/6772 DESCRIPTION: Prevents unnecessarily pulling the data into coordinator for some INSERT .. SELECT queries that target a single-shard group	2023-07-28 15:07:20 +03:00
Teja Mupparti	846cbc3a39	In the MERGE join clause, there is a datatype mismatch between target's distribution column and the expression originating from the source. If the types are different, Citus uses different hash functions for the two column types, which might lead to incorrect repartitioning of the result data	2023-07-27 16:06:00 -07:00
Nils Dijk	186804c119	fix flappyness of shard_rebalancer operations test (#7083 ) Fixes flappyness where the order of shards was dependent on the physical layout in the heap. Failed here https://app.circleci.com/pipelines/github/citusdata/citus/33844/workflows/1651f8f5-6e6a-457e-9d35-34b8788ea6d1/jobs/1189836 ```diff --- /home/circleci/project/src/test/regress/expected/shard_rebalancer.out.modified 2023-07-24 12:51:27.126284675 +0000 +++ /home/circleci/project/src/test/regress/results/shard_rebalancer.out.modified 2023-07-24 12:51:27.170285079 +0000 @@ -2571,24 +2571,24 @@ CREATE TABLE test_with_all_shards_excluded(a int PRIMARY KEY); SELECT create_distributed_table('test_with_all_shards_excluded', 'a', colocate_with:='none', shard_count:=4); create_distributed_table -------------------------- (1 row) SELECT shardid FROM pg_dist_shard; shardid --------- - 433504 433505 433506 433507 + 433504 (4 rows) SELECT rebalance_table_shards('test_with_all_shards_excluded', excluded_shard_list:='{102073, 102074, 102075, 102076}'); rebalance_table_shards ------------------------ (1 row) DROP TABLE test_with_all_shards_excluded; SET citus.shard_count TO 2; ```	2023-07-27 16:24:35 +02:00
Carol Smith	df86a91393	Rename CODEOFCONDUCT.MD to CODE_OF_CONDUCT.md	2023-07-25 08:18:22 -07:00
Carol Smith	a42f58c7c4	Create CODEOFCONDUCT.MD Adding Code of Conduct file to /citus repo reflecting the Microsoft Open Source Code of Conduct.	2023-07-25 08:18:22 -07:00
zhjwpku	6a00517312	[typo] fix typo in comments (#7073 ) %s/pg_dist_local_node_group/pg_dist_local_group/g Signed-off-by: Zhao Junwang <zhjwpku@gmail.com>	2023-07-25 16:43:55 +03:00
Önder Kalacı	862dae823e	Expand EnableNonColocatedRouterQueryPushdown to cover shard colocation (e.g., shard index) (#7076 ) Previously, we only checked whether the relations are colocated, but we ignore the shard indexes. That causes certain queries still to be accidentally router. We should enforce colocation checks for both shard index and table colocation id to make the check restrictive enough. For example, the following query should not be router, and after this patch, it won't: ```SQL SELECT user_id FROM ((SELECT user_id FROM raw_events_first WHERE user_id = 15) EXCEPT (SELECT user_id FROM raw_events_second where user_id = 17)) as foo; ``` DESCRIPTION: Enforce shard level colocation with citus.enable_non_colocated_router_query_pushdown	2023-07-25 16:20:13 +03:00
ahmet gedemenli	3f11139b5c	Do not move a shard to a node that it already exists on	2023-07-25 13:38:33 +03:00
ahmet gedemenli	c968dc9c27	Do not rebalance if replication factor is greater than the node count	2023-07-25 13:38:33 +03:00
Nils Dijk	c2f46f0f3f	Update README.md - slack badge (#7075 ) Use a badge for slack again, although no member count, still better compared to the text.	2023-07-24 14:48:49 +02:00
Gürkan İndibay	3f0e1efb5a	Fixes error surpressions in packaging pipelines (#7054 ) DESCRIPTION: PR description that will go into the change log, up to 78 characters There are 4 errors arised recently and I fixed them in this PR. Problems and fixes are as below: 1. When executing make step in packaging pipeline, if it gets error, we can not detect it since there are additional operations after make in one line. With this fix, now if an error occured after make execution, we can detect and see the step red and failed here, 2. Recently we started to get the error ` fatal: detected dubious ownership in repository at '/__w/citus/citus' ` as below https://github.com/citusdata/citus/actions/runs/5542692968/jobs/10117706723#step:7:9 There is a fix for that one as well. 3. fixed the requirements issue arised related to urllib3 library version 4. Getting errors with centos-8 docker image with the new postgres-dev packages. Now, changed centos-8 image with almalinux-8 and now it works	2023-07-24 14:44:27 +03:00
Carol Smith	da7dd1cc54	Update README.md Adding code of conduct language to README doc.	2023-07-21 17:10:45 -07:00
Naisila Puka	42d956888d	PG16 compatibility: Resolve compilation issues (#7005 ) This PR provides successful compilation against PG16Beta2. It does some necessary refactoring to prepare for full support of version 16, in https://github.com/citusdata/citus/pull/6952 . Change RelFileNode to RelFileNumber or RelFileLocator Relevant PG commit b0a55e43299c4ea2a9a8c757f9c26352407d0ccc new header for varatt.h Relevant PG commit: d952373a987bad331c0e499463159dd142ced1ef drop support for Abs, use fabs Relevant PG commit 357cfefb09115292cfb98d504199e6df8201c957 tuplesort PGcommit: d37aa3d35832afde94e100c4d2a9618b3eb76472 Relevant PG commit: d37aa3d35832afde94e100c4d2a9618b3eb76472 Fix vacuum in columnar Relevant PG commit: 4ce3afb82ecfbf64d4f6247e725004e1da30f47c older one: b6074846cebc33d752f1d9a66e5a9932f21ad177 Add alloc_flags to pg_clean_ascii Relevant PG commit: 45b1a67a0fcb3f1588df596431871de4c93cb76f Merge GetNumConfigOptions() into get_guc_variables() Relevant PG commit: 3057465acfbea2f3dd7a914a1478064022c6eecd Minor PG refactor PG_FUNCNAME_MACRO __func__ Relevant PG commit 320f92b744b44f961e5d56f5f21de003e8027a7f Pass NULL context to stringToQualifiedNameList, typeStringToTypeName The pre-PG16 error behaviour for the following stringToQualifiedNameList & typeStringToTypeName was ereport(ERROR, ...) Now with PG16 we have this context input. We preserve the same behaviour by passing a NULL context, because of the following: (copy paste comment from PG16) If "context" isn't an ErrorSaveContext node, this behaves as errstart(ERROR, domain), and the errsave() macro ends up acting exactly like ereport(ERROR, ...). Relevant PG commit 858e776c84f48841e7e16fba7b690b76e54f3675 Use RangeVarCallbackMaintainsTable instead of RangeVarCallbackOwnsTable Relevant PG commit: 60684dd834a222fefedd49b19d1f0a6189c1632e FIX THIS: Not implemented grant-level control of role inheritance see PG commit e3ce2de09d814f8770b2e3b3c152b7671bcdb83f Make Scan node abstract PG commit: 8c73c11a0d39049de2c1f400d8765a0eb21f5228 Change in Var representations, get_relids_in_jointree PG commit 2489d76c4906f4461a364ca8ad7e0751ead8aa0d Deadlock detection changes because SHM_QUEUE is removed Relevant PG Commit: d137cb52cb7fd44a3f24f3c750fbf7924a4e9532 TU_UpdateIndexes Relevant PG commit 19d8e2308bc51ec4ab993ce90077342c915dd116 Use object_ownercheck and object_aclcheck functions Relevant PG commits: afbfc02983f86c4d71825efa6befd547fe81a926 c727f511bd7bf3c58063737bcf7a8f331346f253 Rework Permission Info for successful compilation Relevant PG commits: postgres/postgres@a61b1f7 postgres/postgres@b803b7d --------- Co-authored-by: onderkalaci <onderkalaci@gmail.com>	2023-07-21 14:32:37 +03:00
Naisila Puka	a282953274	Fix ScanKeyInit RegProcedure and Datum arguments (#7072 ) Index scans in PG16 return empty sets because of extra compatibility enforcement for `ScanKeyInit` arguments. Could be one of the relevant PG commits: `c8b2ef05f4` This PR fixes all incompatible `RegProcedure` and `Datum` arguments in all `ScanKeyInit` functions used throughout the codebase. Helpful for https://github.com/citusdata/citus/pull/6952	2023-07-21 14:11:10 +03:00
Teja Mupparti	87dc88f837	Isolate schema sharding/MERGE tests into a new file, and use the new GUC parameter	2023-07-19 12:23:45 -07:00
mulander	6498e1eb6c	Fix typo in distributed (#7069 ) DESCRIPTION: PR description that will go into the change log, up to 78 characters	2023-07-18 21:02:09 +02:00
aykut-bozkurt	832fc4a8f0	readme update for 12.0 (#7068 )	2023-07-18 20:09:27 +03:00
Nils Dijk	96a3d82e13	Update slack link in README.md for self-serve signup (#7058 ) The link in our readme directly goes to our channel, meaning people finding the link here for the first time are unable to join slack this way. Given that the target audience using this link is most likely not part of the slack channel yet it would be better to link to our self serve signup flow at slack.citusdata.com, which is the same we use on citusdata.com. From simple testing you should still get redirected to the channel if you are already joined and signed in.	2023-07-17 12:59:46 +02:00
Halil Ozan Akgül	c99a93ffa7	Move SQL file changes for citus_shard_sizes fixes into the new 11.3-2 version (#7050 ) This PR moves `citus_shard_sizes` changes from #7003, and #7018 to into a new Citus version, 11.3-2	2023-07-14 17:19:54 +03:00
aykut-bozkurt	609a5465ea	Bump Citus version into 12.1devel (#7061 )	2023-07-14 13:12:30 +03:00
Gürkan İndibay	0f0b60c29c	Fix format attribute and IsLocalReplicationOriginSessionActive errors (#7055 ) This PR fixes the following: - in oraclelinux-7 `Make` step ``` /usr/bin/ld: utils/replication_origin_session_utils.o: relocation R_X86_64_PC32 against undefined symbol `IsLocalReplicationOriginSessionActive' can not be used when making a shared object; recompile with -fPIC /usr/bin/ld: final link failed: Bad value collect2: error: ld returned 1 exit status ``` `IsLocalReplicationOriginSessionActive` function has improper inline declaration, fixed that - in centos-7 `Make` step ``` utils/background_jobs.c: In function 'StartCitusBackgroundTaskExecutor': utils/background_jobs.c:1746:6: warning: function might be possible candidate for 'gnu_printf' format attribute [-Wsuggest-attribute=format] database, user, jobId, taskId); ^ ``` should use `pg_attribute_printf(3,4)` instead of `pg_attribute_printf(3,0)` since the number of arguments varies for `SafeSnprintf(char str, rsize_t count, const char fmt, ...)` --------- Co-authored-by: naisila <nicypp@gmail.com>	2023-07-13 17:41:57 +03:00
aykut-bozkurt	ee255cd46e	Changelog entries for 12.0.0 (#7049 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Gokhan Gulbiz <ggulbiz@gmail.com>	2023-07-13 14:46:58 +03:00
Onur Tirtir	2c11e4d7f9	Deparse ALTER TABLE commands if ADD COLUMN is the only subcommand (#7032 ) Some clients send ALTER TABLE .. ADD COLUMN .. commands together with some other DDLs and this makes it impossible to directly send the original DDL command to the workers. For this reason, this commit adds support for deparsing such ALTER TABLE commands so that we can avoid from directly sending the original one to the workers. Partially fixes https://github.com/citusdata/citus/issues/690. Fixes #3678	2023-07-12 18:28:45 +03:00
Onur Tirtir	f3cdb6d1bf	Deparse ALTER TABLE commands if ADD COLUMN is the only subcommand And stabilize multi_alter_table_statements.sql.	2023-07-12 18:17:47 +03:00
Onur Tirtir	6365f47b57	Properly handle index storage options for ADD CONSTRAINT / COLUMN	2023-07-11 17:42:43 +03:00
Onur Tirtir	ae142e1764	Properly handle IF NOT EXISTS for ADD COLUMN	2023-07-11 17:42:43 +03:00
Onur Tirtir	d4789a2c3a	Stabilize test helper sql files multi_test_helpers is run in parallel with others, so need to stabilize other test helpers too to make multi_test_helpers runnable multiple times.	2023-07-06 10:47:41 +03:00
Onur Tirtir	001437bdfe	Refactor AppendAlterTableCmdAddConstraint to reuse it for ADD COLUMN too	2023-07-06 10:47:41 +03:00
Onur Tirtir	56f1daa800	Refactor the code that extends constraint/index names on shards into a func	2023-07-06 10:47:41 +03:00
Onur Tirtir	ba1ea9b5bd	Refactor the code that prepares constraint objects in an alter table stmt into a func	2023-07-06 10:47:41 +03:00
Halil Ozan Akgül	613cced1ae	Use citus_shard_sizes in citus_tables (#7018 ) Fixes #7019 This PR updates citus_tables view to use citus_shard_sizes function, instead of citus_total_relation_size to improve performance.	2023-07-05 11:40:34 +03:00
aykut-bozkurt	719d92c8b9	mat view should not be converted to tenant table (#7043 ) We allow materialized view to exist in distrbuted schema but they should not be tried to be converted to a tenant table since they cannot be distributed. Fixes https://github.com/citusdata/citus/issues/7041	2023-07-04 17:28:03 +03:00
Ahmet Gedemenli	5051be86ff	Skip distributed schema insertion into pg_dist_schema, if already exists (#7044 ) Inserting into `pg_dist_schema` causes unexpected duplicate key errors, for distributed schemas that already exist. With this commit we skip the insertion if the schema already exists in `pg_dist_schema`. The error: ```sql SET citus.enable_schema_based_sharding TO ON; CREATE SCHEMA sc2; CREATE SCHEMA IF NOT EXISTS sc2; NOTICE: schema "sc2" already exists, skipping ERROR: duplicate key value violates unique constraint "pg_dist_schema_pkey" DETAIL: Key (schemaid)=(17294) already exists. ``` fixes: #7042	2023-07-04 15:19:07 +03:00
Gokhan Gulbiz	e0d3476526	Add locking mechanism for tenant monitoring probabilistic approach (#7026 ) This PR * Addresses a concurrency issue in the probabilistic approach of tenant monitoring by acquiring a shared lock for tenant existence checks. * Changes `citus.stat_tenants_sample_rate_for_new_tenants` type to double * Renames `citus.stat_tenants_sample_rate_for_new_tenants` to `citus.stat_tenants_untracked_sample_rate`	2023-07-03 13:08:03 +03:00
Jelte Fennema	ac24e11986	Change default rebalance strategy to by_disk_size (#7033 ) DESCRIPTION: Change default rebalance strategy to by_disk_size When introducing rebalancing by disk size we didn't make it the default initially. The main reason was, because we expected some problems with it. We have indeed had some problems/bugs with it over the years, and have fixed all of them. By now we're quite confident in its stability, and that it pretty much always gives better results than by_shard_count. So this PR makes by_disk_size the new default. We don't change the default when some other strategy than by_shard_count is the current default. This is in case someone defined their own rebalance strategy and marked this as the default themselves. Note: It explicitly does nothing during a downgrade, because there's no way of knowing if the rebalance strategy before the upgrade was by_disk_size or by_shard_count. And even in previous versions by_disk_size is considered superior for quite some time.	2023-07-03 11:08:24 +02:00
Jelte Fennema	fd1427de2c	Change by_disk_size rebalance strategy to have a base size (#7035 ) One problem with rebalancing by disk size is that shards in newly created collocation groups are considered extremely small. This can easily result in bad balances if there are some other collocation groups that do have some data. One extremely bad example of this is: 1. You have 2 workers 2. Both contain about 100GB of data, but there's a 70MB difference. 3. You create 100 new distributed schemas with a few empty tables in them 4. You run the rebalancer 5. Now all new distributed schemas are placed on the node with that had 70MB less. 6. You start loading some data in these shards and quickly the balance is completely off To address this edge case, this PR changes the by_disk_size rebalance strategy to add a a base size of 100MB to the actual size of each shard group. This can still result in a bad balance when shard groups are empty, but it solves some of the worst cases.	2023-06-27 16:37:09 +02:00
Halil Ozan Akgül	03a4769c3a	Fix Reference Table Check for CDC (#7025 ) Previously reference table check only looked at `partition method = 'n'`. This PR adds `replication model = 't'` to that.	2023-06-23 16:37:35 +03:00
Teja Mupparti	387b5f80f9	Fixes the bug#6785	2023-06-22 10:44:45 -07:00

... 3 4 5 6 7 ...

6847 Commits (d28a5eae6c78935313824d319480632783d48d10) All Branches Search

6847 Commits (d28a5eae6c78935313824d319480632783d48d10)

All Branches