citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	38b08ebde9	Generalize the error checks while removing node The checks for preventing to remove a node are very much reference table centric. We are soon going to add the same checks for replicated tables. So, make the checks generic such that: (a) replicated tables fit naturally (b) we can the same checks in `citus_disable_node`.	2021-11-26 14:25:29 +01:00
Hanefi Onaldi	4c135de9e4	Introduce CI checks for hash comments in specs We do not use comments starting with # in spec files because it creates errors from C preprocessor that expects directives after this character. Instead use C style comments, i.e: // single line comment You can also use multiline comments as well /* * multi line comment */	2021-11-26 14:52:51 +03:00
Halil Ozan Akgul	87a1c760d9	Fix tests in multi-1-schedule that fail with metadata syncing	2021-11-26 12:09:53 +03:00
Onder Kalaci	121f5c4271	Active placements can only be on active nodes We re-define the meaning of active shard placement. It used to only be defined via shardstate == SHARD_STATE_ACTIVE. Now, we also add one more check. The worker node that the placement is on should be active as well. This is a preparation for supporting citus_disable_node() for MX with multiple failures at the same time. With this change, the maintanince daemon only needs to sync the "node metadata" (e.g., pg_dist_node), not the shard metadata.	2021-11-26 09:14:33 +01:00
Onder Kalaci	b4931f7345	Do not acquire locks on reference tables when a node is removed/disabled Before this commit, we acquire the metadata locks on the reference tables while removing/disabling a node on all the MX nodes. Although it has some marginal benefits, such as a concurrent modification during remove/disable node blocks, instead of erroring out, the drawbacks seems worse. Both citus_remove_node and citus_disable_node are not tolerant to multiple node failures. With this commit, we relax the locks. The implication is that while a node is removed/disabled, users might see query errors. On the other hand, this change becomes removing/disabling nodes more tolerant to multiple node failures.	2021-11-26 09:08:25 +01:00
Onur Tirtir	76b8006a9e	Allow overwriting columnar storage pages written by aborted xacts (#5484 ) When refactoring storage layer in #4907, we deleted the code that allows overwriting a disk page previously written but not known by metadata. Readers can see the change that introduced the code allows doing so in commit `a8da9acc63`. The reasoning was that; as of 10.2, we started aligning page reservations (`AlignReservation`) for subsequent writes right after allocating pages from disk. That means, even if writer transaction fails, subsequent writes are guaranteed to allocate a new page and write to there. For this reason, attempting to write to a page allocated before is not possible for a columnar table that user created when using v10.2.x. However, since the older versions of columnar doesn't do that, following example scenario can still result in writing to such disk page, even if user now upgraded to v10.2.x. This is because, when upgrading storage to 2.0 (`ColumnarStorageUpdateIfNeeded`), we calculate `reservedOffset` of the metapage based on the highest used address known by stripe metadata (`GetHighestUsedAddressAndId`). However, stripe metadata doesn't have entries for aborted writes. As a result, highest used address would be computed by ignoring pages that are allocated but not used. - User attempts writing to columnar table on Citus v10.0x/v10.1x. - Write operation fails for some reason. - User upgrades Citus to v10.2.x. - When attempting to write to same columnar table, they hit to "attempt to write columnar data .." error since write operation done in the older version of columnar already allocated that page, and now we are overwriting it. For this reason, with this commit, we re-do the change done in `a8da9acc63`. And for the reasons given above, it wasn't possible to add a test for this commit via usual code-paths. For this reason, added a UDF only for testing purposes so that we can reproduce the exact scenario in our regression test suite.	2021-11-26 07:51:13 +01:00
Onur Tirtir	85da4fc2e0	Merge branch 'master' into col/pg-upgrade-dependency	2021-11-26 09:34:43 +03:00
Onur Tirtir	81af605e07	Fix typo: "no sharding pruning constraints" -> "no shard pruning constraints" (#5490 )	2021-11-25 21:00:44 +01:00
Onur Tirtir	73f06323d8	Introduce dependencies from columnarAM to columnar metadata objects During pg upgrades, we have seen that it is not guaranteed that a columnar table will be created after metadata objects got created. Prior to changes done in this commit, we had such a dependency relationship in `pg_depend`: ``` columnar_table ----> columnarAM ----> citus extension ^ ^ \| \| columnar.storage_id_seq -------------------- \| \| columnar.stripe ------------------------------- ``` Since `pg_upgrade` just knows to follow topological sort of the objects when creating database dump, above dependency graph doesn't imply that `columnar_table` should be created before metadata objects such as `columnar.storage_id_seq` and `columnar.stripe` are created. For this reason, with this commit we add new records to `pg_depend` to make columnarAM depending on all rel objects living in `columnar` schema. That way, `pg_upgrade` will know it needs to create those before creating `columnarAM`, and similarly, before creating any tables using `columnarAM`. Note that in addition to inserting those records via installation script, we also do the same in `citus_finish_pg_upgrade()`. This is because, `pg_upgrade` rebuilds catalog tables in the new cluster and that means, we must insert them in the new cluster too.	2021-11-23 13:14:00 +03:00
Onur Tirtir	ef2ca03f24	Reproduce bug via test suite	2021-11-23 13:14:00 +03:00
Burak Velioglu	6590f12de4	Merge branch 'master' into velioglu/make_object_lock_explicit	2021-11-22 13:55:36 +03:00
Burak Velioglu	12e05ad196	Sorted addresses before getting lock	2021-11-22 11:43:32 +03:00
Marco Slot	f49d26fbeb	Remove citus_update_table_statistics isolation test	2021-11-19 10:51:15 +01:00
Marco Slot	56eae48daf	Stop updating shard range in citus_update_shard_statistics	2021-11-19 10:51:15 +01:00
Burak Velioglu	3a68263cc7	Change lock type	2021-11-19 12:03:17 +03:00
Burak Velioglu	baeaca7bc5	Update comment	2021-11-19 10:51:56 +03:00
Hanefi Onaldi	c0d43d4905	Prevent cache usage on citus_drop_trigger codepaths	2021-11-18 20:24:51 +03:00
Burak Velioglu	77dd12c09d	Merge branch 'master' into velioglu/make_object_lock_explicit	2021-11-18 20:18:07 +03:00
Hanefi Onaldi	e6160ad131	Document failing tests for issue 5099	2021-11-18 20:01:34 +03:00
Hanefi Onaldi	a3cc9b4e53	Remove case block that is identical to its neighbor (#5472 )	2021-11-18 19:41:39 +03:00
Burak Velioglu	b484d9b234	Make object locking explicit while adding dependencies	2021-11-18 19:34:00 +03:00
Marco Slot	9e6ca23286	Remove cstore_fdw-related logic	2021-11-16 13:59:03 +01:00
Önder Kalacı	8c0bc94b51	Enable replication factor > 1 in metadata syncing (#5392 ) - [x] Add some more regression test coverage - [x] Make sure returning works fine in case of local execution + remote execution (task->partiallyLocalOrRemote works as expected, already added tests) - [x] Implement locking properly (and add isolation tests) - [x] We do #shardcount round-trips on `SerializeNonCommutativeWrites`. We made it a single round-trip. - [x] Acquire locks for subselects on the workers & add isolation tests - [x] Add a GUC to prevent modification from the workers, hence increase the coordinator-only throughput - The performance slightly drops (~%15), unless `citus.allow_modifications_from_workers_to_replicated_tables` is set to false	2021-11-15 15:10:18 +03:00
Onur Tirtir	25024b776e	Skip deleting options if columnar.options is already dropped (#5458 ) Drop extension might cascade to columnar.options before dropping a columnar table. In that case, we were getting below error when opening columnar.options to delete records for the columnar table that we are about to drop.: "ERROR: could not open relation with OID 0". I somehow reproduced this bug easily when upgrading pg, that is why adding added the test to after_pg_upgrade_schedule.	2021-11-12 12:30:09 +03:00
Ahmet Gedemenli	14a33d4e8e	Introduce GUC citus.use_citus_managed_tables	2021-11-11 14:09:06 +03:00
Hanefi Onaldi	3d9cec70fd	Update migration paths from 10.2 to 11.0 (#5459 ) We recently introduced a set of patches to 10.2, and introduced 10.2-4 migration version. This migration version only resides on `release-10.2` branch, and is missing on our default branch. This creates a problem because we do not have a valid migration path from 10.2 to latest 11.0. To remedy this issue, I copied the relevant migration files from `release-10.2` branch, and renamed some of our migration files on default branch to make sure we have a linear upgrade path.	2021-11-11 13:55:28 +03:00
Önder Kalacı	6f5a343ff4	Make sure that enterprise tests pass (#5451 )	2021-11-08 18:11:19 +03:00
Önder Kalacı	98ca6ba6ca	Allow lock_shard_resources to be called by the users with privileges (#5441 ) Before this commit, we required the user to be owner of the shard/table in order to call lock_shard_resources. However, that is too restrictive. We can have users with GRANTS to the table who are not owners of the tables/shards. With this commit, we allow such patterns.	2021-11-08 15:36:51 +01:00
Onder Kalaci	d5e89b1132	Unify distributed execution logic for single replicated tables Citus does not acquire any executor locks for shard replication == 1. With this commit, we unify this decision and exit early.	2021-11-08 13:52:20 +01:00
Önder Kalacı	d5b371b2e0	Merge branch 'master' into naisila/fix-partitioned-index	2021-11-08 10:53:16 +01:00
naisila	385ba94d15	Run fix_partition_shard_index_names after each wrong naming command	2021-11-08 10:43:34 +01:00
Marco Slot	78866df13c	Remove master_append_table_to_shard UDF	2021-11-08 10:43:24 +01:00
Marco Slot	fba93df4b0	Remove copy into new append shard logic	2021-11-07 21:01:40 +01:00
Marco Slot	27ba19f7e1	Fix a flappy test in drop_column_partitioned_table	2021-11-07 18:25:44 +01:00
Nils Dijk	3fcb456381	Refactor/partitioned result destreceiver (#5432 ) This change creates a slightly higher abstraction of the `PartitionedResultDestReceiver` where it decouples the partitioning from writing it to a file. This allows for easier reuse for other `DestReceiver`'s that would like to route different tuples to different `DestReceiver`'s. Originally there was a lot of state kept in `PartitionedResultDestReceiver` to be able to lazily create `FileDestReceivers` when the first tuple arrived for that target. This convoluted the implementation of the processing of tuples with where they should go. This refactor changes that where it makes the `PartitionedResultDestReceiver` completely agnostic of what kind of Receivers it is writing to. When constructed you pass it a list of `DestReceiver` compatible pointers with the length of `partitionCount`. Internally the `PartitionedResultDestReceiver` keeps track of which `DestReceiver`'s have been started or not, and start them when they first receive a tuple. Alternatively, if the instantiating code of the `PartitionedResultDestReceiver` wants, the startup can be turned from lazily to eagerly. When the startup is eager (not lazy) all `rStartup` functions on the list of `DestReceiver`'s are called during the startup of the `PartitionedResultDestReceiver` and marked as such. A downside of this approach is the following. On highly partitioned destinations we now need to allocate a `FileDestReceiver` for every target, _always_. When the data passed into the `PartitionedResultDestReceiver` is highly skewed to a small set of `FileDestReceiver`'s this will waste some memory. Given the small size of a `FileDestReceiver`, and the fact that actual file handles are only created during the processing of the startup of the `FileDestReceiver` I think this memory waste is not a problem. If this would become a problem we could refactor the source list into some kind of generator object which can generate the `DestReceiver`'s on the fly.	2021-11-05 13:31:18 +01:00
Nils Dijk	0e7cf9f0ca	reinstate optimization that got unintentionally broken in `366461ccdb` (#5418 ) DESCRIPTION: Reinstate optimisation for uniform shard interval ranges During a refactor introduced in #4132 the following change was made, which made the optimisation in `CalculateUniformHashRangeIndex` unreachable: `366461ccdb (diff-565a339ed3c78bc5a0d4ffeb4e91032150b1dffbeeff59cd3e65981d20b998c7L319-R319)` This PR reinstates the path to the optimisation!	2021-11-05 13:07:51 +01:00
Önder Kalacı	763176a4d9	Some minor improvements on top of 5314 (#5428 ) * Refactor some checks in citus local tables * all existing citus local tables are auto converted after upgrade * Update warning messages in CreateCitusLocalTable * Hide notice msg for auto converting local tables * Hide hint msg Co-authored-by: Ahmet Gedemenli <afgedemenli@gmail.com>	2021-11-05 13:59:13 +03:00
Sait Talha Nisanci	ab29c25658	Fix missing from entry	2021-11-04 18:54:52 +03:00
Halil Ozan Akgul	a8f3f712cc	Turns mx on in isolations tests	2021-11-04 17:12:30 +03:00
Ahmet Gedemenli	b30ed46068	Fixes ALTER STATISTICS IF EXISTS bug (#5435 ) * Fix ALTER STATISTICS IF EXISTS bug	2021-11-04 16:14:05 +03:00
Halil Ozan Akgul	91b377490b	Fix multi_cluster_management fails for metadata syncing	2021-11-04 11:09:21 +03:00
Talha Nisanci	19f28eabae	Fix citus upgrade local run issues (#5414 ) This PR is fixing 2 separate issues related to the local run of citus upgrade tests. `d3e7c825ab` fixes the issue that, with our new testing infrastructure, we moved/renamed some of existing folders. This created a problem for local runs of citus upgrade tests since some paths were sensitive to such changes. This commit tries to make it more generic so that this issue is less likely to happen in the future, while also fixing the current issue. `93de6b60c3` we are fixing an issue that a new environment variable was added for citus upgrade tests, which is defined in the CI. `0cb51f8c37/.circleci/config.yml (L294)` This environment variable wasn't set in our local runs hence it would create problems. Instead of defining this environment variable in the local run, we change the citus_upgrade run command to use an existing env variable, which is now also set in the CI.	2021-11-03 16:17:36 +03:00
Jelte Fennema	9b784e58bf	Add tests for special hash values (#5431 ) We fixed some crashes a while back that would only occur in cases where the value of a distribution column would have result in a high or a very low hash value. This adds a regression test for those crashes.	2021-11-03 13:42:39 +01:00
Jelte Fennema	0cb51f8c37	Test a query that failed on 9.5.8 when coordinator is in metadata (#5412 ) This test starts passing because of PR #4508, to be precise commit: `24e60b44a1` When I undo that commit this newly added test starts failing. This adds this test to make sure we don't regress on this again.	2021-11-03 12:27:28 +01:00
Halil Ozan Akgul	c0785d570c	Remove EnsureSuperUser from start and stop metadata sync to node	2021-11-01 18:01:49 +03:00
Halil Ozan Akgul	c0eb67b24f	Skip forceCloseAtTransactionEnd connections only if BEGIN was not sent on them	2021-11-01 17:43:04 +03:00
Jelte Fennema	57a0228c52	Fix string-concatenation warning on Clang 13 (#5425 ) Clang 13 complains about a suspicious string concatenation. It thinks we might have missed a comma. This adds parentheses to make it clear that concatenation is indeed what we meant.	2021-11-01 13:55:43 +03:00
naisila	796d56a7b1	Rename ddlJob->commandString to ddlJob->metadataSyncCommand	2021-10-29 23:45:43 +03:00
Ahmet Gedemenli	67dca4363d	Dont auto-undistribute user-added citus local tables (#5314 ) * Disable auto-undistribute for user-added citus local tables	2021-10-28 12:10:26 +03:00
Nils Dijk	f4297f774a	Bump mitmproxy version (#5334 ) There is a vulnerability in mitmproxy with the version we are using. It would be hard to exploit anything with regards to the artifacts we ship as its only used in our test suite. Still its good hygiene to _not_ use software with known vulnerabilities. This PR updates the version of python, mitmproxy and the crypto libraries used. The latest version of mitmproxy for python 3.6 is not patched, hence the upgrade of python. For our CI images this cascades into upgrading debian as well :) For CI we bake these versions in our images so we need to update them as well. Changes to the CI images: https://github.com/citusdata/the-process/pull/65	2021-10-27 17:57:13 +02:00

1 2 3 4 5 ...

3405 Commits (7b6588fec00af113707cd976d7d507048879b429)