citus

Commit Graph

Author	SHA1	Message	Date
Ahmet Gedemenli	8936543b80	Create wrapper function CreateObjectAddressDependencyDefList (#5623 )	2022-01-17 15:35:40 +03:00
Ying Xu	4dca662e97	Making Columnar Dependency Free from Citus (#5622 ) * Removed distributed dependency in columnar_metadata.c * Changed columnar_debug.c so that it no longer needed distributed/tuplestore and made it return a record instead of a tuplestore * removed distributed/commands.h dependency * Made columnar_tableam.c dependency-free * Fixed spacing for columnar_store_memory_stats function * indentation fix * fixed test failures	2022-01-14 09:43:05 -08:00
Onur Tirtir	70d8e1fe97	Assert that we will create indexes on shards via local execution (#5620 )	2022-01-13 17:09:57 +01:00
Halil Ozan Akgul	63cd90e5dd	Add missing library to dependencies.c	2022-01-11 18:36:43 +03:00
Önder Kalacı	46ec7cd5cf	Enable MX for rebalancer tests	2022-01-11 12:07:39 +01:00
Önder Kalacı	885601c02c	Require superuser while activating a node (#5609 ) * Require superuser while activating a node With this change, we require ActiveNode() (hence citus_add_node(), citus_activate_node()) explicitly require for a superuser. Before this commit, these functions were designed to work with non-superuser roles with the relevent GRANTs given. However, that is not a widely used way for calling the functions above. Due to possibility of non-super user calling the UDFs, they were designed in a way that some commands were using some additional short-lived superuser connections. That is: (a) breaking transactional behavior (e.g., ROLLBACK wouldn't fully rollback the whole transaction) (b) Making it very complicated to reason about which parts of the node activation goes over which connections, and becoming vulnerable to deadlocks / visibility issues.	2022-01-10 08:30:13 -08:00
Onur Tirtir	3cc44ed8b3	Tell other backends it's safe to ignore the backend that concurrently built the shell table index (#5520 ) In addition to starting a new transaction, we also need to tell other backends --including the ones spawned for connections opened to localhost to build indexes on shards of this relation-- that concurrent index builds can safely ignore us. Normally, DefineIndex() only does that if index doesn't have any predicates (i.e.: where clause) and no index expressions at all. However, now that we already called standard process utility, index build on the shell table is finished anyway. The reason behind doing so is that we cannot guarantee not grabbing any snapshots via adaptive executor, and the backends creating indexes on local shards (if any) might block on waiting for current xact of the current backend to finish, which would cause self deadlocks that are not detectable.	2022-01-10 10:23:09 +03:00
Marco Slot	ee3b50b026	Disallow remote execution from queries on shards	2022-01-07 17:46:21 +01:00
Önder Kalacı	8d1b188620	Enable MX for the remaining failure tests (#5606 )	2022-01-07 17:24:31 +01:00
Ahmet Gedemenli	3c834e6693	Disable foreign distributed tables (#5605 ) * Disable foreign distributed tables * Add warning for existing distributed foreign tables	2022-01-07 18:12:23 +03:00
Onder Kalaci	7cb1d6ae06	Improve metadata connections With https://github.com/citusdata/citus/pull/5493 we introduced metadata specific connections. With this connection we guarantee that there is a single metadata connection. But note that this connection can be used for any other operation. In other words, this connection is not only reserved for metadata operations. However, as https://github.com/citusdata/citus-enterprise/issues/715 showed us that the logic has a flaw. We allowed ineligible connections to be picked as metadata connections: such as exclusively claimed connections or not fully initialized connections. With this commit, we make sure that we only consider eligable connections for metadata operations.	2022-01-07 10:36:32 +01:00
Onder Kalaci	9f2d9e1487	Move placement deletion from disable node to activate node We prefer the background daemon to only sync node metadata. That's why we move placement metadata changes from disable node to activate node. With that, we can make sure that disable node only changes node metadata, whereas activate node syncs all the metadata changes. In essence, we already expect all nodes to be up when a node is activated. So, this does not change the behavior much.	2022-01-07 09:56:03 +01:00
Hanefi Onaldi	9edfbe7718	Fix the default value for DeferShardDeleteOnMove The default for GUC citus.defer_drop_after_shard_move is true. However we initialize the global variable with a false value.	2022-01-07 11:01:49 +03:00
Ahmet Gedemenli	45e423136c	Support foreign tables in MX (#5461 )	2022-01-06 18:50:34 +03:00
Önder Kalacı	5305aa4246	Do not drop sequences when dropping metadata (#5584 ) Dropping sequences means we need to recreate and hence losing the sequence. With this commit, we keep the existing sequences such that resyncing wouldn't drop the sequence. We do that by breaking the dependency of the sequence from the table.	2022-01-06 09:48:34 +01:00
Önder Kalacı	8007adda25	Convert the function to a distributed function (#5596 ) so that when metadata is synced, the table is on the worker	2022-01-06 11:32:40 +03:00
Önder Kalacı	6d9218540b	Enable single node tests with Citus MX (#5595 ) * Enable single node tests with Citus MX The test already has comment on the changes	2022-01-05 16:00:44 +03:00
jeff-davis	2e03efd91e	Columnar: move DDL hooks to citus to remove dependency. (#5547 ) Add a new hook ColumnarTableSetOptions_hook so that citus can get control when the columnar table options change.	2022-01-04 23:26:46 -08:00
jeff-davis	c9292cfad1	Make pg_version_compat.h and listutils.c dependency-free. (#5548 ) Split distributed/version_compat.h into dependency-free pg_version_compat.h, and the original which still has dependencies. The original doesn't have much purpose, but until other files have better discipline about including the correct header files, then it's still needed. Also make distributed/listutils.h dependency-free. Should be moved outside of 'distributed' subdirectory, but that will cause significant code churn, so leave for another cleanup patch. Now both files can be included in columnar without creating a dependency on citus.	2022-01-04 23:02:08 -08:00
jeff-davis	1546aa0d9f	Columnar: use proper generic WAL interface. (#5543 ) Previously, we cheated by using the RM_GENERIC_ID record type, but not actually using the generic WAL API. This worked because we always took a full page image, and saved the extra work of allocating and copying to a temporary page. But it introduced complexity, and perhaps fragility, so better to just use the API properly. The performance penalty for a serial data load seems to be less than 1%.	2022-01-04 22:42:21 -08:00
Onder Kalaci	22b5175fd1	Make sure that the community and enterprise tests produce the same output	2022-01-04 13:30:31 +01:00
Önder Kalacı	0a8b0b06c6	Do not allow distributed functions on non-metadata synced nodes (#5586 ) Before this commit, Citus was triggering metadata syncing in the background when a function is distributed. However, with Citus 11, we expect all clusters to have metadata synced enabled. So, we do not expect any nodes not to have the metadata. This change: (a) pro: simplifies the code and opens up possibilities to simplify futher by reducing the scope of bg worker to only sync node metadata (b) pro: explicitly asks users to sync the metadata such that any unforseen impact can be easily detected (c) con: For distributed functions without distribution argument, we do not necessarily require the metadata sycned. However, for completeness and simplicity, we do so.	2022-01-04 13:12:57 +01:00
Halil Ozan Akgul	9547228e8d	Add isolation_check_mx test	2021-12-30 14:58:30 +03:00
Halil Ozan Akgul	aef2d83c7d	Fix metadata sync fails on multi_transaction_recovery	2021-12-29 11:21:32 +03:00
Önder Kalacı	d33650d1c1	Record if any partitioned Citus tables during upgrade (#5555 ) With Citus 11, the default behavior is to sync the metadata. However, partitioned tables created pre-Citus 11 might have index names that are not compatiable with metadata syncing. See https://github.com/citusdata/citus/issues/4962 for the details. With this commit, we record the existence of partitioned tables such that we can fix it later if any exists.	2021-12-27 03:33:34 -08:00
Halil Ozan Akgul	0c292a74f5	Fix metadata sync fails on multi_truncate	2021-12-27 13:54:53 +03:00
Önder Kalacı	c9127f921f	Avoid round trips while fixing index names (#5549 ) With this commit, fix_partition_shard_index_names() works significantly faster. For example, 32 shards, 365 partitions, 5 indexes drop from ~120 seconds to ~44 seconds 32 shards, 1095 partitions, 5 indexes drop from ~600 seconds to ~265 seconds `queryStringList` can be really long, because it may contain #partitions * #indexes entries. Before this change, we were actually going through the executor where each command in the query string triggers 1 round trip per entry in queryStringList. The aim of this commit is to avoid the round-trips by creating a single query string. I first simply tried sending `q1;q2;..;qn` . However, the executor is designed to handle `q1;q2;..;qn` type of query executions via the infrastructure mentioned above (e.g., by tracking the query indexes in the list and doing 1 statement per round trip). One another option could have been to change the executor such that only track the query index when `queryStringList` is provided not with queryString including multiple `;`s . That is (a) more work (b) could cause weird edge cases with failure handling (c) felt like coding a special case in to the executor	2021-12-27 10:29:37 +01:00
Halil Ozan Akgul	bb636e6a29	Fix metadata sync fails on multi_function_evaluation	2021-12-24 19:32:58 +03:00
Halil Ozan Akgul	70e68d5312	Fix metadata sync fails on multi_name_lengths	2021-12-24 14:33:32 +03:00
Halil Ozan Akgul	5c2fb06322	Fix metadata sync fails on multi_sequence_default	2021-12-24 14:33:32 +03:00
Halil Ozan Akgul	b9c06a6762	Turn metadata sync on in multi_metadata_sync	2021-12-24 10:58:13 +03:00
Hanefi Onaldi	479b2da740	Fix one flaky failure test	2021-12-23 20:11:45 +03:00
Ahmet Gedemenli	042d45b263	Propagate foreign server ops	2021-12-23 17:54:04 +03:00
Onur Tirtir	61b5fb1cfc	Run failure_test_helpers in base schedule (#5559 )	2021-12-23 12:54:12 +01:00
Talha Nisanci	e196d23854	Refactor AttributeEquivalenceId (#5006 )	2021-12-23 13:19:02 +03:00
Hanefi Onaldi	76176caea7	Fix typo s/exlusive/exclusive/	2021-12-23 01:35:01 +03:00
Hanefi Onaldi	1af8ca8f7c	Fix statical analysis findings (#5550 )	2021-12-22 18:16:11 +03:00
Ahmet Gedemenli	8e4ff34a2e	Do not include return table params in the function arg list (cherry picked from commit `90928cfd74`) Fix function signature generation Fix comment typo Add test for worker_create_or_replace_object Add test for recreating distributed functions with OUT/TABLE params Add test for recreating distributed function that returns setof int Fix test output Fix comment	2021-12-21 19:01:42 +03:00
Marco Slot	2eef71ccab	Propagate SET TRANSACTION commands	2021-12-18 11:31:39 +01:00
Halil Ozan Akgul	46f718c76d	Turn metadata sync on in add_coordinator, foreign_key_to_reference_table and replicate_reference_tables_to_coordinator	2021-12-17 16:33:25 +03:00
Halil Ozan Akgul	25755a7094	Turn ddl propagation off in worker on multi_copy	2021-12-17 15:54:20 +03:00
Onder Kalaci	fc98f83af2	Add citus.grep_remote_commands Simply applies ```SQL SELECT textlike(command, citus.grep_remote_commands) ``` And, if returns true, the command is logged. Else, the log is ignored. When citus.grep_remote_commands is empty string, all commands are logged.	2021-12-17 11:47:40 +01:00
Halil Ozan Akgul	df8d0f3db1	Turn metadata sync on in multi_replicate_reference_table and multi_citus_tools	2021-12-17 10:25:57 +03:00
Onur Tirtir	cc4c83b1e5	HAVE_LZ4 -> HAVE_CITUS_LZ4 (#5541 )	2021-12-16 16:21:52 +03:00
Talha Nisanci	c0945d88de	Normalize a debug failure to WARNING failure (#4996 )	2021-12-16 13:43:49 +03:00
Halil Ozan Akgul	8943d7b52f	Turn metadata sync on in mx_regular_user and remove_coordinator	2021-12-16 11:26:24 +03:00
Halil Ozan Akgul	b82af4db3b	Turn metadata sync on in multi_size_queries, multi_drop_extension and multi_unsupported_worker_operations	2021-12-16 11:10:54 +03:00
Hanefi Onaldi	9d4d73898a	Move healthcheck logic into new file (#5531 ) and add a missing `CheckCitusVersion(ERROR)` call	2021-12-15 15:58:20 -08:00
Hanefi Onaldi	acdcd9422c	Fix one flaky failure test (#5528 ) Removes flaky test	2021-12-15 18:59:58 +03:00
Hanefi Onaldi	29e4516642	Introduce citus_check_cluster_node_health UDF This UDF coordinates connectivity checks accross the whole cluster. This UDF gets the list of active readable nodes in the cluster, and coordinates all connectivity checks in sequential order. The algorithm is: for sourceNode in activeReadableWorkerList: c = connectToNode(sourceNode) for targetNode in activeReadableWorkerList: result = c.execute( "SELECT citus_check_connection_to_node(targetNode.name, targetNode.port") emit sourceNode.name, sourceNode.port, targetNode.name, targetNode.port, result - result -> true -> connection attempt from source to target succeeded - result -> false -> connection attempt from source to target failed - result -> NULL -> connection attempt from the current node to source node failed I suggest you use the following query to get an overview on the connectivity: SELECT bool_and(COALESCE(result, false)) FROM citus_check_cluster_node_health(); Whenever this query returns false, there is a connectivity issue, check in detail.	2021-12-15 01:41:51 +03:00
Hanefi Onaldi	13fff9c37a	Remove NOOP tuplestore_donestoring calls PostgreSQL does not need calling this function since 7.4 release, and it is a NOOP. For more details, check PostgreSQL commit below : commit dd04e958c8b03c0f0512497651678c7816af3198 Author: Tom Lane <tgl@sss.pgh.pa.us> Date: Sun Mar 9 03:34:10 2003 +0000 tuplestore_donestoring() isn't needed anymore, but provide a no-op macro definition so as not to create compatibility problems. diff --git a/src/include/utils/tuplestore.h b/src/include/utils/tuplestore.h index b46babacd1..76fe9fb428 100644 --- a/src/include/utils/tuplestore.h +++ b/src/include/utils/tuplestore.h @@ -17,7 +17,7 @@ * Portions Copyright (c) 1996-2002, PostgreSQL Global Development Group * Portions Copyright (c) 1994, Regents of the University of California * - * $Id: tuplestore.h,v 1.8 2003/03/09 02:19:13 tgl Exp $ + * $Id: tuplestore.h,v 1.9 2003/03/09 03:34:10 tgl Exp $ * ------------------------------------------------------------------------- / @@ -41,6 +41,9 @@ extern Tuplestorestate tuplestore_begin_heap(bool randomAccess, extern void tuplestore_puttuple(Tuplestorestate state, void tuple); +/ tuplestore_donestoring() used to be required, but is no longer used / +#define tuplestore_donestoring(state) ((void) 0) + / backwards scan is only allowed if randomAccess was specified 'true' / extern void tuplestore_gettuple(Tuplestorestate state, bool forward, bool should_free);	2021-12-14 18:55:02 +03:00
Halil Ozan Akgul	e060720370	Fix metadata sync fails in multi_index_statements	2021-12-14 11:28:08 +03:00
Halil Ozan Akgul	a951e52ce8	Fix drop index trying to drop coordinator local indexes on metadata worker nodes	2021-12-14 11:28:08 +03:00
Halil Ozan Akgul	1d7dde2c4c	Fix metadata sync fails on multi_copy	2021-12-14 10:59:59 +03:00
Halil Ozan Akgul	98e38e2e4e	Fix metadata sync fails on failure_connection_establishment	2021-12-13 11:51:56 +03:00
Halil Ozan Akgul	507df08422	Fix metadata sync fails on propagate_statistics and pg13_propagate_statistics tests	2021-12-09 12:28:11 +03:00
Halil Ozan Akgul	351314f8a1	Turn metadata sync on in base/minimal schedules	2021-12-08 13:34:41 +03:00
Halil Ozan Akgul	ee894c9e73	Fix metadata sync fails on multi_follower_schedule	2021-12-08 13:07:37 +03:00
Halil Ozan Akgul	4c8f79d7dd	Turn metadata sync on in failure schedule	2021-12-08 11:22:56 +03:00
Halil Ozan Akgul	4f272ea0e5	Fix metadata sync fails in multi_extension	2021-12-08 10:25:43 +03:00
Halil Ozan Akgul	a3834edeaa	Turn metadata sync on in multi_mx_schedule	2021-12-08 10:25:43 +03:00
Halil Ozan Akgul	ea37f4fd29	Turn metadata sync on in upgrade schedules	2021-12-08 10:19:02 +03:00
Hanefi Onaldi	05a3dfa8a9	Remove redundant arbitrary config class We had 2 class definitions for CitusCacheManyConnectionsConfig, where one of them was a copy of CitusSmallCopyBuffersConfig. This commit leaves the intended class definition that configures caching many connections, and removes the one that is a copy of another class	2021-12-08 04:47:08 +03:00
Burak Velioglu	e8534c1dd5	Drop sequence metadata from workers explicitly	2021-12-06 19:25:51 +03:00
Burak Velioglu	21194c3b9d	Mark sequence distributed explicitly while syncing metadata Since sequences are not marked as distributed while creating table if no metadata worker node exists, we are marking all sequences distributed while syncing metadata explicitly.	2021-12-06 19:25:51 +03:00
Burak Velioglu	6d849cf394	Allow delegating function from worker nodes We've both allowed delegating functions and procedures from worker nodes and also prevented delegation if a function/procedure has already been propagated from another node.	2021-12-06 19:25:51 +03:00
Burak Velioglu	a8b1ee87f7	Increment command counter after altering the sequence type	2021-12-06 19:25:51 +03:00
Burak Velioglu	ed8e32de5e	Sync pg_dist_object on an update and propagate while syncing to a new node Before that PR we were updating citus.pg_dist_object metadata, which keeps the metadata related to objects on Citus, only on the coordinator node. In order to allow using those object from worker nodes (or erroring out with proper error message) we've started to propagate that metedata to worker nodes as well.	2021-12-06 19:25:50 +03:00
Halil Ozan Akgul	ef09ba0d06	Fix metadata sync fails of multi_table_ddl	2021-12-06 13:44:30 +03:00
Halil Ozan Akgul	a6d0de060c	Fix fails with metadata syncing in undistribute_table	2021-12-03 13:58:53 +03:00
Hanefi Onaldi	56e9b1b968	Introduce UDF to check worker connectivity citus_check_connection_to_node runs a simple query on a remote node and reports whether this attempt was successful. This UDF will be used to make sure each worker node can connect to all the worker nodes in the cluster. parameters: nodename: required nodeport: optional (default: 5432) return value: boolean success	2021-12-03 02:30:28 +03:00
Talha Nisanci	e4ead8f408	Update broken link for upgrade tests (#5408 ) * Update broken link for upgrade tests * Update src/test/regress/README.md Co-authored-by: Nils Dijk <nils@citusdata.com> Co-authored-by: Nils Dijk <nils@citusdata.com>	2021-12-02 15:25:36 +01:00
Onder Kalaci	549edcabb6	Allow disabling node(s) when multiple failures happen As of master branch, Citus does all the modifications to replicated tables (e.g., reference tables and distributed tables with replication factor > 1), via 2PC and avoids any shardstate=3. As a side-effect of those changes, handling node failures for replicated tables change. With this PR, when one (or multiple) node failures happen, the users would see query errors on modifications. If the problem is intermitant, that's OK, once the node failure(s) recover by themselves, the modification queries would succeed. If the node failure(s) are permenant, the users should call `SELECT citus_disable_node(...)` to disable the node. As soon as the node is disabled, modification would start to succeed. However, now the old node gets behind. It means that, when the node is up again, the placements should be re-created on the node. First, use `SELECT citus_activate_node()`. Then, use `SELECT replicate_table_shards(...)` to replicate the missing placements on the re-activated node.	2021-12-01 10:19:48 +01:00
Halil Ozan Akgul	316274b5f0	Add normalize.sed item for multi_fix_partition_shard_index_names test	2021-11-30 13:28:41 +03:00
Halil Ozan Akgul	11072b4cb8	Normalize create role command in drop_partitioned_table test	2021-11-30 12:46:22 +03:00
Onder Kalaci	d405993b57	Make sure to use a dedicated metadata connection With this commit, we make sure to use a dedicated connection per node for all the metadata operations within the same transaction. This is needed because the same metadata (e.g., metadata includes the distributed table on the workers) can be modified accross multiple connections. With this connection we guarantee that there is a single metadata connection. But note that this connection can be used for any other operation. In other words, this connection is not only reserved for metadata operations.	2021-11-26 14:36:28 +01:00
Onder Kalaci	38b08ebde9	Generalize the error checks while removing node The checks for preventing to remove a node are very much reference table centric. We are soon going to add the same checks for replicated tables. So, make the checks generic such that: (a) replicated tables fit naturally (b) we can the same checks in `citus_disable_node`.	2021-11-26 14:25:29 +01:00
Hanefi Onaldi	4c135de9e4	Introduce CI checks for hash comments in specs We do not use comments starting with # in spec files because it creates errors from C preprocessor that expects directives after this character. Instead use C style comments, i.e: // single line comment You can also use multiline comments as well /* * multi line comment */	2021-11-26 14:52:51 +03:00
Halil Ozan Akgul	87a1c760d9	Fix tests in multi-1-schedule that fail with metadata syncing	2021-11-26 12:09:53 +03:00
Onder Kalaci	121f5c4271	Active placements can only be on active nodes We re-define the meaning of active shard placement. It used to only be defined via shardstate == SHARD_STATE_ACTIVE. Now, we also add one more check. The worker node that the placement is on should be active as well. This is a preparation for supporting citus_disable_node() for MX with multiple failures at the same time. With this change, the maintanince daemon only needs to sync the "node metadata" (e.g., pg_dist_node), not the shard metadata.	2021-11-26 09:14:33 +01:00
Onder Kalaci	b4931f7345	Do not acquire locks on reference tables when a node is removed/disabled Before this commit, we acquire the metadata locks on the reference tables while removing/disabling a node on all the MX nodes. Although it has some marginal benefits, such as a concurrent modification during remove/disable node blocks, instead of erroring out, the drawbacks seems worse. Both citus_remove_node and citus_disable_node are not tolerant to multiple node failures. With this commit, we relax the locks. The implication is that while a node is removed/disabled, users might see query errors. On the other hand, this change becomes removing/disabling nodes more tolerant to multiple node failures.	2021-11-26 09:08:25 +01:00
Onur Tirtir	76b8006a9e	Allow overwriting columnar storage pages written by aborted xacts (#5484 ) When refactoring storage layer in #4907, we deleted the code that allows overwriting a disk page previously written but not known by metadata. Readers can see the change that introduced the code allows doing so in commit `a8da9acc63`. The reasoning was that; as of 10.2, we started aligning page reservations (`AlignReservation`) for subsequent writes right after allocating pages from disk. That means, even if writer transaction fails, subsequent writes are guaranteed to allocate a new page and write to there. For this reason, attempting to write to a page allocated before is not possible for a columnar table that user created when using v10.2.x. However, since the older versions of columnar doesn't do that, following example scenario can still result in writing to such disk page, even if user now upgraded to v10.2.x. This is because, when upgrading storage to 2.0 (`ColumnarStorageUpdateIfNeeded`), we calculate `reservedOffset` of the metapage based on the highest used address known by stripe metadata (`GetHighestUsedAddressAndId`). However, stripe metadata doesn't have entries for aborted writes. As a result, highest used address would be computed by ignoring pages that are allocated but not used. - User attempts writing to columnar table on Citus v10.0x/v10.1x. - Write operation fails for some reason. - User upgrades Citus to v10.2.x. - When attempting to write to same columnar table, they hit to "attempt to write columnar data .." error since write operation done in the older version of columnar already allocated that page, and now we are overwriting it. For this reason, with this commit, we re-do the change done in `a8da9acc63`. And for the reasons given above, it wasn't possible to add a test for this commit via usual code-paths. For this reason, added a UDF only for testing purposes so that we can reproduce the exact scenario in our regression test suite.	2021-11-26 07:51:13 +01:00
Onur Tirtir	85da4fc2e0	Merge branch 'master' into col/pg-upgrade-dependency	2021-11-26 09:34:43 +03:00
Onur Tirtir	81af605e07	Fix typo: "no sharding pruning constraints" -> "no shard pruning constraints" (#5490 )	2021-11-25 21:00:44 +01:00
Onur Tirtir	73f06323d8	Introduce dependencies from columnarAM to columnar metadata objects During pg upgrades, we have seen that it is not guaranteed that a columnar table will be created after metadata objects got created. Prior to changes done in this commit, we had such a dependency relationship in `pg_depend`: ``` columnar_table ----> columnarAM ----> citus extension ^ ^ \| \| columnar.storage_id_seq -------------------- \| \| columnar.stripe ------------------------------- ``` Since `pg_upgrade` just knows to follow topological sort of the objects when creating database dump, above dependency graph doesn't imply that `columnar_table` should be created before metadata objects such as `columnar.storage_id_seq` and `columnar.stripe` are created. For this reason, with this commit we add new records to `pg_depend` to make columnarAM depending on all rel objects living in `columnar` schema. That way, `pg_upgrade` will know it needs to create those before creating `columnarAM`, and similarly, before creating any tables using `columnarAM`. Note that in addition to inserting those records via installation script, we also do the same in `citus_finish_pg_upgrade()`. This is because, `pg_upgrade` rebuilds catalog tables in the new cluster and that means, we must insert them in the new cluster too.	2021-11-23 13:14:00 +03:00
Onur Tirtir	ef2ca03f24	Reproduce bug via test suite	2021-11-23 13:14:00 +03:00
Burak Velioglu	6590f12de4	Merge branch 'master' into velioglu/make_object_lock_explicit	2021-11-22 13:55:36 +03:00
Burak Velioglu	12e05ad196	Sorted addresses before getting lock	2021-11-22 11:43:32 +03:00
Marco Slot	f49d26fbeb	Remove citus_update_table_statistics isolation test	2021-11-19 10:51:15 +01:00
Marco Slot	56eae48daf	Stop updating shard range in citus_update_shard_statistics	2021-11-19 10:51:15 +01:00
Burak Velioglu	3a68263cc7	Change lock type	2021-11-19 12:03:17 +03:00
Burak Velioglu	baeaca7bc5	Update comment	2021-11-19 10:51:56 +03:00
Hanefi Onaldi	c0d43d4905	Prevent cache usage on citus_drop_trigger codepaths	2021-11-18 20:24:51 +03:00
Burak Velioglu	77dd12c09d	Merge branch 'master' into velioglu/make_object_lock_explicit	2021-11-18 20:18:07 +03:00
Hanefi Onaldi	e6160ad131	Document failing tests for issue 5099	2021-11-18 20:01:34 +03:00
Hanefi Onaldi	a3cc9b4e53	Remove case block that is identical to its neighbor (#5472 )	2021-11-18 19:41:39 +03:00
Burak Velioglu	b484d9b234	Make object locking explicit while adding dependencies	2021-11-18 19:34:00 +03:00
Marco Slot	9e6ca23286	Remove cstore_fdw-related logic	2021-11-16 13:59:03 +01:00
Önder Kalacı	8c0bc94b51	Enable replication factor > 1 in metadata syncing (#5392 ) - [x] Add some more regression test coverage - [x] Make sure returning works fine in case of local execution + remote execution (task->partiallyLocalOrRemote works as expected, already added tests) - [x] Implement locking properly (and add isolation tests) - [x] We do #shardcount round-trips on `SerializeNonCommutativeWrites`. We made it a single round-trip. - [x] Acquire locks for subselects on the workers & add isolation tests - [x] Add a GUC to prevent modification from the workers, hence increase the coordinator-only throughput - The performance slightly drops (~%15), unless `citus.allow_modifications_from_workers_to_replicated_tables` is set to false	2021-11-15 15:10:18 +03:00
Onur Tirtir	25024b776e	Skip deleting options if columnar.options is already dropped (#5458 ) Drop extension might cascade to columnar.options before dropping a columnar table. In that case, we were getting below error when opening columnar.options to delete records for the columnar table that we are about to drop.: "ERROR: could not open relation with OID 0". I somehow reproduced this bug easily when upgrading pg, that is why adding added the test to after_pg_upgrade_schedule.	2021-11-12 12:30:09 +03:00
Ahmet Gedemenli	14a33d4e8e	Introduce GUC citus.use_citus_managed_tables	2021-11-11 14:09:06 +03:00
Hanefi Onaldi	3d9cec70fd	Update migration paths from 10.2 to 11.0 (#5459 ) We recently introduced a set of patches to 10.2, and introduced 10.2-4 migration version. This migration version only resides on `release-10.2` branch, and is missing on our default branch. This creates a problem because we do not have a valid migration path from 10.2 to latest 11.0. To remedy this issue, I copied the relevant migration files from `release-10.2` branch, and renamed some of our migration files on default branch to make sure we have a linear upgrade path.	2021-11-11 13:55:28 +03:00
Önder Kalacı	6f5a343ff4	Make sure that enterprise tests pass (#5451 )	2021-11-08 18:11:19 +03:00
Önder Kalacı	98ca6ba6ca	Allow lock_shard_resources to be called by the users with privileges (#5441 ) Before this commit, we required the user to be owner of the shard/table in order to call lock_shard_resources. However, that is too restrictive. We can have users with GRANTS to the table who are not owners of the tables/shards. With this commit, we allow such patterns.	2021-11-08 15:36:51 +01:00
Onder Kalaci	d5e89b1132	Unify distributed execution logic for single replicated tables Citus does not acquire any executor locks for shard replication == 1. With this commit, we unify this decision and exit early.	2021-11-08 13:52:20 +01:00
Önder Kalacı	d5b371b2e0	Merge branch 'master' into naisila/fix-partitioned-index	2021-11-08 10:53:16 +01:00
naisila	385ba94d15	Run fix_partition_shard_index_names after each wrong naming command	2021-11-08 10:43:34 +01:00
Marco Slot	78866df13c	Remove master_append_table_to_shard UDF	2021-11-08 10:43:24 +01:00
Marco Slot	fba93df4b0	Remove copy into new append shard logic	2021-11-07 21:01:40 +01:00
Marco Slot	27ba19f7e1	Fix a flappy test in drop_column_partitioned_table	2021-11-07 18:25:44 +01:00
Nils Dijk	3fcb456381	Refactor/partitioned result destreceiver (#5432 ) This change creates a slightly higher abstraction of the `PartitionedResultDestReceiver` where it decouples the partitioning from writing it to a file. This allows for easier reuse for other `DestReceiver`'s that would like to route different tuples to different `DestReceiver`'s. Originally there was a lot of state kept in `PartitionedResultDestReceiver` to be able to lazily create `FileDestReceivers` when the first tuple arrived for that target. This convoluted the implementation of the processing of tuples with where they should go. This refactor changes that where it makes the `PartitionedResultDestReceiver` completely agnostic of what kind of Receivers it is writing to. When constructed you pass it a list of `DestReceiver` compatible pointers with the length of `partitionCount`. Internally the `PartitionedResultDestReceiver` keeps track of which `DestReceiver`'s have been started or not, and start them when they first receive a tuple. Alternatively, if the instantiating code of the `PartitionedResultDestReceiver` wants, the startup can be turned from lazily to eagerly. When the startup is eager (not lazy) all `rStartup` functions on the list of `DestReceiver`'s are called during the startup of the `PartitionedResultDestReceiver` and marked as such. A downside of this approach is the following. On highly partitioned destinations we now need to allocate a `FileDestReceiver` for every target, _always_. When the data passed into the `PartitionedResultDestReceiver` is highly skewed to a small set of `FileDestReceiver`'s this will waste some memory. Given the small size of a `FileDestReceiver`, and the fact that actual file handles are only created during the processing of the startup of the `FileDestReceiver` I think this memory waste is not a problem. If this would become a problem we could refactor the source list into some kind of generator object which can generate the `DestReceiver`'s on the fly.	2021-11-05 13:31:18 +01:00
Nils Dijk	0e7cf9f0ca	reinstate optimization that got unintentionally broken in `366461ccdb` (#5418 ) DESCRIPTION: Reinstate optimisation for uniform shard interval ranges During a refactor introduced in #4132 the following change was made, which made the optimisation in `CalculateUniformHashRangeIndex` unreachable: `366461ccdb (diff-565a339ed3c78bc5a0d4ffeb4e91032150b1dffbeeff59cd3e65981d20b998c7L319-R319)` This PR reinstates the path to the optimisation!	2021-11-05 13:07:51 +01:00
Önder Kalacı	763176a4d9	Some minor improvements on top of 5314 (#5428 ) * Refactor some checks in citus local tables * all existing citus local tables are auto converted after upgrade * Update warning messages in CreateCitusLocalTable * Hide notice msg for auto converting local tables * Hide hint msg Co-authored-by: Ahmet Gedemenli <afgedemenli@gmail.com>	2021-11-05 13:59:13 +03:00
Sait Talha Nisanci	ab29c25658	Fix missing from entry	2021-11-04 18:54:52 +03:00
Halil Ozan Akgul	a8f3f712cc	Turns mx on in isolations tests	2021-11-04 17:12:30 +03:00
Ahmet Gedemenli	b30ed46068	Fixes ALTER STATISTICS IF EXISTS bug (#5435 ) * Fix ALTER STATISTICS IF EXISTS bug	2021-11-04 16:14:05 +03:00
Halil Ozan Akgul	91b377490b	Fix multi_cluster_management fails for metadata syncing	2021-11-04 11:09:21 +03:00
Talha Nisanci	19f28eabae	Fix citus upgrade local run issues (#5414 ) This PR is fixing 2 separate issues related to the local run of citus upgrade tests. `d3e7c825ab` fixes the issue that, with our new testing infrastructure, we moved/renamed some of existing folders. This created a problem for local runs of citus upgrade tests since some paths were sensitive to such changes. This commit tries to make it more generic so that this issue is less likely to happen in the future, while also fixing the current issue. `93de6b60c3` we are fixing an issue that a new environment variable was added for citus upgrade tests, which is defined in the CI. `0cb51f8c37/.circleci/config.yml (L294)` This environment variable wasn't set in our local runs hence it would create problems. Instead of defining this environment variable in the local run, we change the citus_upgrade run command to use an existing env variable, which is now also set in the CI.	2021-11-03 16:17:36 +03:00
Jelte Fennema	9b784e58bf	Add tests for special hash values (#5431 ) We fixed some crashes a while back that would only occur in cases where the value of a distribution column would have result in a high or a very low hash value. This adds a regression test for those crashes.	2021-11-03 13:42:39 +01:00
Jelte Fennema	0cb51f8c37	Test a query that failed on 9.5.8 when coordinator is in metadata (#5412 ) This test starts passing because of PR #4508, to be precise commit: `24e60b44a1` When I undo that commit this newly added test starts failing. This adds this test to make sure we don't regress on this again.	2021-11-03 12:27:28 +01:00
Halil Ozan Akgul	c0785d570c	Remove EnsureSuperUser from start and stop metadata sync to node	2021-11-01 18:01:49 +03:00
Halil Ozan Akgul	c0eb67b24f	Skip forceCloseAtTransactionEnd connections only if BEGIN was not sent on them	2021-11-01 17:43:04 +03:00
Jelte Fennema	57a0228c52	Fix string-concatenation warning on Clang 13 (#5425 ) Clang 13 complains about a suspicious string concatenation. It thinks we might have missed a comma. This adds parentheses to make it clear that concatenation is indeed what we meant.	2021-11-01 13:55:43 +03:00
naisila	796d56a7b1	Rename ddlJob->commandString to ddlJob->metadataSyncCommand	2021-10-29 23:45:43 +03:00
Ahmet Gedemenli	67dca4363d	Dont auto-undistribute user-added citus local tables (#5314 ) * Disable auto-undistribute for user-added citus local tables	2021-10-28 12:10:26 +03:00
Nils Dijk	f4297f774a	Bump mitmproxy version (#5334 ) There is a vulnerability in mitmproxy with the version we are using. It would be hard to exploit anything with regards to the artifacts we ship as its only used in our test suite. Still its good hygiene to _not_ use software with known vulnerabilities. This PR updates the version of python, mitmproxy and the crypto libraries used. The latest version of mitmproxy for python 3.6 is not patched, hence the upgrade of python. For our CI images this cascades into upgrading debian as well :) For CI we bake these versions in our images so we need to update them as well. Changes to the CI images: https://github.com/citusdata/the-process/pull/65	2021-10-27 17:57:13 +02:00
Jelte Fennema	a8cbeb1047	Fix docs of arbitrary configs (#5413 ) The old command would run none of the tests. The new command runs all of the tests for the given configs.	2021-10-27 17:16:24 +02:00
Philip Dubé	cc50682158	Fix typos. Spurred spotting "connectios" in logs	2021-10-25 13:54:09 +00:00
Jelte Fennema	3bdbfc3edf	Fix duplicate typedef which can cause compile failures (#5406 ) ColumnarScanDesc is already defined in columnar_tableam.h. Redifining it again causes a compiler error on some C compilers. Useful reference: https://bugzilla.redhat.com/show_bug.cgi?id=767538 Fixes #5404	2021-10-25 12:20:13 +00:00
Onder Kalaci	ce4c4540c5	Simplify 2PC decision in the executor It seems like the decision for 2PC is more complicated than it should be. With this change, we do one behavioral change. In essense, before this commit, when a SELECT task with replication factor > 1 is executed, the executor was triggering 2PC. And, in fact, the transaction manager (`ConnectionModifiedPlacement()`) was able to understand not to trigger 2PC when no modification happens. However, for transaction blocks like: BEGIN; -- a command that triggers 2PC -- A SELECT command on replication > 1 .. COMMIT; The SELECT was used to be qualified as required 2PC. And, as a side-effect the executor was setting `xactProperties.errorOnAnyFailure = true;` So, the commands was failing at the time of execution. Now, they fail at the end of the transaction.	2021-10-23 09:06:28 +02:00
Onder Kalaci	575bb6dde9	Drop support for Inactive Shard placements Given that we do all operations via 2PC, there is no way for any placement to be marked as INACTIVE.	2021-10-22 18:03:35 +02:00
Önder Kalacı	b3299de81c	Drop support for citus.multi_shard_commit_protocol (#5380 ) In the past, we allowed users to manually switch to 1PC (e.g., one phase commit). However, with this commit, we don't. All multi-shard modifications are done via 2PC.	2021-10-21 14:01:28 +02:00
Marco Slot	df43868369	Remove PG11 expected upgrade_list_citus_objects output	2021-10-21 12:08:05 +02:00
Marco Slot	dafba6c242	Deprecate master_get_table_metadata UDF	2021-10-21 12:08:05 +02:00
Marco Slot	defb97b7f5	Support operator class parameters in indexes	2021-10-20 17:03:59 +02:00
Önder Kalacı	3f726c72e0	When replication factor > 1, all modifications are done via 2PC (#5379 ) With Citus 9.0, we introduced `citus.single_shard_commit_protocol` which defaults to 2PC. With this commit, we prevent any user to set it to 1PC and drop support for `citus.single_shard_commit_protocol`. Although this might add some overhead for users, it is already the default behaviour (so less likely) and marking placements as INVALID is much worse.	2021-10-20 01:39:03 -07:00
Sait Talha Nisanci	a851211dbc	Run tests sequentially	2021-10-19 18:35:26 +03:00
Marco Slot	641ef9bd6f	Fix flappy subquery_append test	2021-10-19 15:29:01 +02:00
Sait Talha Nisanci	56abd3d501	Increase parallelism	2021-10-19 15:38:58 +03:00
Marco Slot	096660d61d	Remove master_apply_delete_command	2021-10-18 22:29:37 +02:00
Marco Slot	bece86b2f7	Add some subquery on append-distributed table tests	2021-10-18 21:11:16 +02:00
Marco Slot	93e79b9262	Never allow co-located joins of append-distributed tables	2021-10-18 21:11:16 +02:00
Marco Slot	b97e5081c7	Disable co-located joins for append-distributed tables	2021-10-18 21:11:16 +02:00
Marco Slot	dfad73d918	Disable implicit single re-partition joins for append tables	2021-10-18 21:11:16 +02:00
Marco Slot	2206e64e42	Disable single-repartition joins for append tables	2021-10-18 21:11:16 +02:00
Sait Talha Nisanci	6ff2083311	Remove base test as it is not useful anymore	2021-10-18 20:31:18 +03:00
Sait Talha Nisanci	7336c03c22	Add local-dist table joins to arbitrary configs	2021-10-18 20:31:18 +03:00
Önder Kalacı	31c8f279ac	Add helper UDFs to inspect object dependencies (#5293 ) - citus_get_all_dependencies_for_object: emulate what Citus would qualify as dependency when adding a new node - citus_get_dependencies_for_object: emulate what Citus would qualify as dependency when creating an object Example use: ```SQL -- find all the depedencies of table test SELECT pg_identify_object(t.classid, t.objid, t.objsubid) FROM (SELECT * FROM pg_get_object_address('table', '{test}', '{}')) as addr JOIN LATERAL citus_get_all_dependencies_for_object(addr.classid, addr.objid, addr.objsubid) as t(classid oid, objid oid, objsubid int) ON TRUE ORDER BY 1; ```	2021-10-18 14:46:49 +03:00
Halil Ozan Akgul	e3446692f3	Fix the bug by adding comma before the values	2021-10-15 18:42:23 +03:00
Halil Ozan Akgul	3fb996f6de	Fix the tests that fail with MX in columnar_schedule	2021-10-15 13:09:01 +03:00
Halil Ozan Akgul	b710e0064d	Fix tests that fail with MX in multi_schedule	2021-10-15 12:58:38 +03:00
Ahmet Gedemenli	35f6fe5f9f	Refactor/Improve PreprocessAlterTableStmtAttachPartition (#5366 ) * Refactor/Improve PreprocessAlterTableStmtAttachPartition	2021-10-14 11:39:39 +03:00
SaitTalhaNisanci	de61a89083	Fix sql_schedule_name problem (#5371 )	2021-10-13 13:10:00 +02:00
Hanefi Onaldi	3e64dc44c8	Fix some typos in comments (#5369 )	2021-10-13 13:00:39 +03:00
Önder Kalacı	af876bf452	Add value materialization test (#5368 )	2021-10-13 09:08:24 +02:00
SaitTalhaNisanci	a39859bc74	Remove unnecesary output (#5367 )	2021-10-13 09:28:01 +03:00
SaitTalhaNisanci	3f65751d43	Add an infrastructure to run same tests with arbitrary configs (#5316 ) To run tests in parallel use: ```bash make check-arbitrary-configs parallel=4 ``` To run tests sequentially use: ```bash make check-arbitrary-configs parallel=1 ``` To run only some configs: ```bash make check-arbitrary-base CONFIGS=CitusSingleNodeClusterConfig,CitusSmallSharedPoolSizeConfig ``` To run only some test files with some config: ```bash make check-arbitrary-base CONFIGS=CitusSingleNodeClusterConfig EXTRA_TESTS=dropped_columns_1 ``` To get a deterministic run, you can give the random's seed: ```bash make check-arbitrary-configs parallel=4 seed=12312 ``` The `seed` will be in the output of the run. In our regular regression tests, we can see all the details about either planning or execution but this means we need to run the same query under different configs/cluster setups again and again, which is not really maintanable. When we don't care about the internals of how planning/execution is done but the correctness, especially with different configs this infrastructure can be used. With `check-arbitrary-configs` target, the following happens: - a bunch of configs are loaded, which are defined in `config.py`. These configs have different settings such as different shard count, different citus settings, postgres settings, worker amount, or different metadata. - For each config, a separate data directory is created for tests in `tmp_citus_test` with the config's name. - For each config, `create_schedule` is run on the coordinator to setup the necessary tables. - For each config, `sql_schedule` is run. `sql_schedule` is run on the coordinator if it is a non-mx cluster. And if it is mx, it is either run on the coordinator or a random worker. - Tests results are checked if they match with the expected. When tests results don't match, you can see the regression diffs in a config's datadir, such as `tmp_citus_tests/dataCitusSingleNodeClusterConfig`. We also have a PostgresConfig which runs all the test suite with Postgres. By default configs use regular user, but we have a config to run as a superuser as well. So the infrastructure tests: - Postgres vs Citus - Mx vs Non-Mx - Superuser vs regular user - Arbitrary Citus configs When you want to add a new test, you can add the create statements to `create_schedule` and add the sql queries to `sql_schedule`. If you are adding Citus UDFs that should be a NO-OP for Postgres, make sure to override the UDFs in `postgres.sql`. You can add your new config to `config.py`. Make sure to extend either `CitusDefaultClusterConfig` or `CitusMXBaseClusterConfig`. On the CI, upon a failure, all logfiles will be uploaded as artifacts, so you can check the artifacts tab. All the regressions will be shown as part of the job on CI. In your local, you can check the regression diffs in config's datadirs as in `tmp_citus_tests/dataCitusSingleNodeClusterConfig`.	2021-10-12 14:24:19 +03:00
Teja Mupparti	a8348047c5	Pushdown procedures with OUT parameters (#5348 )	2021-10-11 23:14:36 -07:00
Onur Tirtir	f7f4a93073	Remove get_relation_trigger_oid_compat	2021-10-11 11:53:00 +03:00
Onur Tirtir	a1e0511583	Remove get_relation_constraint_oid_compat	2021-10-11 11:53:00 +03:00
Ahmet Gedemenli	d19793c174	Add partitioning support for citus local tables Add/fix tests Fix creating partitions Add test for mx - partition creating case Enable cascading to partitioned tables Fix mx partition adding test Fix cascading through fkeys Style Disable converting with non-inherited fkeys Fix detach bug Early return in case of cascade & Add tests Style Fix undistribute_table bug & Fix test outputs Remove RemovePartitionRelationIds Test with undistribute_table Add test for mx+convert+undistribute Remove redundant usage of CreatePartitionedCitusLocalTable Add some comments Introduce bulk functions for generating attach/detach partition commands Fix: Convert partitioned tables after adding fkey Change the error message for partitions Introduce function ErrorIfPartitionTableAddedToMetadata Polish attach/detach command generation functions Use time_partitions for testing Move mx tests to citus_local_tables_mx Add new partitioned table to cascade test Add test with time series management UDFs Fix test output Fix: Assertion fail on relation access tracking Style Refactor creating partitioned citus local tables Remove CreatePartitionedCitusLocalTable Style Error out if converting multi-level table Revert some old tests Error out adding partitioned partition Polish Polish/address Fix create table partition of case Use CascadeOperationForRelationIdList if no cascade needed Fix create partition bug Revert / Add new tests to mx Style Fix dropping fkey bug Add test with IF NOT EXISTS Convert to CLT when doing ATTACH PARTITION Add comments Add more tests with time series management Edit the error message for converting the child Use OR instead of AND in ErrorIfUnsupportedAlterTableStmt Edit/improve tests Disable ddl prop when dropping default column definitions Disable/enable ddl prop just before/after the command Add comment Add sequence test Add trigger test Remove NeedCascadeViaForeignKeys Add one more insert to sequence test Add comment Style Fix test output shard ids Update comments Disable creating fkey on partitions Move partition check to CreateCitusLocalTable Add comment Add check for attachingmulti-level partition Add test for pg_constraint Check pg_dist_partition in tests Add test inserting on the worker	2021-10-11 10:45:07 +03:00
Marco Slot	386d2567d4	Reduce reliance on append tables in regression tests	2021-10-08 21:27:14 +02:00
Halil Ozan Akgul	9c9d4b5eeb	Turn MX on by default	2021-10-08 18:17:21 +03:00
Naisila Puka	99d3785b5c	Fix flaky test in multi_fix_partition_shard_index_names.sql (#5364 )	2021-10-08 18:03:34 +03:00
Naisila Puka	d0390af72d	Add fix_partition_shard_index_names udf to fix currently broken names (#5291 ) * Add udf to include shardId in broken partition shard index names * Address reviews: rename index such that operations can be done on it * More comprehensive index tests * Final touches and formatting	2021-10-07 19:34:52 +03:00
Marco Slot	91b647024a	Fixes CREATE INDEX deparsing issue	2021-10-06 13:08:16 +02:00
Onur Tirtir	5d8f74bd0b	(Share) Lock buffer page when reading from columnar storage (#5338 ) Under high write concurrency, we were sometimes reading columnar metapage as all zeros. In `WriteToBlock()`, if `clear == true`, then it will clear the page before writing the new one, rather than just adding data to the page. That means any concurrent connection that is holding only a pin will be able to see the all-zero state between the `InitPage()` and the `memcpy_s()`. Moreover, postgres/storage/buffer/README states that: > Buffer access rules: > > 1. To scan a page for tuples, one must hold a pin and either shared or > exclusive content lock. To examine the commit status (XIDs and status bits) > of a tuple in a shared buffer, one must likewise hold a pin and either shared > or exclusive lock. For those reasons, we have to make sure to never keep a pin on the page without (at least) the shared lock, to avoid having such problems.	2021-10-06 11:57:02 +03:00
Halil Ozan Akgul	43d5853b6d	Fixes function names in comments	2021-10-06 09:24:43 +03:00
Hanefi Onaldi	a74409f24c	Bump Citus to 11.0devel	2021-10-01 22:21:22 +03:00
Onur Tirtir	fe72e8bb48	Discard index deletion requests made to columnarAM (#5331 ) A write operation might trigger index deletion if index already had dead entries for the key we are about to insert. There are two ways of index deletion: a) simple deletion b) bottom-up deletion (>= pg14) Since columnar_index_fetch_tuple never sets all_dead to true, columnarAM doesn't ever expect to receive simple deletion requests (columnar_index_delete_tuples) as we don't mark any index entries as dead. However, since columnarAM doesn't delete any dead entries via simple deletion, postgres might ask for a more comprehensive deletion (i.e.: bottom-up) at some point when pg >= 14. So with this commit, we start gracefully ignoring bottom-up deletion requests made to columnar_index_delete_tuples. Given that users can anyway "VACUUM FULL" their columnar tables, we don't see any problem in ignoring deletion requests.	2021-10-01 14:32:47 +03:00
Önder Kalacı	c2311b4c0c	Make (columnar.stripe) first_row_number index a unique constraint (#5324 ) * Make (columnar.stripe) first_row_number index a unique constraint Since stripe_first_row_number_idx is required to scan a columnar table, we need to make sure that it is created before doing anything with columnar tables during pg upgrades. However, a plain btree index is not a dependency of a table, so pg_upgrade cannot guarantee that stripe_first_row_number_idx gets created when creating columnar.stripe, unless we make it a unique "constraint". To do that, drop stripe_first_row_number_idx and create a unique constraint with the same name to keep the code change at minimum. * Add more pg upgrade tests for columnar * Fix a logic error in uprade_columnar_after test Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-30 10:51:56 +03:00
Jelte Fennema	97077c5c4a	Check more exit codes in upgrade tests (#5323 ) We were trying to find the cause for a strange update bug. We thought `pg_upgrade` succeeded and then were surprised that certain data was not in the database after the upgrade. Instead `pg_upgrade` had failed halfway through with an actionable error. It took us pretty long to realise this. This commit adds checking of exit codes to a lot more subprocess executions. That should make debugging in the future much easier.	2021-09-24 15:51:00 +02:00
Jeff Davis	d49d321eac	Columnar: only call BuildStripeMetadata() with heap tuple. BuildStripeMetadata() calls HeapTupleHeaderGetXmin(), which must only be called on a proper heap tuple with MVCC information. Make sure the caller passes the heap tuple, and not a datum tuple. Fixes #5318.	2021-09-23 15:51:01 -07:00
tejeswarm	a1604a87e6	Parition shards to be colocated with the parent shards	2021-09-22 14:47:04 -07:00
Onur Tirtir	77a2dd68da	Revoke read access to columnar.chunk from unprivileged user (#5313 ) Since this could expose chunk min/max values to unprivileged users.	2021-09-22 16:23:02 +03:00
Onur Tirtir	68335285b4	Columnar CustomScan: Pushdown BoolExpr's as we do before	2021-09-22 10:51:34 +03:00
Onur Tirtir	e6ed764f63	Check if xact id is in progress before checking if aborted (#5312 )	2021-09-21 21:20:31 +03:00
Onur Tirtir	f8b1ff7214	Add CheckCitusVersion() calls to columnarAM (#5308 ) Considering all code-paths that we might interact with a columnar table, add `CheckCitusVersion` calls to tableAM callbacks: - initializing table scan (`columnar_beginscan` & `columnar_index_fetch_begin`) - setting a new filenode for a relation (storage initializiation or a table rewrite) - truncating the storage - inserting tuple (single and multi) Also add `CheckCitusVersion` call to: - drop hook (`ColumnarTableDropHook`) - `alter_columnar_table_set` & `alter_columnar_table_reset` UDFs	2021-09-20 17:26:41 +03:00
Onder Kalaci	cea937f52f	Add missing version checks for citus_internal_XXX functions	2021-09-20 09:54:35 +02:00
SaitTalhaNisanci	35ff513dfe	Give proper error while distributing a temp table (#5269 )	2021-09-17 14:34:40 +03:00
jeff-davis	6e8b19984e	Columnar: separate plan and runtime quals. (#5261 ) * Columnar: separate plain and exec quals. Make a clear separation between plain quals, which contain constants or extern params; and exec quals, which contain exec params and can't be evaluated until a rescan. Fixes #5258. * more vanilla tests Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-13 10:54:53 -07:00
jeff-davis	d48ceee238	Columnar: add method ReparameterizeCustomPathByChild. (#5275 ) When performing a partition-wise join, the planner will adjust paths parameterized by the parent rel to instead parameterize by the child rel directly. When this reparameterization happens, we also need to adjust the join quals to reference the child rather than the parent. Fixes #5257.	2021-09-13 10:33:48 -07:00
Onur Tirtir	ea61efb63a	Not flush writes until need to read them when doing index-scan on columnar (#5247 ) Not flush pending writes if given tid belongs to a "flushed" or "aborted" stripe write, or to an "in-progress" stripe write of another backend. That way, we would reduce the cases where we flush single-tuple stripes during index scan. To do that, we follow below steps for index look-up's: - Do not flush any pending writes and do stripe metadata look-up for given tid. If tuple with tid is found, then no need to do another look-up since we already found the tuple without needing to flush pending writes. - If tuple is not found without flushing pending writes, then we have two scenarios: - If given tid belongs to a pending write of my backend, then do stripe metadata look-up for given tid. But this time first flush any pending writes. - Otherwise, just return false from `index_fetch_tuple` since flushing pending writes wouldn't help.	2021-09-13 18:41:20 +02:00
Onur Tirtir	4ee0fb2758	Make sure to skip aborted writes when reading the first tuple (#5274 ) With `5825c44d5f`, we made the changes to skip aborted writes when scanning a columnar table. However, looks like we forgot to handle such cases for the very first call made to columnar_getnextslot. That means, that commit only considered the intermediate stripe read operations. However, functions called by columnar_getnextslot to find first stripe to read (ColumnarBeginRead & ColumnarRescan) were not caring about those aborted writes. To fix that, we teach AdvanceStripeRead to find the very first stripe to read, and then start using it where were blindly calling FindNextStripeByRowNumber.	2021-09-13 11:50:53 +03:00
Burak Velioglu	ceec5d72e3	Swallow errors while aborting remote transactions	2021-09-10 11:06:16 +03:00
Naisila Puka	a69abe3be0	Fixes bug about int and smallint sequences on MX (#5254 ) * Introduce worker_nextval udf for int&smallint column defaults * Fix current tests and add new ones for worker_nextval	2021-09-09 23:41:07 +03:00
Nils Dijk	80a44a7b93	prevent double inclusion of columnar_tableam.h (#5266 ) Recently there are some warnings during the compilation of Citus. Part of the warnings come due to the `columnar_tableam.h` header not being properly guarded with defines and ifndef's. This PR fixes these warnings.	2021-09-09 17:37:58 +02:00
Onur Tirtir	be74518965	Improve memset calls made to reset bool arrays (#5262 )	2021-09-09 17:56:03 +03:00
Halil Ozan Akgul	19af1cef2f	Errors for CTEs with search clause Relevant PG commit: 3696a600e2292d43c00949ddf0352e4ebb487e5b	2021-09-09 13:48:24 +03:00
Marco Slot	f84164a000	Avoid switch to superuser in worker_merge_files_into_table	2021-09-09 11:00:29 +02:00
Marco Slot	04388e13b0	Add worker_append_table_to_shard permissions tests	2021-09-09 11:00:29 +02:00
Marco Slot	4faa49775b	Perform copy command as regular user in worker_append_table_to_shard	2021-09-09 11:00:29 +02:00
Hanefi Onaldi	9ae912a8c8	Prevent C-style comments in all directories (#5250 )	2021-09-09 11:54:58 +03:00
SaitTalhaNisanci	e3e0a028c7	return early in case we want to skip outer vars (#5259 )	2021-09-09 10:53:36 +03:00
Onur Tirtir	32e3e51ed4	Fix a compiler warning that we get on debian (#5260 )	2021-09-08 20:03:59 +03:00
Onur Tirtir	9935dfb958	Remove a flaky test from columnar_paths We already knew that it was flaky. Moreover, now it failed on my branch too. So removing it with this commit.	2021-09-08 14:15:22 +03:00
Onur Tirtir	be3914ae28	Prevent generating index-only "Path"s for columnar tables Previously, even when `EXPLAIN` output tells that we will do index-only scan, it was never the case since columnar tables don't have the visibility fork that postgres is looking for. For this reason, visibility check done in `IndexOnlyNext->VM_ALL_VISIBLE` code-path was always returning false and postgres was reading the tuple from the columnar relation itself.	2021-09-08 14:14:24 +03:00
Onur Tirtir	cc49e63222	Not read heaptuple after closing pg_rewrite (#5255 )	2021-09-08 13:03:17 +02:00
Onur Tirtir	3340f17c4e	Prevent planner from choosing parallel scan for columnar tables (#5245 ) Previously, for regular table scans, we were setting `RelOptInfo->partial_pathlist` to `NIL` via `set_rel_pathlist_hook` to discard scan `Path`s that need to use any parallel workers, this was working nicely. However, when building indexes, this hook doesn't get called so we were not able to prevent spawning parallel workers when building an index. For this reason, `9b4dc2f804` added basic implementation for `columnar_parallelscan_*` callbacks but also made some changes to skip using those workers when building the index. However, now that we are doing stripe reservation in two stages, we call `heap_inplace_update` at some point to complete stripe reservation. However, postgres throws an error if we call `heap_inplace_update` during a parallel operation, even if we don't actually make use of those workers. For this reason, with this pr, we make sure to not generate scan `Path`s that need to use any parallel workers by using `get_relation_info_hook`. This is indeed useful to prevent spawning parallel workers during index builds.	2021-09-08 13:53:43 +03:00
Onur Tirtir	5825c44d5f	Handle aborted writes properly when scanning a columnar table (#5244 ) If it is certain that we will not use any `parallel_worker`s for a columnar table, then stripe entries inserted by aborted transactions become visible to `SnapshotAny` and that causes `REINDEX` to fail by throwing a duplicate key error. To fix that: * consider three states for a stripe write operation: "flushed", "aborted", or "in-progress", * make sure to have a clear separation between them, and * act according to those three states when reading from a columnar table	2021-09-08 13:26:11 +03:00
Onur Tirtir	5dc619162d	Add valgrind test target for multi-1 (#5251 )	2021-09-07 16:27:34 +03:00
Jelte Fennema	bb5c494104	Enable binary encoding by default on PG14 Since PG14 we can now use binary encoding for arrays and composite types that contain user defined types. This was fixed in this commit in Postgres: `670c0a1d47` This change starts using that knowledge, by not necessarily falling back to text encoding anymore for those types. While doing this and testing a bit more I found various cases where binary encoding would fail that our checks didn't cover. This fixes those cases and adds tests for those. It also fixes EXPLAIN ANALYZE never using binary encoding, which was a leftover of workaround that was not necessary anymore. Finally, it changes the default for both `citus.enable_binary_protocol` and `citus.binary_worker_copy_format` to `true` for PG14 and up. In our cloud offering `binary_worker_copy_format` already was true by default. `enable_binary_protocol` had some bug with MX and user defined types, this bug was fixed by the above mentioned fixes.	2021-09-06 10:27:29 +02:00
Burak Velioglu	c3895f35cd	Add helper UDFs for easy time partition management - get_missing_time_partition_ranges: Gets the ranges of missing partitions for the given table, interval and range unless any existing partition conflicts with calculated missing ranges. - create_time_partitions: Creates partitions by getting range values from get_missing_time_partition_ranges. - drop_old_time_partitions: Drops partitions of the table older than given threshold.	2021-09-03 23:03:13 +03:00
Onur Tirtir	2b71263e40	Align columnar path costing functions (#5239 ) * Rename RecostColumnarPaths to CostColumnarPaths * Rename RecostColumnarIndexPath to CostColumnarIndexPath * Reorder args of CostColumnarScan to align with other two costing functions * Not adjust index scan start-up cost * Rename ColumnarIndexScanAddTotalCost to ColumnarIndexScanAdditionalCost * Reflect that index scan will at least read one stripe in totalCost calculation * Organize declarations in columnar_customscan.c	2021-09-03 19:37:42 +03:00
jeff-davis	cc58b58f73	Columnar: reserve metapage flag for UNLOGGED support. (#5237 ) Reserve space in the metapage for a flag to support UNLOGGED tables in the future without a metapage upgrade.	2021-09-03 08:40:55 -07:00
Halil Ozan Akgul	7fadfb74bb	Adds error message for REINDEX TABLE queries on distributed partitioned tables	2021-09-03 16:46:42 +03:00
Sait Talha Nisanci	3ad3bbba84	Apply latest version compat without conflicts	2021-09-03 16:09:59 +03:00
Sait Talha Nisanci	0b67fcf81d	Fix style	2021-09-03 16:09:59 +03:00
Halil Ozan Akgul	e1f5520e1a	Adds propagation of ALTER TABLE .. ALTER COLUMN .. SET COMPRESSION ..	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	902af39a04	Add join alias tests (#5233 ) PG COMMIT: 055fee7eb4dcc78e58672aef146334275e1cc40d	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	2a2ebab1fa	Add tests for jsonb subscripting (#5232 ) PG commit: 676887a3b0b8e3c0348ac3f82ab0d16e9a24bd43	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	2b263f9a2a	ALTER STATISTICS .. OWNER TO CURRENT_ROLE (#5225 ) (cherry picked from commit 42322caf90ca094777aa01376e02d1187afc1560)	2021-09-03 15:44:28 +03:00
Onder Kalaci	82a3b20fb3	Fix flaky test	2021-09-03 15:44:28 +03:00
Onder Kalaci	5844ab286c	Support OUT parameters in procedure pushdown delegation In PG 14, procedures can have OUT parameters. In Citus' procedure delegation framework, we need to adjust the function expression to get the outargs parameters. Releven PG change: `e56bce5d43`	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	1ff7186d20	Extended statistics on expressions - PG14 a4d75c8 (#5224 ) (cherry picked from commit 1268415f123b5d99cfacfe207c8670240efc1c00)	2021-09-03 15:44:28 +03:00
Halil Ozan Akgul	113d5d6615	Adds support for column compression in table distribution	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	6fbdeb38a8	ALTER TABLE ... DETACH PARTITION ... CONCURRENTLY - PG14 #71f4c8c (#5223 )	2021-09-03 15:44:28 +03:00
Onder Kalaci	c431bb2e45	Add support for "COPY dist/ref tables FROM" progress report Simply call Postgres' function to report the progress on each row recieved. Note that we currently do not support "COPY dist/ref TO .." progress report nicely. Citus has some specialized logic to support "COPY dist/ref TO .." such that it either converts the underlying command into "COPY (SELECT * FROM dist/ref ) ..." or sends COPY command to shards directly. In the former case, "tuples_processed" is only updated when the executor returns all the tuples, so the progress is not accurate. In the latter case, Citus can actually implement the progress report. But, for the sake of consistency, we prefer to not implement at all. Added to PG 14 with https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=8a4f618e7ae3cb11b0b37d0f06f05c8ff905833f	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	66303785f3	Add option PROCESS_TOAST to VACUUM - PG14 #7cb3048 (#5219 ) (cherry picked from commit e63bdfc49f9203db14ef77313c1d5e3461a84a32)	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	35a3f7240d	CHANGELOG: Allow REINDEX to change the tablespace of the new index	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	4e85d9ffce	Add empty pg14 sql file	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	307eb81278	Fix failure for 1pc_copy_hash	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	a6c40ebd14	Fix multi_follower_dml When the_table is emtpy, we don't get an error with pg14 anymore so we replace it generate_series so that we get the error.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	b16dadbe7c	Avoid NOTICE message to avoid an alternative output with pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6ff609fa86	Add alternative output for data_types It seems like there is a problem with Postgres14 with SELECT DISTINCT COUNT. The issue is reported to Postgres and an alternative output is added. We can remove the alternative output when the issue is fixed on PG. If this is not an issue on PG(which is unlikely) we should consider some other solution.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2fa1e5ffe3	Use the default max_parallel_workers_per_gather for vanilla	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	4b951a2ed9	Add alternative output for multi-mx	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	96964aeee5	Turn off debug for one query to avoid adding an alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e7607b6bed	Add a helper function to check explain has a single task In order to avoid adding an alternative output, a function to check if a given explan plan has a single task added. This doesn't change what the changed tests intend to do.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e0faf34417	turn off costs in columnar_indexes explain query	2021-09-03 15:41:28 +03:00
Nils Dijk	e63302d012	update error messages for libpq 14beta3	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2656d885f9	Rewrite AppendColumnNames for Pg14 Postgres changed stats expression types as of PG14. Hence we needed to write the AppendColumnNames method. Also they removed the error on PG side so we remove it as well. Relevant commits on pg14: a4d75c86bf15220df22de0a92c819ecef9db3849 388e75ad33489b77cfb9a8590a91e9287d8fb960	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	d1c0403055	Disable Query Idenfifier calculation in tests When queryId is not 0 and verbose is true, the query identifier is emitted to the explain output. This is breaking Postgres outputs. We disable de query identifier calculation in the tests. Commit on PG that introduced the query identifier in the explain output: 4f0b0966c866ae9f0e15d7cc73ccf7ce4e1af84b	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	7c0389a7a1	Update propagate extension commands test for pg12 The test file was changes slightly to avoid adding an alternative output. We update the existing alternative output for pg12 with the new changes.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	cd402b6a2b	Add alternative output for pg12 for window_functions	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	c31b0c2652	Sets next_shard_id at partition_wise_join test	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	9fc4c27b08	Readds deleted resultRelInfo changes for previos PG versions These changes were removed in commit: Introduces ExecSimpleRelationInsert_compat and modifyStateResultRelInfo macros We shouldn't have removed them but instead kept them for before PG14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	aca2b8b675	Add alternative output for isolation_master_update_node	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	f3fa133caa	Bind seg version to 1.3 in isolation_textension_commands	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	75fff14792	Turn off VERBOSE to avoid alternative output With VERBOSE option, as of PG14, we get a line with "Query Identifier".	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6b65dbc492	Add partition_wise_join to avoid big alternative output There was a small part in multi_partitioning that would need an alternative output for pg14. Instead of adding an alternative for the whole file, we created a new file, called partition_wise_join.sql and added the alternative output for that.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	375a1adc9e	Check if extversion is the same for seg extension When we check the exact version of the seg extension, it becomes a problem when its version changes, such as from 1.3 to 1.4. So now we modified the changes to check for that the version is the same in all the cluster.	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	ca0d4c3bde	Includes pg_version_constants.h in columnar_version_compat.h	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	7823e49219	Introduces pg_get_statisticsobj_worker_compat macro Relevant PG commit: a4d75c86bf15220df22de0a92c819ecef9db3849	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	f16d5e1833	Introduces make_simple_restrictinfo_compat and pull_varnos_compat macros make_simple_restrictinfo and pull_varnos functions now have a new parameter These new macros give us the ability to use this new parameter for PG14 and they don't give the parameter for previous versions Relevant PG commit: 55dc86eca70b1dc18a79c141b3567efed910329d	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	9b6ce10892	Removes password outputs from alter_role_propagation tests	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	20c32a7a1d	Add alternative output for multi_deparse_function Postgres tightened up its checks for invalid GUC names hence we started to get an alternative output for one of our tests. We add an alternative output since the file is relatively small. Commit on PG: 3db826bd55cd1df0dd8c3d811f8e5b936d7ba1e4	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	256e7d1540	Add alternative output for window_functions	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	df9b7149c3	Add some normalization rules for pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	dc81cae18f	Turn off COSTS to avoid alternative output for pg14	2021-09-03 15:41:28 +03:00

... 3 4 5 6 7 ...

3681 Commits (6da2d41e00eb33d4257e46ffbaaed131e2a89f8b)