citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	91b647024a	Fixes CREATE INDEX deparsing issue	2021-10-06 13:08:16 +02:00
Hanefi Onaldi	a74409f24c	Bump Citus to 11.0devel	2021-10-01 22:21:22 +03:00
Onur Tirtir	fe72e8bb48	Discard index deletion requests made to columnarAM (#5331 ) A write operation might trigger index deletion if index already had dead entries for the key we are about to insert. There are two ways of index deletion: a) simple deletion b) bottom-up deletion (>= pg14) Since columnar_index_fetch_tuple never sets all_dead to true, columnarAM doesn't ever expect to receive simple deletion requests (columnar_index_delete_tuples) as we don't mark any index entries as dead. However, since columnarAM doesn't delete any dead entries via simple deletion, postgres might ask for a more comprehensive deletion (i.e.: bottom-up) at some point when pg >= 14. So with this commit, we start gracefully ignoring bottom-up deletion requests made to columnar_index_delete_tuples. Given that users can anyway "VACUUM FULL" their columnar tables, we don't see any problem in ignoring deletion requests.	2021-10-01 14:32:47 +03:00
Önder Kalacı	c2311b4c0c	Make (columnar.stripe) first_row_number index a unique constraint (#5324 ) * Make (columnar.stripe) first_row_number index a unique constraint Since stripe_first_row_number_idx is required to scan a columnar table, we need to make sure that it is created before doing anything with columnar tables during pg upgrades. However, a plain btree index is not a dependency of a table, so pg_upgrade cannot guarantee that stripe_first_row_number_idx gets created when creating columnar.stripe, unless we make it a unique "constraint". To do that, drop stripe_first_row_number_idx and create a unique constraint with the same name to keep the code change at minimum. * Add more pg upgrade tests for columnar * Fix a logic error in uprade_columnar_after test Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-30 10:51:56 +03:00
Jelte Fennema	97077c5c4a	Check more exit codes in upgrade tests (#5323 ) We were trying to find the cause for a strange update bug. We thought `pg_upgrade` succeeded and then were surprised that certain data was not in the database after the upgrade. Instead `pg_upgrade` had failed halfway through with an actionable error. It took us pretty long to realise this. This commit adds checking of exit codes to a lot more subprocess executions. That should make debugging in the future much easier.	2021-09-24 15:51:00 +02:00
tejeswarm	a1604a87e6	Parition shards to be colocated with the parent shards	2021-09-22 14:47:04 -07:00
Onur Tirtir	77a2dd68da	Revoke read access to columnar.chunk from unprivileged user (#5313 ) Since this could expose chunk min/max values to unprivileged users.	2021-09-22 16:23:02 +03:00
Onur Tirtir	68335285b4	Columnar CustomScan: Pushdown BoolExpr's as we do before	2021-09-22 10:51:34 +03:00
Onur Tirtir	f8b1ff7214	Add CheckCitusVersion() calls to columnarAM (#5308 ) Considering all code-paths that we might interact with a columnar table, add `CheckCitusVersion` calls to tableAM callbacks: - initializing table scan (`columnar_beginscan` & `columnar_index_fetch_begin`) - setting a new filenode for a relation (storage initializiation or a table rewrite) - truncating the storage - inserting tuple (single and multi) Also add `CheckCitusVersion` call to: - drop hook (`ColumnarTableDropHook`) - `alter_columnar_table_set` & `alter_columnar_table_reset` UDFs	2021-09-20 17:26:41 +03:00
SaitTalhaNisanci	35ff513dfe	Give proper error while distributing a temp table (#5269 )	2021-09-17 14:34:40 +03:00
jeff-davis	6e8b19984e	Columnar: separate plan and runtime quals. (#5261 ) * Columnar: separate plain and exec quals. Make a clear separation between plain quals, which contain constants or extern params; and exec quals, which contain exec params and can't be evaluated until a rescan. Fixes #5258. * more vanilla tests Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-13 10:54:53 -07:00
jeff-davis	d48ceee238	Columnar: add method ReparameterizeCustomPathByChild. (#5275 ) When performing a partition-wise join, the planner will adjust paths parameterized by the parent rel to instead parameterize by the child rel directly. When this reparameterization happens, we also need to adjust the join quals to reference the child rather than the parent. Fixes #5257.	2021-09-13 10:33:48 -07:00
Onur Tirtir	ea61efb63a	Not flush writes until need to read them when doing index-scan on columnar (#5247 ) Not flush pending writes if given tid belongs to a "flushed" or "aborted" stripe write, or to an "in-progress" stripe write of another backend. That way, we would reduce the cases where we flush single-tuple stripes during index scan. To do that, we follow below steps for index look-up's: - Do not flush any pending writes and do stripe metadata look-up for given tid. If tuple with tid is found, then no need to do another look-up since we already found the tuple without needing to flush pending writes. - If tuple is not found without flushing pending writes, then we have two scenarios: - If given tid belongs to a pending write of my backend, then do stripe metadata look-up for given tid. But this time first flush any pending writes. - Otherwise, just return false from `index_fetch_tuple` since flushing pending writes wouldn't help.	2021-09-13 18:41:20 +02:00
Onur Tirtir	4ee0fb2758	Make sure to skip aborted writes when reading the first tuple (#5274 ) With `5825c44d5f`, we made the changes to skip aborted writes when scanning a columnar table. However, looks like we forgot to handle such cases for the very first call made to columnar_getnextslot. That means, that commit only considered the intermediate stripe read operations. However, functions called by columnar_getnextslot to find first stripe to read (ColumnarBeginRead & ColumnarRescan) were not caring about those aborted writes. To fix that, we teach AdvanceStripeRead to find the very first stripe to read, and then start using it where were blindly calling FindNextStripeByRowNumber.	2021-09-13 11:50:53 +03:00
Naisila Puka	a69abe3be0	Fixes bug about int and smallint sequences on MX (#5254 ) * Introduce worker_nextval udf for int&smallint column defaults * Fix current tests and add new ones for worker_nextval	2021-09-09 23:41:07 +03:00
Halil Ozan Akgul	19af1cef2f	Errors for CTEs with search clause Relevant PG commit: 3696a600e2292d43c00949ddf0352e4ebb487e5b	2021-09-09 13:48:24 +03:00
Marco Slot	04388e13b0	Add worker_append_table_to_shard permissions tests	2021-09-09 11:00:29 +02:00
SaitTalhaNisanci	e3e0a028c7	return early in case we want to skip outer vars (#5259 )	2021-09-09 10:53:36 +03:00
Onur Tirtir	9935dfb958	Remove a flaky test from columnar_paths We already knew that it was flaky. Moreover, now it failed on my branch too. So removing it with this commit.	2021-09-08 14:15:22 +03:00
Onur Tirtir	be3914ae28	Prevent generating index-only "Path"s for columnar tables Previously, even when `EXPLAIN` output tells that we will do index-only scan, it was never the case since columnar tables don't have the visibility fork that postgres is looking for. For this reason, visibility check done in `IndexOnlyNext->VM_ALL_VISIBLE` code-path was always returning false and postgres was reading the tuple from the columnar relation itself.	2021-09-08 14:14:24 +03:00
Onur Tirtir	3340f17c4e	Prevent planner from choosing parallel scan for columnar tables (#5245 ) Previously, for regular table scans, we were setting `RelOptInfo->partial_pathlist` to `NIL` via `set_rel_pathlist_hook` to discard scan `Path`s that need to use any parallel workers, this was working nicely. However, when building indexes, this hook doesn't get called so we were not able to prevent spawning parallel workers when building an index. For this reason, `9b4dc2f804` added basic implementation for `columnar_parallelscan_*` callbacks but also made some changes to skip using those workers when building the index. However, now that we are doing stripe reservation in two stages, we call `heap_inplace_update` at some point to complete stripe reservation. However, postgres throws an error if we call `heap_inplace_update` during a parallel operation, even if we don't actually make use of those workers. For this reason, with this pr, we make sure to not generate scan `Path`s that need to use any parallel workers by using `get_relation_info_hook`. This is indeed useful to prevent spawning parallel workers during index builds.	2021-09-08 13:53:43 +03:00
Onur Tirtir	5825c44d5f	Handle aborted writes properly when scanning a columnar table (#5244 ) If it is certain that we will not use any `parallel_worker`s for a columnar table, then stripe entries inserted by aborted transactions become visible to `SnapshotAny` and that causes `REINDEX` to fail by throwing a duplicate key error. To fix that: * consider three states for a stripe write operation: "flushed", "aborted", or "in-progress", * make sure to have a clear separation between them, and * act according to those three states when reading from a columnar table	2021-09-08 13:26:11 +03:00
Onur Tirtir	5dc619162d	Add valgrind test target for multi-1 (#5251 )	2021-09-07 16:27:34 +03:00
Jelte Fennema	bb5c494104	Enable binary encoding by default on PG14 Since PG14 we can now use binary encoding for arrays and composite types that contain user defined types. This was fixed in this commit in Postgres: `670c0a1d47` This change starts using that knowledge, by not necessarily falling back to text encoding anymore for those types. While doing this and testing a bit more I found various cases where binary encoding would fail that our checks didn't cover. This fixes those cases and adds tests for those. It also fixes EXPLAIN ANALYZE never using binary encoding, which was a leftover of workaround that was not necessary anymore. Finally, it changes the default for both `citus.enable_binary_protocol` and `citus.binary_worker_copy_format` to `true` for PG14 and up. In our cloud offering `binary_worker_copy_format` already was true by default. `enable_binary_protocol` had some bug with MX and user defined types, this bug was fixed by the above mentioned fixes.	2021-09-06 10:27:29 +02:00
Burak Velioglu	c3895f35cd	Add helper UDFs for easy time partition management - get_missing_time_partition_ranges: Gets the ranges of missing partitions for the given table, interval and range unless any existing partition conflicts with calculated missing ranges. - create_time_partitions: Creates partitions by getting range values from get_missing_time_partition_ranges. - drop_old_time_partitions: Drops partitions of the table older than given threshold.	2021-09-03 23:03:13 +03:00
Onur Tirtir	2b71263e40	Align columnar path costing functions (#5239 ) * Rename RecostColumnarPaths to CostColumnarPaths * Rename RecostColumnarIndexPath to CostColumnarIndexPath * Reorder args of CostColumnarScan to align with other two costing functions * Not adjust index scan start-up cost * Rename ColumnarIndexScanAddTotalCost to ColumnarIndexScanAdditionalCost * Reflect that index scan will at least read one stripe in totalCost calculation * Organize declarations in columnar_customscan.c	2021-09-03 19:37:42 +03:00
Halil Ozan Akgul	7fadfb74bb	Adds error message for REINDEX TABLE queries on distributed partitioned tables	2021-09-03 16:46:42 +03:00
Sait Talha Nisanci	0b67fcf81d	Fix style	2021-09-03 16:09:59 +03:00
Halil Ozan Akgul	e1f5520e1a	Adds propagation of ALTER TABLE .. ALTER COLUMN .. SET COMPRESSION ..	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	902af39a04	Add join alias tests (#5233 ) PG COMMIT: 055fee7eb4dcc78e58672aef146334275e1cc40d	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	2a2ebab1fa	Add tests for jsonb subscripting (#5232 ) PG commit: 676887a3b0b8e3c0348ac3f82ab0d16e9a24bd43	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	2b263f9a2a	ALTER STATISTICS .. OWNER TO CURRENT_ROLE (#5225 ) (cherry picked from commit 42322caf90ca094777aa01376e02d1187afc1560)	2021-09-03 15:44:28 +03:00
Onder Kalaci	82a3b20fb3	Fix flaky test	2021-09-03 15:44:28 +03:00
Onder Kalaci	5844ab286c	Support OUT parameters in procedure pushdown delegation In PG 14, procedures can have OUT parameters. In Citus' procedure delegation framework, we need to adjust the function expression to get the outargs parameters. Releven PG change: `e56bce5d43`	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	1ff7186d20	Extended statistics on expressions - PG14 a4d75c8 (#5224 ) (cherry picked from commit 1268415f123b5d99cfacfe207c8670240efc1c00)	2021-09-03 15:44:28 +03:00
Halil Ozan Akgul	113d5d6615	Adds support for column compression in table distribution	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	6fbdeb38a8	ALTER TABLE ... DETACH PARTITION ... CONCURRENTLY - PG14 #71f4c8c (#5223 )	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	66303785f3	Add option PROCESS_TOAST to VACUUM - PG14 #7cb3048 (#5219 ) (cherry picked from commit e63bdfc49f9203db14ef77313c1d5e3461a84a32)	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	35a3f7240d	CHANGELOG: Allow REINDEX to change the tablespace of the new index	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	4e85d9ffce	Add empty pg14 sql file	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	307eb81278	Fix failure for 1pc_copy_hash	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	a6c40ebd14	Fix multi_follower_dml When the_table is emtpy, we don't get an error with pg14 anymore so we replace it generate_series so that we get the error.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	b16dadbe7c	Avoid NOTICE message to avoid an alternative output with pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6ff609fa86	Add alternative output for data_types It seems like there is a problem with Postgres14 with SELECT DISTINCT COUNT. The issue is reported to Postgres and an alternative output is added. We can remove the alternative output when the issue is fixed on PG. If this is not an issue on PG(which is unlikely) we should consider some other solution.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2fa1e5ffe3	Use the default max_parallel_workers_per_gather for vanilla	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	4b951a2ed9	Add alternative output for multi-mx	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	96964aeee5	Turn off debug for one query to avoid adding an alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e7607b6bed	Add a helper function to check explain has a single task In order to avoid adding an alternative output, a function to check if a given explan plan has a single task added. This doesn't change what the changed tests intend to do.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e0faf34417	turn off costs in columnar_indexes explain query	2021-09-03 15:41:28 +03:00
Nils Dijk	e63302d012	update error messages for libpq 14beta3	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2656d885f9	Rewrite AppendColumnNames for Pg14 Postgres changed stats expression types as of PG14. Hence we needed to write the AppendColumnNames method. Also they removed the error on PG side so we remove it as well. Relevant commits on pg14: a4d75c86bf15220df22de0a92c819ecef9db3849 388e75ad33489b77cfb9a8590a91e9287d8fb960	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	d1c0403055	Disable Query Idenfifier calculation in tests When queryId is not 0 and verbose is true, the query identifier is emitted to the explain output. This is breaking Postgres outputs. We disable de query identifier calculation in the tests. Commit on PG that introduced the query identifier in the explain output: 4f0b0966c866ae9f0e15d7cc73ccf7ce4e1af84b	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	7c0389a7a1	Update propagate extension commands test for pg12 The test file was changes slightly to avoid adding an alternative output. We update the existing alternative output for pg12 with the new changes.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	cd402b6a2b	Add alternative output for pg12 for window_functions	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	c31b0c2652	Sets next_shard_id at partition_wise_join test	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	aca2b8b675	Add alternative output for isolation_master_update_node	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	f3fa133caa	Bind seg version to 1.3 in isolation_textension_commands	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	75fff14792	Turn off VERBOSE to avoid alternative output With VERBOSE option, as of PG14, we get a line with "Query Identifier".	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6b65dbc492	Add partition_wise_join to avoid big alternative output There was a small part in multi_partitioning that would need an alternative output for pg14. Instead of adding an alternative for the whole file, we created a new file, called partition_wise_join.sql and added the alternative output for that.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	375a1adc9e	Check if extversion is the same for seg extension When we check the exact version of the seg extension, it becomes a problem when its version changes, such as from 1.3 to 1.4. So now we modified the changes to check for that the version is the same in all the cluster.	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	9b6ce10892	Removes password outputs from alter_role_propagation tests	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	20c32a7a1d	Add alternative output for multi_deparse_function Postgres tightened up its checks for invalid GUC names hence we started to get an alternative output for one of our tests. We add an alternative output since the file is relatively small. Commit on PG: 3db826bd55cd1df0dd8c3d811f8e5b936d7ba1e4	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	256e7d1540	Add alternative output for window_functions	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	df9b7149c3	Add some normalization rules for pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	dc81cae18f	Turn off COSTS to avoid alternative output for pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	fb8671f291	Change pg13 test to not differ with pg14 to avoid adding alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	3f5c178c93	Remove VERBOSE output to make pg14 and pg13 output the same	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	8ef94dc1f5	Changes array_cat argument type from anyarray to anycompatiblearray Relevant PG commit: 9e38c2bb5093ceb0c04d6315ccd8975bd17add66 fix array_cat_agg for pg upgrades array_cat_agg now needs to take anycompatiblearray instead of anyarray because array_cat changed its type from anyarray to anycompatiblearray with pg14. To handle upgrades correctly, we drop the aggregate in citus_pg_prepare_upgrade. To be able to drop it, we first remove the dependency from pg_depend. Then we create the right aggregate in citus_finish_pg_upgrade and we also add the dependency back to pg_depend.	2021-09-03 15:41:28 +03:00
jeff-davis	4718b6bcdf	Generate parameterized paths for columnar scans. (#5172 ) Allow ColumnarScans to push down join quals by generating parameterized paths. This significantly expands the utility of chunk group filtering, making a ColumnarScan behave similar to an index when on the inner of a nested loop join. Also, evaluate all parameters on beginscan/rescan, which also works for external parameters. Fixes #4488.	2021-09-02 22:22:48 -07:00
Onur Tirtir	37d0ecfbb7	Show projected cols for columnar tables in EXPLAIN output	2021-09-02 19:05:32 +03:00
Naisila Puka	4fb05efabb	Distributes partition-to-be table before ProcessUtility (#5191 ) * Skip ALTER TABLE constraint checks while planning * Revert previous commit's solution, keep tests * Distribute partition-to-be table before ProcessUtility * Acquire locks in PreprocessAlterTableStmtAttachPartition	2021-09-02 13:07:42 +03:00
Onur Tirtir	889a2731cb	Split columnar stripe reservation into two phases (#5188 ) Previously, we were doing `first_row_number` reservation for the first row written to current `WriteState` but were doing `stripe_id` reservation when flushing the `WriteState` and were inserting the related record to `columnar.stripe` at that time as well. However, inserting `columnar.stripe` record at flush-time is problematic. This is because, as told in #5160, if relation has any index-based constraints and if there are two concurrent writes that are inserting conflicting key values for that constraint, then postgres relies on `tableAM->fetch_index_tuple` (=`columnar_fetch_index_tuple`) callback to return `true` when indexAM is checking against possible constraint violations. However, pending writes of other backends are not visible to concurrent sessions in columnar since we were not inserting the stripe metadata record until flushing the stripe. With this commit, we split stripe reservation into two phases: i) Reserve `stripe_id` and insert a "dummy" record to `columnar.stripe` at the very same time we reserve `first_row_number`, i.e. when writing the first row to the current `WriteState`. ii) At flush time, do the storage level allocation and complete the missing fields of the dummy record inserted into `columnar.stripe` during i). That way, any concurrent writes would be able to check against possible constraint violations by using `SnapshotDirty` when scanning `columnar.stripe`. Note that `columnar_fetch_index_tuple` still wouldn't be able to fill the output tupleslot for the requested tid but it would at least return `true` for such index look-up's and we believe this should be sufficient for the caller indexAM callback to make the concurrent writer block on prior one. That is how we fix #5160. Only downside of reserving `stripe_id` at the same time we reserve `first_row_number` is that now any aborted writes would also waste some amount of `stripe_id` as in the case of `first_row_number` but we are just wasting them one-by-one. Considering the fact that we waste `first_row_number` by the amount stripe row limit (=150k by default) in such cases, this shouldn't be important at all.	2021-09-02 11:49:14 +03:00
Onur Tirtir	bf4dfad6f7	Update curcid of given snapshot if it is MVCC Before starting to scan a columnar table, we always flush the pending writes to disk. However, we increment command counter after modifying metadata tables. On the other hand, now that we _don't always use_ xact snapshot to scan a columnar table, writes that we just flushed might not be visible to the query that just flushed pending writes to disk since curcid of provided snapshot would become smaller than the command id being used when modifying metadata tables. To give an example, before this change, below was a possible scenario due to the changes that we made to use the correct snapshot. ```sql CREATE TABLE t(a int, b int) USING columnar; BEGIN; INSERT INTO t VALUES (5, 10); SELECT * FROM t; ┌───┬───┐ │ a │ b │ ├───┼───┤ └───┴───┘ (0 rows) SELECT * FROM t; ┌───┬────┐ │ a │ b │ ├───┼────┤ │ 5 │ 10 │ └───┴────┘ (1 row) ```	2021-09-02 11:11:59 +03:00
Naisila Puka	acb5ae6ab6	Skip dropping shards when we know it's a partition (#5176 )	2021-08-31 17:41:37 +03:00
SaitTalhaNisanci	5ae01303d4	Use get_attnum to find the attribute number of target entry (#5220 ) * Use get_attnum to find the attribute number of target entry	2021-08-31 16:47:19 +03:00
Jelte Fennema	481f8be084	Fix crash in shard rebalancer when no distributed tables exist (#5205 ) The logging of the amount of ignored moves crashed when no distributed tables existed in a cluster. This also fixes in passing that the logging of ignored moves logs the correct number of ignored moves if there exist multiple colocation groups and all are rebalanced at the same time.	2021-08-31 14:15:24 +02:00
SaitTalhaNisanci	d50830d4cc	Update failure tests README (#5197 ) * Update failure tests README I keep finding this page when trying to run failure tests, so updating the README that way: https://github.com/pypa/pipenv/issues/3363#issuecomment-452171564 Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com> Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2021-08-26 12:35:06 +03:00
Hanefi Onaldi	7e39c7ea83	Replace master with citus in logs and comments (#5210 ) I replaced - master_add_node, - master_add_inactive_node - master_activate_node with - citus_add_node, - citus_add_inactive_node - citus_activate_node respectively.	2021-08-26 11:31:17 +03:00
SaitTalhaNisanci	b923d51fc6	Bump pg12 and pg13 images to pg12.8 and pg13.8 (#5208 ) In our testing infra structure, even though we use pinned versions of postgres, the auxiliary libraries might pull in newer versions. This is for example the case for libpq, which will now use the libpq libraries from 14beta3. The changes in this PR are a lot due to the libpq changes. We also have changed the citus version that is used as a base for the citus upgrades, from 10.0 to 10.1 . This caused columnar to enforce some extra limits on the settings, which conflicted with our upgrade tests. The changes in failure tests are due to the libpq changes. There are also a lot of changes on isolation tests outputs, hence we updated all of them. Co-authored-by: Nils Dijk <nils@citusdata.com>	2021-08-25 16:04:57 +03:00
Onur Tirtir	5af839ada0	Not print metapage.reserved_offset in regression tests (#5168 ) * We were anyway not testing reserved_offset in any of those tests but other fields. * This only happens with compressed columnar tables and is because the libzstd/liblz4 versions that we have on exttester ci image might be different than what we might have on our local environments.	2021-08-23 11:07:10 +03:00
jeff-davis	4f213f293e	Columnar: use generate_series for test rather than load. (#5181 )	2021-08-16 16:12:06 -07:00
Onur Tirtir	68f46c5dc9	Use scan context for intermediate mem allocs too	2021-08-16 11:06:03 +03:00
Burak Velioglu	4355ba0a38	Add CREATE INDEX ... ON ONLY and ALTER INDEX ... ATTACH PARTITION (#4938 #4980 ) - Add support for CRETE INDEX ... ON ONLY: Before that commit we were not sending "ONLY" option to the worker nodes at all. With this commit, "ONLY" parameter will be sent to the worker nodes if it is necessary. (#4938) - Add support for ALTER INDEX ... ATTACH PARTITION: Attach child_index to parent_index by creating same inheritance on shard level in addition to table level. (#4980)	2021-08-13 13:12:45 +03:00
Ahmet Gedemenli	9e90894f21	Synchronize hasmetadata flag on mx workers (#5086 ) * Synchronize hasmetadata flag on mx workers * Switch to sequential execution * Add test * Use SetWorkerColumn * Add test for stop_sync * Remove usage of UpdateHasmetadataOnWorkersWithMetadata * Remove MarkNodeMetadataSynced * Fix test for metadatasynced * Remove MarkNodeMetadataSynced * Style * Remove MarkNodeHasMetadata * Remove UpdateDistNodeBoolAttr * Refactor SetWorkerColumn * Use SetWorkerColumnLocalOnly when setting up dependencies * Use SetWorkerColumnLocalOnly in TriggerSyncMetadataToPrimaryNodes * Style * Make update command generator functions static * Set metadatasynced before syncing * Call SetWorkerColumn only if the sync is successful * Try to sync all nodes * Fix indexno * Update metadatasynced locally first * Break if a node fails to sync metadata * Send worker commands optional * Style & Rebase * Add raiseOnError param to SetWorkerColumn * Style * Set metadatasynced for all metadata nodes * Style * Introduce SetWorkerColumnOptional * Polish * Style * Dont send set command to not synced metadata nodes * Style * Polish * Add test for stop_sync * Add test for shouldhaveshards * Add test for isactive flag * Sort by placementid in the function verify_metadata * Cover edge cases for failing nodes * Add comments * Add nodeport to isactive test * Add warning if metadata out of sync * Update warning message	2021-08-12 14:16:18 +03:00
Naisila Puka	e5b32b2c3c	Acquire AccessShareLock before updating table statistics (#5155 )	2021-08-12 13:58:15 +03:00
Onder Kalaci	d4368ff2b3	Make sure that shouldhaveshards is synced to workers	2021-08-11 15:53:31 +02:00
Onder Kalaci	5f02d18ef8	transactional metadata sync for maintanince daemon As we use the current user to sync the metadata to the nodes with #5105 (and many other PRs), there is no reason that prevents us to use the coordinated transaction for metadata syncing. This commit also renames few functions to reflect their actual implementation.	2021-08-09 10:34:55 +02:00
Onder Kalaci	35964c6366	Dropped columns do not diverge distribution column for partitioned tables Before this commit, creating a partition after a DROP column on the parent (position before dist. key) was leading to partition to have the wrong distribution column.	2021-08-06 13:36:12 +02:00
naisila	798a7902bf	Fix master_update_table_statistics scripts for 9.5	2021-08-03 18:15:56 +03:00
naisila	f9fa5a3d69	Fix master_update_table_statistics scripts for 9.4	2021-08-03 18:15:56 +03:00
Onder Kalaci	482b8096e9	Introduce citus_internal_update_relation_colocation update_distributed_table_colocation can be called by the relation owner, and internally it updates pg_dist_partition. With this commit, update_distributed_table_colocation uses an internal UDF to access pg_dist_partition. As a result, this operation can now be done by regular users on MX.	2021-08-03 11:44:58 +02:00
Onur Tirtir	93ebbb0607	Re-cost SeqPath's as well for columnar tables	2021-08-02 11:32:25 +03:00
Onur Tirtir	297f59a70e	Re-cost columnar table index paths	2021-08-02 11:16:37 +03:00
Onur Tirtir	73058d35cc	Not free (stripe) chunk buffers after de-serializing Previously, we were only using chunk group reader for sequential scan. However, to support index scans on columnar tables, now we use very same low level functions for index scan too. Since those low-level functions were only used for sequential scan, it was guaranteed that we would never read the same chunk group more than once, so we were freeing chunk buffers after deserializing them into a separate buffer. Now that we use those low level functions for index scan, we cannot free chunk buffers since it's possible to read the same chunk group again, such that: - read chunk group 1 of stripe 5 - read chunk group 2 of stripe 5 - read chunk group 1 of stripe 5 again Here, when we decide to read chunk group 1 for a second time, chunk group 1 is not cached. Plus, before this commit, we were freeing the chunk buffers for chunk group 1 after the first read and then we were getting segfault or errors from low-level de-compression APIs.	2021-08-02 11:00:12 +03:00
Onur Tirtir	83f5d42365	Use long-lasting mem cxt & optimize correlated index scan	2021-08-02 11:00:12 +03:00
Onur Tirtir	84a49cc221	Improve error message for indexAMs not supported by columnar	2021-07-30 16:41:53 +03:00
Onur Tirtir	90e856d6bc	Keep supported indexes when converting table to columnar	2021-07-30 16:41:01 +03:00
SaitTalhaNisanci	4559d02c41	Fix union pushdown issue (#5079 ) * Fix UNION not being pushdown Postgres optimizes column fields that are not needed in the output. We were relying on these fields to understand if it is safe to push down a union query. This fix looks at the parse query, which has the original column fields to detect if it is safe to push down a union query. * Add more tests * Simplify code and make it more robust * Process varlevelsup > 0 in FindReferencedTableColumn * Only look for outers vars in union path * Add more comments * Remove UNION ALL specific logic for pulling up childvars	2021-07-29 13:52:55 +03:00
Jelte Fennema	2aa67421a7	Fix showing target shard size in the rebalance progress monitor (#5136 ) The progress monitor wouldn't actually update the size of the shard on the target node when using "block_writes" as the `shard_transfer_mode`. The reason for this is that the CREATE TABLE part of the shard creation would only be committed once all data was moved as well. This caused our size calculation to always return 0, since the table did not exist yet in the session that the progress monitor used. This is fixed by first committing creation of the table, and only then starting the actual data copy. The test output changes slightly. Apparently splitting this up in two transactions instead of one, increases the table size after the copy by about 40kB. The additional size used doesn't increase when with the amount of data in the table is larger (it stays ~40kB per shard). So this small change in test output is not considered an actual problem.	2021-07-23 16:37:00 +02:00
Jelte Fennema	7d0b6dc9be	Include data_type and cache in sequence definition on workers These two options were not included when creating the sequences on the workers as part of metadata syncing. The missing `data_type` part of the definition made finding the cause of #5126 harder than necessary, because of confusing errors.	2021-07-22 11:49:06 +02:00

1 2 3 4 5 ...

2102 Commits (d5b371b2e0d1b3b354b62c81cd0ab72af3fcafa3)