citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	575bb6dde9	Drop support for Inactive Shard placements Given that we do all operations via 2PC, there is no way for any placement to be marked as INACTIVE.	2021-10-22 18:03:35 +02:00
Önder Kalacı	b3299de81c	Drop support for citus.multi_shard_commit_protocol (#5380 ) In the past, we allowed users to manually switch to 1PC (e.g., one phase commit). However, with this commit, we don't. All multi-shard modifications are done via 2PC.	2021-10-21 14:01:28 +02:00
Marco Slot	dafba6c242	Deprecate master_get_table_metadata UDF	2021-10-21 12:08:05 +02:00
Marco Slot	defb97b7f5	Support operator class parameters in indexes	2021-10-20 17:03:59 +02:00
Önder Kalacı	3f726c72e0	When replication factor > 1, all modifications are done via 2PC (#5379 ) With Citus 9.0, we introduced `citus.single_shard_commit_protocol` which defaults to 2PC. With this commit, we prevent any user to set it to 1PC and drop support for `citus.single_shard_commit_protocol`. Although this might add some overhead for users, it is already the default behaviour (so less likely) and marking placements as INVALID is much worse.	2021-10-20 01:39:03 -07:00
Marco Slot	641ef9bd6f	Fix flappy subquery_append test	2021-10-19 15:29:01 +02:00
Marco Slot	096660d61d	Remove master_apply_delete_command	2021-10-18 22:29:37 +02:00
Marco Slot	bece86b2f7	Add some subquery on append-distributed table tests	2021-10-18 21:11:16 +02:00
Marco Slot	93e79b9262	Never allow co-located joins of append-distributed tables	2021-10-18 21:11:16 +02:00
Marco Slot	b97e5081c7	Disable co-located joins for append-distributed tables	2021-10-18 21:11:16 +02:00
Sait Talha Nisanci	6ff2083311	Remove base test as it is not useful anymore	2021-10-18 20:31:18 +03:00
Sait Talha Nisanci	7336c03c22	Add local-dist table joins to arbitrary configs	2021-10-18 20:31:18 +03:00
Önder Kalacı	31c8f279ac	Add helper UDFs to inspect object dependencies (#5293 ) - citus_get_all_dependencies_for_object: emulate what Citus would qualify as dependency when adding a new node - citus_get_dependencies_for_object: emulate what Citus would qualify as dependency when creating an object Example use: ```SQL -- find all the depedencies of table test SELECT pg_identify_object(t.classid, t.objid, t.objsubid) FROM (SELECT * FROM pg_get_object_address('table', '{test}', '{}')) as addr JOIN LATERAL citus_get_all_dependencies_for_object(addr.classid, addr.objid, addr.objsubid) as t(classid oid, objid oid, objsubid int) ON TRUE ORDER BY 1; ```	2021-10-18 14:46:49 +03:00
Halil Ozan Akgul	b710e0064d	Fix tests that fail with MX in multi_schedule	2021-10-15 12:58:38 +03:00
Ahmet Gedemenli	35f6fe5f9f	Refactor/Improve PreprocessAlterTableStmtAttachPartition (#5366 ) * Refactor/Improve PreprocessAlterTableStmtAttachPartition	2021-10-14 11:39:39 +03:00
Önder Kalacı	af876bf452	Add value materialization test (#5368 )	2021-10-13 09:08:24 +02:00
SaitTalhaNisanci	a39859bc74	Remove unnecesary output (#5367 )	2021-10-13 09:28:01 +03:00
SaitTalhaNisanci	3f65751d43	Add an infrastructure to run same tests with arbitrary configs (#5316 ) To run tests in parallel use: ```bash make check-arbitrary-configs parallel=4 ``` To run tests sequentially use: ```bash make check-arbitrary-configs parallel=1 ``` To run only some configs: ```bash make check-arbitrary-base CONFIGS=CitusSingleNodeClusterConfig,CitusSmallSharedPoolSizeConfig ``` To run only some test files with some config: ```bash make check-arbitrary-base CONFIGS=CitusSingleNodeClusterConfig EXTRA_TESTS=dropped_columns_1 ``` To get a deterministic run, you can give the random's seed: ```bash make check-arbitrary-configs parallel=4 seed=12312 ``` The `seed` will be in the output of the run. In our regular regression tests, we can see all the details about either planning or execution but this means we need to run the same query under different configs/cluster setups again and again, which is not really maintanable. When we don't care about the internals of how planning/execution is done but the correctness, especially with different configs this infrastructure can be used. With `check-arbitrary-configs` target, the following happens: - a bunch of configs are loaded, which are defined in `config.py`. These configs have different settings such as different shard count, different citus settings, postgres settings, worker amount, or different metadata. - For each config, a separate data directory is created for tests in `tmp_citus_test` with the config's name. - For each config, `create_schedule` is run on the coordinator to setup the necessary tables. - For each config, `sql_schedule` is run. `sql_schedule` is run on the coordinator if it is a non-mx cluster. And if it is mx, it is either run on the coordinator or a random worker. - Tests results are checked if they match with the expected. When tests results don't match, you can see the regression diffs in a config's datadir, such as `tmp_citus_tests/dataCitusSingleNodeClusterConfig`. We also have a PostgresConfig which runs all the test suite with Postgres. By default configs use regular user, but we have a config to run as a superuser as well. So the infrastructure tests: - Postgres vs Citus - Mx vs Non-Mx - Superuser vs regular user - Arbitrary Citus configs When you want to add a new test, you can add the create statements to `create_schedule` and add the sql queries to `sql_schedule`. If you are adding Citus UDFs that should be a NO-OP for Postgres, make sure to override the UDFs in `postgres.sql`. You can add your new config to `config.py`. Make sure to extend either `CitusDefaultClusterConfig` or `CitusMXBaseClusterConfig`. On the CI, upon a failure, all logfiles will be uploaded as artifacts, so you can check the artifacts tab. All the regressions will be shown as part of the job on CI. In your local, you can check the regression diffs in config's datadirs as in `tmp_citus_tests/dataCitusSingleNodeClusterConfig`.	2021-10-12 14:24:19 +03:00
Teja Mupparti	a8348047c5	Pushdown procedures with OUT parameters (#5348 )	2021-10-11 23:14:36 -07:00
Ahmet Gedemenli	d19793c174	Add partitioning support for citus local tables Add/fix tests Fix creating partitions Add test for mx - partition creating case Enable cascading to partitioned tables Fix mx partition adding test Fix cascading through fkeys Style Disable converting with non-inherited fkeys Fix detach bug Early return in case of cascade & Add tests Style Fix undistribute_table bug & Fix test outputs Remove RemovePartitionRelationIds Test with undistribute_table Add test for mx+convert+undistribute Remove redundant usage of CreatePartitionedCitusLocalTable Add some comments Introduce bulk functions for generating attach/detach partition commands Fix: Convert partitioned tables after adding fkey Change the error message for partitions Introduce function ErrorIfPartitionTableAddedToMetadata Polish attach/detach command generation functions Use time_partitions for testing Move mx tests to citus_local_tables_mx Add new partitioned table to cascade test Add test with time series management UDFs Fix test output Fix: Assertion fail on relation access tracking Style Refactor creating partitioned citus local tables Remove CreatePartitionedCitusLocalTable Style Error out if converting multi-level table Revert some old tests Error out adding partitioned partition Polish Polish/address Fix create table partition of case Use CascadeOperationForRelationIdList if no cascade needed Fix create partition bug Revert / Add new tests to mx Style Fix dropping fkey bug Add test with IF NOT EXISTS Convert to CLT when doing ATTACH PARTITION Add comments Add more tests with time series management Edit the error message for converting the child Use OR instead of AND in ErrorIfUnsupportedAlterTableStmt Edit/improve tests Disable ddl prop when dropping default column definitions Disable/enable ddl prop just before/after the command Add comment Add sequence test Add trigger test Remove NeedCascadeViaForeignKeys Add one more insert to sequence test Add comment Style Fix test output shard ids Update comments Disable creating fkey on partitions Move partition check to CreateCitusLocalTable Add comment Add check for attachingmulti-level partition Add test for pg_constraint Check pg_dist_partition in tests Add test inserting on the worker	2021-10-11 10:45:07 +03:00
Marco Slot	386d2567d4	Reduce reliance on append tables in regression tests	2021-10-08 21:27:14 +02:00
Halil Ozan Akgul	9c9d4b5eeb	Turn MX on by default	2021-10-08 18:17:21 +03:00
Naisila Puka	99d3785b5c	Fix flaky test in multi_fix_partition_shard_index_names.sql (#5364 )	2021-10-08 18:03:34 +03:00
Naisila Puka	d0390af72d	Add fix_partition_shard_index_names udf to fix currently broken names (#5291 ) * Add udf to include shardId in broken partition shard index names * Address reviews: rename index such that operations can be done on it * More comprehensive index tests * Final touches and formatting	2021-10-07 19:34:52 +03:00
Marco Slot	91b647024a	Fixes CREATE INDEX deparsing issue	2021-10-06 13:08:16 +02:00
Hanefi Onaldi	a74409f24c	Bump Citus to 11.0devel	2021-10-01 22:21:22 +03:00
Onur Tirtir	fe72e8bb48	Discard index deletion requests made to columnarAM (#5331 ) A write operation might trigger index deletion if index already had dead entries for the key we are about to insert. There are two ways of index deletion: a) simple deletion b) bottom-up deletion (>= pg14) Since columnar_index_fetch_tuple never sets all_dead to true, columnarAM doesn't ever expect to receive simple deletion requests (columnar_index_delete_tuples) as we don't mark any index entries as dead. However, since columnarAM doesn't delete any dead entries via simple deletion, postgres might ask for a more comprehensive deletion (i.e.: bottom-up) at some point when pg >= 14. So with this commit, we start gracefully ignoring bottom-up deletion requests made to columnar_index_delete_tuples. Given that users can anyway "VACUUM FULL" their columnar tables, we don't see any problem in ignoring deletion requests.	2021-10-01 14:32:47 +03:00
Önder Kalacı	c2311b4c0c	Make (columnar.stripe) first_row_number index a unique constraint (#5324 ) * Make (columnar.stripe) first_row_number index a unique constraint Since stripe_first_row_number_idx is required to scan a columnar table, we need to make sure that it is created before doing anything with columnar tables during pg upgrades. However, a plain btree index is not a dependency of a table, so pg_upgrade cannot guarantee that stripe_first_row_number_idx gets created when creating columnar.stripe, unless we make it a unique "constraint". To do that, drop stripe_first_row_number_idx and create a unique constraint with the same name to keep the code change at minimum. * Add more pg upgrade tests for columnar * Fix a logic error in uprade_columnar_after test Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-30 10:51:56 +03:00
tejeswarm	a1604a87e6	Parition shards to be colocated with the parent shards	2021-09-22 14:47:04 -07:00
Onur Tirtir	77a2dd68da	Revoke read access to columnar.chunk from unprivileged user (#5313 ) Since this could expose chunk min/max values to unprivileged users.	2021-09-22 16:23:02 +03:00
Onur Tirtir	68335285b4	Columnar CustomScan: Pushdown BoolExpr's as we do before	2021-09-22 10:51:34 +03:00
Onur Tirtir	f8b1ff7214	Add CheckCitusVersion() calls to columnarAM (#5308 ) Considering all code-paths that we might interact with a columnar table, add `CheckCitusVersion` calls to tableAM callbacks: - initializing table scan (`columnar_beginscan` & `columnar_index_fetch_begin`) - setting a new filenode for a relation (storage initializiation or a table rewrite) - truncating the storage - inserting tuple (single and multi) Also add `CheckCitusVersion` call to: - drop hook (`ColumnarTableDropHook`) - `alter_columnar_table_set` & `alter_columnar_table_reset` UDFs	2021-09-20 17:26:41 +03:00
SaitTalhaNisanci	35ff513dfe	Give proper error while distributing a temp table (#5269 )	2021-09-17 14:34:40 +03:00
jeff-davis	6e8b19984e	Columnar: separate plan and runtime quals. (#5261 ) * Columnar: separate plain and exec quals. Make a clear separation between plain quals, which contain constants or extern params; and exec quals, which contain exec params and can't be evaluated until a rescan. Fixes #5258. * more vanilla tests Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-13 10:54:53 -07:00
jeff-davis	d48ceee238	Columnar: add method ReparameterizeCustomPathByChild. (#5275 ) When performing a partition-wise join, the planner will adjust paths parameterized by the parent rel to instead parameterize by the child rel directly. When this reparameterization happens, we also need to adjust the join quals to reference the child rather than the parent. Fixes #5257.	2021-09-13 10:33:48 -07:00
Onur Tirtir	ea61efb63a	Not flush writes until need to read them when doing index-scan on columnar (#5247 ) Not flush pending writes if given tid belongs to a "flushed" or "aborted" stripe write, or to an "in-progress" stripe write of another backend. That way, we would reduce the cases where we flush single-tuple stripes during index scan. To do that, we follow below steps for index look-up's: - Do not flush any pending writes and do stripe metadata look-up for given tid. If tuple with tid is found, then no need to do another look-up since we already found the tuple without needing to flush pending writes. - If tuple is not found without flushing pending writes, then we have two scenarios: - If given tid belongs to a pending write of my backend, then do stripe metadata look-up for given tid. But this time first flush any pending writes. - Otherwise, just return false from `index_fetch_tuple` since flushing pending writes wouldn't help.	2021-09-13 18:41:20 +02:00
Onur Tirtir	4ee0fb2758	Make sure to skip aborted writes when reading the first tuple (#5274 ) With `5825c44d5f`, we made the changes to skip aborted writes when scanning a columnar table. However, looks like we forgot to handle such cases for the very first call made to columnar_getnextslot. That means, that commit only considered the intermediate stripe read operations. However, functions called by columnar_getnextslot to find first stripe to read (ColumnarBeginRead & ColumnarRescan) were not caring about those aborted writes. To fix that, we teach AdvanceStripeRead to find the very first stripe to read, and then start using it where were blindly calling FindNextStripeByRowNumber.	2021-09-13 11:50:53 +03:00
Naisila Puka	a69abe3be0	Fixes bug about int and smallint sequences on MX (#5254 ) * Introduce worker_nextval udf for int&smallint column defaults * Fix current tests and add new ones for worker_nextval	2021-09-09 23:41:07 +03:00
Halil Ozan Akgul	19af1cef2f	Errors for CTEs with search clause Relevant PG commit: 3696a600e2292d43c00949ddf0352e4ebb487e5b	2021-09-09 13:48:24 +03:00
Marco Slot	04388e13b0	Add worker_append_table_to_shard permissions tests	2021-09-09 11:00:29 +02:00
SaitTalhaNisanci	e3e0a028c7	return early in case we want to skip outer vars (#5259 )	2021-09-09 10:53:36 +03:00
Onur Tirtir	9935dfb958	Remove a flaky test from columnar_paths We already knew that it was flaky. Moreover, now it failed on my branch too. So removing it with this commit.	2021-09-08 14:15:22 +03:00
Onur Tirtir	3340f17c4e	Prevent planner from choosing parallel scan for columnar tables (#5245 ) Previously, for regular table scans, we were setting `RelOptInfo->partial_pathlist` to `NIL` via `set_rel_pathlist_hook` to discard scan `Path`s that need to use any parallel workers, this was working nicely. However, when building indexes, this hook doesn't get called so we were not able to prevent spawning parallel workers when building an index. For this reason, `9b4dc2f804` added basic implementation for `columnar_parallelscan_*` callbacks but also made some changes to skip using those workers when building the index. However, now that we are doing stripe reservation in two stages, we call `heap_inplace_update` at some point to complete stripe reservation. However, postgres throws an error if we call `heap_inplace_update` during a parallel operation, even if we don't actually make use of those workers. For this reason, with this pr, we make sure to not generate scan `Path`s that need to use any parallel workers by using `get_relation_info_hook`. This is indeed useful to prevent spawning parallel workers during index builds.	2021-09-08 13:53:43 +03:00
Onur Tirtir	5825c44d5f	Handle aborted writes properly when scanning a columnar table (#5244 ) If it is certain that we will not use any `parallel_worker`s for a columnar table, then stripe entries inserted by aborted transactions become visible to `SnapshotAny` and that causes `REINDEX` to fail by throwing a duplicate key error. To fix that: * consider three states for a stripe write operation: "flushed", "aborted", or "in-progress", * make sure to have a clear separation between them, and * act according to those three states when reading from a columnar table	2021-09-08 13:26:11 +03:00
Jelte Fennema	bb5c494104	Enable binary encoding by default on PG14 Since PG14 we can now use binary encoding for arrays and composite types that contain user defined types. This was fixed in this commit in Postgres: `670c0a1d47` This change starts using that knowledge, by not necessarily falling back to text encoding anymore for those types. While doing this and testing a bit more I found various cases where binary encoding would fail that our checks didn't cover. This fixes those cases and adds tests for those. It also fixes EXPLAIN ANALYZE never using binary encoding, which was a leftover of workaround that was not necessary anymore. Finally, it changes the default for both `citus.enable_binary_protocol` and `citus.binary_worker_copy_format` to `true` for PG14 and up. In our cloud offering `binary_worker_copy_format` already was true by default. `enable_binary_protocol` had some bug with MX and user defined types, this bug was fixed by the above mentioned fixes.	2021-09-06 10:27:29 +02:00
Burak Velioglu	c3895f35cd	Add helper UDFs for easy time partition management - get_missing_time_partition_ranges: Gets the ranges of missing partitions for the given table, interval and range unless any existing partition conflicts with calculated missing ranges. - create_time_partitions: Creates partitions by getting range values from get_missing_time_partition_ranges. - drop_old_time_partitions: Drops partitions of the table older than given threshold.	2021-09-03 23:03:13 +03:00
Onur Tirtir	2b71263e40	Align columnar path costing functions (#5239 ) * Rename RecostColumnarPaths to CostColumnarPaths * Rename RecostColumnarIndexPath to CostColumnarIndexPath * Reorder args of CostColumnarScan to align with other two costing functions * Not adjust index scan start-up cost * Rename ColumnarIndexScanAddTotalCost to ColumnarIndexScanAdditionalCost * Reflect that index scan will at least read one stripe in totalCost calculation * Organize declarations in columnar_customscan.c	2021-09-03 19:37:42 +03:00
Halil Ozan Akgul	7fadfb74bb	Adds error message for REINDEX TABLE queries on distributed partitioned tables	2021-09-03 16:46:42 +03:00
Sait Talha Nisanci	0b67fcf81d	Fix style	2021-09-03 16:09:59 +03:00
Halil Ozan Akgul	e1f5520e1a	Adds propagation of ALTER TABLE .. ALTER COLUMN .. SET COMPRESSION ..	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	902af39a04	Add join alias tests (#5233 ) PG COMMIT: 055fee7eb4dcc78e58672aef146334275e1cc40d	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	2a2ebab1fa	Add tests for jsonb subscripting (#5232 ) PG commit: 676887a3b0b8e3c0348ac3f82ab0d16e9a24bd43	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	2b263f9a2a	ALTER STATISTICS .. OWNER TO CURRENT_ROLE (#5225 ) (cherry picked from commit 42322caf90ca094777aa01376e02d1187afc1560)	2021-09-03 15:44:28 +03:00
Onder Kalaci	82a3b20fb3	Fix flaky test	2021-09-03 15:44:28 +03:00
Onder Kalaci	5844ab286c	Support OUT parameters in procedure pushdown delegation In PG 14, procedures can have OUT parameters. In Citus' procedure delegation framework, we need to adjust the function expression to get the outargs parameters. Releven PG change: `e56bce5d43`	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	1ff7186d20	Extended statistics on expressions - PG14 a4d75c8 (#5224 ) (cherry picked from commit 1268415f123b5d99cfacfe207c8670240efc1c00)	2021-09-03 15:44:28 +03:00
Halil Ozan Akgul	113d5d6615	Adds support for column compression in table distribution	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	6fbdeb38a8	ALTER TABLE ... DETACH PARTITION ... CONCURRENTLY - PG14 #71f4c8c (#5223 )	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	66303785f3	Add option PROCESS_TOAST to VACUUM - PG14 #7cb3048 (#5219 ) (cherry picked from commit e63bdfc49f9203db14ef77313c1d5e3461a84a32)	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	35a3f7240d	CHANGELOG: Allow REINDEX to change the tablespace of the new index	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	4e85d9ffce	Add empty pg14 sql file	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	307eb81278	Fix failure for 1pc_copy_hash	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	a6c40ebd14	Fix multi_follower_dml When the_table is emtpy, we don't get an error with pg14 anymore so we replace it generate_series so that we get the error.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	b16dadbe7c	Avoid NOTICE message to avoid an alternative output with pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6ff609fa86	Add alternative output for data_types It seems like there is a problem with Postgres14 with SELECT DISTINCT COUNT. The issue is reported to Postgres and an alternative output is added. We can remove the alternative output when the issue is fixed on PG. If this is not an issue on PG(which is unlikely) we should consider some other solution.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	4b951a2ed9	Add alternative output for multi-mx	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	96964aeee5	Turn off debug for one query to avoid adding an alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e7607b6bed	Add a helper function to check explain has a single task In order to avoid adding an alternative output, a function to check if a given explan plan has a single task added. This doesn't change what the changed tests intend to do.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e0faf34417	turn off costs in columnar_indexes explain query	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2656d885f9	Rewrite AppendColumnNames for Pg14 Postgres changed stats expression types as of PG14. Hence we needed to write the AppendColumnNames method. Also they removed the error on PG side so we remove it as well. Relevant commits on pg14: a4d75c86bf15220df22de0a92c819ecef9db3849 388e75ad33489b77cfb9a8590a91e9287d8fb960	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	c31b0c2652	Sets next_shard_id at partition_wise_join test	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	75fff14792	Turn off VERBOSE to avoid alternative output With VERBOSE option, as of PG14, we get a line with "Query Identifier".	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6b65dbc492	Add partition_wise_join to avoid big alternative output There was a small part in multi_partitioning that would need an alternative output for pg14. Instead of adding an alternative for the whole file, we created a new file, called partition_wise_join.sql and added the alternative output for that.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	375a1adc9e	Check if extversion is the same for seg extension When we check the exact version of the seg extension, it becomes a problem when its version changes, such as from 1.3 to 1.4. So now we modified the changes to check for that the version is the same in all the cluster.	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	9b6ce10892	Removes password outputs from alter_role_propagation tests	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	20c32a7a1d	Add alternative output for multi_deparse_function Postgres tightened up its checks for invalid GUC names hence we started to get an alternative output for one of our tests. We add an alternative output since the file is relatively small. Commit on PG: 3db826bd55cd1df0dd8c3d811f8e5b936d7ba1e4	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	dc81cae18f	Turn off COSTS to avoid alternative output for pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	fb8671f291	Change pg13 test to not differ with pg14 to avoid adding alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	3f5c178c93	Remove VERBOSE output to make pg14 and pg13 output the same	2021-09-03 15:41:28 +03:00
jeff-davis	4718b6bcdf	Generate parameterized paths for columnar scans. (#5172 ) Allow ColumnarScans to push down join quals by generating parameterized paths. This significantly expands the utility of chunk group filtering, making a ColumnarScan behave similar to an index when on the inner of a nested loop join. Also, evaluate all parameters on beginscan/rescan, which also works for external parameters. Fixes #4488.	2021-09-02 22:22:48 -07:00
Onur Tirtir	37d0ecfbb7	Show projected cols for columnar tables in EXPLAIN output	2021-09-02 19:05:32 +03:00
Naisila Puka	4fb05efabb	Distributes partition-to-be table before ProcessUtility (#5191 ) * Skip ALTER TABLE constraint checks while planning * Revert previous commit's solution, keep tests * Distribute partition-to-be table before ProcessUtility * Acquire locks in PreprocessAlterTableStmtAttachPartition	2021-09-02 13:07:42 +03:00
Onur Tirtir	bf4dfad6f7	Update curcid of given snapshot if it is MVCC Before starting to scan a columnar table, we always flush the pending writes to disk. However, we increment command counter after modifying metadata tables. On the other hand, now that we _don't always use_ xact snapshot to scan a columnar table, writes that we just flushed might not be visible to the query that just flushed pending writes to disk since curcid of provided snapshot would become smaller than the command id being used when modifying metadata tables. To give an example, before this change, below was a possible scenario due to the changes that we made to use the correct snapshot. ```sql CREATE TABLE t(a int, b int) USING columnar; BEGIN; INSERT INTO t VALUES (5, 10); SELECT * FROM t; ┌───┬───┐ │ a │ b │ ├───┼───┤ └───┴───┘ (0 rows) SELECT * FROM t; ┌───┬────┐ │ a │ b │ ├───┼────┤ │ 5 │ 10 │ └───┴────┘ (1 row) ```	2021-09-02 11:11:59 +03:00
Naisila Puka	acb5ae6ab6	Skip dropping shards when we know it's a partition (#5176 )	2021-08-31 17:41:37 +03:00
SaitTalhaNisanci	5ae01303d4	Use get_attnum to find the attribute number of target entry (#5220 ) * Use get_attnum to find the attribute number of target entry	2021-08-31 16:47:19 +03:00
Jelte Fennema	481f8be084	Fix crash in shard rebalancer when no distributed tables exist (#5205 ) The logging of the amount of ignored moves crashed when no distributed tables existed in a cluster. This also fixes in passing that the logging of ignored moves logs the correct number of ignored moves if there exist multiple colocation groups and all are rebalanced at the same time.	2021-08-31 14:15:24 +02:00
SaitTalhaNisanci	b923d51fc6	Bump pg12 and pg13 images to pg12.8 and pg13.8 (#5208 ) In our testing infra structure, even though we use pinned versions of postgres, the auxiliary libraries might pull in newer versions. This is for example the case for libpq, which will now use the libpq libraries from 14beta3. The changes in this PR are a lot due to the libpq changes. We also have changed the citus version that is used as a base for the citus upgrades, from 10.0 to 10.1 . This caused columnar to enforce some extra limits on the settings, which conflicted with our upgrade tests. The changes in failure tests are due to the libpq changes. There are also a lot of changes on isolation tests outputs, hence we updated all of them. Co-authored-by: Nils Dijk <nils@citusdata.com>	2021-08-25 16:04:57 +03:00
Onur Tirtir	5af839ada0	Not print metapage.reserved_offset in regression tests (#5168 ) * We were anyway not testing reserved_offset in any of those tests but other fields. * This only happens with compressed columnar tables and is because the libzstd/liblz4 versions that we have on exttester ci image might be different than what we might have on our local environments.	2021-08-23 11:07:10 +03:00
jeff-davis	4f213f293e	Columnar: use generate_series for test rather than load. (#5181 )	2021-08-16 16:12:06 -07:00
Onur Tirtir	68f46c5dc9	Use scan context for intermediate mem allocs too	2021-08-16 11:06:03 +03:00
Burak Velioglu	4355ba0a38	Add CREATE INDEX ... ON ONLY and ALTER INDEX ... ATTACH PARTITION (#4938 #4980 ) - Add support for CRETE INDEX ... ON ONLY: Before that commit we were not sending "ONLY" option to the worker nodes at all. With this commit, "ONLY" parameter will be sent to the worker nodes if it is necessary. (#4938) - Add support for ALTER INDEX ... ATTACH PARTITION: Attach child_index to parent_index by creating same inheritance on shard level in addition to table level. (#4980)	2021-08-13 13:12:45 +03:00
Ahmet Gedemenli	9e90894f21	Synchronize hasmetadata flag on mx workers (#5086 ) * Synchronize hasmetadata flag on mx workers * Switch to sequential execution * Add test * Use SetWorkerColumn * Add test for stop_sync * Remove usage of UpdateHasmetadataOnWorkersWithMetadata * Remove MarkNodeMetadataSynced * Fix test for metadatasynced * Remove MarkNodeMetadataSynced * Style * Remove MarkNodeHasMetadata * Remove UpdateDistNodeBoolAttr * Refactor SetWorkerColumn * Use SetWorkerColumnLocalOnly when setting up dependencies * Use SetWorkerColumnLocalOnly in TriggerSyncMetadataToPrimaryNodes * Style * Make update command generator functions static * Set metadatasynced before syncing * Call SetWorkerColumn only if the sync is successful * Try to sync all nodes * Fix indexno * Update metadatasynced locally first * Break if a node fails to sync metadata * Send worker commands optional * Style & Rebase * Add raiseOnError param to SetWorkerColumn * Style * Set metadatasynced for all metadata nodes * Style * Introduce SetWorkerColumnOptional * Polish * Style * Dont send set command to not synced metadata nodes * Style * Polish * Add test for stop_sync * Add test for shouldhaveshards * Add test for isactive flag * Sort by placementid in the function verify_metadata * Cover edge cases for failing nodes * Add comments * Add nodeport to isactive test * Add warning if metadata out of sync * Update warning message	2021-08-12 14:16:18 +03:00
Onder Kalaci	5f02d18ef8	transactional metadata sync for maintanince daemon As we use the current user to sync the metadata to the nodes with #5105 (and many other PRs), there is no reason that prevents us to use the coordinated transaction for metadata syncing. This commit also renames few functions to reflect their actual implementation.	2021-08-09 10:34:55 +02:00
Onder Kalaci	35964c6366	Dropped columns do not diverge distribution column for partitioned tables Before this commit, creating a partition after a DROP column on the parent (position before dist. key) was leading to partition to have the wrong distribution column.	2021-08-06 13:36:12 +02:00
naisila	798a7902bf	Fix master_update_table_statistics scripts for 9.5	2021-08-03 18:15:56 +03:00
naisila	f9fa5a3d69	Fix master_update_table_statistics scripts for 9.4	2021-08-03 18:15:56 +03:00
Onder Kalaci	482b8096e9	Introduce citus_internal_update_relation_colocation update_distributed_table_colocation can be called by the relation owner, and internally it updates pg_dist_partition. With this commit, update_distributed_table_colocation uses an internal UDF to access pg_dist_partition. As a result, this operation can now be done by regular users on MX.	2021-08-03 11:44:58 +02:00
Onur Tirtir	93ebbb0607	Re-cost SeqPath's as well for columnar tables	2021-08-02 11:32:25 +03:00
Onur Tirtir	297f59a70e	Re-cost columnar table index paths	2021-08-02 11:16:37 +03:00
Onur Tirtir	73058d35cc	Not free (stripe) chunk buffers after de-serializing Previously, we were only using chunk group reader for sequential scan. However, to support index scans on columnar tables, now we use very same low level functions for index scan too. Since those low-level functions were only used for sequential scan, it was guaranteed that we would never read the same chunk group more than once, so we were freeing chunk buffers after deserializing them into a separate buffer. Now that we use those low level functions for index scan, we cannot free chunk buffers since it's possible to read the same chunk group again, such that: - read chunk group 1 of stripe 5 - read chunk group 2 of stripe 5 - read chunk group 1 of stripe 5 again Here, when we decide to read chunk group 1 for a second time, chunk group 1 is not cached. Plus, before this commit, we were freeing the chunk buffers for chunk group 1 after the first read and then we were getting segfault or errors from low-level de-compression APIs.	2021-08-02 11:00:12 +03:00
Onur Tirtir	83f5d42365	Use long-lasting mem cxt & optimize correlated index scan	2021-08-02 11:00:12 +03:00
Onur Tirtir	90e856d6bc	Keep supported indexes when converting table to columnar	2021-07-30 16:41:01 +03:00
SaitTalhaNisanci	4559d02c41	Fix union pushdown issue (#5079 ) * Fix UNION not being pushdown Postgres optimizes column fields that are not needed in the output. We were relying on these fields to understand if it is safe to push down a union query. This fix looks at the parse query, which has the original column fields to detect if it is safe to push down a union query. * Add more tests * Simplify code and make it more robust * Process varlevelsup > 0 in FindReferencedTableColumn * Only look for outers vars in union path * Add more comments * Remove UNION ALL specific logic for pulling up childvars	2021-07-29 13:52:55 +03:00
Jelte Fennema	2aa67421a7	Fix showing target shard size in the rebalance progress monitor (#5136 ) The progress monitor wouldn't actually update the size of the shard on the target node when using "block_writes" as the `shard_transfer_mode`. The reason for this is that the CREATE TABLE part of the shard creation would only be committed once all data was moved as well. This caused our size calculation to always return 0, since the table did not exist yet in the session that the progress monitor used. This is fixed by first committing creation of the table, and only then starting the actual data copy. The test output changes slightly. Apparently splitting this up in two transactions instead of one, increases the table size after the copy by about 40kB. The additional size used doesn't increase when with the amount of data in the table is larger (it stays ~40kB per shard). So this small change in test output is not considered an actual problem.	2021-07-23 16:37:00 +02:00
Jelte Fennema	7d0b6dc9be	Include data_type and cache in sequence definition on workers These two options were not included when creating the sequences on the workers as part of metadata syncing. The missing `data_type` part of the definition made finding the cause of #5126 harder than necessary, because of confusing errors.	2021-07-22 11:49:06 +02:00
Onder Kalaci	c8368e7929	Introduce citus_internal_delete_shard_metadata With this function, the owner of the table is allowed to remove shard metadata. This is going to be useful for tenant-isolation.	2021-07-19 13:25:05 +02:00
Jelte Fennema	adf17a8cf1	Add upgrade and dowgrade tests for Citus 10.2 (#5120 ) It seems we forgot to add this when starting 10.2 development.	2021-07-16 14:39:04 +02:00
Onder Kalaci	2c349e6dfd	Use current user to sync metadata Before this commit, we always synced the metadata with superuser. However, that creates various edge cases such as visibility errors or self distributed deadlocks or complicates user access checks. Instead, with this commit, we use the current user to sync the metadata. Note that, `start_metadata_sync_to_node` still requires super user because accessing certain metadata (like pg_dist_node) always require superuser (e.g., the current user should be a superuser). However, metadata syncing operations regarding the distributed tables can now be done with regular users, as long as the user is the owner of the table. A table owner can still insert non-sense metadata, however it'd only affect its own table. So, we cannot do anything about that.	2021-07-16 13:25:27 +02:00
Onur Tirtir	f00c63c33d	Support columnar table index builds with CONCURRENTLY option (#5032 ) With this commit, we add (`CREATE INDEX` / `REINDEX`) `CONCURRENTLY` support for columnar tables. For that, we implement `columnar_index_validate_scan` callback. The reasoning behind the implementation is as follows: * Postgres function `validate_index` provides all the TIDs that are currently in the index to `columnar_index_validate_scan` callback via a `tupleSort` object.. * We start scanning the table by using `columnar_getnextslot` as usual. Before moving forward, note that `columnar_getnextslot` guarantees to return tuples in the order of their TIDs. * For us to use during table scan, postgres provides a snapshot guaranteeing that any tuples that are valid according to that snapshot but are not in the index must be added to the index. * Then for each tuple that we read from our table, we continue iterating given `tupleSort` to find the first TID that is greater than or equal to our tuple's TID. If both TID's are equal to each other, then we skip the tuple since it's already indexed. If the TID that we read from tupleSort is greater then our tuple's TID, then we decide to insert this tuple into index.	2021-07-09 13:44:58 +03:00
Hanefi Onaldi	8e9cc229ff	Remove public schema dependency for 10.0 upgrades This commit contains a subset of the changes that should be cherry picked to 10.0 releases.	2021-07-09 02:08:22 +03:00
Ahmet Gedemenli	ed3b98a80b	Add failure test for stop_metadata_sync_to_node (#5102 )	2021-07-08 18:23:19 +03:00
Marco Slot	b14955c2bd	Fix PG upgrade scripts for 10.0	2021-07-05 14:38:20 +02:00
Marco Slot	3c0dfc12c0	Fix PG upgrade scripts for 9.5	2021-07-05 13:39:35 +02:00
Marco Slot	bee202aa39	Fix PG upgrade scripts for 9.4	2021-07-05 13:39:28 +02:00
Onur Tirtir	b118d4188e	Fix lower boundary calculation when pruning range dist table shards (#5082 ) This happens only when we have a "<" or "<=" filter on distribution column of a range distributed table and that filter falls in between two shards. When the filter falls in between two shards: If the filter is ">" or ">=", then UpperShardBoundary was returning "upperBoundIndex - 1", where upperBoundIndex is exclusive shard index used during binary seach. This is expected since upperBoundIndex is an exclusive index. If the filter is "<" or "<=", then LowerShardBoundary was returning "lowerBoundIndex + 1", where lowerBoundIndex is inclusive shard index used during binary seach. On the other hand, since lowerBoundIndex is an inclusive index, we should just return lowerBoundIndex instead of doing "+ 1". Before this commit, we were missing leftmost shard in such queries. * Remove useless conditional branches The branch that we delete from UpperShardBoundary was obviously useless. The other one in LowerShardBoundary became useless after we remove "+ 1" from there. This indeed is another proof of what & how we are fixing with this pr. * Improve comments and add more * Add some tests for upper bound calculation too	2021-07-02 14:48:21 +03:00
Ahmet Gedemenli	8bae58fdb7	Add parameter to cleanup metadata (#5055 ) * Add parameter to cleanup metadata * Set clear metadata default to true * Add test for clearing metadata * Separate test file for start/stop metadata syncing * Fix stop_sync bug for secondary nodes * Use PreventInTransactionBlock * DRemovedebuggiing logs * Remove relation not found logs from mx test * Revert localGroupId when doing stop_sync * Move metadata sync test to mx schedule * Add test with name that needs to be quoted * Add test for views and matviews * Add test for distributed table with custom type * Add comments to test * Add test with stats, indexes and constraints * Fix matview test * Add test for dropped column * Add notice messages to stop_metadata_sync * Add coordinator check to stop metadat sync * Revert local_group_id only if clearMetadata is true * Add a final check to see the metadata is sane * Remove the drop verbosity in test * Remove table description tests from sync test * Add stop sync to coordinator test * Change the order in stop_sync * Add test for hybrid (columnar+heap) partitioned table * Change error to notice for stop sync to coordinator * Sync at the end of the test to prevent any failures * Add test case in a transaction block * Remove relation not found tests	2021-07-01 16:23:53 +03:00
Sait Talha Nisanci	e7ed16c296	Not include to-be-deleted shards while finding shard placements Ignore orphaned shards in more places Only use active shard placements in RouterInsertTaskList Use IncludingOrphanedPlacements in some more places Fix comment Add tests	2021-06-28 13:05:31 +03:00
Naisila Puka	fe5907ad2d	Adds propagation of ALTER SEQUENCE and other improvements (#5061 ) * Alter seq type when we first use the seq in a dist table * Don't allow type changes when seq is used in dist table * ALTER SEQUENCE propagation * Tests for ALTER SEQUENCE propagation * Relocate AlterSequenceType and ensure dependencies for sequence * Support for citus local tables, and other fixes * Final formatting	2021-06-24 21:23:25 +03:00
Jelte Fennema	d1d386a904	Only allow moves of shards of distributed tables (#5072 ) Moving shards of reference tables was possible in at least one case: ```sql select citus_disable_node('localhost', 9702); create table r(x int); select create_reference_table('r'); set citus.replicate_reference_tables_on_activate = off; select citus_activate_node('localhost', 9702); select citus_move_shard_placement(102008, 'localhost', 9701, 'localhost', 9702); ``` This would then remove the reference table shard on the source, causing all kinds of issues. This fixes that by disallowing all shard moves except for shards of distributed tables. Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-06-23 16:25:46 +02:00
Onder Kalaci	75847d10b5	Add regression tests for changing column type with fkey closes https://github.com/citusdata/citus/issues/2337 as it doesn't apply anymore.	2021-06-23 09:03:55 +03:00
Onder Kalaci	55ed93bf0d	fix regression tests to avoid any conflicts in enterprise	2021-06-22 08:45:17 +03:00
Onder Kalaci	76ae5dd0db	Improve regression tests for prepared statements With a recent commit, we made (`644b266dee`) the behaviour of prepared statements for local cached plans has slightly changed. Now, Citus caches the plans when they are re-used. This make triggering of local cached plans on the 7th execution, and 8th execution is the first time the plan is used from the cached. So, the tests are improved to cover 8th execution.	2021-06-21 13:34:44 +03:00
Onder Kalaci	69ca943e58	Deparse/parse the local cached queries With local query caching, we try to avoid deparse/parse stages as the operation is too costly. However, we can do deparse/parse operations once per cached queries, right before we put the plan into the cache. With that, we avoid edge cases like (4239) or (5038). In a sense, we are making the local plan caching behave similar for non-cached local/remote queries, by forcing to deparse the query once.	2021-06-21 12:24:29 +03:00
Onur Tirtir	681f700321	Fix first_row_number test for stripe_row_limit enforcement	2021-06-17 10:51:43 +03:00
Onur Tirtir	3d11c0f9ef	Merge remote-tracking branch 'origin/master' into columnar-index Conflicts: src/test/regress/expected/columnar_empty.out src/test/regress/expected/multi_extension.out	2021-06-16 20:23:50 +03:00
Onur Tirtir	b6b969971a	Error out for CLUSTER commands on columnar tables	2021-06-16 20:06:33 +03:00
Onur Tirtir	9b4dc2f804	Prevent using parallel scan for columnar index builds	2021-06-16 19:59:32 +03:00
Onur Tirtir	10a762aa88	Implement columnar index support functions	2021-06-16 19:59:32 +03:00
Halil Ozan Akgul	db03afe91e	Bump citus version to 10.2devel	2021-06-16 17:44:05 +03:00
SaitTalhaNisanci	1784c7ef85	Merge branch 'master' into split_multi	2021-06-16 15:26:09 +03:00
Sait Talha Nisanci	c7d04e7f40	swap multi_schedule and multi_schedule_1	2021-06-16 14:40:14 +03:00
Sait Talha Nisanci	c55e44a4af	Drop table if exists	2021-06-16 14:19:59 +03:00
Naisila Puka	e26b29d3bb	Fix nextval('seq_name'::text) bug, and schema for seq tests (#5046 )	2021-06-16 13:58:49 +03:00
Marco Slot	a7e4d6c94a	Fix a bug that causes worker_create_or_alter_role to crash with NULL input	2021-06-15 20:07:08 +02:00
Onur Tirtir	a209999618	Enforce table opt constraints when using alter_columnar_table_set (#5029 )	2021-06-08 17:39:16 +03:00
Ahmet Gedemenli	089ef35940	Disable dropping and truncating known shards Add test for disabling dropping and truncating known shards	2021-06-02 14:30:27 +02:00
Jelte Fennema	503c70b619	Cleanup orphaned shards before moving when necessary A shard move would fail if there was an orphaned version of the shard on the target node. With this change before actually fail, we try to clean up orphaned shards to see if that fixes the issue.	2021-06-04 11:23:07 +02:00
Jelte Fennema	280b9ae018	Cleanup orphaned shards at the start of a rebalance In case the background daemon hasn't cleaned up shards yet, we do this manually at the start of a rebalance.	2021-06-04 11:23:07 +02:00
Jelte Fennema	7015049ea5	Add citus_cleanup_orphaned_shards UDF Sometimes the background daemon doesn't cleanup orphaned shards quickly enough. It's useful to have a UDF to trigger this removal when needed. We already had a UDF like this but it was only used during testing. This exposes that UDF to users. As a safety measure it cannot be run in a transaction, because that would cause the background daemon to stop cleaning up shards while this transaction is running.	2021-06-04 11:23:07 +02:00
Naisila Puka	0f37ab5f85	Fixes column default coming from a sequence (#4914 ) * Add user-defined sequence support for MX * Remove default part when propagating to workers * Fix ALTER TABLE with sequences for mx tables * Clean up and add tests * Propagate DROP SEQUENCE * Removing function parts * Propagate ALTER SEQUENCE * Change sequence type before propagation & cleanup * Revert "Propagate ALTER SEQUENCE" This reverts commit 2bef64c5a29f4e7224a7f43b43b88e0133c65159. * Ensure sequence is not used in a different column with different type * Insert select tests * Propagate rename sequence stmt * Fix issue with group ID cache invalidation * Add ALTER TABLE ALTER COLUMN TYPE .. precaution * Fix attnum inconsistency and add various tests * Add ALTER SEQUENCE precaution * Remove Citus hook * More tests Co-authored-by: Marco Slot <marco.slot@gmail.com>	2021-06-03 23:02:09 +03:00
Marco Slot	c03729ad03	Only warn about reference tables when removing last node	2021-06-01 10:53:12 +02:00
Hanefi Onaldi	056005db4d	Improve tests for truncating local data (#5012 ) We have a slightly different behavior when using truncate_local_data_after_distributing_table UDF on metadata synced clusters. This PR aims to add tests to cover such cases. We allow distributing tables with data that have foreign keys to reference tables only on metadata synced clusters. This is the reason why some of my earlier tests failed when run on a single node Citus cluster.	2021-06-03 08:51:32 +03:00
Ahmet Gedemenli	0fbddc740d	Fix shard id difference for enterprise	2021-06-01 17:17:46 +03:00
Ahmet Gedemenli	69d39c0e8b	Fix relname null bug when parallel execution	2021-06-01 14:14:35 +03:00
Ahmet Gedemenli	9638933d9d	Remove function GenerateNewTargetEntriesForSortClauses	2021-06-01 12:35:36 +03:00
SaitTalhaNisanci	c72d2b479b	Add tests for union pushdown workaround (#5005 )	2021-05-31 20:02:20 +02:00
SaitTalhaNisanci	8c3f85692d	Not consider old placements when disabling or removing a node (#4960 ) * Not consider old placements when disabling or removing a node * update cluster test	2021-05-28 22:38:20 +02:00
Hanefi Onaldi	878513f325	Remove all occurences of replication_model GUC	2021-05-21 16:14:59 +03:00
SaitTalhaNisanci	87e3a5e24a	Use 2PC when using a node connection (#4997 )	2021-05-21 14:58:53 +03:00
SaitTalhaNisanci	82f34a8d88	Enable citus.defer_drop_after_shard_move by default (#4961 ) Enable citus.defer_drop_after_shard_move by default	2021-05-21 10:48:32 +03:00
Jelte Fennema	10f06ad753	Fetch shard size on the fly for the rebalance monitor Without this change the rebalancer progress monitor gets the shard sizes from the `shardlength` column in `pg_dist_placement`. This column needs to be updated manually by calling `citus_update_table_statistics`. However, `citus_update_table_statistics` could lead to distributed deadlocks while database traffic is on-going (see #4752). To work around this we don't use `shardlength` column anymore. Instead for every rebalance we now fetch all shard sizes on the fly. Two additional things this does are: 1. It adds tests for the rebalance progress function. 2. If a shard move cannot be done because a source or target node is unreachable, then we error in stop the rebalance, instead of showing a warning and continuing. When using the by_disk_size rebalance strategy it's not safe to continue with other moves if a specific move failed. It's possible that the failed move made space for the next move, and because the failed move never happened this space now does not exist. 3. Adds two new columns to the result of `get_rebalancer_progress` which shows the size of the shard on the source and target node. Fixes #4930	2021-05-20 16:38:17 +02:00
Nils Dijk	a6c2d2a4c4	Feature: alter database owner (#4986 ) DESCRIPTION: Add support for ALTER DATABASE OWNER This adds support for changing the database owner. It achieves this by marking the database as a distributed object. By marking the database as a distributed object it will look for its dependencies and order the user creation commands (enterprise only) before the alter of the database owner. This is mostly important when adding new nodes. By having the database marked as a distributed object it can easily understand for which `ALTER DATABASE ... OWNER TO ...` commands to propagate by resolving the object address of the database and verifying it is a distributed object, and hence should propagate changes of owner ship to all workers. Given the ownership of the database might have implications on subsequent commands in transactions we force sequential mode for transactions that have a `ALTER DATABASE ... OWNER TO ...` command in them. This will fail the transaction with meaningful help when the transaction already executed parallel statements. By default the feature is turned off since roles are not automatically propagated, having it turned on would cause hard to understand errors for the user. It can be turned on by the user via setting the `citus.enable_alter_database_owner`.	2021-05-20 13:27:44 +02:00
Onder Kalaci	d07db99ea4	Make sure that target node in shard moves is eligable for shard move	2021-05-20 10:51:01 +02:00
Jelte Fennema	924959fdb1	Include result type in upgrade diff test (#4987 ) We often change result types of functions slightly. Our downgrade tests wouldn't notice these changes. This change adds them to the description of these items. An example of an SQL change that isn't caught without this change and is caught with the get_rebalance_progress change in this PR: https://github.com/citusdata/citus/pull/4963	2021-05-18 16:25:39 +02:00
Nils Dijk	c91f8d8a15	Feature: localhost guc (#4836 ) DESCRIPTION: introduce `citus.local_hostname` GUC for connections to the current node Citus once in a while needs to connect to itself for some systems operations. This used to be hardcoded to `localhost`. The hardcoded hostname causes some issues, for example in environments where `sslmode=verify-full` is required. It is not always desirable or even feasible to get `localhost` as an alt name on the certificate. By introducing a GUC to use when connecting to the current instance the user has more control what network path is used and what hostname is required to be present in the server certificate.	2021-05-12 16:59:44 +02:00
Hanefi Onaldi	13808b60cf	Update gitignore files	2021-05-12 09:49:07 +03:00
Jelte Fennema	cbbd10b974	Implement an improvement threshold in the rebalancer (#4927 ) Every move in the rebalancer algorithm results in an improvement in the balance. However, even if the improvement in the balance was very small the move was still chosen. This is especially problematic if the shard itself is very big and the move will take a long time. This changes the rebalancer algorithm to take the relative size of the balance improvement into account when choosing moves. By default a move will not be chosen if it improves the balance by less than half of the size of the shard. An extra argument is added to the rebalancer functions so that the user can decide to lower the default threshold if the ignored move is wanted anyway.	2021-05-11 14:24:59 +02:00
Onur Tirtir	2e419ea177	Add first_row_number column to columnar.stripe for tid mapping	2021-05-10 20:16:50 +03:00
jeff-davis	7b9aecff21	Columnnar: metapage changes. (#4907 ) * Columnar: introduce columnar storage API. This new API is responsible for the low-level storage details of columnar; translating large reads and writes into individual block reads and writes that respect the page headers and emit WAL. It's also responsible for the columnar metapage, resource reservations (stripe IDs, row numbers, and data), and truncation. This new API is not used yet, but will be used in subsequent forthcoming commits. * Columnar: add columnar_storage_info() for debugging purposes. * Columnar: expose ColumnarMetadataNewStorageId(). * Columnar: always initialize metapage at creation time. This avoids the complexity of dealing with tables where the metapage has not yet been initialized. * Columnar: columnar storage upgrade/downgrade UDFs. Necessary upgrade/downgrade step so that new code doesn't see an old metapage. * Columnar: improve metadata.c comment. * Columnar: make ColumnarMetapage internal to the storage API. Callers should not have or need direct access to the metapage. * Columnar: perform resource reservation using storage API. * Columnar: implement truncate using storage API. * Columnar: implement read/write paths with storage API. * Columnar: add storage tests. * Revert "Columnar: don't include stripe reservation locks in lock graph." This reverts commit `c3dcd6b9f8`. No longer needed because the columnar storage API takes care of concurrency for resource reservation. * Columnar: remove unnecessary lock when reserving. No longer necessary because the columnar storage API takes care of concurrent resource reservation. * Add simple upgrade tests for storage/ branch * fix multi_extension.out Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-05-10 20:16:46 +03:00
Ahmet Gedemenli	8cb505d6e1	Fix matview access method change issue (#4959 ) * Fix matview access method change issue * Use pg function get_am_name * Split view generation command into pieces	2021-05-07 15:47:24 +03:00
SaitTalhaNisanci	6b1904d37a	When moving a shard to a new node ensure there is enough space (#4929 ) * When moving a shard to a new node ensure there is enough space * Add WairForMiliseconds time utility * Add more tests and increase readability * Remove the retry loop and use a single udf for disk stats * Address review * address review Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2021-05-06 17:28:02 +03:00
Ahmet Gedemenli	bc818e76e2	Add notice log message for skipping child tables for optimization	2021-05-06 16:49:37 +03:00
Ahmet Gedemenli	2e0bb5c0c8	Fix nested select query with union bug	2021-05-05 20:35:00 +03:00
Jelte Fennema	50357db957	Simplify code that tests the shard rebalancer algorithm (#4925 ) This modifies the test code to use sane defaults instead of requiring all values to be specified in the test.	2021-05-03 15:47:19 +02:00
Marco Slot	4b49cb112f	Fix FROM ONLY queries on partitioned tables	2021-04-27 16:10:07 +02:00
Onur Tirtir	889ad6fa8c	Run some upgrade tests only when old version=9.0	2021-04-26 14:53:53 +03:00
Ahmet Gedemenli	332c5ce4ad	Fix worker partitioned size functions (#4922 )	2021-04-26 10:29:46 +03:00
Onder Kalaci	918838e488	Allow constant VALUES clauses in pushdown queries As long as the VALUES clause contains constant values, we should not recursively plan the queries/CTEs. This is a follow-up work of #1805. So, we can easily apply OUTER join checks as if VALUES clause is a reference table/immutable function.	2021-04-21 14:28:08 +02:00
Ahmet Gedemenli	33c620f232	Optimize partitioned disk size calculation (#4905 ) * Optimize partitioned disk size calculation * Polish * Fix test for citus_shard_cost_by_disk_size Try optimizing if not CSTORE	2021-04-19 13:30:56 +03:00
Onur Tirtir	96278822d9	Move columnar test helpers to a separate file (#4908 ) * Move columnar test helpers to another file * Rename column_store_memory_stats to columnar_store_memory_stats	2021-04-16 18:56:21 +03:00
Onder Kalaci	5b78f6cd63	Keep more execution statistics When DEBUG4 enabled, Citus now prints per task execution times.	2021-04-16 14:45:00 +02:00
Hanefi Onaldi	9919fbe3f8	Switch to sequential mode on long partition names This commit adds support for long partition names for distributed tables: - ALTER TABLE dist_table ATTACH PARTITION .. - CREATE TABLE .. PARTITION OF dist_table .. Note: create_distributed_table UDF does not support long table and partition names, and is not covered in this commit	2021-04-14 15:27:50 +03:00
Ahmet Gedemenli	e445e3d39c	Introduce 3 partitioned size udfs (#4899 ) * Introduce 3 partitioned size udfs * Add tests for new partition size udfs * Fix type incompatibilities * Convert UDFs into pure sql functions * Fix function comment	2021-04-13 17:36:27 +03:00
Ahmet Gedemenli	52e467a9a0	Error out if inheriting a distributed table (#4871 ) * Error out if inheriting a distributed table * Add test inheriting a distirbuted table	2021-04-07 11:21:06 +03:00
Ahmet Gedemenli	48a6a5b128	Add test for public shard not found issue	2021-04-06 10:29:17 +03:00
Halil Ozan Akgul	a5038046f9	Adds shard_count parameter to create_distributed_table	2021-03-29 16:22:49 +03:00
SaitTalhaNisanci	03832f353c	Drop postgres 11 support	2021-03-25 09:20:28 +03:00
Nils Dijk	787ee97867	Tests: foreign key non colocated tests (#4841 ) Earlier versions of Citus (pre 9.0) had a bug where a user was able to get in a situation where a foreign key between two non-colocated tables was allowed. This was caused by the wrongful scoping together with only setting to on of a boolean variable in a loop, causing the `true` from an earlier iteration to leak into a new iteration. This was 'by accident' solved in a refactor that was executed in the preparation of the 9.0 release. Only recently we had a user running into this and it was tracked down to this behaviour. Given the dire situation a user could get them self into when running into this bug we have backported a fix to the latest 8.3 release branch. To make sure this regression does not happen anymore in the future I propose we add the tests from the backport to our mainline. For reference: https://github.com/citusdata/citus/pull/4840	2021-03-22 15:33:56 +01:00
Ahmet Gedemenli	5e5db9eefa	Add udf citus_get_active_worker_nodes	2021-03-17 13:15:59 +03:00
Marco Slot	fbc2147e11	Replace MAX_PUT_COPY_DATA_BUFFER_SIZE by citus.remote_copy_flush_threshold GUC	2021-03-16 06:00:38 +01:00
Marco Slot	1646fca445	Add GUC to set maximum connection lifetime	2021-03-16 01:57:57 +01:00
Onur Tirtir	1d3e075e62	Support temporary columnar tables (#4766 )	2021-03-12 12:01:36 +03:00
Onder Kalaci	e65e72130d	Rename use -> shouldUse Because setting the flag doesn't necessarily mean that we'll use 2PC. If connections are read-only, we will not use 2PC. In other words, we'll use 2PC only for connections that modified any placements.	2021-03-12 08:29:43 +00:00
Onder Kalaci	6a7ed7b309	Do not trigger 2PC for reads on local execution Before this commit, Citus used 2PC no matter what kind of local query execution happens. For example, if the coordinator has shards (and the workers as well), even a simple SELECT query could start 2PC: ```SQL WITH cte_1 AS (SELECT * FROM test LIMIT 10) SELECT count(*) FROM cte_1; ``` In this query, the local execution of the shards (and also intermediate result reads) triggers the 2PC. To prevent that, Citus now distinguishes local reads and local writes. And, Citus switches to 2PC only if a modification happens. This may still lead to unnecessary 2PCs when there is a local modification and remote SELECTs only. Though, we handle that separately via #4587.	2021-03-12 08:29:43 +00:00
Onur Tirtir	874d5fd962	Remove foreign keys between columnar metadata tables (#4791 ) Postgres keeps AFTER trigger state for each transaction, because we can have deferred AFTER triggers which will be fired at the end of a transaction. Postgres cleans up this state at the end of transaction. Postgres processes ON COMMIT triggers after cleaning-up the AFTER trigger states. So if we fire any triggers in ON COMMIT, the AFTER trigger state won't be cleaned-up properly and the transaction state will be left in an inconsistent state, which might result in assertion failure. So with this commit, we remove foreign keys between columnar metadata tables and enforce constraints between them manually when dropping columnar tables.	2021-03-12 11:28:17 +03:00
Naisila Puka	196064836c	Skip 2PC for readonly connections in a transaction (#4587 ) * Skip 2PC for readonly connections in a transaction * Use ConnectionModifiedPlacement() function * Remove the second check of ConnectionModifiedPlacement() * Add order by to prevent flaky output * Test using pg_dist_transaction	2021-03-10 20:01:37 +03:00
Marco Slot	9c0d7f5c26	Add tests for modifying CTE and SELECT without FROM	2021-03-09 10:39:33 +01:00
SaitTalhaNisanci	aef7fc3a51	Ignore columnar generated test files (#4796 )	2021-03-09 10:52:08 +03:00
Philip Dubé	4e22f02997	Fix various typos due to zealous repetition	2021-03-04 19:28:15 +00:00
Onur Tirtir	1bb7a0a268	Fix chunk_group_consistency regression test view (#4765 )	2021-03-04 12:20:25 +03:00
Onur Tirtir	9728ce1167	Add tests for concurrent index deadlock issue (#4775 )	2021-03-04 11:56:54 +03:00
Onder Kalaci	d1cd198655	Prevent infinite recursion for queries that involve UNION ALL and JOIN With this commit, we make sure to prevent infinite recursion for queries in the format: [subquery with a UNION ALL] JOIN [table or subquery] Also, fixes a bug where we pushdown UNION ALL below a JOIN even if the UNION ALL is not safe to pushdown.	2021-03-03 12:27:26 +01:00
Naisila Puka	2f30614fe3	Reimplement citus_update_table_statistics to detect dist. deadlocks (#4752 ) * Reimplement citus_update_table_statistics * Update stats for the given table not colocation group * Add tests for reimplemented citus_update_table_statistics * Use coordinated transaction, merge with citus_shard_sizes functions * Update the old master_update_table_statistics as well	2021-03-03 04:12:30 +03:00
jeff-davis	9da9bd3dfd	Columnar: rename files and tests. (#4751 ) * Columnar: rename files and tests. * Columnar: Rename TableState to ColumnarState.	2021-03-01 08:34:24 -08:00
SaitTalhaNisanci	feee25dfbd	Use translated vars in postgres 13 as well (#4746 ) * Use translated vars in postgres 13 as well Postgres 13 removed translated vars with pg 13 so we had a special logic for pg 13. However it had some bug, so now we copy the translated vars before postgres deletes it. This also simplifies the logic. * fix rtoffset with pg >= 13	2021-02-26 19:41:29 +03:00
Halil Ozan Akgul	5c5cb200f7	Adds GRANT for public to citus_tables	2021-02-26 16:24:33 +03:00
Önder Kalacı	0fe26a216c	Prevent cross join without any target list entries (#4750 ) /* * The physical planner assumes that all worker queries would have * target list entries based on the fact that at least the column * on the JOINs have to be on the target list. However, there is * an exception to that if there is a cartesian product join and * there is no additional target list entries belong to one side * of the JOIN. Once we support cartesian product join, we should * remove this error. */	2021-02-26 11:04:21 +01:00
Onur Tirtir	54ac924bef	Grant read access for columnar metadata tables to unprivileged user	2021-02-26 12:31:09 +03:00
Onur Tirtir	dcc0207605	Add 10.0-2 schema version	2021-02-26 12:31:09 +03:00
Onur Tirtir	5ed954844c	Ensure table owner when using alter_columnar_table_set/alter_columnar_table_reset (#4748 )	2021-02-26 12:27:51 +03:00
Naisila Puka	5ebd4eac7f	Preserve colocation with procedures in alter_distributed_table (#4743 )	2021-02-25 19:52:47 +03:00
Hanefi Onaldi	5aff18b573	Fix flaky test	2021-02-24 17:09:08 +03:00
Hanefi Onaldi	7bebeb872d	Failing long table name tests	2021-02-24 03:35:27 +03:00
Naisila Puka	dbb88f6f8b	Fix insert query with CTEs/sublinks/subqueries etc (#4700 ) * Fix insert query with CTE * Add more cases with deferred pruning but false fast path * Add more tests * Better readability with if statements	2021-02-23 18:00:47 +03:00
Naisila Puka	105bb580e1	Add columnar regression tests (#4727 ) * Add cursor tests for columnar tables * Add columnar tests for data types w/out comp. operators * Add more prepared statements with columnar tables * Add constraint tests for columnar tables * Add row level security, detach partition and rename columnar tests * Add some ORDER BYs	2021-02-23 14:16:38 +03:00
Ahmet Gedemenli	1f345f65b4	Support dropping local table indexes along with a distributed index	2021-02-18 13:30:12 +03:00
Onur Tirtir	676d9a9726	Bump Citus to 10.1devel	2021-02-17 11:54:33 +03:00
Onur Tirtir	d61fd6e478	Decide changing sequence dependencies on MX nodes according to resulting relation (#4713 ) When executing alter_table / undistribute_table udf's, we should not try to change sequence dependencies on MX workers if new table wouldn't require syncing metadata. Previously, we were checking that for input table. But in some cases, the fact that input table requires syncing metadata doesn't imply the same for resulting table (e.g when undistributing a Citus table). Even more, doing that was giving an unexpected error when undistributing a Citus table so this commit actually fixes that.	2021-02-15 19:20:26 +03:00
SaitTalhaNisanci	bcbd24f8de	Only consider pseudo constants for shortcuts (#4712 ) It seems that we need to consider only pseudo constants while doing some shortcuts in planning. For example there could be a false clause but it can contribute to the result in which case it will not be a pseudo constant.	2021-02-15 18:39:37 +03:00
SaitTalhaNisanci	0f1ce7a913	Not skip relation in conversion if it doesn't have RelationRestriction (#4685 ) We would exclude tables without relationRestriction from conversion candidates in local-distributed table joins. This could leave a leftover local table which should have been converted to a subquery. Ideally I would expect that in each call to CreateDistributedPlan we would pass a new plan id, but that seems like a bigger change.	2021-02-12 12:33:55 +03:00
Hadi Moshayedi	e690d8b79b	Move stripe.chunk_count to last position	2021-02-11 17:00:44 -08:00
Jeff Davis	1f1c3c362b	Columnar: rename chunk_num -> chunk_group_num.	2021-02-11 09:27:00 -08:00
Onder Kalaci	f297c96ec5	Add regression tests for COPY into colocated intermediate results To add the tests without too much data, make the copy switchover configurable.	2021-02-11 15:41:06 +01:00
Onder Kalaci	5d5a357487	Do not connection re-use for intermediate results /* * Colocated intermediate results are just files and not required to use * the same connections with their co-located shards. So, we are free to * use any connection we can get. * * Also, the current connection re-use logic does not know how to handle * intermediate results as the intermediate results always truncates the * existing files. That's why, we use one connection per intermediate * result. */	2021-02-11 15:41:06 +01:00
Ahmet Gedemenli	c8e83d1f26	Fix dropping fkey when distributing table	2021-02-11 15:48:35 +03:00
SaitTalhaNisanci	847b79078f	Not consider subplans in restriction list (#4679 ) * Not consider subplans in restriction list * Not consider sublink, alternative subplan in restrictions	2021-02-11 15:04:07 +03:00
Onur Tirtir	ec7ab68f3b	Test adding local table with long name to metadata	2021-02-10 18:05:04 +03:00
Onur Tirtir	9f619a85d6	Fix EXPLAIN ANALYZE exec when query returns no cols (#4672 ) We do not include dummy column if original task didn't return any columns. Otherwise, number of columns that original task returned wouldn't match number of columns returned by worker_save_query_explain_analyze.	2021-02-10 17:59:47 +03:00
Hadi Moshayedi	52297804ae	Fix zero column tables	2021-02-09 23:05:11 -08:00
Hadi Moshayedi	2d09c76b76	Rename storageid to storage_id	2021-02-09 19:57:04 -08:00
Hadi Moshayedi	8270b598b6	Rename stripeid, chunkid, and attnum	2021-02-09 19:50:50 -08:00
Hadi Moshayedi	be90c20457	Fix write path for zero column tables	2021-02-09 14:14:06 -08:00
Hadi Moshayedi	c8d61a31e2	Columnar: chunk_group metadata table	2021-02-09 14:11:58 -08:00
Onder Kalaci	c804c9aa21	Allow local execution for intermediate results in COPY When COPY is used for copying into co-located files, it was not allowed to use local execution. The primary reason was Citus treating co-located intermediate results as co-located shards, and COPY into the distributed table was done via "format result". And, local execution of such COPY commands was not implemented. With this change, we implement support for local execution with "format result". To do that, we use the buffer for every file on shardState->copyOutState, similar to how local copy on shards are implemented. In fact, the logic is similar to local copy on shards, but instead of writing to the shards, Citus writes the results to a file. The logic relies on LOCAL_COPY_FLUSH_THRESHOLD, and flushes only when the size exceeds the threshold. But, unlike local copy on shards, in this case we write the headers and footers just once.	2021-02-09 15:00:06 +01:00
Hadi Moshayedi	eff8cffaf3	Columnar: improve naming of limit config variables. (#4653 ) * Rename chunk_row_count to chunk_group_row_limit * Rename stripe_row_count to stripe_row_limit * Undo couple of renames	2021-02-06 09:04:04 -08:00
Jeff Davis	b1882d4400	Columnar: Call nextval_internal instead of DirectFunctionCall.	2021-02-06 01:45:30 -08:00
Ahmet Gedemenli	5dd2a3da03	Convert RelabelTypes into CollateExprs in get_rule_expr function	2021-02-05 12:06:46 +03:00
Onder Kalaci	fc9a23792c	COPY uses adaptive connection management on local node With #4338, the executor is smart enough to failover to local node if there is not enough space in max_connections for remote connections. For COPY, the logic is different. With #4034, we made COPY work with the adaptive connection management slightly differently. The cause of the difference is that COPY doesn't know which placements are going to be accessed hence requires to get connections up-front. Similarly, COPY decides to use local execution up-front. With this commit, we change the logic for COPY on local nodes: Try to reserve a connection to local host. This logic follows the same logic (e.g., citus.local_shared_pool_size) as the executor because COPY also relies on TryToIncrementSharedConnectionCounter(). If reservation to local node fails, switch to local execution Apart from this, if local execution is disabled, we follow the exact same logic for multi-node Citus. It means that if we are out of the connection, we'd give an error.	2021-02-04 09:45:07 +01:00
Hadi Moshayedi	5fde617229	Columnar: disallow CREATE INDEX CONCURRENTLY	2021-02-03 12:10:00 -08:00
Jeff Davis	4043731c41	Columnar: fix inheritance planning.	2021-02-03 10:41:21 -08:00
Sait Talha Nisanci	24e60b44a1	Consider coordinator in intermediate result optimization It seems that we were not considering the case where coordinator was added to the cluster as a worker in the optimization of intermediate results. This could lead to errors when coordinator was added as a worker.	2021-02-03 20:02:03 +03:00
Onur Tirtir	c0f2817b70	Disallow using alter_table udfs with tables having any identity cols (#4635 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating table from scratch. For this reason, we don't allow using alter_table udfs with tables having any identity cols.	2021-02-03 19:33:54 +03:00
Onur Tirtir	3a403090fd	Disallow adding local table with identity column to metadata (#4633 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating shell relation. For this reason, we don't allow adding local tables -having identity cols- to metadata.	2021-02-03 19:05:17 +03:00
Onur Tirtir	5efb742f8a	Skip copying GENERATED ALWAYS AS STORED cols in ReplaceTable (#4616 ) Postgres doesn't allow inserting into columns having GENERATED ALWAYS AS (...) STORED expressions. For this reason, when executing undistribute_table or an alter_* udf, we should skip copying such columns. This is not bad since Postgres would already generate such columns.	2021-02-03 17:55:16 +03:00
jeff-davis	e03246dd45	Colummnar: mark custom scan path paralle_safe. (#4619 ) Enables an overall plan to be parallel (e.g. over a partition hierarchy), even though an individual ColumnarScan is not parallel-aware. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-02 11:56:00 -08:00
jeff-davis	e195af7e72	Columnar: always disable parallel paths. (#4617 ) Previously, if columnar.enable_custom_scan was false, parallel paths could remain, leading to an unexpected error. Also, ensure that cheapest_parameterized_paths is cleared if a custom scan is used. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-02 11:37:42 -08:00
Onur Tirtir	912d829757	Skip GENERATED AS ALWAYS STORED cols when processing cols owning sequences When finding columns owning sequences, we shouldn't rely on atthasdef since it might be true when column has GENERATED ALWAYS AS (...) STORED expression.	2021-02-02 18:17:42 +03:00
Onur Tirtir	c8a48c6eee	Not try to sync metadata for local tables (#4625 )	2021-02-02 15:12:12 +03:00
Hadi Moshayedi	bcb162976f	Fix #4608	2021-02-01 16:23:16 -08:00
Hadi Moshayedi	f5b1e49b79	Columnar: Fix lateral joins	2021-02-01 11:59:36 -08:00
Hadi Moshayedi	ef927688fa	Columnar: Fix ALTER TABLE ... ADD COLUMN.	2021-02-01 11:40:17 -08:00
Brian Bergeron	1253eeb9ff	Don't propagate ALTER ROLE SET when scoped to a different database (#4471 ) Co-authored-by: brberger <brberger@microsoft.com>	2021-02-01 15:49:26 +03:00
Hanefi Önaldı	cab17afce9	Introduce UDFs for fixing partitioned table constraint names	2021-01-29 17:32:20 +03:00
Hanefi Önaldı	92cf49b7e9	Limit shardId in partitioned table constraint names to only CHECK	2021-01-29 17:29:53 +03:00
SaitTalhaNisanci	738825cc38	Fix partition column index issue (#4591 ) * Fix partition column index issue We send column names to worker_hash/range_partition_table methods, and in these methods we check the column name index from tuple descriptor. Then this index is used to decide the bucket that the current row will be sent for the repartition. This becomes a problem when there are the same column names in the tupleDescriptor. Then we can choose the wrong index. Hence the partitioned data will be put to wrong workers. Then the result could miss some data because workers might contain different range of data. An example: TupleDescriptor contains "trip_id", "car_id", "car_id" for one table. It contains only "car_id" for the other table. And assuming that the tables will be partitioned by car_id, it is not certain what should be used for deciding the bucket number for the first table. Assuming value 2 goes to bucket 2 and value 3 goes to bucket 3, it is not certain which bucket "1 2 3" (trip_id, car_id, car_id) row will go to. As a solution we send the index of partition column in targetList instead of the column name. The old API is kept so that if workers upgrade work, it still works (though it will have the same bug) * Use the same method so that backporting is easier	2021-01-29 14:40:40 +03:00
SaitTalhaNisanci	1ba399f5ca	Fix a flaky behaviour in shared_connection_stats (#4596 ) With the previous query, we were not pushing down the pg_sleep hence the number of connections to a worker could be different from run to run.	2021-01-28 18:42:49 +03:00
Onder Kalaci	c7ea46067f	Add regression tests	2021-01-28 12:45:57 +01:00
Onur Tirtir	bb5962ee79	Early error out when creating citus local from a temp table (#4592 )	2021-01-28 14:18:06 +03:00
Halil Ozan Akgul	913aa91449	Adds error message to AlterTableSetAccessMethod for below PG12	2021-01-28 11:32:02 +03:00
Nils Dijk	07d3b4fd04	fix NaN cost estimate on empty columnar tables (#4593 ) Fixing a division by zero in the cost calculations for scanning a columnar table. Due to how the columns in a columnar table are counted an empty table would result in a division by zero. Instead this patch keeps the column selection ratio on zero when this happens, resulting in an accurate cost of zero pages to scan a columnar table. fixes #4589	2021-01-27 17:32:17 +01:00

... 3 4 5 6 7 ...

1766 Commits (1d7dda991f54a2da75febf9040a4356221e9a4ba)