citus

Commit Graph

Author	SHA1	Message	Date
Jelte Fennema	5c0205ce10	Fix flakyness in multi_replicate_reference_table (#6235 ) In CI multi_replicate_reference_table would sometimes fail like this: ```diff -- detects correctly that referecence table doesn't have replica identity SELECT replicate_reference_tables(); -ERROR: cannot use logical replication to transfer shards of the relation initially_not_replicated_reference_table since it doesn't have a REPLICA IDENTITY or PRIMARY KEY +ERROR: cannot use logical replication to transfer shards of the relation ref_table since it doesn't have a REPLICA IDENTITY or PRIMARY KEY DETAIL: UPDATE and DELETE commands on the shard will error out during logical replication unless there is a REPLICA IDENTITY or PRIMARY KEY. HINT: If you wish to continue without a replica identity set the shard_transfer_mode to 'force_logical' or 'block_writes'. ``` Because `CitusTableTypeIdList` returns tables in heap order so it's a bit random which one is first in the list. And the test contained multiple tables that didn't have a primary key or replica identity. So it made sense that the error could be for either one of these tables. This PR makes the test output consistent by changing one of the tables to have a primary key. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26387/workflows/fc3196e7-ddf2-4000-a70b-5ac71c836321/jobs/748940	2022-08-24 13:34:10 +03:00
aykut-bozkurt	041f88d7bf	Revert "Revert "Creates new colocation for colocate_with:='none' too"" (#6227 ) This reverts commit `d171a736ab`.	2022-08-24 10:54:04 +03:00
Marco Slot	bad8196da3	Verify that we can replicate reference tables using rebalancer (#6232 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-24 00:34:21 +02:00
Jelte Fennema	e0ada050aa	Enable binary logical replication for shard moves (#6017 ) Using binary encoding can save a lot of CPU cycles, both on the sender and on the receiver. Since the walsender and walreceiver processes are single threaded, this can matter a lot for the throughput if they are bottlenecked on CPU. This feature is only available in PG14, not PG13. It should be safe to always enable because it's only used for types that support binary encoding according to the PG docs: > Even when this option is enabled, only data types that have binary > send and receive functions will be transferred in binary. But in case it causes problems, it can still be disabled by setting `citus.enable_binary_protocol` to `false`.	2022-08-23 16:38:00 +02:00
Jelte Fennema	cc7e93a56a	Fix flakyness in failure_connection_establishment (#6226 ) In CI our failure_connection_establishment sometimes failed randomly with the following error: ```diff -- verify a connection attempt was made to the intercepted node, this would have cause the -- connection to have been delayed and thus caused a timeout SELECT * FROM citus.dump_network_traffic() WHERE conn=0; conn \| source \| message ------+--------+--------- - 0 \| coordinator \| [initial message] -(1 row) +(0 rows) SELECT citus.mitmproxy('conn.allow()'); ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26318/workflows/d3354024-9a67-4b01-9416-5cf79aec6bd8/jobs/745558 The way I fixed this was by removing the dump_network_traffic call. This might sound simple, but doing this while continuing to let the test serve its intended purpose required quite some more changes. This dump_network_traffic call was there because we didn't want to show warnings in the queries above, because the exact warnings were not reliable. The main reason this error was not reliable was because we were using round-robin task assignment. We did the same query twice, so that it would hit the node with the intercepted connection in one of those connections. Instead of doing that I'm now using the "first-replica" policy and do the queries only once. This works, because the first placements by placementid for each of the used tables are on the second node, so first-replica will cause the first connection to go there. This solved most of the flakyness, but when confirming that the flakyness was fixed I found some additional errors: ```diff -- show that INSERT failed SELECT citus.mitmproxy('conn.allow()'); mitmproxy ----------- (1 row) SELECT count(*) FROM single_replicatated WHERE key = 100; - count ---------------------------------------------------------------------- - 0 -(1 row) - +ERROR: could not establish any connections to the node localhost:9060 after 400 ms RESET client_min_messages; ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26321/workflows/fd5f4622-400c-465e-8d82-83f5f55a87ec/jobs/745666 I addressed this with a combination of two things: 1. Only change citus.node_connection_timeout for the queries that we want to test timeout behaviour for. When those queries are done I reset the value to the default again. 2. Change our mitm framework to only delay the initial connection packet instead of all packets. I think sometimes a follow on packet of a previous connection attempt was causing the next connection attempt to be delayed even if `conn.allow()` was already called. For our tests we only care about connection timeouts, so there's no reason to delay any other packets than the initial connection packet. Then there was some last flakyness in the exact error that was given: ```diff -- tests for connectivity checks SELECT name FROM r1 WHERE id = 2; WARNING: could not establish any connections to the node localhost:9060 after 900 ms +WARNING: connection to the remote node localhost:9060 failed with the following error: name ------ bar (1 row) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26338/workflows/9610941c-4d01-4f62-84dc-b91abc56c252/jobs/746467 I don't have a good explaination for this slight change in error message, but given that it is missing the actual error message I expected this to be related to some small difference in timing: e.g. the server responding to the connection attempt right after the coordinator determined that the connection timed out. To solve this last flakyness I increased the connection timeouts and made the difference between the timeout and the delay a bit bigger. With these tweaks I wasn't able to reproduce this error on CI anymore. Finally, I made most of the same changes to failure_failover_to_local_execution, since it was using the `conn.delay()` mitm method too. The only change that I left out was the timing increase, since it might not be strictly necessary and increases time it takes to run the test. If this test ever becomes flaky the first thing we should try is increase its timeout.	2022-08-23 15:04:20 +03:00
Jelte Fennema	506c16efdf	Fix flakyness in failure_single_select (#6223 ) The failure_single_select test would sometimes fail with an error that's similar to this: ```diff -- cancel after first SELECT; txn should fail and nothing should be marked as invalid SELECT citus.mitmproxy('conn.onQuery(query="^SELECT").cancel(' \|\| pg_backend_pid() \|\| ')'); - mitmproxy ---------------------------------------------------------------------- - -(1 row) - +ERROR: canceling statement due to user request +CONTEXT: COPY mitmproxy_result, line 1: "" +SQL statement "COPY mitmproxy_result FROM '/home/circleci/project/src/test/regress/tmp_check/mitmproxy.fifo'" +PL/pgSQL function citus.mitmproxy(text) line 11 at EXECUTE BEGIN; ``` This error looked very to the one from #6217 and indeed the cause turned out to be similar. Because we were canceling all SELECT queries, we would actually sometimes cancel our mitmproxy SELECT queries itself. This puts some additional restrictions on the queries that we cancel, most importantly it should contain the name of the table that we're selecting from. I was able to reproduce the original issue locally pretty reliably. With the changes in this PR it didn't happen again. In passing this also changes one other failure test that was cancelling all selects and puts similar additional restrictions on those cancellations. Example of failed test in CI: https://app.circleci.com/pipelines/github/citusdata/citus/26305/workflows/4d942b91-f83c-453c-8d9a-ae22d608e756/jobs/745071	2022-08-22 20:06:33 +02:00
Hanefi Onaldi	e33ba7da9e	Decrease min messages for normalization	2022-08-22 17:16:52 +03:00
Jelte Fennema	e2a24b921e	Fix flakyness in failure_create_distributed_table_non_empty (#6217 ) The failure_create_distributed_table_non_empty test would sometimes fail like this: ```diff -- in the first test, cancel the first connection we sent from the coordinator SELECT citus.mitmproxy('conn.cancel(' \|\| pg_backend_pid() \|\| ')'); - mitmproxy ---------------------------------------------------------------------- - -(1 row) - +ERROR: canceling statement due to user request +CONTEXT: COPY mitmproxy_result, line 1: "" +SQL statement "COPY mitmproxy_result FROM '/home/circleci/project/src/test/regress/tmp_check/mitmproxy.fifo'" +PL/pgSQL function citus.mitmproxy(text) line 11 at EXECUTE SELECT create_distributed_table('test_table', 'id'); ``` Because the cancel command had no filter it would actually sometimes cancel the mitmproxy cancel command itself. This PR addresses that by filtering on CREATE TABLE, which is one of the command that create_distributed_table will send to the workers. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26252/workflows/1b7e5464-cca4-4ec1-99b3-48ddf25c29fa/jobs/742829	2022-08-20 01:23:25 +03:00
Jelte Fennema	4ce17f015b	Fix flakyness in columnar_memory test (#6216 ) Sometimes in CI the columnar_memory test was using slightly more memory than expected. ```diff SELECT CASE WHEN 1.0 * TopMemoryContext / :top_post BETWEEN 0.98 AND 1.02 THEN 1 ELSE 1.0 * TopMemoryContext / :top_post END AS top_growth FROM columnar_test_helpers.columnar_store_memory_stats(); --[ RECORD 1 ]- -top_growth \| 1 +-[ RECORD 1 ]------------------ +top_growth \| 1.0206132116232119 -- before this change, max mem usage while executing inserts was 28MB and ``` This PR changes the expectation to be slightly higher, such that this random increase in memory usage doesn't cause a flaky test. Failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26256/workflows/c0870f66-3346-4f8d-a1d3-36dfd7c98289/jobs/743028	2022-08-19 23:46:28 +02:00
Jelte Fennema	de475feb69	Actually connect to the right database in logical_replication test (#6211 ) In the logical_replication test we test that the cleanup logic at the start of a shard move works as expected. To do so we create a subscription and publication slot manually. This changes the test to make that subscription actually connect to the database that the publication is in. Useful for #5987 #6085	2022-08-20 00:09:50 +03:00
Naisila Puka	9cfadd7965	Deletes unnecessary test outputs pt2 (#6214 )	2022-08-19 18:21:13 +03:00
Jelte Fennema	e6a1a86db0	Improve debugability for columnar_memory flakyness (#6203 ) Sometimes the columnar_memory test fails in CI with the following error: ```diff SELECT 1.0 * TopMemoryContext / :top_post BETWEEN 0.98 AND 1.02 AS top_growth_ok FROM columnar_test_helpers.columnar_store_memory_stats(); -[ RECORD 1 ]-+-- -top_growth_ok \| t +top_growth_ok \| f -- before this change, max mem usage while executing inserts was 28MB and ``` This is almost certainly a harmless failure that simply requires bumping the margin a little bit. However, it's impossible to say with the current output. I was unable to reproduce this on-demand on my local machine or even in CI. So this changes the test to include the actual value difference in the size of TopMemoryContext when it's outside the expected range. Then next time it fails we at least have some information about why. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/25966/workflows/d472a57b-419a-4f33-b8bc-2e174a98d4d6/jobs/730576	2022-08-19 15:41:16 +02:00
Jelte Fennema	fe1668e43f	Fix flakyness in multi_utilities (#6204 ) Sometimes this multi_utilities would fail with the following error: ```diff SET citus.log_remote_commands TO ON; -- should propagate to all workers because no table is specified ANALYZE; NOTICE: issuing BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED;SELECT assign_distributed_transaction_id(0, 3461, '2022-08-19 01:56:06.35816-07'); DETAIL: on server postgres@localhost:57637 connectionId: 1 NOTICE: issuing BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED;SELECT assign_distributed_transaction_id(0, 3461, '2022-08-19 01:56:06.35816-07'); DETAIL: on server postgres@localhost:57638 connectionId: 2 NOTICE: issuing SET citus.enable_ddl_propagation TO 'off' DETAIL: on server postgres@localhost:57637 connectionId: 1 -NOTICE: issuing SET citus.enable_ddl_propagation TO 'off' -DETAIL: on server postgres@localhost:xxxxx connectionId: xxxxxxx NOTICE: issuing ANALYZE DETAIL: on server postgres@localhost:57637 connectionId: 1 +NOTICE: issuing SET citus.enable_ddl_propagation TO 'off' +DETAIL: on server postgres@localhost:57638 connectionId: 2 NOTICE: issuing ANALYZE DETAIL: on server postgres@localhost:57638 connectionId: 2 ``` This is simply a harmless change in output due to some timing differences. This PR makes the test output consistent by only logging the remote ANALYZE commands, not the SET commands.	2022-08-19 12:38:55 +02:00
Jelte Fennema	8ce12eb51f	Fix flakyness in failure_insert_select_repartition (#6202 ) This fixes our most commonly randomly failing failure test. The failing diff is as follows: ```diff SELECT citus.mitmproxy('conn.onQuery(query="fetch_intermediate_results").kill()'); mitmproxy ----------- (1 row) INSERT INTO target_table SELECT * FROM source_table; -ERROR: connection to the remote node localhost:xxxxx failed with the following error: connection not open +ERROR: could not open file "base/pgsql_job_cache/10_0_40/repartitioned_results_20770193413_from_4213590_to_1.data": No such file or directory +CONTEXT: while executing command on localhost:9060 +while executing command on localhost:57637 SELECT * FROM target_table ORDER BY a; ``` As far as I can tell this is the cause of a race condition: After killing fetch_intermediate_results on worker 9060, the previously created data file gets cleaned up. The fetch_intermediate_results call that's sent to worker 57637 will be cancelled and rolled back soon because of the failure on the other connection. But if that fetch_intermediate_results call is able to connect to 9060 before it is cancelled, it won't find the file it's looking for there anymore. So while it's not the error we expect, it does indicate that we succeeded. To avoid this issue instead of killing the fetch_intermediate_results call directly, we kill the COPY command that it uses to do the fetch. This results in stable output as can be seen here, where 227 runs of failure_insert_select_repartition succeeded: https://app.circleci.com/pipelines/github/citusdata/citus/26168/workflows/9c64a3b6-f46c-4725-9fb4-8f6a2d00a023/jobs/739389 To be clear this changes the test to affects the opposite fetch_intermediate_results call. This kills the fetch_intermediate_results call of worker 57637, instead of killing the fetch_intermediate_results call on worker 9060. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26147/workflows/780e95ea-264a-4c9f-ad2e-cf11449a795e/jobs/738467	2022-08-19 09:11:07 +00:00
Naisila Puka	5a9fdc221b	Add explicit alias to avoid debug output diff in pg15 (#6183 )	2022-08-19 11:39:18 +03:00
Jelte Fennema	d16b458e2a	Remove the flaky rollback_to_savepoint test (#6190 ) This removes a flaky test that I introduced in #3868 after I fixed the issue described in #3622. This test is sometimes fails randomly in CI. The way it fails indicates that there might be some bug: A connection breaks after rolling back to a savepoint. I tried reproducing this issue locally, but I wasn't able to. I don't understand what causes the failure. Things that I tried were: 1. Running the test with: ```sql SET citus.force_max_query_parallelization = true; ``` 2. Running the test with: ```sql SET citus.max_adaptive_executor_pool_size = 1; ``` 3. Running the test in parallel with the same tests that it is run in parallel with in multi_schedule. None of these allowed me to reproduce the issue locally. So I think it's time to give on fixing this test and simply remove the test. The regression that this test protects against seems very unlikely to reappear, since in #3868 I also added a big comment about the need for the newly added `UnclaimConnection` call. So, I think the need for the test is quite small, and removing it will make our CI less flaky. In case the cause of the bug ever gets found, I tracked the bug in #6189 Example of a failing CI run: https://app.circleci.com/pipelines/github/citusdata/citus/26098/workflows/f84741d9-13b1-4ae7-9155-c21ed3466951/jobs/736424 For reference the unexpected diff is this (so both warnings and an error): ```diff INSERT INTO t SELECT i FROM generate_series(1, 100) i; +WARNING: connection to the remote node localhost:57638 failed with the following error: +WARNING: +CONTEXT: while executing command on localhost:57638 +ERROR: connection to the remote node localhost:57638 failed with the following error: ROLLBACK; ``` This test is also mentioned as the most failing regression test in #5975	2022-08-18 15:14:16 +03:00
Onder Kalaci	9ec8e627c1	Support Sequences owned by columns before distributing tables There are 3 different ways that a sequence can be interacting with tables. (1) and (2) are already supported. This commit adds support for (3). (1) column DEFAULT nextval('seq'): The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \|------------------ sequence <--------\| (2) serial columns: Bigserial/small serial etc: The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \| \| sequence <--------\| (3) Sequence OWNED BY table.column: Added support for this type of resolution in this commit. The dependency is almost like the following, and ExpandCitusSupportedTypes() is NOT responsible for finding the dependency. schema <--- table <--- column ^ \| sequence	2022-08-18 10:29:40 +02:00
Ying Xu	91473635db	[Columnar] Check for existence of Citus before creating Citus_Columnar (#6178 ) * Added a check to see if Citus has already been loaded before creating citus_columnar * added tests	2022-08-17 15:12:42 -07:00
Ahmet Gedemenli	0631e1998b	Fix upgrade paths for #6100 (#6176 ) * Fix upgrade paths for #6100 Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2022-08-17 18:56:53 +03:00
Naisila Puka	20a0e0ed39	Grant create on public to some users where necessary (for PG15) (#6180 )	2022-08-17 17:35:10 +03:00
aykut-bozkurt	52efe08642	default mode for shard splitting is set to auto. (#6179 )	2022-08-17 12:18:47 +03:00
aykut-bozkurt	be06d65721	Nonblocking tenant isolation is supported by using split api. (#6167 )	2022-08-17 11:13:07 +03:00
Jelte Fennema	78a5013e24	Support changing CPU priorities for backends and shard moves (#6126 ) Intro This adds support to Citus to change the CPU priority values of backends. This is created with two main usecases in mind: 1. Users might want to run the logical replication part of the shard moves or shard splits at a higher speed than they would do by themselves. This might cause some small loss of DB performance for their regular queries, but this is often worth it. During high load it's very possible that the logical replication WAL sender is not able to keep up with the WAL that is generated. This is especially a big problem when the machine is close to running out of disk when doing a rebalance. 2. Users might have certain long running queries that they don't impact their regular workload too much. Be very careful!!! Using CPU priorities to control scheduling can be helpful in some cases to control which processes are getting more CPU time than others. However, due to an issue called "[priority inversion][1]" it's possible that using CPU priorities together with the many locks that are used within Postgres cause the exact opposite behavior of what you intended. This is why this PR only allows the PG superuser to change the CPU priority of its own processes. Currently it's not recommended to set `citus.cpu_priority` directly. Currently the only recommended interface for users is the setting called `citus.cpu_priority_for_logical_replication_senders`. This setting controls CPU priority for a very limited set of processes (the logical replication senders). So, the dangers of priority inversion are also limited with when using it for this usecase. Background Before reading the rest it's important to understand some basic background regarding process CPU priorities, because they are a bit counter intuitive. A lower priority value, means that the process will be scheduled more and whatever it's doing will thus complete faster. The default priority for processes is 0. Valid values are from -20 to 19 inclusive. On Linux a larger difference between values of two processes will result in a bigger difference in percentage of scheduling. Handling the usecases Usecase 1 can be achieved by setting `citus.cpu_priority_for_logical_replication_senders` to the priority value that you want it to have. It's necessary to set this both on the workers and the coordinator. Example: ``` citus.cpu_priority_for_logical_replication_senders = -10 ``` Usecase 2 can with this PR be achieved by running the following as superuser. Note that this is only possible as superuser currently due to the dangers mentioned in the "Be very carefull!!!" section. And although this is possible it's NOT recommended: ```sql ALTER USER background_job_user SET citus.cpu_priority = 5; ``` OS configuration To actually make these settings work well it's important to run Postgres with more a more permissive value for the 'nice' resource limit than Linux will do by default. By default Linux will not allow a process to set its priority lower than it currently is, even if it was lower when the process originally started. This capability is necessary to reset the CPU priority to its original value after a transaction finishes. Depending on how you run Postgres this needs to be done in one of two ways: If you use systemd to start Postgres all you have to do is add a line like this to the systemd service file: ```conf LimitNice=+0 # the + is important, otherwise its interpreted incorrectly as 20 ``` If that's not the case you'll have to configure `/etc/security/limits.conf` like so, assuming that you are running Postgres as the `postgres` OS user: ``` postgres soft nice 0 postgres hard nice 0 ``` Finally you'd have add the following line to `/etc/pam.d/common-session` ``` session required pam_limits.so ``` These settings would allow to change the priority back after setting it to a higher value. However, to actually allow you to set priorities even lower than the default priority value you would need to change the values in the config to something lower than 0. So for example: ```conf LimitNice=-10 ``` or ``` postgres soft nice -10 postgres hard nice -10 ``` If you use WSL2 you'll likely have to do another thing. You have to open a new shell, because when PAM is only used during login, and WSL2 doesn't actually log you in. You can force a login like this: ``` sudo su $USER --shell /bin/bash ``` Source: https://stackoverflow.com/a/68322992/2570866 [1]: https://en.wikipedia.org/wiki/Priority_inversion	2022-08-16 13:07:17 +03:00
Jelte Fennema	43c2a1e88b	Share more code between splits and moves (#6152 ) When introducing non-blocking shard split functionality it was based heavily on the non-blocking shard moves. However, differences between usage was slightly to big to be able to reuse the existing functions easily. So, most logical replication code was simply copied to dedicated shard split functions and modified for that purpose. This PR tries to create a more generic logical replication infrastructure that can be used by both shard splits and shard moves. There's probably more code sharing possible in the future, but I believe this is at least a good start and addresses the lowest hanging fruit. This also adds a CreateSimpleHash function that makes creating the most common type of hashmap common.	2022-08-15 20:21:51 +03:00
yxu2162	e1322ec905	Change for PG15 test because hash_mem_multiplier was changed to 2 as a default instead of 1 which was what PG13/14 have	2022-08-11 09:49:56 -07:00
Önder Kalacı	73fcbdf12c	Merge branch 'main' into add_missing_schema	2022-08-11 11:28:41 +02:00
aykut-bozkurt	898801504e	sysid should be parsed as int. (#6150 )	2022-08-11 10:44:46 +03:00
Hanefi Onaldi	294400b2eb	Fix typos in tests that fail on PG15	2022-08-10 22:45:28 +03:00
Onder Kalaci	00ce7235cb	Set missing search_path in the tests On PG 15, public schema requires explicit GRANT, so lets avoid the conflict helpful for #6085	2022-08-10 18:04:10 +02:00
Onder Kalaci	44947d5634	This is not supported in PG15 so fix earlier	2022-08-10 17:44:03 +02:00
naisila	ea209bd11d	Rename remaining regclass to relation in columnar.options	2022-08-10 15:38:53 +02:00
aykut-bozkurt	cc694b6bcf	we consider stat object as invalid if it is not owned by current user (#6130 )	2022-08-09 20:59:30 +03:00
Hanefi Onaldi	6ef96ac560	Use client side \copy when accessing test files	2022-08-09 15:00:42 +03:00
Hanefi Onaldi	9f52fa7610	Remove dynamic translation of regression test scripts, step 2. This commit is inspired by a commit dc9c3b0ff21465fa89d71eecf5e6cc956d647eca from PostgreSQL 15 that shares the same header. I also removed some gitignore rules so that I can add some files to git worktree. We used to ignore the generated files, that are no longer generated after this commit. -------------------- Below is the commit message from PostgreSQL 15 commit dc9c3b0ff21465fa89d71eecf5e6cc956d647eca : "git mv" all the input/.source and output/.source files into the corresponding sql/ and expected/ directories. Then remove the pg_regress and Makefile infrastructure associated with dynamic translation. Discussion: https://postgr.es/m/1655733.1639871614@sss.pgh.pa.us	2022-08-09 14:15:52 +03:00
Jelte Fennema	8017693b2f	Allow specifying the shard_transfer_mode when replicating reference tables (#6070 ) When using `citus.replicate_reference_tables_on_activate = off`, reference tables need to be replicated later. This can be done using the `replicate_reference_tables()` UDF. However, this function only allowed blocking replication. This changes the function to default to logical replication instead, and allows choosing any of our existing shard transfer modes.	2022-08-09 13:21:31 +03:00
Marco Slot	3b57ff2867	Fix crash in citus_copy_shard_placement	2022-08-09 09:31:05 +02:00
naisila	796d90d293	Explain w/out costs in ch_bench to avoid PG15 output diff	2022-08-09 07:53:27 +03:00
Naisila Puka	bcbba99c96	Clean up large_table_shard_count guc leftovers (#6144 )	2022-08-09 06:31:57 +03:00
Naisila Puka	3806f6f6a9	Add ORDER BY in pg_locks to avoid output order diffs (#6145 )	2022-08-09 06:02:07 +03:00
Naisila Puka	ce944c3c0f	Remove bogus guc citus.compression (#6142 )	2022-08-09 05:21:32 +03:00
Jelte Fennema	dd548ee3c7	Use faster custom copy logic for non-blocking shard moves (#6119 ) DESCRIPTION: Use faster custom copy logic for non-blocking shard moves Non-blocking shard moves consist of two main phases: 1. Initial data copy 2. Catchup phase This changes the first of these phases significantly. Previously we used the copy logic provided by postgres subscriptions. This meant we didn't have to implement it ourselves, but it came with the downside of little control. When implementing shard splits we needed more control to even make it work, so we implemented our own logic for copying data between nodes. This PR starts using that logic for non-blocking shard moves. Doing so has four main advantages: 1. It uses COPY in binary format when possible, which is cheaper to encode and decode. Furthermore it very often results in less data that needs to be sent over the network. 2. It allows us to create the primary key (or other replica identity) after doing the initial data copy. This should give some speed up over the total run, because creating an index is bulk is much faster than incrementally building it. 3. It doesn't require a replication slot per parallel copy. Increasing the maximum number of replication slots uses resources in postgres, even if they are not used. So reducing the number of replication slots that shard moves need is nice. 4. Logical replication table_sync workers are slow to start up, so if lots of shards need to be copied that can make it quite slow. This can happen easily when combining Postgres partitioning with Citus.	2022-08-08 17:09:43 +02:00
Marco Slot	6aee8f35a6	Fix tenant isolation failure tests	2022-08-08 13:33:23 +02:00
Marco Slot	044dd26e40	Reimplement tenant isolation on top of block shard split	2022-08-08 13:33:23 +02:00
Naisila Puka	3401b31c13	Deletes unnecessary test outputs (#6140 )	2022-08-08 11:19:14 +03:00
Naisila Puka	9eedf6dcf8	Reduce log level to avoid alternative output for PG15 (#6139 )	2022-08-07 16:07:58 +03:00
aykut-bozkurt	4992533e33	support grant statement propagation for aggregates (#6132 )	2022-08-05 14:47:33 +03:00
Ahmet Gedemenli	8b68b0b5bb	Fix pg upgrade script for foreign tables (#6100 ) Fixes unexpected error for foreign tables when upgrading pg	2022-08-05 13:35:17 +03:00
Sameer Awasekar	e236711eea	Introduce Non-Blocking Shard Split Workflow	2022-08-04 16:32:38 +02:00
Naisila Puka	a1c630a16e	Reduce shard_count to reduce drain_node execution time (#6128 ) master_drain_node in distributed_triggers.sql test file takes too long to execute. It is directly dependent on the shard count. Hence I reduced shard count from 32 to 4 (default in tests), since this doesn't affect the validity of the tests.	2022-08-04 15:34:13 +03:00
aykut-bozkurt	3ddc089651	stop distributing views with no distributed dependency if GUC DistributeLocalViews is set false. (#6083 )	2022-08-04 12:34:40 +03:00
Jelte Fennema	8866d9ac32	Reduce setup time of check-minimal and check-minimal-mx (#6117 ) This change reduces the setup time of our minimal schedules in two ways: 1. Don't run `multi_cluster_managament`, but instead run a much smaller sql file with almost the same results. `multi_cluster_management` adds and removes lots of nodes and tests all kinds of failure scenarios. This is not needed for the minimal schedules. The only reason we were using it there was to get a working cluster of the layout that the tests expected. The new `minimal_cluster_management` test achieves this with much less work, going from ~2s to ~0.5s. 2. Parallelize a bit more of the helper tests.	2022-08-02 17:58:59 +03:00
Naisila Puka	28e22c4abf	Reduce log level to avoid alternative output for PG15 (#6118 ) We are reducing the log level here to avoid alternative test output in PG15 because of the change in the display of SQL-standard function's arguments in INSERT/SELECT in PG15. The log level changes can be reverted when we drop support for PG14 Relevant PG commit: a8d8445a7b2f80f6d0bfe97b19f90bd2cbef8759	2022-08-02 11:56:28 +03:00
Jelte Fennema	abffa6c3b9	Use shard split copy code for blocking shard moves (#6098 ) The new shard copy code that was created for shard splits has some advantages over the old shard copy code. The old code was using worker_append_table_to_shard, which wrote to disk twice. And it also didn't use binary copy when that was possible. Both of these issues were fixed in the new copy code. This PR starts using this new copy logic also for shard moves, not just for shard splits. On my local machine I created a single shard table like this. ```sql set citus.shard_count = 1; create table t(id bigint, a bigint); select create_distributed_table('t', 'id'); INSERT into t(id, a) SELECT i, i from generate_series(1, 100000000) i; ``` I then turned `fsync` off to make sure I wasn't bottlenecked by disk. Finally I moved this shard between nodes with `citus_move_shard_placement` with `block_writes`. Before this PR a move took ~127s, after this PR it took only ~38s. So for this small test this resulted in spending ~70% less time. And I also tried the same test for a table that contained large strings: ```sql set citus.shard_count = 1; create table t(id bigint, a bigint, content text); select create_distributed_table('t', 'id'); INSERT into t(id, a, content) SELECT i, i, 'aunethautnehoautnheaotnuhetnohueoutnehotnuhetncouhaeohuaeochgrhgd.athbetndairgexdbuhaobulrhdbaetoausnetohuracehousncaoehuesousnaceohuenacouhancoexdaseohusnaetobuetnoduhasneouhaceohusnaoetcuhmsnaetohuacoeuhebtokteaoshetouhsanetouhaoug.lcuahesonuthaseauhcoerhuaoecuh.lg;rcydabsnetabuesabhenth' from generate_series(1, 20000000) i; ```	2022-08-01 20:10:36 +03:00
Naisila Puka	5060d0ab17	Remove leftover PG version_above_11 checks from tests (#6112 )	2022-08-01 15:38:19 +03:00
Onder Kalaci	bdaeb40b51	Add missing relation access record for local utility command While testing `5670dffd33`, I realized that we have a missing RecordNonDistTableAccessesForTask() for local utility commands. Although we don't have to record the relation access for local only cases, we really want to keep the behaviour for scale-out be the same with single node on all aspects. We wouldn't want any single node complex transaction to work on single machine, but not on multi node cluster. Hence, we apply the same restrictions. For example, on a distributed cluster, the following errors, and after this commit this errors locally as well ```SQL CREATE TABLE ref(a int primary key); INSERT INTO ref VALUES (1); CREATE TABLE dist(a int REFERENCES ref(a)); SELECT create_reference_table('ref'); SELECT create_distributed_table('dist', 'a'); BEGIN; SELECT * FROM dist; TRUNCATE ref CASCADE; ERROR: cannot execute DDL on table "ref" because there was a parallel SELECT access to distributed table "dist" in the same transaction HINT: Try re-running the transaction with "SET LOCAL citus.multi_shard_modify_mode TO 'sequential';" COMMIT; ``` We also add the comprehensive test suite and run the same locally.	2022-07-29 11:36:33 +02:00
Marco Slot	cff013a057	Fix issues with insert..select casts and column ordering	2022-07-28 13:23:57 +02:00
Onder Kalaci	b41c3fd30d	Add tests	2022-07-28 11:27:59 +02:00
Ying Xu	fdf090758b	Bugfix for IN clause to be considered during planner phase in Columnar (#6030 ) Reported bug #5803 shows that we are currently not sending the IN clause to our planner for columnar. This PR fixes it by checking for ScalarArrayOpExpr in ExtractPushdownClause so that we do not skip it. Also added a test case for this new addition.	2022-07-27 11:06:49 -07:00
Ahmet Gedemenli	2b2a529653	Error out for views with circular dependencies (#6051 ) Adds error check for views with circular dependencies	2022-07-27 17:57:45 +03:00
Naisila Puka	1259d83511	Smallfix in CreateCollationDDL logic (#6089 )	2022-07-27 14:33:31 +03:00
aykut-bozkurt	67ac3da2b0	added citus_depended_objects udf and HideCitusDependentObjects GUC to hide citus depended objects from pg meta queries (#6055 ) use RecurseObjectDependencies api to find if an object is citus depended make vanilla tests runnable to see if citus_depended function is working correctly	2022-07-25 16:43:34 +03:00
Marco Slot	5fabf94e39	Allow WITH HOLD cursors with parameters	2022-07-21 12:00:59 +02:00
Hanefi Onaldi	eb3e5ee227	Introduce citus_locks view citus_locks combines the pg_locks views from all nodes and adds global_pid, nodeid, and relation_name. The columns of citus_locks don't change based on the Postgres version, however the pg_locks's columns do. Postgres 14 added one more column to pg_locks (waitstart timestamptz). citus_locks has the most expansive column set, including the newly added column. If citus_locks is queried in a Postgres version where pg_locks doesn't have some columns, the values for those columns in citus_locks will be NULL	2022-07-21 03:06:57 +03:00
Nitish Upreti	3d569cc49a	Shard Split support for Columnar and Partitioned Table (#6067 ) DESCRIPTION: This PR extends support for Partitioned and Columnar tables in blocking 'citus_split_shard_by_split_points' workflow. Columnar Support : No special handling required. Just removing checks that fails split for columnar table and adding test coverage. Partitioned Table Support : Skip copying of parent table as they are empty, The partitions contain data and are treated as co-located shards that will be copied separately. Attach partitions to parent on destination after inserting new shard metadata and before creating foreign key constraints. MISC: Fix Bug #4949 where Blocking shard moves fails if there is a foreign key between partitioned distributed tables (from child to parent). TEST: Added new test 'citus_split_shards_columnar_partitioned' for splitting 'partitioned' and 'columnar + partitioned' table. Added new test 'shard_move_constraints_blocking' to add coverage for shard move bug fix. Updated test 'citus_split_shard_by_split_points_negative' to allow columnar and partitioned table.	2022-07-20 12:24:50 -07:00
Naisila Puka	7d6410c838	Drop postgres 12 support (#6040 ) * Remove if conditions with PG_VERSION_NUM < 13 * Remove server_above_twelve(&eleven) checks from tests * Fix tests * Remove pg12 and pg11 alternative test output files * Remove pg12 specific normalization rules * Some more if conditions in the code * Change RemoteCollationIdExpression and some pg12/pg13 comments * Remove some more normalization rules	2022-07-20 17:49:36 +03:00
Nitish Upreti	5b3537cdff	Shard Split for Citus (#6029 ) * Blocking split setup * Add missing type * Missing API from Metadata Sync * Shard Split e2e code * Worker Split Copy DestReceiver skeleton * Basic destreceiver code * worker_split_copy UDF * UDF calling * Split points are text * Isolate Tenant and Split Shard Unification * Fixing executor and misc * Reindent code * Fixing UDF definitions * Hello World Local Copy works * Remote copy hello world works * Local and Remote binary test * Fixing text local copy and adding tests * Hello World shard split works * Negative tests * Blocking Split workflow works * Refactor * Bug fix * Reindent * Cleaning up and adding comments * Basic test for shard split workflow * ReIndent * Circle CI integration * Removing include causing circle-ci build failure * Remove SplitCopyDestReceiver and use PartitionedResultDestReceiver * Add support for citus.enable_binary_protocol * Reindent * Fix build break * Update Test * Cleanup on catch * Addressing open comments * Update downgrade script and quote schema/table in COPY statement * Fix metadata sync issue. Update regression test * Isolation test and bug fix * Add Isolation test, fix foreign constraint deadlock issue * Misc code review comments * Test name needing to be quoted * Refactor code from review comments * Explaining shardGroupSplitIntervalListList * Fix upgrade & downgrade * Fix broken test * Test fix Round 2 * Fixing bug and modifying test appropriately * Fully qualify copy udf name. Run Reindent * Address PR comments * Fix null handling when creating AuxiliaryStructures * Ensure local copy is triggered in tests * Limit max shards that can be created with split * Test failure fix * Remove split_mode and use shard_transfer_mode instead' * Fix test failure * Fix test failure * Fixing permission issue when splitting non-superuser owned tables * Fix test expected output * Remove extra space * Fix test * attempt to fix test * Addressing Marco's PR comment * Only clean shards created by workflow * Remove from merge * Update test	2022-07-18 02:54:15 -07:00
ywj	1675519f93	Support citus_columnar as separate extension (#5911 ) * Support upgrade and downgrade and separate columnar as citus_columnar extension Co-authored-by: Yanwen Jin <yanwjin@microsoft.com> Co-authored-by: Jeff Davis <jeff@j-davis.com>	2022-07-13 21:08:29 -07:00
Onder Kalaci	6cd7319f12	Add more generic read-replica tests	2022-07-13 14:58:30 +02:00
Onder Kalaci	3c343d4563	Add regression tests for LOCK command citus.use_secondary_nodes=always mode	2022-07-13 14:27:11 +02:00
aykutbozkurt	d53a7760b0	* alter index/table rename weird syntax supported, * correct the wrong level of lock if the weird syntax is used	2022-07-04 21:27:47 +03:00
aykutbozkurt	ba62c0a148	auto is a valid option for vacuum index_cleanup.	2022-07-04 19:27:55 +03:00
Ahmet Gedemenli	c8e1e243b8	Fix matviews for citus_add_local_table_to_metadata (#6023 )	2022-07-04 17:00:07 +03:00
Hanefi Onaldi	f60809a6c1	Fix downgrade scripts from 11.0-2 to 11.0-1 (#6039 )	2022-06-29 22:43:50 +03:00
Onder Kalaci	bab4c0a8c3	Fixes a bug that prevents upgrades when there are no worker nodes	2022-06-28 15:54:49 +02:00
Onder Kalaci	bd3a070369	Fixes a bug that prevents upgrades when there COMPRESSION and DEFAULT columns	2022-06-28 13:36:00 +02:00
aykutbozkurt	8194dc4c62	* Added isolation tests for vacuum, * Added more regression tests for more vacuum options, * Fixed deadlock for unqualified vacuum when there is only 1 worker, * Supported lock_skipped for vacuum.	2022-06-23 15:33:14 +03:00
aykutbozkurt	1d6c81245c	fix bug, which is column mismatch of shard tasks when specifying column names for citus tables in vacuum and analyze commands	2022-06-23 15:33:14 +03:00
Aykut Bozkurt	6986f53835	propagate unqualified vacuum and analyze to all worker nodes	2022-06-23 15:33:14 +03:00
Ahmet Gedemenli	1ee3e8b7f4	Fix creating stats bug when CREATE TABLE LIKE (#6006 )	2022-06-16 12:43:47 +03:00
Jelte Fennema	184c7c0bce	Make enterprise features open source (#6008 ) This PR makes all of the features open source that were previously only available in Citus Enterprise. Features that this adds: 1. Non blocking shard moves/shard rebalancer (`citus.logical_replication_timeout`) 2. Propagation of CREATE/DROP/ALTER ROLE statements 3. Propagation of GRANT statements 4. Propagation of CLUSTER statements 5. Propagation of ALTER DATABASE ... OWNER TO ... 6. Optimization for COPY when loading JSON to avoid double parsing of the JSON object (`citus.skip_jsonb_validation_in_copy`) 7. Support for row level security 8. Support for `pg_dist_authinfo`, which allows storing different authentication options for different users, e.g. you can store passwords or certificates here. 9. Support for `pg_dist_poolinfo`, which allows using connection poolers in between coordinator and workers 10. Tracking distributed query execution times using citus_stat_statements (`citus.stat_statements_max`, `citus.stat_statements_purge_interval`, `citus.stat_statements_track`). This is disabled by default. 11. Blocking tenant_isolation 12. Support for `sslkey` and `sslcert` in `citus.node_conninfo`	2022-06-16 00:23:46 -07:00
Burak Velioglu	e244e9ffb6	Fix dropping temporary view without specifying the explicit schema name (#6003 )	2022-06-15 16:41:12 +02:00
Marco Slot	ee34e1ed9d	Fix bug in unqualified, non-existing DROP DOMAIN IF EXISTS	2022-06-15 13:59:08 +02:00
Ahmet Gedemenli	268d3fa3a6	Fix materialized view intermediate result filename (#5982 )	2022-06-14 15:07:08 +03:00
Onder Kalaci	af22a30b48	Use citus_finish_citus_upgrade() in the tests We already have tests relying on citus_finalize_upgrade_to_citus11(). Now, adjust those to rely on citus_finish_citus_upgrade() and always call citus_finish_citus_upgrade().	2022-06-13 13:15:15 +02:00
Halil Ozan Akgul	b255706189	Fixes the bug where undistribute can drop Citus extension	2022-05-31 16:23:28 +03:00
Onder Kalaci	89c1ccb7a5	Show that no metadata is sent when disabled	2022-05-30 13:41:06 +02:00
Ahmet Gedemenli	26d927178c	Propagate dependent views upon distribution (#5950 )	2022-05-26 14:23:45 +03:00
Burak Velioglu	1d7dda991f	Create view and materialized views with right schema and owner while altering the distributed table. To be able to alter view's owner without enforcing sequential mode. Alter view process functions have been udpated to use metadata connection.	2022-05-24 15:27:30 +03:00
Onder Kalaci	dd02e1755f	Parallelize metadata syncing on node activate It is often useful to be able to sync the metadata in parallel across nodes. Also citus_finalize_upgrade_to_citus11() uses start_metadata_sync_to_primary_nodes() after this commit. Note that this commit does not parallelize all pieces of node activation or metadata syncing. Instead, it tries to parallelize potenially large parts of metadata, which is the objects and distributed tables (in general Citus tables). In the future, it would be nice to sync the reference tables in parallel across nodes. Create ~720 distributed tables / ~23450 shards ```SQL -- declaratively partitioned table CREATE TABLE github_events_looooooooooooooong_name ( event_id bigint, event_type text, event_public boolean, repo_id bigint, payload jsonb, repo jsonb, actor jsonb, org jsonb, created_at timestamp ) PARTITION BY RANGE (created_at); SELECT create_time_partitions( table_name := 'github_events_looooooooooooooong_name', partition_interval := '1 day', end_at := now() + '24 months' ); CREATE INDEX ON github_events_looooooooooooooong_name USING btree (event_id, event_type, event_public, repo_id); SELECT create_distributed_table('github_events_looooooooooooooong_name', 'repo_id'); SET client_min_messages TO ERROR; ``` across 1 node: almost same as expected ```SQL SELECT start_metadata_sync_to_primary_nodes(); Time: 15664.418 ms (00:15.664) select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 14284.069 ms (00:14.284) ``` across 7 nodes: ~3.5x improvement ```SQL SELECT start_metadata_sync_to_primary_nodes(); ┌──────────────────────────────────────┐ │ start_metadata_sync_to_primary_nodes │ ├──────────────────────────────────────┤ │ t │ └──────────────────────────────────────┘ (1 row) Time: 25711.192 ms (00:25.711) -- across 7 nodes select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 82126.075 ms (01:22.126) ```	2022-05-23 09:15:48 +02:00
jeff-davis	a2f5b068e6	Columnar: tighten security and improve visibility. (#5922 ) Move internal storage details to a separate schema with no public access to limit the possibility for information leakage. Create views with public access that show storage details for those columnar tables where the user has ownership privileges. Include mapping between relation ID and storage ID for easier interpretation.	2022-05-20 15:30:31 -07:00
Ying Xu	a1151c2395	Clear metadatacache during abort for create extension (#5907 ) * Bug fix for bug #5876. Memset MetadataCacheSystem every time there is an abort * Created an ObjectAccessHook that saves the transactionlevel of when citus was created and will clear metadatacache if that transaction level is rolled back. Added additional tests to make sure metadatacache is cleared	2022-05-20 13:47:58 -07:00
Marco Slot	09ec366ff5	Improve nested execution checks and add GUC to disable	2022-05-20 18:55:43 +02:00
Marco Slot	e683993449	Fix prepared statement bug when switching from local to remote execution	2022-05-20 18:55:43 +02:00
jeff-davis	a9f8a60007	Columnar: support relation options with ALTER TABLE. (#5935 ) Columnar: support relation options with ALTER TABLE. Use ALTER TABLE ... SET/RESET to specify relation options rather than alter_columnar_table_set() and alter_columnar_table_reset(). Not only is this more ergonomic, but it also allows better integration because it can be treated like DDL on a regular table. For instance, citus can use its own ProcessUtility_hook to distribute the new settings to the shards. DESCRIPTION: Columnar: support relation options with ALTER TABLE.	2022-05-20 08:35:00 -07:00
Marco Slot	ad5214b50c	Allow distributed execution from run_command_on_* functions	2022-05-20 15:26:47 +02:00
gledis69	4731630741	Add distributing lock command support	2022-05-20 12:28:07 +03:00
Marco Slot	79d7e860e6	Add a run_command_on_coordinator function	2022-05-19 10:26:09 +02:00
Marco Slot	fa9cee409c	Fix downgrade scripts and add new downgrade tests	2022-05-19 10:26:09 +02:00
Onder Kalaci	127450466e	Do not warn unncessarily when a node is removed In the past (pre-11), we allowed removing worker nodes that had active placements for replicated distributed table, without even checking if there are any other replicas of the same placement. However, with #5469, we prevent disabling nodes via a hard error when there is the last active placement of shard, as we do for reference tables. Note that otherwise, we'd allow users to lose data. As of today, the NOTICE is completely irrelevant.	2022-05-18 17:23:38 +02:00
Onder Kalaci	db998b3d66	Adds "sync" option to citus_disable_node() UDF Before this commit, we had: ```SQL SELECT citus_disable_node(nodename, nodeport, force boolean DEFAULT false) ``` Where, we allow forcing to disable first worker node with `force:=true`. However, it entails the risk for losing data / diverging placement data etc. With `force` flag, we control disabling the first worker node, and with `async` flag we control whether the changes are done via bg worker or immediately. ```SQL SELECT citus_disable_node(nodename, nodeport, force boolean DEFAULT false, sync boolean DEFAULT false) ``` Where we can achieve all the following: \| Mode \| Data loss possibility \| Can run in 2PC \| Handle multiple node failures \| Immediately effective \| \| --- \|--- \|--- \|--- \|--- \| \| force:false, sync: false \| false \| true \| true \| false \| \| force:false, sync: true \| false \| false \| false \| true \| \| force:true, sync: false \| true \| true \| true \| false \| \| force:true, sync: true \| false \| false \| false \| true \|	2022-05-18 17:21:12 +02:00
Onder Kalaci	2cc4053fc1	Fixes a bug that prevents dropping/altering indexes There are two problems in this area. First, when there are expressions on the index name, we should call `transformIndexExpression()` before generating the index name. That is what Postgres does. Second, because of `40c24bfef9` PG 13 and PG 14 generates different names for indexes with function calls even for local PG tables. Assume we have: ```SQL create table t(id int); select create_distributed_table('t', 'id'); create index ON t (my_very_boring_function(id)); ``` On PG 13, the name of the index is `t_expr_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_expr_idx" btree (my_very_boring_function(id::bigint)) ``` On PG 14, the name of the index is `t_my_very_boring_function_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_my_very_boring_function_idx" btree (my_very_boring_function(id::bigint)) ``` The second issue is not very critical. The important part is that we adjust regression tests to drop all the indexes, which ensures the index names are sane on any version.	2022-05-18 16:35:17 +02:00
Nils Dijk	b71a08955a	Refactor: reduce complexity and code duplication for Object Propagation Over time we have added significantly improved the support for objects to be propagated by Citus as to make scaling out the database more seamless. It became evident that there was a lot of code duplication that got into the codebase to implement the propagation. This PR tries to reduce the amount of repeated code that is at most only slightly different. To make things worse, most of the differences were actually oversights instead of correct. This Patch introduces 3 reusable sets of pre/post processing steps for respectively - create - alter - drop With the use of the common functionality we should have more coherent behaviour between different supported object by Citus. Some steps either omit the Pre or Post processing step if they would not make sense to include. All tests pass, only 1 test needed changing, foreign servers, as the dropping of foreign servers didn't implement support for dropping multiple foreign servers at once. Given the common approach correctly supports dropping of multiple objects, either distributed or not, the test that assumed it wouldn't work was now obsolete.	2022-05-18 15:58:28 +02:00
Onder Kalaci	ee45e7bfbf	Mark existing views as distributed when upgrade to 11.0+ We have a mechanism which ensures that newly distributed objects are recorded during `alter extension citus update`. However, the logic was lacking "view"s. With this commit, we make sure that existing views are also marked as distributed during upgrade.	2022-05-18 15:43:17 +02:00
Nils Dijk	14c6c799f2	suppress notices when more dependencies are found (#5954 ) We are nearing the 100 objects being propagated in `master_copy_shard_placement` and with the extra supported objects this gets pushed over a 100 objects. When a 100 objects are reached for propagation a notice will be shown to the user, informing them it might take a while to finish the operation. During testing this is not important to see. Since the message contains the exact number of objects to be propagated the tests becomes very unstable when merging community into enterprsie. This change makes that the test output stays stable.	2022-05-18 14:31:10 +03:00
Hanefi Onaldi	313104ab9b	Grep logs for deterministic global_cancel test results (#5948 )	2022-05-18 11:09:54 +03:00
Halil Ozan Akgul	d171a736ab	Revert "Creates new colocation for colocate_with:='none' too" This reverts commit `f74447b3b7`.	2022-05-17 15:32:22 +03:00
Halil Ozan Akgul	f74447b3b7	Creates new colocation for colocate_with:='none' too	2022-05-16 13:39:05 +03:00
Teja Mupparti	e56fc34404	Fixes: #5787 In prepared statements, map any unused parameters to a generic type.	2022-05-13 19:31:05 -07:00
Burak Velioglu	1875516ae9	Add ALTER VIEW support Adds support for propagation ALTER VIEW commands to - Change owner of view - SET/RESET option - Rename view and view's column name - Change schema of the view Since PG also supports targeting views with ALTER TABLE commands, related code also added to direct such ALTER TABLE commands to ALTER VIEW commands while sending them to workers.	2022-05-13 13:21:53 +03:00
Marco Slot	6fad5dc207	Add a citus_is_coordinator function	2022-05-13 10:02:52 +02:00
Gledis Zeneli	4c6f62efc6	Switch to using LOCK instead of lock_relation_if_exists in TRUNCATE (#5930 ) Breaking down #5899 into smaller PR-s This particular PR changes the way TRUNCATE acquires distributed locks on the relations it is truncating to use the LOCK command instead of lock_relation_if_exists. This has the benefit of using pg's recursive locking logic it implements for the LOCK command instead of us having to resolve relation dependencies and lock them explicitly. While this does not directly affect truncate, it will allow us to generalize this locking logic to then log different relations where the pg recursive locking will become useful (e.g. locking views). This implementation is a bit more complex that it needs to be due to pg not supporting locking foreign tables. We can however, still lock foreign tables with lock_relation_if_exists. So for a command: TRUNCATE dist_table_1, dist_table_2, foreign_table_1, foreign_table_2, dist_table_3; We generate and send the following command to all the workers in metadata: ```sql SEL citus.enable_ddl_propagation TO FALSE; LOCK dist_table_1, dist_table_2 IN ACCESS EXCLUSIVE MODE; SELECT lock_relation_if_exists('foreign_table_1', 'ACCESS EXCLUSIVE'); SELECT lock_relation_if_exists('foreign_table_2', 'ACCESS EXCLUSIVE'); LOCK dist_table_3 IN ACCESS EXCLUSIVE MODE; SEL citus.enable_ddl_propagation TO TRUE; ``` Note that we need to alternate between the lock command and lock_table_if_exists in order to preserve the TRUNCATE order of relations. When pg supports locking foreign tables, we will be able to massive simplify this logic and send a single LOCK command.	2022-05-11 18:38:48 +03:00
Burak Velioglu	1460452442	Introduce CREATE/DROP VIEW Adds support for propagating create/drop view commands and views to worker node while scaling out the cluster. Since views are dropped while converting the table type, metadata connection will be used while propagating view commands to not switch to sequential mode.	2022-05-10 13:07:14 +03:00
Marco Slot	ceb593c9da	Convert citus.hide_shards_from_app_name_prefixes to citus.show_shards_for_app_name_prefixes	2022-05-03 14:22:13 +02:00
Onder Kalaci	5fc7661169	Do not set coordinator's metadatasynced column to false After a disable_node	2022-04-25 09:25:59 +02:00
Onder Kalaci	a2debe0f02	Do not assign distributed transaction ids for local execution In the past, for all modifications on the local execution, we enabled 2PC (with `6a7ed7b309`). This also required us to enable coordinated transactions via https://github.com/citusdata/citus/pull/4831 . However, it does have a very substantial impact on the distributed deadlock detection. The distributed deadlock detection is designed to avoid single-statement transactions because they cannot lead to any actual deadlocks. The implementation is to skip backends without distributed transactions are assigned. Now that we assign single statement local executions in the lock graphs, we are conflicting with the design of distributed deadlock detection. In general, we should fix it. However, one might think that it is not a big deal, even if the processes show up in the lock graphs, the deadlock detection should not be causing any false positives. That is false, unless https://github.com/citusdata/citus/issues/1803 is fixed. Now that local processes are considered as a single distributed backend, the lock graphs might find: local execution 1 [tx id: 1] -> any local process [tx id: 0] any local process [tx id: 0] -> local execution 2 [tx id: 2] And, decides that there is a distributed deadlock. This commit is: (a) right thing to do, as local execuion should not need any distributed tx id (b) Eliminates performance issues that might come up with deadlock detection does a lot of unncessary checks (c) After moving local execution after the remote execution via https://github.com/citusdata/citus/pull/4301, the vauge requirement for assigning distributed tx ids are already gone.	2022-04-13 13:25:12 +02:00
Hanefi Onaldi	6254f30305	Add arbitrary config tests for function DDL statements (#5885 )	2022-04-12 16:03:10 +03:00
Burak Velioglu	5d9599f964	Create function in transaction according to create object propagation guc	2022-04-08 17:15:31 +03:00
Nils Dijk	8897361f95	Implement DOMAIN propagation for citus	2022-04-08 15:25:39 +02:00
Jelte Fennema	6d8c5931d6	Work around flaky test related to search_path (#5894 ) For some reason search_path is not always set correctly on the worker when calling a distributed function, this shows up when calling `insert_document` in our distributed_triggers test. The underlying reason is currently unknown and warrants deeper investigation. Currently this test is one of the main causes for random CI failures. So this change sets the search_path of each function explicitly, to reduce these failures. So other devs can be more efficient, while I continue investigating the root cause of this issue. Also changes explicit `SET citus.enable_unsafe_triggers = false` to `RESET citus.enable_unsafe_triggers` in passing.	2022-04-08 16:09:33 +03:00
Marco Slot	2304815356	Allow adding a unique constraint with an index	2022-04-07 16:00:31 +02:00
Marco Slot	c0827703ec	Fix EXPLAIN ANALYZE JSON format for subplans	2022-04-07 11:38:20 +02:00
Marco Slot	544dce919a	Handle user-defined type parameters in EXPLAIN ANALYZE	2022-04-07 11:14:32 +02:00
Marco Slot	9476f377b5	Remove old re-partitioning functions	2022-04-04 18:11:52 +02:00
Marco Slot	8c8c3b665d	Add TABLESAMPLE support	2022-04-01 15:51:40 +02:00
Ahmet Gedemenli	a62de6494d	Add schema tests to arbitrary configs	2022-04-01 13:57:17 +03:00
jeff-davis	c485a04139	Separate build of citus.so and citus_columnar.so. (#5805 ) * Separate build of citus.so and citus_columnar.so. Because columnar code is statically-linked to both modules, it doesn't make sense to load them both at once. A subsequent commit will make the modules entirely separate and allow loading them both simultaneously. Author: Yanwen Jin * Separate citus and citus_columnar modules. Now the modules are independent. Columnar can be loaded by itself, or along with citus. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2022-03-31 19:47:17 -07:00
Gledis Zeneli	c9aab7fb8b	Add TRUNCATE arbitrary config tests (#5848 ) Adds TRUNCATE arbitrary config tests. Also adds the ability to skip tests from particular configs.	2022-03-31 14:14:47 +03:00
Onder Kalaci	9043a1ed3f	Only hide shards from client backends and pg bg workers The aim of hiding shards is to hide shards from client applications. Certain bg workers (such as pg_cron or Citus maintanince daemon) should be treated like client applications because users can run queries from such bg workers. And, these bg workers should follow the similar application_name checks as client backeends. Certain other bg workers, such as logical replication or postgres' parallel workers, should never hide shards. They are internal operations. Similarly the other backend types like the walsender or checkpointer or autovacuum should never hide shards.	2022-03-30 16:56:12 +02:00
Ahmet Gedemenli	f74d3eedc8	Add tests for materialized views	2022-03-30 16:01:11 +03:00
Ahmet Gedemenli	8ef2da8192	Add view tests to arbitrary configs	2022-03-30 12:28:31 +03:00
Önder Kalacı	670fae99f7	Add tests with function dependencies on tables (#5866 ) We are not sure if we have such tests, but lets add anyway	2022-03-29 18:04:07 +03:00
Ahmet Gedemenli	1e1e66eeed	Add index tests to arbitrary configs (#5862 )	2022-03-29 13:49:05 +03:00
Ahmet Gedemenli	b5448e43e3	Fix aggregate signature bug (#5854 )	2022-03-23 13:42:03 +03:00
Burak Velioglu	db9f0d926c	Add support for deparsing ALTER FUNCION ... SUPPORT ... commands	2022-03-22 21:55:55 +03:00
Halil Ozan Akgul	4690c42121	Fixes ALTER COLLATION encoding does not exist bug	2022-03-22 17:42:20 +03:00
Marco Slot	32c23c2775	Disallow re-partition joins when no hash function defined	2022-03-22 13:42:53 +01:00
Onur Tirtir	dc31102630	Locally create objects having a dependency that we cannot distribute We were already doing so for functions & types believing that this cannot be the case for other object types. However, as in #5830, we cannot distribute an object that user attempts creating in temp schema. Even more, this doesn't only apply to functions and types but also to many other object types. So with this commit, we teach preprocess/postprocess functions (that need to create dependencies on worker nodes) how to skip trying to distribute such objects. We also start identifying temp schemas as the objects that we don't know how to propagate to worker nodes so that we can simply create objects locally if user attempts creating them in a temp schema. There are 36 callers of `EnsureDependenciesExistOnAllNodes` in the codebase atm and for the most we still need to throw a hard error (i.e.: not use `DeferErrorIfHasUnsupportedDependency` beforehand), such as: i) user explicitly wants to create a distributed object * CreateCitusLocalTable * CreateDistributedTable * master_create_worker_shards * master_create_empty_shard * create_distributed_function * EnsureExtensionFunctionCanBeDistributed ii) we don't want to skip altering distributed table on worker nodes * PostprocessIndexStmt * PostprocessCreateTriggerStmt * PostprocessCreateStatisticsStmt iii) object is already distributed / handled by Citus before, so we aren't okay with not propagating the ALTER command * PostprocessAlterTableSchemaStmt * PostprocessAlterCollationOwnerStmt * PostprocessAlterCollationSchemaStmt * PostprocessAlterDatabaseOwnerStmt * PostprocessAlterExtensionSchemaStmt * PostprocessAlterFunctionOwnerStmt * PostprocessAlterFunctionSchemaStmt * PostprocessAlterSequenceOwnerStmt * PostprocessAlterSequenceSchemaStmt * PostprocessAlterStatisticsSchemaStmt * PostprocessAlterStatisticsOwnerStmt * PostprocessAlterTextSearchConfigurationSchemaStmt * PostprocessAlterTextSearchDictionarySchemaStmt * PostprocessAlterTextSearchConfigurationOwnerStmt * PostprocessAlterTextSearchDictionaryOwnerStmt * PostprocessAlterTypeSchemaStmt * PostprocessAlterForeignServerOwnerStmt iv) we already cannot create those objects in temp schemas, so skipping for now * PostprocessCreateExtensionStmt * PostprocessCreateForeignServerStmt Also note that there are 3 more callers of `EnsureDependenciesExistOnAllNodes` in enterprise in addition to those 36 but we don't need to do anything specific about them due to the same reasoning given in iii).	2022-03-22 15:09:23 +03:00
Halil Ozan Akgul	50bace9cfb	Fixes the type names that start with underscore bug	2022-03-22 14:24:30 +03:00
Halil Ozan Akgul	4dbc760603	Introduces citus_coordinator_node_id	2022-03-22 10:34:22 +03:00
Hanefi Onaldi	9f204600af	Allow all possible option types for text search objects (#5838 )	2022-03-21 20:01:53 +01:00
Halil Ozan Akgül	6c05e4b35c	Add check_mx to operations schedule (#5818 )	2022-03-21 19:09:26 +03:00
Burak Velioglu	d4625ec6a1	Add support for zero-argument polymorphic aggregates	2022-03-21 16:10:40 +03:00
Burak Velioglu	2c2064bf36	Create type locally if it has undistributable dependency	2022-03-18 18:23:32 +03:00
Marco Slot	055bbd6212	Use coordinated transaction when there are multiple queries per task	2022-03-18 15:04:27 +01:00
Marco Slot	cab243218d	Avoid locks in relation_is_a_known_shard	2022-03-18 14:37:39 +01:00
Ahmet Gedemenli	eddfea18c2	Fix role creation issue on schema tests (#5812 )	2022-03-16 13:49:28 +01:00
Burak Velioglu	333c73a53c	Drop distributed table on worker with ProcessUtilityParseTree	2022-03-15 17:42:01 +03:00
Hanefi Onaldi	c0cd8f3d56	Wait until metadata sync before testing distributed sequences	2022-03-15 10:28:51 +01:00
Ahmet Gedemenli	36b33e2491	Add sequence tests to arbitrary config (#5771 ) Add sequence tests to arbitrary config (#5771)	2022-03-14 19:16:24 +03:00
Onder Kalaci	db529facab	Only change the sequence types if the target column type is a supported sequence type Before this commit, we erroneously converted the sequence type to the column's type it is used. However, it is possible that the sequence is used in an expression which then converted to a type that cannot be a sequence, such as text. With this commit, we only try this conversion if the column type is a supported sequence type (e.g., smallint, int and bigint). Note that we do this conversion because if the column type is a bigint and the sequence is NOT a bigint, users would be in trouble because sequences would generate values that are out of the range of the column. (The other ways are already not supported such as the column is int and the sequence is bigint would fail on the worker.) In other words, with this commit, we scope this optimization only when the target column type is a supported sequence type. Otherwise, we let users to more freely use the sequences.	2022-03-11 16:06:00 +01:00
Ahmet Gedemenli	d06146360d	Support GRANT ON SCHEMA commands in CREATE SCHEMA statements (#5789 ) * Support GRANT ON SCHEMA commands in CREATE SCHEMA statements * Add test * add comment * Rename to GetGrantCommandsFromCreateSchemaStmt	2022-03-11 14:47:45 +03:00
Jelte Fennema	e5d5c7be93	Start erroring out for unsupported lateral subqueries (#5753 ) With the introduction of #4385 we inadvertently started allowing and pushing down certain lateral subqueries that were unsafe to push down. To be precise the type of LATERAL subqueries that is unsafe to push down has all of the following properties: 1. The lateral subquery contains some non recurring tuples 2. The lateral subquery references a recurring tuple from outside of the subquery (recurringRelids) 3. The lateral subquery requires a merge step (e.g. a LIMIT) 4. The reference to the recurring tuple should be something else than an equality check on the distribution column, e.g. equality on a non distribution column. Property number four is considered both hard to detect and probably not used very often. Thus this PR ignores property number four and causes query planning to error out if the first three properties hold. Fixes #5327	2022-03-11 11:59:18 +01:00
Hanefi Onaldi	b0eb685101	Add support for TEXT SEARCH DICTIONARY objects TEXT SEARCH DICTIONARY objects depend on TEXT SEARCH TEMPLATE objects. Since we do not yet support distributed TS TEMPLATE objects, we skip dependency checks for text search templates, similar to what we do for roles. The user is expected to manually create the TEXT SEARCH TEMPLATE objects before a) adding new nodes, b) creating TEXT SEARCH DICTIONARY objects.	2022-03-11 03:40:20 +03:00
Marco Slot	49467e27e6	Ensure worker_save_query_explain_analyze always fully qualifies types (#5776 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-03-10 07:30:11 -08:00
Gledis Zeneli	2cb02bfb56	Fix node adding itself with citus_add_node leading to deadlock (Fix #5720 ) (#5758 ) If a worker node is being added, a command is sent to get the server_id of the worker from the pg_dist_node_metadata table. If the worker's id is the same as the node executing the code, we will know the node is trying to add itself. If the node tries to add itself without specifying `groupid:=0` the operation will result in an error.	2022-03-10 17:46:33 +03:00
Hanefi Onaldi	d153c2de0d	Fix some typos in comments	2022-03-10 15:03:26 +03:00
Ahmet Gedemenli	551a7d1383	Support CREATE SCHEMA without name (#5782 )	2022-03-10 13:38:00 +03:00
Marco Slot	7559ad12ba	Change create_object_propagation default to immediate	2022-03-09 17:40:50 +01:00
Burak Velioglu	bbe1b16125	Check whether the object has unsupported or circular dependency	2022-03-09 16:37:53 +03:00
Halil Ozan Akgül	333bcc7948	Global PID Helper Functions (#5768 ) * Introduces citus_nodename_for_nodeid and citus_nodeport_for_nodeid functions * Introduces citus_nodeid_for_gpid and citus_pid_for_gpid functions * Add tests	2022-03-09 13:15:59 +03:00
Onder Kalaci	24fcd2a88c	Handle dropping the partitioned tables properly Before this commit, we might be leaving some metadata on the workers. Now, we handle DROP SCHEMA .. CASCADE properly to avoid any metadata leakage.	2022-03-07 10:02:54 +01:00
Nils Dijk	3801576dfb	Move pg_dist_object to pg_catalog (#5765 ) DESCRIPTION: Move pg_dist_object to pg_catalog Historically `pg_dist_object` had been created in the `citus` schema as an experiment to understand if we could move our catalog tables to a branded schema. We quickly realised that this interfered with the UX on our managed services and other environments, where users connected via a user with the name of `citus`. By default postgres put the username on the search_path. To be able to read the catalog in the `citus` schema we would need to grant access permissions to the schema. This caused newly created objects like tables etc, to default to this schema for creation. This failed due to the write permissions to that schema. With this change we move the `pg_dist_object` catalog table to the `pg_catalog` schema, where our other schema's are also located. This makes the catalog table visible and readable by any user, like our other catalog tables, for debugging purposes. Note: due to the change of schema, we had to disable 1 test that was running into a discrepancy between the schema and binary. Secondly, we needed to make the lookup functions for the `pg_dist_object` relation and their indexes less strict on the fallback of the naming due to an other test that, due to an unfortunate cache invalidation, needed to lookup the relation again. This makes that we won't default to _only_ resolving from `pg_catalog` outside of upgrades.	2022-03-04 17:40:38 +00:00
Ahmet Gedemenli	b8eedcd261	Notice when create_distributed_function called without params (#5752 ) * Notice when create_distributed_function called without params * Move variable comments to top * Add valid check for cache entry * add objtype to notice msg * update test outputs * Add more tests * Address feedback	2022-03-04 17:26:39 +03:00
Önder Kalacı	bd6a6563ff	Merge branch 'master' into calculate_gpid	2022-03-04 11:34:12 +01:00
Burak Velioglu	cb6d67a9a9	Make sure that all dependencies of citus tables can be distributed	2022-03-03 20:08:09 +03:00
Onder Kalaci	c7b67ba0ea	Add citus_backend_gpid() And also citus_calculate_gpid(nodeId,pid). These UDFs are just wrappers for the existing functions. Useful for testing and simple manipulation of citus_stat_activity.	2022-03-03 15:29:40 +01:00
Halil Ozan Akgul	06a0509b1a	Introduces citus_stat_activity view	2022-03-03 16:19:20 +03:00
Marco Slot	3ba61244b8	Synchronize pg_dist_colocation metadata	2022-03-03 11:01:59 +01:00
Marco Slot	43e4dd3808	Add a citus.internal_reserved_connections setting	2022-03-02 19:13:53 +01:00
Onder Kalaci	35ec9721b4	Add a new API for enabling Citus MX for clusters upgrading from earlier versions Clusters created pre-Citus 11 mostly didn't have metadata sync enabled. For those clusters, we add a utility UDF which fixes some minor issues and sync the necessary objects to the workers.	2022-03-02 17:02:55 +01:00
Onder Kalaci	98751058a9	Add Primary key to the table Otherwise enterprise tests fail	2022-03-02 12:03:59 +01:00
Ahmet Gedemenli	e1809af376	Propagate CREATE AGGREGATE commands	2022-03-02 10:52:43 +03:00
Onder Kalaci	b79a0052a4	Drop function in the tests on a never version As dropping the function now relies on pg_dist_object, which exists with 9.0+	2022-03-02 08:45:35 +01:00
Nils Dijk	65bd540943	Feature: configure object propagation behaviour in transactions (#5724 ) DESCRIPTION: Add GUC to control ddl creation behaviour in transactions Historically we would _not_ propagate objects when we are in a transaction block. Creation of distributed tables would not always work in sequential mode, hence objects created in the same transaction as distributing a table that would use the just created object wouldn't work. The benefit was that the user could still benefit from parallelism. Now that the creation of distributed tables is supported in sequential mode it would make sense for users to force transactional consistency of ddl commands for distributed tables. A transaction could switch more aggressively to sequential mode when creating new objects in a transaction. We don't change the default behaviour just yet. Also, many objects would not even propagate their creation when the transaction was already set to sequential, leaving the probability of a self deadlock. The new policy checks solve this discrepancy between objects as well.	2022-03-01 17:29:31 +03:00
Burak Velioglu	f17872aed4	Expand functions while resolving dependencies	2022-03-01 17:08:46 +03:00
Gledis Zeneli	b825232ecb	Handle rebalance / replication when a node is disabled (Fix #5664 ) (#5729 ) The issue in question is caused when rebalance / replication call `FullShardPlacementList` which returns all shard placements (including those in disabled nodes with `citus_disable_node`). Eventually, `FindFillStateForPlacement` looks for the state across active workers and fails to find a state for the placements which are in the disabled workers causing a seg fault shortly after. Approach: * `ActivePlacementHash` was not using the status of the shard placement's node to determine if the node it is active. Initially, I just fixed that. * Additionally, I refactored the code which handles active shards in replication / rebalance to: * use a single function to determine if a shard placement is active. * do the shard active shard filtering before calling `RebalancePlacementUpdates` and `ReplicationPlacementUpdates`, so test methods like `shard_placement_rebalance_array` and `shard_placement_replication_array` which have different shard placement active requirements can do their own filtering while using the same rebalance / replicate logic that `rebalance_table_shards` and `replicate_table_shards` use. Fix #5664	2022-02-25 19:54:30 +03:00
Marco Slot	8de802eec5	Enable local_shared_pool_size 5 in arbitrary configs test	2022-02-23 19:40:21 +01:00
Marco Slot	490765a754	Enable re-partition joins after local execution	2022-02-23 19:40:21 +01:00
Marco Slot	72d8fde28b	Use intermediate results for re-partition joins	2022-02-23 19:40:21 +01:00
Nils Dijk	1fb970224e	Fix: partitioned index dependencies (#5741 ) #5685 introduced the resolution of dependencies for indices. This missed support for indices on partitioned tables. This change adds support for partitioned indices to the dependency resolution code.	2022-02-23 17:53:26 +03:00
Teja Mupparti	a62901396b	Allow unsafe triggers via a GUC	2022-02-21 22:45:17 -08:00
Halil Ozan Akgul	f6cd4d0f07	Overrides pg_cancel_backend and pg_terminate_backend to accept global pid	2022-02-21 16:41:35 +03:00
Ahmet Gedemenli	c1d5ca9896	Do distributed check first, for DropSchema stmts	2022-02-21 14:43:04 +03:00
Ahmet Gedemenli	28aa715ce2	Add test for citus local tables with dropped columns	2022-02-21 12:07:17 +03:00
yxu2162	8974b2de66	Copied CheckCitusVersion over to Columnar to handle dependency issue. If we split columnar into two extensions, this will later be changed tl CheckColumnarVersion.	2022-02-18 09:47:39 -08:00
Burak Velioglu	fa6866ed36	Start to propagate functions to worker nodes with CREATE FUNCTION command together with it's dependencies. If the function depends on any nondistributable object, function will be created only locally. Parameterless version of create_distributed_function becomes obsolete with this change, it will deprecated from the code with a subsequent PR.	2022-02-18 13:56:51 +03:00
gledis69	a14fada153	Prevent Deadlocks When a Worker Tries to Create Collation (Fix #5583 ) * When a worker tried to create a collation which had a dependency in the same worker node, it would cause a deadlock, now it throws the correct "not a coordinator" error.	2022-02-18 12:28:02 +03:00
Teja Mupparti	46fa47beea	Force-delegated functions' distribution argument must be reset as soon as the routine completes execution, and not wait until the top level Executor ends. This fixes issue #5687	2022-02-17 10:48:30 -08:00
Nils Dijk	ea86f9f94e	Add support for TEXT SEARCH CONFIGURATION objects (#5685 ) DESCRIPTION: Implement TEXT SEARCH CONFIGURATION propagation The change adds support to Citus for propagating TEXT SEARCH CONFIGURATION objects. TSConfig objects cannot always be created in one create statement, and instead require a create statement followed by many alter statements to get turned into the object they should represent. To support this we add functionality to the worker to create or replace objects based on a list of statements. When the lists of the local object and the remote object correspond 1:1 we skip the creation of the object and simply mark it distributed. This is especially important for TSConfig objects as initdb pre-populates databases with a dozen configurations (for many different languages). When the user creates a new TSConfig based on the copy of an existing configuration there is no direct link to the object copied from. Since there is no link we can't simply rely on propagating the dependencies to the worker and send a qualified	2022-02-17 13:12:46 +01:00
Ahmet Gedemenli	a1c3580c64	Support TRUNCATE for foreign tables	2022-02-17 09:59:53 +03:00
Gledis Zeneli	badfd561b2	Prevent Citus table functions from being called on shards (Fix #5610 ) (#5694 ) DESCRIPTION: Prevent Citus table functions from being called on shards The operations that guard against using shards are: * Create Local Table * Create distributed table (which affects reference table creation as well). * I used a `ErrorIfRaltionIsKnownShard` instead of `ErrorIfIllegallyChangingKnownShard`. `ErrorIfIllegallyChangingKnownShard` allows the operation if `citus.enable_manual_changes_to_shards`, but I am not sure if it ever makes sense to create a distributed, reference, or citus local table out of a shard. I tried to go over the code to identify other UDF-s where shards could be illegaly changed, but I could not find any other. My knowledge of the codebase is not solid enough for me to say for sure. Fixes #5610	2022-02-14 16:06:48 +03:00
Ahmet Gedemenli	76b63a307b	Propagate create/drop schema commands	2022-02-10 14:58:09 +03:00
Marco Slot	d0711ea9b4	Delegate function calls in FROM outside of transaction block	2022-02-09 20:56:25 +01:00
Onder Kalaci	1c30f61a70	Prevent citus.node_conninfo to use "application_name" With https://github.com/citusdata/citus/pull/5657, Citus uses a fixed application_name while connecting to remote nodes for internal purposes. It means that we cannot allow users to override it via citus.node_conninfo.	2022-02-09 13:22:04 +01:00
Teja Mupparti	1e3c8e34c0	Allow create_distributed_function() on a function owned by an extension Implement #5649 Allow create_distributed_function() on functions owned by extensions 1) Only update pg_dist_object, and do not propagate CREATE FUNCTION. 2) Ensure corresponding extension is in pg_dist_object. 3) Verify if dependencies exist on the function they should resolve to the extension. 4) Impact on node-scaling: We build a list of ddl commands based on all objects in pg_dist_object. We need to omit the ddl's for the extension-function, as it will get propagated by the virtue of the extension creation. 5) Extra checks for functions coming from extensions, to not propagate changes via ddl commands, even though the function is marked as distributed in pg_dist_object	2022-02-08 11:52:56 -08:00
Halil Ozan Akgul	8ee02b29d0	Introduce global PID	2022-02-08 16:49:38 +03:00
Burak Velioglu	0a70b78bf5	Add test for dist type	2022-02-07 17:50:49 +03:00
Burak Velioglu	c0aece64d0	Add test for checking distributed extension function	2022-02-07 17:50:48 +03:00
Teja Mupparti	c8e504dd69	Fix the issue #5673 If the expression is simple, such as, SELECT function() or PEFORM function() in PL/PgSQL code, PL engine does a simple expression evaluation which can't interpret the Citus CustomScan Node. Code checks for simple expressions when executing an UDF but missed the DO-Block scenario, this commit fixes it.	2022-02-04 15:44:53 -08:00
Ying Xu	b5c116449b	Removed dependency from EnsureTableOwner (#5676 ) Removed dependency for EnsureTableOwner. Also removed pg_fini() and columnar_tableam_finish() Still need to remove CheckCitusVersion dependency to make Columnar_tableam.h dependency free from Citus.	2022-02-04 12:45:07 -08:00

... 2 3 4 5 6 ...

2003 Commits (10603ed5d45ad435b3d3122146f67b7de95b4d06)