citus

Commit Graph

Author	SHA1	Message	Date
Halil Ozan Akgul	34c2b7e056	Fixes the psql connection bug	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	56e814a333	Adds public host to only hyperscale tests	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	d574ac33a8	Adds next shard ids to multi_create_table tests	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	a701fc774a	Adds multi_schedule_hyperscale schedule	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	5bf350faf9	Removes failing tests This task just removes the failing tests. It doesn't mean this tests cannot be saved. It's just a starting point	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	1aa1f55d8e	Adds check_multi_hyperscale_superuser schedule	2020-04-10 13:05:07 +03:00
Halil Ozan Akgul	c2edf989cf	Adds public host parameters	2020-04-10 13:04:24 +03:00
Halil Ozan Akgul	4b9705f714	Adds worker host parameters	2020-04-10 13:03:28 +03:00
Halil Ozan Akgul	119bf590c8	Creates normalize_modified.sed	2020-04-10 13:03:19 +03:00
Halil Ozan Akgul	c8a81ef1ce	Changes copy to \copy	2020-04-10 13:03:15 +03:00
Halil Ozan Akgul	93b97248b2	Adds a connection string to run tests on that connection	2020-04-10 13:03:03 +03:00
SaitTalhaNisanci	17373d51da	not wait forever in upgrade distributed function before (#3731 )	2020-04-10 09:43:42 +03:00
SaitTalhaNisanci	07f9a442b0	Refactor CopyLocalDataIntoShards (#3693 ) This PR: - Declares variables when they are needed. - Creates DoCopyFromLocalTableIntoShards for better readability. - Doesn't use a hardcoded value, instead use a variable for better readability.	2020-04-10 09:25:26 +03:00
Philip Dubé	d99043fe0c	Merge pull request #3690 from citusdata/fix/limit_non_const Correctly handle non-constant LIMIT/OFFSET clauses	2020-04-09 20:25:27 +00:00
Marco Slot	a4b2197450	Correctly handle non-constant LIMIT/OFFSET clauses	2020-04-09 19:59:50 +00:00
SaitTalhaNisanci	3dc7cad754	use an enum for local execution status (#3733 ) We have two variables that are related to local execution status. TransactionAccessedLocalPlacement and TransactionConnectedToLocalGroup. Only one of these fields should be set, however we didn't have any check for this contraint and it was error prone. What those two variables are used is that we are trying to understand if we should use local execution, the current session, or if we should be using a connection to execute the current query, therefore the tasks. In the enum, now it is more clear what these variables mean. Also, now we have a method to change the local execution status. The method will error if we are trying to transition from a state to a wrong state. This will help us avoid problems.	2020-04-09 19:11:04 +03:00
SaitTalhaNisanci	24dcb02bca	enable local table join with reference table (#3697 ) * enable local table join with reference table * test different cases with local table and reference join	2020-04-09 15:25:54 +03:00
SaitTalhaNisanci	ebda3eff61	read database name inside the function (#3730 )	2020-04-09 13:11:13 +03:00
SaitTalhaNisanci	233e4a24d1	use local execution within transaction block (#3714 ) * use local executon when in a transaction block When we are inside a transaction block, there could be other methods that need local execution, therefore we will use local execution in a transaction block. * update test outputs with transaction block local execution * add a test to verify we dont leak intermediate schemas	2020-04-09 12:41:58 +03:00
SaitTalhaNisanci	fa88046ce1	test that we don't leak intermediate schemas (#3737 ) * test that we don't leak intermediate schemas We have tests to make sure that we don't intermediate any intermediate files, tables etc but we don't test if we are leaking schemas. It makes sense to test this as well. * remove all repartition schemas in case of error This solution is not an ideal one but it seems to be doing the job. We should have a more generic solution for the cleanup but it seems that putting the cleanup in the abort handler is dangerous and it was crashing.	2020-04-09 12:17:41 +03:00
SaitTalhaNisanci	362d72853c	return early in ExecuteTaskListExtended (#3738 ) It is possible to return an error in ExecuteTaskListExtended after performing local execution with the current structure. However there is no point in execution the local tasks if we are going to return an error later. So the local execution is moved after the error check.	2020-04-09 10:10:49 +03:00
Hadi Moshayedi	117233c1e0	Merge pull request #3736 from citusdata/remove_todo Remove todo from reference_table_utils	2020-04-08 12:54:48 -07:00
Hadi Moshayedi	cd877f3fdd	Merge pull request #3637 from citusdata/defer_reference_table_replication_copy Defer reference table replication	2020-04-08 12:54:04 -07:00
Hadi Moshayedi	9b8802ba2d	Remove todo from reference_table_utils	2020-04-08 12:46:55 -07:00
Hadi Moshayedi	dda53a0bba	GUC for replicate reference tables on activate.	2020-04-08 12:42:45 -07:00
Hadi Moshayedi	c168a53ebc	Tests for replicate_reference_tables	2020-04-08 12:41:36 -07:00
Hadi Moshayedi	acfa850c38	Make multi_replicate_reference_table check-base friendly	2020-04-08 12:41:36 -07:00
Hadi Moshayedi	0758a81287	Prevent reference tables being dropped when replicating reference tables	2020-04-08 12:41:36 -07:00
Marco Slot	924cd7343a	Defer reference table replication to shard creation time	2020-04-08 12:41:36 -07:00
Philip Dubé	76a8a3c7c9	Merge pull request #3719 from citusdata/stricter-trigger-checks Verify trigger relation before reading old/new tuples	2020-04-07 16:18:36 +00:00
Philip Dubé	26797bfb94	Verify trigger relation before reading old/new tuples master_dist_placement_cache_invalidate: bail when triggering on pg_dist_shard_placement	2020-04-07 15:39:31 +00:00
Önder Kalacı	9fb83d6e5d	Merge pull request #3703 from citusdata/get_rid_of_side_channel Move connection establishment for intermediate results after query execution	2020-04-07 17:21:30 +02:00
Önder Kalacı	70012dfd33	Do not error when an intermediate file does not exit (#3707 ) When the file does not exist, it could mean two different things. First -- and a lot more common -- case is that a failure happened in a concurrent backend on the same distributed transaction. And, one of the backends in that transaction has already been roll backed, which has already removed the file. If we throw an error here, the user might see this error instead of the actual error message. Instead, we prefer to WARN the user and pretend that the file has no data in it. In the end, the user would see the actual error message for the failure. Second, in case of any bugs in intermediate result broadcasts, we could try to read a non-existing file. That is most likely to happen during development. Thus, when asserts enabled, we throw an error instead of WARNING so that the developers cannot miss.	2020-04-07 17:06:55 +02:00
Onder Kalaci	a695b44ce9	Add new regression tests	2020-04-07 17:06:55 +02:00
Onder Kalaci	4b3d17f466	Make sure that tests are not failing randomly	2020-04-07 17:06:55 +02:00
Onder Kalaci	4f7c902c6c	Move connection establishment for intermediate results after query execution When we have a query like the following: ```SQL WITH a AS (SELECT * FROM foo LIMIT 10) SELECT max(x) FROM a JOIN bar 2 USING (y); ``` Citus currently opens side channels for doing the `COPY "1_1"` FROM STDIN (format 'result') before starting the execution of `SELECT * FROM foo LIMIT 10` Since we need at least 1 connection per worker to do `SELECT * FROM foo LIMIT 10` We need to have 2 connections to worker in order to broadcast the results. However, we don't actually send a single row over the side channel until the execution of `SELECT * FROM foo LIMIT 10` is completely done (and connections unclaimed) and the results are written to a tuple store. We could actually reuse the same connection for doing the `COPY "1_1"` FROM STDIN (format 'result'). This also fixes the issue that Citus doesn't obey `citus.max_adaptive_executor_pool_size` when the query includes an intermediate result.	2020-04-07 17:06:55 +02:00
Onder Kalaci	721daec9a5	Move the logic that initilize connections/local files into a function	2020-04-07 17:06:55 +02:00
Onder Kalaci	9b29a32d7a	Remove all references for side channel connections We don't need any side channel connections. That is actually problematic in the sense that it creates extra connections. Say, citus.max_adaptive_executor_pool_size equals to 1, Citus ends up using one extra connection for the intermediate results. Thus, not obeying citus.max_adaptive_executor_pool_size. In this PR, we remove the following entities from the codebase to allow further commits to implement not requiring extra connection for the intermediate results: - The connection flag REQUIRE_SIDECHANNEL - The function GivePurposeToConnection - The ConnectionPurpose struct and related fields	2020-04-07 17:06:55 +02:00
Hanefi Onaldi	e31dcff178	Merge pull request #3666 from citusdata/size-functions-without-locks Remove metadata locks from size functions	2020-04-07 18:02:39 +03:00
Hanefi Onaldi	1d22d0c2ff	Remove metadata locks from size functions	2020-04-07 17:37:15 +03:00
SaitTalhaNisanci	0430b568be	explicitly return false if transaction connected to local node (#3715 ) * explicitly return false if transaction connected to local node * not set TransactionConnectedToLocalGroup if we are writing to a file We use TransactionConnectedToLocalGroup to prevent local execution from happening as that might cause visibility problems. As files are visible to all transactions, we shouldn't set this variable if we are writing to a file.	2020-04-07 17:30:34 +03:00
Marco Slot	225adbc7ac	Merge pull request #3720 from citusdata/fix/intermediate_result_pruning Simplify and fix issues in intermediate result pruning	2020-04-07 11:20:07 +02:00
Marco Slot	2632343f64	Fix intermediate result pruning for INSERT..SELECT	2020-04-07 11:07:49 +02:00
Marco Slot	84672c3dbd	Simplify intermediate result pruning logic	2020-04-07 10:53:29 +02:00
SaitTalhaNisanci	a710b3cdc5	fix null tupleStoreState case in ExecuteLocalTaskListExtended (#3711 ) In case we don't care about the tupleStoreState in ExecuteLocalTaskListExtended, it could be passed as null. In that case we will get a seg error. This changes it so that a dummy tuple store will be created when it is null. Do not use local execution in ExecuteTaskListOutsideTransaction. As we are going to run the tasks outside transaction, we shouldn't use local execution. However, there is some problem when using local execution related to repartition joins, when we solve that problem, we can execute the tasks coming to this path with local execution. Also logging the local command is simplified. normalize job id in worker_hash_partition_table in test outputs.	2020-04-07 11:47:09 +03:00
SaitTalhaNisanci	a369f9001d	fix incorrect groupid or nodeid (#3710 ) For shardplacements, we were setting nodeid, nodename, nodeport and nodegroup manually. This makes it very error prone, and it seems that we already forgot to set some of them. This would mean that they would have their default values, e.g group id would be 0 when its group id is not 0. So the implication is that we would have inconsistent worker metadata. A new method is introduced, and we call the method to set those fields now, so that as long as we call this method, we won't be setting inconsistent metadata. It probably makes sense to have a struct for these fields. We already have NodeMetadata but it doesn't have nodename or nodeport. So that could be done over another refactor to make things simpler.	2020-04-07 11:14:14 +03:00
Philip Dubé	ec734a643b	Merge pull request #3722 from citusdata/optimistic-duplicate-grouping Duplicate grouping on worker whenever possible	2020-04-06 21:31:23 +00:00
Philip Dubé	4860e11561	Duplicate grouping on worker whenever possible This is possible whenever we aren't pulling up intermediate rows We want to do this because this was done in 9.2, some queries rely on the performance of grouping causing distinct values This change was introduced when implementing window functions on coordinator	2020-04-06 18:51:30 +00:00
Philip Dubé	6a6d5af8a3	Merge pull request #3403 from citusdata/fix-rollback-savepoint-hang Check connections from connection_placement before polling	2020-04-06 18:04:03 +00:00
Philip Dubé	b01bae5937	Check connections from connection_placement before polling	2020-04-06 17:45:44 +00:00

... 5 6 7 8 9 ...

3827 Commits (1a7ccac6efa19d1b6e1c019898d95da3fed6e12a) All Branches Search

3827 Commits (1a7ccac6efa19d1b6e1c019898d95da3fed6e12a)

All Branches