citus

Commit Graph

Author	SHA1	Message	Date
Önder Kalacı	f41e1b1a60	Merge pull request #3923 from citusdata/assert_order Sort WorkerPool in executions	2020-06-22 18:27:54 +02:00
Onder Kalaci	88c473e007	Sort WorkerPool in executions We sort the workerList because adaptive connection management (e.g., OPTIONAL_CONNECTION) requires any concurrent executions to wait for the connections in the same order to prevent any starvation. If we don't sort, we might end up with: Execution 1: Get connection for worker 1, wait for worker 2 Execution 2: Get connection for worker 2, wait for worker 1 and, none could proceed. Instead, we enforce every execution establish the required connections to workers in the same order.	2020-06-22 16:39:27 +02:00
Onur Tirtir	fb46ef1d17	Merge pull request #3930 from citusdata/update-cl-0622 Update CHANGELOG for 9.2.6 & 9.3.2	2020-06-22 16:23:05 +03:00
Onur Tirtir	d41ad47579	Update CHANGELOG for 9.3.2	2020-06-22 14:20:16 +03:00
Onur Tirtir	4a38685744	Update CHANGELOG for 9.2.6	2020-06-22 14:19:56 +03:00
Hanefi Onaldi	ebd8de88d5	Merge pull request #3829 from citusdata/migrations-disallow-c-comment	2020-06-22 13:36:57 +03:00
Hanefi Önaldı	618453a2ba	Disallow C-style comments in migration files	2020-06-22 12:51:16 +03:00
Hanefi Önaldı	56285e6470	Use citus docker hub org	2020-06-22 12:51:16 +03:00
Jelte Fennema	b3ec6fbe7a	Make check_enterprise_merge script stricter (#3918 ) We've had two issues with merge conflicts to enterprise in the last week, that suddenly happened. Because of this CI check this actually blocks all community PRs from being merged. This PR tries to improve on the previous script we had, by putting tougher constraints on when a merge is allowed. Previously the check would pass in two cases: 1. This PR be merged without conflicts into `enterprise-master` 2. A branch exists with the same name as this PR on enterprise and that can be merged into `enterprise-master`. The first case stays the same, but I've changed the second case to require the following instead: 1. A branch exists on enterprise with the same name as this PR 2. NEW: This branch contains the the last commit of the community PR branch 3. This branch can be merged into enterprise-master This makes sure the enterprise branch is actually up to date and not forgotten about. If we still get problems with this change, future improvements could be: 1. Check that the PR on enterprise passes CI 2. Check that the PR on enterprise has been approved 3. Require the enterprise PR branch to be merged before merging community.	2020-06-19 12:45:36 +02:00
SaitTalhaNisanci	3a789352b6	rename citus hammerdb branch prefix as citus_github_push (#3925 ) When we are using hammerdb jobs, the job creates a branch on test automation, since that branch should be deleted, it would have `delete_me` prefix, however since the result branch on release-test-results will have the test automation branch as prefix, it will also have `delete_me` prefix, which seems a bit confusing. This PR updates it as citus_github_push	2020-06-18 21:11:58 +03:00
Onur Tirtir	c61e84c14b	Merge pull request #3921 from citusdata/update-cl-0617 Update CHANGELOG for 9.2.5 & 9.3.1	2020-06-17 19:05:45 +03:00
Onur Tirtir	4640f90933	Update CHANGELOG for 9.3.1	2020-06-17 18:45:54 +03:00
Onur Tirtir	74f20149cd	Update CHANGELOG for 9.2.5	2020-06-17 18:45:54 +03:00
Marco Slot	004e0e4617	Merge pull request #3919 from citusdata/fix/combine-query Rename masterQuery to combineQuery	2020-06-17 16:12:13 +02:00
Marco Slot	2a3234ca26	Rename masterQuery to combineQuery	2020-06-17 14:14:37 +02:00
Jelte Fennema	0259815d3a	Fix EXPLAIN ANALYZE received data counter issues (#3917 ) In #3901 the "Data received from worker(s)" sections were added to EXPLAIN ANALYZE. After merging @pykello posted some review comments. This addresses those comments as well as fixing a other issues that I found while addressing them. The things this does: 1. Fix `EXPLAIN ANALYZE EXECUTE p1` to not increase received data on every execution 2. Fix `EXPLAIN ANALYZE EXECUTE p1(1)` to not return 0 bytes as received data allways. 3. Move `EXPLAIN ANALYZE` specific logic to `multi_explain.c` from `adaptive_executor.c` 4. Change naming of new explain sections to `Tuple data received from node(s)`. Firstly because a task can reference the coordinator too, so "worker(s)" was incorrect. Secondly to indicate that this is tuple data and not all network traffic that was performed. 5. Rename `totalReceivedData` in our codebase to `totalReceivedTupleData` to make it clearer that it's a tuple data counter, not all network traffic. 6. Actually add `binary_protocol` test to `multi_schedule` (woops) 7. Fix a randomly failing test in `local_shard_execution.sql`.	2020-06-17 11:33:38 +02:00
Marco Slot	7bd93c8f2f	Merge pull request #3904 from citusdata/fix/remove-master Remove master terminology from file hierarchy	2020-06-16 18:00:25 +02:00
Marco Slot	d1bab78d79	Remove master from file hierarchy	2020-06-16 17:49:09 +02:00
Jelte Fennema	b71f82b31e	Use 5 second isolation test timeout (#3907 ) Sometimes isolation tests get stuck in CI and we cannot see why, because the job is killed by the CI runner. This will instead fail inside make the testsuite continue, but mark it as a failure like this in the diff output: ```diff +isolationtester: canceling step s2-ddl-create-index-concurrently after 5 seconds step s2-ddl-create-index-concurrently: CREATE INDEX CONCURRENTLY select_append_index ON select_append(id); +ERROR: CONCURRENTLY-enabled index command failed ``` We should detect blockages very quickly and the queries we run are also very fast, so 5 seconds should be more than enough to catch any random slowness. The default from Postgres is 5 minutes, which is waaay to much for us.	2020-06-16 14:57:49 +02:00
Jelte Fennema	799bfdab56	Temporarily disable connection leak tests that fail a lot (#3911 ) MX connection leak failures: 1. https://app.circleci.com/pipelines/github/citusdata/citus/9296/workflows/e36d1088-662a-4f60-acec-293132632c2f/jobs/131908/steps 2. https://app.circleci.com/pipelines/github/citusdata/citus/9258/workflows/37659d82-2c5b-495e-b0e7-905811e30444/jobs/131299 Failure connection leak failures: 1. https://app.circleci.com/pipelines/github/citusdata/citus/9297/workflows/c0ebc326-8c93-468f-8b70-f470bd492fb9/jobs/131920 2. https://app.circleci.com/pipelines/github/citusdata/citus/9283/workflows/9af154d0-ff96-4c5d-ae19-81faae1e0c18/jobs/131668	2020-06-16 13:48:48 +02:00
Philip Dubé	56eb5ee305	Merge pull request #3866 from citusdata/release-cache-entry-deferred Deferred release of metadata cache entries	2020-06-15 16:41:02 +00:00
Philip Dubé	39400319e6	Defer freeing CitusTableCacheEntry, as there were memory safety issues before Shard id to index mapping stored in cache entry as there may now be multiple entries alive for a given relation insert_select_executor: revert copying cache entry, which was a hack added to avoid memory safety issues	2020-06-15 16:20:50 +00:00
Jelte Fennema	927de6d187	Show amount of data received in EXPLAIN ANALYZE (#3901 ) Sadly this does not actually work yet for binary protocol data, because when doing EXPLAIN ANALYZE we send two commands at the same time. This means we cannot use `SendRemoteCommandParams`, and thus cannot use the binary protocol. This can still be useful though when using the text protocol, to find out that a lot of data is being sent.	2020-06-15 16:01:05 +02:00
SaitTalhaNisanci	077c784fe9	Create EnsureTableCanBeCreated for some checks (#3839 )	2020-06-14 14:25:58 +03:00
Hadi Moshayedi	b090dcd530	Merge pull request #3887 from citusdata/local-router-joins Implement local table joins in router planner	2020-06-12 18:45:13 -07:00
Hadi Moshayedi	ef778c1cd7	address feedback from Sait Talha & Hadi	2020-06-12 18:36:02 -07:00
Marco Slot	4f7989ad8e	Rename WorkersContainingAllShards to PlacementsForWorkersContainingAllShards	2020-06-12 18:36:02 -07:00
Marco Slot	080f711e62	Remove useless debug message in router planner	2020-06-12 18:36:02 -07:00
Marco Slot	d953f084db	Rename FindRouterWorkerList to CreateTaskPlacementListForShardIntervals	2020-06-12 18:36:01 -07:00
Marco Slot	24feadc230	Handle joins between local/reference/cte via router planner	2020-06-12 18:36:01 -07:00
Nils Dijk	f57711b3d2	fix test output for tdigest (#3909 ) Due to the problem described in #3908 we don't cover the tdigest integration (and other extensions) on CI. Due to this a bug got in the patch due to a change in `EXPLAIN VERBOSE` being merged concurrently with the tdigest integration. This PR fixes the test output that missed the newly added information.	2020-06-12 20:54:27 +02:00
Halil Ozan Akgül	8c5eb6b7ea	Insert Select Into Local Table (#3870 ) * Insert select with master query * Use relid to set custom_scan_tlist varno * Reviews * Fixes null check Co-authored-by: Marco Slot <marco.slot@gmail.com>	2020-06-12 17:06:31 +03:00
Jelte Fennema	0e12d045b1	Support use of binary protocol in between nodes (#3877 ) This can save a lot of data to be sent in some cases, thus improving performance for which inter query bandwidth is the bottleneck. There's some issues with enabling this as default, so that's currently not done.	2020-06-12 15:02:51 +02:00
Nils Dijk	da8f2b0134	Feature: tdigest aggregate (#3897 ) DESCRIPTION: Adds support to partially push down tdigest aggregates tdigest extensions: https://github.com/tvondra/tdigest This PR implements the partial pushdown of tdigest calculations when possible. The extension adds a tdigest type which can be combined into the same structure. There are several aggregate functions that can be used to get; - a quantile - a list of quantiles - the quantile of a hypothetical value - a list of quantiles for a list of hypothetical values These function can work both on values or tdigest types. Since we can create tdigest values either by combining them, or based on a group of values we can rewrite the aggregates in such a way that most of the computation gets delegated to the compute on the shards. This both speeds up the percentile calculations because the values don't have to be sorted while at the same time making the transfer size from the shards to the coordinator significantly less.	2020-06-12 13:50:28 +02:00
Philip Dubé	f69037c192	Merge pull request #3903 from citusdata/remove-misleading-iscitustable-check IsReferenceTable, ShardIntervalCount: remove misleading isCitusTable check	2020-06-11 18:50:36 +00:00
Philip Dubé	8faaaee6a5	IsReferenceTable, ShardIntervalCount: remove misleading isCitusTable check GetCitusTableCacheEntry raises an error if relationId is not distributed	2020-06-11 15:35:02 +00:00
Philip Dubé	f344c1a4bc	Merge pull request #3654 from citusdata/2776_modifying_ctes Modifying ctes in router planner	2020-06-11 15:26:00 +00:00
Philip Dubé	1722d8ac8b	Allow routing modifying CTEs We still recursively plan some cases, eg: - INSERTs - SELECT FOR UPDATE when reference tables in query - Everything must be same single shard & replication model	2020-06-11 15:14:06 +00:00
Hadi Moshayedi	e37c385d6c	Merge pull request #3899 from citusdata/explain_analyze_sort_by_time Include execution duration in worker_last_saved_explain_analyze	2020-06-11 04:05:41 -07:00
Hadi Moshayedi	0e3140c14d	Include execution duration in worker_last_saved_explain_analyze	2020-06-11 02:54:54 -07:00
Hadi Moshayedi	93b79880fe	Merge pull request #3864 from citusdata/explain_analyze_cte CTE statistics in EXPLAIN ANALYZE	2020-06-11 02:49:06 -07:00
Hadi Moshayedi	7c52c6edb0	CTE statistics in EXPLAIN ANALYZE	2020-06-11 02:39:59 -07:00
Hadi Moshayedi	fe8a9c721c	Merge pull request #3891 from citusdata/explain_analyze_worker_query Show query text in EXPLAIN output	2020-06-11 02:38:07 -07:00
Hadi Moshayedi	1f6d6ee4a5	Show query text in EXPLAIN output	2020-06-11 02:19:55 -07:00
Hadi Moshayedi	9a49f10c49	Merge pull request #3890 from citusdata/explain_analyze_exec_once Do EXPLAIN ANALYZE at the same time as execution to avoid executing twice.	2020-06-11 02:17:38 -07:00
Hadi Moshayedi	bb96ef5047	Does the EXPLAIN ANALYZE at the same time as execution, so avoids executing twice. We wrap worker tasks in worker_save_query_explain_analyze() so we can fetch their explain output later by a call worker_last_saved_explain_analyze(). Fixes #3519 Fixes #2347 Fixes #2613 Fixes #621	2020-06-11 01:55:57 -07:00
Hadi Moshayedi	8551affc1e	Merge pull request #3892 from citusdata/explain_execute Test we don't support multi-shard EXPLAIN EXECUTE	2020-06-10 17:19:10 -07:00
Hadi Moshayedi	6ca621bd16	Test we don't support multi-shard EXPLAIN EXECUTE	2020-06-10 17:11:27 -07:00
Jelte Fennema	6f2eb4cdb6	Remove FlattenJoinVars (#3880 ) This code is not needed anymore since #3668 was merged. It's actually causing some issues when using the binary Postgres protocol, because postgres thinks it gets a `bigint` from the worker, but actually gets an normal `int`. The query in question that fails is this: ```sql CREATE TABLE test_table_1(id int, val1 int); CREATE TABLE test_table_2(id int, val1 bigint); SELECT create_distributed_table('test_table_1', 'id'); SELECT create_distributed_table('test_table_2', 'id'); INSERT INTO test_table_1 VALUES(1,1),(2,2),(3,3); INSERT INTO test_table_2 VALUES(1,1),(3,3),(4,5); SELECT val1 FROM test_table_1 LEFT JOIN test_table_2 USING(id, val1) ORDER BY 1; ``` The difference in queries that is sent to the workers after this change is this, for this query: ```diff --- query_old.sql 2020-06-09 09:51:21.460000000 +0200 +++ query_new.sql 2020-06-09 09:51:39.500000000 +0200 @@ -1 +1 @@ -SELECT worker_column_1 AS val1 FROM (SELECT test_table_1.val1 AS worker_column_1 FROM (public.test_table_1_102015 test_table_1(id, val1) LEFT JOIN public.test_table_2_102019 test_table_2(id, val1) USING (id, val1))) worker_subquery +SELECT worker_column_1 AS val1 FROM (SELECT val1 AS worker_column_1 FROM (public.test_table_1_102015 test_table_1(id, val1) LEFT JOIN public.test_table_2_102019 test_table_2(id, val1) USING (id, val1))) worker_subquery ```	2020-06-10 17:24:53 +02:00
Jelte Fennema	f4791fcb10	Remove SwallowErrors by using PathNameDeleteTemporaryDir (#3893 ) This is a different version of #3634. It also removes SwallowErrors, but instead of modifying our own functions to not throw errors, it uses the postgres built in `PathNameDeleteTemporaryDir` function. This function does not throw errors. Since this change is for a bugfix, I tried to minimize the changes. PRs with the following changes would be good to do separately from this PR: 1. Use PathName(Create\|Open\|Delete)Temporary(File\|Dir) to open and remove all files/dirs instead of our own custom file functions. 2. Prefix our outmost files/directories with `PG_TEMP_FILE_PREFIX` so that they are identified by Postgres as temporary files, which will be removed at postmaster start. This way we do not have to do this cleanup ourselves. 3. Store the files in the temporary table space if it exists. Fixes #3634 Fixes #3618	2020-06-10 17:04:07 +02:00

1 2 3 4 5 ...

3751 Commits (change-ci-scripts-a-bit) All Branches Search

3751 Commits (change-ci-scripts-a-bit)

All Branches