citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	331b45348c	Fix error when using LEFT JOIN with GROUP BY on primary key	2020-03-30 16:42:22 +02:00
Jelte Fennema	3be665269f	Reintroduce ForceSearchShardPlacementInList (#3664 ) This was added to silence static analysis errors. It was removed accidentally in #3591. This reintroduces it again.	2020-03-27 14:28:50 +01:00
Hanefi Onaldi	c0930d157e	Merge pull request #3510 from citusdata/alter-role-set-propagation In PostgreSQL, user defaults for config parameters can be changed by ALTER ROLE .. SET statements. We wish to propagate those defaults across the Citus cluster so that the behavior will be similar in different workers. The defaults can either be set in a specific database, or the whole cluster, similarly they can be set for a single role or all roles. We propagate the ALTER ROLE .. SET if all the conditions below are met: - The query affects the current database, or all databases - The user is already created in worker nodes	2020-03-27 14:04:39 +03:00
Hanefi Onaldi	0e8103b101	Propagate ALTER ROLE .. SET statements In PostgreSQL, user defaults for config parameters can be changed by ALTER ROLE .. SET statements. We wish to propagate those defaults accross the Citus cluster so that the behaviour will be similar in different workers. The defaults can either be set in a specific database, or the whole cluster, similarly they can be set for a single role or all roles. We propagate the ALTER ROLE .. SET if all the conditions below are met: - The query affects the current database, or all databases - The user is already created in worker nodes	2020-03-27 13:02:48 +03:00
Philip Dubé	bda1f1d530	Merge pull request #3661 from citusdata/fix/agg_evaluation Fixes a bug that causes some DML queries containing aggregates to fail	2020-03-26 16:14:42 +00:00
Marco Slot	a65ffee266	Fixes a bug that causes some DML queries containing aggregates to fail	2020-03-26 16:08:34 +00:00
SaitTalhaNisanci	d3fdade2e8	add missing perPlacementQueryStrings to copy and out funcs (#3657 )	2020-03-26 17:16:29 +03:00
Marco Slot	6bc3895b02	Merge pull request #3651 from citusdata/fix/srf_evaluation Fix a bug which caused queries with SRFs and function evaluation to fail	2020-03-26 14:48:26 +01:00
SaitTalhaNisanci	dd1a456407	store query command list in task (#3649 ) Sometimes we have concatenated query strings for a task. However, when we want to find each query string, it is not a trivial task. Therefore, it makes sense to store this in task so that when we need each query string we can easily get it.	2020-03-26 12:04:08 +03:00
Philip Dubé	4686133bf2	Merge pull request #3653 from citusdata/fix-grouping-sets-segfault Don't segfault on queries using GROUPING	2020-03-25 17:43:26 +00:00
Philip Dubé	0ad1956551	Merge pull request #3537 from citusdata/master-window-functions Master window functions	2020-03-25 17:27:31 +00:00
Philip Dubé	917cb6ae93	Don't segfault on queries using GROUPING GROUPING will always return 0 outside of GROUPING SETS, CUBE, or ROLLUP Since we don't support those, it makes sense to reject GROUPING in queries	2020-03-25 15:46:43 +00:00
Philip Dubé	720525cfda	Add support for window functions on coordinator Some refactoring: Consolidate expression which decides whether GROUP BY/HAVING are pushed down Rename early pullUpIntermediateRows to hasNonDistributableAggregates Create WorkerColumnName to handle formatting WORKER_COLUMN_FORMAT Ignore NULL StringInfo pointers to SafeToPushdownWindowFunction Fix bug where SubqueryPushdownMultiNodeTree mutates supplied Query, SafeToPushdownWindowFunction requires the original query as it relies on rtable	2020-03-25 15:31:20 +00:00
Jelte Fennema	36ff150465	Update CHANGELOG for v9.2.3 (#3648 )	2020-03-25 14:32:55 +01:00
Nils Dijk	4e611cfc25	Refactor dependency resolution and resolve from pg_shdepend (#3633 ) DESCRIPTION: Refactor dependency resolution and resolve from pg_shdepend This PR refactors how dependencies are resolved by not assuming solely a `pg_depend` record describing the dependency. Instead we keep a definition of the dependency around which records how the dependency is resolved. This can be one of the following ways - `pg_depend`, data will contain a copy of the `pg_depend` record - `pg_shdepend`, data will contain a copy of the `pg_shdepend` record - `ObjectAddress`, data will contain only an `ObjectAddress` describing a dependency Irregardless of way the dependency was found it will always be able to get to the address of the dependency as that is the most important property. For some checks we can inspect the source where the dependency was found and perform a deep inspection to decide if we want to follow the dependency. This is important to not distribute dependencies coming from extensions for example.	2020-03-25 13:38:25 +01:00
Onur Tirtir	eaaf302795	Merge pull request #3644 from citusdata/refactor/small-typos-etc Move MakeNameListFromRangeVar function and some other small changes	2020-03-25 11:36:50 +03:00
Onur Tirtir	52fd58d51f	move MakeNameListFromRangeVar function to a more appropriate file	2020-03-25 11:01:50 +03:00
Onur Tirtir	2396b66ac5	remove an outdated comment in local executor	2020-03-25 11:01:40 +03:00
Onur Tirtir	8ebb8ef31d	use PG_USED_FOR_ASSERTS_ONLY	2020-03-25 11:01:33 +03:00
Onur Tirtir	81d48d3466	fix some typos	2020-03-25 11:01:26 +03:00
Marco Slot	b89e9dc158	Fix a bug which caused queries with SRFs and function evalution to fail	2020-03-25 06:55:53 +01:00
Jelte Fennema	149f0b2122	Use Microsoft approved cipher string (#3639 ) This cipher string is approved by the Microsoft security team and only enables TLSv1.2 ciphers.	2020-03-24 15:51:44 +01:00
Jelte Fennema	2aabe3e2ef	Mark all connections for shutdown when citus.node_conninfo chan… (#3642 ) We cache connections between nodes in our connection management code. This is good for speed. For security this can be a problem though. If the user changes settings related to TLS encryption they want those to be applied to future queries. This is especially important when they did not have TLS enabled before and now they want to enable it. This can normally be achieved by changing citus.node_conninfo. However, because connections are not reopened there will still be old connections that might not be encrypted at all. This commit changes that by marking all connections to be shutdown at the end of their current transaction. This way running transactions will succeed, even if placement requires connections to be reused for this transaction. But after this transaction completes any future statements will use a connection created with the new connection options. If a connection is requested and a connection is found that is marked for shutdown, then we don't return this connection. Instead a new one is created. This is needed to make sure that if there are no running transactions, then the next statement will not use an old cached connection, since connections are only actually shutdown at the end of a transaction.	2020-03-24 15:31:41 +01:00
Hadi Moshayedi	b166105f16	Merge pull request #3591 from citusdata/copy_shard_placement Allow master_copy_shard_placement to replicate to new nodes	2020-03-23 08:45:21 -07:00
Hadi Moshayedi	b46b9a68ae	Tests for master_copy_shard_placement	2020-03-23 08:33:55 -07:00
Marco Slot	ede176d849	Implement shard placement copying	2020-03-23 08:33:08 -07:00
Philip Dubé	f77c71a9bd	Merge pull request #3625 from citusdata/avoid-execinitexpr-sublink PartiallyEvaluateExpression: Avoid unrecognized paramkind: 2	2020-03-23 14:25:28 +00:00
Philip Dubé	dd2bd53e5b	PartiallyEvaluateExpression: Avoid unrecognized paramkind: 2	2020-03-23 14:14:01 +00:00
SaitTalhaNisanci	3b7959a763	not run local shard copy test in parallel (#3640 ) It seems that when logging is enabled we should not run local shard copy in parallel with other tests. The reason is that it adds coordinator for reference tables and if the parallel test creates a schema before this test is run, the schema will be logged. So it is not deterministic.	2020-03-23 14:38:18 +03:00
SaitTalhaNisanci	c5c446f84f	not run local_shard_copy in parallel (#3635 )	2020-03-23 13:56:25 +03:00
SaitTalhaNisanci	3df578010e	add a UDF to update colocation (#3623 ) If two tables have the same distribution column type, we implicitly colocate them. This is useful since colocation has a big performance impact in most applications. When a table is rebalanced, all of the colocated tables are also rebalanced. If table A and table B are colocated and we want to rebalance table A, table B will also be rebalanced. We need replica identity so that logical replication can replicate updates and deletes during rebalancing. If table B does not have a replica identity we error out. A solution to this is to introduce a UDF so that colocation can be updated. The remaining tables in the colocation group will stay colocated. For example if table A, B and C are colocated and after updating table B's colocations, table A and table C stay colocated. The "updating colocation" step does not move any data around, it only updated pg_dist_partition and pg_dist_colocation tables. Specifically it creates a new colocation group for the table and updates the entry in pg_dist_partition while invalidating any cache.	2020-03-23 13:22:24 +03:00
Önder Kalacı	3e980c81e9	Merge pull request #3631 from citusdata/improve_at_exit Properly terminate connections at the end session	2020-03-20 18:01:16 +01:00
Onder Kalaci	7b4eb9611b	Properly terminate connections at the end session Citus coordinator (or MX nodes) caches `citus.max_cached_conns_per_worker` connections per node. This means that, those connections are not terminated after each statement. Instead, cached to avoid the cost of re-establishment. This is crucial for OLTP performance. The problem with that approach is that, we never properly handle the termnation of those cached connections. For instance, when a session on the coordinator disconnects, you'd see the following logs on the workers: ``` 2020-03-20 09:13:39.454 CET [64028] LOG: could not receive data from client: Connection reset by peer ``` With this patch, we're terminating the cached connections properly at the end of the connection.	2020-03-20 17:34:34 +01:00
Jelte Fennema	8deb805338	Ignore safestringlib sourcefiles in coverage (#3632 ) This is not our code, so we don't care about the coverage our tests generate for it.	2020-03-20 14:26:52 +01:00
Jelte Fennema	56863e8f0b	Really ignore -Wgnu-variable-sized-type-not-at-end (#3627 )	2020-03-20 11:53:28 +01:00
Jelte Fennema	ed0376bb41	Unparallelize tests (#3629 ) We're getting a lot of random failures on CI regarding connection errors. This works around that by not running that create lots of connections in parallel.	2020-03-20 10:31:34 +01:00
Jelte Fennema	30ada54f6a	Merge pull request #3626 from citusdata/vendor-new-directory Compile safestringlib using regular configure	2020-03-19 12:36:38 +01:00
Jelte Fennema	a3513c8902	Ignore symlinks and directories editorconfig CI script	2020-03-19 11:53:05 +01:00
Jelte Fennema	605b901637	Update cherry-pick hash in vendor README	2020-03-19 11:53:05 +01:00
Jelte Fennema	dc2a371d9f	Fix compilation issues with safestringlib Based on `92d7a40d1d`	2020-03-19 11:52:20 +01:00
Jelte Fennema	9a79935f1f	Update safestringlib	2020-03-19 11:52:20 +01:00
Jelte Fennema	6db7d87618	Compile safestringlib using regular configure This is needed to automatically generate .bc (bitcode) files when postgres is compiled with llvmjit support. It also has the advantage that cmake is not required for the build anymore.	2020-03-19 11:52:20 +01:00
Nils Dijk	6ff79c5ea9	Revert: Semmle: Protect against theoretical race in recursive d… (#3619 ) As discussed with @JelteF; #3559 caused consistent errors on BSD (OSX). Given a group of people use this environment to develop on it is an undesirable change. This reverts commit `ca8f7119fe`.	2020-03-18 13:48:05 +01:00
SaitTalhaNisanci	e5a2bbb2bd	Merge pull request #3557 from citusdata/enh/localExecutionCopy add local copy execution	2020-03-18 09:43:57 +03:00
SaitTalhaNisanci	2eaf7bba69	not use local copy if we are copying into intermediate results file We have special logic to copy into intermediate results and we use a custom format for that, "result" copy format. Postgres internally does not know this format and if we use this locally it will error saying that it does not know this format. Files are visible to all transactions, which means that we can use any connection to access files. In order to use the existing logic, it makes sense that in case we have intermediate results, which means we will write the results to a file, we preserve the same behavior, which is opening connections to localhost. Therefore if we have intermediate results we return false in ShouldExecuteCopyLocally.	2020-03-18 09:35:20 +03:00
SaitTalhaNisanci	9d2f3c392a	enable local execution in INSERT..SELECT and add more tests We can use local copy in INSERT..SELECT, so the check that disables local execution is removed. Also a test for local copy where the data size > LOCAL_COPY_FLUSH_THRESHOLD is added. use local execution with insert..select	2020-03-18 09:34:39 +03:00
SaitTalhaNisanci	42cfc4c0e9	apply review items log shard id in local copy and add more comments	2020-03-18 09:33:55 +03:00
SaitTalhaNisanci	c22068e75a	use the right partition for partitioned tables	2020-03-18 09:28:59 +03:00
SaitTalhaNisanci	1df9601e13	not use local copy if current transaction is connected to local group If current transaction is connected to local group we should not use local copy, because we might not see some of the changes that are made over the connection to the local group.	2020-03-18 09:28:59 +03:00
SaitTalhaNisanci	39bbec0f30	add tests for local copy execution	2020-03-18 09:28:59 +03:00

... 6 7 8 9 10 ...

3785 Commits (acdecc8fe5d2801e55c56d55bcc72e835965d45c) All Branches Search

3785 Commits (acdecc8fe5d2801e55c56d55bcc72e835965d45c)

All Branches