citus

Commit Graph

Author	SHA1	Message	Date
Matthew Wozniczka	4582a4b398	Fixed a typo	2018-03-27 22:51:36 -06:00
Brian Cloutier	f8f0d4aedc	Add Windows replacement for uname	2018-03-21 20:35:56 -07:00
Brian Cloutier	98ffafe16e	Fix error handling in connection_management	2018-03-21 20:05:00 -07:00
Murat Tuncer	224b0a8c14	Replace poll with select/poll Windows does not have poll(), so fall back to select()	2018-03-21 20:05:00 -07:00
Metin Doslu	3b7b64a8b6	Remove skip_jsonb_validation_in_copy GUC	2018-03-13 10:33:27 +02:00
Murat Tuncer	1440caeef2	Fix incorrect limit pushdown when distinct clause is not superset of group by (#2035 ) Pushing down limit and order by into workers may produce wrong output when distinct on() clause has expressions, aggregates, or window functions. This checking allows pushing down of limits only if distinct clause is a superset of group by clause. i.e. it contains all clauses in group by.	2018-03-07 13:24:56 +03:00
Metin Doslu	e86d34256c	Change default to false for citus.skip_jsonb_validation_in_copy	2018-03-06 13:19:47 +02:00
Onder Kalaci	40b898b59f	Improve error messages for INSERT queries that have subqueries	2018-03-05 14:46:47 +02:00
Onder Kalaci	7dc9589b56	Handle failures during I/O This commit checks the connection status right after any IO happens on the socket. This is necessary since before this commit we didn't pass any information to the higher level functions whether we're done with the connection (e.g., no IO required anymore) or an errors happened during the IO.	2018-03-02 08:33:53 +02:00
Onder Kalaci	da0048e0b7	ForgetResults() becomes a wrapper for ClearResults() ClearResults() is able to handle failures properly by checking the result status. So, relying on it makes error handling more generic in Citus.	2018-03-02 08:33:53 +02:00
Murat Tuncer	76f6883d5d	Add support for window functions that can be pushed down to worker (#2008 ) This is the first of series of window function work. We can now support window functions that can be pushed down to workers. Window function must have distribution column in the partition clause to be pushed down.	2018-03-01 19:07:07 +03:00
Marco Slot	e79db17b91	Update comment in WorkerAggregateExpressionList	2018-02-27 23:48:25 +01:00
Murat Tuncer	e13c5beced	Fix worker query when order by avg aggregate is used (#2024 ) We push down order by to worker query when limit is specified (with some other additional checks). If the query has an expression on an aggregate or avg aggregate by itself, and there is an order by on this particular target we may send wrong order by to worker query with potential to affect query result. The fix creates a auxilary target entry in the worker query and uses that target entry for sorting.	2018-02-28 12:12:54 +03:00
Metin Doslu	bcf660475a	Add support for modifying CTEs	2018-02-27 15:08:32 +02:00
velioglu	78e6d990a2	Fix master plan of the query with distinct, aggregate and group by clauses. Before this PR, we were trusting on the columns of group by about guaranteeing the uniqueness of the results. However, this assumption is correct only if the columns in the group by is subset of columns in the distinct clause. It can be wrong if we have part of group by columns and some aggregation columns in the distinct clause. With this PR, we add distinct plan on top of aggregate plan when necessary.	2018-02-26 15:30:15 +03:00
Onder Kalaci	1c930c96a3	Support non-co-located joins between subqueries With #1804 (and related PRs), Citus gained the ability to plan subqueries that are not safe to pushdown. There are two high-level requirements for pushing down subqueries: * Individual subqueries that require a merge step (i.e., GROUP BY on non-distribution key, or LIMIT in the subquery etc). We've handled such subqueries via #1876. * Combination of subqueries that are not joined on distribution keys. This commit aims to recursively plan some of such subqueries to make the whole query safe to pushdown. The main logic behind non colocated subquery joins is that we pick an anchor range table entry and check for distribution key equality of any other subqueries in the given query. If for a given subquery, we cannot find distribution key equality with the anchor rte, we recursively plan that subquery. We also used a hacky solution for picking relations as the anchor range table entries. The hack is that we wrap them into a subquery. This is only necessary since some of the attribute equivalance checks are based on queries rather than range table entries.	2018-02-26 13:50:37 +02:00
Onder Kalaci	7b57e0562a	Add infrastructure for detecting non-colocated subqueries	2018-02-26 13:28:25 +02:00
Onder Kalaci	4d70c86645	Leaf level recursive planning for non colocated subqueries With this commit, we enable recursive planning for the subqueries that are not joined on the distribution keys.	2018-02-26 13:28:24 +02:00
Onder Kalaci	e998703ff8	Enable restriction eq. checks for top level set operations We used to only support pushdownable set operations inside a subquery, however, we could easily expand the restriction checks to cover top level set operations as well.	2018-02-26 13:28:24 +02:00
Onder Kalaci	e8aa532a90	Refactor checks for distribution key equality Change some function names, ensure we stick to Citus' function order rules etc.	2018-02-26 13:28:24 +02:00
Marco Slot	1e9186a3b5	Do not use new connection in table size functions	2018-02-23 07:07:55 +01:00
Markus Sintonen	6202e80d06	Implemented jsonb_agg, json_agg, jsonb_object_agg, json_object_agg	2018-02-18 00:19:18 +02:00
velioglu	195ac948d2	Recursively plan subqueries in WHERE clause when FROM recurs	2018-02-13 19:52:12 +03:00
Marco Slot	0cba4ab588	Refactor worker node hash initialisation	2018-02-12 23:36:43 +01:00
Marco Slot	40d715d494	Cache worker node array for faster iteration	2018-02-12 23:36:43 +01:00
Marco Slot	6e79a34c97	Do not check for cancellation in ClearResultsIfReady	2018-02-12 16:45:02 +01:00
Marco Slot	6051aae56e	Handle errors that are discovered during abort	2018-02-12 16:45:02 +01:00
Marco Slot	ee6a751798	Only copy distributed plan when modifying it	2018-02-12 16:30:55 +01:00
Onder Kalaci	94c5ac6ebb	Remove duplicate join restrictions We use PostgreSQL hooks to accumulate the join restrictions and PostgreSQL gives us all the join paths it tries while deciding on the join order. Thus, for queries that have many joins, this function is likely to remove lots of duplicate join restrictions. This becomes relevant for Citus on query pushdown check peformance.	2018-02-12 18:35:05 +02:00
Onder Kalaci	c228d8ff3d	Refactor equivalance generation related codes This commit changes the APIs for restriction generation to make future changes simpler.	2018-02-12 18:35:04 +02:00
Onder Kalaci	2f2d350924	Refactor relation restriction related codes This commit moves some of the functions to a more relevant source file.	2018-02-12 18:35:04 +02:00
Murat Tuncer	901b543e20	Fix count distinct using field select on top level query We were allowing count distict queries even if they were not directly on columns if the query is grouped on distribution column. When performing these checks we were skipping subqueries because they also perform this check in a more concise manner. We relied on oid SUBQUERY_RELATION_ID (10000) to decide if a given RTE relation id denotes a subquery, however, we also use SUBQUERY_PUSHDOWN_RELATION_ID (10001) for some subqueries. We skip both type of subqueries with this change.	2018-02-06 13:16:10 +03:00
metdos	35f864bcaf	Respect enable_hashagg in the master planner	2018-02-05 15:06:00 +02:00
metdos	3d540d961c	Fix typo in grouping_is_sortable()	2018-02-05 12:10:19 +02:00
Marco Slot	6f7c3bd73b	Skip JSON validation on coordinator during COPY	2018-02-02 15:33:27 +01:00
Brian Cloutier	15511f6ba1	Dynamically allocate connection metadata in WaitForAllConnections	2018-02-01 10:30:41 -08:00
Brian Cloutier	e6ebfc1f53	Remove VLA from UpdateNodeLocation	2018-02-01 10:30:41 -08:00
Brian Cloutier	a2ed45e206	Remove variable length arrays VLAs aren't supported by Visual Studio. - Remove all existing instances of VLAs. - Add a flag, -Werror=vla, which makes gcc refuse to compile if we add VLAs in the future.	2018-02-01 10:30:41 -08:00
Brian Cloutier	2efe80ce55	CheckForDistributedDeadlocks no longer uses a VLA - variable length arrays (VLAs) do not work with Visual Studio - fix an off-by-one error. We incorrectly assumed there would always at least as many edges as there were nodes. - refactor: reduce scope of transactionNodeStack by moving it into the function which uses it. - refactor: break up the distinct uses of currentStackDepth into separate variables.	2018-02-01 10:30:41 -08:00
Brian Cloutier	097fd15a89	small refactor, CheckDeadlockForTransactionNode builds it's own array	2018-02-01 10:30:41 -08:00
Brian Cloutier	457f570b77	Small refactor, we were using incompatible types	2018-01-31 11:05:59 -08:00
Brian Cloutier	b864d014ab	GetNextNodeId() incorrectly called PG_RETURN_DATUM - Also stabilize the output of a multi_router_planner test	2018-01-29 15:32:36 -08:00
Brian Cloutier	61a6b846b9	Refactor: use a temporary timestamp variable It's against our coding convention to call functions inside parameter lists; when single-stepping with a debugger it's difficult to determine what the function returned. That wouldn't be good enough reason to change this code but while porting Citus to Windows I ran into this line of code. assign_distributed_transaction_id was called with a weird timestamp and I wasn't able to find the problem without first making this change.	2018-01-29 11:20:13 -08:00
Marco Slot	bd0ebac865	Skip call to ActiveReadableNodeList when there are no subplans	2018-01-29 16:05:10 +01:00
Hadi Moshayedi	ff26bcd5a5	Include sys/stat.h for S_IRUSR and S_IWUSR. (#1977 )	2018-01-26 16:21:48 -05:00
Brian Cloutier	76d1edc3fd	Don't rely on gcc-specific features (#1963 ) * Don't use expressions inside compound statements * Don't depend on __builtin_constant_p * Remove reliance on S_ISLNK * Replace use of __func__: older mcvs doesn't support this builtin	2018-01-23 17:03:29 -08:00
Onder Kalaci	fbde87d2d0	Allocate enough space for transaction nodes This fix prevents any potential memory access that might occur while forming the deadlock path.	2018-01-22 08:45:48 +02:00
Onder Kalaci	9a89c0b425	Fix bug while traversing the distributed deadlock graph With this fix, we traverse the graph with DFS which was originally intended. Note that, before the fix, we traverse the graph with BFS which might lead to killing some unrelated backend that is not involved in the distributed deadlock.	2018-01-22 08:45:48 +02:00
Dimitri Fontaine	c9760fbb64	Fix CREATE INDEX with storage options on distributed tables. By sharing the implementation of the function AppendOptionListToString on three call sites, we would expand an extra OPTIONS keyword in a create index statement, and omit other bits of the specific syntax here. This patch introduces an AppendStorageParametersToString() function that is very similar to AppendOptionListToString() but handles WITH(a="foo",...) syntax that is used in reloptions (aka Storage Parameters). Fixes #1747.	2018-01-17 21:56:40 +01:00
Dimitri Fontaine	952da72c55	Implement ALTER TABLE\|INDEX ... SET\|RESET (). PostgreSQL implements support for several relation kinds in a single statement, such as in the AlterTableStmt case, which supports both tables and indexes and more (see ATExecSetRelOptions in PostgreSQL source code file src/backend/commands/tablecmds.c for an example of that). As a consequence, this patch implements support for setting and resetting storage parameters on both relation kinds.	2018-01-17 21:56:40 +01:00

1 2 3 4 5 ...

894 Commits (4582a4b398182459be7deba13e433bf6975c7487)