citus

Commit Graph

Author	SHA1	Message	Date
Philip Dubé	84fe626378	multi_router_planner: refactor error propagation	2019-06-26 10:32:01 +02:00
Philip Dubé	9ed6dd5570	Ignore compile_commands.json, fix typo	2019-06-26 10:32:01 +02:00
Onder Kalaci	ad93d6feea	Change the order of placement access added to the list This is to make sure that the error messages related to foreign keys to reference tables shows the exact placement access name instead of SELECT.	2019-06-23 11:32:58 +02:00
Nils Dijk	eb98f2d13a	Fix null pointer caused by partial initialization of ConnParamsHashEntry (#2789 ) It has been reported a null pointer dereference could be triggered in FreeConnParamsHashEntryFields. Likely cause is an error in GetConnParams which will leave the cached ConnParamsHashEntry in a state that would cause the null pointer dereference in a subsequent connection establishment to the same server. This has been simulated by inserting ereport(ERROR, ...) at certain places in the code. Not only would ConnParamsHashEntry be in a state that would cause a crash, it was also leaking memory in the ConnectionContext due to the loss of pointers as they are only stored on the ConnParamsHashEntry at the end of the function. This patch rewrites both the GetConnParams to store pointers 'durably' at every point in the code so that an error would not lose the pointer as well as FreeConnParamsHashEntryFields in a way that it can clear half initialised ConnParamsHashEntry's in a safer manner.	2019-06-21 18:16:43 +02:00
Nils Dijk	5df1b49bed	Feature: optionally force master_update_node during failover (#2773 ) When `master_update_node` is called to update a node's location it waits for appropriate locks to become available. This is useful during normal operation as new operations will be blocked till after the metadata update while running operations have time to finish. When `master_update_node` is called after a node failure it is less useful to wait for running operations to finish as they can't. The lock being held indicates an operation that once attempted to commit will fail as the machine already failed. Now the downside is the failover is postponed till the termination point of the operation. This has been observed by users to take a significant amount of time causing the rest of the system to be observed unavailable. With this patch it is possible in such situations to invoke `master_update_node` with 2 optional arguments: - `force` (bool defaults to `false`): When called with true the update of the metadata will be forced to proceed by terminating conflicting backends. A cancel is not enough as the backend might be in idle time (eg. an interactive session, or going back and forth between an appliaction), therefore a more intrusive solution of termination is used here. - `lock_cooldown` (int defaults to `10000`): This is the time in milliseconds before conflicting backends are terminated. This is to allow the backends to finish cleanly before terminating them. This allows the user to set an upperbound to the expected time to complete the metadata update, eg. performing the failover. The functionality is implemented by spawning a background worker that has the task of helping a certain backend in acquiring its locks. The backend is either terminated on successful execution of the metadata update, or once the memory context of the expression gets reset, eg. on a cancel of the statement.	2019-06-21 12:03:15 +02:00
Jason Petersen	d4e1172247	Implement propagation of SET LOCAL commands Adds support for propagation of SET LOCAL commands to all workers involved in a query. For now, SET SESSION (i.e. plain SET) is not supported whatsoever, though this code is intended as somewhat of a base for implementing such support in the future. As SET LOCAL modifications are scoped to the body of a BEGIN/END xact block, queries wishing to use SET LOCAL propagation must be within such a block. In addition, subsequent modifications after e.g. any SAVEPOINT or ROLLBACK statements will correspondingly push or pop variable mod- ifications onto an internal stack such that the behavior of changed values across the cluster will be identical to such behavior on e.g. single-node PostgreSQL (or equivalently, what values are visible to the end user by running SHOW on such variables on the coordinator). If nodes enter the set of participants at some point after SET LOCAL modifications (or SAVEPOINT, ROLLBACK, etc.) have occurred, the SET variable state is eagerly propagated to them upon their entrance (this is identical to, and indeed just augments, the existing logic for the propagation of the SAVEPOINT "stack"). A new GUC (citus.propagate_set_commands) has been added to control this behavior. Though the code suggests the valid settings are 'none', 'local', 'session', and 'all', only 'none' (the default) and 'local' are presently implemented: attempting to use other values will result in an error.	2019-06-20 16:15:43 -07:00
Jason Petersen	1dec6c5163	Change BeginCoordinatedTransaction to internal linkage It's only ever called from a single file, so having it be extern didn't make a whole lot of sense.	2019-06-20 13:44:06 -07:00
Jason Petersen	2349e8e75c	Remove extraneous comments around PG header change	2019-06-20 13:37:53 -07:00
Hadi Moshayedi	4bbae02778	Make COPY compatible with unified executor.	2019-06-20 19:53:40 +02:00
Hadi Moshayedi	2e6d04df7b	Refactor ExecuteModifyTasksSequentially.	2019-06-20 18:38:57 +02:00
Hadi Moshayedi	83f6c7dab4	Fix subxact release crash	2019-06-19 17:43:10 +02:00
Onder Kalaci	2b0c4accda	Apply feedback	2019-06-19 10:03:58 +02:00
Onder Kalaci	3a04374a9e	Refactor relation shard list creation during placement creation This change is to make further refactoring even simpler such as using the executor for shard creation.	2019-06-19 10:03:58 +02:00
Onder Kalaci	4fd1fcbbef	Refactor shard creation logic This is a preperation for the new executor, where creating shards would go through the executor. So, explicitly generate the commands for further processing.	2019-06-19 10:03:58 +02:00
Philip Dubé	4bfcf5b665	Enable Werror for all warnings Changes to ruleutils match changes made upstream to silence gcc fallthrough warnings	2019-06-18 14:43:54 -07:00
Hadi Moshayedi	b240854b8c	Use SendCancelationRequest() in ShutdownConnection()	2019-06-18 12:10:05 +02:00
Philip Dubé	342d423725	Fix join alias resolution FROM (query) alias ignored renaming In nested subqueries the select list would rename, while the join alias would not respect that	2019-06-12 17:25:07 -07:00
Marco Slot	c1ac794b77	enable_statistics_collection defaults to off	2019-06-05 18:43:26 +02:00
Hadi Moshayedi	85325e0098	Refactor ScanStateGetExecutorState into its own function.	2019-06-05 09:16:43 -07:00
Hadi Moshayedi	0b01c59fa6	Refactor ScanStateGetTupleDescriptor() into a function.	2019-06-04 15:19:49 -07:00
Hadi Moshayedi	8e2d328530	Search all outer node levels for lateral join params.	2019-06-04 10:14:05 -07:00
Philip Dubé	b5ced403d8	Also check rewrittenQuery jointree for outer join	2019-06-04 07:47:35 -07:00
Hadi Moshayedi	dee5bc31b4	Refactor ShardIdForTuple() to a separate function.	2019-06-02 09:48:15 -07:00
Marco Slot	bb3a96eacb	Cache a configurable number of connections at xact end	2019-05-29 13:24:31 +02:00
Hadi Moshayedi	23207a43e0	Fix a typo: WITH CARDINALITY -> WITH ORDINALITY	2019-05-24 15:49:17 -07:00
Philip Dubé	b8871d9ff4	Propagate more ALTER FOREIGN TABLE to workers	2019-05-24 12:54:05 -07:00
Marco Slot	b3fcf2a48f	Deprecate master_modify_multiple_shards	2019-05-24 15:22:06 +02:00
Marco Slot	7fa5d36057	Stop using master_modify_multiple_shards in TRUNCATE	2019-05-24 14:35:46 +02:00
exialin	59e54de54d	Minor code clean-up	2019-05-24 14:26:26 +02:00
Hanefi Onaldi	4d737177e6	Remove redundant active placement filters and unneded sort operations If a query is router executable, it hits a single shard and therefore has a single task associated with it. Therefore there is no need to sort the task list that has a single element. Also we already have a list of active shard placements, sending it in param and reuse it.	2019-05-24 14:16:50 +03:00
Philip Dubé	16886b3c63	Fix misc typos	2019-05-23 17:23:27 -07:00
Hadi Moshayedi	8ae47e1244	Fix comments for RemoteFileDestReceiverStartup and CitusCopyDestReceiverStartup	2019-05-21 09:03:22 -07:00
Hadi Moshayedi	dce9260c0e	Fix an include in recusive_planning.c	2019-05-20 18:57:03 -07:00
Murat Tuncer	3fe482adbc	Fix DistShardCacheHash initialization InitializeCaches() method may prematurely set performedInitialization without actually creating DistShardCacheHash. Fix makes sure flag is set only if DistShardCacheHash is created successfully. Also introduced a new memory context to allocate aforementioned hash tables. If allocation/initialization fails for any reason we make sure memory is reclaimed by deleting the memory context.	2019-05-15 16:47:44 +03:00
Hanefi Onaldi	4030d603eb	Merge pull request #2691 from citusdata/update_changelog Add 8.1.2 and 8.2.1 changelog entries	2019-05-15 09:18:58 +03:00
Onder Kalaci	495b6e9b62	Refactor Parallel Relation Access Recording Instead of scattering the code around, we move all the logic into a single function. This will help supporting foreign keys to reference tables in the unified executor with a single line of change, just calling this function.	2019-05-02 18:12:33 +03:00
Hadi Moshayedi	32ecb6884c	Test ROLLBACK TO SAVEPOINT with multi-shard CTE failures	2019-05-01 09:33:43 -07:00
Hadi Moshayedi	aafd22dffa	Fix savepoint rollback for INSERT INTO ... SELECT.	2019-05-01 09:33:43 -07:00
Hadi Moshayedi	b69a762e0b	Fix savepoint rollback after multi-shard update failure.	2019-05-01 09:33:43 -07:00
Jason Petersen	71d5d1c865	Enable variable shadowing warnings; fix all Rather than wait for another place like the previous commit to bite us, I think we should turn on this warning.	2019-04-30 13:24:25 -06:00
Jason Petersen	1125fc9da0	Fix self-strncmp in ConstrIsFKToReferenceTable Make the function do what I assume was intended.	2019-04-30 13:24:25 -06:00
Hadi Moshayedi	c9b1d9c2d1	Check all placements aren't inactive	2019-04-26 10:04:55 -07:00
Hadi Moshayedi	7b1d03772d	Don't schedule tasks on inactive nodes.	2019-04-26 10:04:54 -07:00
Onder Kalaci	004f28e18c	Sort output of RETURNING The feature is only intended for getting consistent outputs for the regression tests. RETURNING does not have any ordering gurantees and with unified executor, the ordering of query executions on the shards are also becoming unpredictable. Thus, we're enforcing ordering when a GUC is set. We implicitly add an `ORDER BY` something equivalent of ` RETURNING expr1, expr2, .. ,exprN ORDER BY expr1, expr2, .. ,exprN ` As described in the code comments as well, this is probably not the most performant approach we could implement. However, since we're only targeting regression tests, I don't see any issues with that. If we decide to expand this to a feature to users, we should revisit the implementation and improve the performance.	2019-04-24 11:51:19 +03:00
Jason Petersen	4b9519e7d6	Check for non-extended constraint before extending This will only apply to DROP and VALIDATE commands; see the lengthy comment in multi_create_table_constraints.sql for more explanation.	2019-04-15 23:14:21 -06:00
Onder Kalaci	7d872a343a	Rename MultiConnectionState to MultiConnectionPollState	2019-04-05 11:50:11 +03:00
Onder Kalaci	fb38dc3136	Ensure that stack resizing logic works expected This commit has two goals: (a) Ensure to access both edges of the allocated stack (b) Ensure that any compiler optimizations to prevent the function optimized away. Stack size after the patch: sudo grep -A 1 stack /proc/2119/smaps 7ffe305a6000-7ffe307a9000 rw-p 00000000 00:00 0 [stack] Size: 2060 kB Stack size before the patch: sudo grep -A 1 stack /proc/3610/smaps 7fff09957000-7fff09978000 rw-p 00000000 00:00 0 [stack] Size: 132 kB	2019-04-03 10:58:19 +03:00
Murat Tuncer	1424f75ec9	Support columns referencing an aliased joins We used to rely on PG function flatten_join_alias_vars to resolve actual columns referenced in target entry list. The function goes deep and finds the actual relation. This logic usually works fine. However, when joins are given an alias, inner relation names are not visible to target entry entry. Thus relation resolving should stop when we the target entry column refers an rte of an aliased join. We stopped using PG function and provided our own flatten function.	2019-03-26 09:46:22 +03:00
Jason Petersen	4c7f78bd7e	Code review feedback	2019-03-25 22:07:27 -05:00
Jason Petersen	6a0dc7756e	Formatting fixes Noticed a lot of weird lines wrapped at 80; our standard is 90.	2019-03-22 20:32:19 -06:00
Jason Petersen	6acf52660c	Always coerce RHS of pruning op to part. key type Our assumption that strip_implicit_coercions would leave us with a bi- nary-compatible type to that of the partition key was wrong. Instead, we should ensure the RHS of the comparison we perform is proactively coerced into a compatible type (at least binary compatible).	2019-03-22 20:32:19 -06:00
Jason Petersen	5baa257c91	Add second assert to guard against future changes This isn't entirely necessary but I feel safer with it here.	2019-03-22 20:32:19 -06:00
Jason Petersen	69adb627c3	Add Assert that will crash before coercion fix is in	2019-03-22 20:32:19 -06:00
Nils Dijk	feaac69769	Implementation for asycn FinishConnectionListEstablishment (#2584 )	2019-03-22 17:30:42 +01:00
Marco Slot	e3b7e74f43	Allow rescan in DECLARE .. WITH HOLD	2019-03-22 11:25:55 +01:00
Jason Petersen	a2c6f596f9	Address code review comments	2019-03-21 11:59:52 -06:00
Jason Petersen	04aa34da68	Invalidate ConnParamsHash at config reload At configuration reload, we free all "global" (i.e. GUC-set) connection parameters, but these may still have live references in the connection parameters hash. By marking the entries as invalid, we can ensure they will not be used after free.	2019-03-21 00:03:35 -06:00
Jason Petersen	00d836e5a3	alloc non-global conn. params in provided context Having DATA-segment string literals made blindly freeing the keywords/ values difficult, so I've switched to allocating all in the provided context; because of this (and with the knowledge of the end point of the global parameters), we can safely pfree non-global parameters when we come across an invalid connection parameter entry.	2019-03-21 00:03:35 -06:00
Marco Slot	e8152d9b6d	Only look in top-level rtable in ExtractFirstDistributedTableId	2019-03-20 12:14:46 +03:00
Marco Slot	ee6a0b6943	Speed up RTE walkers Do it in two ways (a) re-use the rte list as much as possible instead of re-calculating over and over again (b) Limit the recursion to the relevant parts of the query tree	2019-03-20 12:14:46 +03:00
Marco Slot	5ff1821411	Cache the current database name Purely for performance reasons.	2019-03-20 12:14:46 +03:00
Marco Slot	0ea4e52df5	Add nodeId to shardPlacements and use it for shard placement comparisons Before this commit, shardPlacements were identified with shardId, nodeName and nodeport. Instead of using nodeName and nodePort, we now use nodeId since it apparently has performance benefits in several places in the code.	2019-03-20 12:14:46 +03:00
Onder Kalaci	ad5ff1d01a	Some queries lead to infinite recursion with recurisve planning The rule for infinite recursion is the following: - If the query contains a subquery which is recursively planned, and no other subqueries can be recursively planned due to correlation (e.g., LATERAL joins), the planner keeps recursing again and again. One interesting thing here is that even if a subquery contains only intermediate result(s), we re-recursively plan that. In the end, the logic in the code does the following: - Try recursive planning any of the subqueries in the query tree - If any subquery is recursively planned, call the planner again where the subquery is replaced with the intermediate result. - Try recursively planning any of the queries - If any subquery is recursively planned, call the planner again where the subquery (in this case it is already intermediate result) is replaced with the intermediate result. - Try recursively planning any of the queries - If any subquery is recursively planned, call the planner again where the subquery (in this case it is already intermediate result) is replaced with the intermediate result. - Try recursively planning any of the queries - If any subquery is recursively planned, call the planner again where the subquery (in this case it is already intermediate result) is replaced with the intermediate result. ......	2019-03-18 10:35:00 +03:00
Marco Slot	f2abf2b8e5	Functions are treated as transaction blocks	2019-03-15 16:34:08 -06:00
Marco Slot	4b9bd54ae0	Remove create_insert_proxy_for_table	2019-03-15 14:13:03 -06:00
exialin	84b853e1b5	Fix some typos (#2620 )	2019-03-14 16:48:31 -07:00
Hadi Moshayedi	a9e6d06a98	Skip execution of ALTER TABLE constraint checks on the coordinator	2019-03-14 15:40:56 -07:00
Hadi Moshayedi	cdd3b15ac8	Fix distributed deadlock for ALTER TABLE ... ATTACH PARTITION. Following scenario resulted in distributed deadlock before this commit: CREATE TABLE partitioning_test(id int, time date) PARTITION BY RANGE (time); CREATE TABLE partitioning_test_2009 (LIKE partitioning_test); CREATE TABLE partitioning_test_reference(id int PRIMARY KEY, subid int); SELECT create_distributed_table('partitioning_test_2009', 'id'), create_distributed_table('partitioning_test', 'id'), create_reference_table('partitioning_test_reference'); ALTER TABLE partitioning_test ADD CONSTRAINT partitioning_reference_fkey FOREIGN KEY (id) REFERENCES partitioning_test_reference(id) ON DELETE CASCADE; ALTER TABLE partitioning_test_2009 ADD CONSTRAINT partitioning_reference_fkey_2009 FOREIGN KEY (id) REFERENCES partitioning_test_reference(id) ON DELETE CASCADE; ALTER TABLE partitioning_test ATTACH PARTITION partitioning_test_2009 FOR VALUES FROM ('2009-01-01') TO ('2010-01-01');	2019-03-14 15:28:37 -07:00
Hadi Moshayedi	f19feb742c	Remove never assigned colocatedRelation from CreateDistributedTable (#2479 )	2019-03-12 14:50:18 -07:00
Murat Tuncer	2681231c98	Create column aliases for shard tables in worker queries when requested	2019-03-07 12:54:42 +03:00
Hadi Moshayedi	f4d3b94e22	Fix some of the casts for groupId (#2609 ) A small change which partially addresses #2608.	2019-03-05 12:06:44 -08:00
velioglu	faf50849d7	Enhance pushdown planning logic to handle full outer joins with using clause Since flattening query may flatten outer joins' columns into coalesce expr that is in the USING part, and that was not expected before this commit, these queries were erroring out. It is fixed by this commit with considering coalesce expression as well.	2019-03-05 11:49:30 +03:00
Onder Kalaci	26f569abd8	Make sure to clear PGresult on few places This leads to a memory leak otherwise.	2019-02-28 13:44:34 +03:00
Jason Petersen	3df2f51881	Turn on style-checking, fix lingering violations We'd been ignoring updating uncrustify for some time now because I'd thought these were misclassifications that would require an update in our rules to address. Turns out they're legit, so I'm checking them in.	2019-02-26 23:01:40 -07:00
Onder Kalaci	f706772b2f	Round-robin task assignment policy relies on local transaction id Before this commit, round-robin task assignment policy was relying on the taskId. Thus, even inside a transaction, the tasks were assigned to different nodes. This was especially problematic while reading from reference tables within transaction blocks. Because, we had to expand the distributed transaction to many nodes that are not necessarily already in the distributed transaction.	2019-02-22 19:26:38 +03:00
Onder Kalaci	e521e7e39c	Apply feedback	2019-02-22 18:14:30 +03:00
Onder Kalaci	407d0e30f5	Fix selectForUpdate bug	2019-02-21 18:21:41 +03:00
Onder Kalaci	f144bb4911	Introduce fast path router planning In this context, we define "Fast Path Planning for SELECT" as trivial queries where Citus can skip relying on the standard_planner() and handle all the planning. For router planner, standard_planner() is mostly important to generate the necessary restriction information. Later, the restriction information generated by the standard_planner is used to decide whether all the shards that a distributed query touches reside on a single worker node. However, standard_planner() does a lot of extra things such as cost estimation and execution path generations which are completely unnecessary in the context of distributed planning. There are certain types of queries where Citus could skip relying on standard_planner() to generate the restriction information. For queries in the following format, Citus does not need any information that the standard_planner() generates: SELECT ... FROM single_table WHERE distribution_key = X; or DELETE FROM single_table WHERE distribution_key = X; or UPDATE single_table SET value_1 = value_2 + 1 WHERE distribution_key = X; Note that the queries might not be as simple as the above such that GROUP BY, WINDOW FUNCIONS, ORDER BY or HAVING etc. are all acceptable. The only rule is that the query is on a single distributed (or reference) table and there is a "distribution_key = X;" in the WHERE clause. With that, we could use to decide the shard that a distributed query touches reside on a worker node.	2019-02-21 13:27:01 +03:00
Nils Dijk	1623c44fc7	Simplify make file for citus sql files	2019-02-19 21:29:20 -05:00
Hanefi Onaldi	148dcad0bb	More documentation and stale comments rewritten	2019-02-04 20:21:51 +03:00
Hanefi Onaldi	825666f912	Query samples in docs and better errors	2019-02-04 19:20:02 +03:00
Hanefi Onaldi	574b071113	Add wrapper function introduced in PG11 for compatibility	2019-02-04 19:20:02 +03:00
Hanefi Onaldi	1106e14385	Wrap functions in subqueries remove debug logs to fix travis tests Support RowType functions in joins Regression tests for a custom type function in join	2019-02-04 19:19:29 +03:00
Murat Tuncer	b36b59dd4f	Relax reference table restrictions in subquery union pushdowns We used to error out if there is a reference table in the query participating a union. This has caused pushdownable queries to be evaluated in coordinator. Now we let reference tables inside union queries as long as there is a distributed table in from clause. Existing join checks (reference table on the outer part) sufficient enought that we do not need check the join relation of reference tables.	2019-01-31 15:34:29 +03:00
Onder Kalaci	ec67381ba2	Queries with only intermediate results do not rely on task assignment policy Previously we allowed task assignment policy to have affect on router queries with only intermediate results. However, that is erroneous since the code-path that assigns placements relies on shardIds and placements, which doesn't exists for intermediate results. With this commit, we do not apply task assignment policies when a router query hits only intermediate results.	2019-01-28 17:59:17 +03:00
Murat Tuncer	cd5213abee	Set sequential mode execution GUC for alter partitioned table PG recently started propagating foreign key constraints to partition tables. This came with a select query to validate the the constaint. We are already setting sequential mode execution for this command. In order for validation select query to respect this setting we need to explicitly set the GUC. This commit also handles detach partition part.	2019-01-25 15:28:07 +03:00
velioglu	1bb0ec316a	Reset planner restriction context instead of popping with recursive planning	2019-01-17 14:35:16 +03:00
Jason Petersen	339e6e661e	Remove 9.6 (#2554 ) Removes support and code for PostgreSQL 9.6 cr: @velioglu	2019-01-16 13:11:24 -07:00
Marco Slot	1656b519c4	Plan outer joins through pushdown planning	2019-01-05 20:55:27 +01:00
Murat Tuncer	b389bebda1	Move repeated code to a function	2019-01-03 17:19:01 +03:00
Murat Tuncer	2ed7d24591	Fix having clause bug for complex joins We update column attributes of various clauses for a query inluding target columns, select clauses when we introduce new range table entries in the query. It seems having clause column attributes were not updated. This fix resolves the issue	2019-01-03 17:07:26 +03:00
Murat Tuncer	ec36030fae	Move functions calls that can fail to outside of spinlock We had recently fixed a spinlock issue due to functions failing, but spinlock is not being released. This is the continuation of that work to eliminate possible regression of the issue. Function calls that are moved out of spinlock scope are macros and plain type casting. However, depending on the configuration they have an alternate implementation in PG source that performs memory allocation. This commit moves last bit of codes to out of spinlock for completion purposes.	2019-01-03 15:59:56 +03:00
Murat Tuncer	3b95a03c3e	Merge branch 'master' into fix_spinlock_use	2018-12-25 14:41:21 +03:00
Hadi Moshayedi	38579d52d0	Speed-up run_command_on_shards(). (#2564 ) We were establishing connections synchronously. Establishing connections asynchronously results in some parallelization, saving hundreds of milliseconds. In a test I did, this decreased the query time from 150ms to 40ms.	2018-12-24 08:47:01 -05:00
Murat Tuncer	9671bc3cbb	Make sure spinlock is not left unreleased when an exception is thrown A spinlock is not released when an exception is thrown after spinlock is acquired. This has caused infinite wait and eventual crash in maintenance daemon. This work moves the code than can fail to the outside of spinlock scope so that in the case of failure spinlock is not left locked since it was not locked in the first place.	2018-12-24 15:47:21 +03:00
Hanefi Onaldi	fb497ddad1	Bump 8.2devel on master (#2567 )	2018-12-24 13:49:50 +03:00
Onder Kalaci	9fff7d28a7	Revert `4925521`	2018-12-21 15:36:40 -07:00
Marco Slot	1b1c6374f7	Execute CREATE INDEX CONCURRENTLY concurrently	2018-12-21 14:02:59 -07:00
Marco Slot	3ff2b47366	Restrict visibility of get_*_active_transactions functions to pg_monitor	2018-12-19 18:32:42 +01:00
Dimitri Fontaine	6a1a2b8458	Move an assert-only array-bound check to run-time. When the bound-check fails at run-time, better abort with an error message rather than trying to user memory we did not allocate.	2018-12-19 06:12:05 +01:00
Marco Slot	5b9376a7f8	Check ownership before taking locks in distributed table creation	2018-12-18 15:32:07 +01:00
Nils Dijk	694992e946	upgrade default ssl_ciphers to more restrictive on extension creation Show ssl_ciphers in ssl_by_default_test	2018-12-12 15:33:15 +01:00
Jason Petersen	92893e9601	Fix control file version	2018-12-11 18:50:20 -07:00
Jason Petersen	bd0d1f05e7	Bump SQL version Should have been done when the release-8.0 branch was created…	2018-12-11 10:40:15 -07:00
velioglu	90704d9a52	Fix getting function oid to get hll_add_agg id	2018-12-10 14:16:19 +03:00
velioglu	3e0cff94a6	Add FunctionOidExtended function	2018-12-10 11:59:41 +03:00
Nils Dijk	4af40eee76	Enable SSL by default during installation of citus	2018-12-07 11:23:19 -07:00
velioglu	8764a19464	Adds support for disabling hash agg with hll functions on coordinator query	2018-12-07 18:49:25 +03:00
Marco Slot	9cf91c438b	Only allow transmit from pgsql_job_cache directory	2018-12-05 10:18:27 +01:00
Marco Slot	70fb9c851b	Remove odd memcpy usag in BuildCachedShardList	2018-12-04 14:09:10 +01:00
Marco Slot	0388324fbe	Expand planner readme	2018-12-04 09:55:19 +01:00
Dimitri Fontaine	d1b182de7d	Replace calls to unsafe functions like memcpy and sscanf In answer to a security audit, we double check buffer sizes and avoid known-dangerous operations such as sscanf.	2018-12-04 08:54:43 +01:00
Onder Kalaci	621ccf3946	Ensure to use initialized MaxBackends Postgresql loads shared libraries before calculating MaxBackends. However, Citus relies on MaxBackends being set. Thus, with this commit we use the same steps to calculate MaxBackends while Citus is being loaded (e.g., PG_Init is called). Note that this is safe since all the elements that are used to calculate MaxBackends are PGC_POSTMASTER gucs and a constant value.	2018-12-03 13:25:51 +03:00
Onder Kalaci	b6ebd791a6	Sort task list for multi-task explain outputs This is purely for ensuring that regression tests do not randomly fail.	2018-11-30 11:19:37 -07:00
Marco Slot	8893cc141d	Support INSERT...SELECT with ON CONFLICT or RETURNING via coordinator Before this commit, Citus supported INSERT...SELECT queries with ON CONFLICT or RETURNING clauses only for pushdownable ones, since queries supported via coordinator were utilizing COPY infrastructure of PG to send selected tuples to the target worker nodes. After this PR, INSERT...SELECT queries with ON CONFLICT or RETURNING clauses will be performed in two phases via coordinator. In the first phase selected tuples will be saved to the intermediate table which is colocated with target table of the INSERT...SELECT query. Note that, a utility function to save results to the colocated intermediate result also implemented as a part of this commit. In the second phase, INSERT.. SELECT query is directly run on the worker node using the intermediate table as the source table.	2018-11-30 15:29:12 +03:00
Hanefi Onaldi	088a2ef66a	throw an error when a subquery has grouping set clause	2018-11-30 13:11:32 +03:00
Nils Dijk	9309e63156	create_distributed_table as user, change table ownership during create	2018-11-29 14:20:42 +01:00
Nils Dijk	6aa191f72c	remove table_ddl_command_array and test master_get_table_ddl_events	2018-11-29 14:20:42 +01:00
Murat Tuncer	fd868ec268	Fix citus_stat_statements view Join between pg_stat_statements and citus_query_stats should include queryid, dbid, userid instead of just queryid.	2018-11-29 14:49:16 +03:00
Dimitri Fontaine	5ae2d03881	Refrain from having a strong opinion on maxGroupId. When initializing a Citus formation automatically from an external piece of software such as Citus-HA, the following process process may be used: - decide on the groupId in the external software - SELECT * FROM master_add_inactive_node('localhost', 9701, groupid => X) When Citus checks for maxGroupId, it forbids other software to pick their own group Ids to ues with the master_add_inactive_node() API. This patch removes the extra testing around maxGroupId.	2018-11-28 04:29:15 +01:00
Marco Slot	aff37cf1bc	Control multi-shard modify locks with enable_deadlock_prevention	2018-11-28 02:59:50 +01:00
Marco Slot	1ec5b6c890	Remove old worker_hash_partition_table API	2018-11-26 14:40:37 +01:00
Marco Slot	5a63deab2e	Clean up UDFs and remove unnecessary permissions	2018-11-26 14:40:37 +01:00
Hanefi Onaldi	7db6991dc0	propagate validate queries to workers	2018-11-26 14:04:51 +03:00
Marco Slot	e8e956aa9f	Require superuser when using non-existent job schema in worker_merge_files_into_table	2018-11-24 02:57:16 +01:00
Marco Slot	c4ad899dd8	Check schema ownership in worker_merge_* functions	2018-11-23 11:05:09 +01:00
Marco Slot	e9a7295ead	Add multi-user tests for task-tracker protocol functions	2018-11-23 11:05:09 +01:00
Marco Slot	8e93fe5870	Check schema owner in task_tracker_assign_task	2018-11-23 11:05:09 +01:00
Marco Slot	ec957a833a	Check permission in task_tracker_task_status	2018-11-23 11:04:58 +01:00
Marco Slot	6aa5592e52	Add user ID suffix to intermediate files in re-partition jobs	2018-11-23 08:36:11 +01:00
Marco Slot	a59bf31c76	Use worker_execute_sql_task UDF in task-tracker executor	2018-11-22 18:15:33 +01:00
Marco Slot	30bad7e66f	Add worker_execute_sql_task UDF	2018-11-22 18:15:33 +01:00
Marco Slot	caf402d506	COPY to a task file no longer switches to superuser	2018-11-22 18:15:33 +01:00
Marco Slot	e17025e1d4	Check table ownership in mark_tables_colocated	2018-11-18 00:11:38 +01:00
Marco Slot	18acd00553	Check permissions in lock_relation_if_exists	2018-11-18 00:11:38 +01:00
Marco Slot	aab9f623eb	Check table ownership in upgrade_to_reference_table	2018-11-16 23:27:34 +01:00
Onder Kalaci	052ba21b19	Make sure to prevent unauthorized users to drop sequences in Citus MX	2018-11-15 18:08:04 +03:00
Onder Kalaci	7f0a57a153	Make sure to prevent unauthorized users to drop tables in Citus MX	2018-11-15 18:07:03 +03:00
Nils Dijk	f9520be011	Round robin queries to reference tables with task_assignment_policy set to `round-robin` (#2472 ) Description: Support round-robin `task_assignment_policy` for queries to reference tables. This PR allows users to query multiple placements of shards in a round robin fashion. When `citus.task_assignment_policy` is set to `'round-robin'` the planner will use a round robin scheduling feature when multiple shard placements are available. The primary use-case is spreading the load of reference table queries to all the nodes in the cluster instead of hammering only the first placement of the reference table. Since reference tables share the same path for selecting the shards with single shard queries that have multiple placements (`citus.shard_replication_factor > 1`) this setting also allows users to spread the query load on these shards. For modifying queries we do not apply a round-robin strategy. This would be negated by an extra reordering step in the executor for such queries where a `first-replica` strategy is enforced.	2018-11-15 15:11:15 +01:00
Marco Slot	2de8ef29c3	Revoke function permissions for node metadata functions	2018-11-15 06:25:07 +01:00
Marco Slot	f383e4f307	Description: Refactor code that handles DDL commands from one file into a module The file handling the utility functions (DDL) for citus organically grew over time and became unreasonably large. This refactor takes that file and refactored the functionality into separate files per command. Initially modeled after the directory and file layout that can be found in postgres. Although the size of the change is quite big there are barely any code changes. Only one two functions have been added for readability purposes: - PostProcessIndexStmt which is extracted from PostProcessUtility - PostProcessAlterTableStmt which is extracted from multi_ProcessUtility A README.md has been added to `src/backend/distributed/commands` describing the contents of the module and every file in the module. We need more documentation around the overloading of the COPY command, for now the boilerplate has been added for people with better knowledge to fill out.	2018-11-14 13:36:27 +01:00
Burak Yucesoy	f8e0d37ba1	Fix crashes caused by stack size increase under high memory load Each PostgreSQL backend starts with a predefined amount of stack and this stack size can be increased if there is a need. However, stack size increase during high memory load may cause unexpected crashes, because if there is not enough memory for stack size increase, there is nothing to do for process apart from crashing. An interesting thing is; the process would get OOM error instead of crash, if the process had an explicit memory request (with palloc) for example. However, in the case of stack size increase, there is no system call to get OOM error, so the process simply crashes. With this change, we are increasing the stack size explicitly by requesting extra memory from the stack, so that, even if there is not memory, we can at least get an OOM instead of a crash.	2018-11-14 01:27:53 +03:00
Murat Tuncer	cc401a2616	Create function_utils for pg function call related utilities	2018-11-07 15:29:38 +03:00
Hadi Moshayedi	d3e284dcd6	Use heap_deform_tuple() instead of calling heap_getattr(). (#2464 ) After Fast ALTER TABLE ADD COLUMN with a non-NULL default in PG11, physical heaps might not contain all attributes after a ALTER TABLE ADD COLUMN happens. heap_getattr() returns NULL when the physical tuple doesn't contain an attribute. So we should use heap_deform_tuple() in these cases, which fills in the missing attributes. Our catalog tables evolve over time, and an upgrade might involve some ALTER TABLE ADD COLUMN commands. Note that we don't need to worry about postgres catalog tables and we can use heap_getattr() for them, because they only change between major versions. This also fixes #2453.	2018-11-05 15:11:01 -05:00
Onder Kalaci	9e2e2a7300	Make sure to access PARAM_EXTERN accurately in PG 11 PG 11 has change the way that PARAM_EXTERN is processed. This commit ensures that Citus follows the same pattern. For details see the related Postgres commit: `6719b238e8`	2018-10-25 21:55:03 +03:00
Onder Kalaci	6e05921736	Processes that are blocked on advisory locks show up in wait edges Assign the distributed transaction id before trying to acquire the executor advisory locks. This is useful to show this backend in citus lock graphs (e.g., dump_global_wait_edges() and citus_lock_waits).	2018-10-24 13:32:13 +03:00
Hadi Moshayedi	3e00bf1c0d	Don't throw error for DROP DATABASE IF EXISTS	2018-10-23 09:45:03 -04:00
Jason Petersen	ae9a98c2d1	Attempt to address planner context crashes Both of these are a bit of a shot in the dark. In one case, we noticed a stack trace where a caller received a null pointer and attempted to dereference the memory context field (at 0x010). In the other, I saw that any error thrown from within AdjustParseTree could keep the stack from being cleaned up (presumably if we push we should always pop). Both stack traces were collected during times of high memory pressure and locally reproducing the problem locally or otherwise has been very tricky (i.e. it hasn't been reproduced reliably at all).	2018-10-18 08:41:51 -06:00
Hadi Moshayedi	431ac80563	Keep track of cached entries in case of interruption. (#2433 ) * Keep track of cached entries in case of interruption. Previously we set DistTableCacheEntry->sortedShardIntervalArray and DistTableCacheEntry->shardIntervalArrayLength after we entered all related shard entries into DistShardCacheHash. The drawback was that if populating DistShardCacheHash was interrupted, ResetDistTableCacheEntry() didn't see the shard hash entries created, so was unable to clean them up. This patch fixes that by setting sortedShardIntervalArray earlier, and incrementing shardIntervalArrayLength as we enter shards into the cache.	2018-10-15 14:06:56 -04:00
Jason Petersen	9fb951c312	Fix user-facing typos Lintian found these (presumably by looking in the text section and running them through e.g. aspell).	2018-10-09 16:54:03 -07:00
Onder Kalaci	73696a03e4	Make sure not to leak intermediate result folders on the workers	2018-10-09 22:47:56 +03:00
Marco Slot	d56baefe3d	Allow simple DML commands from hot standby	2018-10-06 10:54:44 +02:00
Murat Tuncer	4f8042085c	Fix drop schema in mx with partitioned tables Drop schema command fails in mx mode if there is a partitioned table with active partitions. This is due to fact that sql drop trigger receives all the dropped objects including partitions. When we call drop table on parent partition, it also drops the partitions on the mx node. This causes the drop table command on partitions to fail on mx node because they are already dropped when the partition parent was dropped. With this work we did not require the table to exist on worker_drop_distributed_table.	2018-10-08 17:01:54 -07:00
velioglu	512d23934f	Show router modify,select and real-time queries on MX views	2018-10-02 13:59:38 +03:00
Murat Tuncer	9bdef67bab	Do not create inherited constraints on worker shards PG now allows foreign keys on partitioned tables. Each foreign key constraint on partitioned table is propagated down to partitions. We used to create all constraints on shards when we are creating a new shard, or when just simply moving a shard from one worker to another. We also used the same logic when creating a copy of coordinator table in mx node. With this change we create the constraint on worker node only if it is not an inherited constraint.	2018-09-28 14:14:51 +03:00
Murat Tuncer	653c7e4ae0	Fix memory leak in FinishRemoteTransactionPrepare	2018-09-28 11:13:21 +03:00
Onder Kalaci	cdc0d1491c	Make sure to use correct execution mode for TRUNCATE We used to set the execution mode in the truncate trigger. However, when multiple tables are truncated with a single command, we could set the execution mode very late. Instead, now set the execution mode on the utility hook.	2018-09-25 15:35:27 +03:00
Marco Slot	1ca9a5b867	Do not allow unresolved parameters in INSERT...SELECT	2018-09-24 14:12:04 +02:00
Marco Slot	877d703ac5	Evaluate functions (and when applicable, parameters) anywhere in query	2018-09-21 12:57:50 -06:00
Onder Kalaci	abc443d7fa	Make sure that shard repair considers replication factor	2018-09-21 15:24:49 +03:00
Onder Kalaci	8520a5b432	worker_append_table_to_shard becomes aware of partitioned tables	2018-09-21 14:40:42 +03:00
Onder Kalaci	c1b5a04f6e	Allow partitioned tables with replication factor > 1 With this commit, we all partitioned distributed tables with replication factor > 1. However, we also have many restrictions. In summary, we disallow all kinds of modifications (including DDLs) on the partition tables. Instead, the user is allowed to run the modifications over the parent table. The necessity for such a restriction have two aspects: - We need to acquire shard resource locks appropriately - We need to handle marking partitions INVALID in case of any failures. Note that, in theory, the parent table should also become INVALID, which is too aggressive.	2018-09-21 14:40:41 +03:00
Murat Tuncer	b6930e3db9	Add distributed locking to truncated mx tables We acquire distributed lock on all mx nodes for truncated tables before actually doing truncate operation. This is needed for distributed serialization of the truncate command without causing a deadlock.	2018-09-21 14:23:19 +03:00
velioglu	d7f75e5b48	Add citus_lock_waits to show locked distributed queries	2018-09-20 14:13:51 +03:00
Murat Tuncer	0f6e514bfb	Fixes a bug on not being able to drop index on a partitioned table. Reason for the failure is that PG11 introduced a new relation kind RELKIND_PARTITIONED_INDEX to be used for partitioned indices. We expanded our check to cover that case.	2018-09-19 13:15:05 +03:00
Marco Slot	f34ab55389	Fix bug preventing rollback in stored procedure	2018-08-31 20:49:20 +02:00
Onder Kalaci	41d606b575	Use tree walker instad of mutator in relation visibility This commit uses _walker instead of _mutator for performance reasons. Given that we're only updating a functionId in the tree, the approach seems fine.	2018-09-18 09:33:01 +03:00
Onder Kalaci	4cae856846	Relax assertion on transaction abort on PREPARE step In case a failure happens when a transaction is failed on PREPARE, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-09-17 18:09:16 +03:00
Onder Kalaci	a94184fff8	Prevent overflow of memory accesses during deadlock detection In the distributed deadlock detection design, we concluded that prepared transactions cannot be part of a distributed deadlock. The idea is that (a) when the transaction is prepared it already acquires all the locks, so cannot be part of a deadlock (b) even if some other processes blocked on the prepared transaction, prepared transactions would eventually be committed (or rollbacked) and the system will continue operating. With the above in mind, we probably had a mistake in terms of memory allocations. For each backend initialized, we keep a `BackendData` struct. The bug we've introduced is that, we assumed there would only be `MaxBackend` number of backends. However, `MaxBackends` doesn't include the prepared transactions and axuliary processes. When you check Postgres' InitProcGlobal` you'd see that `TotalProcs = MaxBackends + NUM_AUXILIARY_PROCS + max_prepared_xacts;` This commit aligns with total procs processed with that.	2018-09-17 16:23:29 +03:00
Marco Slot	55f46acedf	Support TABLESAMPLE in router queries	2018-08-31 13:22:38 +02:00
velioglu	d1f005daac	Adds UDFs for testing MX functionalities with isolation tests	2018-09-12 07:04:16 +03:00
Onder Kalaci	d657759c97	Views to Provide some insight about the distributed transactions on Citus MX With this commit, we implement two views that are very similar to pg_stat_activity, but showing queries that are involved in distributed queries: - citus_dist_stat_activity: Shows all the distributed queries - citus_worker_stat_activity: Shows all the queries on the shards that are initiated by distributed queries. Both views have the same columns in the outputs. In very basic terms, both of the views are meant to provide some useful insights about the distributed transactions within the cluster. As the names reveal, both views are similar to pg_stat_activity. Also note that these views can be pretty useful on Citus MX clusters. Note that when the views are queried from the worker nodes, they'd not show the distributed transactions that are initiated from the coordinator node. The reason is that the worker nodes do not know the host/port of the coordinator. Thus, it is advisable to query the views from the coordinator. If we bucket the columns that the views returns, we'd end up with the following: - Hostnames and ports: - query_hostname, query_hostport: The node that the query is running - master_query_host_name, master_query_host_port: The node in the cluster initiated the query. Note that for citus_dist_stat_activity view, the query_hostname-query_hostport is always the same with master_query_host_name-master_query_host_port. The distinction is mostly relevant for citus_worker_stat_activity. For example, on Citus MX, a users starts a transaction on Node-A, which starts worker transactions on Node-B and Node-C. In that case, the query hostnames would be Node-B and Node-C whereas the master_query_host_name would Node-A. - Distributed transaction related things: This is mostly the process_id, distributed transactionId and distributed transaction number. - pg_stat_activity columns: These two views get all the columns from pg_stat_activity. We're basically joining pg_stat_activity with get_all_active_transactions on process_id.	2018-09-10 21:33:27 +03:00
Onder Kalaci	76aa6951c2	Properly send commands to other nodes We previously implemented OTHER_WORKERS_WITH_METADATA tag. However, that was wrong. See the related discussion: https://github.com/citusdata/citus/issues/2320 Instead, we switched using OTHER_WORKER_NODES and make the command that we're running optional such that even if the node is not a metadata node, we won't be in trouble.	2018-09-10 16:01:30 +03:00
Onder Kalaci	5cf8fbe7b6	Add infrastructure to relation if exists	2018-09-07 14:49:36 +03:00
Onder Kalaci	bf28dd0cff	Do not recover wrong distributed transactions in MX	2018-09-07 09:52:46 +03:00
Murat Tuncer	d8279569b8	Add support for INCLUDE option in index creation INCLUDE is a new feature in index creation in PG11. Included column/expression paramameters are now forwarded to shards	2018-09-06 19:41:06 +03:00
Onder Kalaci	1b3257816e	Make sure that table is dropped before shards are dropped This commit fixes a bug where a concurrent DROP TABLE deadlocks with SELECT (or DML) when the SELECT is executed from the workers. The problem was that Citus used to remove the metadata before droping the table on the workers. That creates a time window where the SELECT starts running on some of the nodes and DROP table on some of the other nodes.	2018-09-04 08:57:20 +03:00
Onder Kalaci	26e308bf2a	Support TRUNCATE from the MX worker nodes This commit enables support for TRUNCATE on both distributed table and reference tables. The basic idea is to acquire lock on the relation by sending the TRUNCATE command to all metedata worker nodes. We only skip sending the TRUNCATE command to the node that actually executus the command to prevent a self-distributed-deadlock.	2018-09-03 14:06:31 +03:00
Onder Kalaci	97ba7bf2eb	Add the option to skip the node that is executing the node	2018-09-03 14:01:24 +03:00
velioglu	bd30e3e908	Add support for writing to reference tables from MX nodes	2018-08-27 18:15:04 +03:00
velioglu	2639149bd8	Enterprise functions about metadata/resource locks	2018-08-27 16:32:20 +03:00
Onder Kalaci	b8af8c359b	Make sure that modifying CTEs always use the correct execution mode	2018-08-23 14:53:55 +03:00
Onder Kalaci	910ea392f5	Prevent multiple placements of a single shard to lead huge memory allocations	2018-08-22 19:25:01 +03:00
Onder Kalaci	cb481f55cf	Prevent excessive number of unnecessary range table traversal	2018-08-22 11:45:00 +03:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
Nils Dijk	2a9d47e1a6	fix pg11 tests	2018-08-15 23:27:31 -06:00
mehmet furkan şahin	1a3b9f731e	Make master_disable/activate_node runnable when superuser	2018-08-15 00:43:35 -07:00
Onder Kalaci	85d418412d	Fix DDL execution problem on MX when search_path is used Make sure that the coordinator sends the commands when the search path synchronised with the coordinator's search_path. This is only important when Citus sends the commands that are directly relayed to the worker nodes. For example, the deparsed DLL commands or queries always adds schema qualifications to the queries. So, they do not require this change.	2018-08-13 16:34:50 +03:00
Onder Kalaci	974cbf11a5	Hide shard names on MX worker nodes This commit by default enables hiding shard names on MX workers by simple replacing `pg_table_is_visible()` calls with `citus_table_is_visible()` calls on the MX worker nodes. The latter function filters out tables that are known to be shards. The main motivation of this change is a better UX. The functionality can be opted out via a GUC. We also added two views, namely citus_shards_on_worker and citus_shard_indexes_on_worker such that users can query them to see the shards and their corresponding indexes. We also added debug messages such that the filtered tables can be interactively seen by setting the level to DEBUG1.	2018-08-07 14:21:45 +03:00
Onder Kalaci	e13da6a343	Add infrastructure to hide shards on MX worker nodes Add ability to understand whether a table is a known shard on MX workers. Note that this is only useful and applicable for hiding shards on MX worker nodes given that we can have metadata only there.	2018-08-04 09:03:37 +03:00
mehmet furkan şahin	bc757845eb	Citus versioning fix	2018-07-26 10:56:34 +03:00
mehmet furkan şahin	887aa8150d	Bump citus version to 8.0devel	2018-07-25 12:03:47 +03:00
velioglu	e23625bf5e	Use contype to check for FK constraint instead of reading catalog table	2018-07-24 15:53:05 +03:00
mehmet furkan şahin	6d0fbbace7	ALTER TABLE %s ADD COLUMN constraint check is added	2018-07-24 15:53:05 +03:00
Marco Slot	625816242a	Don't try to check unopened connection in EXEC_TASK_FAILED state	2018-07-23 11:41:02 -06:00
Nils Dijk	2d13900230	error on unsupported changing of distirbution column in ON CONFLICT for INSERT ... SELECT	2018-07-23 15:18:21 +02:00
Nils Dijk	6a15e1c9fc	extract ErrorIfOnConflictNotSupported function for reuse	2018-07-23 12:20:10 +02:00
Nils Dijk	df98900f80	fix missing space for tablein in error	2018-07-20 15:05:13 +02:00
Marco Slot	69a3ebea5f	Ensure StartPlacementListConnection connects with username supplied by the caller	2018-07-19 20:10:11 +02:00
Jason Petersen	318119910b	Add pg_dist_poolinfo table For storing nodes' pool host/port overrides.	2018-07-10 09:30:22 -07:00
mehmet furkan şahin	3afa7f425d	Topn aggregates are supported	2018-07-10 14:33:42 +03:00
Murat Tuncer	a7277526fd	Make citus_stat_statements_reset() super user function	2018-07-10 11:21:20 +03:00
Marco Slot	89870e76ce	Add a select_opens_transaction_block GUC	2018-07-08 03:50:39 +02:00
Murat Tuncer	f20258ef10	Expand count distinct support We can now support more complex count distinct operations by pulling necessary columns to coordinator and evalutating the aggreage at coordinator. It supports broad range of expression with the restriction that the expression must contain a column.	2018-07-06 09:44:20 +03:00
Onder Kalaci	7fb529aab9	Some stylistic improvements in the foreign keys to reference table changes.	2018-07-05 23:23:34 +03:00
Brian Cloutier	735218ee5d	Remove sslmode structs and add more helpful description	2018-07-05 14:12:36 +02:00
Nils Dijk	c1c8c38dc9	create placeholder for policy ddl	2018-07-05 11:07:01 +02:00
mehmet furkan şahin	06217be326	hll aggregate functions are supported natively	2018-07-04 16:41:09 +03:00
Murat Tuncer	901066a421	Move partition key logging related code from enterprise	2018-07-04 13:11:34 +03:00
mehmet furkan şahin	f7b901e3fd	CopyShardForeignConstraintCommandList API change for grouped constraints	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	35eac2318d	lock referenced reference table metadata is added For certain operations in enterprise, we need to lock the referenced reference table shard distribution metadata	2018-07-03 17:05:55 +03:00
Onder Kalaci	d83be3a33f	Enforce foreign key restrictions inside transaction blocks When a hash distributed table have a foreign key to a reference table, there are few restrictions we have to apply in order to prevent distributed deadlocks or reading wrong results. The necessity to apply the restrictions arise from cascading nature of foreign keys. When a foreign key on a reference table cascades to a distributed table, a single operation over a single connection can acquire locks on multiple shards of the distributed table. Thus, any parallel operation on that distributed table, in the same transaction should not open parallel connections to the shards. Otherwise, we'd either end-up with a self-distributed deadlock or read wrong results. As briefly described above, the restrictions that we apply is done by tracking the distributed/reference relation accesses inside transaction blocks, and act accordingly when necessary. The two main rules are as follows: - Whenever a parallel distributed relation access conflicts with a consecutive reference relation access, Citus errors out - Whenever a reference relation access is followed by a conflicting parallel relation access, the execution mode is switched to sequential mode. There are also some other notes to mention: - If the user does SET LOCAL citus.multi_shard_modify_mode TO 'sequential';, all the queries should simply work with using one connection per worker and sequentially executing the commands. That's obviously a slower approach than Citus' usual parallel execution. However, we've at least have a way to run all commands successfully. - If an unrelated parallel query executed on any distributed table, we cannot switch to sequential mode. Because, the essense of sequential mode is using one connection per worker. However, in the presence of a parallel connection, the connection manager picks those connections to execute the commands. That contradicts with our purpose, thus we error out. - COPY to a distributed table cannot be executed in sequential mode. Thus, if we switch to sequential mode and COPY is executed, the operation fails and there is currently no way of implementing that. Note that, when the local table is not empty and create_distributed_table is used, citus uses COPY internally. Thus, in those cases, create_distributed_table() will also fail. - There is a GUC called citus.enforce_foreign_key_restrictions to disable all the checks. We added that GUC since the restrictions we apply is sometimes a bit more restrictive than its necessary. The user might want to relax those. Similarly, if you don't have CASCADEing reference tables, you might consider disabling all the checks.	2018-07-03 17:05:55 +03:00
velioglu	6be6911ed9	Create foreign key relation graph and functions to query on it	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	4db72c99f6	Specific DDLs are sequentialized when there is FK -[x] drop constraint -[x] drop column -[x] alter column type -[x] truncate are sequentialized if there is a foreign constraint from a distributed table to a reference table on the affected relations by the above commands.	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	2c5d59f3a8	create_distributed_table in transaction is fixed	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	45f8017f42	create_distributed_table with fk to ref table is implemented	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	2fa4e38841	FK from dist to ref can be added with alter table	2018-07-03 17:05:01 +03:00
Murat Tuncer	23800f50f1	Update citus_stat_statements view and regression tests	2018-07-03 16:14:13 +03:00
Murat Tuncer	e532755a6e	Fix bug in partition column extraction added strip_implicit_coercion prior to checking if the expression is Const. This is important to find values for types like bigint.	2018-07-02 18:08:16 +03:00
Murat Tuncer	3fc7cdfe6d	Apply master_stage_protocol refactoring changes	2018-06-28 11:24:57 +03:00
Murat Tuncer	4d35b92016	Add groundwork for citus_stat_statements api	2018-06-27 14:20:03 +03:00
Brian Cloutier	5ce18327a7	Don't spinloop when trying to cleanup a failed connection	2018-06-26 13:13:34 -07:00
Onder Kalaci	7d0f7835e7	Improve relation accesses association to do less job	2018-06-25 18:40:40 +03:00
Onder Kalaci	8ccb8b679e	Real-time executor marks multi shard relation accesses before opening connections	2018-06-25 18:40:31 +03:00
Onder Kalaci	2890154420	Make sure that TRUNCATE always opens a DDL access	2018-06-25 18:40:31 +03:00
Onder Kalaci	21038f0d0e	Make sure that inter-shard DDL commands are always covers both tables	2018-06-25 18:40:30 +03:00
Onder Kalaci	2f01894589	Track relation accesses using the connection management infrastructure	2018-06-25 18:40:30 +03:00
Onder Kalaci	d5472614df	Use non-data connection for intermediate results Make sure that intermediate results use a connection that is not associated with any placement. That is useful in two ways: - More complex queries can be executed with CTEs - Safely use the same connections when there is a foreign key to reference table from a distributed table, which needs to use the same connection for modifications since the reference table might cascade to the distributed table.	2018-06-21 13:26:13 +03:00
Onder Kalaci	7762d81cba	Move test UDF under test folder	2018-06-21 08:42:44 +03:00
Jason Petersen	7a75c2ed31	Add connparam invalidation trigger creation logic This needs to live in Community, since we haven't yet added the com- plication of having divergent upgrade scripts in Enterprise.	2018-06-20 14:13:18 -06:00
mehmet furkan şahin	2b2ce036eb	create_distributed_table honors sequential mode	2018-06-19 17:33:45 +03:00
Onder Kalaci	8f5821493a	Implement C interface for setting GUC We need the ability to switch to sequential mode (e.g., SET LOCAL citus.multi_shard_modify_mode = 'sequential'). This commit enables that.	2018-06-19 10:23:43 +03:00
Marco Slot	f3f2805978	Fix use-after-free that may occur for INSERT..SELECT in prepared statements	2018-06-18 22:55:06 -06:00
velioglu	53b2e81d01	Adds SELECT ... FOR UPDATE support for router plannable queries	2018-06-18 13:55:17 +03:00
Marco Slot	0bbe778760	Rename failOnError to alwaysThrowErrorOnFailure	2018-06-14 23:37:47 +02:00
Marco Slot	0feb1f2eb1	Do not call CheckRemoteTransactionsHealth from commit handler	2018-06-14 23:33:07 +02:00
Marco Slot	4ab8e87090	Always throw errors on failure on critical connection in router executor	2018-06-14 23:33:07 +02:00
Nils Dijk	73efcb22c4	Extract RoleSpecString and resolve role references	2018-06-14 11:38:42 +02:00
Jason Petersen	5bf7bc64ba	Add pg_dist_authinfo schema and validation This table will be used by Citus Enterprise to populate authentication- related fields in outbound connections; Citus Community lacks support for this functionality.	2018-06-13 11:16:26 -06:00
Jason Petersen	57b3f253c5	Add node_conninfo GUC and related logic To support more flexible (i.e. not at compile-time) specification of libpq connection parameters, this change adds a new GUC, node_conninfo, which must be a space-separated string of key-value pairs suitable for parsing by libpq's connection establishment methods. To avoid rebuilding and parsing these values at connection time, this change also adds a cache in front of the configuration params to permit immediate use of any previously-calculated parameters.	2018-06-12 20:23:47 -06:00
mehmet furkan şahin	d1a3b20115	foreign_constraint_utils is created	2018-06-07 18:19:24 +03:00
Onder Kalaci	a5370f5bb0	Realtime executor honours multi_shard_modify_mode We're relying on multi_shard_modify_mode GUC for real-time SELECTs. The name of the GUC is unfortunate, but, adding one more GUC (or renaming the GUC) would make the UX even worse. Given that this mode is mostly important for transaction blocks that involve modification /DDL queries along with real-time SELECTs, we can live with the confusion.	2018-06-06 14:59:54 +03:00
Onder Kalaci	d918556dca	INSERT .. SELECT pushdown honors multi_shard_modification_mode	2018-06-06 12:42:23 +03:00
Onder Kalaci	336044f2a8	master_modify_multiple_shards() and TRUNCATE honors multi_shard_modification_mode	2018-06-06 12:29:05 +03:00
Onder Kalaci	df44956dc3	Make sure that sequential DDL opens a single connection to each node After this commit DDL commands honour `citus.multi_shard_modify_mode`. We preferred using the code-path that executes single task router queries (e.g., ExecuteSingleModifyTask()) in order not to invent a new executor that is only applicable for DDL commands that require sequential execution.	2018-06-05 17:52:17 +03:00
Marco Slot	fd4ff29f2f	Add a debug message with distribution column value	2018-06-05 15:09:17 +03:00
Murat Tuncer	ba50e3f33e	Add handling for grant/revoke all tables in schema	2018-05-31 13:47:02 +03:00
velioglu	20acee2cd4	Bump citus version to 7.5devel	2018-05-28 17:25:21 -06:00
Brian Cloutier	9667ee5ac9	Alleviate OOM failures in COMMIT callback Previously those failures caused us to crash, postgres abort()s when it notices a failure in the COMMIT callback.	2018-05-15 16:39:33 -07:00
Brian Cloutier	4c2bf5d2d6	Move call to RemoveIntermediateResultsDirectory Errors thrown in the COMMIT handler will cause Postgres to segfault, there's nothing it can do it abort the transaction by the time that handler is called! RemoveIntermediateResultsDirectory is problematic for two reasons: - It has calls to ereport(ERROR which have been known to trigger - It makes memory allocations which raise ERRORs when they fail Once the COMMIT process has begun we don't use the intermediate results, so it's safe to remove them a little earlier in the process. A failure here will abort the transaction. That's pretty unnecessary, it's not that important that we remove the results, but it's still better than a crash.	2018-05-10 19:28:41 -07:00
mehmet furkan şahin	d35f2725bf	valgrind tests fix	2018-05-10 10:20:14 +03:00
Dimitri Fontaine	8b258cbdb0	Lock reads and writes only to the node being updated in master_update_node Rather than locking out all the writes in the cluster, the function now only locks out writes that target shards hosted by the node we're updating.	2018-05-09 15:14:20 +02:00
Marco Slot	5f5f7b4fe0	Throw an error if placements cannot be found in router executor	2018-05-08 22:39:18 -04:00
Marco Slot	9438e5bde9	Ensure single-shard modifying CTEs are part of distributed transaction	2018-05-06 12:49:40 +02:00
velioglu	caa27161ca	Check volatile functions in modify queries	2018-05-08 11:16:40 +03:00
Hadi Moshayedi	86b12bc2d0	Always prefix operators with their namespace. (#2147 ) Previously we checked if an operator is in pg_catalog, and if it wasn't we prefixed it with namespace in worker queries. This can have a huge impact on performance of physical planner when using custom data types. This happened regardless of current search_path config, because Citus overrides the search path in get_query_def_extended(). When we do so, the check for existence of the operator in current search path in generate_operator_name() fails for any operators outside pg_catalog. This means that nothing gets cached, and in the following calls we will again recheck the system tables for existence of the operators, which took an additional 40-50ms for some of the usecases we were seeing. In this change we skip the pg_catalog check, and always prefix the operator with its namespace.	2018-05-05 13:27:26 -04:00
Marco Slot	2f9c8c6af0	Allow DML commands with unreferenced SELECT CTEs	2018-05-03 14:53:26 +02:00
Marco Slot	f8cfe07fd1	Support intermediate results in distributed INSERT..SELECT	2018-05-03 14:42:28 +02:00
Marco Slot	90cdfff602	Implement recursive planning for DML statements	2018-05-03 14:42:28 +02:00
Murat Tuncer	42a8082721	PG11 compatibility refresh adds a shim for a changed function api	2018-05-03 13:21:15 -06:00
Onder Kalaci	317dd02a2f	Implement single repartitioning on hash distributed tables * Change worker_hash_partition_table() such that the divergence between Citus planner's hashing and worker_hash_partition_table() becomes the same. * Rename single partitioning to single range partitioning. * Add single hash repartitioning. Basically, logical planner treats single hash and range partitioning almost equally. Physical planner, on the other hand, treats single hash and dual hash repartitioning almost equally (except for JoinPruning). * Add a new GUC to enable this feature	2018-05-02 18:50:55 +03:00
velioglu	32bcd610c1	Support modify queries with multiple tables With this commit we begin to support modify queries with multiple tables if these queries are pushdownable.	2018-05-02 16:22:26 +03:00
velioglu	d9fa69c031	Refactor query pushdown related logic	2018-05-02 15:03:09 +03:00
Brian Cloutier	f8fb7a27fb	Don't copyObject into the wrong memory context utilityStmt sometimes (such as when it's inside of a plpgsql function) comes from a cached plan, which is kept in a child of the CacheMemoryContext. When we naively call copyObject we're copying it into a statement-local context, which corrupts the cached plan when it's thrown away.	2018-05-01 15:34:32 -07:00
Marco Slot	2559b84049	Drop shards as current user instead of super user	2018-05-01 09:57:20 +02:00
velioglu	121ff39b26	Removes large_table_shard_count GUC	2018-04-29 10:34:50 +02:00
Onder Kalaci	832c91e28c	Move processing each part of the query into its own functions This commit doesn't change any of the logic at all. Instead, the goal is to: * Get rid of any code duplication * Incremental changes to the optimizer made it slightly hard to follow the code, improve that and make it easier to implement new features * Simplify the code by moving each part of query processing (e.g., DISTINCT, LIMIT etc) into its own function * Make the interaction between each part of the query more obvious (e.g., How DISTINCT affects LIMIT etc)	2018-04-27 17:32:38 +03:00
mehmet furkan şahin	f2555317b6	ProcessVacuumStmt update on names	2018-04-27 14:37:01 +03:00
mehmet furkan şahin	a4153c6ab1	notice handler is implemented	2018-04-27 14:37:01 +03:00
Marco Slot	304b3a41ba	Cache the partition column Var	2018-04-26 14:58:16 -06:00
Marco Slot	3d3c19a717	Improve messages for essential connection failures	2018-04-26 12:58:47 -06:00
Marco Slot	88f64d22db	Prevent connection pointer is NULL details	2018-04-26 12:49:57 -06:00
Marco Slot	394732b6be	Add a connection failure error code	2018-04-26 12:49:57 -06:00
Önder Kalacı	ebb8f902c8	Relax assertion on transaction rollback failure (#2052 ) In case a failure happens when a transaction is rollbacked, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-04-26 13:39:03 -04:00
Hadi Moshayedi	24659a97dc	Fail task in real-time executor if no placements found. (#2133 )	2018-04-26 12:05:24 -04:00
Murat Tuncer	a6fe5ca183	PG11 compatibility update - changes in ruleutils_11.c is reflected - vacuum statement api change is handled. We now allow multi-table vacuum commands. - some other function header changes are reflected - api conflicts between PG11 and earlier versions are handled by adding shims in version_compat.h - various regression tests are fixed due output and functionality in PG1 - no change is made to support new features in PG11 they need to be handled by new commit	2018-04-26 11:29:43 +03:00
Brian Cloutier	49255213d4	Configure appveyor to run regression tests - Add install.pl to instal .sql files on Windows - Remove a hack to PGDLLIMPORT some variables - Add citus_version.o to the Makefile - Fix pg_regress_multi's PATH generation on Windows - Output regression.diffs when the tests fail - Fix permissions in data directory, make sure postgres can play with it	2018-04-25 18:02:07 -07:00
Onder Kalaci	ac8f2f1e6d	Eliminate code duplication in WorkerExtendedOpNode() Before this commit, we had code duplication in the WorkerExtendedOpNode(). The duplication was noticeable and any change is prone to bugs. The PR consists of 4 commits. Each commit incrementally fixes the problem by moving certain parts of the duplicated code into smaller, better-documented functions.	2018-04-25 08:54:59 +03:00
Brian Cloutier	8d4c4d5c58	Close all files before trying to remove them	2018-04-24 14:35:20 -07:00
Brian Cloutier	c5f1235090	Turn the crashes on Windows into WARNINGs	2018-04-24 14:35:20 -07:00
Onder Kalaci	ee748d9140	Unify extendedOpNode Processing Before this commit, we had a divergence among the creation of master/worker extended op nodes. This commit moves the related parts into a single place and allows the creation of master/extended op nodes to share a common data structure.	2018-04-24 11:56:38 +03:00
Hadi Moshayedi	966f01fad3	Fix write and copy functions for TaskExecution. (#2120 ) We were missing criticalErrorOccurred from CopyNodeTaskExecution() and OutTaskExecution(). This PR fixes it.	2018-04-23 09:07:52 -04:00
Onder Kalaci	814f0e3acc	Ensure Citus never try to access a not planned subquery PostgreSQL might remove some of the subqueries when they do not contribute to the query result at all. Citus should not try to access such subqueries during planning.	2018-04-20 13:52:00 +03:00
Brian Cloutier	b0b130f064	Fix Windows crash in multi_copy test Without this change we crash on Windows with COPYing into a table with 62 shards, and we ERROR when COPYing into a table with >62 shards: ERROR: WaitForMutipleObjects() failed: error code 87	2018-04-17 15:48:02 -07:00
Brian Cloutier	a59c1c634e	Fix cancellation of real time queries Without this change multi_real_time_transaction blocks forever (on Windows) in the block where it repeatedly calls pg_advisory_lock(15). This happens because the deadlock detector tries to cancel the backend but the backend never processes that signal.	2018-04-17 14:26:22 -07:00
mehmet furkan şahin	00e786af00	Capital named schema support is added	2018-04-17 17:17:42 +03:00
mehmet furkan şahin	e5a5502b16	Adds support for multiple ANDs in Having This PR adds support for multiple AND expressions in Having for pushdown planner. We simply make a call to make_ands_explicit from MultiLogicalPlanOptimize for the having qual in workerExtendedOpNode.	2018-04-16 14:14:48 +03:00
Brian Cloutier	42ddfa176d	Fix crash on Windows where there is no detail	2018-04-13 12:54:22 -07:00
velioglu	82b2d21b0c	Convert broadcast join to reference join After this commit large_table_shard_count wont be used to check whether broadcast join, which is renamed as reference join, can be applied. Reference join can only be applied over reference tables.	2018-04-13 12:58:14 +03:00
velioglu	1b92812be2	Add co-placement check to CoPartition function	2018-04-13 12:13:08 +03:00
Marco Slot	9318aeee6b	Allow multiple size function calls per query	2018-04-12 14:16:17 +02:00
Marco Slot	ee132c5ead	Prune shards once per relation in subquery pushdown	2018-04-10 20:33:07 +02:00
Burak Yucesoy	b33b282030	Fix bug while DROPping partitioned table from worker We recently added partitionin support to Citus MX. We should not execute DROP table commands from MX workers but at the moment we try to execute such commands for partitioned tables. This PR fixes that problem by adding check.	2018-04-09 13:50:21 +03:00
Burak Yucesoy	0c283fa8a3	Add partitioning support to MX tables Previously, we prevented creation of partitioned tables on Citus MX. We decided to not focus on this feature until there is a need. Since now there are requests for this feature, we are implementing support for partitioned tables on Citus MX.	2018-04-06 12:47:06 +03:00
velioglu	72dfe4a289	Adds colocation check to local join	2018-04-04 22:49:27 +03:00
velioglu	698d585fb5	Remove broadcast join logic After this change all the logic related to shard data fetch logic will be removed. Planner won't plan any ShardFetchTask anymore. Shard fetch related steps in real time executor and task-tracker executor have been removed.	2018-03-30 11:45:19 +03:00
Matthew Wozniczka	4582a4b398	Fixed a typo	2018-03-27 22:51:36 -06:00
Brian Cloutier	f8f0d4aedc	Add Windows replacement for uname	2018-03-21 20:35:56 -07:00
Brian Cloutier	98ffafe16e	Fix error handling in connection_management	2018-03-21 20:05:00 -07:00
Murat Tuncer	224b0a8c14	Replace poll with select/poll Windows does not have poll(), so fall back to select()	2018-03-21 20:05:00 -07:00
Metin Doslu	3b7b64a8b6	Remove skip_jsonb_validation_in_copy GUC	2018-03-13 10:33:27 +02:00
Murat Tuncer	1440caeef2	Fix incorrect limit pushdown when distinct clause is not superset of group by (#2035 ) Pushing down limit and order by into workers may produce wrong output when distinct on() clause has expressions, aggregates, or window functions. This checking allows pushing down of limits only if distinct clause is a superset of group by clause. i.e. it contains all clauses in group by.	2018-03-07 13:24:56 +03:00
Metin Doslu	e86d34256c	Change default to false for citus.skip_jsonb_validation_in_copy	2018-03-06 13:19:47 +02:00
Onder Kalaci	40b898b59f	Improve error messages for INSERT queries that have subqueries	2018-03-05 14:46:47 +02:00
Onder Kalaci	7dc9589b56	Handle failures during I/O This commit checks the connection status right after any IO happens on the socket. This is necessary since before this commit we didn't pass any information to the higher level functions whether we're done with the connection (e.g., no IO required anymore) or an errors happened during the IO.	2018-03-02 08:33:53 +02:00
Onder Kalaci	da0048e0b7	ForgetResults() becomes a wrapper for ClearResults() ClearResults() is able to handle failures properly by checking the result status. So, relying on it makes error handling more generic in Citus.	2018-03-02 08:33:53 +02:00
Murat Tuncer	76f6883d5d	Add support for window functions that can be pushed down to worker (#2008 ) This is the first of series of window function work. We can now support window functions that can be pushed down to workers. Window function must have distribution column in the partition clause to be pushed down.	2018-03-01 19:07:07 +03:00
Marco Slot	e79db17b91	Update comment in WorkerAggregateExpressionList	2018-02-27 23:48:25 +01:00
Murat Tuncer	e13c5beced	Fix worker query when order by avg aggregate is used (#2024 ) We push down order by to worker query when limit is specified (with some other additional checks). If the query has an expression on an aggregate or avg aggregate by itself, and there is an order by on this particular target we may send wrong order by to worker query with potential to affect query result. The fix creates a auxilary target entry in the worker query and uses that target entry for sorting.	2018-02-28 12:12:54 +03:00
Metin Doslu	bcf660475a	Add support for modifying CTEs	2018-02-27 15:08:32 +02:00
velioglu	78e6d990a2	Fix master plan of the query with distinct, aggregate and group by clauses. Before this PR, we were trusting on the columns of group by about guaranteeing the uniqueness of the results. However, this assumption is correct only if the columns in the group by is subset of columns in the distinct clause. It can be wrong if we have part of group by columns and some aggregation columns in the distinct clause. With this PR, we add distinct plan on top of aggregate plan when necessary.	2018-02-26 15:30:15 +03:00
Onder Kalaci	1c930c96a3	Support non-co-located joins between subqueries With #1804 (and related PRs), Citus gained the ability to plan subqueries that are not safe to pushdown. There are two high-level requirements for pushing down subqueries: * Individual subqueries that require a merge step (i.e., GROUP BY on non-distribution key, or LIMIT in the subquery etc). We've handled such subqueries via #1876. * Combination of subqueries that are not joined on distribution keys. This commit aims to recursively plan some of such subqueries to make the whole query safe to pushdown. The main logic behind non colocated subquery joins is that we pick an anchor range table entry and check for distribution key equality of any other subqueries in the given query. If for a given subquery, we cannot find distribution key equality with the anchor rte, we recursively plan that subquery. We also used a hacky solution for picking relations as the anchor range table entries. The hack is that we wrap them into a subquery. This is only necessary since some of the attribute equivalance checks are based on queries rather than range table entries.	2018-02-26 13:50:37 +02:00
Onder Kalaci	7b57e0562a	Add infrastructure for detecting non-colocated subqueries	2018-02-26 13:28:25 +02:00
Onder Kalaci	4d70c86645	Leaf level recursive planning for non colocated subqueries With this commit, we enable recursive planning for the subqueries that are not joined on the distribution keys.	2018-02-26 13:28:24 +02:00
Onder Kalaci	e998703ff8	Enable restriction eq. checks for top level set operations We used to only support pushdownable set operations inside a subquery, however, we could easily expand the restriction checks to cover top level set operations as well.	2018-02-26 13:28:24 +02:00
Onder Kalaci	e8aa532a90	Refactor checks for distribution key equality Change some function names, ensure we stick to Citus' function order rules etc.	2018-02-26 13:28:24 +02:00
Marco Slot	1e9186a3b5	Do not use new connection in table size functions	2018-02-23 07:07:55 +01:00
Markus Sintonen	6202e80d06	Implemented jsonb_agg, json_agg, jsonb_object_agg, json_object_agg	2018-02-18 00:19:18 +02:00
velioglu	195ac948d2	Recursively plan subqueries in WHERE clause when FROM recurs	2018-02-13 19:52:12 +03:00
Marco Slot	0cba4ab588	Refactor worker node hash initialisation	2018-02-12 23:36:43 +01:00
Marco Slot	40d715d494	Cache worker node array for faster iteration	2018-02-12 23:36:43 +01:00
Marco Slot	6e79a34c97	Do not check for cancellation in ClearResultsIfReady	2018-02-12 16:45:02 +01:00
Marco Slot	6051aae56e	Handle errors that are discovered during abort	2018-02-12 16:45:02 +01:00
Marco Slot	ee6a751798	Only copy distributed plan when modifying it	2018-02-12 16:30:55 +01:00
Onder Kalaci	94c5ac6ebb	Remove duplicate join restrictions We use PostgreSQL hooks to accumulate the join restrictions and PostgreSQL gives us all the join paths it tries while deciding on the join order. Thus, for queries that have many joins, this function is likely to remove lots of duplicate join restrictions. This becomes relevant for Citus on query pushdown check peformance.	2018-02-12 18:35:05 +02:00
Onder Kalaci	c228d8ff3d	Refactor equivalance generation related codes This commit changes the APIs for restriction generation to make future changes simpler.	2018-02-12 18:35:04 +02:00
Onder Kalaci	2f2d350924	Refactor relation restriction related codes This commit moves some of the functions to a more relevant source file.	2018-02-12 18:35:04 +02:00
Murat Tuncer	901b543e20	Fix count distinct using field select on top level query We were allowing count distict queries even if they were not directly on columns if the query is grouped on distribution column. When performing these checks we were skipping subqueries because they also perform this check in a more concise manner. We relied on oid SUBQUERY_RELATION_ID (10000) to decide if a given RTE relation id denotes a subquery, however, we also use SUBQUERY_PUSHDOWN_RELATION_ID (10001) for some subqueries. We skip both type of subqueries with this change.	2018-02-06 13:16:10 +03:00
metdos	35f864bcaf	Respect enable_hashagg in the master planner	2018-02-05 15:06:00 +02:00
metdos	3d540d961c	Fix typo in grouping_is_sortable()	2018-02-05 12:10:19 +02:00
Marco Slot	6f7c3bd73b	Skip JSON validation on coordinator during COPY	2018-02-02 15:33:27 +01:00
Brian Cloutier	15511f6ba1	Dynamically allocate connection metadata in WaitForAllConnections	2018-02-01 10:30:41 -08:00
Brian Cloutier	e6ebfc1f53	Remove VLA from UpdateNodeLocation	2018-02-01 10:30:41 -08:00
Brian Cloutier	a2ed45e206	Remove variable length arrays VLAs aren't supported by Visual Studio. - Remove all existing instances of VLAs. - Add a flag, -Werror=vla, which makes gcc refuse to compile if we add VLAs in the future.	2018-02-01 10:30:41 -08:00
Brian Cloutier	2efe80ce55	CheckForDistributedDeadlocks no longer uses a VLA - variable length arrays (VLAs) do not work with Visual Studio - fix an off-by-one error. We incorrectly assumed there would always at least as many edges as there were nodes. - refactor: reduce scope of transactionNodeStack by moving it into the function which uses it. - refactor: break up the distinct uses of currentStackDepth into separate variables.	2018-02-01 10:30:41 -08:00
Brian Cloutier	097fd15a89	small refactor, CheckDeadlockForTransactionNode builds it's own array	2018-02-01 10:30:41 -08:00
Brian Cloutier	457f570b77	Small refactor, we were using incompatible types	2018-01-31 11:05:59 -08:00
Brian Cloutier	b864d014ab	GetNextNodeId() incorrectly called PG_RETURN_DATUM - Also stabilize the output of a multi_router_planner test	2018-01-29 15:32:36 -08:00
Brian Cloutier	61a6b846b9	Refactor: use a temporary timestamp variable It's against our coding convention to call functions inside parameter lists; when single-stepping with a debugger it's difficult to determine what the function returned. That wouldn't be good enough reason to change this code but while porting Citus to Windows I ran into this line of code. assign_distributed_transaction_id was called with a weird timestamp and I wasn't able to find the problem without first making this change.	2018-01-29 11:20:13 -08:00
Marco Slot	bd0ebac865	Skip call to ActiveReadableNodeList when there are no subplans	2018-01-29 16:05:10 +01:00
Hadi Moshayedi	ff26bcd5a5	Include sys/stat.h for S_IRUSR and S_IWUSR. (#1977 )	2018-01-26 16:21:48 -05:00
Brian Cloutier	76d1edc3fd	Don't rely on gcc-specific features (#1963 ) * Don't use expressions inside compound statements * Don't depend on __builtin_constant_p * Remove reliance on S_ISLNK * Replace use of __func__: older mcvs doesn't support this builtin	2018-01-23 17:03:29 -08:00
Onder Kalaci	fbde87d2d0	Allocate enough space for transaction nodes This fix prevents any potential memory access that might occur while forming the deadlock path.	2018-01-22 08:45:48 +02:00
Onder Kalaci	9a89c0b425	Fix bug while traversing the distributed deadlock graph With this fix, we traverse the graph with DFS which was originally intended. Note that, before the fix, we traverse the graph with BFS which might lead to killing some unrelated backend that is not involved in the distributed deadlock.	2018-01-22 08:45:48 +02:00
Dimitri Fontaine	c9760fbb64	Fix CREATE INDEX with storage options on distributed tables. By sharing the implementation of the function AppendOptionListToString on three call sites, we would expand an extra OPTIONS keyword in a create index statement, and omit other bits of the specific syntax here. This patch introduces an AppendStorageParametersToString() function that is very similar to AppendOptionListToString() but handles WITH(a="foo",...) syntax that is used in reloptions (aka Storage Parameters). Fixes #1747.	2018-01-17 21:56:40 +01:00
Dimitri Fontaine	952da72c55	Implement ALTER TABLE\|INDEX ... SET\|RESET (). PostgreSQL implements support for several relation kinds in a single statement, such as in the AlterTableStmt case, which supports both tables and indexes and more (see ATExecSetRelOptions in PostgreSQL source code file src/backend/commands/tablecmds.c for an example of that). As a consequence, this patch implements support for setting and resetting storage parameters on both relation kinds.	2018-01-17 21:56:40 +01:00
Dimitri Fontaine	17266e3301	Implement ALTER INDEX ... RENAME TO ... The command is now distributed among the shards when the table is distributed. To that effect, we fill in the DDLJob's targetRelationId with the OID of the table for which the index is defined, rather than the OID of the index itself.	2018-01-17 21:56:40 +01:00
velioglu	d357d2fccd	Bump citus version to 7.3devel	2018-01-16 11:50:28 +03:00
Dimitri Fontaine	e010238280	Implement ALTER TABLE ... RENAME TO ... The implementation was already mostly in place, but the code was protected by a principled check against the operation. Turns out there's a nasty concurrency bug though with long identifier names, much as in #1664. To prevent deadlocks from happening, we could either review the DDL transaction management in shards and placements, or we can simply reject names with (NAMEDATALEN - 1) chars or more — that's because of the PostgreSQL array types being created with a one-char prefix: '_'.	2018-01-11 13:21:24 +01:00
Hadi Moshayedi	5d7c52ffa6	Don't return in PG_TRY() block when cancellations happen in WaitForConnections(). (#1923 ) We shouldn't return in middle of a PG_TRY() block because if we do, we won't reset PG_exception_stack, and later when a re-throw tries to jump to the jump-point which was active in this PG_TRY() block, it seg-faults. We used to return in middle of PG_TRY() block in WaitForConnections() where we checked for cancellations. Whenever cancellations were caught here, Citus crashed. And example was reported by @onderkalaci at #1903.	2018-01-03 09:54:03 -05:00
Marco Slot	8f69973411	Fix cancellation issues in the real-time executor (#1905 )	2018-01-01 23:10:29 -05:00
Marco Slot	3fd65cb91b	Do not raise errors in the real-time executor (#1903 )	2018-01-01 22:26:31 -05:00
Onder Kalaci	a1bbdf2d44	Outer joins should also use subquery pushdown planner if join clause is not supported This change allows unsupported clauses to go through query pushdown planner instead of erroring out as we already do for non-outer joins.	2017-12-29 16:40:47 +02:00
Marco Slot	09c09f650f	Recursively plan set operations when leaf nodes recur	2017-12-26 13:46:55 +02:00
mehmet furkan şahin	446893234a	unsupported subquery error messages are fixed	2017-12-25 15:10:59 +03:00
mehmet furkan şahin	57bc86e23d	new debug output for subplans	2017-12-25 09:50:51 +03:00
Marco Slot	fa7fa2734b	Log remote commands sent via MultiClientSendQuery	2017-12-22 16:18:40 +01:00
Murat Tuncer	87c6f306f1	Fix join clause eq restrictions (#1884 ) We used to error out if the join clause includes filters like t1.a < t2.a even if other filter like t1.key = t2.key exists. Recently we lifted that restriction in subquery planning by not lifting that restriction and focusing on equivalance classes provided by postgres. This checkin forwards previously erroring out real-time queries due to join clauses to subquery planner and let it handle the join even if the query does not have a subquery. We are now pushing down queries that do not have any subqueries in it. Error message looked misleading, changed to a more descriptive one.	2017-12-22 12:16:14 +03:00
metdos	32b7e152a3	Get shard resource locks for only DMLs	2017-12-22 10:30:41 +02:00
Murat Tuncer	a9cf0c3e66	Fix CTE column alias issue (#1893 ) We were creating intermediate query result's target names from subquery target list. Now we also check if cte re-defines its column name aliases, and create intermediate result query accordingly.	2017-12-22 09:39:40 +03:00
Brian Cloutier	377b31dcf7	Remove enable_deadlock_prevention prevention warning	2017-12-21 14:47:52 +01:00
Brian Cloutier	fb7b86fa14	Replace strtoull with pg_strtouint64 The macro we were using to detect strtoull isn't set on Windows, and just in case there are differences use a portable function from PG instead of calling strtoull directly.	2017-12-21 14:28:51 +01:00
mehmet furkan şahin	fd546cf322	Intermediate result size limitation This commit introduces a new GUC to limit the intermediate result size which we handle when we use read_intermediate_result function for CTEs and complex subqueries.	2017-12-21 14:26:56 +03:00
Onder Kalaci	0d5a4b9c72	Recursively plan subqueries that are not safe to pushdown With this commit, Citus recursively plans subqueries that are not safe to pushdown, in other words, requires a merge step. The algorithm is simple: Recursively traverse the query from bottom up (i.e., bottom meaning the leaf queries). On each level, check whether the query is safe to pushdown (or a single repartition subquery). If the answer is yes, do not touch that subquery. If the answer is no, plan the subquery seperately (i.e., create a subPlan for it) and replace the subquery with a call to `read_intermediate_results(planId, subPlanId)`. During the the execution, run the subPlans first, and make them avaliable to the next query executions. Some of the queries hat this change allows us: * Subqueries with LIMIT * Subqueries with GROUP BY/DISTINCT on non-partition keys * Subqueries involving re-partition joins, router queries * Mixed usage of subqueries and CTEs (i.e., use CTEs in subqueries as well). Nested subqueries as long as we support the subquery inside the nested subquery. * Subqueries with local tables (i.e., those subqueries has the limitation that they have to be leaf subqueries) * VIEWs on the distributed tables just works (i.e., the limitations mentioned below still applies to views) Some of the queries that is still NOT supported: * Corrolated subqueries that are not safe to pushdown * Window function on non-partition keys * Recursively planned subqueries or CTEs on the outer side of an outer join * Only recursively planned subqueries and CTEs in the FROM (i.e., not any distributed tables in the FROM) and subqueries in WHERE clause * Subquery joins that are not on the partition columns (i.e., each subquery is individually joined on partition keys but not the upper level subquery.) * Any limitation that logical planner applies such as aggregate distincts (except for count) when GROUP BY is on non-partition key, or array_agg with ORDER BY	2017-12-21 08:37:40 +02:00
Onder Kalaci	e12ea914b9	Refactor ErrorIfQueryNotSupported to defer errors	2017-12-20 09:03:49 +02:00
Onder Kalaci	71ce42b936	Refactor RecursivelyPlanSubqueriesAndCTEs() to make it ready to work with subqueries	2017-12-20 09:03:47 +02:00
Marco Slot	5e0539efa3	Plan CTEs when subquery pushdown is on	2017-12-19 16:34:56 +01:00
Marco Slot	44a1ea631a	Show distributed subplan ID in EXPLAIN output	2017-12-19 16:34:56 +01:00
Marco Slot	35dbacdb69	Do not reinitialise MyBackendData	2017-12-19 15:56:26 +01:00
Marco Slot	af201a2f6d	Allow intermediate results to be used in parallel workers	2017-12-18 19:05:08 +01:00
Marco Slot	7dab078e67	Set cost estimates for read_intermediate_result	2017-12-18 16:23:44 +01:00
Marco Slot	74bd33d0cc	Revert "Plan CTEs when subquery pushdown is on" This reverts commit `e3b953b8e3`.	2017-12-17 22:34:20 +01:00
Marco Slot	aca5f35ab9	Revert "Show distributed subplan ID in EXPLAIN output" This reverts commit `686b079272`.	2017-12-17 22:34:04 +01:00
Marco Slot	e3b953b8e3	Plan CTEs when subquery pushdown is on	2017-12-17 21:49:36 +01:00
Marco Slot	686b079272	Show distributed subplan ID in EXPLAIN output	2017-12-16 11:32:01 +01:00
Marco Slot	ea6b98fda4	Allow count(distinct) in queries with a subquery	2017-12-15 15:24:26 +01:00
Marco Slot	9ee0e68882	Do not take extra access exclusive lock partitioned tables	2017-12-15 13:02:31 +01:00
Marco Slot	5a69fc1b17	Relax checks on recurring tuples in FROM with sublinks	2017-12-15 11:56:06 +01:00
Marco Slot	a64f0060ba	Reduce the frequency of FinishConnectionIO calls during COPY (#1864 )	2017-12-14 13:21:59 -05:00
Marco Slot	2e2b4e81fa	Add support for CTEs in distributed queries	2017-12-14 09:32:55 +01:00
Marco Slot	d0335ec818	Send BEGIN for SELECTs in the router executor	2017-12-14 09:32:55 +01:00
Marco Slot	cbbd418af2	Add citus.copy_format OIDs to metadata cache	2017-12-14 09:32:55 +01:00
Marco Slot	66f9f1d6cd	Make some intermediate results functions public	2017-12-14 09:32:55 +01:00
Marco Slot	36ee21c323	Make CanUseBinaryCopyFormatForType public	2017-12-14 09:32:55 +01:00
Marco Slot	7d1191954d	Add DistributedSubPlan node	2017-12-14 09:32:55 +01:00
Onder Kalaci	86b2d9420c	Treat recurring tuples as reference table for GROUP BY checks read_intermediate_results() and immutable functions are implemented. Empty join trees seems not applicable here.	2017-12-13 14:55:42 +02:00
Marco Slot	d1a470a52e	Fix issue with multiple ANALYZE in transaction block	2017-12-12 10:28:48 +01:00
mehmet furkan şahin	3c941aedf1	adds citus.enable_repartition_joins GUC The new GUC allows Citus to switch between task executors when necessary	2017-12-11 09:36:37 +03:00
Marco Slot	60a1e31671	Allow queries with local tables in NeedsDistributedPlanning	2017-12-07 16:20:23 +01:00
Marco Slot	f8550b8c85	Fix issues with read_intermediate_result signature	2017-12-07 13:47:56 +01:00
Marco Slot	d8fea4efb8	Revert "Allow queries with local tables in NeedsDistributedPlanning" This reverts commit `d2bac081e8`.	2017-12-07 11:19:11 +01:00
Marco Slot	d2bac081e8	Allow queries with local tables in NeedsDistributedPlanning	2017-12-07 11:02:16 +01:00
Onder Kalaci	c42a92afd2	Fix bug related to incrementing an index not properly	2017-12-07 08:50:57 +02:00
Marco Slot	eab15aa035	Avoid deadlock in ColocatedTableId	2017-12-06 11:49:34 +01:00
Marco Slot	7279d42849	Treat read_intermediate_result as recurring tuples	2017-12-04 14:50:11 +01:00
Marco Slot	4cdadfcab6	Add intermediate results infrastructure	2017-12-04 14:50:11 +01:00
Marco Slot	bfcc76df69	Make several COPY-related functions public	2017-12-04 13:12:03 +01:00
Marco Slot	73989b07eb	Refactor query execution functions	2017-12-04 13:12:03 +01:00
Murat Tuncer	2d66bf5f16	Fix hard coded formatting strings for 64 bit numbers (#1831 ) Postgres provides OS agnosting formatting macros for formatting 64 bit numbers. Replaced %ld %lu with INT64_FORMAT and UINT64_FORMAT respectively. Also found some incorrect usages of formatting flags and fixed them.	2017-12-04 14:11:06 +03:00
Hadi Moshayedi	ff706cf556	Test that COPY blocks UPDATE/DELETE/INSERT...SELECT when rep factor 2.	2017-11-30 14:52:29 -05:00
Marco Slot	acbc0fe0de	Use RowExclusiveLock shard resource lock in COPY	2017-11-30 09:15:45 -05:00
Onder Kalaci	a273711500	The common attribute equivalance class always includes the input relations We added the ability to filter out the planner restriction information for specific parts of the query. This might lead to situations where the common restriction includes some other relations that we're searching for. The reason is that while filtering for join restrictions, we add the restriction as soon as we find the relation. With this commit we make sure that the common attribute equivalance class always includes the input relations.	2017-11-30 16:00:26 +02:00
Marco Slot	d6dd0b3a81	Send BEGIN in the real-time executor when in a transaction	2017-11-30 12:59:09 +01:00
Marco Slot	3a4d5f8182	Remove filter checks on leaf queries	2017-11-30 12:25:14 +01:00
Marco Slot	3f03cb6a6a	Support UNION with joins in the subqueries	2017-11-30 10:37:56 +01:00
Marco Slot	a9933deac6	Make real time executor work in transactions	2017-11-30 09:59:32 +03:00
Jason Petersen	0eacf6bd95	Refactor VacuumStmt checker to be single-return Decided this would be safer for the future (defaults to unsupported).	2017-11-29 16:06:50 -07:00
Jason Petersen	b12e77ab0e	Ensure unsupported VACUUMs don't go to workers Apparently these two blocks have been incorrect for nearly a year…	2017-11-29 16:06:50 -07:00
Marco Slot	7ea718fd8d	Round-robin over worker nodes for 0-shard router queries	2017-11-29 15:52:22 +01:00
Onder Kalaci	05fb0dd020	Add infrastructure for filtering restriction contexts based on the input query In subquery pushdown, we first ensure that each relation is joined with at least on another relation on the partition keys. That's fine given that the decision is binary: pushdown the query at all or not. With recursive planning, we'd want to check whether any specific part of the query can be pushded down or not. Thus, we need the ability to understand which part(s) of the subquery is safe to pushdown. This commit adds the infrastructure for doing that.	2017-11-28 09:58:21 +02:00
Onder Kalaci	26d9b58e9e	Make sure that ExtractRangeTableRelationWalker never misses RTE_RELATION	2017-11-28 09:27:34 +02:00
Onder Kalaci	32def06ebd	Split assigning RTE identities and partitioning related query modifications Note that we used to iterate over the RTEs once for performance reasons. However, keeping an extra copy of original query seems more costly and hard to maintain/explain.	2017-11-28 09:27:34 +02:00
Marco Slot	feffe86440	Subqueries containing functions go through subquery pushdown	2017-11-27 22:13:02 +01:00
Onder Kalaci	48f96bf3e5	Enable non equi joins in subquery pushdown Subquery pushdown planning is based on relation restriction equivalnce. This brings us the opportuneatly to allow any other joins as long as there is an already equi join between the distributed tables. We already allow that for joins with reference tables and this commit allows that for joins among distributed tables.	2017-11-23 16:13:46 +02:00
Onder Kalaci	16421f089f	Register citus custom scan nodes	2017-11-23 11:38:33 +02:00
Onder Kalaci	83c1143505	Refactor custom scan related codes In this commit, we don't change any codes, only create a new file and move the related functions and types there.	2017-11-23 11:38:12 +02:00
Marco Slot	20a526d5c4	Fix memory leak in ListToHashSet	2017-11-22 11:26:58 +01:00
Marco Slot	f4ceea5a3d	Enable 2PC by default	2017-11-22 11:26:58 +01:00
Marco Slot	8486f76e15	Auto-recover 2PC transactions	2017-11-22 11:26:58 +01:00
Marco Slot	6ba3f42d23	Rename MultiPlan to DistributedPlan	2017-11-22 09:36:24 +01:00
Marco Slot	0ad39b36fe	Treat immutable table functions and constant subqueries as reference tables	2017-11-21 14:15:22 +01:00
Onder Kalaci	d558ebb923	Relax the checks on ensuring distribution columns for target entries With this commit, we allow pushing down subqueries with only reference tables where GROUP BY or DISTINCT clause or Window functions include only columns from reference tables.	2017-11-21 12:28:14 +02:00
Andres Freund	d063658d6d	Protect some initializations from being called during backend startup. On EXEC_BACKEND builds these functions shouldn't be called at every backend start.	2017-11-20 15:29:51 -08:00
Brian Cloutier	d267e0f9fa	EXEC_BACKEND: don't put pointers to shared hashes into shared memory Store pointers to shared hashes in process-local variables. Previously pointers to shared hashes were put into shared memory. This causes problems on EXEC_BACKEND because everybody calls execve and receives a brand new address space; the shared hash will be in a different place for every backend. (normally we call fork, which gives you a copy of the address space, so these pointers remain constant)	2017-11-20 15:29:51 -08:00
Brian Cloutier	30a2365d81	Rename CreateDirectory to CitusCreateDirectory	2017-11-20 14:38:26 -08:00
Brian Cloutier	aa2ab023a2	Rename RemoveDirectory -> CitusRemoveDirectory	2017-11-20 14:21:52 -08:00
Brian Cloutier	06f756b0a1	Rename DeleteFile -> CitusDeleteFile	2017-11-20 13:30:11 -08:00
Marco Slot	9793218122	Do not commit already-committed prepared transactions in recovery	2017-11-20 13:18:48 +01:00
Marco Slot	ae47df01ea	Observe prepared xacts twice in RecoverWorkerTransactions to avoid race condition	2017-11-20 11:44:08 +01:00
Marco Slot	2410c2e450	Rewrite recover_prepared_transactions to be fast, non-blocking	2017-11-20 11:27:40 +01:00
Onder Kalaci	5bea95009b	Skip autovacuum processes for distributed deadlock detection Autovacuum process cancels itself if any modification starts on the table in order to avoid blocking your regular Postgres sessions. That's normal and expected. Thus, any locks held by autovacuum process cannot involve in a distributed deadlock since it'll be released if needed.	2017-11-15 14:32:16 +02:00
Onder Kalaci	c65c153a46	Skip speculative locks for distributed deadlock detection These locks are held for a very short duration time and cannot contribute to a deadlock. Speculative locks are used by Postgres for internal notification mechanism among transactions.	2017-11-15 12:43:45 +02:00
Marco Slot	bbbadd6d1b	Bump Citus version to 7.2devel	2017-11-15 10:32:49 +01:00
Marco Slot	d3b634b301	Allow generating placement IDs without using the sequence	2017-11-15 10:12:06 +01:00
Marco Slot	c24a0875a5	Allow generating shard IDs without using the sequence	2017-11-15 10:12:05 +01:00
Brian Cloutier	0f3230170f	Pull in INT32_MAXINT and INT32_MININT	2017-11-14 14:03:46 -08:00
Brian Cloutier	0db8277266	remove unused errno import	2017-11-14 13:09:34 -08:00
Brian Cloutier	5d9f3ae7fd	Remove unused poll import from multi_real_time_executor	2017-11-14 13:09:34 -08:00
Marco Slot	533a533565	Only drop sequences on workers with metadata	2017-11-14 16:01:56 +01:00
velioglu	be28ba8e70	Add stub UDF to run pg_upgrade flawlessly	2017-11-13 16:14:45 +02:00
metdos	111c04c2bd	Warn on CLUSTER command for distributed tables	2017-11-10 12:14:45 +02:00
Burak Yücesoy	863df0b874	Merge branch 'master' into fix_partitioning_in_schema	2017-11-09 12:49:35 +02:00
Burak Yucesoy	17229ed7bd	Fix attaching partition to a distributed table in schema While attaching a partition to a distributed table in schema, we mistakenly used unqualified name to find partitioned table's oid. This caused problems while using partitioned tables with schemas. We are fixing this issue in this PR.	2017-11-09 13:20:29 +03:00
Onder Kalaci	94921a2be1	Skip page-level locks on distributed deadlock detection Short-term share/exclusive page-level locks are used for read/write access. Locks are released immediately after each index row is fetched or inserted. Since those locks may not lead to any deadlocks, it's safe to ignore them in the distributed deadlock detection.	2017-11-09 10:37:23 +02:00
Marco Slot	f71728f634	Add GUC for specifying sslmode in connections to workers	2017-11-08 14:15:58 +01:00
Murat Tuncer	4e3d633ebf	Add check for connection failures during multishard update (#1765 )	2017-11-07 12:33:25 +02:00
Hadi Moshayedi	6d79d25101	Fix a relcache reference leak in stats collection. In DistributedTablesSize() we didn't close the relations that had replication factor > 2. This caused relcache reference leaks, and warning messages like following in logs: WARNING: relcache reference leak: relation "researchers" not closed	2017-11-06 23:16:43 -05:00
metdos	c83edc36b5	Check connection status before using it	2017-11-06 14:53:35 +02:00
Brian Cloutier	7be1545843	Support implicit casts during INSERT/SELECT It's possible to build INSERT SELECT queries which include implicit casts, currently we attempt to support these by adding explicit casts to the SELECT query, but this sometimes crashes because we don't update all nodes with the new types. (SortClauses, for instance) This commit removes those explicit casts and passes an unmodified SELECT query to the COPY executor (how we implement INSERT SELECT under the scenes). In lieu of those cases, COPY has been given some extra logic to inspect queries, notice that the types don't line up with the table it's supposed to be inserting into, and "manually" casting every tuple before sending them to workers.	2017-11-03 22:27:15 -07:00
Marco Slot	6883a09cdd	Allow distributed partitioned table creation in Cloud	2017-11-03 10:09:18 +01:00
Marco Slot	6219186683	Allow distributed INSERT...SELECT via worker nodes in MX	2017-11-02 14:38:39 +01:00
Hadi Moshayedi	7280774cf4	Use list_length() != 1 in SingleReplicatedTable(). ShardPlacementList's implementation can return NIL. In previous implementation we got a segmentation fault in this case. The relation can be dropped after getting distributed table list but before calling SingleReplicatedTable().	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	7691991cb5	Do PG_TRY() inside a subtransaction block. If we don't propagate the errors we are catching in PG_CATCH(), database's internal state might not be clean. So we do PG_TRY() inside a subtransaction so we can rollback to it after catching errors.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	9bfbbf8a04	Make reports hostname configurable and enable stats collection in tests. This patch adds --with-reports-host configure option, which sets the REPORTS_BASE_URL constant. The default is reports.citusdata.com. It also enables stats collection in tests.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	acaf085a80	Add callback function for request by CollectBasicUsageStatistics(). Curl writes the received response to stdout if we don't specify a response callback or an output file. This can pollute the PostgreSQL log. In this change we add a callback function so the response messages aren't added to the log file.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	747e439601	Limit number of stats collection retries to once a day.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	78a2cd9052	Check for Citus updates. Sends a request to /v1/releases/latest?flavor=$CITUS_EDITION once a day, which returns a response similar to {"version": "7.1.0", "major": 7, "minor": 1, "patch": 0}. Then compares it with current Citus version, and if the latest release is newer, logs a LOG message.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	34f3ec0961	Call FlushDistTableCache() before stats collection.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	c18c6625d9	Lock relations before calling citus_table_size(). This is to make sure they don't get dropped.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	97d544b75c	Follow the patterns used in Deadlock Detection in Stats Collection. This includes: (1) Wrap everything inside a StartTransactionCommand()/CommitTransactionCommand(). This is so we can access the database. This also switches to a new memory context and releases it, so we don't have to do our own memory management. (2) LockCitusExtension() so the extension cannot be dropped or created concurrently. (3) Check CitusHasBeenLoaded() && CheckCitusVersion() before doing any work. (4) Do not PG_TRY() inside a loop.	2017-10-31 21:51:43 -04:00
Marco Slot	100aaeb3f5	Fix typo in distributed deadlock error message	2017-10-31 19:39:32 +01:00
metdos	8c356b2bc8	Don't try to add restrictions for reference tables in insert into select	2017-10-31 19:44:10 +02:00
mehmet furkan şahin	32fb19911c	Add Constraint %s Add Primary Key Using index %s support This commit makes a change in relay_event_utility.c to check if the Alter Table command adds a constraint using index. If this is the case, it appends the shard id to the index name.	2017-10-31 16:03:56 +03:00
Marco Slot	7e34348334	Add shard transfer mode parameter to shard copy functions	2017-10-31 13:30:48 +01:00
Marco Slot	2bb46bb5ee	Reset connectionReady flag after moving a connection in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	e6e6897499	Defer initial PQflush to main loop in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	d6dadb1b25	Use correct index for ModifyWaitEvent in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Furkan Sahin	2b39c52f0b	Replica identity on create_distributed_table By this commit, citus minds the replica identity of the table when we distribute the table. So the shards of the distributed table have the same replica identity with the local table.	2017-10-31 13:08:36 +03:00
Marco Slot	7f68f78ee9	Omit public schema from shard_name output	2017-10-31 00:22:07 +01:00
Murat Tuncer	e16805215d	Support count(distinct) for non-partition columns (#1692 ) Expands count distinct coverage by allowing more cases. We used to support count distinct only if we can push down distinct aggregate to worker query i.e. the count distinct clause was on the partition column of the table, or there was a grouping on the partition column. Now we can support - non-partition columns, with or without grouping on partition column - partition, and non partition column in the same query - having clause - single table subqueries - insert into select queries - join queries where count distinct is on partition, or non-partition column - filters on count distinct clauses (extends existing support) We first try to push down aggregate to worker query (original case), if we can't then we modify worker query to return distinct columns to coordinator node. We do that by adding distinct column targets to group by clauses. Then we perform count distinct operation on the coordinator node. This work should reduce the cases where HLL is used as it can address anything that HLL can. However, if we start having performance issues due to very large number rows, then we can recommend hll use.	2017-10-30 13:12:24 +02:00
Marco Slot	be46661bf7	Block only 2PCs instead of all writes in citus_create_restore_point	2017-10-27 00:07:32 +02:00
mehmet furkan şahin	61ae33dc7f	ALTER TABLE .. REPLICA IDENTITY support is implemented	2017-10-26 13:44:28 +03:00
Brian Cloutier	4a17d12d74	Replace uint with uint32	2017-10-25 19:32:12 -07:00
velioglu	0b5db5d826	Support multi shard update/delete queries	2017-10-25 15:52:38 +03:00
Marco Slot	4bde83e1d2	Relay error message if DML fails on worker	2017-10-25 14:23:21 +02:00
Hadi Moshayedi	9a04b78980	Send server_id for statistics reports. (#1698 ) This change introduces the `pg_dist_node_metadata` which has a single jsonb value. When creating the extension, a random server id is generated and stored in there. Everything in the metadata table is added as a nested objected to the json payload that is sent to the reports server.	2017-10-18 21:20:32 -04:00
Hadi Moshayedi	86bcd93a4a	Don't collect stats when there is a version mismatch. (#1712 ) The following scenario can cause an Assert() crash if we don't do this: - Install Citus v7.0-15 - Restart server & run a query to start maintenanced. - Install Citus v7.1 - Restart server & run a query. This will tell user to upgrade. - Type "UPDATE EXTENSION c" & press tab. maintenanced will start and crash with Assert(CitusHasBeenLoaded() && CheckCitusVersion(WARNING)); This change checks Citus version before calling metadata functions so the crash doesn't happen.	2017-10-17 14:01:14 -04:00
Jason Petersen	8544878c4b	Add citus_version(), analogous to PG's version() This will provide the full project name (i.e. Citus/Citus Enterprise), and the host system, compiler, and architecture word size. I wanted to limit the number of copied files in 'config', so I added only config.guess and call it manually, rather than using the macro AC_CANONICAL_HOST, which requires several other files.	2017-10-16 18:09:29 -06:00
Brian Cloutier	91ff8cd2d5	{*,}create_distributed_table doesn't emit OID (#1710 )	2017-10-16 18:08:51 -06:00
Brian Cloutier	ebcb2b65e9	Add master_move_node function	2017-10-16 10:51:28 -07:00
Brian Cloutier	58cf15ceca	DistributedTableSize doesn't emit oid when erring out	2017-10-14 02:42:57 +03:00
Hadi Moshayedi	2aec6eda49	Properly use #ifdef HAVE_LIBCURL.	2017-10-13 12:04:36 -06:00
Jason Petersen	01353cb7cb	Use header define rather than -D flag Eclipse apparently doesn't scan build output looking for -D flags, so having the value actually appear in a header is nicer for those of us using IDEs.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	946659aebe	Delete StatsCollection memory context after we are done with stats reporting. Previously we left the memory context untouched, which overtime leaked memory.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	873fd1e7ff	Fix compiling --without-libcurl. Previously <curl/curl.h> was included even if compiled --without-libcurl. This can fail when libcurl headers are not there. This commit guards this include by checks for HAVE_LIBCURL.	2017-10-13 11:00:09 -04:00
Murat Tuncer	4832abc7cb	Make multi_master_planner.c coding convention compliant Changed order of function definitions and added declarations in the beginning of the file	2017-10-13 14:59:48 +03:00
Murat Tuncer	f7ab901766	Add select distinct, and distinct on support Distinct, and distinct on() clauses are supported in simple selects, joins, subqueries, and insert into select queries.	2017-10-13 14:59:48 +03:00
Hadi Moshayedi	6879f92e23	Fix out of bound memeory access when getting HTTP response code. (#1699 )	2017-10-12 12:51:42 -04:00
Hadi Moshayedi	a1387f4aa8	Basic usage statistics collection. (#1656 ) Adds ```citus.enable_statistics_collection``` GUC variable, which ```true``` by default, unless built without libcurl. If statistics collection is enabled, sends basic usage data to Citus servers every 24 hours. The data that is collected consists of: - Citus version - OS name & release - Hardware Id - Number of tables, rounded to next power of 2 - Size of data, rounded to next power of 2 - Number of workers	2017-10-11 09:55:15 -04:00
Onder Kalaci	498ac80d8b	Add window function support for SUBQUERY PUSHDOWN and INSERT INTO SELECT This commit provides the support for window functions in subquery and insert into select queries. Note that our support for window functions is still limited because it must have a partition by clause on the distribution key. This commit makes changes in the files insert_select_planner and multi_logical_planner. The required tests are also added with files multi_subquery_window_functions.out and multi_insert_select_window.out.	2017-10-04 15:33:07 +03:00
Marco Slot	9e516513fc	Use local group ID when querying for prepared transactions	2017-10-03 16:36:53 +02:00
Hadi Moshayedi	11adb9b034	Push down LIMIT and HAVING when grouped by partition key. (#1641 ) We can do this because all rows belonging to a group are in the same shard when grouping by distribution column on a range/hash distributed table.	2017-10-02 20:17:51 -04:00
Marco Slot	394918f9d0	Invalidate worker and group ID cache in maintenance daemon	2017-10-02 18:14:29 +02:00
Marco Slot	43d5e79eaa	Execute transmit commands as superuser during task-tracker queries	2017-09-28 15:27:25 +02:00
Marco Slot	306c58d59b	Check for absolute paths in COPY with format transmit	2017-09-28 15:27:11 +02:00
Marco Slot	cb6b0e820c	Allow read-only users to run task-tracker queries	2017-09-28 13:52:36 +02:00
Marco Slot	da6b42a3e2	Use unique constraint index for transaction record deletion	2017-09-28 12:04:56 +02:00
Onder Kalaci	68ca8cb7f0	Skip relation extension locks We should skip if the process blocked on the relation extension since those locks are hold for a short duration while the relation is actually extended on the disk and released as soon as the extension is done. Thus, recording such waits on our lock graphs could yield detecting wrong distributed deadlocks.	2017-09-28 10:09:09 +03:00
Murat Tuncer	4676c4f7a5	Prevent crash when remote transaction start fails (#1662 ) We sent multiple commands to worker when starting a transaction. Previously we only checked the result of the first command that is transaction 'BEGIN' which always succeeds. Any failure on following commands were not checked. With this commit, we make sure all command results are checked. If there is any error we report the first error found.	2017-09-26 17:25:46 -07:00
Jason Petersen	b4d53423fa	Add adapter functions for OpenFile changes	2017-09-25 17:20:24 -07:00
Jason Petersen	d686123dae	Omit now-public Explain methods from PG11 build This copy-pasted code is no longer needed in PG11.	2017-09-25 17:20:24 -07:00
Jason Petersen	89d02c6115	Add ruleutils file for PostgreSQL 11	2017-09-25 17:20:24 -07:00
Jason Petersen	bbc15e0598	Handle HASHPROC changes PostgreSQL 11 now has "standard" and "extended" (64-bit) versions of hash functions.	2017-09-25 17:20:24 -07:00
Jason Petersen	6c9b19a954	Add version-compat header For polyfill macros, etc.	2017-09-25 17:20:23 -07:00
Jason Petersen	fbeaa2f9d0	Remove direct access to tupleDesc->attrs A level of indirection was removed from this field for PostgreSQL 11. By using the handy provided macro, we can be version agnostic.	2017-09-25 17:20:23 -07:00
Jason Petersen	6a020b5adc	Update CopyGetAttnums with latest from PostgreSQL This function was recently modified to use the TupleDescAttr wrapper, which abstracts away recent changes to TupleDesc.	2017-09-25 17:20:23 -07:00
Andres Freund	78716e5546	Fix possible shard cache incoherency. When a table and it's shards are dropped, and afterwards the same shard identifiers are reused, e.g. due to a DROP & CREATE EXTENSION, the old entry in the shard cache and the required entry in the shard cache might be for different tables. Force invalidation for both old and new table to fix.	2017-09-25 13:05:09 -07:00
velioglu	0a56ed910b	Change error message of queries with distributed and local table Citus can handle INSERT INTO ... SELECT queries if the query inserts into local table by reading data from distributed table. The opposite way is not correct. With this commit we warn the user if the latter option is used.	2017-09-22 13:46:19 -07:00
Onder Kalaci	867224bdd7	Make the tests produce more consistent outputs	2017-09-22 20:38:56 +03:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Onder Kalaci	6736fd1682	Remove two obsolete functions Namely GetConnectionFromPGconn() and CloseConnectionByPGconn()	2017-09-21 00:36:23 -06:00
Onder Kalaci	33ec33c5b3	Ensure schema exists on reference table creation If the schema doesn't exists on the workers, create it.	2017-09-18 23:50:47 +03:00
Onder Kalaci	6116c8e93d	Allow pushing down GROUP BYs when at least there is one distribution column in the target list	2017-09-15 19:15:06 +03:00
Onder Kalaci	a5b66912d4	Expand reference table support in subquery pushdown With this commit, we relax the restrictions put on the reference tables with subquery pushdown. We did three notable improvements: 1) Relax equi-join restrictions Previously, we always expected that the non-reference tables are equi joined with reference tables on the partition key of the non-reference table. With this commit, we allow any column of non-reference tables joined using non-equi joins as well. 2) Relax OUTER JOIN restrictions Previously Citus errored out if any reference table exists at any point of the outer part of an outer join. For instance, See the below sketch where (h) denotes a hash distributed relation, (r) denotes a reference table, (L) denotes LEFT JOIN and (I) denotes INNER JOIN. (L) / \ (I) h / \ r h Before this commit Citus would error out since a reference table appears on the left most part of an left join. However, that was too restrictive so that we only error out if the reference table is directly below and in the outer part of an outer join. 3) Bug fixes We've done some minor bugfixes in the existing implementation.	2017-09-14 20:59:22 +03:00
Marco Slot	d1befa4df9	Wait for I/O to finish after PQputCopyData	2017-09-12 16:18:42 -07:00
Marco Slot	cbe16169b4	Free per-tuple COPY memory in INSERT...SELECT	2017-09-12 15:35:53 -07:00
Marco Slot	5fe0845d7e	Always copy MultiPlan in GetMultiPlan	2017-09-12 11:38:52 -07:00
Jason Petersen	8b2c3fcc15	Add clarifying comment to RngVarCallbackForDropIdx We don't need the PARTITION-related logic recently added in PostgreSQL.	2017-09-01 15:57:30 -06:00
Jason Petersen	ec30ad38ba	Update ruleutils_10 with latest PostgreSQL changes See: postgres/postgres@21d304dfed postgres/postgres@bb5d6e80b1 postgres/postgres@d363d42bb9 postgres/postgres@eb145fdfea postgres/postgres@decb08ebdf postgres/postgres@a3ca72ae9a postgres/postgres@bc2d716ad0 postgres/postgres@382ceffdf7 postgres/postgres@c7b8998ebb postgres/postgres@e3860ffa4d postgres/postgres@76a3df6e5e	2017-09-01 14:26:59 -06:00
Jason Petersen	ebecde8f6e	Update ruleutils_96 with latest PostgreSQL changes See: postgres/postgres@41ada83774 postgres/postgres@3b0c2dbed0 postgres/postgres@ff2d537223	2017-09-01 14:26:53 -06:00
Marco Slot	0aadbb1760	Convert multi-row INSERT target list to Vars	2017-08-25 10:55:56 +02:00
Marco Slot	ae00795dab	Allow default columns in multi-row INSERTs	2017-08-25 10:55:56 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Marco Slot	dbf18df995	Don't error out if BuildGlobalWaitGraph fails to connect	2017-08-23 19:08:03 +02:00
Onder Kalaci	c7bb29b69e	Prevent maintanince deamon crashes due to dead processes If after the distributed deadlock detection decides to cancel a backend, the backend has been terminated/killed/cancelled externally, we might be accessing to a NULL pointer. This commit prevents that case by ignoring the current distributed deadlock.	2017-08-23 15:44:09 +03:00
Marco Slot	641420d79f	Remove source node argument from dump_local_wait_edges	2017-08-23 13:14:00 +02:00
Jason Petersen	8cb69e3a14	Add alias for target in multi-row INSERTs This is necessary for multi-row INSERTs for the same reasons we use it in e.g. UPSERTs: if the range table list has more than one entry, then PostgreSQL's deparse logic requires that vars be prefixed by the name of their corresponding range table entry. This of course doesn't affect single-row INSERTs, but since multi-row INSERTs have a VALUE RTE, they were affected. The piece of ruleutils which builds range table names wasn't modified to handle shard extension; instead UPSERT/INSERT INTO ... SELECT added an alias to the RTE. When present, this alias is favored. Doing the same in the multi-row INSERT case fixes RETURNING for such commands.	2017-08-23 10:24:00 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Marco Slot	bd6bf29983	Don't add procs multiple times in BuildWaitGraphForSourceNode	2017-08-21 16:48:30 +02:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Metin Doslu	0d052e9864	Fix a crash on zero-shard tables	2017-08-18 13:53:59 +03:00
Önder Kalacı	b82f886ad3	Merge branch 'master' into improve_deadlock_detection	2017-08-18 13:07:18 +03:00
Marco Slot	7523753a73	Clear metadata OID cache prior to deadlock detection	2017-08-18 11:20:24 +02:00
Andres Freund	b936bde936	Take AccessShareLock on the extension prior to running deadlock detection	2017-08-18 11:20:24 +02:00
Onder Kalaci	20679c9e8b	Relax assertion on deadlock detection considering self deadlocks.	2017-08-18 11:16:38 +03:00
Onder Kalaci	550a5578d8	Skip deadlock detection on the workers Do not run distributed deadlock detection on the worker nodes to prevent errornous decisions to kill the deadlocks.	2017-08-17 19:43:38 +03:00
Marco Slot	1eca53ad40	Exit maintenanced on database crash	2017-08-16 18:29:44 +02:00
Marco Slot	9e7b1fb858	Return readable nodes in master_get_active_worker_nodes	2017-08-16 11:28:47 +02:00
Hadi Moshayedi	e5fbcf37dd	Add Savepoint Support (#1539 ) This change adds support for SAVEPOINT, ROLLBACK TO SAVEPOINT, and RELEASE SAVEPOINT. When transaction connections are not established yet, savepoints are kept in a stack and sent to the worker when the connection is later established. After establishing connections, savepoint commands are sent as they arrive. This change fixes #1493 .	2017-08-15 13:02:28 -04:00
Onder Kalaci	205501532a	Add version check to the maintenance daemon We should prevent running the deadlock detection if there is a major version change. Otherwise, the daemon may access to obsolete metadata catalog tables.	2017-08-15 18:47:13 +03:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Marco Slot	fa70089766	Enable 2PC during distributed table creation	2017-08-15 13:44:20 +02:00
Marco Slot	9232823070	Abort on failure on master connection during copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	df7723cde5	Should not commit on aborted non-critical connections	2017-08-15 13:44:20 +02:00
Eren Başak	77626c4238	Fix NULL nodeClusterString crush on pg_worker_list.conf migrations	2017-08-14 18:13:53 +03:00
Eren Başak	b3d2f9ba71	Fix pg_worker_list use-after-free bug This change fixes a use-after-free bug while renaming obsolete `pg_worker_list.conf` file, which causes Citus to crash during upgrade (or even extension creation) if `pg_worker_list.conf` exists.	2017-08-14 18:13:53 +03:00
Burak Yucesoy	dfdfb44ebf	Acquire shard resource locks on parent tables while operating on partitions	2017-08-14 14:44:30 +03:00
Burak Yucesoy	a321e750c0	Acquire relation locks on partitions while operation on parent table	2017-08-14 14:44:30 +03:00
Burak Yucesoy	52b9e35d50	Add relationIdList field to the Job struct	2017-08-14 14:06:22 +03:00
Onder Kalaci	5b48de7430	Improve deadlock detection for MX We added a new field to the transaction id that is set to true only for the transactions initialized on the coordinator. This is only useful for MX in order to distinguish the transaction that started the distributed transaction on the coordinator where we could have the same transactions' worker queries on the same node.	2017-08-12 13:28:37 +03:00
Onder Kalaci	59133415b0	Add logging infrasture for distributed deadlock detection We added a new GUC citus.log_distributed_deadlock_detection which is off by default. When set to on, we log some debug messages related to the distributed deadlock to the server logs.	2017-08-12 13:28:37 +03:00
Onder Kalaci	e5d5bdff51	Enable distributed deadlock detection on the maintenance deamon With this commit, the maintenance deamon starts to check for distributed deadlocks. We also introduced a GUC variable (distributed_deadlock_detection_factor) whose value is multiplied with Postgres' deadlock_timeout. Setting it to -1 disables the distributed deadlock detection.	2017-08-12 13:28:37 +03:00
Onder Kalaci	66936053a0	Improve error messages when a backend is cancelled by deadlock detection We send SIGINT to a backend that is cancelled due to a deadlock. That approach ends up being a very confusing error message. With this commit we intercept the error messages and show a more meaningful error message to the user.	2017-08-12 13:28:37 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Onder Kalaci	a333c9f16c	Add infrastructure for distributed deadlock detection This commit adds all the necessary pieces to do the distributed deadlock detection. Each distributed transaction is already assigned with distributed transaction ids introduced with `3369f3486f`. The dependency among the distributed transactions are gathered with `80ea233ec1`. With this commit, we implement a DFS (depth first seach) on the dependency graph and search for cycles. Finding a cycle reveals a distributed deadlock. Once we find the deadlock, we examine the path that the cycle exists and cancel the youngest distributed transaction. Note that, we're not yet enabling the deadlock detection by default with this commit.	2017-08-12 13:28:37 +03:00
Marco Slot	55992d4bc0	Disallow task-tracker queries on follower clusters	2017-08-12 11:47:31 +02:00
velioglu	100739f62a	Change citus subversion	2017-08-11 11:57:57 +03:00
Marco Slot	53584affa8	Fix locking in create_distributed_table	2017-08-11 11:34:33 +03:00
velioglu	7c65001e23	Do not delete row from colocation table within drop table	2017-08-11 11:34:33 +03:00
velioglu	b0efffae1c	Correct planner and add more tests	2017-08-11 10:16:13 +03:00
velioglu	7550b8ad52	Fix anchor shard id selection when reference table exists	2017-08-11 10:09:47 +03:00
velioglu	ceba81ce35	Move physical planner checks to logical planner	2017-08-11 10:09:47 +03:00
velioglu	0359d03530	Add set operation check for reference tables	2017-08-11 10:09:47 +03:00
velioglu	c4e3b8b5e1	Add planner changes and tests for subquery on reference tables	2017-08-11 10:09:47 +03:00
velioglu	45717dd013	Check equivalence on reference tables for subquery pushdown	2017-08-11 10:09:47 +03:00
Marco Slot	0ae265c436	Add citus_create_restore_point for distributed snapshots	2017-08-11 07:36:20 +02:00
Marco Slot	fdff210ef7	Wait for commit/abort/prepare results asynchronously	2017-08-11 00:03:06 +02:00
Marco Slot	fca986f214	Add API for waiting for multiple connections	2017-08-11 00:03:06 +02:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Brian Cloutier	3fc87a7a29	Metadata sync also syncs nodes in other clusters	2017-08-10 16:55:55 +03:00
Brian Cloutier	0dee4f8418	Metadata sync syncs all nodes, not just primaries	2017-08-10 16:55:55 +03:00
Eren Başak	f9470329e5	Remove test_helper_functions.h inclusions	2017-08-10 12:42:46 +03:00
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	1961add6f9	Improve error message when there are no nodes for a placement	2017-08-10 12:38:51 +03:00
Jason Petersen	dee66e3959	Final review feedback	2017-08-10 01:10:09 -07:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
velioglu	7e436c0277	Add bool expression to pruning instance with a function	2017-08-10 08:56:36 +03:00
Andres Freund	e8b793c454	Support for IN (const, list) and = ANY(const, b, c) pruning.	2017-08-10 08:56:36 +03:00
Onder Kalaci	b5ea3ab6a3	Improve locking semantics for backend management We use the backend shared memory lock for preventing new backends to be part of a new distributed transaction or an existing backend to leave a distributed transaction while we're reading the all backends' data. The primary goal is to provide consistent view of the current distributed transactions while doing the deadlock detection.	2017-08-09 17:17:12 +03:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Marco Slot	08ed6d8269	Prevent pg_dist_node changes during master_create_empty_shard	2017-08-09 14:22:09 +02:00
Marco Slot	3a0571e69b	Remove LockMetadataSnapshot	2017-08-09 14:09:54 +02:00
Marco Slot	c2f8bafa05	Fix shard creation vs. pg_dist_node change locking	2017-08-09 14:09:54 +02:00
Marco Slot	868ee6be83	Fix and simplify pg_dist_node locking	2017-08-09 14:09:54 +02:00
Burak Yucesoy	8455d1a4ef	Ensure we are allowing partitioned tables at all appropriate places	2017-08-09 10:01:35 +03:00
Burak Yucesoy	2eee556738	Add distributed partitioned table support for COPY For partitioned tables, PostgreSQL opens partition and its partitions in BeginCopyFrom and it expects its caller to close those relations. However, we do not have quick access to opened relations and performing special operations for partitioned tables isn't necessary in coordinator node. Therefore before calling BeginCopyFrom, we change relkind of those partitioned tables to RELKIND_RELATION. This prevents PostgreSQL to open its partitions as well.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	31f3221342	Add distributed partitioned table support to router plannable queries In standart_planner, PostgreSQL expands partitioned tables to their partitions and call our restriction hook for each partition. It also, for some queries, skips the partitioned table itself completely. This behaviour makes it difficult to prune shards and decide whether query is router plannable or not. To prevent this behaviour, we change inh flag of partitioned tables to false in the query tree. In this case, PostgreSQL treats those partitioned tables as regular relations and does not expand them. This behaviour is inline with our expectations, because we do not want to treat partitioned tables differently on coordinator. Although we are not entirely comfortable with modifying query tree, other solutions to this problem is overly complicated.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Marco Slot	d3e9746236	Avoid connections that accessed non-colocated placements in multi-shard commands	2017-08-08 18:32:34 +02:00
Brian Cloutier	7060ade6fe	GetNodeTuple returns NULL it node does not exist It never throws an error.	2017-08-08 13:12:06 +03:00
Brian Cloutier	a3e9bef685	All users of WorkerNodeHash take an AccessShareLock The metadata cache simulates a SELECT on pg_dist_node. Now the locks it takes also simulate that SELECT.	2017-08-08 13:12:06 +03:00
Brian Cloutier	5914c992e6	cluster management UDFs see nodes in different clusters - master_activate_node and master_disable_node correctly toggle isActive, without crashing - master_add_node rejects duplicate nodes, even if they're in different clusters - master_remove_node allows removing nodes in different clusters	2017-08-08 13:12:06 +03:00
Brian Cloutier	3151b52a0b	Add citus.cluster_name GUC - Nodes with a nodecluster which does not match citus.cluster_name are excluded from the metadata cache and never seen by another part of Citus.	2017-08-08 13:12:06 +03:00
Brian Cloutier	94947c0d54	Refactor: ReplicateShardToAllWorkers more explicitly locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	f87fefa323	Refactor: DistributedTableSize more explicitly only locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	3769381366	Fix inaccurate comment on SetNodeState	2017-08-08 13:12:06 +03:00
Brian Cloutier	fbecf48a03	Disallow adding primary nodes to non-default clusters	2017-08-08 11:18:31 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00

... 10 11 12 13 14 ...

1740 Commits (8f9ef63e8a993eb5a576aa7ca28463ee63f202b6)