citus

Commit Graph

Author	SHA1	Message	Date
Onur Tirtir	8f9ef63e8a	Implement get_relation_constraint_oid_compat helper (#3836 )	2020-05-15 17:36:59 +03:00
MoYi	9e1f198155	Fix composite create type deparsing to preserve typmod	2020-05-15 13:12:54 +00:00
Onur Tirtir	249550b815	Refactor EnsureLocalTableEmptyIfNecessary (#3830 )	2020-05-15 14:20:33 +03:00
Onur Tirtir	8f3373c702	Remove unused parameter from RecordDistributedRelationDependencies (#3831 )	2020-05-15 10:34:35 +03:00
SaitTalhaNisanci	22c903b151	remove ExecuteUtilityTaskListWithoutResults (#3696 ) This PR removes ExecuteUtilityTaskListWithoutResults and uses the same path for local execution via ExecuteTaskListExtended. ExecuteUtilityTaskList is added. ExecuteLocalTaskListExtended now has a parameter for utility commands so that it can call the right method. In order not to change the existing calls, ExecuteTaskListExtendedInternal is added, which is the main method that runs the execution, via local and remote execution.	2020-05-07 13:30:50 +03:00
Nils Dijk	105de7beb8	Fix for pruned target list entries (#3818 ) DESCRIPTION: Ignore pruned target list entries in coordinator plan The postgres planner has the ability to prune target list entries that are proven not used in the output relation. When this happens at the `CitusCustomScan` boundary we need to _not_ return these pruned columns to not upset the rest of the planner. By using the target list the planner asks us to return we fix issues that lead to Assertion failures, and potentially could be runtime errors when they hit in a production build. Fixes #3809	2020-05-06 13:56:02 +02:00
Marco Slot	6ce2803777	Make sure we don't wrap GROUP BY expressions in any_value	2020-05-05 05:12:45 +02:00
Hadi Moshayedi	dbf509bbdd	Don't error out when cannot create maintenanced	2020-05-04 09:53:52 -07:00
Onder Kalaci	f9d4a9cf38	Remove assertion for subqueries in WHERE clause ANDed with FALSE In the code, we had the assumption that if restriction information is NULL, it means that we cannot have any disributetd tables in the subquery. However, for subqueries in WHERE clause, that is not the case when the subquery is ANDed with FALSE. In that case, Citus operates on the originalQuery (which doesn't go through the standard_planner()), and rely on the restriction information generated by standard_plannner(). As Postgres is smart enough to no generate restriction information for subqueries ANDed with FALSE, we hit the assertion.	2020-05-04 10:52:15 +02:00
Onder Kalaci	77c397e9ae	Rebuild wait event sets after PQconnectPoll() if socket changes The reason is that PQconnectPoll() may change the underlying socket. If we don't rebuild the wait event set, the low level APIs (such as epoll_ctl()) may fail due to invalid sockets. Instead, rebuilding ensures that we'll use accurate/active sockets.	2020-05-01 09:44:21 +02:00
Jelte Fennema	c6f5d5fe88	Add some asserts to pass static analysis (#3805 )	2020-04-29 11:19:11 +02:00
SaitTalhaNisanci	cbda951395	Fix task copy and appending empty task in ExtractLocalAndRemoteTasks (#3802 ) * Not append empty task in ExtractLocalAndRemoteTasks ExtractLocalAndRemoteTasks extracts the local and remote tasks. If we do not have a local task the localTaskPlacementList will be NIL, in this case we should not append anything to local tasks. Previously we would first check if a task contains a single placement or not, now we first check if there is any local task before doing anything. * fix copy of node task Task node has task query, which might contain a list of strings in its fields. We were using postgres copyObject for these lists. Postgres assumes that each element of list will be a node type. If it is not a node type it will error. As a solution to that, a new macro is introduced to copy a list of strings.	2020-04-29 11:05:34 +03:00
Philip Dubé	b6b3c1bc17	Fix COPY TO's COPY (SELECT) with distributed table having generated columns It's necessary to omit generated columns from output	2020-04-28 14:40:47 +00:00
SaitTalhaNisanci	164c00cf08	Fix typo: longer visible -> no longer visible (#3803 )	2020-04-27 16:32:46 +03:00
Onder Kalaci	bc54c5125f	Increase the default value of citus.node_connection_timeout The previous default was 5 seconds, and we change it to 30 seconds. The main motivation for this is that for busy clusters, 5 seconds can be too aggressive. Especially with connection throttling, the servers might be kept busy for a really long time, and users may see the connection errors more frequently. We've done some sanity checks, for really quick queries (like `SELECT count(*) from table`), 30 seconds is a decent value even if users execute 300 distributed queries on the coordinator. We've verified this on Hyperscale(Citus).	2020-04-24 15:16:42 +02:00
Onder Kalaci	0cb7ab2d05	Explicitly mark queries in physical planner for [not] having parameters Physical planner doesn't support parameters. If the parameters have already been resolved when the physical planner handling the queries, mark it. The reason is that the executor is unaware of this, and sends the parameters along with the worker queries, which fails for composite types. (See `DissuadePlannerFromUsingPlan()` for the details of paramater resolving)	2020-04-24 12:49:43 +02:00
Onur Tirtir	2e927bd6b7	Bump Citus to 9.4devel (#3788 )	2020-04-22 12:50:00 +03:00
Hanefi Önaldı	e85b835065	Skip dependency setup on coordinator node	2020-04-21 12:06:31 +03:00
Philip Dubé	9093d51a22	maintenanced: handle before_shmem_exit, assert workerPid == 0 on start	2020-04-20 14:41:40 +00:00
Onder Kalaci	e182215d96	Improve connection error message from the worker nodes We currently put the actual error message to the detail part. However, many drivers don't show detail part. As connection errors are somehow common, and hard to trace back, can't we added the detail to the message itself. In addition to that, we changed "connection error" message, as it was confusing to the users who think that the error was happening while connecting to the coordinator. In fact, this error is showing up when the coordinator fails to connect remote nodes.	2020-04-20 13:32:55 +02:00
Hadi Moshayedi	1250d691d3	Replicate reference tables before master_create_empty_shard	2020-04-17 16:47:03 -07:00
Philip Dubé	8e79672839	Try copying shard intervals out of cache for long lived borrow	2020-04-17 22:00:41 +00:00
Philip Dubé	c00d57a955	CreateDistributedInsertSelectPlan: avoid calling GetCitusTableCacheEntry in a way that would invalidate live ShardInterval pointers	2020-04-17 14:44:23 +00:00
SaitTalhaNisanci	1d0f4bdcd2	invalidate plan cache in master_update_node (#3758 ) * invalidate plan cache in master_update_node If a plan is cached by postgres but a user uses master_update_node, then when the plan cache is used for the updated node, they will get the old nodename/nodepost in the plan. This is because the plan cache doesn't know about the master_update_node. This could be a problem in prepared statements or anything that goes into plancache. As a solution the plan cache is invalidated inside master_update_node. * add invalidate_inactive_shared_connections test function We introduce invalidate_inactive_shared_connections udf to be used in testing. It is possible that a connection count for an inactive node will be greater than 0 and in that case it will not be removed at the time of invalidation. However, later we don't have a mechanism to remove it, which means that it will stay in the hash. For this not to cause a problem, we use this udf in testing. * move invalidate_inactive_shared_connections to udfs from test as it will be used in mx * remove the test udf * remove the IsInactive check	2020-04-17 17:43:48 +03:00
Philip Dubé	c0a95a3adb	Copy data from CitusTableCacheEntry more often This copies over fixes from reference counting branch, all CitusTableCacheEntry data may be freed when a GetCitusTableCacheEntry call occurs for its relationId This fix is not complete, but reference counting is being deferred until 9.4 CopyShardInterval: remove dest parameter, always return newly allocated object	2020-04-17 14:17:18 +00:00
Önder Kalacı	a919f09c96	Remove the entries from the shared connection counter hash when no connections remain (#3775 ) We initially considered removing entries just before any change to pg_dist_node. However, that ended-up being very complex and making MX even more complex. Instead, we're switching to a simpler solution, where we remove entries when the counter gets to 0. With certain workloads, this may have some performance penalty. But, two notes on that: - When counter == 0, it implies that the cluster is not busy - With cached connections, that's not possible	2020-04-17 17:14:58 +03:00
Philip Dubé	e4a4707f4a	Avoid setting hasWindowFuncs true after window functions have been optimized out of query	2020-04-17 12:22:48 +00:00
SaitTalhaNisanci	a9a3be15cc	introduce TASK_QUERY_NULL task type (#3774 ) When we call SetTaskQueryString we would set the task type to TASK_QUERY_TEXT, and some parts of the codebase rely on the fact that if TASK_QUERY_TEXT is set, the data can be read safely. However if SetTaskQueryString is called with a NULL taskQueryString this can cause crashes. In that case taskQueryType will simply be set to TASK_QUERY_NULL.	2020-04-17 14:59:22 +03:00
Hanefi Önaldı	0c5d0cfee9	Notice message to help truncate local data after distribution	2020-04-17 13:21:34 +03:00
Hanefi Önaldı	d535121f8d	Introduce truncate_local_data_after_distributing_table()	2020-04-17 13:21:34 +03:00
Hadi Moshayedi	61198251fd	Use block_writes for replicate_reference_tables	2020-04-16 19:25:41 -07:00
Nils Dijk	1d6ba1d09e	Refactor alter role to work on distributed roles (#3739 ) DESCRIPTION: Alter role only works for citus managed roles Alter role was implemented before we implemented good role management that hooks into the object propagation framework. This is a refactor of all alter role commands that have been implemented to - be on by default - only work for supported roles - make the citus extension owner a supported role Instead of distributing the alter role commands for roles at the beginning of the node activation role it now _only_ executes the alter role commands for all users in all databases and in the current database. In preparation of full role support small refactors have been done in the deparser. Earlier tests targeting other roles than the citus extension owner have been either slightly changed or removed to be put back where we have full role support. Fixes #2549	2020-04-16 12:23:27 +02:00
Hadi Moshayedi	59b9a4e5a1	Detect deadlocks in replicate_reference_tables()	2020-04-15 11:06:18 -07:00
SaitTalhaNisanci	df9048ebaa	update outdated comments related to local_execution (#3759 )	2020-04-15 16:15:43 +03:00
Marco Slot	8b83306a27	Issue worker messages with the same log level	2020-04-14 21:08:25 +02:00
SaitTalhaNisanci	132efdbc56	add execution params struct (#3747 ) We had 9+ parameters in some of the functions related to execution. Execution params is created to simplify this a bit so that we can set only the fields that we are interested in and it is easier to read.	2020-04-14 14:32:40 +03:00
Onder Kalaci	aa6b641828	Throttle connections to the worker nodes With this commit, we're introducing a new infrastructure to throttle connections to the worker nodes. This infrastructure is useful for multi-shard queries, router queries are have not been affected by this. The goal is to prevent establishing more than citus.max_shared_pool_size number of connections per worker node in total, across sessions. To do that, we've introduced a new connection flag OPTIONAL_CONNECTION. The idea is that some connections are optional such as the second (and further connections) for the adaptive executor. A single connection is enough to finish the distributed execution, the others are useful to execute the query faster. Thus, they can be consider as optional connections. When an optional connection is not allowed to the adaptive executor, it simply skips it and continues the execution with the already established connections. However, it'll keep retrying to establish optional connections, in case some slots are open again.	2020-04-14 10:27:48 +02:00
Onder Kalaci	38b8a9ad62	Add citus_remote_connection_stats() function This function is intended to be used for monitoring the remote connections.	2020-04-14 10:03:27 +02:00
Onder Kalaci	0dbfbe0c37	Add the necessary shared memory infrastructure - The hashmap in the shared memory - The lock to access the hashmap - The GUC to control the size	2020-04-14 10:03:26 +02:00
Hadi Moshayedi	f9de734329	Ensure metadata is synced on ReplicateColocatedShardPlacement	2020-04-13 11:45:21 -07:00
Hadi Moshayedi	2218b7e38d	Refactor ReplicateColocatedShardPlacement	2020-04-13 11:07:26 -07:00
SaitTalhaNisanci	2438e80a58	use CURSOR_OPT_PARALLEL_OK flag in local execution (#3745 ) We currently don't use any cursor flags in local execution, but we can use CURSOR_OPT_PARALLEL_OK flag to potentially benefit from parallelism when possible.	2020-04-12 19:49:22 +03:00
Philip Dubé	30f10984e1	Defer get_agg_clause_costs, it happens later & avoids errors	2020-04-10 13:26:05 +00:00
Philip Dubé	ab0b59ad3b	GetConnParams: Set runtimeParamStart before setting keywords/values to avoid out of bounds access	2020-04-10 13:14:06 +00:00
SaitTalhaNisanci	07f9a442b0	Refactor CopyLocalDataIntoShards (#3693 ) This PR: - Declares variables when they are needed. - Creates DoCopyFromLocalTableIntoShards for better readability. - Doesn't use a hardcoded value, instead use a variable for better readability.	2020-04-10 09:25:26 +03:00
Marco Slot	a4b2197450	Correctly handle non-constant LIMIT/OFFSET clauses	2020-04-09 19:59:50 +00:00
SaitTalhaNisanci	3dc7cad754	use an enum for local execution status (#3733 ) We have two variables that are related to local execution status. TransactionAccessedLocalPlacement and TransactionConnectedToLocalGroup. Only one of these fields should be set, however we didn't have any check for this contraint and it was error prone. What those two variables are used is that we are trying to understand if we should use local execution, the current session, or if we should be using a connection to execute the current query, therefore the tasks. In the enum, now it is more clear what these variables mean. Also, now we have a method to change the local execution status. The method will error if we are trying to transition from a state to a wrong state. This will help us avoid problems.	2020-04-09 19:11:04 +03:00
SaitTalhaNisanci	24dcb02bca	enable local table join with reference table (#3697 ) * enable local table join with reference table * test different cases with local table and reference join	2020-04-09 15:25:54 +03:00
SaitTalhaNisanci	233e4a24d1	use local execution within transaction block (#3714 ) * use local executon when in a transaction block When we are inside a transaction block, there could be other methods that need local execution, therefore we will use local execution in a transaction block. * update test outputs with transaction block local execution * add a test to verify we dont leak intermediate schemas	2020-04-09 12:41:58 +03:00
SaitTalhaNisanci	fa88046ce1	test that we don't leak intermediate schemas (#3737 ) * test that we don't leak intermediate schemas We have tests to make sure that we don't intermediate any intermediate files, tables etc but we don't test if we are leaking schemas. It makes sense to test this as well. * remove all repartition schemas in case of error This solution is not an ideal one but it seems to be doing the job. We should have a more generic solution for the cleanup but it seems that putting the cleanup in the abort handler is dangerous and it was crashing.	2020-04-09 12:17:41 +03:00

1 2 3 4 5 ...

1740 Commits (8f9ef63e8a993eb5a576aa7ca28463ee63f202b6)