citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	00b9338294	Pushdown only necessary projections in the recursive relation planning With this commit, we only pull&push the necessary columns. In this context, necessary columns means that the columns that are required for the query to be executed when the relation is wrapped into a subquery. We could potentially optimize things further: (a) If a column only appears as a qual filter, we don't need to pull it to the coordinator (b) We currently pull unnecessary columns as NULL. However, we could potentially adjust remaining of the query tree and do not add columns of the relation to the target entry.	2018-11-27 13:07:32 +03:00
Onder Kalaci	3ac4c1c3a2	Pushdown filters on recursive relation planning PostgreSQL already keeps track of the restrictions that are on the relation. With this commit, Citus uses that information and pushes down the filters to the subquery that is recursively planned for the table that is in considiration.	2018-11-27 13:07:32 +03:00
Onder Kalaci	94ea93c0ae	Basic implementation of recursive planning of non-colocated relation joins	2018-11-27 13:07:32 +03:00
Marco Slot	1ec5b6c890	Remove old worker_hash_partition_table API	2018-11-26 14:40:37 +01:00
Marco Slot	5a63deab2e	Clean up UDFs and remove unnecessary permissions	2018-11-26 14:40:37 +01:00
Hanefi Onaldi	7db6991dc0	propagate validate queries to workers	2018-11-26 14:04:51 +03:00
Marco Slot	e8e956aa9f	Require superuser when using non-existent job schema in worker_merge_files_into_table	2018-11-24 02:57:16 +01:00
Marco Slot	c4ad899dd8	Check schema ownership in worker_merge_* functions	2018-11-23 11:05:09 +01:00
Marco Slot	e9a7295ead	Add multi-user tests for task-tracker protocol functions	2018-11-23 11:05:09 +01:00
Marco Slot	8e93fe5870	Check schema owner in task_tracker_assign_task	2018-11-23 11:05:09 +01:00
Marco Slot	ec957a833a	Check permission in task_tracker_task_status	2018-11-23 11:04:58 +01:00
Marco Slot	6aa5592e52	Add user ID suffix to intermediate files in re-partition jobs	2018-11-23 08:36:11 +01:00
Marco Slot	a59bf31c76	Use worker_execute_sql_task UDF in task-tracker executor	2018-11-22 18:15:33 +01:00
Marco Slot	30bad7e66f	Add worker_execute_sql_task UDF	2018-11-22 18:15:33 +01:00
Marco Slot	caf402d506	COPY to a task file no longer switches to superuser	2018-11-22 18:15:33 +01:00
Marco Slot	e17025e1d4	Check table ownership in mark_tables_colocated	2018-11-18 00:11:38 +01:00
Marco Slot	18acd00553	Check permissions in lock_relation_if_exists	2018-11-18 00:11:38 +01:00
Marco Slot	aab9f623eb	Check table ownership in upgrade_to_reference_table	2018-11-16 23:27:34 +01:00
Onder Kalaci	052ba21b19	Make sure to prevent unauthorized users to drop sequences in Citus MX	2018-11-15 18:08:04 +03:00
Onder Kalaci	7f0a57a153	Make sure to prevent unauthorized users to drop tables in Citus MX	2018-11-15 18:07:03 +03:00
Nils Dijk	f9520be011	Round robin queries to reference tables with task_assignment_policy set to `round-robin` (#2472 ) Description: Support round-robin `task_assignment_policy` for queries to reference tables. This PR allows users to query multiple placements of shards in a round robin fashion. When `citus.task_assignment_policy` is set to `'round-robin'` the planner will use a round robin scheduling feature when multiple shard placements are available. The primary use-case is spreading the load of reference table queries to all the nodes in the cluster instead of hammering only the first placement of the reference table. Since reference tables share the same path for selecting the shards with single shard queries that have multiple placements (`citus.shard_replication_factor > 1`) this setting also allows users to spread the query load on these shards. For modifying queries we do not apply a round-robin strategy. This would be negated by an extra reordering step in the executor for such queries where a `first-replica` strategy is enforced.	2018-11-15 15:11:15 +01:00
Marco Slot	2de8ef29c3	Revoke function permissions for node metadata functions	2018-11-15 06:25:07 +01:00
Marco Slot	f383e4f307	Description: Refactor code that handles DDL commands from one file into a module The file handling the utility functions (DDL) for citus organically grew over time and became unreasonably large. This refactor takes that file and refactored the functionality into separate files per command. Initially modeled after the directory and file layout that can be found in postgres. Although the size of the change is quite big there are barely any code changes. Only one two functions have been added for readability purposes: - PostProcessIndexStmt which is extracted from PostProcessUtility - PostProcessAlterTableStmt which is extracted from multi_ProcessUtility A README.md has been added to `src/backend/distributed/commands` describing the contents of the module and every file in the module. We need more documentation around the overloading of the COPY command, for now the boilerplate has been added for people with better knowledge to fill out.	2018-11-14 13:36:27 +01:00
Burak Yucesoy	f8e0d37ba1	Fix crashes caused by stack size increase under high memory load Each PostgreSQL backend starts with a predefined amount of stack and this stack size can be increased if there is a need. However, stack size increase during high memory load may cause unexpected crashes, because if there is not enough memory for stack size increase, there is nothing to do for process apart from crashing. An interesting thing is; the process would get OOM error instead of crash, if the process had an explicit memory request (with palloc) for example. However, in the case of stack size increase, there is no system call to get OOM error, so the process simply crashes. With this change, we are increasing the stack size explicitly by requesting extra memory from the stack, so that, even if there is not memory, we can at least get an OOM instead of a crash.	2018-11-14 01:27:53 +03:00
Murat Tuncer	cc401a2616	Create function_utils for pg function call related utilities	2018-11-07 15:29:38 +03:00
Hadi Moshayedi	d3e284dcd6	Use heap_deform_tuple() instead of calling heap_getattr(). (#2464 ) After Fast ALTER TABLE ADD COLUMN with a non-NULL default in PG11, physical heaps might not contain all attributes after a ALTER TABLE ADD COLUMN happens. heap_getattr() returns NULL when the physical tuple doesn't contain an attribute. So we should use heap_deform_tuple() in these cases, which fills in the missing attributes. Our catalog tables evolve over time, and an upgrade might involve some ALTER TABLE ADD COLUMN commands. Note that we don't need to worry about postgres catalog tables and we can use heap_getattr() for them, because they only change between major versions. This also fixes #2453.	2018-11-05 15:11:01 -05:00
Onder Kalaci	9e2e2a7300	Make sure to access PARAM_EXTERN accurately in PG 11 PG 11 has change the way that PARAM_EXTERN is processed. This commit ensures that Citus follows the same pattern. For details see the related Postgres commit: `6719b238e8`	2018-10-25 21:55:03 +03:00
Onder Kalaci	6e05921736	Processes that are blocked on advisory locks show up in wait edges Assign the distributed transaction id before trying to acquire the executor advisory locks. This is useful to show this backend in citus lock graphs (e.g., dump_global_wait_edges() and citus_lock_waits).	2018-10-24 13:32:13 +03:00
Hadi Moshayedi	3e00bf1c0d	Don't throw error for DROP DATABASE IF EXISTS	2018-10-23 09:45:03 -04:00
Jason Petersen	ae9a98c2d1	Attempt to address planner context crashes Both of these are a bit of a shot in the dark. In one case, we noticed a stack trace where a caller received a null pointer and attempted to dereference the memory context field (at 0x010). In the other, I saw that any error thrown from within AdjustParseTree could keep the stack from being cleaned up (presumably if we push we should always pop). Both stack traces were collected during times of high memory pressure and locally reproducing the problem locally or otherwise has been very tricky (i.e. it hasn't been reproduced reliably at all).	2018-10-18 08:41:51 -06:00
Hadi Moshayedi	431ac80563	Keep track of cached entries in case of interruption. (#2433 ) * Keep track of cached entries in case of interruption. Previously we set DistTableCacheEntry->sortedShardIntervalArray and DistTableCacheEntry->shardIntervalArrayLength after we entered all related shard entries into DistShardCacheHash. The drawback was that if populating DistShardCacheHash was interrupted, ResetDistTableCacheEntry() didn't see the shard hash entries created, so was unable to clean them up. This patch fixes that by setting sortedShardIntervalArray earlier, and incrementing shardIntervalArrayLength as we enter shards into the cache.	2018-10-15 14:06:56 -04:00
Jason Petersen	9fb951c312	Fix user-facing typos Lintian found these (presumably by looking in the text section and running them through e.g. aspell).	2018-10-09 16:54:03 -07:00
Onder Kalaci	73696a03e4	Make sure not to leak intermediate result folders on the workers	2018-10-09 22:47:56 +03:00
Marco Slot	d56baefe3d	Allow simple DML commands from hot standby	2018-10-06 10:54:44 +02:00
Murat Tuncer	4f8042085c	Fix drop schema in mx with partitioned tables Drop schema command fails in mx mode if there is a partitioned table with active partitions. This is due to fact that sql drop trigger receives all the dropped objects including partitions. When we call drop table on parent partition, it also drops the partitions on the mx node. This causes the drop table command on partitions to fail on mx node because they are already dropped when the partition parent was dropped. With this work we did not require the table to exist on worker_drop_distributed_table.	2018-10-08 17:01:54 -07:00
velioglu	512d23934f	Show router modify,select and real-time queries on MX views	2018-10-02 13:59:38 +03:00
Murat Tuncer	9bdef67bab	Do not create inherited constraints on worker shards PG now allows foreign keys on partitioned tables. Each foreign key constraint on partitioned table is propagated down to partitions. We used to create all constraints on shards when we are creating a new shard, or when just simply moving a shard from one worker to another. We also used the same logic when creating a copy of coordinator table in mx node. With this change we create the constraint on worker node only if it is not an inherited constraint.	2018-09-28 14:14:51 +03:00
Murat Tuncer	653c7e4ae0	Fix memory leak in FinishRemoteTransactionPrepare	2018-09-28 11:13:21 +03:00
Onder Kalaci	cdc0d1491c	Make sure to use correct execution mode for TRUNCATE We used to set the execution mode in the truncate trigger. However, when multiple tables are truncated with a single command, we could set the execution mode very late. Instead, now set the execution mode on the utility hook.	2018-09-25 15:35:27 +03:00
Marco Slot	1ca9a5b867	Do not allow unresolved parameters in INSERT...SELECT	2018-09-24 14:12:04 +02:00
Marco Slot	877d703ac5	Evaluate functions (and when applicable, parameters) anywhere in query	2018-09-21 12:57:50 -06:00
Onder Kalaci	abc443d7fa	Make sure that shard repair considers replication factor	2018-09-21 15:24:49 +03:00
Onder Kalaci	8520a5b432	worker_append_table_to_shard becomes aware of partitioned tables	2018-09-21 14:40:42 +03:00
Onder Kalaci	c1b5a04f6e	Allow partitioned tables with replication factor > 1 With this commit, we all partitioned distributed tables with replication factor > 1. However, we also have many restrictions. In summary, we disallow all kinds of modifications (including DDLs) on the partition tables. Instead, the user is allowed to run the modifications over the parent table. The necessity for such a restriction have two aspects: - We need to acquire shard resource locks appropriately - We need to handle marking partitions INVALID in case of any failures. Note that, in theory, the parent table should also become INVALID, which is too aggressive.	2018-09-21 14:40:41 +03:00
Murat Tuncer	b6930e3db9	Add distributed locking to truncated mx tables We acquire distributed lock on all mx nodes for truncated tables before actually doing truncate operation. This is needed for distributed serialization of the truncate command without causing a deadlock.	2018-09-21 14:23:19 +03:00
velioglu	d7f75e5b48	Add citus_lock_waits to show locked distributed queries	2018-09-20 14:13:51 +03:00
Murat Tuncer	0f6e514bfb	Fixes a bug on not being able to drop index on a partitioned table. Reason for the failure is that PG11 introduced a new relation kind RELKIND_PARTITIONED_INDEX to be used for partitioned indices. We expanded our check to cover that case.	2018-09-19 13:15:05 +03:00
Marco Slot	f34ab55389	Fix bug preventing rollback in stored procedure	2018-08-31 20:49:20 +02:00
Onder Kalaci	41d606b575	Use tree walker instad of mutator in relation visibility This commit uses _walker instead of _mutator for performance reasons. Given that we're only updating a functionId in the tree, the approach seems fine.	2018-09-18 09:33:01 +03:00
Onder Kalaci	4cae856846	Relax assertion on transaction abort on PREPARE step In case a failure happens when a transaction is failed on PREPARE, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-09-17 18:09:16 +03:00

1 2 3 4 5 ...

1072 Commits (00b93382940b4c27e5d0cedf176e5ed2de3596bf)