citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	6fc545c87b	Support temporary distributed tables	2017-08-25 11:56:46 +02:00
Marco Slot	0aadbb1760	Convert multi-row INSERT target list to Vars	2017-08-25 10:55:56 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Burak Yucesoy	dfdfb44ebf	Acquire shard resource locks on parent tables while operating on partitions	2017-08-14 14:44:30 +03:00
Burak Yucesoy	a321e750c0	Acquire relation locks on partitions while operation on parent table	2017-08-14 14:44:30 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Marco Slot	55992d4bc0	Disallow task-tracker queries on follower clusters	2017-08-12 11:47:31 +02:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Jason Petersen	dee66e3959	Final review feedback	2017-08-10 01:10:09 -07:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Marco Slot	4cc7c36596	Simplify metadata lock acquisition for DML	2017-08-07 15:36:58 +02:00
Marco Slot	aa7ca81548	Execute UPDATE/DELETE statements with 0 shards	2017-08-07 15:36:58 +02:00
Hadi Moshayedi	8229a64fe8	Remove distributed tables' dependency on distribution key columns. (#1527 ) This change removes distributed tables' dependency on distribution key columns. We already check that we cannot drop distribution key columns in ErrorIfUnsupportedAlterTableStmt() at multi_utility.c, so we don't need to have distributed table to distribution key column dependency to avoid dropping of distribution key column. Furthermore, having this dependency causes some warnings in pg_dump --schema-only (See #866), which are not desirable. This change also adds check to disallow drop of distribution keys when citus.enable_ddl_propagation is set to false. Regression tests are updated accordingly.	2017-08-03 10:07:04 -04:00
Murat Tuncer	fa18899cf9	Remove serialization/deserialization of multiplan node (#1477 ) introduces copy functions for Citus MultiPlan nodes. uses ExtensibleNode mechanism to store MultiPlan data drops serialiazation of MultiPlans	2017-08-02 08:24:00 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Marco Slot	d3785b97c0	Remove XactModificationLevel distinction between DML and multi-shard	2017-07-12 11:59:19 +02:00
Marco Slot	710fe8666b	Use GetPlacementListConnection for router DML	2017-07-12 11:26:23 +02:00
Marco Slot	29f21fea59	Use GetPlacementListConnection for multi-shard commands	2017-07-12 11:26:22 +02:00
Marco Slot	01c9b1f921	Use GetPlacementListConnection for router SELECTs	2017-07-12 11:26:22 +02:00
Burak Yucesoy	cb6070c720	Use ShareUpdateExclusiveLock instead ShareLock in VACUUM Before this change, we used ShareLock to acquire lock on distributed tables while running VACUUM. This makes VACUUM and INSERT block each other. With this change we changed lock mode from ShareLock to ShareUpdateExclusiveLock, which does not conflict with the locks INSERT acquire.	2017-07-10 15:46:19 +03:00
Andres Freund	21c25abbb1	Move multi_client_executor to interrupt aware libpq wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	b96ba9b490	Fix code only enabled for 9.5. There's still supporting wrappers used, a subsequent commit will remove those. This also removes the already unused tuplecount_t define.	2017-06-26 08:46:32 -07:00
Jason Petersen	2204da19f0	Support PostgreSQL 10 (#1379 ) Adds support for PostgreSQL 10 by copying in the requisite ruleutils and updating all API usages to conform with changes in PostgreSQL 10. Most changes are fairly minor but they are numerous. One particular obstacle was the change in \d behavior in PostgreSQL 10's psql; I had to add SQL implementations (views, mostly) to mimic the pre-10 output.	2017-06-26 02:35:46 -06:00
Andres Freund	1691f780fd	Force cache invalidation machinery to be initialized earlier. Previously it was not guaranteed that invalidations were registered after creating the extension, only if the extension was used afterwards.	2017-06-23 11:20:10 -07:00
Marco Slot	2f8ac82660	Execute INSERT..SELECT via coordinator if it cannot be pushed down Add a second implementation of INSERT INTO distributed_table SELECT ... that is used if the query cannot be pushed down. The basic idea is to execute the SELECT query separately and pass the results into the distributed table using a CopyDestReceiver, which is also used for COPY and create_distributed_table. When planning the SELECT, we go through planner hooks again, which means the SELECT can also be a distributed query. EXPLAIN is supported, but EXPLAIN ANALYZE is not because preventing double execution was a lot more complicated in this case.	2017-06-22 15:46:30 +02:00
Murat Tuncer	0c4bf2d943	Remove fall back to select if poll is not available (#1466 ) poll is supported on all relevant systems, there is no need to have a fall back mechanism to use select()	2017-06-22 12:11:18 +03:00
Jason Petersen	294aeff2ed	Don't call PostProcessUtility for local commands It is intended only to aid in processing of distributed DDL commands, but as written could execute during local CREATE INDEX CONCURRENTLY commands.	2017-06-19 15:56:03 -06:00
Burak Yucesoy	c7bfa06cb9	Fix incorrect call to CheckInstalledVersion During version update, we indirectly calld CheckInstalledVersion via ChackCitusVersions. This obviously fails because during version update it is expected to have version mismatch between installed version and binary version. Thus, we remove that ChackCitusVersions. We now only call ChackAvailableVersion.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	eea8c51e1f	Only error out on distributed queries when there is version mismatch Before this commit, we were erroring out at almost all queries if there is a version mismatch. With this commit, we started to error out only requested operation touches distributed tables. Normally we would need to use distributed cache to understand whether a table is distributed or not. However, it is not safe to read our metadata tables when there is a version mismatch, thus it is not safe to create distributed cache. Therefore for this specific occasion, we directly read from pg_dist_partition table. However; reading from catalog is costly and we should not use this method in other places as much as possible.	2017-05-22 09:53:29 +03:00
Marco Slot	853f07dd33	Don't change query tree of DDL commands	2017-05-04 21:34:28 +02:00
Marco Slot	4ed093970a	Support expressions in the partition column in INSERTs	2017-04-21 14:05:52 +02:00
velioglu	24d24db25c	Implement ALTER TABLE ADD CONSTRAINT command	2017-04-20 15:02:33 +03:00
velioglu	8cbef819be	Log message of across shard queries according to the log level	2017-04-20 12:24:46 +03:00
Jason Petersen	5272c2c44b	Enable distributed ALTER TABLE ... RENAME COLUMN Pretty straightforward. Had some concerns about locking, but due to the fact that all distributed operations use either some level of deparsing or need to enumerate column names, they all block during any concurrent column renames (due to the AccessExclusive lock). In addition, I had some misgivings about permitting renames of the dis- tribution column, but nothing bad comes from just allowing them. Finally, I tried to trigger any sort of error using prepared statements and could not trigger any errors not also exhibited by plain PostgreSQL tables.	2017-04-18 22:47:48 -06:00
Marco Slot	dfd7d86948	Stop using a sequence to generate unique job IDs	2017-04-18 11:31:51 +02:00
Marco Slot	5e58804d44	Support query parameters in combination with function evaluation	2017-04-17 15:40:55 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Jason Petersen	7e46f41c12	Add comments, use strncmp, clean up GUC desc. Good to go!	2017-04-04 16:16:49 -06:00
Jason Petersen	ef81b21a49	Clean up ErrorIfUnstableCreateOrAlterExtensionStmt Swaps an Assert in for an ereport, and adds details and hints to the error message to help users with a possibly confusing scenario.	2017-04-04 15:58:57 -06:00
Jason Petersen	ad3fbd9689	Refactor utility-skip/extn-check code This was getting pretty long and complex in the context of the main utility hook. Moved out the checks for what should skip Citus process- ing and what should have version checks performed.	2017-04-04 15:07:22 -06:00
Burak Yucesoy	a09614553f	Add enable_version_checks GUC and address feedback	2017-04-04 19:11:13 +03:00
Burak Yucesoy	087d8427e3	Error out if binary citus version does not match installed extension With this change, we start to error out if loaded citus binaries does not match the available major version or installed citus extension version. In this case we force user to restart the server or run ALTER EXTENSION depending on the situation	2017-04-03 17:36:13 -06:00
Jason Petersen	4cdfc3a10f	Address review feedback Should just about do it.	2017-04-03 11:44:57 -06:00

1 2 3 4

191 Commits (6fc545c87b34145286a87722d726123d4457edf6)