citus

Commit Graph

Author	SHA1	Message	Date
Andres Freund	b96ba9b490	Fix code only enabled for 9.5. There's still supporting wrappers used, a subsequent commit will remove those. This also removes the already unused tuplecount_t define.	2017-06-26 08:46:32 -07:00
Jason Petersen	2204da19f0	Support PostgreSQL 10 (#1379 ) Adds support for PostgreSQL 10 by copying in the requisite ruleutils and updating all API usages to conform with changes in PostgreSQL 10. Most changes are fairly minor but they are numerous. One particular obstacle was the change in \d behavior in PostgreSQL 10's psql; I had to add SQL implementations (views, mostly) to mimic the pre-10 output.	2017-06-26 02:35:46 -06:00
Andres Freund	1691f780fd	Force cache invalidation machinery to be initialized earlier. Previously it was not guaranteed that invalidations were registered after creating the extension, only if the extension was used afterwards.	2017-06-23 11:20:10 -07:00
Marco Slot	2f8ac82660	Execute INSERT..SELECT via coordinator if it cannot be pushed down Add a second implementation of INSERT INTO distributed_table SELECT ... that is used if the query cannot be pushed down. The basic idea is to execute the SELECT query separately and pass the results into the distributed table using a CopyDestReceiver, which is also used for COPY and create_distributed_table. When planning the SELECT, we go through planner hooks again, which means the SELECT can also be a distributed query. EXPLAIN is supported, but EXPLAIN ANALYZE is not because preventing double execution was a lot more complicated in this case.	2017-06-22 15:46:30 +02:00
Murat Tuncer	0c4bf2d943	Remove fall back to select if poll is not available (#1466 ) poll is supported on all relevant systems, there is no need to have a fall back mechanism to use select()	2017-06-22 12:11:18 +03:00
Jason Petersen	294aeff2ed	Don't call PostProcessUtility for local commands It is intended only to aid in processing of distributed DDL commands, but as written could execute during local CREATE INDEX CONCURRENTLY commands.	2017-06-19 15:56:03 -06:00
Burak Yucesoy	c7bfa06cb9	Fix incorrect call to CheckInstalledVersion During version update, we indirectly calld CheckInstalledVersion via ChackCitusVersions. This obviously fails because during version update it is expected to have version mismatch between installed version and binary version. Thus, we remove that ChackCitusVersions. We now only call ChackAvailableVersion.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	eea8c51e1f	Only error out on distributed queries when there is version mismatch Before this commit, we were erroring out at almost all queries if there is a version mismatch. With this commit, we started to error out only requested operation touches distributed tables. Normally we would need to use distributed cache to understand whether a table is distributed or not. However, it is not safe to read our metadata tables when there is a version mismatch, thus it is not safe to create distributed cache. Therefore for this specific occasion, we directly read from pg_dist_partition table. However; reading from catalog is costly and we should not use this method in other places as much as possible.	2017-05-22 09:53:29 +03:00
Marco Slot	853f07dd33	Don't change query tree of DDL commands	2017-05-04 21:34:28 +02:00
Marco Slot	4ed093970a	Support expressions in the partition column in INSERTs	2017-04-21 14:05:52 +02:00
velioglu	24d24db25c	Implement ALTER TABLE ADD CONSTRAINT command	2017-04-20 15:02:33 +03:00
velioglu	8cbef819be	Log message of across shard queries according to the log level	2017-04-20 12:24:46 +03:00
Jason Petersen	5272c2c44b	Enable distributed ALTER TABLE ... RENAME COLUMN Pretty straightforward. Had some concerns about locking, but due to the fact that all distributed operations use either some level of deparsing or need to enumerate column names, they all block during any concurrent column renames (due to the AccessExclusive lock). In addition, I had some misgivings about permitting renames of the dis- tribution column, but nothing bad comes from just allowing them. Finally, I tried to trigger any sort of error using prepared statements and could not trigger any errors not also exhibited by plain PostgreSQL tables.	2017-04-18 22:47:48 -06:00
Marco Slot	dfd7d86948	Stop using a sequence to generate unique job IDs	2017-04-18 11:31:51 +02:00
Marco Slot	5e58804d44	Support query parameters in combination with function evaluation	2017-04-17 15:40:55 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Jason Petersen	7e46f41c12	Add comments, use strncmp, clean up GUC desc. Good to go!	2017-04-04 16:16:49 -06:00
Jason Petersen	ef81b21a49	Clean up ErrorIfUnstableCreateOrAlterExtensionStmt Swaps an Assert in for an ereport, and adds details and hints to the error message to help users with a possibly confusing scenario.	2017-04-04 15:58:57 -06:00
Jason Petersen	ad3fbd9689	Refactor utility-skip/extn-check code This was getting pretty long and complex in the context of the main utility hook. Moved out the checks for what should skip Citus process- ing and what should have version checks performed.	2017-04-04 15:07:22 -06:00
Burak Yucesoy	a09614553f	Add enable_version_checks GUC and address feedback	2017-04-04 19:11:13 +03:00
Burak Yucesoy	087d8427e3	Error out if binary citus version does not match installed extension With this change, we start to error out if loaded citus binaries does not match the available major version or installed citus extension version. In this case we force user to restart the server or run ALTER EXTENSION depending on the situation	2017-04-03 17:36:13 -06:00
Jason Petersen	4cdfc3a10f	Address review feedback Should just about do it.	2017-04-03 11:44:57 -06:00
Jason Petersen	cf775c4773	Improve CONCURRENTLY-related error messages Thought this looked slightly nicer than the default behavior. Changed preventTransaction to concurrent to be clearer that this code path presently affects CONCURRENTLY code only.	2017-04-03 11:19:15 -06:00
Jason Petersen	dd9365433e	Update documentation Ensure all functions have comments, etc.	2017-04-03 11:19:15 -06:00
Jason Petersen	d904e96c59	Address MX CONCURRENTLY problems Adds a non-transactional multi-command method to propagate DDLs to all MX/metadata-synced nodes.	2017-04-03 11:19:15 -06:00
Jason Petersen	32886e97a3	Add code to set index validity on failure Coordinator code marks index as invalid as a base, set it as valid in a transactional layer atop that base, then proceeds with worker commands. If a worker command has problems, the rollback results in an index with isvalid = false. If everything succeeds, the user sees a valid index.	2017-04-03 11:19:15 -06:00
Jason Petersen	dea6c44f75	Remove CONCURRENTLY checks, fix tests Still pending failure testing, which broke with my recent changes.	2017-04-03 11:19:15 -06:00
Jason Petersen	0b6c4e756e	Change DropStmt to generate worker DDL on master Because we can't execute DROP INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:15 -06:00
Jason Petersen	95d8d27c4f	Change IndexStmt to generate worker DDL on master Because we can't execute CREATE INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:14 -06:00
Marco Slot	0f355a4a48	Batch task_tracker_status calls to reduce task-tracker query times	2017-03-31 11:54:11 +02:00
Metin Doslu	54a277ff01	Add disable/enable trigger all support	2017-03-29 22:00:14 +03:00
Jason Petersen	34a62abb7d	Address code review comments	2017-03-22 17:29:17 -06:00
Jason Petersen	d95b5bbad3	Rework ReplicateGrantStmt to use new flow This was the impetus for the previous commit that changed from using a DDLJob * to a List * of them.	2017-03-22 17:29:16 -06:00
Jason Petersen	23f5e4282d	Change DDLJob usage to be wrapped in lists To prepare for GRANT fixes.	2017-03-22 17:29:16 -06:00
Jason Petersen	f181b24859	Move worker execution to after master, fix tests Some tests relied on worker errors though local commands were invalid. Fixed those by ensuring preconditions were met to have command work correctly. Otherwise most test changes are related to slight changes in local/remote error ordering.	2017-03-22 17:21:49 -06:00
Jason Petersen	419a4c3745	Remove execution from stmt-specific util functions Now have a single Execute call in the main body.	2017-03-22 17:21:49 -06:00
Jason Petersen	a64165767d	Rename ProcessStmt functions to PlanStmt To reflect their new purpose planning a DDLJob rather than fully processing a distributed DDL statement.	2017-03-22 17:21:49 -06:00
Jason Petersen	a02a2a90c7	Refactor ExecuteDistDDLCommand to expect struct Will let us separate out the determination of what to execute from its actual execution.	2017-03-22 17:21:49 -06:00
Metin Doslu	1f838199f8	Use CustomScan API for query execution Custom Scan is a node in the planned statement which helps external providers to abstract data scan not just for foreign data wrappers but also for regular relations so you can benefit your version of caching or hardware optimizations. This sounds like only an abstraction on the data scan layer, but we can use it as an abstraction for our distributed queries. The only thing we need to do is to find distributable parts of the query, plan for them and replace them with a Citus Custom Scan. Then, whenever PostgreSQL hits this custom scan node in its Vulcano style execution, it will call our callback functions which run distributed plan and provides tuples to the upper node as it scans a regular relation. This means fewer code changes, fewer bugs and more supported features for us! First, in the distributed query planner phase, we create a Custom Scan which wraps the distributed plan. For real-time and task-tracker executors, we add this custom plan under the master query plan. For router executor, we directly pass the custom plan because there is not any master query. Then, we simply let the PostgreSQL executor run this plan. When it hits the custom scan node, we call the related executor parts for distributed plan, fill the tuple store in the custom scan and return results to PostgreSQL executor in Vulcano style, a tuple per XXX_ExecScan() call. * Modify planner to utilize Custom Scan node. * Create different scan methods for different executors. * Use native PostgreSQL Explain for master part of queries.	2017-03-14 12:17:51 +02:00
Andres Freund	52358fe891	Initial temp table removal implementation	2017-03-14 12:09:49 +02:00
Onder Kalaci	95f8382ca2	Bugfix for creating foreign key This commit fixes crash for adding foreign keys without specifying the referenced column crashes the backend.	2017-02-07 09:34:24 +02:00
Brian Cloutier	e6e5f63d9d	Utility hook does nothing if the extension is not loaded	2017-02-02 17:48:31 +02:00
Brian Cloutier	6843ad8e91	Fix bug where router executor sends query to failed connections	2017-01-27 09:40:30 +02:00
Brian Cloutier	1173f3f225	Refactor CheckShardPlacements - Break CheckShardPlacements into multiple functions (The most important is MarkFailedShardPlacements), so that we can get rid of the global CoordinatedTransactionUses2PC. - Call MarkFailedShardPlacements in the router executor, so we mark shards as invalid and stop using them while inside transaction blocks.	2017-01-26 13:20:45 +02:00
Marco Slot	85c1a87999	Short circuit in multi_ProcessUtility on ABORT/COMMIT	2017-01-25 11:57:00 +01:00
Marco Slot	2748660b1c	Always skip foreign key validation when enable_ddl_propagation is off	2017-01-25 11:56:59 +01:00
Marco Slot	ba940a1de9	Use coordinator instead of schema node in terminology	2017-01-25 11:07:23 +01:00
Marco Slot	1585c02322	Use placement connection API for multi-shard transactions	2017-01-23 18:34:50 +01:00
Andres Freund	6939cb8c56	Hack up PREPARE/EXECUTE for nearly all distributed queries. All router, real-time, task-tracker plannable queries should now have full prepared statement support (and even use router when possible), unless they don't go through the custom plan interface (which basically just affects LANGUAGE SQL (not plpgsql) functions). This is achieved by forcing postgres' planner to always choose a custom plan, by assigning very low costs to plans with bound parameters (i.e. ones were the postgres planner replanned the query upon EXECUTE with all parameter values provided), instead of the generic one. This requires some trickery, because for custom plans to work the costs for a non-custom plan have to be known, which means we can't error out when planning the generic plan. Instead we have to return a "faux" plan, that'd trigger an error message if executed. But due to the custom plan logic that plan will likely (unless called by an SQL function, or because we can't support that query for some reason) not be executed; instead the custom plan will be chosen.	2017-01-23 09:23:50 -08:00
Marco Slot	ea855ddf86	Add an enable_deadlock_prevention flag to allow router transactions to expand to multiple nodes	2017-01-22 17:31:24 +01:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	6ec34bed84	Remove remnants of commit_protocol.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	c390daed0f	Use interrupt checking libpq wrappers in router executor.	2017-01-09 14:02:45 -08:00
Andres Freund	7320c17f00	Convert router executor to placement connection management infrastructure. Remove the router specific transaction and shard management, and replace it with the new placement connection API. This mostly leaves behaviour alone, except that it is now, inside a transaction, legal to select from a shard to which no pre-existing connection exists. To simplify code the code handling task executions for select and modify has been split into two - the previous coding was starting to get confusing due to the amount of only conditionally applicable code. Modification connections & transactions are now always established in parallel, not just for reference tables.	2017-01-09 13:13:02 -08:00
Onder Kalaci	6d050fd677	Use 2PC for reference table modification With this commit, we ensure that router executor always uses 2PC for reference table modifications and never mark the placements of it as INVALID.	2017-01-04 12:46:35 +02:00
Eren Basak	7e09bd6836	Error on Unsupported Features on Workers This change makes the metadata workers error out on unsupported commands.	2017-01-02 16:03:45 +03:00
Marco Slot	59bc5972fa	Use MultiConnection in multi-shard transactions	2016-12-30 14:43:21 -07:00
Burak Yucesoy	bb9e95e134	Error out on foreign keys with reference tables We have one replication of reference table for each node. Therefore all problems with replication factor > 1 also applies to reference table. As a solution we will not allow foreign keys on reference tables. It is not possible to define foreign key from, to or between reference tables.	2016-12-28 10:58:26 +03:00
Marco Slot	92c7567008	Convert worker_transactions to new connection API	2016-12-23 16:14:29 +01:00
Eren Basak	71d73ec5ff	Propagate DDL commands to metadata workers for MX tables	2016-12-23 15:43:32 +03:00
Marco Slot	11031bcf55	Enable evaluation of stable functions in INSERT..SELECT	2016-12-23 12:47:21 +01:00
Marco Slot	d745d7bf70	Add explicit RelationShards mapping to tasks	2016-12-23 10:23:43 +01:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Eren Basak	296e0bd33a	Add citus.node_connection_timeout GUC	2016-12-20 14:11:37 +03:00
Jason Petersen	6f95875191	Add targeted VACUUM/ANALYZE support Adds support for VACUUM and ANALYZE commands which target a specific distributed table. After grabbing the appropriate locks, this imple- mentation sends VACUUM commands to each placement (using one connec- tion per placement). These commands are sent in parallel, so users with large tables will benefit from sharding. Except for VERBOSE, all VACUUM and ANALYZE options are supported, including the explicit column list used by ANALYZE. As with many of our utility commands, the local command also runs. In the VACUUM/ANALYZE case, the local command is executed before any re- mote propagation. Because error handling is managed after local proc- essing, this can result in a VACUUM completing locally but erroring out when distributed processing commences: a minor technicality in all cases, as there isn't really much reason to ever roll back a VACUUM (an impossibility in any case, as VACUUM cannot run within a transaction). Remote propagation of targeted VACUUM/ANALYZE is controlled by the enable_ddl_propagation setting; warnings are emitted if such a command is attempted when DDL propagation is disabled. Unqualified VACUUM or ANALYZE is not handled, but a warning message informs the user of this. Implementation note: this commit adds a "BARE" value to MultiShard- CommitProtocol. When active, no BEGIN command is ever sent to remote nodes, useful for commands such as VACUUM/ANALYZE which must not run in a transaction block. This value is not user-facing and is reset at transaction end.	2016-12-16 16:59:06 -07:00
Andres Freund	80b34a5d6b	Integrate router executor into transaction management framework. One less place managing remote transactions. It also makes it fairly easy to use 2PC for certain modifications (e.g. reference tables). Just issue a CoordinatedTransactionUse2PC(). If every placement failure should cause the whole transaction to abort, additionally mark the relevant transactions as critical.	2016-12-12 15:18:12 -08:00
Burak Yucesoy	8d7cd4d746	Add Foreign Key Support to ALTER TABLE commands With this PR, we add foreign key support to ALTER TABLE commands. For now, we only support foreign constraint creation via ALTER TABLE query, if it is only subcommand in ALTER TABLE subcommand list. We also only allow foreign key creation if replication factor is 1.	2016-12-08 15:03:25 +02:00
Andres Freund	2374905c89	Move multi_client_executor.[ch] ontop of connection_management.[ch]. That way connections can be automatically closed after errors and such, and the connection management infrastructure gets wider testing. It also fixes a few issues around connection string building.	2016-12-07 11:44:24 -08:00
Andres Freund	a77cf36778	Use connection_management.c from within connection_cache.c. This is a temporary step towards removing connection_cache.c.	2016-12-07 11:44:24 -08:00
Andres Freund	3223b3c92d	Centralized Connection Lifetime Management. Connections are tracked and released by integrating into postgres' transaction handling. That allows to to use connections without having to resort to having to disable interrupts or using PG_TRY/CATCH blocks to avoid leaking connections. This is intended to eventually replace multi_client_executor.c and connection_cache.c, and to provide the basis of a centralized transaction management. The newly introduced transaction hook should, in the future, be the only one in citus, to allow for proper ordering between operations. For now this central handler is responsible for releasing connections and resetting XactModificationLevel after a transaction.	2016-12-07 11:43:18 -08:00
Sumedh Pathak	0a0d4784b9	Change DDL error message to say "unsupported" instead of "supported"	2016-11-26 10:30:09 +01:00
Marco Slot	b566c4815c	Pass down the correct type for null parameters	2016-11-11 07:14:08 +01:00
Samay Sharma	82e5faa190	Avoid error during CREATE INDEX IF NOT EXISTS Previously, we threw an error when we ran CREATE INDEX IF NOT EXISTS with an already existing index. This change enables expected behavior by checking if the statement has IF NOT EXISTS before throwing the error. We also ensure that we don't execute the command on the workers, if an index already exists on the master.	2016-11-01 14:51:19 -07:00
Brian Cloutier	50805f1e5c	Copy raw_parse_tree before using it Address citusdata/citus#922. Fixes a segfault in PG's installcheck caused by our reuse of raw_parse_tree when handling EXPLAIN EXECUTE.	2016-10-27 18:25:49 +03:00
Onder Kalaci	a43e3bad56	Improve error semantics for INSERT..SELECT With this commit, we error out if a worker query cannot be executed on all placements of a target insert shard interval.	2016-10-27 14:09:05 +03:00
Marco Slot	275378aa45	Re-acquire metadata locks in RouterExecutorStart	2016-10-26 14:34:59 +02:00
Brian Cloutier	1e6d1ef67e	Fix segfault during EXPLAIN EXECUTE Fix citusdata/citus#886 The way postgres' explain hook is designed means that our hook is never called during EXPLAIN EXECUTE. So, we special-case EXPLAIN EXECUTE by catching it in the utility hook. We then replace the EXECUTE with the original query and pass it back to Citus.	2016-10-26 15:18:42 +03:00
Onder Kalaci	1673ea937c	Feature: INSERT INTO ... SELECT This commit adds INSERT INTO ... SELECT feature for distributed tables. We implement INSERT INTO ... SELECT by pushing down the SELECT to each shard. To compute that we use the router planner, by adding an "uninstantiated" constraint that the partition column be equal to a certain value. standard_planner() distributes that constraint to all the tables where it knows how to push the restriction safely. An example is that the tables that are connected via equi joins. The router planner then iterates over the target table's shards, for each we replace the "uninstantiated" restriction, with one that PruneShardList() handles. Do so by replacing the partitioning qual parameter added in multi_planner() with the current shard's actual boundary values. Also, add the current shard's boundary values to the top level subquery to ensure that even if the partitioning qual is not distributed to all the tables, we never run the queries on the shards that don't match with the current shard boundaries. Finally, perform the normal shard pruning to decide on whether to push the query to the current shard or not. We do not support certain SQLs on the subquery, which are described/commented on ErrorIfInsertSelectQueryNotSupported(). We also added some locking on the router executor. When an INSERT/SELECT command runs on a distributed table with replication factor >1, we need to ensure that it sees the same result on each placement of a shard. So we added the ability such that router executor takes exclusive locks on shards from which the SELECT in an INSERT/SELECT reads in order to prevent concurrent changes. This is not a very optimal solution, but it's simple and correct. The citus.all_modifications_commutative can be used to avoid aggressive locking. An INSERT/SELECT whose filters are known to exclude any ongoing writes can be marked as commutative. See RequiresConsistentSnapshot() for the details. We also moved the decison of whether the multiPlan should be executed on the router executor or not to the planning phase. This allowed us to integrate multi task router executor tasks to the router executor smoothly.	2016-10-26 10:01:00 +03:00
Marco Slot	271b20a23e	Parallelise DDL commands	2016-10-24 12:39:08 +02:00
Marco Slot	65f6d7c02a	Follow consistent execution order in parallel commands	2016-10-19 08:33:08 +02:00
Marco Slot	a497e7178c	Parallelise master_modify_multiple_shards	2016-10-19 08:33:08 +02:00
Marco Slot	9d98acfb6d	Move requiresMasterEvaluation from Task to Job	2016-10-19 08:23:06 +02:00
Marco Slot	213d8419c6	Refactor and redocument executor shard lock code	2016-10-19 08:13:35 +02:00
Andres Freund	ac14b2edbc	Support PostgreSQL 9.6 Adds support for PostgreSQL 9.6 by copying in the requisite ruleutils file and refactoring the out/readfuncs code to flexibly support the old-style copy/pasted out/readfuncs (prior to 9.6) or use extensible node APIs (in 9.6 and higher). Most version-specific code within this change is only needed to set new fields in the AggRef nodes we build for aggregations. Version-specific test output files were added in certain cases, though in most they were not necessary. Each such file begins by e.g. printing the major version in order to clarify its purpose. The comment atop citus_nodes.h details how to add support for new nodes for when that becomes necessary.	2016-10-18 16:23:55 -06:00
Marco Slot	33b7723530	Use UpdateShardPlacementState where appropriate	2016-10-07 11:59:20 -07:00
Andres Freund	982ad66753	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Andres Freund	de32b7bbad	Don't create hash-table of zero size in TaskHashCreate(). hash_create(), called by TaskHashCreate(), doesn't work correctly for a zero sized hash table. This triggers valgrind errors, and could potentially cause crashes even without valgring. This currently happens for Jobs with 0 tasks. These probably should be optimized away before reaching TaskHashCreate(), but that's a bigger change.	2016-10-03 13:07:43 -07:00
Andres Freund	a6150c2916	Lower "waiting for activity on tasks took longer than" log level. It's perfectly normal to wait longer in several circumstances, and the output can lead to spurious regression output changes.	2016-10-03 13:07:43 -07:00
Onder Kalaci	a533b8e7c1	Differentiate worker and master job temporary folders This commit enables to create different worker and master temporary folders. This change is important for citus-mx on task-tracker execution. In simple words, on citus-mx, the worker could actually be reponsible for the master tasks as well. Prior to this change, both master and worker logic on task-tracker executor was accessing and using the same files for different purposes which was dangerous on certain cases (i.e., when task_tracker_delay is low).	2016-10-03 14:24:08 +03:00
Jason Petersen	5f6264105d	Directly register router xact callbacks in PG_init Not entirely sure why we went with the shared memory hook approach, but it causes problems (multiple registration) during crashes. Changing to a simple direct registration call from PG_init.	2016-09-29 11:43:18 -06:00
Jason Petersen	0caf0d95f1	Fix unique-violation-in-xact segfault An interaction between ReraiseRemoteError and DML transaction support causes segfaults: * ReraiseRemoteError calls PurgeConnection, freeing a connection... * That connection is still in the xactParticipantHash At transaction end, the memory in the freed connection might happen to pass the "is this connection OK?" check, causing us to try to send an ABORT over that connection. By removing it from the transaction hash before calling ReraiseRemoteError, we avoid this possibility.	2016-09-27 16:44:03 -06:00
Metin Doslu	c9dcad9b05	Pass text oid inteads of invalid oid for null values Passing invalid oids even for null values in PQsendQueryParams() causes worker nodes to fail. Therefore, we pass text oid for null values.	2016-09-27 08:15:46 +03:00
Andres Freund	776b3868b9	Support NoMovement direction in router executor This is mainly interesting because it allows to use RETURN QUERY/RETURN QUERY EXECUTE and FOR ... IN .. LOOPs in plpgsql.	2016-09-26 18:28:36 -06:00
Murat Tuncer	2eec0167be	Add support for truncate statement	2016-09-26 18:23:42 -06:00
Jason Petersen	74f4e0003b	Permit multiple DDL commands in a transaction Three changes here to get to true multi-statement, multi-relation DDL transactions (same functionality pre-5.2, with benefits of atomicity): 1. Changed the multi-shard utility hook to always run (consistency with router executor hook, removes ad-hoc "installed" boolean) 2. Change the global connection list in multi_shard_transaction to instead be a hash; update related functions to operate on global hash instead of local hash/global list 3. Remove check within DDL code to prevent subsequent DDL commands; place unset/reset guard around call to ConnectToNode to permit connecting to additional nodes after DDL transaction has begun In addition, code has been added to raise an error if a ROLLBACK TO SAVEPOINT is attempted (similar to router executor), and comprehensive tests execute all multi-DDL scenarios (full success, user ROLLBACK, any actual errors (say, duplicate index), partial failure (duplicate index on one node but not others), partial COMMIT (one node fails), and 2PC partial PREPARE (one node fails)). Interleavings with other commands (DML, \copy) are similarly all covered.	2016-09-08 22:35:55 -05:00
Jason Petersen	850c51947a	Re-permit DDL in transactions, selectively Recent changes to DDL and transaction logic resulted in a "regression" from the viewpoint of users. Previously, DDL commands were allowed in multi-command transaction blocks, though they were not processed in any actual transactional manner. We improved the atomicity of our DDL code, but added a restriction that DDL commands themselves must not occur in any BEGIN/END transaction block. To give users back the original functionality (and improved atomicity) we now keep track of whether a multi-command transaction has modified data (DML) or schema (DDL). Interleaving the two modification types in a single transaction is disallowed. This first step simply permits a single DDL command in such a block, admittedly an incomplete solution, but one which will permit us to add full multi-DDL command support in a subsequent commit.	2016-08-30 20:37:19 -06:00
Metin Doslu	75618fc3fb	Return false in MultiClientQueryResult() on failing query	2016-08-29 17:05:35 +03:00
Andres Freund	7fdb5fbe29	Skip over unreferenced parameters when router executing prepared statement. When an unreferenced prepared statement parameter does not explicitly have a type assigned, we cannot deserialize it, to send to the remote side. That commonly happens inside plpgsql functions, where local variables are passed in as unused prepared statement parameters.	2016-08-05 14:12:06 -07:00
Jason Petersen	eba8396501	Avoid attempting to lock invalid shard identifier A recent change generates a "dummy" shard placement with its identifier set to INVALID_SHARD_ID for SELECT queries against distributed tables with no shards. Normally, no lock is acquired for SELECT statements, but if all_modifications_commutative is set to true, we will acquire a shared lock, triggering an assertion failure within LockShardResource in the above case. The "dummy" shard placement is actually necessary to ensure such empty queries have somewhere to execute, and INVALID_SHARD_ID seems the most appropriate value for the dummy's shard identifier field, so the most straightforward fix is to just avoid locking invalid shard identifiers.	2016-08-04 13:49:51 -07:00
Marco Slot	9705cbcdf8	Rewrite WorkerShardStats to avoid invalid value bugs	2016-07-29 20:11:18 +02:00

1 2 3 4 5

213 Commits (906dadddb738bf51010c443de83be9154dfa69fc)