citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	155db4d913	Simplify router planner call path	2017-06-22 15:45:57 +02:00
Murat Tuncer	0c4bf2d943	Remove fall back to select if poll is not available (#1466 ) poll is supported on all relevant systems, there is no need to have a fall back mechanism to use select()	2017-06-22 12:11:18 +03:00
Jason Petersen	294aeff2ed	Don't call PostProcessUtility for local commands It is intended only to aid in processing of distributed DDL commands, but as written could execute during local CREATE INDEX CONCURRENTLY commands.	2017-06-19 15:56:03 -06:00
velioglu	a17ab6408a	Delete ExecuteRemoteCommand function	2017-06-15 17:11:19 +03:00
velioglu	173fe137af	Convert DropShardsFromWorker to the new connection API	2017-06-15 15:24:06 +03:00
velioglu	d7b68e5647	Convert TableDDLCommandList function to the new connection API	2017-06-14 17:29:58 +03:00
velioglu	0aa9572e18	Convert RemoteTableOwner function to the new connection API	2017-06-14 17:29:58 +03:00
velioglu	7fe29aad4c	Convert worker_fetch_foreign_file to new connection API	2017-06-14 17:29:58 +03:00
velioglu	43d2cdbd35	Convert DistributedTableSizeOnWorker function to new connection API	2017-06-14 17:29:58 +03:00
Marco Slot	56876596d5	Add support for unlogged distributed tables	2017-06-14 13:50:00 +02:00
velioglu	a1ea29ec2b	Use placement connection to drop shards instead of node connection	2017-06-14 14:14:59 +03:00
Marco Slot	70abfd29d2	Allow COPY after a multi-shard command This change removes the XactModificationLevel check at the start of COPY that was made redundant by consistently using GetPlacementConnection.	2017-06-09 13:54:58 +02:00
jmunsch	1647d17a14	Clarify error message for local and distributed query plans.	2017-06-01 11:52:49 -07:00
Marco Slot	f1d804180b	Don't take a table lock in ForeignConstraintGetReferencedTableId	2017-05-31 11:15:21 +02:00
Burak Yucesoy	8c1bbf1417	Register cache invalidation callback before version checks With this commit we start to register InvalidateDistRelationCacheCallback function as cache invalidation callback function before version checks because during version checks we use cache to look up relation ids of some relations like pg_dist_relation or pg_dist_partition_logical_relid_index and we want to know about cache invalidation before accessing them.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	c7bfa06cb9	Fix incorrect call to CheckInstalledVersion During version update, we indirectly calld CheckInstalledVersion via ChackCitusVersions. This obviously fails because during version update it is expected to have version mismatch between installed version and binary version. Thus, we remove that ChackCitusVersions. We now only call ChackAvailableVersion.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	9fb15c439c	Add version checks to necessary UDFs	2017-05-22 09:53:29 +03:00
Burak Yucesoy	eea8c51e1f	Only error out on distributed queries when there is version mismatch Before this commit, we were erroring out at almost all queries if there is a version mismatch. With this commit, we started to error out only requested operation touches distributed tables. Normally we would need to use distributed cache to understand whether a table is distributed or not. However, it is not safe to read our metadata tables when there is a version mismatch, thus it is not safe to create distributed cache. Therefore for this specific occasion, we directly read from pg_dist_partition table. However; reading from catalog is costly and we should not use this method in other places as much as possible.	2017-05-22 09:53:29 +03:00
Burak Yucesoy	acb0d23717	Fix crash during upgrade from 5.2 to 6.2 This commit fixes the problem where we incorrectly try to reach distributed table cache when the extension is not loaded completely. We tried to reach the cache because we wanted to get reference table information to activate the node. However it is actually not necessary to explicitly activate the nodes which come from master_initialize_node_metadata. Because it only runs during extension creation and at that time there are no reference tables and all nodes are considered as active.	2017-05-19 00:01:36 +03:00
Jason Petersen	cc45712144	Bump extension and configure PACKAGE versions Actually getting this done before the next dev cycle begins.	2017-05-17 15:25:30 -06:00
Jason Petersen	489aa73257	Add missing CCI call in metadata seq sync Be explicit about the fact that we've made a modification: we need subsequent commands to see this sequence.	2017-05-16 11:05:34 -06:00
Jason Petersen	c9fa11b445	Use library and symbol name for bgw entry PostgreSQL 10 takes away the ability to directly assign a function pointer; the other approach (library and symbol name) is supported by all versions.	2017-05-16 11:05:33 -06:00
Jason Petersen	f86920f9d6	Add includes for missing standard headers We use symbols from each of these and were relying on them being included by other headers.	2017-05-16 11:05:33 -06:00
Jason Petersen	82b03d5cb6	Add explicit cast for argument to copyObject PostgreSQL 10 adds a call to typeof, if supported.	2017-05-16 11:05:33 -06:00
Burak Yucesoy	5a3a32d6df	Quote schema's owner name When we propogate the schema creation command to data nodes we add schema's owner name too. Before this patch, we did not quote the owner's name which causes problems with the names containing characters like '-'.	2017-05-15 16:26:32 +03:00
Burak Yucesoy	1b5560b2f7	Fix OwnerName function to work with schemas We incorrectly try to use relation cache to find particular schema's owner and when we cannot find the schema in the relation cache(i.e always), we automatically used current user as the schema's owner. This means we always created schemas in the data nodes with current user. With this patch we started to use namespace cache to find schemas.	2017-05-15 16:26:32 +03:00
Önder Kalacı	e0257aecd9	Accept invalidation messages before accessing the metadata cache (#1406 ) * Accept invalidation messages before accessing the metadata cache This commit is crucial to prevent stale metadata reads from the cache. Without this commit, some of the operations may use stale metadata which could end up with various bugs such as crashes, inconsistent/lost data etc. As an example, consider that a COPY operation is blocked on shard metadata lock. Another concurrent session updates the metadata and invalidates the cache. However, since Citus doesn't accept invalidations, COPY continues with the stale metadata once it acquires the lock. With this commit, we make sure that invalidation messages are accepted just before accessing the metadata cache and preventing any operation to use stale metadata. * Add isolation tests for placement changes and conccurrent operations - add node with reference table vs COPY/insert/update/DDL - repair shard vs COPY/insert/update/DDL - repair shard vs repair shard	2017-05-12 12:32:35 +03:00
Marco Slot	6f9e18de24	Ensure all preceding writes are visible in data migration	2017-05-11 09:42:12 +02:00
Önder Kalacı	3ec502b286	Add support for parametrized execution for subquery pushdown (#1356 ) Distributed query planning for subquery pushdown is done on the original query. This prevents the usage of external parameters on the execution. To overcome this, we manually replace the parameters on the original query.	2017-05-10 09:38:48 +03:00
Marco Slot	a8f368fced	Fix locking in master_drop_all_shards / master_apply_delete_command	2017-05-08 17:26:55 +02:00
Marco Slot	853f07dd33	Don't change query tree of DDL commands	2017-05-04 21:34:28 +02:00
Jason Petersen	f0c6c47c4e	Fix CREATE SEQUENCE generation bug Apparently we've had a typo all this time causing us to pass the cache value for the start value.	2017-05-03 21:47:06 -07:00
Önder Kalacı	ef6d3587b6	Skip exhaustive test in CoPartitionedTables() if declared colocated (#1376 ) That's considerably cheaper.	2017-05-02 03:33:21 +03:00
Önder Kalacı	b74ed3c8e1	Subqueries in where -- updated (#1372 ) * Support for subqueries in WHERE clause This commit enables subqueries in WHERE clause to be pushed down by the subquery pushdown logic. The support covers: - Correlated subqueries with IN, NOT IN, EXISTS, NOT EXISTS, operator expressions such as (>, <, =, ALL, ANY etc.) - Non-correlated subqueries with (partition_key) IN (SELECT partition_key ..) (partition_key) =ANY (SELECT partition_key ...) Note that this commit heavily utilizes the attribute equivalence logic introduced in the `1cb6a34ba8`. In general, this commit mostly adjusts the logical planner not to error out on the subqueries in WHERE clause. * Improve error checks for subquery pushdown and INSERT ... SELECT Since we allow subqueries in WHERE clause with the previous commit, we should apply the same limitations to those subqueries. With this commit, we do not iterate on each subquery one by one. Instead, we extract all the subqueries and apply the checks directly on those subqueries. The aim of this change is to (i) Simplify the code (ii) Make it close to the checks on INSERT .. SELECT code base. * Extend checks for unresolved paramaters to include SubLinks With the presence of subqueries in where clause (i.e., SubPlans on the query) the existing way for checking unresolved parameters fail. The reason is that the parameters for SubPlans are kept on the parent plan not on the query itself (see primnodes.h for the details). With this commit, instead of checking SubPlans on the modified plans we start to use originalQuery, where SubLinks represent the subqueries in where clause. The unresolved parameters can be found on the SubLinks. * Apply code-review feedback * Remove unnecessary copying of shard interval list This commit removes unnecessary copying of shard interval list. Note that there are no copyObject function implemented for shard intervals.	2017-05-01 17:20:21 +03:00
Marco Slot	8edba5f309	Honour enable_ddl_propagation in truncate trigger	2017-04-29 03:32:52 +02:00
Brian Cloutier	22e7aa9a4f	Fix crash in isolation tests - There was a crash when the table a shardid belonged to changed during a session. Instead of crashing (a failed assert) we now throw an error - Update the isolation test which was crashing to no longer exercise that code path - Add a regression test to check that the error is thrown	2017-04-29 04:25:26 +03:00
Önder Kalacı	ad5cd326a4	Subquery pushdown - main branch (#1323 ) * Enabling physical planner for subquery pushdown changes This commit applies the logic that exists in INSERT .. SELECT planning to the subquery pushdown changes. The main algorithm is followed as : - pick an anchor relation (i.e., target relation) - per each target shard interval - add the target shard interval's shard range as a restriction to the relations (if all relations joined on the partition keys) - Check whether the query is router plannable per target shard interval. - If router plannable, create a task * Add union support within the JOINS This commit adds support for UNION/UNION ALL subqueries that are in the following form: .... (Q1 UNION Q2 UNION ...) as union_query JOIN (QN) ... In other words, we currently do NOT support the queries that are in the following form where union query is not JOINed with other relations/subqueries : .... (Q1 UNION Q2 UNION ...) as union_query .... * Subquery pushdown planner uses original query With this commit, we change the input to the logical planner for subquery pushdown. Before this commit, the planner was relying on the query tree that is transformed by the postgresql planner. After this commit, the planner uses the original query. The main motivation behind this change is the simplify deparsing of subqueries. * Enable top level subquery join queries This work enables - Top level subquery joins - Joins between subqueries and relations - Joins involving more than 2 range table entries A new regression test file is added to reflect enabled test cases * Add top level union support This commit adds support for UNION/UNION ALL subqueries that are in the following form: .... (Q1 UNION Q2 UNION ...) as union_query .... In other words, Citus supports allow top level unions being wrapped into aggregations queries and/or simple projection queries that only selects some fields from the lower level queries. * Disallow subqueries without a relation in the range table list for subquery pushdown This commit disallows subqueries without relation in the range table list. This commit is only applied for subquery pushdown. In other words, we do not add this limitation for single table re-partition subqueries. The reasoning behind this limitation is that if we allow pushing down such queries, the result would include (shardCount * expectedResults) where in a non distributed world the result would be (expectedResult) only. * Disallow subqueries without a relation in the range table list for INSERT .. SELECT This commit disallows subqueries without relation in the range table list. This commit is only applied for INSERT.. SELECT queries. The reasoning behind this limitation is that if we allow pushing down such queries, the result would include (shardCount * expectedResults) where in a non distributed world the result would be (expectedResult) only. * Change behaviour of subquery pushdown flag (#1315) This commit changes the behaviour of the citus.subquery_pushdown flag. Before this commit, the flag is used to enable subquery pushdown logic. But, with this commit, that behaviour is enabled by default. In other words, the flag is now useless. We prefer to keep the flag since we don't want to break the backward compatibility. Also, we may consider using that flag for other purposes in the next commits. * Require subquery_pushdown when limit is used in subquery Using limit in subqueries may cause returning incorrect results. Therefore we allow limits in subqueries only if user explicitly set subquery_pushdown flag. * Evaluate expressions on the LIMIT clause (#1333) Subquery pushdown uses orignal query, the LIMIT and OFFSET clauses are not evaluated. However, logical optimizer expects these expressions are already evaluated by the standard planner. This commit manually evaluates the functions on the logical planner for subquery pushdown. * Better format subquery regression tests (#1340) * Style fix for subquery pushdown regression tests With this commit we intented a more consistent style for the regression tests we've added in the - multi_subquery_union.sql - multi_subquery_complex_queries.sql - multi_subquery_behavioral_analytics.sql * Enable the tests that are temporarily commented This commit enables some of the regression tests that were commented out until all the development is done. * Fix merge conflicts (#1347) - Update regression tests to meet the changes in the regression test output. - Replace Ifs with Asserts given that the check is already done - Update shard pruning outputs * Add view regression tests for increased subquery coverage (#1348) - joins between views and tables - joins between views - union/union all queries involving views - views with limit - explain queries with view * Improve btree operators for the subquery tests This commit adds the missing comprasion for subquery composite key btree comparator.	2017-04-29 04:09:48 +03:00
Andres Freund	90b211267d	Perform range based pruning if equality pruning has survivor. We previously dismissed this as unimportant, but it turns out to be very useful for the upcoming subquery pushdown, where a user might specify an equality constraint in a subquery, and the subquery pushdown machinery adds >= and <= restrictions on the shard boundary. Previously the latter restriction was ignored.	2017-04-28 17:35:18 -07:00
Andres Freund	6c08fe72f9	Use stricter qual for pruning if both >/< and >=/<= are present. Previously, if both =< and < (>= and < respectively) were specified, we always used the latter restriction. Instead use the stricter one.	2017-04-28 17:35:18 -07:00
Marco Slot	6e58067962	Fix list length lookup in WorkerGetLiveNodeCount	2017-04-29 02:13:20 +02:00
Burak Yucesoy	6599677902	Fix check-vanilla tests It semms that GEQO optimizations, when it is set to on, create their own memory context and free it after when it is no longer necessary. In join multi_join_restriction_hook we allocate our variables in the CurrentMemoryContext, which is GEQO's memory context if it is active. To prevent deallocation of our variables when GEQO's memory context is freed, we started to allocate memory fo these variables in separate MemoryContext.	2017-04-29 01:55:18 +02:00
Marco Slot	0b579d027a	Check whether relation ID exists in citus_relation_size	2017-04-29 01:39:39 +02:00
Andres Freund	d399f395f7	Faster shard pruning. So far citus used postgres' predicate proofing logic for shard pruning, except for INSERT and COPY which were already optimized for speed. That turns out to be too slow: * Shard pruning for SELECTs is currently O(#shards), because PruneShardList calls predicate_refuted_by() for every shard. Obviously using an O(N) type algorithm for general pruning isn't good. * predicate_refuted_by() is quite expensive on its own right. That's primarily because it's optimized for doing a single refutation proof, rather than performing the same proof over and over. * predicate_refuted_by() does not keep persistent state (see 2.) for function calls, which means that a lot of syscache lookups will be performed. That's particularly bad if the partitioning key is a composite key, because without a persistent FunctionCallInfo record_cmp() has to repeatedly look-up the type definition of the composite key. That's quite expensive. Thus replace this with custom-code that works in two phases: 1) Search restrictions for constraints that can be pruned upon 2) Use those restrictions to search for matching shards in the most efficient manner available: a) Binary search / Hash Lookup in case of hash partitioned tables b) Binary search for equal clauses in case of range or append tables without overlapping shards. c) Binary search for inequality clauses, searching for both lower and upper boundaries, again in case of range or append tables without overlapping shards. d) exhaustive search testing each ShardInterval My measurements suggest that we are considerably, often orders of magnitude, faster than the previous solution, even if we have to fall back to exhaustive pruning.	2017-04-28 14:40:41 -07:00
Andres Freund	6bd2e3ed30	Add DistTableCacheEntry->hasOverlappingShardInterval. This determines whether it's possible to perform binary search on sortedShardIntervalArray or not. If e.g. two shards have overlapping ranges, that'd be prohibitive. That'll be useful in later commit introducing faster shard pruning.	2017-04-28 14:40:38 -07:00
Andres Freund	105483ec56	Add DistTableCacheEntry->shardValueCompareFunction. That's useful when comparing values a hash-partitioned table is filtered by. The existing shardIntervalCompareFunction is about comparing hashed values, not unhashed ones. The added btree opclass function is so we can get a comparator back. This should be changed much more widely, but is not necessary so far.	2017-04-28 14:40:38 -07:00
Andres Freund	52571c00ad	Build DistTableCacheEntry->shardIntervalCompareFunction even for 0 shards. Previously we, unnecessarily, used a the first shard's type information to to look up the comparison function. But that information is already available, so use it. That's helpful because we sometimes want to access the comparator function even if there's no shards.	2017-04-28 14:40:38 -07:00
Andres Freund	ba93d32c8a	Fix: Make FindShardIntervalIndex robust against 0 shards.	2017-04-28 14:40:38 -07:00
Metin Doslu	b6659bec22	Send explain queries with savepoints With this commit, we started to send explain queries within a savepoint. After running explain query, we rollback to savepoint. This saves us from side effects of EXPLAIN ANALYZE on DML queries.	2017-04-28 12:13:48 -07:00
Jason Petersen	93e3afc25c	Remove FastShardPruning method With the other simplifications, it doesn't make sense to keep around.	2017-04-27 13:32:36 -06:00
Jason Petersen	42ee7c05f5	Refactor FindShardInterval to use cacheEntry All callers fetch a cache entry and extract/compute arguments for the eventual FindShardInterval call, so it makes more sense to refactor into that function itself; this solves the use-after-free bug, too.	2017-04-27 13:32:36 -06:00
Andres Freund	b7dfeb0bec	Boring regression test output adjustments. Soon shard pruning will be optimized not to generally work linearly anymore. Thus we can't print the pruned shard intervals as currently done anymore. The current printing of shard ids also prevents us from running tests in parallel, as otherwise shard ids aren't linearly numbered.	2017-04-26 11:33:56 -07:00
Andres Freund	71a7f39b05	Skip exhaustive test in CoPartitionedTables() if declared colocated. That's considerably cheaper.	2017-04-26 11:19:17 -07:00
Marco Slot	7f9e80db10	Only process error if not NULL in StoreErrorMessage	2017-04-21 17:01:01 +02:00
Marco Slot	7faf4657b7	Use right sizeof in UpdateRelationColocationGroup	2017-04-21 16:37:09 +02:00
Marco Slot	4ed093970a	Support expressions in the partition column in INSERTs	2017-04-21 14:05:52 +02:00
velioglu	24d24db25c	Implement ALTER TABLE ADD CONSTRAINT command	2017-04-20 15:02:33 +03:00
velioglu	8cbef819be	Log message of across shard queries according to the log level	2017-04-20 12:24:46 +03:00
velioglu	2327b63291	Change native hash function with worker_hash	2017-04-19 22:16:55 +03:00
Jason Petersen	5272c2c44b	Enable distributed ALTER TABLE ... RENAME COLUMN Pretty straightforward. Had some concerns about locking, but due to the fact that all distributed operations use either some level of deparsing or need to enumerate column names, they all block during any concurrent column renames (due to the AccessExclusive lock). In addition, I had some misgivings about permitting renames of the dis- tribution column, but nothing bad comes from just allowing them. Finally, I tried to trigger any sort of error using prepared statements and could not trigger any errors not also exhibited by plain PostgreSQL tables.	2017-04-18 22:47:48 -06:00
Marco Slot	dfd7d86948	Stop using a sequence to generate unique job IDs	2017-04-18 11:31:51 +02:00
Burak Yucesoy	00747dc8c9	Set default value of isactive to true With this change, we set to default value of isactive column to true so that upgrading users all nodes will be marked as active to not break their environment.	2017-04-18 09:40:44 +03:00
Burak Yucesoy	1a56b99f13	Fix node copy error Instead of directly returning heap tuple obtained from heap scan we return copied version of it.	2017-04-17 19:38:18 +03:00
Marco Slot	af0e462409	Support UPDATE/DELETE with parameterised partition column qual	2017-04-17 16:17:30 +02:00
Marco Slot	5e58804d44	Support query parameters in combination with function evaluation	2017-04-17 15:40:55 +02:00
Marco Slot	0bcc227a62	Create indexes after worker_append_table_to_shard during shard repair	2017-04-17 15:17:21 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Burak Yucesoy	7cfcb7d2f8	Error out on parameterized SQL functions Before this commit, we were erroring out for queries containing parameterized SQL functions like 'SELECT parameterized_sql_query(value)' as we should, however we were returning wrong results for queries like 'SELECT * FROM parameterized_sql_query(value)'. With this commit we started to error out on such queries too.	2017-04-13 16:36:24 +03:00
Onder Kalaci	1cb6a34ba8	Remove uninstantiated qual logic, use attribute equivalences In this PR, we aim to deduce whether each of the RTE_RELATION is joined with at least on another RTE_RELATION on their partition keys. If each RTE_RELATION follows the above rule, we can conclude that all RTE_RELATIONs are joined on their partition keys. In order to do that, we invented a new equivalence class namely: AttributeEquivalenceClass. In very simple words, a AttributeEquivalenceClass is identified by an unique id and consists of a list of AttributeEquivalenceMembers. Each AttributeEquivalenceMember is designed to identify attributes uniquely within the whole query. The necessity of this arise since varno attributes are defined within a single level of a query. Instead, here we want to identify each RTE_RELATION uniquely and try to find equality among each RTE_RELATION's partition key. Whenever we find an equality clause A = B, where both A and B originates from relation attributes (i.e., not random expressions), we create an AttributeEquivalenceClass to record this knowledge. If we later find another equivalence B = C, we create another AttributeEquivalenceClass. Finally, we can apply transitity rules and generate a new AttributeEquivalenceClass which includes A, B and C. Note that equality among the members are identified by the varattno and rteIdentity. Each equality among RTE_RELATION is saved using an AttributeEquivalenceClass where each member attribute is identified by a AttributeEquivalenceMember. In the final step, we try generate a common attribute equivalence class that holds as much as AttributeEquivalenceMembers whose attributes are a partition keys.	2017-04-13 11:51:26 +03:00
velioglu	1fb11c738f	Check binary output function of type.	2017-04-10 16:28:09 +03:00
Jason Petersen	7e46f41c12	Add comments, use strncmp, clean up GUC desc. Good to go!	2017-04-04 16:16:49 -06:00
Jason Petersen	033fda9183	Clean up remaining error messages Added details and hints, based off of similar PostgreSQL scenarios.	2017-04-04 16:11:59 -06:00
Jason Petersen	ef81b21a49	Clean up ErrorIfUnstableCreateOrAlterExtensionStmt Swaps an Assert in for an ereport, and adds details and hints to the error message to help users with a possibly confusing scenario.	2017-04-04 15:58:57 -06:00
Jason Petersen	ad3fbd9689	Refactor utility-skip/extn-check code This was getting pretty long and complex in the context of the main utility hook. Moved out the checks for what should skip Citus process- ing and what should have version checks performed.	2017-04-04 15:07:22 -06:00
Burak Yucesoy	a09614553f	Add enable_version_checks GUC and address feedback	2017-04-04 19:11:13 +03:00
Burak Yucesoy	087d8427e3	Error out if binary citus version does not match installed extension With this change, we start to error out if loaded citus binaries does not match the available major version or installed citus extension version. In this case we force user to restart the server or run ALTER EXTENSION depending on the situation	2017-04-03 17:36:13 -06:00
Jason Petersen	4cdfc3a10f	Address review feedback Should just about do it.	2017-04-03 11:44:57 -06:00
Jason Petersen	cf775c4773	Improve CONCURRENTLY-related error messages Thought this looked slightly nicer than the default behavior. Changed preventTransaction to concurrent to be clearer that this code path presently affects CONCURRENTLY code only.	2017-04-03 11:19:15 -06:00
Jason Petersen	dd9365433e	Update documentation Ensure all functions have comments, etc.	2017-04-03 11:19:15 -06:00
Jason Petersen	d904e96c59	Address MX CONCURRENTLY problems Adds a non-transactional multi-command method to propagate DDLs to all MX/metadata-synced nodes.	2017-04-03 11:19:15 -06:00
Jason Petersen	32886e97a3	Add code to set index validity on failure Coordinator code marks index as invalid as a base, set it as valid in a transactional layer atop that base, then proceeds with worker commands. If a worker command has problems, the rollback results in an index with isvalid = false. If everything succeeds, the user sees a valid index.	2017-04-03 11:19:15 -06:00
Jason Petersen	dea6c44f75	Remove CONCURRENTLY checks, fix tests Still pending failure testing, which broke with my recent changes.	2017-04-03 11:19:15 -06:00
Jason Petersen	0b6c4e756e	Change DropStmt to generate worker DDL on master Because we can't execute DROP INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:15 -06:00
Jason Petersen	95d8d27c4f	Change IndexStmt to generate worker DDL on master Because we can't execute CREATE INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:14 -06:00
Marco Slot	0f355a4a48	Batch task_tracker_status calls to reduce task-tracker query times	2017-03-31 11:54:11 +02:00
Metin Doslu	54a277ff01	Add disable/enable trigger all support	2017-03-29 22:00:14 +03:00
Onder Kalaci	11665dbe3c	Fix pushing down wrong queries for INSERT ... SELECT queries Before this commit, in certain cases router planner allowed pushing down JOINs that are not on the partition keys. With @anarazel's suggestion, we change the logic to use uninstantiated parameter. Previously, the planner was traversing on the restriction information and once it finds the parameter, it was replacing it with the shard range. With this commit, instead of traversing the restrict infos, the planner explicitly checks for the equivalence of the relation partition key with the uninstantiated parameter. If finds an equivalence, it adds the restrictions. In this way, we have more control over the queries that are pushed down.	2017-03-24 11:37:35 +02:00
Jason Petersen	34a62abb7d	Address code review comments	2017-03-22 17:29:17 -06:00
Jason Petersen	d95b5bbad3	Rework ReplicateGrantStmt to use new flow This was the impetus for the previous commit that changed from using a DDLJob * to a List * of them.	2017-03-22 17:29:16 -06:00
Jason Petersen	23f5e4282d	Change DDLJob usage to be wrapped in lists To prepare for GRANT fixes.	2017-03-22 17:29:16 -06:00
Jason Petersen	f181b24859	Move worker execution to after master, fix tests Some tests relied on worker errors though local commands were invalid. Fixed those by ensuring preconditions were met to have command work correctly. Otherwise most test changes are related to slight changes in local/remote error ordering.	2017-03-22 17:21:49 -06:00
Jason Petersen	419a4c3745	Remove execution from stmt-specific util functions Now have a single Execute call in the main body.	2017-03-22 17:21:49 -06:00
Jason Petersen	a64165767d	Rename ProcessStmt functions to PlanStmt To reflect their new purpose planning a DDLJob rather than fully processing a distributed DDL statement.	2017-03-22 17:21:49 -06:00
Jason Petersen	a02a2a90c7	Refactor ExecuteDistDDLCommand to expect struct Will let us separate out the determination of what to execute from its actual execution.	2017-03-22 17:21:49 -06:00
Metin Doslu	b1ee7ec93e	Fix access permission checks for distributed relations With this commit, we add the range table list of the original query to our custom plan. Therefore, PostgreSQL can check relations in the original query for access permissions and error out if the proper access is not granted.	2017-03-22 15:25:00 -06:00
Murat Tuncer	c4734d7d94	Rephrase router modify errors generic "distributed modifications must target exactly one shard" message is replaced by more context aware error messages.	2017-03-16 15:09:10 +03:00
velioglu	e32aff1a26	Size UDFs implemented citus_table_size, citus_relation_size and citus_total_relation_size UDFs are implemented.	2017-03-16 13:50:30 +03:00
Metin Doslu	1f838199f8	Use CustomScan API for query execution Custom Scan is a node in the planned statement which helps external providers to abstract data scan not just for foreign data wrappers but also for regular relations so you can benefit your version of caching or hardware optimizations. This sounds like only an abstraction on the data scan layer, but we can use it as an abstraction for our distributed queries. The only thing we need to do is to find distributable parts of the query, plan for them and replace them with a Citus Custom Scan. Then, whenever PostgreSQL hits this custom scan node in its Vulcano style execution, it will call our callback functions which run distributed plan and provides tuples to the upper node as it scans a regular relation. This means fewer code changes, fewer bugs and more supported features for us! First, in the distributed query planner phase, we create a Custom Scan which wraps the distributed plan. For real-time and task-tracker executors, we add this custom plan under the master query plan. For router executor, we directly pass the custom plan because there is not any master query. Then, we simply let the PostgreSQL executor run this plan. When it hits the custom scan node, we call the related executor parts for distributed plan, fill the tuple store in the custom scan and return results to PostgreSQL executor in Vulcano style, a tuple per XXX_ExecScan() call. * Modify planner to utilize Custom Scan node. * Create different scan methods for different executors. * Use native PostgreSQL Explain for master part of queries.	2017-03-14 12:17:51 +02:00
Andres Freund	52358fe891	Initial temp table removal implementation	2017-03-14 12:09:49 +02:00
Jason Petersen	6f4886cd11	Revert "Remove unused SendCommandToWorker" This reverts commit `c8c308c109`.	2017-03-13 15:48:51 -06:00
Murat Tuncer	f657a744d5	Enable router planner for queries on range partitioned tables Router planner now supports queries using range partitioned tables. Queries on append partitioned tables are still not supported.	2017-03-09 16:39:15 +03:00
Brian Cloutier	c8c308c109	Remove unused SendCommandToWorker	2017-03-08 16:30:23 +03:00
Brian Cloutier	a2ba565a9e	Remove unused master_stage_shard_{placement_,}row	2017-03-07 11:59:26 +03:00
Brian Cloutier	95936ff481	Remove unused master_get_round_robin_candidate_nodes	2017-03-07 11:51:24 +03:00
Brian Cloutier	807beb7bc0	Remove master_get_local_first_candidate_nodes	2017-03-07 11:50:59 +03:00
Andres Freund	fa5b8fb39f	Fix SendRemoteCommandParams() handling of a NULL MultiConnection->pgConn. (#1271 ) Previously we'd segfault in PQisnonblocking() which, contrary to other libpq calls, doesn't handle a NULL PQconn (because there'd be no appropriate return value for that). cr: @jasonmp85	2017-03-03 12:02:15 -07:00
Murat Tuncer	72027f2eba	Remove default clause from shard DDL when sequences are used	2017-03-01 17:32:48 +03:00
Marco Slot	bab1b65491	Fix spelling in master_initialize_node_metadata comment	2017-03-01 12:27:50 +01:00
Jason Petersen	047825c6ca	Rename misleading allowEmpty parameter Last bit of PR feedback.	2017-02-28 22:48:00 -07:00
Marco Slot	56d4d375c2	Address review feedback in create_distributed_table data loading	2017-02-28 17:39:45 +01:00
Marco Slot	db98c28354	Address review feedback in COPY refactoring	2017-02-28 17:39:45 +01:00
Marco Slot	d74fb764b1	Use CitusCopyDestReceiver for regular COPY	2017-02-28 17:24:45 +01:00
Marco Slot	d11eca7d4a	Load data into distributed table on creation	2017-02-28 17:24:45 +01:00
Marco Slot	bf3541cb24	Add CitusCopyDestReceiver infrastructure	2017-02-28 17:24:45 +01:00
Burak Velioglu	e158c7ae67	Merge branch 'master' into disallow_master_appy_delete_on_hash	2017-02-24 10:40:23 +02:00
velioglu	4dbb69cfc3	Fix error message of start_metadata_sync_to_node Single quotation mark is added around nodename to make the error code consistent with master_add_node usage.	2017-02-22 18:03:58 +03:00
Metin Doslu	ee425871ee	Get reproducible costs between different PostgreSQL versions	2017-02-22 15:40:02 +02:00
Burak Velioglu	49812ddfa0	Disallow master_apply_delete_command on hash distributed table Delete operation is blocked for any table distributed by hash using master_apply_delete_command. Suggested master_modify_multiple_shards command as a hint.	2017-02-22 11:54:46 +03:00
Andres Freund	9721e80901	Use DEBUG2 instead of DEBUG4 in INSERT SELECT tests & debug message. During later work the transaction debug output will change (as it will in postgres 10), which makes it hard to see actual changes in the INSERT ... SELECT ... test. Reduce to DEBUG2 after changing a debug message to that log level.	2017-02-20 12:56:16 +02:00
Eren Basak	df9cf346ee	Enforce statement based replication on old APIs and non-hash tables This change ignores `citus.replication_model` setting and uses the statement based replication in - Tables distributed via the old `master_create_distributed_table` function - Append and range partitioned tables, even if created via `create_distributed_table` function This seems like the easiest solution to #1191, without changing the existing behavior and harming existing users with custom scripts. This change also prevents RF>1 on streaming replicated tables on `master_create_worker_shards` Prior to this change, `master_create_worker_shards` command was not checking the replication model of the target table, thus allowing RF>1 with streaming replicated tables. With this change, `master_create_worker_shards` errors out on the case.	2017-02-16 10:37:53 -08:00
Onder Kalaci	95f8382ca2	Bugfix for creating foreign key This commit fixes crash for adding foreign keys without specifying the referenced column crashes the backend.	2017-02-07 09:34:24 +02:00
Brian Cloutier	e6e5f63d9d	Utility hook does nothing if the extension is not loaded	2017-02-02 17:48:31 +02:00
Brian Cloutier	a30b9b93a4	Set a memory context when throwing deferred errors	2017-02-02 15:14:21 +02:00
Brian Cloutier	e3c763c3f7	Start remote transactions in master_append_table_to_shard Add a call to RemoteTransactionBeginIfNecessary so that BEGIN is actually sent to the remote connections. This means that ROLLBACK and Ctrl-C are respected and don't leave the table in a partial state.	2017-02-01 18:12:19 +02:00
Eren Basak	ae0bfb1394	Allow dropping sequences on mx workers This change allows users to drop sequences on MX workers. Previously, Citus didn't allow dropping sequences on MX workers because it could cause shards to be dropped if `DROP SEQUENCE ... CASCADE` is used. We now allow that since allowing sequence creation but not dropping hurts user experience and also may cause problems with custom Citus solutions.	2017-01-31 14:51:44 -08:00
Brian Cloutier	6843ad8e91	Fix bug where router executor sends query to failed connections	2017-01-27 09:40:30 +02:00
Brian Cloutier	1173f3f225	Refactor CheckShardPlacements - Break CheckShardPlacements into multiple functions (The most important is MarkFailedShardPlacements), so that we can get rid of the global CoordinatedTransactionUses2PC. - Call MarkFailedShardPlacements in the router executor, so we mark shards as invalid and stop using them while inside transaction blocks.	2017-01-26 13:20:45 +02:00
Marco Slot	f56454360c	Mark failed placements as inactive immediately after COPY	2017-01-25 19:19:39 +03:00
Marco Slot	b1626887d5	Don't mark placements inactive in COPY after successful connection	2017-01-25 19:19:38 +03:00
Marco Slot	d0c76407b8	Set placement to inactive on connection failure in COPY	2017-01-25 19:19:38 +03:00
Marco Slot	85c1a87999	Short circuit in multi_ProcessUtility on ABORT/COMMIT	2017-01-25 11:57:00 +01:00
Marco Slot	2748660b1c	Always skip foreign key validation when enable_ddl_propagation is off	2017-01-25 11:56:59 +01:00
Marco Slot	ba940a1de9	Use coordinator instead of schema node in terminology	2017-01-25 11:07:23 +01:00
Marco Slot	72725ba30c	Use bigserial instead of BIGINT in sequence error	2017-01-25 11:07:23 +01:00
Burak Yucesoy	d80e7849a4	Convert DropShards to use new connection API With this change DropShards function started to use new connection API. DropShards function is used by DROP TABLE, master_drop_all_shards and master_apply_delete_command, therefore all of these functions now support transactional operations. In DropShards function, if we cannot reach a node, we mark shard state of related placements as FILE_TO_DELETE and continue to drop remaining shards; however if any error occurs after establishing the connection, we ROLLBACK whole operation.	2017-01-23 21:08:41 +03:00
Burak Yucesoy	2489c59c15	In case of failed transactions update shard state only if it is FILE_FINALIZED Before this change, when a transaction failed, we update related placements shard states to FILE_INACTIVE during XACT_EVENT_PRE_COMMIT. However that means if another code block changed shard state to something else (e.g. FILE_TO_DELETE) before XACT_EVENT_PRE_COMMIT we overwrite that. To prevent that problem, in case of failure we started to change shard state, only if its current shard state is FILE_FINALIZED.	2017-01-23 21:04:57 +03:00
Burak Yucesoy	484cb12cd0	Add LoadShardPlacement UDF This UDF returns a shard placement from cache given shard id and placement id. At the moment it iterates over all shard placements of given shard by ShardPlacementList and searches given placement id in that list, which is not a good solution performance-wise. However, currently, this function will be used only when there is a failed transaction. If a need arises we can optimize this function in the future.	2017-01-23 21:04:57 +03:00
Marco Slot	1585c02322	Use placement connection API for multi-shard transactions	2017-01-23 18:34:50 +01:00
Andres Freund	6939cb8c56	Hack up PREPARE/EXECUTE for nearly all distributed queries. All router, real-time, task-tracker plannable queries should now have full prepared statement support (and even use router when possible), unless they don't go through the custom plan interface (which basically just affects LANGUAGE SQL (not plpgsql) functions). This is achieved by forcing postgres' planner to always choose a custom plan, by assigning very low costs to plans with bound parameters (i.e. ones were the postgres planner replanned the query upon EXECUTE with all parameter values provided), instead of the generic one. This requires some trickery, because for custom plans to work the costs for a non-custom plan have to be known, which means we can't error out when planning the generic plan. Instead we have to return a "faux" plan, that'd trigger an error message if executed. But due to the custom plan logic that plan will likely (unless called by an SQL function, or because we can't support that query for some reason) not be executed; instead the custom plan will be chosen.	2017-01-23 09:23:50 -08:00
Andres Freund	c244b8ef4a	Make router planner error handling more flexible. So far router planner had encapsulated different functionality in MultiRouterPlanCreate. Modifications always go through router, selects sometimes. Modifications always error out if the query is unsupported, selects return NULL. Especially the error handling is a problem for the upcoming extension of prepared statement support. Split MultiRouterPlanCreate into CreateRouterPlan and CreateModifyPlan, and change them to not throw errors. Instead errors are now reported by setting the new MultiPlan->plannigError. Callers of router planner functionality now have to throw errors themselves if desired, but also can skip doing so. This is a pre-requisite for expanding prepared statement support. While touching all those lines, improve a number of error messages by getting them closer to the postgres error message guidelines.	2017-01-23 09:23:50 -08:00
Andres Freund	7681f6ab9d	Centralize more of distributed planning into CreateDistributedPlan(). The name CreatePhysicalPlan() hasn't been accurate for a while, and the split of work between multi_planner() and CreatePhysicalPlan() doesn't seem perfect. So rename to CreateDistributedPlan() and move a bit more logic in there.	2017-01-23 09:23:50 -08:00
Andres Freund	557ccc6fda	Support for deferred error messages. It can be useful, e.g. in the upcoming prepared statement support, to be able to return an error from a function that is not raised immediately, but can later be thrown. That allows e.g. to attempt to plan a statment using different methods and to create good error messages in each planner, but to only error out after all planners have been run. To enable that create support for deferred error messages that can be created (supporting errorcode, message, detail, hint) in one function, and then thrown in different place.	2017-01-23 09:23:50 -08:00
Andres Freund	9a82e8f06b	Make usage of static a bit more consistent in multi_planner.c.	2017-01-23 09:23:50 -08:00
Jason Petersen	56197dbdba	Add replication_model GUC This adds a replication_model GUC which is used as the replication model for any new distributed table that is not a reference table. With this change, tables with replication factor 1 are no longer implicitly MX tables. The GUC is similarly respected during empty shard creation for e.g. existing append-partitioned tables. If the model is set to streaming while replication factor is greater than one, table and shard creation routines will error until this invalid combination is corrected. Changing this parameter requires superuser permissions.	2017-01-23 09:05:14 -07:00
Brian Cloutier	fe5465aa4e	Port master_append_table_to_shard to new connection API (#1149 ) If any placements fail it doesn't update shard statistics on those placements. A minor enabling refactor: Make CoordinatedTransactionUses2PC public (it used to be CoordinatedTransactionUse2PC but that symbol already existed, so renamed it as well)	2017-01-23 15:57:44 +02:00
Burak Yucesoy	2e1df4c910	Reword error message for outer joins requiring repartition We changed error message which appears when user tries to execute outer join command and that command requires repartitioning. Old error message mentioned about 1-to-1 shard partitioning which may not be clear to user.	2017-01-23 10:42:36 +03:00
Marco Slot	ea855ddf86	Add an enable_deadlock_prevention flag to allow router transactions to expand to multiple nodes	2017-01-22 17:31:24 +01:00
Marco Slot	87ae26aef3	Ensure job IDs are unique across workers	2017-01-22 16:55:14 +01:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	6ec34bed84	Remove remnants of commit_protocol.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	52c3369f79	Minimal citus tools conversion to new connection API.	2017-01-21 09:01:14 -08:00

1 2 3 4 5 ...

618 Commits (353d2db9138c974355e8da125cc4aa5c2ee45314)