citus

Commit Graph

Author	SHA1	Message	Date
Jason Petersen	d904e96c59	Address MX CONCURRENTLY problems Adds a non-transactional multi-command method to propagate DDLs to all MX/metadata-synced nodes.	2017-04-03 11:19:15 -06:00
Jason Petersen	32886e97a3	Add code to set index validity on failure Coordinator code marks index as invalid as a base, set it as valid in a transactional layer atop that base, then proceeds with worker commands. If a worker command has problems, the rollback results in an index with isvalid = false. If everything succeeds, the user sees a valid index.	2017-04-03 11:19:15 -06:00
Jason Petersen	dea6c44f75	Remove CONCURRENTLY checks, fix tests Still pending failure testing, which broke with my recent changes.	2017-04-03 11:19:15 -06:00
Jason Petersen	0b6c4e756e	Change DropStmt to generate worker DDL on master Because we can't execute DROP INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:15 -06:00
Jason Petersen	95d8d27c4f	Change IndexStmt to generate worker DDL on master Because we can't execute CREATE INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:14 -06:00
Marco Slot	0f355a4a48	Batch task_tracker_status calls to reduce task-tracker query times	2017-03-31 11:54:11 +02:00
Metin Doslu	54a277ff01	Add disable/enable trigger all support	2017-03-29 22:00:14 +03:00
Onder Kalaci	11665dbe3c	Fix pushing down wrong queries for INSERT ... SELECT queries Before this commit, in certain cases router planner allowed pushing down JOINs that are not on the partition keys. With @anarazel's suggestion, we change the logic to use uninstantiated parameter. Previously, the planner was traversing on the restriction information and once it finds the parameter, it was replacing it with the shard range. With this commit, instead of traversing the restrict infos, the planner explicitly checks for the equivalence of the relation partition key with the uninstantiated parameter. If finds an equivalence, it adds the restrictions. In this way, we have more control over the queries that are pushed down.	2017-03-24 11:37:35 +02:00
Jason Petersen	34a62abb7d	Address code review comments	2017-03-22 17:29:17 -06:00
Jason Petersen	d95b5bbad3	Rework ReplicateGrantStmt to use new flow This was the impetus for the previous commit that changed from using a DDLJob * to a List * of them.	2017-03-22 17:29:16 -06:00
Jason Petersen	23f5e4282d	Change DDLJob usage to be wrapped in lists To prepare for GRANT fixes.	2017-03-22 17:29:16 -06:00
Jason Petersen	42c799faee	Fix MX tests Missed some of these. One had a bad DDL statement to begin with (mixed up column type and column name) and other was just master/worker order.	2017-03-22 17:21:49 -06:00
Jason Petersen	f181b24859	Move worker execution to after master, fix tests Some tests relied on worker errors though local commands were invalid. Fixed those by ensuring preconditions were met to have command work correctly. Otherwise most test changes are related to slight changes in local/remote error ordering.	2017-03-22 17:21:49 -06:00
Jason Petersen	419a4c3745	Remove execution from stmt-specific util functions Now have a single Execute call in the main body.	2017-03-22 17:21:49 -06:00
Jason Petersen	a64165767d	Rename ProcessStmt functions to PlanStmt To reflect their new purpose planning a DDLJob rather than fully processing a distributed DDL statement.	2017-03-22 17:21:49 -06:00
Jason Petersen	a02a2a90c7	Refactor ExecuteDistDDLCommand to expect struct Will let us separate out the determination of what to execute from its actual execution.	2017-03-22 17:21:49 -06:00
Jason Petersen	2cb34406d1	Minor permissions test fix When running under Enterprise, some of the GRANT commands and whatnot are propagated. Guarding that section with a call to disable DDL prop. fixes everything.	2017-03-22 17:07:05 -06:00
Metin Doslu	404e32cdb4	Add basic permission checking tests	2017-03-22 15:25:00 -06:00
Metin Doslu	bcff6aa96c	Update regression tests for changing explain output	2017-03-22 15:25:00 -06:00
Metin Doslu	b1ee7ec93e	Fix access permission checks for distributed relations With this commit, we add the range table list of the original query to our custom plan. Therefore, PostgreSQL can check relations in the original query for access permissions and error out if the proper access is not granted.	2017-03-22 15:25:00 -06:00
Murat Tuncer	c4734d7d94	Rephrase router modify errors generic "distributed modifications must target exactly one shard" message is replaced by more context aware error messages.	2017-03-16 15:09:10 +03:00
velioglu	e32aff1a26	Size UDFs implemented citus_table_size, citus_relation_size and citus_total_relation_size UDFs are implemented.	2017-03-16 13:50:30 +03:00
Metin Doslu	1f838199f8	Use CustomScan API for query execution Custom Scan is a node in the planned statement which helps external providers to abstract data scan not just for foreign data wrappers but also for regular relations so you can benefit your version of caching or hardware optimizations. This sounds like only an abstraction on the data scan layer, but we can use it as an abstraction for our distributed queries. The only thing we need to do is to find distributable parts of the query, plan for them and replace them with a Citus Custom Scan. Then, whenever PostgreSQL hits this custom scan node in its Vulcano style execution, it will call our callback functions which run distributed plan and provides tuples to the upper node as it scans a regular relation. This means fewer code changes, fewer bugs and more supported features for us! First, in the distributed query planner phase, we create a Custom Scan which wraps the distributed plan. For real-time and task-tracker executors, we add this custom plan under the master query plan. For router executor, we directly pass the custom plan because there is not any master query. Then, we simply let the PostgreSQL executor run this plan. When it hits the custom scan node, we call the related executor parts for distributed plan, fill the tuple store in the custom scan and return results to PostgreSQL executor in Vulcano style, a tuple per XXX_ExecScan() call. * Modify planner to utilize Custom Scan node. * Create different scan methods for different executors. * Use native PostgreSQL Explain for master part of queries.	2017-03-14 12:17:51 +02:00
Andres Freund	52358fe891	Initial temp table removal implementation	2017-03-14 12:09:49 +02:00
Jason Petersen	6f4886cd11	Revert "Remove unused SendCommandToWorker" This reverts commit `c8c308c109`.	2017-03-13 15:48:51 -06:00
Murat Tuncer	f657a744d5	Enable router planner for queries on range partitioned tables Router planner now supports queries using range partitioned tables. Queries on append partitioned tables are still not supported.	2017-03-09 16:39:15 +03:00
Brian Cloutier	c8c308c109	Remove unused SendCommandToWorker	2017-03-08 16:30:23 +03:00
Brian Cloutier	a2ba565a9e	Remove unused master_stage_shard_{placement_,}row	2017-03-07 11:59:26 +03:00
Brian Cloutier	95936ff481	Remove unused master_get_round_robin_candidate_nodes	2017-03-07 11:51:24 +03:00
Brian Cloutier	807beb7bc0	Remove master_get_local_first_candidate_nodes	2017-03-07 11:50:59 +03:00
Andres Freund	fa5b8fb39f	Fix SendRemoteCommandParams() handling of a NULL MultiConnection->pgConn. (#1271 ) Previously we'd segfault in PQisnonblocking() which, contrary to other libpq calls, doesn't handle a NULL PQconn (because there'd be no appropriate return value for that). cr: @jasonmp85	2017-03-03 12:02:15 -07:00
Murat Tuncer	72027f2eba	Remove default clause from shard DDL when sequences are used	2017-03-01 17:32:48 +03:00
Marco Slot	bab1b65491	Fix spelling in master_initialize_node_metadata comment	2017-03-01 12:27:50 +01:00
Jason Petersen	047825c6ca	Rename misleading allowEmpty parameter Last bit of PR feedback.	2017-02-28 22:48:00 -07:00
Marco Slot	56d4d375c2	Address review feedback in create_distributed_table data loading	2017-02-28 17:39:45 +01:00
Marco Slot	db98c28354	Address review feedback in COPY refactoring	2017-02-28 17:39:45 +01:00
Marco Slot	d74fb764b1	Use CitusCopyDestReceiver for regular COPY	2017-02-28 17:24:45 +01:00
Marco Slot	d11eca7d4a	Load data into distributed table on creation	2017-02-28 17:24:45 +01:00
Marco Slot	bf3541cb24	Add CitusCopyDestReceiver infrastructure	2017-02-28 17:24:45 +01:00
Burak Velioglu	e158c7ae67	Merge branch 'master' into disallow_master_appy_delete_on_hash	2017-02-24 10:40:23 +02:00
velioglu	4dbb69cfc3	Fix error message of start_metadata_sync_to_node Single quotation mark is added around nodename to make the error code consistent with master_add_node usage.	2017-02-22 18:03:58 +03:00
Metin Doslu	ee425871ee	Get reproducible costs between different PostgreSQL versions	2017-02-22 15:40:02 +02:00
Burak Velioglu	49812ddfa0	Disallow master_apply_delete_command on hash distributed table Delete operation is blocked for any table distributed by hash using master_apply_delete_command. Suggested master_modify_multiple_shards command as a hint.	2017-02-22 11:54:46 +03:00
Andres Freund	9721e80901	Use DEBUG2 instead of DEBUG4 in INSERT SELECT tests & debug message. During later work the transaction debug output will change (as it will in postgres 10), which makes it hard to see actual changes in the INSERT ... SELECT ... test. Reduce to DEBUG2 after changing a debug message to that log level.	2017-02-20 12:56:16 +02:00
Eren Basak	df9cf346ee	Enforce statement based replication on old APIs and non-hash tables This change ignores `citus.replication_model` setting and uses the statement based replication in - Tables distributed via the old `master_create_distributed_table` function - Append and range partitioned tables, even if created via `create_distributed_table` function This seems like the easiest solution to #1191, without changing the existing behavior and harming existing users with custom scripts. This change also prevents RF>1 on streaming replicated tables on `master_create_worker_shards` Prior to this change, `master_create_worker_shards` command was not checking the replication model of the target table, thus allowing RF>1 with streaming replicated tables. With this change, `master_create_worker_shards` errors out on the case.	2017-02-16 10:37:53 -08:00
Jason Petersen	ad19b668ac	Fix tests broken by new PostgreSQL patch releases (#1220 ) PostgreSQL 9.5.6 and 9.6.2 were released today and broke several tests by adding TABLESPACE pg_default output to some DDL commands. Fixed all occurrences. cr: @anarazel	2017-02-09 16:53:02 -07:00
Onder Kalaci	95f8382ca2	Bugfix for creating foreign key This commit fixes crash for adding foreign keys without specifying the referenced column crashes the backend.	2017-02-07 09:34:24 +02:00
Brian Cloutier	e6e5f63d9d	Utility hook does nothing if the extension is not loaded	2017-02-02 17:48:31 +02:00
Brian Cloutier	a30b9b93a4	Set a memory context when throwing deferred errors	2017-02-02 15:14:21 +02:00
Brian Cloutier	e3c763c3f7	Start remote transactions in master_append_table_to_shard Add a call to RemoteTransactionBeginIfNecessary so that BEGIN is actually sent to the remote connections. This means that ROLLBACK and Ctrl-C are respected and don't leave the table in a partial state.	2017-02-01 18:12:19 +02:00
Eren Basak	005c533ee8	Fix Random Fails on Travis This change fixes the random failures on Travis, which is a bug introduced with citus/#1124. Before this fix, travis was failing randomly on `check_multi_mx` test schedule, specifically in the parallel group of `multi_mx_metadata`, 'multi_mx_modifications` and `multi_mx_modifying_xacts` tests. This change fixes this by serializing these three test cases.	2017-01-31 15:23:06 -08:00
Eren Basak	ae0bfb1394	Allow dropping sequences on mx workers This change allows users to drop sequences on MX workers. Previously, Citus didn't allow dropping sequences on MX workers because it could cause shards to be dropped if `DROP SEQUENCE ... CASCADE` is used. We now allow that since allowing sequence creation but not dropping hurts user experience and also may cause problems with custom Citus solutions.	2017-01-31 14:51:44 -08:00
Brian Cloutier	6843ad8e91	Fix bug where router executor sends query to failed connections	2017-01-27 09:40:30 +02:00
Brian Cloutier	1173f3f225	Refactor CheckShardPlacements - Break CheckShardPlacements into multiple functions (The most important is MarkFailedShardPlacements), so that we can get rid of the global CoordinatedTransactionUses2PC. - Call MarkFailedShardPlacements in the router executor, so we mark shards as invalid and stop using them while inside transaction blocks.	2017-01-26 13:20:45 +02:00
Murat Tuncer	0e635b69f0	Add copy failure tests inside transactions	2017-01-26 11:54:40 +03:00
Murat Tuncer	1107439ade	Fix dependent tests	2017-01-25 19:19:39 +03:00
Murat Tuncer	5194111420	Add failure case for regression tests	2017-01-25 19:19:39 +03:00
Marco Slot	f56454360c	Mark failed placements as inactive immediately after COPY	2017-01-25 19:19:39 +03:00
Marco Slot	b1626887d5	Don't mark placements inactive in COPY after successful connection	2017-01-25 19:19:38 +03:00
Marco Slot	d0c76407b8	Set placement to inactive on connection failure in COPY	2017-01-25 19:19:38 +03:00
Marco Slot	85c1a87999	Short circuit in multi_ProcessUtility on ABORT/COMMIT	2017-01-25 11:57:00 +01:00
Marco Slot	2748660b1c	Always skip foreign key validation when enable_ddl_propagation is off	2017-01-25 11:56:59 +01:00
Marco Slot	ba940a1de9	Use coordinator instead of schema node in terminology	2017-01-25 11:07:23 +01:00
Marco Slot	72725ba30c	Use bigserial instead of BIGINT in sequence error	2017-01-25 11:07:23 +01:00
Burak Yucesoy	00dd42dc44	Add ORDER BY to some tests to have consistent output	2017-01-25 11:43:25 +02:00
Eren Basak	88e9a429e1	Add Regression Tests For Querying MX Tables from Workers	2017-01-24 10:36:59 +03:00
Burak Yucesoy	d80e7849a4	Convert DropShards to use new connection API With this change DropShards function started to use new connection API. DropShards function is used by DROP TABLE, master_drop_all_shards and master_apply_delete_command, therefore all of these functions now support transactional operations. In DropShards function, if we cannot reach a node, we mark shard state of related placements as FILE_TO_DELETE and continue to drop remaining shards; however if any error occurs after establishing the connection, we ROLLBACK whole operation.	2017-01-23 21:08:41 +03:00
Burak Yucesoy	2489c59c15	In case of failed transactions update shard state only if it is FILE_FINALIZED Before this change, when a transaction failed, we update related placements shard states to FILE_INACTIVE during XACT_EVENT_PRE_COMMIT. However that means if another code block changed shard state to something else (e.g. FILE_TO_DELETE) before XACT_EVENT_PRE_COMMIT we overwrite that. To prevent that problem, in case of failure we started to change shard state, only if its current shard state is FILE_FINALIZED.	2017-01-23 21:04:57 +03:00
Burak Yucesoy	484cb12cd0	Add LoadShardPlacement UDF This UDF returns a shard placement from cache given shard id and placement id. At the moment it iterates over all shard placements of given shard by ShardPlacementList and searches given placement id in that list, which is not a good solution performance-wise. However, currently, this function will be used only when there is a failed transaction. If a need arises we can optimize this function in the future.	2017-01-23 21:04:57 +03:00
Marco Slot	1585c02322	Use placement connection API for multi-shard transactions	2017-01-23 18:34:50 +01:00
Andres Freund	6939cb8c56	Hack up PREPARE/EXECUTE for nearly all distributed queries. All router, real-time, task-tracker plannable queries should now have full prepared statement support (and even use router when possible), unless they don't go through the custom plan interface (which basically just affects LANGUAGE SQL (not plpgsql) functions). This is achieved by forcing postgres' planner to always choose a custom plan, by assigning very low costs to plans with bound parameters (i.e. ones were the postgres planner replanned the query upon EXECUTE with all parameter values provided), instead of the generic one. This requires some trickery, because for custom plans to work the costs for a non-custom plan have to be known, which means we can't error out when planning the generic plan. Instead we have to return a "faux" plan, that'd trigger an error message if executed. But due to the custom plan logic that plan will likely (unless called by an SQL function, or because we can't support that query for some reason) not be executed; instead the custom plan will be chosen.	2017-01-23 09:23:50 -08:00
Andres Freund	c244b8ef4a	Make router planner error handling more flexible. So far router planner had encapsulated different functionality in MultiRouterPlanCreate. Modifications always go through router, selects sometimes. Modifications always error out if the query is unsupported, selects return NULL. Especially the error handling is a problem for the upcoming extension of prepared statement support. Split MultiRouterPlanCreate into CreateRouterPlan and CreateModifyPlan, and change them to not throw errors. Instead errors are now reported by setting the new MultiPlan->plannigError. Callers of router planner functionality now have to throw errors themselves if desired, but also can skip doing so. This is a pre-requisite for expanding prepared statement support. While touching all those lines, improve a number of error messages by getting them closer to the postgres error message guidelines.	2017-01-23 09:23:50 -08:00
Andres Freund	7681f6ab9d	Centralize more of distributed planning into CreateDistributedPlan(). The name CreatePhysicalPlan() hasn't been accurate for a while, and the split of work between multi_planner() and CreatePhysicalPlan() doesn't seem perfect. So rename to CreateDistributedPlan() and move a bit more logic in there.	2017-01-23 09:23:50 -08:00
Andres Freund	557ccc6fda	Support for deferred error messages. It can be useful, e.g. in the upcoming prepared statement support, to be able to return an error from a function that is not raised immediately, but can later be thrown. That allows e.g. to attempt to plan a statment using different methods and to create good error messages in each planner, but to only error out after all planners have been run. To enable that create support for deferred error messages that can be created (supporting errorcode, message, detail, hint) in one function, and then thrown in different place.	2017-01-23 09:23:50 -08:00
Andres Freund	9a82e8f06b	Make usage of static a bit more consistent in multi_planner.c.	2017-01-23 09:23:50 -08:00
Jason Petersen	56197dbdba	Add replication_model GUC This adds a replication_model GUC which is used as the replication model for any new distributed table that is not a reference table. With this change, tables with replication factor 1 are no longer implicitly MX tables. The GUC is similarly respected during empty shard creation for e.g. existing append-partitioned tables. If the model is set to streaming while replication factor is greater than one, table and shard creation routines will error until this invalid combination is corrected. Changing this parameter requires superuser permissions.	2017-01-23 09:05:14 -07:00
Brian Cloutier	fe5465aa4e	Port master_append_table_to_shard to new connection API (#1149 ) If any placements fail it doesn't update shard statistics on those placements. A minor enabling refactor: Make CoordinatedTransactionUses2PC public (it used to be CoordinatedTransactionUse2PC but that symbol already existed, so renamed it as well)	2017-01-23 15:57:44 +02:00
Burak Yucesoy	2e1df4c910	Reword error message for outer joins requiring repartition We changed error message which appears when user tries to execute outer join command and that command requires repartitioning. Old error message mentioned about 1-to-1 shard partitioning which may not be clear to user.	2017-01-23 10:42:36 +03:00
Marco Slot	ea855ddf86	Add an enable_deadlock_prevention flag to allow router transactions to expand to multiple nodes	2017-01-22 17:31:24 +01:00
Marco Slot	87ae26aef3	Ensure job IDs are unique across workers	2017-01-22 16:55:14 +01:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	6ec34bed84	Remove remnants of commit_protocol.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	fd717d6da9	Consistently libpq forward declaration in remote_commands.h.	2017-01-21 09:01:14 -08:00
Andres Freund	52c3369f79	Minimal citus tools conversion to new connection API.	2017-01-21 09:01:14 -08:00
Önder Kalacı	594fa761e1	Merge branch 'master' into fix_command_counter_increment	2017-01-21 09:21:19 +02:00
Murat Tuncer	d76f781ae4	Convert multi copy to use new connection api This enables proper transactional behaviour for copy and relaxes some restrictions like combining COPY with single-row modifications. It also provides the basis for relaxing restrictions further, and for optionally allowing connection caching.	2017-01-20 19:15:19 -08:00
Jason Petersen	4e7b23472c	Change default replication factor to one Took the quick-and-dirty approach of changing it back to two during test runs. Can update tests to expect one in due time.	2017-01-20 18:56:43 -07:00
Andres Freund	3a36d32c43	Mark some now unnecessarily exposed multi_planner.c functions static.	2017-01-20 12:31:56 -08:00
Andres Freund	608bed0387	Don't duplicate planning logic in citus' explain hook. Instead use pg_plan_query() like the normal explain does, and use that to explain the query. That's important because it allows to remove the duplicated planner logic from multi_explain - and that logic is about to get more complicated.	2017-01-20 12:31:28 -08:00
Andres Freund	0f28a11970	Remove citus.explain_multi_logical/physical_plan. They make fixing explain for prepared statement harder, and they don't really fit into EXPLAIN in the first place. Additionally they're currently not exercised in any tests.	2017-01-20 12:31:19 -08:00
Onder Kalaci	bd825be340	Improve heap access methods This commit improves heap access methods for reference table upgrade and colocation group modifications.	2017-01-20 14:53:29 +02:00
Metin Doslu	2bd8f8f12e	Add a function to delete shard metadata from MX nodes	2017-01-20 14:38:01 +02:00
Metin Doslu	93e626c896	Refactor get_shard_id_for_distribution_column() and other minor changes	2017-01-20 14:38:01 +02:00
Metin Doslu	ed77260aa1	Return a deep copy shard list from ColocatedShardIntervalList()	2017-01-20 14:38:01 +02:00
Metin Doslu	7cff8719c2	Add worker_hash() and a stub for isolate_tenant_to_new_shard()	2017-01-20 14:38:01 +02:00
Murat Tuncer	c12bd7b75e	Remove hint message from master_remove_node UDF Hint about master_disable_node was giving wrong impression to users. Removal is better than keeping it.	2017-01-18 22:33:00 -07:00
Eren Basak	4def1ca696	Prevent COPY to reference tables from worker nodes	2017-01-18 17:38:01 +03:00
Eren Basak	e7c15ecc1f	Make `upgrade_to_reference_table` function MX-compatible	2017-01-18 16:49:50 +03:00
Eren Basak	56ca590daa	Propagate metadata changes for deleted reference table placements on master_remove_node call	2017-01-18 16:00:07 +03:00
Eren Basak	be78769ae4	Propagate new reference table placement metadata on `master_add_node`	2017-01-18 15:59:06 +03:00
Eren Basak	23b2619412	Make reference table metadata synced to workers	2017-01-18 15:59:05 +03:00
Eren Basak	e44d226221	Propagate Metadata to Workers on `create_reference_table` call.	2017-01-18 11:05:24 +03:00
Eren Basak	b686d9a025	Add Sequence Support for MX Tables This change adds support for serial columns to be used with MX tables. Prior to this change, sequences of serial columns were created in all workers (for being able to create shards) but never used. With MX, we need to set the sequences so that sequences in each worker create unique values. This is done by setting the MINVALUE, MAXVALUE and START values of the sequence.	2017-01-18 09:43:38 +03:00
Eren Basak	b1ce8d61c0	Create Invalidation Trigger for pg_dist_local_group Table Updates	2017-01-18 09:43:38 +03:00
Andres Freund	bdef35ac14	Query placementId in RemoteFinalizedShardPlacementList(). Not having the id in the ShardPlacement struct causes issues while making copy use the placement aware connection management.	2017-01-17 13:27:26 -08:00
Brian Cloutier	67ee357d7f	Port WorkerShardStats to new connection API Part of the work in citusdata/citus#1101, this is a pretty direct port over to the new functions and shouldn't result in any behavior changes.	2017-01-17 17:04:37 +02:00
Brian Cloutier	b1b2b4fadf	Create ExecuteOptionalRemoteCommand A small refactor which pulls some code out of `RecoverWorkerTransactions` and into `remote_commands.c`. This code block currently only occurs in `RecoverWorkerTransactions` but will be useful to other functions shortly. Unfortunately we couldn't call it `ExecuteRemoteCommand`, that name was already taken.	2017-01-17 17:04:37 +02:00
Brian Cloutier	539a205462	Pass entire ShardPlacement into WorkerShardStats A small refactor so we'll be able to call the new connection API (which requires having a ShardPlacement) from within WorkerShardStats.	2017-01-17 17:04:37 +02:00
Andres Freund	b9385700ee	Make placement_connection.c colocation aware. Because of foreign keys and similar concerns there should only be a single modifying/DDL connection for a set of colocated placements to a node. To enforce placement_connection.c now has an additional hash-table keeping track of the connections to a set of colocated placements. In addition to enforcing per placement restrictions on connections, there's now very similar restrictions for sets of colocated placements.	2017-01-16 13:47:01 -08:00
Andres Freund	6972186652	Add ShardPlacement fields required for colocated placement connection mapping.	2017-01-16 13:42:54 -08:00
Andres Freund	1d79820b74	Fix use of wrong constant. This could potentially lead to spuriously shared connections if the first 63 characters of a hostname are the same.	2017-01-16 13:42:53 -08:00
Andres Freund	4b1d37b7be	Remove fields used in earlier revisions of placement_connection.c.	2017-01-16 13:42:53 -08:00
Onder Kalaci	a7ed49c16e	Improve error messages for INSERT INTO .. SELECT This commit is intended to improve the error messages while planning INSERT INTO .. SELECT queries. The main motivation for this change is that we used to map multiple cases into a single message. With this change, we added explicit error messages for many cases.	2017-01-16 12:16:14 -07:00
Burak Yucesoy	3315ae6142	Remove placement metadata of reference tables after master_remove_node With this change, we start to delete placement of reference tables at given worker node after master_remove_node UDF call. We remove placement metadata at master node but we do not drop actual shard from the worker node. There are two reasons for that decision, first, it is not critical to DROP the shards in the workers because Citus will ignore them as long as node is removed from cluster and if we add that node back to cluster we will DROP and recreate all reference tables. Second, if node is unreachable, it becomes complicated to cover failure cases and have a transaction support.	2017-01-16 11:24:56 +03:00
Murat Tuncer	e7935a3be4	Report error when original range table id is not found in NewTableId()	2017-01-13 09:39:43 +03:00
Murat Tuncer	77f8db6b14	Add view support Enables use views within distributed queries. User can create and use a view on distributed tables/queries as he/she would use with regular queries. After this change router queries will have full support for views, insert into select queries will support reading from views, not writing into. Outer joins would have a limited support, and would error out at certain cases such as when a view is in the inner side of the outer join. Although PostgreSQL supports writing into views under certain circumstances. We disallowed that for distributed views.	2017-01-13 09:39:42 +03:00
Burak Yucesoy	b2c61be4a2	Add ORDER BY clause to shard state tests to have consistent output In tests related to automatic reference table creation and deletion, there were some tests whose output may change order thus creating inconsistent test results. With this change we add ORDER BY clause to related tests to have consistent output.	2017-01-13 02:42:28 +03:00
Onder Kalaci	aed5f817fa	Refactor CheckShardPlacements() and improve support for node removal This commit refactors CheckShardPlacements() so that it only considers modifyingConnection. Also, it skips nodes which are removed from the cluster.	2017-01-12 20:10:10 +02:00
Murat Tuncer	cb1dfd0a17	Add hint to errored real time queries	2017-01-12 11:33:35 +03:00
Onder Kalaci	1efa301ada	Copy on reference tables should never mark placements invalid This commit ensures that COPY does not mark any placement of reference's state as INVALID in case of an error.	2017-01-12 02:43:41 +02:00
Eren Basak	859b920ba9	Fix escaping of workerrack in NodeListInsertCommand This change fixes a small bug about quoting of workerrack column in NodeListInsertCommand: Previous: `"..., '%s'", workerRack` Now: `"..., %s", quote_literal_cstr(workerRack)`	2017-01-11 10:18:48 +03:00
Andres Freund	b813b39241	Cache ShardPlacements in metadata cache. So far we've reloaded them frequently. Besides avoiding that cost - noticeable for some workloads with large shard counts - it makes it easier to add information to ShardPlacements that help us make placement_connection.c colocation aware.	2017-01-10 18:14:18 -08:00
Andres Freund	8cb47195ba	Make LoadShardInterval() backed by the metadata cache. Doing so requires adding a mapping from shardId to the cache entries. For that metadata_cache.c now maintains an additional hashtable. That hashtable only references shard intervals in the dist table cache.	2017-01-10 17:00:19 -08:00
Andres Freund	f6e8647337	Split DistTableCacheEntry() into separate functions. Previously the function was getting too large. Thus this splits the function into separate parts for looking up the cache entry and building the cache contents.	2017-01-10 15:23:18 -08:00
Onder Kalaci	cd8e41bb79	Fix CloseNodeConnections to actually close connections CloseNodeConnections() is supposed to close connections to a given node. However, before this commit it lacks to actually call PQFinish() on the connections. Using CloseConnection() handles closing and all other necessary actions.	2017-01-11 01:13:58 +02:00
Murat Tuncer	739934cdcb	Fix diff option failure in regression test	2017-01-10 22:58:47 +03:00
Murat Tuncer	95862632de	Add citus tools to default configuration	2017-01-10 17:53:27 +03:00
Murat Tuncer	b93185d800	Add master_disable_node UDF We can now remove nodes from cluster regardless of them having an active shard placement.	2017-01-10 10:54:57 +03:00
Burak Yucesoy	59d3d05bc4	Error out on CTEs with data modifying statement With this change we start to error out on router planner queries where a common table expression with data-modifying statement is present. We already do not support if there is a data-modifying statement using result of the CTE, now we also error out if CTE itself is data-modifying statement.	2017-01-10 10:30:09 +02:00
Andres Freund	f2ee63d638	Change diff output to unified in pg_regress_multi.pl. Unified is better understood by a lot of people, and the default almost everywhere (including github).	2017-01-09 20:51:01 -08:00
Marco Slot	ef326b202a	PQclear in ReportResultError to prevent memory leaks	2017-01-10 02:51:39 +01:00
Marco Slot	31231ce196	Use GetNodeConnection to establish a connection in transaction recovery	2017-01-10 02:44:34 +01:00
Andres Freund	c390daed0f	Use interrupt checking libpq wrappers in router executor.	2017-01-09 14:02:45 -08:00
Andres Freund	7320c17f00	Convert router executor to placement connection management infrastructure. Remove the router specific transaction and shard management, and replace it with the new placement connection API. This mostly leaves behaviour alone, except that it is now, inside a transaction, legal to select from a shard to which no pre-existing connection exists. To simplify code the code handling task executions for select and modify has been split into two - the previous coding was starting to get confusing due to the amount of only conditionally applicable code. Modification connections & transactions are now always established in parallel, not just for reference tables.	2017-01-09 13:13:02 -08:00
Andres Freund	bfa742d794	Centralized shard/placement connection and state management. Currently there are several places in citus that map placements to connections and that manage placement health. Centralize this knowledge. Because of the centralized knowledge about which connection has previously been used for which shard/placement, this also provides the basis for relaxing restrictions around combining various forms of DDL/DML. Connections for a placement can now be acquired using GetPlacementConnection(). If the connection is used for DML or DDL the FOR_DDL/DML flags should be used respectively. If an individual remote transaction fails (but the transaction on the master succeeds) and FOR_DDL/DML have been specified, the placement is marked as invalid, unless that'd mark all placements for a shard as invalid.	2017-01-09 13:13:02 -08:00
Andres Freund	3286b99ff1	Remove useless changing of CurrentMemoryContext.	2017-01-06 09:16:45 -08:00
Andres Freund	6291998ae1	Use FinishConnectionListEstablishment() instead of manually iterating.	2017-01-06 09:16:01 -08:00
Andres Freund	d256f3fca9	Remove unused LogPreparedTransactions() function. This is unused since `92c7567008`.	2017-01-06 09:15:01 -08:00
Burak Yucesoy	9c9f479e4b	Replicate reference tables when new node is added With this change, we start to replicate all reference tables to the new node when new node is added to the cluster with master_add_node command. We also update replication factor of reference table's colocation group.	2017-01-05 14:30:41 +03:00
Burak Yucesoy	1d18950860	Modify tests to create clean workspace Since we will now replicate reference tables each time we add node, we need to ensure that test space is clean in terms of reference tables before any add node operation. For this purpose we had to change order of multi_drop_extension test which caused change of some of the colocation ids.	2017-01-05 12:22:44 +03:00
Onder Kalaci	6d050fd677	Use 2PC for reference table modification With this commit, we ensure that router executor always uses 2PC for reference table modifications and never mark the placements of it as INVALID.	2017-01-04 12:46:35 +02:00
Burak Yucesoy	31cd2357fe	Add upgrade_to_reference_table With this change we introduce new UDF, upgrade_to_reference_table, which can be used to upgrade existing broadcast tables reference tables. For upgrading, we require that given table contains only one shard.	2017-01-02 17:54:42 +02:00
Eren Basak	7e09bd6836	Error on Unsupported Features on Workers This change makes the metadata workers error out on unsupported commands.	2017-01-02 16:03:45 +03:00
Marco Slot	59bc5972fa	Use MultiConnection in multi-shard transactions	2016-12-30 14:43:21 -07:00
Metin Doslu	1ddc70ca55	Add binary search capability to ShardIndex() Renamed FindShardIntervalIndex() to ShardIndex() and added binary search capability. It used to assume that hash partition tables are always uniformly distributed which is not true if upcoming tenant isolation feature is applied. This commit also reduces code duplication.	2016-12-30 18:55:34 +02:00
Murat Tuncer	fc01a47ea4	Add null clause test cases to router planner regression tests Router planner already handles cases when all shards are pruned out. This is about missing test cases. Notice that "column is null" and "column = null" have different shard pruning behavior.	2016-12-29 10:42:31 +03:00
Eren Basak	e43eed0f7a	Prevent Deadlock on Dropping MX Tables with Sequences This change prevents a deadlock situation during DROP TABLE on an mx table with sequences on workers with metadata.	2016-12-28 16:32:20 +03:00
Burak Yucesoy	88ee7802dd	Address Onder's comments	2016-12-28 12:26:16 +03:00
Burak Yucesoy	bb9e95e134	Error out on foreign keys with reference tables We have one replication of reference table for each node. Therefore all problems with replication factor > 1 also applies to reference table. As a solution we will not allow foreign keys on reference tables. It is not possible to define foreign key from, to or between reference tables.	2016-12-28 10:58:26 +03:00
Murat Tuncer	2f76b4be99	Add error hint to failing modify query	2016-12-23 19:43:55 +03:00
Marco Slot	6cbc1945f9	Enable transaction recovery in connection API	2016-12-23 16:14:29 +01:00
Marco Slot	92c7567008	Convert worker_transactions to new connection API	2016-12-23 16:14:29 +01:00
Marco Slot	00d55ad957	Add a wrapper for PQsendQuery	2016-12-23 16:14:29 +01:00
Marco Slot	87c62d598e	Connectionapify SendCommandListToWorkerInSingleTransaction	2016-12-23 16:14:29 +01:00
Burak Yucesoy	0851fd2f0b	GRANT SELECT access for metadata tables to public Previously, we errored out if non-user tries to SELECT query for some metadata tables. It seems that we already GRANT SELECT access to some metadata tables but not others. With this change, we GRANT SELECT access to all existing Citus metadata tables.	2016-12-23 16:32:47 +03:00
Eren Basak	31af40cc26	Handle MX tables on workers during drop table commands	2016-12-23 15:43:32 +03:00
Eren Basak	bed2e353db	Propagate `mark_tables_colocated` changes in `pg_dist_partition` table to metadata workers.	2016-12-23 15:43:32 +03:00
Eren Basak	71d73ec5ff	Propagate DDL commands to metadata workers for MX tables	2016-12-23 15:43:32 +03:00
Eren Basak	048fddf4da	Propagate MX table and shard metadata on `create_distributed_table` call	2016-12-23 15:43:32 +03:00
Eren Basak	efcb1f9dd9	Rename multi_metadata_snapshot to multi_metadata_sync to make it include future mx metadata syncing regression tests	2016-12-23 15:43:32 +03:00
Eren Basak	61a1e487d0	Mark hash distributed tables with replication factor = 1 as streaming replicated tables (repmodel=s). This works only with `create_distributed_table` call.	2016-12-23 15:43:31 +03:00
Marco Slot	11031bcf55	Enable evaluation of stable functions in INSERT..SELECT	2016-12-23 12:47:21 +01:00
Marco Slot	d745d7bf70	Add explicit RelationShards mapping to tasks	2016-12-23 10:23:43 +01:00
Marco Slot	6852f8a951	Add shard locking UDFs	2016-12-22 11:04:34 +01:00
Burak Yücesoy	501a2ecead	Add get_distribution_value_shardid UDF (#1048 ) * Add get_distribution_value_shardid UDF With this UDF users can now map given distribution value to shard id. We mostly hide shardids from users to prevent unnecessary complexity but some power users might need to know about which entry/value is stored in which shard for maintanence purposes. Signature of this UDF is as follows; bigint get_distribution_value_shardid(table_name regclass, distribution_value anyelement)	2016-12-22 12:17:08 +03:00
Eren Basak	cfcb1260a2	Make git ignore multi_outer_join_reference test outputs	2016-12-21 15:58:22 +03:00
Onder Kalaci	2276e99347	Improve regression tests for multi_colocated_shard_transfer Ensure that regression tests outputs are consistent for multi_colocated_shard_transfer.	2016-12-20 14:09:35 +02:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Eren Basak	296e0bd33a	Add citus.node_connection_timeout GUC	2016-12-20 14:11:37 +03:00
Marco Slot	dd094bc372	Run copy commands in worker_merge_files_into_table as superuser	2016-12-20 10:15:42 +01:00
Marco Slot	42ff472721	Set user as pg_merge_job_* schema owner	2016-12-20 10:15:42 +01:00
Murat Tuncer	c3a60bff70	Make router planner active at all times We used to disable router planner and executor when task executor is set to task-tracker. This change enables router planning and execution at all times regardless of task execution mode. We are introducing a hidden flag enable_router_execution to enable/disable router execution. Its default value is true. User may disable router planning by setting it to false.	2016-12-20 11:24:01 +03:00
Jason Petersen	6f95875191	Add targeted VACUUM/ANALYZE support Adds support for VACUUM and ANALYZE commands which target a specific distributed table. After grabbing the appropriate locks, this imple- mentation sends VACUUM commands to each placement (using one connec- tion per placement). These commands are sent in parallel, so users with large tables will benefit from sharding. Except for VERBOSE, all VACUUM and ANALYZE options are supported, including the explicit column list used by ANALYZE. As with many of our utility commands, the local command also runs. In the VACUUM/ANALYZE case, the local command is executed before any re- mote propagation. Because error handling is managed after local proc- essing, this can result in a VACUUM completing locally but erroring out when distributed processing commences: a minor technicality in all cases, as there isn't really much reason to ever roll back a VACUUM (an impossibility in any case, as VACUUM cannot run within a transaction). Remote propagation of targeted VACUUM/ANALYZE is controlled by the enable_ddl_propagation setting; warnings are emitted if such a command is attempted when DDL propagation is disabled. Unqualified VACUUM or ANALYZE is not handled, but a warning message informs the user of this. Implementation note: this commit adds a "BARE" value to MultiShard- CommitProtocol. When active, no BEGIN command is ever sent to remote nodes, useful for commands such as VACUUM/ANALYZE which must not run in a transaction block. This value is not user-facing and is reset at transaction end.	2016-12-16 16:59:06 -07:00
Metin Doslu	20b8f1feeb	Refactor distribution column type check for colocation	2016-12-16 15:24:45 +02:00
Metin Doslu	e2d0bd38f2	Don't allow tables with different replication models to be colocated	2016-12-16 15:23:49 +02:00
Metin Doslu	86cca54857	Add colocate_with option to create_distributed_table() With this commit, we support three versions of colocate_with: i.default, ii.none and iii. a specific table name.	2016-12-16 14:53:35 +02:00
Metin Doslu	edbedbd744	Move colocation related functions to colocation_utils.c	2016-12-16 14:52:40 +02:00
Marco Slot	5714be0da5	Expose the column_to_column_name UDF to make partkey in pg_dist_partition human-readable	2016-12-14 10:46:33 +01:00
Eren Basak	afbb5ffb31	Add stop_metadata_sync_to_node UDF	2016-12-14 10:53:12 +03:00
Eren Basak	b94647c3bc	Propagate CREATE SCHEMA commands with the correct AUTHORIZATION clause in start_metadata_sync_to_node	2016-12-14 10:53:12 +03:00
Eren Basak	fb08093b00	Make start_metadata_sync_to_node UDF to propagate foreign-key constraints	2016-12-14 10:53:12 +03:00
Eren Basak	5e96e4f60e	Make truncate triggers propagated on start_metadata_sync_to_node call	2016-12-14 10:53:10 +03:00
Eren Basak	4fd086f0af	Prevent Transactions in start_metadata_sync_to_node	2016-12-13 10:48:03 +03:00
Eren Basak	c154a91621	Add Regression Tests For start_metadata_sync_to_node	2016-12-13 10:48:03 +03:00
Eren Basak	9eff968d1f	Add start_metadata_sync_to_node UDF This change adds `start_metadata_sync_to_node` UDF which copies the metadata about nodes and MX tables from master to the specified worker, sets its local group ID and marks its hasmetadata to true to allow it receive future DDL changes.	2016-12-13 10:48:03 +03:00
Andres Freund	56aaa25cfa	Add support for running regression tests under valgrind. Note that this only provides infrastructure for running tests under valgrind - there's some spurious failures due to timeouts.	2016-12-12 15:42:11 -08:00
Andres Freund	80b34a5d6b	Integrate router executor into transaction management framework. One less place managing remote transactions. It also makes it fairly easy to use 2PC for certain modifications (e.g. reference tables). Just issue a CoordinatedTransactionUse2PC(). If every placement failure should cause the whole transaction to abort, additionally mark the relevant transactions as critical.	2016-12-12 15:18:12 -08:00
Andres Freund	fa5e202403	Convert multi_shard_transaction.[ch] to new framework.	2016-12-12 15:18:12 -08:00
Andres Freund	fc298ec095	Coordinated remote transaction management.	2016-12-12 15:18:12 -08:00
Andres Freund	6eeb43af15	Add PQgetResult() wrapper handling interrupts. This makes it possible to implement cancelling queries blocked on communication with remote nodes.	2016-12-12 15:18:12 -08:00
Andres Freund	b65814280f	Add support for running postgres tests against a database with citus loaded. Not that this is not with the citus extension loaded, just the shared library. The former runs (by adding --use-existing to the flags) but has a bunch of trivial test differences.	2016-12-12 14:25:06 -08:00
Andres Freund	7434fcc6df	Make prepared transactions available if not configured.	2016-12-08 19:57:22 -08:00
Burak Yucesoy	8d7cd4d746	Add Foreign Key Support to ALTER TABLE commands With this PR, we add foreign key support to ALTER TABLE commands. For now, we only support foreign constraint creation via ALTER TABLE query, if it is only subcommand in ALTER TABLE subcommand list. We also only allow foreign key creation if replication factor is 1.	2016-12-08 15:03:25 +02:00
Andres Freund	2374905c89	Move multi_client_executor.[ch] ontop of connection_management.[ch]. That way connections can be automatically closed after errors and such, and the connection management infrastructure gets wider testing. It also fixes a few issues around connection string building.	2016-12-07 11:44:24 -08:00
Andres Freund	a77cf36778	Use connection_management.c from within connection_cache.c. This is a temporary step towards removing connection_cache.c.	2016-12-07 11:44:24 -08:00
Andres Freund	3505d431cd	Add initial helpers to make interactions with MultiConnection et al. easier. This includes basic infrastructure for logging of commands sent to remote/worker nodes. Note that this has no effect as of yet, since no callers are converted to the new infrastructure.	2016-12-07 11:44:24 -08:00
Andres Freund	3223b3c92d	Centralized Connection Lifetime Management. Connections are tracked and released by integrating into postgres' transaction handling. That allows to to use connections without having to resort to having to disable interrupts or using PG_TRY/CATCH blocks to avoid leaking connections. This is intended to eventually replace multi_client_executor.c and connection_cache.c, and to provide the basis of a centralized transaction management. The newly introduced transaction hook should, in the future, be the only one in citus, to allow for proper ordering between operations. For now this central handler is responsible for releasing connections and resetting XactModificationLevel after a transaction.	2016-12-07 11:43:18 -08:00
Andres Freund	883af02b54	Add some basic helpers to make use of dynahash hashtables easier.	2016-12-06 14:15:36 -08:00
Marco Slot	3d09a2e5c2	Use READ_UINT64_FIELD for placement ID in ReadShardPlacement	2016-12-05 17:22:23 +01:00
Murat Tuncer	131ed8ca1f	Add new tests for non-relational filters in queries	2016-12-05 14:27:36 +03:00
Marco Slot	172bb457e6	Take shard metadata lock in master_append_table_to_shard	2016-12-02 15:56:30 +01:00
Eren Basak	fb88b167a7	Propagate node add/remove to the nodes with hasmetadata=true This change propagates the changes done by `master_add_node` and `master_remove_node` to the workers that contain metadata.	2016-12-02 14:43:32 +03:00
Brian Cloutier	a4096c9f45	Remove dead code: ResponsiveWorkerNodeList	2016-12-02 13:14:11 +03:00
Andres Freund	0a4889d0af	Use system psql if available, to fix travis build errors. On some systems a new libpq is available than what we're compiling against, but until now we used psql in the version we're compiling against. That' a problem, because (quoting Jason): With 9.6, libpq's default handling of CONTEXT changed: it is hidden unless the level is ERROR or higher. We addressed this ourselves using the SHOW_CONTEXT variable (by setting "always" in pg_regress_multi): in 9.5, this is ignored (and unneeded), in 9.6, it ensures old behavior is preserved. For 9.6 we'd already worked around the problem by specifying that context should always be shown, but < 9.6 psql doesn't know how to do that. As there's no csql anymore, which strictly tied us to a specific version of psql/csql, we can now just use the system's psql if available. We still fall back to the psql of the installation we're compiling against, if there's no other psql in PATH.	2016-12-01 15:58:23 -08:00
Onder Kalaci	df974e15b8	Bugfix for deparsing INSERT..SELECT queries which involve constant values This commit fixes a bug when the SELECT target list includes a constant value. Previous behaviour of target list re-ordering: * Iterate over the INSERT target list * If it includes a Var, find the corresponding SELECT entry and update its resno accordingly * If it does not include a Var (which we only considered to be DEFAULTs), generate a new SELECT target entry * If the processed target entry count in SELECT target list is less than the original SELECT target list (GROUP BY elements not included in the SELECT target entry), add them in the SELECT target list and update the resnos accordingly. * However, this step was leading to add the CONST SELECT target entries twice. The reason is that when CONST target list entries appear in the SELECT target list, the INSERT target list doesn't include a Var. Instead, it includes CONST as it does for DEFAULTs. New behaviour of target list re-ordering: * Iterate over the INSERT target list * If it includes a Var, find the corresponding SELECT entry and update its resno accordingly * If it does not include a Var (which we consider to be DEFAULTs and CONSTs on the SELECT), generate a new SELECT target entry * If any target entry remains on the SELECT target list which are resjunk, (GROUP BY elements not included in the SELECT target entry), keep them in the SELECT target list by updating the resnos.	2016-12-01 10:41:56 +02:00
Murat Tuncer	45762006f3	Add support for filters Ensures filter clauses are stripped from master query, and pushed down to worker queries.	2016-12-01 08:53:46 +03:00
Sumedh Pathak	0a0d4784b9	Change DDL error message to say "unsupported" instead of "supported"	2016-11-26 10:30:09 +01:00
Murat Tuncer	b5c1ecb684	Fix failures during pg_upgrade - fix error in CitusHasBeenLoaded() - allow creation of pg_catalog tables during upgrade	2016-11-11 17:22:45 -08:00
Marco Slot	b566c4815c	Pass down the correct type for null parameters	2016-11-11 07:14:08 +01:00
Metin Doslu	a0c92b38cb	Use AccessShareLock on the source table while creating a colocated table While creating a colocated table, we don't want the source table to be dropped. However, using a ShareLock blocks DML statements on the source table, and using AccessShareLock is enough to prevent DROP. Therefore, we just loosened the lock to AccessShareLock.	2016-11-10 09:17:05 -08:00
Eren Basak	444f14d546	Add Column Definition List for Output Columns for master_add_node This change allows seeing the names of columns of `master_add_node`, using `SELECT * FROM master_add_node(...)` by specifying output columns in UDF definition.	2016-11-07 14:08:58 -08:00
Marco Slot	c157c3b419	Disallow SendCommandListToWorkerInSingleTransaction when modifications have occurred	2016-11-02 12:26:56 +01:00
Marco Slot	f6b3af7a49	Use co-located shard ID in multi_shard_transaction	2016-11-02 11:01:19 +01:00
Samay Sharma	82e5faa190	Avoid error during CREATE INDEX IF NOT EXISTS Previously, we threw an error when we ran CREATE INDEX IF NOT EXISTS with an already existing index. This change enables expected behavior by checking if the statement has IF NOT EXISTS before throwing the error. We also ensure that we don't execute the command on the workers, if an index already exists on the master.	2016-11-01 14:51:19 -07:00
Burak Yucesoy	b30b339f91	Fix typo in error message	2016-11-01 16:58:27 +02:00
Burak Yucesoy	6246702a4c	Change error message we displayed for foreign constraints if RF > 1 At the moment, we do not support foreign constraints if replication factor is greater than 1. However foreign constraints can be used in cloud with high availability option. Therefore we do not want to create an impression such that foreign constraints with high availability is not supported at all. We call users to action with this error message.	2016-11-01 15:47:19 +02:00
Önder Kalacı	83e1719541	Always CASCADE while dropping a shard	2016-11-01 10:16:34 +01:00
Brian Cloutier	50805f1e5c	Copy raw_parse_tree before using it Address citusdata/citus#922. Fixes a segfault in PG's installcheck caused by our reuse of raw_parse_tree when handling EXPLAIN EXECUTE.	2016-10-27 18:25:49 +03:00
Onder Kalaci	a43e3bad56	Improve error semantics for INSERT..SELECT With this commit, we error out if a worker query cannot be executed on all placements of a target insert shard interval.	2016-10-27 14:09:05 +03:00
Andres Freund	dfe7b357c5	Simple isolationtester dml vs. repair tests.	2016-10-27 00:31:41 -07:00
Andres Freund	121b868da5	Add very basic isolationtester infrastructure including a trivial test.	2016-10-27 00:31:41 -07:00
Andres Freund	c3e1d49e34	Don't try to shutdown servers that have not been started in regression tests. This avoids spurious output from failing shutdowns and uninitialized variable warnings if pg_regress_multi.pl fails before starting servers.	2016-10-27 00:31:41 -07:00
Metin Doslu	c6f5cabbe3	Error on different shard placement count In ErrorIfShardPlacementsNotColocated(), while checking if shards are colocated, error out if matching shard intervals have different number of shard placements.	2016-10-26 18:46:05 +03:00
Onder Kalaci	9cd549f21f	Add stub for Copy shard placement This commit does not change the current behaviour, but, helps to implement enterprise feature without any version changes.	2016-10-26 17:57:55 +03:00
Metin Doslu	4e555880b7	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Marco Slot	275378aa45	Re-acquire metadata locks in RouterExecutorStart	2016-10-26 14:34:59 +02:00
Brian Cloutier	1e6d1ef67e	Fix segfault during EXPLAIN EXECUTE Fix citusdata/citus#886 The way postgres' explain hook is designed means that our hook is never called during EXPLAIN EXECUTE. So, we special-case EXPLAIN EXECUTE by catching it in the utility hook. We then replace the EXECUTE with the original query and pass it back to Citus.	2016-10-26 15:18:42 +03:00
Burak Yucesoy	fc2fea839b	Only repair given shard Previously, when a repair is requested on a shard, we also repair all co-located shards of given shard, which may cause repairing already healthy shards. With this change, we only repair given shard.	2016-10-26 14:36:37 +03:00
Brian Cloutier	80c8cfeabe	Don't add a raw 32-bit int to tuples in create_distributed_table	2016-10-26 14:02:42 +03:00
Andres Freund	fcd150c7c8	Invalidate relcache after pg_dist_shard_placement changes. This forces prepared statements to be re-planned after changes of the placement metadata. There's some locking issues remaining, but that's a a separate task. Also add regression tests verifying that invalidations take effect on prepared statements.	2016-10-26 03:36:35 -07:00
Onder Kalaci	1673ea937c	Feature: INSERT INTO ... SELECT This commit adds INSERT INTO ... SELECT feature for distributed tables. We implement INSERT INTO ... SELECT by pushing down the SELECT to each shard. To compute that we use the router planner, by adding an "uninstantiated" constraint that the partition column be equal to a certain value. standard_planner() distributes that constraint to all the tables where it knows how to push the restriction safely. An example is that the tables that are connected via equi joins. The router planner then iterates over the target table's shards, for each we replace the "uninstantiated" restriction, with one that PruneShardList() handles. Do so by replacing the partitioning qual parameter added in multi_planner() with the current shard's actual boundary values. Also, add the current shard's boundary values to the top level subquery to ensure that even if the partitioning qual is not distributed to all the tables, we never run the queries on the shards that don't match with the current shard boundaries. Finally, perform the normal shard pruning to decide on whether to push the query to the current shard or not. We do not support certain SQLs on the subquery, which are described/commented on ErrorIfInsertSelectQueryNotSupported(). We also added some locking on the router executor. When an INSERT/SELECT command runs on a distributed table with replication factor >1, we need to ensure that it sees the same result on each placement of a shard. So we added the ability such that router executor takes exclusive locks on shards from which the SELECT in an INSERT/SELECT reads in order to prevent concurrent changes. This is not a very optimal solution, but it's simple and correct. The citus.all_modifications_commutative can be used to avoid aggressive locking. An INSERT/SELECT whose filters are known to exclude any ongoing writes can be marked as commutative. See RequiresConsistentSnapshot() for the details. We also moved the decison of whether the multiPlan should be executed on the router executor or not to the planning phase. This allowed us to integrate multi task router executor tasks to the router executor smoothly.	2016-10-26 10:01:00 +03:00
Onder Kalaci	e0d83d65af	Add ability to reorder target list for INSERT/SELECT queries The necessity for this functionality comes from the fact that ruleutils.c is not supposed to be used on "rewritten" queries (i.e. ones that have been passed through QueryRewrite()). Query rewriting is the process in which views and such are expanded, and, INSERT/UPDATE targetlists are reordered to match the physical order, defaults etc. For the details of reordeing, see transformInsertRow().	2016-10-26 10:00:03 +03:00
Jason Petersen	73f5b8b05f	Move all funcs to pg_catalog, add test to verify We'd been relying on a single SET search_path command in an earlier script, but a subsequent script RESET search_path, causing any further bare functions to be created in the first schema on the search path. However, starting with an older extension version and executing ALTER scripts one at a time DOES avoid putting any functions in the public namespace, so I wrote an upgrade script resilient to that, especially because PostgreSQL 9.5 will error out if a function is already in the schema it's being moved to.	2016-10-25 12:45:53 -06:00
Brian Cloutier	c6b74b023f	Treat nodePort as the 8byte number it is	2016-10-25 16:31:48 +03:00
Brian Cloutier	2e96f6ab27	Fix crash when upgrading to Citus 6 Between restart (running the new code) and ALTER EXTENSION citus UPGRADE there was an inconsistency where we assumed that pg_dist_partition had the repmodel column set. Now we give it a default value if the column doesn't exist yet.	2016-10-24 15:18:29 +03:00
Marco Slot	271b20a23e	Parallelise DDL commands	2016-10-24 12:39:08 +02:00
Burak Yucesoy	5a03acf2bf	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Marco Slot	02d2b86e68	Re-disable master evaluation for SELECT	2016-10-21 10:51:47 +02:00
Metin Doslu	405335fcee	Add create_reference_table() create_reference_table() creates a hash distributed table with shard count equals to 1 and replication factor equals to shard_replication_factor configuration value.	2016-10-20 15:29:30 +03:00
Metin Doslu	d3e7d9dc8d	Final refactoring	2016-10-20 11:29:11 +03:00
Metin Doslu	58ac477ffb	Change return type of BuildDistributionKeyFromColumnName() to Var * BuildDistributionKeyFromColumnName() always returns a Var pointer, so there is no reason to return a Node pointer instead of a Var pointer.	2016-10-20 10:59:31 +03:00
Metin Doslu	161093908e	Convert colocationid to uint32	2016-10-20 10:59:31 +03:00
Metin Doslu	8334d853c0	Add local function GetNextShardId()	2016-10-20 10:59:31 +03:00
Metin Doslu	40bdafa8d1	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Metin Doslu	d04f4f5935	Add guc variable for shard count	2016-10-19 10:44:50 +03:00
Marco Slot	65f6d7c02a	Follow consistent execution order in parallel commands	2016-10-19 08:33:08 +02:00
Marco Slot	a497e7178c	Parallelise master_modify_multiple_shards	2016-10-19 08:33:08 +02:00
Marco Slot	9d98acfb6d	Move requiresMasterEvaluation from Task to Job	2016-10-19 08:23:06 +02:00
Marco Slot	213d8419c6	Refactor and redocument executor shard lock code	2016-10-19 08:13:35 +02:00
Andres Freund	ac14b2edbc	Support PostgreSQL 9.6 Adds support for PostgreSQL 9.6 by copying in the requisite ruleutils file and refactoring the out/readfuncs code to flexibly support the old-style copy/pasted out/readfuncs (prior to 9.6) or use extensible node APIs (in 9.6 and higher). Most version-specific code within this change is only needed to set new fields in the AggRef nodes we build for aggregations. Version-specific test output files were added in certain cases, though in most they were not necessary. Each such file begins by e.g. printing the major version in order to clarify its purpose. The comment atop citus_nodes.h details how to add support for new nodes for when that becomes necessary.	2016-10-18 16:23:55 -06:00
Murat Tuncer	b453f6c7ab	Add master_run_on_worker UDF	2016-10-18 17:59:54 +03:00
Eren Basak	cee7b54e7c	Add worker transaction and transaction recovery infrastructure	2016-10-18 14:18:14 +03:00
Metin Doslu	27616cca52	Add regression tests for parameterized queries	2016-10-18 14:02:50 +03:00
Eren Basak	f3ede37c9f	Add hasmetadata column to pg_dist_node	2016-10-17 11:52:18 +03:00
Eren Basak	c7bf2021fa	Add metadata infrastructure for pg_dist_local_group table	2016-10-17 11:52:18 +03:00
Eren Basak	8f477d18f1	Add pg_dist_local_group Metadata Table This change adds the pg_dist_local_group metadata table, which indicates the group id of the current node. It is expected that this table contains one and only one row, which only contains the group id of the node as an integer.	2016-10-14 11:41:14 +03:00
Eren Basak	630f199d3c	Fix changing placement ids in metadata snapshot test	2016-10-14 11:13:16 +03:00
Brian Cloutier	6c3d79b4e7	Drop shardalias	2016-10-14 11:03:26 +03:00
Burak Yucesoy	6668d19a3b	Make shard transfer functions co-location aware With this change, master_copy_shard_placement and master_move_shard_placement functions start to copy/move given shard along with its co-located shards.	2016-10-13 18:16:40 +03:00
Metin Doslu	d03a2af778	Add HAVING support This commit completes having support in Citus by adding having support for real-time and task-tracker executors. Multiple tests are added to regression tests to cover new supported queries with having support.	2016-10-13 15:47:53 +03:00
Eren Basak	ed3af403fd	Add Metadata Snapshot Infrastructure This change adds the required infrastructure about metadata snapshot from MX codebase into Citus, mainly metadata_sync.c file and master_metadata_snapshot UDF.	2016-10-13 10:40:14 +03:00
Jason Petersen	d140d1c934	Use single-quote interpolation in partition test Noticed an old issue and this outdated comment. Figured I'd fix it.	2016-10-10 13:03:43 -06:00
Jason Petersen	bcfc58a7c7	Fix tests and tell Travis to run them all Two sets of tests are fixed by this change: * multi_agg_approximate_distinct * those in multi_task_tracker_extra_schedule The first broke when we renamed stage to load in many files and was never being run because the HyperLogLog extension wasn't easily available in Debian. Now it's in our repo, so we install it and run the test. I removed the distinct HLL target in favor of just always running it and providing an output variant to handle when the extension is absent. Basically, if PostgreSQL thinks HLL is available, the test installs it and runs normally, otherwise the absent variant is used. The second broke when I removed a test variant, erroneously believing it to be related to an older Citus version. I've added a line in that test to clarify why the variant is necessary (a practice we should widely adopt).	2016-10-07 17:32:54 -06:00
Marco Slot	33b7723530	Use UpdateShardPlacementState where appropriate	2016-10-07 11:59:20 -07:00
Andres Freund	982ad66753	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Metin Doslu	d94a65e0e9	Reduce minimum value of task_tracker_delay to 1ms	2016-10-07 09:55:56 +03:00
Brian Cloutier	9d6699b07c	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Marco Slot	32b2bd4ed8	Add replication model column to pg_dist_partition	2016-10-05 01:14:28 +02:00
Onder Kalaci	0993f2fb2c	Update ColocatedShardPlacementList() function name to ColocatedShardIntervalList() which was intented.	2016-10-04 09:51:42 +03:00
Marco Slot	fe3ffdb013	Avoid use of pnstrdup	2016-10-04 00:31:53 +02:00
Robin Thomas	f677fadbe6	Provides safe, idempotent shard-extended names to any object name related to a table that might be distributed, allowing any name that is within regular PostgreSQL length limits to be extended with a shard ID for use in shards on workers. Handles multi-byte character boundaries in identifiers when making prefixes for shard-extended names. Includes tests. Uses hash_any from PostgreSQL's access/hashfunc.c. Removes AppendShardIdToStringInfo() as it's used only once and arguably is best replaced there with a call to AppendShardIdToName(). Adds UDF shard_name(object_name, shard_id) to expose the shard-extended name logic to other PL/PGSQL, UDFs and scripts. Bumps version to 6.0-2 to allow for UDF to be created in migration script. Fixes citusdata/citus#781 and citusdata/citus#179.	2016-10-03 17:02:34 -04:00
Andres Freund	de32b7bbad	Don't create hash-table of zero size in TaskHashCreate(). hash_create(), called by TaskHashCreate(), doesn't work correctly for a zero sized hash table. This triggers valgrind errors, and could potentially cause crashes even without valgring. This currently happens for Jobs with 0 tasks. These probably should be optimized away before reaching TaskHashCreate(), but that's a bigger change.	2016-10-03 13:07:43 -07:00
Andres Freund	6d050bc9f8	Initialize count_agg_clauses argument to 0. count_agg_clause adds the cost of the aggregates to the state variable, it doesn't reinitialize it. That is intentional, as it is used to incrementally add costs in some places.	2016-10-03 13:07:43 -07:00
Andres Freund	a6150c2916	Lower "waiting for activity on tasks took longer than" log level. It's perfectly normal to wait longer in several circumstances, and the output can lead to spurious regression output changes.	2016-10-03 13:07:43 -07:00
Marco Slot	a4efb60b54	Change logicalrelid type in pg_dist_partition and pg_dist_shard to regclass	2016-10-03 20:27:16 +02:00
Marco Slot	fc93974238	Remove EventInvokeTrigger from regression test output	2016-10-03 20:21:15 +02:00
Robin Thomas	c507a0df1c	During repartitions, the partitionColumnType argument sent to workers is now a `::regtype` using the qualified name of the column type, not the column type OID which may differ between master/worker nodes. Test coverage of a hash reparitition using a UDT as the join column. Note that the UDFs `worker_hash_partition_table` and `worker_range_partition_table` are unchanged, and rightly expect an OID for the column type; but the planner code building the commands now allows for `::regtype` casting to do its magic. Fixes citusdata/citus#111.	2016-10-03 13:41:20 -04:00
Robin Thomas	b1493e299e	Added test coverage for partial unique indexes and exclude constraints.	2016-10-03 10:47:30 -04:00
Eren Basak	ac3a4eee21	Fix command counter increment bug Fixes citusdata/citus#714 On `InsertShardRow`, we previously called `CommandCounterIncrement()` before `CitusInvalidateRelcacheByRelid(relationId);`. This might prevent to skip invalidation of the distributed table in the next access within the same session.	2016-10-03 17:00:27 +03:00
Onder Kalaci	a533b8e7c1	Differentiate worker and master job temporary folders This commit enables to create different worker and master temporary folders. This change is important for citus-mx on task-tracker execution. In simple words, on citus-mx, the worker could actually be reponsible for the master tasks as well. Prior to this change, both master and worker logic on task-tracker executor was accessing and using the same files for different purposes which was dangerous on certain cases (i.e., when task_tracker_delay is low).	2016-10-03 14:24:08 +03:00
Andres Freund	77efe7fcd4	Move task tracker lwlocks into their own tranche. RequestAddinLWLocks()/LWLockAssign() are gone in 9.6. Luckily all citus supported postgres versions support tranches, so use those.	2016-09-30 16:06:49 -06:00
Jason Petersen	f59cf2b818	Remove references to 9.4 Some still lingered.	2016-09-29 17:35:19 -06:00
Jason Petersen	37631cd132	Remove alternate multi_hash test file This was made irrelevant by Citus v5.1.0.	2016-09-29 16:43:19 -06:00
Jason Petersen	6671cf5171	Remove unused dumputils.h header Believe this was used by csql, which is now gone.	2016-09-29 15:54:38 -06:00
Jason Petersen	1c560dfa9c	Update ruleutils_95 with latest PostgreSQL changes Hand-applied changes from a diff I generated between 9.5.0 and 9.5.4.	2016-09-29 15:54:38 -06:00
Marco Slot	c4bc0742a7	Make count return 0 if all shards are pruned away Before this change, count on a distributed returned NULL if all shards were pruned away, because on the master we replace with count(..) call with a sum(..) call to sum the counts from the shards. However, sum returns NULL when there are no rows, whereas count is expected to return 0.	2016-09-29 20:27:26 +02:00
Jason Petersen	5b80d4e8dd	Directly register multi-shard callbacks in PG_init I had changed these callbacks to use the same method I chose for the router executor (for consistency), but as that method is flawed, we now want to ensure we directly register them from PG_init as well.	2016-09-29 11:43:19 -06:00
Jason Petersen	5f6264105d	Directly register router xact callbacks in PG_init Not entirely sure why we went with the shared memory hook approach, but it causes problems (multiple registration) during crashes. Changing to a simple direct registration call from PG_init.	2016-09-29 11:43:18 -06:00
Burak Yucesoy	1ee39eb098	Internal co-location API With this commit we introduce internal API for co-location related operations.	2016-09-29 11:56:53 +03:00
Marco Slot	5cdbe2b86c	Remove copy_to_distributed_table	2016-09-28 11:27:54 -06:00
Murat Tuncer	5b42318ac4	Make where false queries router plannable	2016-09-28 18:49:26 +03:00
Murat Tuncer	c16dec88c3	Add UDF master_expire_table_cache	2016-09-28 12:08:37 +03:00
Jason Petersen	0caf0d95f1	Fix unique-violation-in-xact segfault An interaction between ReraiseRemoteError and DML transaction support causes segfaults: * ReraiseRemoteError calls PurgeConnection, freeing a connection... * That connection is still in the xactParticipantHash At transaction end, the memory in the freed connection might happen to pass the "is this connection OK?" check, causing us to try to send an ABORT over that connection. By removing it from the transaction hash before calling ReraiseRemoteError, we avoid this possibility.	2016-09-27 16:44:03 -06:00
Metin Doslu	c9dcad9b05	Pass text oid inteads of invalid oid for null values Passing invalid oids even for null values in PQsendQueryParams() causes worker nodes to fail. Therefore, we pass text oid for null values.	2016-09-27 08:15:46 +03:00
Andres Freund	776b3868b9	Support NoMovement direction in router executor This is mainly interesting because it allows to use RETURN QUERY/RETURN QUERY EXECUTE and FOR ... IN .. LOOPs in plpgsql.	2016-09-26 18:28:36 -06:00
Murat Tuncer	32003c4aa1	Add tests with spaces in table names	2016-09-26 18:23:43 -06:00
Murat Tuncer	2f78fb8f1b	Remove extra space	2016-09-26 18:23:43 -06:00
Murat Tuncer	902e68c9ef	Refactor SendQueryToPlacements api	2016-09-26 18:23:43 -06:00
Murat Tuncer	6317bbe9a8	Address feedback	2016-09-26 18:23:42 -06:00
Murat Tuncer	877694296f	Fix regression test failures after rebase	2016-09-26 18:23:42 -06:00
Murat Tuncer	2eec0167be	Add support for truncate statement	2016-09-26 18:23:42 -06:00
Marco Slot	3318288d75	Fix segmentation fault in case of joins with WHERE 1=0	2016-09-26 15:12:29 +02:00
Robin Thomas	614c858375	Forbid EXCLUDE constraints on distributed tables just as we forbid UNIQUE or PRIMARY KEY constraints. Also, properly propagate valid EXCLUDE constraints to worker shard tables. If an EXCLUDE constraint includes the distribution column, the operator must be an equality operator. Tests in regression suite for exclusion constraints that include the partition column, omit it, and include it but with non-equality operator. Regression tests also verify that valid exclusion constraints are propagated to the shard tables. And the tests work in different timezones now. Fixes citusdata/citus#748 and citusdata/citus#778.	2016-09-21 14:02:42 -04:00
Metin Doslu	35eceb6cca	Remove pg_toast_* references from regression tests pg_toast_* oids are constantly changing, and this causes regression tests to fail time to time. With this commit, we remove all of the pg_toast_* references from regression test outputs.	2016-09-09 11:31:51 +03:00
Jason Petersen	74f4e0003b	Permit multiple DDL commands in a transaction Three changes here to get to true multi-statement, multi-relation DDL transactions (same functionality pre-5.2, with benefits of atomicity): 1. Changed the multi-shard utility hook to always run (consistency with router executor hook, removes ad-hoc "installed" boolean) 2. Change the global connection list in multi_shard_transaction to instead be a hash; update related functions to operate on global hash instead of local hash/global list 3. Remove check within DDL code to prevent subsequent DDL commands; place unset/reset guard around call to ConnectToNode to permit connecting to additional nodes after DDL transaction has begun In addition, code has been added to raise an error if a ROLLBACK TO SAVEPOINT is attempted (similar to router executor), and comprehensive tests execute all multi-DDL scenarios (full success, user ROLLBACK, any actual errors (say, duplicate index), partial failure (duplicate index on one node but not others), partial COMMIT (one node fails), and 2PC partial PREPARE (one node fails)). Interleavings with other commands (DML, \copy) are similarly all covered.	2016-09-08 22:35:55 -05:00
Eric B. Ridge	e80f1612a6	Add syscols in queries; extend relnames in indexes To permit use with ZomboDB (https://github.com/zombodb/zombodb), two changes were necessary: 1. Permit use of `tableoid` system column in queries 2. Extend relation names appearing in index expressions The first is accomplished by simply changing the deparse logic to allow system columns in queries destined for distributed tables. The latter was slightly more complex, given that DDL extension currently occurs on workers. But since indexes cannot reference tables other than the one being indexed, it is safe to look for any relation reference ending in a '*' character and extend their penultimate segments with a shard id. This change also adds an error to prevent users from distributing any relations using the WITH (OIDS) feature, which is unsupported.	2016-09-07 11:54:55 -05:00
Marco Slot	6f6cb1a0d6	Allow noop updates of the partition column	2016-09-07 14:22:41 +02:00
Jason Petersen	ed027f060e	Add sort call to shard placement test The comparator is kind of broken, but I think this is better than the current state of random failures.	2016-09-06 11:07:27 -05:00
Jason Petersen	b3684074f3	Fix CreateShardConnectionHash memory leak The call to hash_create specified HASH_CONTEXT without actually setting one using the provided HASHCTL. The hashes returned by this function are used locally, so simply using CurrentMemoryContext is sufficient.	2016-09-06 10:17:18 -05:00
Metin Doslu	5b50f2c333	Add complex subquery pushdown regression tests	2016-09-02 14:21:51 +03:00
Metin Doslu	7d212b847f	Add outer join clause list extraction for subquery pushdown logic In subquery pushdown, we allow outer joins if the join condition is on the partition columns. WhereClauseList() used to return all join conditions including outer joins. However, this has been changed with a commit related to outer join support on regular queries. With this commit, we refactored ExtractFromExpressionWalker() to return two lists of qualifiers. The first list is for inner join and filter clauses and the second list is for outer join clauses. Therefore, we can also use outer join clauses to check subquery pushdown prerequisites.	2016-09-02 11:54:44 +03:00
Burak Yucesoy	12d1aba1fc	Error out at master_create_distributed_table if the table has any rows Before this change, we do not check whether given table which already contains any data in master_create_distributed_table command. If that table contains any data, making it it distributed, makes that data hidden to user. With this change, we now gave error to user if the table contains data.	2016-09-01 17:42:47 +03:00
Jason Petersen	850c51947a	Re-permit DDL in transactions, selectively Recent changes to DDL and transaction logic resulted in a "regression" from the viewpoint of users. Previously, DDL commands were allowed in multi-command transaction blocks, though they were not processed in any actual transactional manner. We improved the atomicity of our DDL code, but added a restriction that DDL commands themselves must not occur in any BEGIN/END transaction block. To give users back the original functionality (and improved atomicity) we now keep track of whether a multi-command transaction has modified data (DML) or schema (DDL). Interleaving the two modification types in a single transaction is disallowed. This first step simply permits a single DDL command in such a block, admittedly an incomplete solution, but one which will permit us to add full multi-DDL command support in a subsequent commit.	2016-08-30 20:37:19 -06:00
Metin Doslu	75618fc3fb	Return false in MultiClientQueryResult() on failing query	2016-08-29 17:05:35 +03:00
Brian Cloutier	4ecd6b58fb	Remove csql, \stage is no longer needed	2016-08-26 10:41:59 +03:00
Brian Cloutier	640bb8863b	Remove check-multi-fdw tests, nobody uses Citus with fdws	2016-08-26 10:41:33 +03:00
Jason Petersen	e54d3f6d32	Rename test files with 'stage' in name Ignored FDW files as those test are being removed entirely, I believe.	2016-08-22 13:32:53 -06:00
Jason Petersen	b391abda3d	Replace verb 'stage' with 'load' in test comments "Staging table" will be the only valid use of 'stage' from now on, we will now say "load" when talking about data ingestion. If creation of shards is its own step, we'll just say "shard creation".	2016-08-22 13:24:18 -06:00
Jason Petersen	35e9f51348	Replace verb 'stage' with 'load' in schedules "Staging table" will be the only valid use of 'stage' from now on.	2016-08-22 11:48:41 -06:00
Eren Başak	0322916700	Lowercase \copy to match PostgreSQL's style for local/psql-level functions	2016-08-22 11:31:26 -06:00
Eren Basak	b513f1c911	Replace \stage With \copy on Regression Tests Fixes #547 This change removes all references to \stage in the regression tests and puts \COPY instead. Doing so changed shard counts, min/max values on some test tables (lineitem, orders, etc.).	2016-08-22 11:31:26 -06:00
Robin Thomas	010cbf16fc	Remove all usage of pg_dist_shard.shardalias in extension code. (#739 ) Remove regression test of non-null shardalias.	2016-08-19 17:06:22 +03:00
Jason Petersen	91578ff149	Remove HAVE_INTTYPES_H ifdefs I've been seeing warnings on OS X/clang for a while about these lines and finally got tired of it. The main problem is that PRIu64 expects a uint64_t but we were passing a uint64 (a PostgreSQL-defined type). In PostgreSQL 9.5, we now have INT64_MODIFIER, so can build our own zero- padded unsigned 64-bit int format modifier that expects a PostgreSQL- provided uint64 type. This simplifies the code slightly (no more ifdefs) and gets rid of the warning that's been annoying me since April (my TODO creation time).	2016-08-18 15:19:53 -06:00
Jason Petersen	900f7590ab	Fix Travis local_first_candidate_nodes failures A recent change to the image used in Travis causes some problems for the code we use here to ensure the local replica is first. Since this code is essentially dead in a post-stage world anyhow, we're OK with ripping out the tests to placate Travis.	2016-08-14 23:12:10 -06:00
Murat Tuncer	3a49cf830e	Remove a router planner test for materialized view PostgreSQL 9.5.4 stopped calling planner for materialized view create command when NO DATA option is provided. This causes our test to behave differently between pre-9.5.4 and 9.5.4.	2016-08-14 22:57:09 -06:00
Andres Freund	7fdb5fbe29	Skip over unreferenced parameters when router executing prepared statement. When an unreferenced prepared statement parameter does not explicitly have a type assigned, we cannot deserialize it, to send to the remote side. That commonly happens inside plpgsql functions, where local variables are passed in as unused prepared statement parameters.	2016-08-05 14:12:06 -07:00
Jason Petersen	eba8396501	Avoid attempting to lock invalid shard identifier A recent change generates a "dummy" shard placement with its identifier set to INVALID_SHARD_ID for SELECT queries against distributed tables with no shards. Normally, no lock is acquired for SELECT statements, but if all_modifications_commutative is set to true, we will acquire a shared lock, triggering an assertion failure within LockShardResource in the above case. The "dummy" shard placement is actually necessary to ensure such empty queries have somewhere to execute, and INVALID_SHARD_ID seems the most appropriate value for the dummy's shard identifier field, so the most straightforward fix is to just avoid locking invalid shard identifiers.	2016-08-04 13:49:51 -07:00
Metin Doslu	3ff1877108	Bump version numbers for 5.2 release	2016-08-01 13:48:24 -07:00
Marco Slot	9705cbcdf8	Rewrite WorkerShardStats to avoid invalid value bugs	2016-07-29 20:11:18 +02:00
Marco Slot	5e432449ba	Add MultiClientExecute and MultiClientValueIsNull for simple remote query execution	2016-07-29 20:07:18 +02:00
Andres Freund	63fb8311cb	Don't access pg_dist_partition->partkey directly, use heap_getattr(). Text datums can't be directly accessed via the struct equivalence trick used to access catalogs. That's because, as an optimization, they're sometimes aligned to 1 byte ("text"'s alignment), and sometimes to 4 bytes. That depends on it being a short varlena (cf. VARATT_NOT_PAD_BYTE) or not. In the case at hand here, partkey became longer than 127 characters - the boundary for short varlenas (cf. VARATT_CAN_MAKE_SHORT()). Thus it became 4 byte/int aligned. Which lead to the direct struct access accessing the wrong data. The fix is simply to never access partkey that way - to enforce that, hide partkey ehind the usual ifdef. Fixes: #674	2016-07-29 10:02:36 -07:00
Eren Başak	bb3893d0d8	Set 1PC as the Default Commit Protocol for DDL Commands Fixes #679 This change sets the default commit protocol for distributed DDL commands to '1pc'. If the user issues a distributed DDL command with this default setting, then once in a session, a NOTICE message is shown about using '2pc' being extra safe.	2016-07-29 16:42:55 +03:00
Jason Petersen	bedf53d566	Quick fix for possible segfault in PurgeConnection Now that connections can be acquired without going through the cache, we have to handle cases where functions assume the cache has been ini- tialized.	2016-07-29 00:12:56 -06:00
Jason Petersen	abe7304898	Support SERIAL/BIGSERIAL non-partition columns This adds support for SERIAL/BIGSERIAL column types. Because we now can evaluate functions on the master (during execution), adding this is a matter of ensuring the table creation step works properly. To accomplish this, I've added some logic to detect sequences owned by a table (i.e. those related to its columns). Simply creating a sequence and using it in a default value is insufficient; users who do so must ensure the sequence is owned by the column using it. Fortunately, this is exactly what SERIAL and BIGSERIAL do, which is the use case we're targeting with this feature. While testing this, I found that worker_apply_shard_ddl_command actually adds shard identifiers to sequence names, though I found no places that use or test this path. I removed that code so that sequence names are not mutated and will match those used by a SERIAL default value expression. Our use of the new-to-9.5 CREATE SEQUENCE IF NOT EXISTS syntax means we are dropping support for 9.4 (which is being done regardless, but makes this change simpler). I've removed 9.4 from the Travis build matrix. Some edge cases are possible in ALTER SEQUENCE, COPY FROM (on workers), and CREATE SEQUENCE OWNED BY. I've added errors for each so that users understand when and why certain operations are prohibited.	2016-07-28 23:55:40 -06:00
Burak Yucesoy	6f20af9e38	Remove schema name parameter from API functions We remove schema name parameter from worker_fetch_foreign_file and worker_fetch_regular_table functions. We now send schema name concatanated with table name.	2016-07-28 20:41:05 +03:00
Burak Yucesoy	a649b47bac	Add old version(without schema name parameter) of api functions back Fixes #676 We added old versions (i.e. without schema name) of worker_apply_shard_ddl_command, worker_fetch_foreign_file and worker_fetch_regular_table back. During function call of one of these functions, we set schema name as public schema and call the newer version of the functions.	2016-07-28 20:40:38 +03:00
Eren Başak	8a590c9e5b	Remove AllFinalizedlacementsAccessible Function This change removes AllFinalizedPlacementsAccessible function since, we open connections to all shard placements before any command is sent so we immediately error out if a shard placement is not accessible.	2016-07-28 17:24:37 +03:00
Eren Başak	40f8149320	Allow Cancellation During Distributed DDL Commands This change allows users to interrupt long running DDL commands. Interrupt requests are handled after each DDL command being propagated to a shard placement, which means that generally the cancel request will be processed right after the execution of the DDL is finished in the current placement.	2016-07-28 17:12:07 +03:00
Murat Tuncer	cc33a450c4	Expand router planner coverage We can now support richer set of queries in router planner. This allow us to support CTEs, joins, window function, subqueries if they are known to be executed at a single worker with a single task (all tables are filtered down to a single shard and a single worker contains all table shards referenced in the query). Fixes : #501	2016-07-27 23:35:38 +03:00
Murat Tuncer	c20080992d	Remove PostgreSQL 9.4 support	2016-07-26 20:16:09 +03:00
Onder Kalaci	56c0b0825f	Fix bug related to poll timeout This commit fixes a bug on setting polling timeout. The code updated to comform to the comment that is already placed.	2016-07-26 09:55:47 +03:00
Burak Yucesoy	c1a3478c3b	Remove warnings on schema creation Since now we support schema related operations, there is no need to warn user about schema usage.	2016-07-22 18:24:23 +03:00
Burak Yucesoy	bdff72ed75	Fix ALTER TABLE SET SCHEMA Fixes #132 We hook into ALTER ... SET SCHEMA and warn out if user tries to change schema of a distributed table. We also hook into ALTER TABLE ALL IN TABLE SPACE statements and warn out if citus has been loaded.	2016-07-22 17:52:40 +03:00
Murat Tuncer	5d996a6891	Fix outer join crash when subquery is flatten	2016-07-22 17:01:19 +03:00
Burak Yucesoy	b58872b441	Fix worker_fetch_regular_table with schema Fixes #504 Fixes #646 We changed signature of worker_fetch_regular_table to accept schema name as parameter to make it work with schemas.	2016-07-22 00:44:02 -06:00
Jason Petersen	5d525fba24	Permit "single-shard" transactions Allows the use of modification commands (INSERT/UPDATE/DELETE) within transaction blocks (delimited by BEGIN and ROLLBACK/COMMIT), so long as all modifications hit a subset of nodes involved in the first such com- mand in the transaction. This does not circumvent the requirement that each individual modification command must still target a single shard. For instance, after sending BEGIN, a user might INSERT some rows to a shard replicated on two nodes. Subsequent modifications can hit other shards, so long as they are on one or both of these nodes. SAVEPOINTs are supported, though if the user actually attempts to send a ROLLBACK command that specifies a SAVEPOINT they will receive an ERROR at the end of the topmost transaction. Placements are only marked inactive if at least one replica succeeds in a transaction where others fail. Non-atomic behavior is possible if the shard targeted by the initial modification within a transaction has a higher replication factor than another shard within the same block and a node with the latter shard has a failure during the COMMIT phase. Other methods of denoting transaction blocks (multi-statement commands sent all at once and functions written in e.g. PL/pgSQL or other such languages) are not presently supported; their treatment remains the same as before.	2016-07-21 15:57:22 -06:00
Burak Yucesoy	20debfc0ee	Fix COUNT DISTINCT approximation with schema Fixes #555 Before this change, we were resolving HLL function and type Oid without qualified name. Now we find the schema name where HLL objects are stored and generate qualified names for each objects. Similar fix is also applied for cstore_table_size function call.	2016-07-21 17:29:18 +03:00
Burak Yucesoy	bca672e0a4	Fix master_apply_delete_command with schema Fixes #73	2016-07-21 15:09:20 +03:00
Burak Yucesoy	2f0158dde1	Change worker_apply_shard_ddl_command to accept schema name as parameter Fixes #565 Fixes #626 To add schema support to citus, we need to schema-prefix all table names, object names etc. in the queries sent to worker nodes. However; query deparsing is not available for most of DDL commands, therefore it is not easy to generate worker query in the master node. As a solution we are sending schema names along with shard id and query to run to worker nodes with worker_apply_shard_ddl_command. To not break \STAGE command we pass public schema as paramater while calling worker_apply_shard_ddl_command from there. This will not cause problem if user uses \STAGE in different schema because passes schema name is used only if there is no schema name is given in the query.	2016-07-21 14:17:26 +03:00
Metin Doslu	a811e09dd4	Add support for prepared statements with parameterized non-partition columns in router executor	2016-07-21 11:09:28 +03:00

... 5 6 7 8 9 ...

815 Commits (temp_tables)