citus

Commit Graph

Author	SHA1	Message	Date
Jason Petersen	56197dbdba	Add replication_model GUC This adds a replication_model GUC which is used as the replication model for any new distributed table that is not a reference table. With this change, tables with replication factor 1 are no longer implicitly MX tables. The GUC is similarly respected during empty shard creation for e.g. existing append-partitioned tables. If the model is set to streaming while replication factor is greater than one, table and shard creation routines will error until this invalid combination is corrected. Changing this parameter requires superuser permissions.	2017-01-23 09:05:14 -07:00
Brian Cloutier	fe5465aa4e	Port master_append_table_to_shard to new connection API (#1149 ) If any placements fail it doesn't update shard statistics on those placements. A minor enabling refactor: Make CoordinatedTransactionUses2PC public (it used to be CoordinatedTransactionUse2PC but that symbol already existed, so renamed it as well)	2017-01-23 15:57:44 +02:00
Burak Yucesoy	2e1df4c910	Reword error message for outer joins requiring repartition We changed error message which appears when user tries to execute outer join command and that command requires repartitioning. Old error message mentioned about 1-to-1 shard partitioning which may not be clear to user.	2017-01-23 10:42:36 +03:00
Marco Slot	ea855ddf86	Add an enable_deadlock_prevention flag to allow router transactions to expand to multiple nodes	2017-01-22 17:31:24 +01:00
Marco Slot	87ae26aef3	Ensure job IDs are unique across workers	2017-01-22 16:55:14 +01:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	6ec34bed84	Remove remnants of commit_protocol.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	52c3369f79	Minimal citus tools conversion to new connection API.	2017-01-21 09:01:14 -08:00
Önder Kalacı	594fa761e1	Merge branch 'master' into fix_command_counter_increment	2017-01-21 09:21:19 +02:00
Murat Tuncer	d76f781ae4	Convert multi copy to use new connection api This enables proper transactional behaviour for copy and relaxes some restrictions like combining COPY with single-row modifications. It also provides the basis for relaxing restrictions further, and for optionally allowing connection caching.	2017-01-20 19:15:19 -08:00
Jason Petersen	4e7b23472c	Change default replication factor to one Took the quick-and-dirty approach of changing it back to two during test runs. Can update tests to expect one in due time.	2017-01-20 18:56:43 -07:00
Andres Freund	3a36d32c43	Mark some now unnecessarily exposed multi_planner.c functions static.	2017-01-20 12:31:56 -08:00
Andres Freund	608bed0387	Don't duplicate planning logic in citus' explain hook. Instead use pg_plan_query() like the normal explain does, and use that to explain the query. That's important because it allows to remove the duplicated planner logic from multi_explain - and that logic is about to get more complicated.	2017-01-20 12:31:28 -08:00
Andres Freund	0f28a11970	Remove citus.explain_multi_logical/physical_plan. They make fixing explain for prepared statement harder, and they don't really fit into EXPLAIN in the first place. Additionally they're currently not exercised in any tests.	2017-01-20 12:31:19 -08:00
Onder Kalaci	bd825be340	Improve heap access methods This commit improves heap access methods for reference table upgrade and colocation group modifications.	2017-01-20 14:53:29 +02:00
Metin Doslu	2bd8f8f12e	Add a function to delete shard metadata from MX nodes	2017-01-20 14:38:01 +02:00
Metin Doslu	93e626c896	Refactor get_shard_id_for_distribution_column() and other minor changes	2017-01-20 14:38:01 +02:00
Metin Doslu	ed77260aa1	Return a deep copy shard list from ColocatedShardIntervalList()	2017-01-20 14:38:01 +02:00
Metin Doslu	7cff8719c2	Add worker_hash() and a stub for isolate_tenant_to_new_shard()	2017-01-20 14:38:01 +02:00
Murat Tuncer	c12bd7b75e	Remove hint message from master_remove_node UDF Hint about master_disable_node was giving wrong impression to users. Removal is better than keeping it.	2017-01-18 22:33:00 -07:00
Eren Basak	4def1ca696	Prevent COPY to reference tables from worker nodes	2017-01-18 17:38:01 +03:00
Eren Basak	e7c15ecc1f	Make `upgrade_to_reference_table` function MX-compatible	2017-01-18 16:49:50 +03:00
Eren Basak	56ca590daa	Propagate metadata changes for deleted reference table placements on master_remove_node call	2017-01-18 16:00:07 +03:00
Eren Basak	be78769ae4	Propagate new reference table placement metadata on `master_add_node`	2017-01-18 15:59:06 +03:00
Eren Basak	23b2619412	Make reference table metadata synced to workers	2017-01-18 15:59:05 +03:00
Eren Basak	e44d226221	Propagate Metadata to Workers on `create_reference_table` call.	2017-01-18 11:05:24 +03:00
Eren Basak	b686d9a025	Add Sequence Support for MX Tables This change adds support for serial columns to be used with MX tables. Prior to this change, sequences of serial columns were created in all workers (for being able to create shards) but never used. With MX, we need to set the sequences so that sequences in each worker create unique values. This is done by setting the MINVALUE, MAXVALUE and START values of the sequence.	2017-01-18 09:43:38 +03:00
Eren Basak	b1ce8d61c0	Create Invalidation Trigger for pg_dist_local_group Table Updates	2017-01-18 09:43:38 +03:00
Andres Freund	bdef35ac14	Query placementId in RemoteFinalizedShardPlacementList(). Not having the id in the ShardPlacement struct causes issues while making copy use the placement aware connection management.	2017-01-17 13:27:26 -08:00
Brian Cloutier	67ee357d7f	Port WorkerShardStats to new connection API Part of the work in citusdata/citus#1101, this is a pretty direct port over to the new functions and shouldn't result in any behavior changes.	2017-01-17 17:04:37 +02:00
Brian Cloutier	b1b2b4fadf	Create ExecuteOptionalRemoteCommand A small refactor which pulls some code out of `RecoverWorkerTransactions` and into `remote_commands.c`. This code block currently only occurs in `RecoverWorkerTransactions` but will be useful to other functions shortly. Unfortunately we couldn't call it `ExecuteRemoteCommand`, that name was already taken.	2017-01-17 17:04:37 +02:00
Brian Cloutier	539a205462	Pass entire ShardPlacement into WorkerShardStats A small refactor so we'll be able to call the new connection API (which requires having a ShardPlacement) from within WorkerShardStats.	2017-01-17 17:04:37 +02:00
Andres Freund	b9385700ee	Make placement_connection.c colocation aware. Because of foreign keys and similar concerns there should only be a single modifying/DDL connection for a set of colocated placements to a node. To enforce placement_connection.c now has an additional hash-table keeping track of the connections to a set of colocated placements. In addition to enforcing per placement restrictions on connections, there's now very similar restrictions for sets of colocated placements.	2017-01-16 13:47:01 -08:00
Andres Freund	6972186652	Add ShardPlacement fields required for colocated placement connection mapping.	2017-01-16 13:42:54 -08:00
Andres Freund	1d79820b74	Fix use of wrong constant. This could potentially lead to spuriously shared connections if the first 63 characters of a hostname are the same.	2017-01-16 13:42:53 -08:00
Andres Freund	4b1d37b7be	Remove fields used in earlier revisions of placement_connection.c.	2017-01-16 13:42:53 -08:00
Onder Kalaci	a7ed49c16e	Improve error messages for INSERT INTO .. SELECT This commit is intended to improve the error messages while planning INSERT INTO .. SELECT queries. The main motivation for this change is that we used to map multiple cases into a single message. With this change, we added explicit error messages for many cases.	2017-01-16 12:16:14 -07:00
Burak Yucesoy	3315ae6142	Remove placement metadata of reference tables after master_remove_node With this change, we start to delete placement of reference tables at given worker node after master_remove_node UDF call. We remove placement metadata at master node but we do not drop actual shard from the worker node. There are two reasons for that decision, first, it is not critical to DROP the shards in the workers because Citus will ignore them as long as node is removed from cluster and if we add that node back to cluster we will DROP and recreate all reference tables. Second, if node is unreachable, it becomes complicated to cover failure cases and have a transaction support.	2017-01-16 11:24:56 +03:00
Murat Tuncer	e7935a3be4	Report error when original range table id is not found in NewTableId()	2017-01-13 09:39:43 +03:00
Murat Tuncer	77f8db6b14	Add view support Enables use views within distributed queries. User can create and use a view on distributed tables/queries as he/she would use with regular queries. After this change router queries will have full support for views, insert into select queries will support reading from views, not writing into. Outer joins would have a limited support, and would error out at certain cases such as when a view is in the inner side of the outer join. Although PostgreSQL supports writing into views under certain circumstances. We disallowed that for distributed views.	2017-01-13 09:39:42 +03:00
Onder Kalaci	aed5f817fa	Refactor CheckShardPlacements() and improve support for node removal This commit refactors CheckShardPlacements() so that it only considers modifyingConnection. Also, it skips nodes which are removed from the cluster.	2017-01-12 20:10:10 +02:00
Murat Tuncer	cb1dfd0a17	Add hint to errored real time queries	2017-01-12 11:33:35 +03:00
Onder Kalaci	1efa301ada	Copy on reference tables should never mark placements invalid This commit ensures that COPY does not mark any placement of reference's state as INVALID in case of an error.	2017-01-12 02:43:41 +02:00
Eren Basak	859b920ba9	Fix escaping of workerrack in NodeListInsertCommand This change fixes a small bug about quoting of workerrack column in NodeListInsertCommand: Previous: `"..., '%s'", workerRack` Now: `"..., %s", quote_literal_cstr(workerRack)`	2017-01-11 10:18:48 +03:00
Andres Freund	b813b39241	Cache ShardPlacements in metadata cache. So far we've reloaded them frequently. Besides avoiding that cost - noticeable for some workloads with large shard counts - it makes it easier to add information to ShardPlacements that help us make placement_connection.c colocation aware.	2017-01-10 18:14:18 -08:00
Andres Freund	8cb47195ba	Make LoadShardInterval() backed by the metadata cache. Doing so requires adding a mapping from shardId to the cache entries. For that metadata_cache.c now maintains an additional hashtable. That hashtable only references shard intervals in the dist table cache.	2017-01-10 17:00:19 -08:00
Andres Freund	f6e8647337	Split DistTableCacheEntry() into separate functions. Previously the function was getting too large. Thus this splits the function into separate parts for looking up the cache entry and building the cache contents.	2017-01-10 15:23:18 -08:00
Onder Kalaci	cd8e41bb79	Fix CloseNodeConnections to actually close connections CloseNodeConnections() is supposed to close connections to a given node. However, before this commit it lacks to actually call PQFinish() on the connections. Using CloseConnection() handles closing and all other necessary actions.	2017-01-11 01:13:58 +02:00
Murat Tuncer	95862632de	Add citus tools to default configuration	2017-01-10 17:53:27 +03:00
Murat Tuncer	b93185d800	Add master_disable_node UDF We can now remove nodes from cluster regardless of them having an active shard placement.	2017-01-10 10:54:57 +03:00
Burak Yucesoy	59d3d05bc4	Error out on CTEs with data modifying statement With this change we start to error out on router planner queries where a common table expression with data-modifying statement is present. We already do not support if there is a data-modifying statement using result of the CTE, now we also error out if CTE itself is data-modifying statement.	2017-01-10 10:30:09 +02:00
Marco Slot	ef326b202a	PQclear in ReportResultError to prevent memory leaks	2017-01-10 02:51:39 +01:00
Marco Slot	31231ce196	Use GetNodeConnection to establish a connection in transaction recovery	2017-01-10 02:44:34 +01:00
Andres Freund	c390daed0f	Use interrupt checking libpq wrappers in router executor.	2017-01-09 14:02:45 -08:00
Andres Freund	7320c17f00	Convert router executor to placement connection management infrastructure. Remove the router specific transaction and shard management, and replace it with the new placement connection API. This mostly leaves behaviour alone, except that it is now, inside a transaction, legal to select from a shard to which no pre-existing connection exists. To simplify code the code handling task executions for select and modify has been split into two - the previous coding was starting to get confusing due to the amount of only conditionally applicable code. Modification connections & transactions are now always established in parallel, not just for reference tables.	2017-01-09 13:13:02 -08:00
Andres Freund	bfa742d794	Centralized shard/placement connection and state management. Currently there are several places in citus that map placements to connections and that manage placement health. Centralize this knowledge. Because of the centralized knowledge about which connection has previously been used for which shard/placement, this also provides the basis for relaxing restrictions around combining various forms of DDL/DML. Connections for a placement can now be acquired using GetPlacementConnection(). If the connection is used for DML or DDL the FOR_DDL/DML flags should be used respectively. If an individual remote transaction fails (but the transaction on the master succeeds) and FOR_DDL/DML have been specified, the placement is marked as invalid, unless that'd mark all placements for a shard as invalid.	2017-01-09 13:13:02 -08:00
Andres Freund	3286b99ff1	Remove useless changing of CurrentMemoryContext.	2017-01-06 09:16:45 -08:00
Andres Freund	6291998ae1	Use FinishConnectionListEstablishment() instead of manually iterating.	2017-01-06 09:16:01 -08:00
Andres Freund	d256f3fca9	Remove unused LogPreparedTransactions() function. This is unused since `92c7567008`.	2017-01-06 09:15:01 -08:00
Burak Yucesoy	9c9f479e4b	Replicate reference tables when new node is added With this change, we start to replicate all reference tables to the new node when new node is added to the cluster with master_add_node command. We also update replication factor of reference table's colocation group.	2017-01-05 14:30:41 +03:00
Onder Kalaci	6d050fd677	Use 2PC for reference table modification With this commit, we ensure that router executor always uses 2PC for reference table modifications and never mark the placements of it as INVALID.	2017-01-04 12:46:35 +02:00
Burak Yucesoy	31cd2357fe	Add upgrade_to_reference_table With this change we introduce new UDF, upgrade_to_reference_table, which can be used to upgrade existing broadcast tables reference tables. For upgrading, we require that given table contains only one shard.	2017-01-02 17:54:42 +02:00
Eren Basak	7e09bd6836	Error on Unsupported Features on Workers This change makes the metadata workers error out on unsupported commands.	2017-01-02 16:03:45 +03:00
Marco Slot	59bc5972fa	Use MultiConnection in multi-shard transactions	2016-12-30 14:43:21 -07:00
Metin Doslu	1ddc70ca55	Add binary search capability to ShardIndex() Renamed FindShardIntervalIndex() to ShardIndex() and added binary search capability. It used to assume that hash partition tables are always uniformly distributed which is not true if upcoming tenant isolation feature is applied. This commit also reduces code duplication.	2016-12-30 18:55:34 +02:00
Eren Basak	e43eed0f7a	Prevent Deadlock on Dropping MX Tables with Sequences This change prevents a deadlock situation during DROP TABLE on an mx table with sequences on workers with metadata.	2016-12-28 16:32:20 +03:00
Burak Yucesoy	88ee7802dd	Address Onder's comments	2016-12-28 12:26:16 +03:00
Burak Yucesoy	bb9e95e134	Error out on foreign keys with reference tables We have one replication of reference table for each node. Therefore all problems with replication factor > 1 also applies to reference table. As a solution we will not allow foreign keys on reference tables. It is not possible to define foreign key from, to or between reference tables.	2016-12-28 10:58:26 +03:00
Murat Tuncer	2f76b4be99	Add error hint to failing modify query	2016-12-23 19:43:55 +03:00
Marco Slot	6cbc1945f9	Enable transaction recovery in connection API	2016-12-23 16:14:29 +01:00
Marco Slot	92c7567008	Convert worker_transactions to new connection API	2016-12-23 16:14:29 +01:00
Marco Slot	00d55ad957	Add a wrapper for PQsendQuery	2016-12-23 16:14:29 +01:00
Marco Slot	87c62d598e	Connectionapify SendCommandListToWorkerInSingleTransaction	2016-12-23 16:14:29 +01:00
Burak Yucesoy	0851fd2f0b	GRANT SELECT access for metadata tables to public Previously, we errored out if non-user tries to SELECT query for some metadata tables. It seems that we already GRANT SELECT access to some metadata tables but not others. With this change, we GRANT SELECT access to all existing Citus metadata tables.	2016-12-23 16:32:47 +03:00
Eren Basak	31af40cc26	Handle MX tables on workers during drop table commands	2016-12-23 15:43:32 +03:00
Eren Basak	bed2e353db	Propagate `mark_tables_colocated` changes in `pg_dist_partition` table to metadata workers.	2016-12-23 15:43:32 +03:00
Eren Basak	71d73ec5ff	Propagate DDL commands to metadata workers for MX tables	2016-12-23 15:43:32 +03:00
Eren Basak	048fddf4da	Propagate MX table and shard metadata on `create_distributed_table` call	2016-12-23 15:43:32 +03:00
Eren Basak	61a1e487d0	Mark hash distributed tables with replication factor = 1 as streaming replicated tables (repmodel=s). This works only with `create_distributed_table` call.	2016-12-23 15:43:31 +03:00
Marco Slot	11031bcf55	Enable evaluation of stable functions in INSERT..SELECT	2016-12-23 12:47:21 +01:00
Marco Slot	d745d7bf70	Add explicit RelationShards mapping to tasks	2016-12-23 10:23:43 +01:00
Marco Slot	6852f8a951	Add shard locking UDFs	2016-12-22 11:04:34 +01:00
Burak Yücesoy	501a2ecead	Add get_distribution_value_shardid UDF (#1048 ) * Add get_distribution_value_shardid UDF With this UDF users can now map given distribution value to shard id. We mostly hide shardids from users to prevent unnecessary complexity but some power users might need to know about which entry/value is stored in which shard for maintanence purposes. Signature of this UDF is as follows; bigint get_distribution_value_shardid(table_name regclass, distribution_value anyelement)	2016-12-22 12:17:08 +03:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Eren Basak	296e0bd33a	Add citus.node_connection_timeout GUC	2016-12-20 14:11:37 +03:00
Marco Slot	dd094bc372	Run copy commands in worker_merge_files_into_table as superuser	2016-12-20 10:15:42 +01:00
Marco Slot	42ff472721	Set user as pg_merge_job_* schema owner	2016-12-20 10:15:42 +01:00
Murat Tuncer	c3a60bff70	Make router planner active at all times We used to disable router planner and executor when task executor is set to task-tracker. This change enables router planning and execution at all times regardless of task execution mode. We are introducing a hidden flag enable_router_execution to enable/disable router execution. Its default value is true. User may disable router planning by setting it to false.	2016-12-20 11:24:01 +03:00
Jason Petersen	6f95875191	Add targeted VACUUM/ANALYZE support Adds support for VACUUM and ANALYZE commands which target a specific distributed table. After grabbing the appropriate locks, this imple- mentation sends VACUUM commands to each placement (using one connec- tion per placement). These commands are sent in parallel, so users with large tables will benefit from sharding. Except for VERBOSE, all VACUUM and ANALYZE options are supported, including the explicit column list used by ANALYZE. As with many of our utility commands, the local command also runs. In the VACUUM/ANALYZE case, the local command is executed before any re- mote propagation. Because error handling is managed after local proc- essing, this can result in a VACUUM completing locally but erroring out when distributed processing commences: a minor technicality in all cases, as there isn't really much reason to ever roll back a VACUUM (an impossibility in any case, as VACUUM cannot run within a transaction). Remote propagation of targeted VACUUM/ANALYZE is controlled by the enable_ddl_propagation setting; warnings are emitted if such a command is attempted when DDL propagation is disabled. Unqualified VACUUM or ANALYZE is not handled, but a warning message informs the user of this. Implementation note: this commit adds a "BARE" value to MultiShard- CommitProtocol. When active, no BEGIN command is ever sent to remote nodes, useful for commands such as VACUUM/ANALYZE which must not run in a transaction block. This value is not user-facing and is reset at transaction end.	2016-12-16 16:59:06 -07:00
Metin Doslu	20b8f1feeb	Refactor distribution column type check for colocation	2016-12-16 15:24:45 +02:00
Metin Doslu	e2d0bd38f2	Don't allow tables with different replication models to be colocated	2016-12-16 15:23:49 +02:00
Metin Doslu	86cca54857	Add colocate_with option to create_distributed_table() With this commit, we support three versions of colocate_with: i.default, ii.none and iii. a specific table name.	2016-12-16 14:53:35 +02:00
Metin Doslu	edbedbd744	Move colocation related functions to colocation_utils.c	2016-12-16 14:52:40 +02:00
Marco Slot	5714be0da5	Expose the column_to_column_name UDF to make partkey in pg_dist_partition human-readable	2016-12-14 10:46:33 +01:00
Eren Basak	afbb5ffb31	Add stop_metadata_sync_to_node UDF	2016-12-14 10:53:12 +03:00
Eren Basak	b94647c3bc	Propagate CREATE SCHEMA commands with the correct AUTHORIZATION clause in start_metadata_sync_to_node	2016-12-14 10:53:12 +03:00
Eren Basak	fb08093b00	Make start_metadata_sync_to_node UDF to propagate foreign-key constraints	2016-12-14 10:53:12 +03:00
Eren Basak	5e96e4f60e	Make truncate triggers propagated on start_metadata_sync_to_node call	2016-12-14 10:53:10 +03:00
Eren Basak	4fd086f0af	Prevent Transactions in start_metadata_sync_to_node	2016-12-13 10:48:03 +03:00
Eren Basak	9eff968d1f	Add start_metadata_sync_to_node UDF This change adds `start_metadata_sync_to_node` UDF which copies the metadata about nodes and MX tables from master to the specified worker, sets its local group ID and marks its hasmetadata to true to allow it receive future DDL changes.	2016-12-13 10:48:03 +03:00
Andres Freund	80b34a5d6b	Integrate router executor into transaction management framework. One less place managing remote transactions. It also makes it fairly easy to use 2PC for certain modifications (e.g. reference tables). Just issue a CoordinatedTransactionUse2PC(). If every placement failure should cause the whole transaction to abort, additionally mark the relevant transactions as critical.	2016-12-12 15:18:12 -08:00
Andres Freund	fa5e202403	Convert multi_shard_transaction.[ch] to new framework.	2016-12-12 15:18:12 -08:00
Andres Freund	fc298ec095	Coordinated remote transaction management.	2016-12-12 15:18:12 -08:00
Andres Freund	6eeb43af15	Add PQgetResult() wrapper handling interrupts. This makes it possible to implement cancelling queries blocked on communication with remote nodes.	2016-12-12 15:18:12 -08:00
Andres Freund	7434fcc6df	Make prepared transactions available if not configured.	2016-12-08 19:57:22 -08:00
Burak Yucesoy	8d7cd4d746	Add Foreign Key Support to ALTER TABLE commands With this PR, we add foreign key support to ALTER TABLE commands. For now, we only support foreign constraint creation via ALTER TABLE query, if it is only subcommand in ALTER TABLE subcommand list. We also only allow foreign key creation if replication factor is 1.	2016-12-08 15:03:25 +02:00
Andres Freund	2374905c89	Move multi_client_executor.[ch] ontop of connection_management.[ch]. That way connections can be automatically closed after errors and such, and the connection management infrastructure gets wider testing. It also fixes a few issues around connection string building.	2016-12-07 11:44:24 -08:00
Andres Freund	a77cf36778	Use connection_management.c from within connection_cache.c. This is a temporary step towards removing connection_cache.c.	2016-12-07 11:44:24 -08:00
Andres Freund	3505d431cd	Add initial helpers to make interactions with MultiConnection et al. easier. This includes basic infrastructure for logging of commands sent to remote/worker nodes. Note that this has no effect as of yet, since no callers are converted to the new infrastructure.	2016-12-07 11:44:24 -08:00
Andres Freund	3223b3c92d	Centralized Connection Lifetime Management. Connections are tracked and released by integrating into postgres' transaction handling. That allows to to use connections without having to resort to having to disable interrupts or using PG_TRY/CATCH blocks to avoid leaking connections. This is intended to eventually replace multi_client_executor.c and connection_cache.c, and to provide the basis of a centralized transaction management. The newly introduced transaction hook should, in the future, be the only one in citus, to allow for proper ordering between operations. For now this central handler is responsible for releasing connections and resetting XactModificationLevel after a transaction.	2016-12-07 11:43:18 -08:00
Andres Freund	883af02b54	Add some basic helpers to make use of dynahash hashtables easier.	2016-12-06 14:15:36 -08:00
Marco Slot	3d09a2e5c2	Use READ_UINT64_FIELD for placement ID in ReadShardPlacement	2016-12-05 17:22:23 +01:00
Marco Slot	172bb457e6	Take shard metadata lock in master_append_table_to_shard	2016-12-02 15:56:30 +01:00
Eren Basak	fb88b167a7	Propagate node add/remove to the nodes with hasmetadata=true This change propagates the changes done by `master_add_node` and `master_remove_node` to the workers that contain metadata.	2016-12-02 14:43:32 +03:00
Brian Cloutier	a4096c9f45	Remove dead code: ResponsiveWorkerNodeList	2016-12-02 13:14:11 +03:00
Onder Kalaci	df974e15b8	Bugfix for deparsing INSERT..SELECT queries which involve constant values This commit fixes a bug when the SELECT target list includes a constant value. Previous behaviour of target list re-ordering: * Iterate over the INSERT target list * If it includes a Var, find the corresponding SELECT entry and update its resno accordingly * If it does not include a Var (which we only considered to be DEFAULTs), generate a new SELECT target entry * If the processed target entry count in SELECT target list is less than the original SELECT target list (GROUP BY elements not included in the SELECT target entry), add them in the SELECT target list and update the resnos accordingly. * However, this step was leading to add the CONST SELECT target entries twice. The reason is that when CONST target list entries appear in the SELECT target list, the INSERT target list doesn't include a Var. Instead, it includes CONST as it does for DEFAULTs. New behaviour of target list re-ordering: * Iterate over the INSERT target list * If it includes a Var, find the corresponding SELECT entry and update its resno accordingly * If it does not include a Var (which we consider to be DEFAULTs and CONSTs on the SELECT), generate a new SELECT target entry * If any target entry remains on the SELECT target list which are resjunk, (GROUP BY elements not included in the SELECT target entry), keep them in the SELECT target list by updating the resnos.	2016-12-01 10:41:56 +02:00
Murat Tuncer	45762006f3	Add support for filters Ensures filter clauses are stripped from master query, and pushed down to worker queries.	2016-12-01 08:53:46 +03:00
Sumedh Pathak	0a0d4784b9	Change DDL error message to say "unsupported" instead of "supported"	2016-11-26 10:30:09 +01:00
Murat Tuncer	b5c1ecb684	Fix failures during pg_upgrade - fix error in CitusHasBeenLoaded() - allow creation of pg_catalog tables during upgrade	2016-11-11 17:22:45 -08:00
Marco Slot	b566c4815c	Pass down the correct type for null parameters	2016-11-11 07:14:08 +01:00
Metin Doslu	a0c92b38cb	Use AccessShareLock on the source table while creating a colocated table While creating a colocated table, we don't want the source table to be dropped. However, using a ShareLock blocks DML statements on the source table, and using AccessShareLock is enough to prevent DROP. Therefore, we just loosened the lock to AccessShareLock.	2016-11-10 09:17:05 -08:00
Eren Basak	444f14d546	Add Column Definition List for Output Columns for master_add_node This change allows seeing the names of columns of `master_add_node`, using `SELECT * FROM master_add_node(...)` by specifying output columns in UDF definition.	2016-11-07 14:08:58 -08:00
Marco Slot	c157c3b419	Disallow SendCommandListToWorkerInSingleTransaction when modifications have occurred	2016-11-02 12:26:56 +01:00
Marco Slot	f6b3af7a49	Use co-located shard ID in multi_shard_transaction	2016-11-02 11:01:19 +01:00
Samay Sharma	82e5faa190	Avoid error during CREATE INDEX IF NOT EXISTS Previously, we threw an error when we ran CREATE INDEX IF NOT EXISTS with an already existing index. This change enables expected behavior by checking if the statement has IF NOT EXISTS before throwing the error. We also ensure that we don't execute the command on the workers, if an index already exists on the master.	2016-11-01 14:51:19 -07:00
Burak Yucesoy	b30b339f91	Fix typo in error message	2016-11-01 16:58:27 +02:00
Burak Yucesoy	6246702a4c	Change error message we displayed for foreign constraints if RF > 1 At the moment, we do not support foreign constraints if replication factor is greater than 1. However foreign constraints can be used in cloud with high availability option. Therefore we do not want to create an impression such that foreign constraints with high availability is not supported at all. We call users to action with this error message.	2016-11-01 15:47:19 +02:00
Önder Kalacı	83e1719541	Always CASCADE while dropping a shard	2016-11-01 10:16:34 +01:00
Brian Cloutier	50805f1e5c	Copy raw_parse_tree before using it Address citusdata/citus#922. Fixes a segfault in PG's installcheck caused by our reuse of raw_parse_tree when handling EXPLAIN EXECUTE.	2016-10-27 18:25:49 +03:00
Onder Kalaci	a43e3bad56	Improve error semantics for INSERT..SELECT With this commit, we error out if a worker query cannot be executed on all placements of a target insert shard interval.	2016-10-27 14:09:05 +03:00
Metin Doslu	c6f5cabbe3	Error on different shard placement count In ErrorIfShardPlacementsNotColocated(), while checking if shards are colocated, error out if matching shard intervals have different number of shard placements.	2016-10-26 18:46:05 +03:00
Onder Kalaci	9cd549f21f	Add stub for Copy shard placement This commit does not change the current behaviour, but, helps to implement enterprise feature without any version changes.	2016-10-26 17:57:55 +03:00
Metin Doslu	4e555880b7	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Marco Slot	275378aa45	Re-acquire metadata locks in RouterExecutorStart	2016-10-26 14:34:59 +02:00
Brian Cloutier	1e6d1ef67e	Fix segfault during EXPLAIN EXECUTE Fix citusdata/citus#886 The way postgres' explain hook is designed means that our hook is never called during EXPLAIN EXECUTE. So, we special-case EXPLAIN EXECUTE by catching it in the utility hook. We then replace the EXECUTE with the original query and pass it back to Citus.	2016-10-26 15:18:42 +03:00
Burak Yucesoy	fc2fea839b	Only repair given shard Previously, when a repair is requested on a shard, we also repair all co-located shards of given shard, which may cause repairing already healthy shards. With this change, we only repair given shard.	2016-10-26 14:36:37 +03:00
Brian Cloutier	80c8cfeabe	Don't add a raw 32-bit int to tuples in create_distributed_table	2016-10-26 14:02:42 +03:00
Andres Freund	fcd150c7c8	Invalidate relcache after pg_dist_shard_placement changes. This forces prepared statements to be re-planned after changes of the placement metadata. There's some locking issues remaining, but that's a a separate task. Also add regression tests verifying that invalidations take effect on prepared statements.	2016-10-26 03:36:35 -07:00
Onder Kalaci	1673ea937c	Feature: INSERT INTO ... SELECT This commit adds INSERT INTO ... SELECT feature for distributed tables. We implement INSERT INTO ... SELECT by pushing down the SELECT to each shard. To compute that we use the router planner, by adding an "uninstantiated" constraint that the partition column be equal to a certain value. standard_planner() distributes that constraint to all the tables where it knows how to push the restriction safely. An example is that the tables that are connected via equi joins. The router planner then iterates over the target table's shards, for each we replace the "uninstantiated" restriction, with one that PruneShardList() handles. Do so by replacing the partitioning qual parameter added in multi_planner() with the current shard's actual boundary values. Also, add the current shard's boundary values to the top level subquery to ensure that even if the partitioning qual is not distributed to all the tables, we never run the queries on the shards that don't match with the current shard boundaries. Finally, perform the normal shard pruning to decide on whether to push the query to the current shard or not. We do not support certain SQLs on the subquery, which are described/commented on ErrorIfInsertSelectQueryNotSupported(). We also added some locking on the router executor. When an INSERT/SELECT command runs on a distributed table with replication factor >1, we need to ensure that it sees the same result on each placement of a shard. So we added the ability such that router executor takes exclusive locks on shards from which the SELECT in an INSERT/SELECT reads in order to prevent concurrent changes. This is not a very optimal solution, but it's simple and correct. The citus.all_modifications_commutative can be used to avoid aggressive locking. An INSERT/SELECT whose filters are known to exclude any ongoing writes can be marked as commutative. See RequiresConsistentSnapshot() for the details. We also moved the decison of whether the multiPlan should be executed on the router executor or not to the planning phase. This allowed us to integrate multi task router executor tasks to the router executor smoothly.	2016-10-26 10:01:00 +03:00
Onder Kalaci	e0d83d65af	Add ability to reorder target list for INSERT/SELECT queries The necessity for this functionality comes from the fact that ruleutils.c is not supposed to be used on "rewritten" queries (i.e. ones that have been passed through QueryRewrite()). Query rewriting is the process in which views and such are expanded, and, INSERT/UPDATE targetlists are reordered to match the physical order, defaults etc. For the details of reordeing, see transformInsertRow().	2016-10-26 10:00:03 +03:00
Jason Petersen	73f5b8b05f	Move all funcs to pg_catalog, add test to verify We'd been relying on a single SET search_path command in an earlier script, but a subsequent script RESET search_path, causing any further bare functions to be created in the first schema on the search path. However, starting with an older extension version and executing ALTER scripts one at a time DOES avoid putting any functions in the public namespace, so I wrote an upgrade script resilient to that, especially because PostgreSQL 9.5 will error out if a function is already in the schema it's being moved to.	2016-10-25 12:45:53 -06:00
Brian Cloutier	c6b74b023f	Treat nodePort as the 8byte number it is	2016-10-25 16:31:48 +03:00
Brian Cloutier	2e96f6ab27	Fix crash when upgrading to Citus 6 Between restart (running the new code) and ALTER EXTENSION citus UPGRADE there was an inconsistency where we assumed that pg_dist_partition had the repmodel column set. Now we give it a default value if the column doesn't exist yet.	2016-10-24 15:18:29 +03:00
Marco Slot	271b20a23e	Parallelise DDL commands	2016-10-24 12:39:08 +02:00
Burak Yucesoy	5a03acf2bf	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Marco Slot	02d2b86e68	Re-disable master evaluation for SELECT	2016-10-21 10:51:47 +02:00
Metin Doslu	405335fcee	Add create_reference_table() create_reference_table() creates a hash distributed table with shard count equals to 1 and replication factor equals to shard_replication_factor configuration value.	2016-10-20 15:29:30 +03:00
Metin Doslu	d3e7d9dc8d	Final refactoring	2016-10-20 11:29:11 +03:00
Metin Doslu	58ac477ffb	Change return type of BuildDistributionKeyFromColumnName() to Var * BuildDistributionKeyFromColumnName() always returns a Var pointer, so there is no reason to return a Node pointer instead of a Var pointer.	2016-10-20 10:59:31 +03:00
Metin Doslu	161093908e	Convert colocationid to uint32	2016-10-20 10:59:31 +03:00
Metin Doslu	8334d853c0	Add local function GetNextShardId()	2016-10-20 10:59:31 +03:00
Metin Doslu	40bdafa8d1	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Metin Doslu	d04f4f5935	Add guc variable for shard count	2016-10-19 10:44:50 +03:00
Marco Slot	65f6d7c02a	Follow consistent execution order in parallel commands	2016-10-19 08:33:08 +02:00
Marco Slot	a497e7178c	Parallelise master_modify_multiple_shards	2016-10-19 08:33:08 +02:00
Marco Slot	9d98acfb6d	Move requiresMasterEvaluation from Task to Job	2016-10-19 08:23:06 +02:00
Marco Slot	213d8419c6	Refactor and redocument executor shard lock code	2016-10-19 08:13:35 +02:00
Andres Freund	ac14b2edbc	Support PostgreSQL 9.6 Adds support for PostgreSQL 9.6 by copying in the requisite ruleutils file and refactoring the out/readfuncs code to flexibly support the old-style copy/pasted out/readfuncs (prior to 9.6) or use extensible node APIs (in 9.6 and higher). Most version-specific code within this change is only needed to set new fields in the AggRef nodes we build for aggregations. Version-specific test output files were added in certain cases, though in most they were not necessary. Each such file begins by e.g. printing the major version in order to clarify its purpose. The comment atop citus_nodes.h details how to add support for new nodes for when that becomes necessary.	2016-10-18 16:23:55 -06:00
Murat Tuncer	b453f6c7ab	Add master_run_on_worker UDF	2016-10-18 17:59:54 +03:00
Eren Basak	cee7b54e7c	Add worker transaction and transaction recovery infrastructure	2016-10-18 14:18:14 +03:00
Eren Basak	f3ede37c9f	Add hasmetadata column to pg_dist_node	2016-10-17 11:52:18 +03:00
Eren Basak	c7bf2021fa	Add metadata infrastructure for pg_dist_local_group table	2016-10-17 11:52:18 +03:00
Eren Basak	8f477d18f1	Add pg_dist_local_group Metadata Table This change adds the pg_dist_local_group metadata table, which indicates the group id of the current node. It is expected that this table contains one and only one row, which only contains the group id of the node as an integer.	2016-10-14 11:41:14 +03:00
Brian Cloutier	6c3d79b4e7	Drop shardalias	2016-10-14 11:03:26 +03:00
Burak Yucesoy	6668d19a3b	Make shard transfer functions co-location aware With this change, master_copy_shard_placement and master_move_shard_placement functions start to copy/move given shard along with its co-located shards.	2016-10-13 18:16:40 +03:00
Metin Doslu	d03a2af778	Add HAVING support This commit completes having support in Citus by adding having support for real-time and task-tracker executors. Multiple tests are added to regression tests to cover new supported queries with having support.	2016-10-13 15:47:53 +03:00
Eren Basak	ed3af403fd	Add Metadata Snapshot Infrastructure This change adds the required infrastructure about metadata snapshot from MX codebase into Citus, mainly metadata_sync.c file and master_metadata_snapshot UDF.	2016-10-13 10:40:14 +03:00
Marco Slot	33b7723530	Use UpdateShardPlacementState where appropriate	2016-10-07 11:59:20 -07:00
Andres Freund	982ad66753	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Metin Doslu	d94a65e0e9	Reduce minimum value of task_tracker_delay to 1ms	2016-10-07 09:55:56 +03:00
Brian Cloutier	9d6699b07c	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Marco Slot	32b2bd4ed8	Add replication model column to pg_dist_partition	2016-10-05 01:14:28 +02:00
Onder Kalaci	0993f2fb2c	Update ColocatedShardPlacementList() function name to ColocatedShardIntervalList() which was intented.	2016-10-04 09:51:42 +03:00
Marco Slot	fe3ffdb013	Avoid use of pnstrdup	2016-10-04 00:31:53 +02:00
Robin Thomas	f677fadbe6	Provides safe, idempotent shard-extended names to any object name related to a table that might be distributed, allowing any name that is within regular PostgreSQL length limits to be extended with a shard ID for use in shards on workers. Handles multi-byte character boundaries in identifiers when making prefixes for shard-extended names. Includes tests. Uses hash_any from PostgreSQL's access/hashfunc.c. Removes AppendShardIdToStringInfo() as it's used only once and arguably is best replaced there with a call to AppendShardIdToName(). Adds UDF shard_name(object_name, shard_id) to expose the shard-extended name logic to other PL/PGSQL, UDFs and scripts. Bumps version to 6.0-2 to allow for UDF to be created in migration script. Fixes citusdata/citus#781 and citusdata/citus#179.	2016-10-03 17:02:34 -04:00
Andres Freund	de32b7bbad	Don't create hash-table of zero size in TaskHashCreate(). hash_create(), called by TaskHashCreate(), doesn't work correctly for a zero sized hash table. This triggers valgrind errors, and could potentially cause crashes even without valgring. This currently happens for Jobs with 0 tasks. These probably should be optimized away before reaching TaskHashCreate(), but that's a bigger change.	2016-10-03 13:07:43 -07:00
Andres Freund	6d050bc9f8	Initialize count_agg_clauses argument to 0. count_agg_clause adds the cost of the aggregates to the state variable, it doesn't reinitialize it. That is intentional, as it is used to incrementally add costs in some places.	2016-10-03 13:07:43 -07:00
Andres Freund	a6150c2916	Lower "waiting for activity on tasks took longer than" log level. It's perfectly normal to wait longer in several circumstances, and the output can lead to spurious regression output changes.	2016-10-03 13:07:43 -07:00
Marco Slot	a4efb60b54	Change logicalrelid type in pg_dist_partition and pg_dist_shard to regclass	2016-10-03 20:27:16 +02:00
Robin Thomas	c507a0df1c	During repartitions, the partitionColumnType argument sent to workers is now a `::regtype` using the qualified name of the column type, not the column type OID which may differ between master/worker nodes. Test coverage of a hash reparitition using a UDT as the join column. Note that the UDFs `worker_hash_partition_table` and `worker_range_partition_table` are unchanged, and rightly expect an OID for the column type; but the planner code building the commands now allows for `::regtype` casting to do its magic. Fixes citusdata/citus#111.	2016-10-03 13:41:20 -04:00
Eren Basak	ac3a4eee21	Fix command counter increment bug Fixes citusdata/citus#714 On `InsertShardRow`, we previously called `CommandCounterIncrement()` before `CitusInvalidateRelcacheByRelid(relationId);`. This might prevent to skip invalidation of the distributed table in the next access within the same session.	2016-10-03 17:00:27 +03:00
Onder Kalaci	a533b8e7c1	Differentiate worker and master job temporary folders This commit enables to create different worker and master temporary folders. This change is important for citus-mx on task-tracker execution. In simple words, on citus-mx, the worker could actually be reponsible for the master tasks as well. Prior to this change, both master and worker logic on task-tracker executor was accessing and using the same files for different purposes which was dangerous on certain cases (i.e., when task_tracker_delay is low).	2016-10-03 14:24:08 +03:00
Andres Freund	77efe7fcd4	Move task tracker lwlocks into their own tranche. RequestAddinLWLocks()/LWLockAssign() are gone in 9.6. Luckily all citus supported postgres versions support tranches, so use those.	2016-09-30 16:06:49 -06:00
Jason Petersen	1c560dfa9c	Update ruleutils_95 with latest PostgreSQL changes Hand-applied changes from a diff I generated between 9.5.0 and 9.5.4.	2016-09-29 15:54:38 -06:00
Marco Slot	c4bc0742a7	Make count return 0 if all shards are pruned away Before this change, count on a distributed returned NULL if all shards were pruned away, because on the master we replace with count(..) call with a sum(..) call to sum the counts from the shards. However, sum returns NULL when there are no rows, whereas count is expected to return 0.	2016-09-29 20:27:26 +02:00
Jason Petersen	5b80d4e8dd	Directly register multi-shard callbacks in PG_init I had changed these callbacks to use the same method I chose for the router executor (for consistency), but as that method is flawed, we now want to ensure we directly register them from PG_init as well.	2016-09-29 11:43:19 -06:00
Jason Petersen	5f6264105d	Directly register router xact callbacks in PG_init Not entirely sure why we went with the shared memory hook approach, but it causes problems (multiple registration) during crashes. Changing to a simple direct registration call from PG_init.	2016-09-29 11:43:18 -06:00
Burak Yucesoy	1ee39eb098	Internal co-location API With this commit we introduce internal API for co-location related operations.	2016-09-29 11:56:53 +03:00
Marco Slot	5cdbe2b86c	Remove copy_to_distributed_table	2016-09-28 11:27:54 -06:00
Murat Tuncer	5b42318ac4	Make where false queries router plannable	2016-09-28 18:49:26 +03:00
Murat Tuncer	c16dec88c3	Add UDF master_expire_table_cache	2016-09-28 12:08:37 +03:00
Jason Petersen	0caf0d95f1	Fix unique-violation-in-xact segfault An interaction between ReraiseRemoteError and DML transaction support causes segfaults: * ReraiseRemoteError calls PurgeConnection, freeing a connection... * That connection is still in the xactParticipantHash At transaction end, the memory in the freed connection might happen to pass the "is this connection OK?" check, causing us to try to send an ABORT over that connection. By removing it from the transaction hash before calling ReraiseRemoteError, we avoid this possibility.	2016-09-27 16:44:03 -06:00
Metin Doslu	c9dcad9b05	Pass text oid inteads of invalid oid for null values Passing invalid oids even for null values in PQsendQueryParams() causes worker nodes to fail. Therefore, we pass text oid for null values.	2016-09-27 08:15:46 +03:00
Andres Freund	776b3868b9	Support NoMovement direction in router executor This is mainly interesting because it allows to use RETURN QUERY/RETURN QUERY EXECUTE and FOR ... IN .. LOOPs in plpgsql.	2016-09-26 18:28:36 -06:00
Murat Tuncer	2f78fb8f1b	Remove extra space	2016-09-26 18:23:43 -06:00
Murat Tuncer	902e68c9ef	Refactor SendQueryToPlacements api	2016-09-26 18:23:43 -06:00
Murat Tuncer	6317bbe9a8	Address feedback	2016-09-26 18:23:42 -06:00
Murat Tuncer	2eec0167be	Add support for truncate statement	2016-09-26 18:23:42 -06:00
Marco Slot	3318288d75	Fix segmentation fault in case of joins with WHERE 1=0	2016-09-26 15:12:29 +02:00
Robin Thomas	614c858375	Forbid EXCLUDE constraints on distributed tables just as we forbid UNIQUE or PRIMARY KEY constraints. Also, properly propagate valid EXCLUDE constraints to worker shard tables. If an EXCLUDE constraint includes the distribution column, the operator must be an equality operator. Tests in regression suite for exclusion constraints that include the partition column, omit it, and include it but with non-equality operator. Regression tests also verify that valid exclusion constraints are propagated to the shard tables. And the tests work in different timezones now. Fixes citusdata/citus#748 and citusdata/citus#778.	2016-09-21 14:02:42 -04:00

... 2 3 4 5 6 ...

526 Commits (2204da19f0c110afd5bd48863d0d25226ca83543)