citus

Commit Graph

Author	SHA1	Message	Date
Brian Cloutier	0db8277266	remove unused errno import	2017-11-14 13:09:34 -08:00
Marco Slot	c2f8bafa05	Fix shard creation vs. pg_dist_node change locking	2017-08-09 14:09:54 +02:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Burak Yucesoy	c8b9e4011b	Remove LockRelationDistributionMetadata function	2017-07-10 15:46:37 +03:00
Burak Yucesoy	9fb15c439c	Add version checks to necessary UDFs	2017-05-22 09:53:29 +03:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Murat Tuncer	72027f2eba	Remove default clause from shard DDL when sequences are used	2017-03-01 17:32:48 +03:00
Eren Basak	df9cf346ee	Enforce statement based replication on old APIs and non-hash tables This change ignores `citus.replication_model` setting and uses the statement based replication in - Tables distributed via the old `master_create_distributed_table` function - Append and range partitioned tables, even if created via `create_distributed_table` function This seems like the easiest solution to #1191, without changing the existing behavior and harming existing users with custom scripts. This change also prevents RF>1 on streaming replicated tables on `master_create_worker_shards` Prior to this change, `master_create_worker_shards` command was not checking the replication model of the target table, thus allowing RF>1 with streaming replicated tables. With this change, `master_create_worker_shards` errors out on the case.	2017-02-16 10:37:53 -08:00
Marco Slot	ba940a1de9	Use coordinator instead of schema node in terminology	2017-01-25 11:07:23 +01:00
Jason Petersen	56197dbdba	Add replication_model GUC This adds a replication_model GUC which is used as the replication model for any new distributed table that is not a reference table. With this change, tables with replication factor 1 are no longer implicitly MX tables. The GUC is similarly respected during empty shard creation for e.g. existing append-partitioned tables. If the model is set to streaming while replication factor is greater than one, table and shard creation routines will error until this invalid combination is corrected. Changing this parameter requires superuser permissions.	2017-01-23 09:05:14 -07:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Metin Doslu	93e626c896	Refactor get_shard_id_for_distribution_column() and other minor changes	2017-01-20 14:38:01 +02:00
Andres Freund	b813b39241	Cache ShardPlacements in metadata cache. So far we've reloaded them frequently. Besides avoiding that cost - noticeable for some workloads with large shard counts - it makes it easier to add information to ShardPlacements that help us make placement_connection.c colocation aware.	2017-01-10 18:14:18 -08:00
Eren Basak	7e09bd6836	Error on Unsupported Features on Workers This change makes the metadata workers error out on unsupported commands.	2017-01-02 16:03:45 +03:00
Metin Doslu	1ddc70ca55	Add binary search capability to ShardIndex() Renamed FindShardIntervalIndex() to ShardIndex() and added binary search capability. It used to assume that hash partition tables are always uniformly distributed which is not true if upcoming tenant isolation feature is applied. This commit also reduces code duplication.	2016-12-30 18:55:34 +02:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Metin Doslu	a0c92b38cb	Use AccessShareLock on the source table while creating a colocated table While creating a colocated table, we don't want the source table to be dropped. However, using a ShareLock blocks DML statements on the source table, and using AccessShareLock is enough to prevent DROP. Therefore, we just loosened the lock to AccessShareLock.	2016-11-10 09:17:05 -08:00
Metin Doslu	4e555880b7	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Burak Yucesoy	5a03acf2bf	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Metin Doslu	d3e7d9dc8d	Final refactoring	2016-10-20 11:29:11 +03:00
Metin Doslu	8334d853c0	Add local function GetNextShardId()	2016-10-20 10:59:31 +03:00
Metin Doslu	40bdafa8d1	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Burak Yucesoy	2f0158dde1	Change worker_apply_shard_ddl_command to accept schema name as parameter Fixes #565 Fixes #626 To add schema support to citus, we need to schema-prefix all table names, object names etc. in the queries sent to worker nodes. However; query deparsing is not available for most of DDL commands, therefore it is not easy to generate worker query in the master node. As a solution we are sending schema names along with shard id and query to run to worker nodes with worker_apply_shard_ddl_command. To not break \STAGE command we pass public schema as paramater while calling worker_apply_shard_ddl_command from there. This will not cause problem if user uses \STAGE in different schema because passes schema name is used only if there is no schema name is given in the query.	2016-07-21 14:17:26 +03:00
Burak Yücesoy	323f1151e0	Fix wrong storage type for foreign tables Fixes #496 Previously we do not check whether table is foreign or not while creating empty shards, and set storage type to 't'(Standard table) or 'c'(Columnar table). Now if the table is foreign table(but not CStore foreign table) we set storage type to 'f'(Foreign table). If it is CStore foreign table, we set its storage type to 'c', i.e. columnar table have priority over foreign table. Please note that 'c' is only used for CStore tables not for other possible columnar stores at the moment. Possible improvement could be checking for other columnar stores, though I am not sure if there is a way to check it for all other columnar stores.	2016-06-08 04:12:01 +03:00
Andres Freund	758a70a8ff	Create new shards as owned the distributed table's owner. That's important because ownership of relations implies special privileges. Without this change, a distributed table can be accessible by a table's owner, but a shard created by another user might not.	2016-04-27 10:28:33 -07:00
Andres Freund	12a246de37	Perform permission checks in functions manipulating distributed tables. Previously several commands, amongst them commands like master_create_distributed_table(), were allowed for everyone. That's not good: Even though citus currently requires superuser permissions, we shouldn't allow non-superusers to perform actions as sensitive as making a table distributed. There's no checks on the worker_* functions, as these usually just punt the action to underlying postgres functionality, which then perform the necessary checks.	2016-04-27 10:22:20 -07:00
Jason Petersen	423e6c8ea0	Update copyright dates Fixed configure variable and updated all end dates to 2016.	2016-03-23 17:14:37 -06:00
Jason Petersen	fdb37682b2	First formatting attempt Skipped csql, ruleutils, readfuncs, and functions obviously copied from PostgreSQL. Seeing how this looks, then continuing.	2016-02-15 23:29:32 -07:00
Onder Kalaci	136306a1fe	Initial commit of Citus 5.0	2016-02-11 04:05:32 +02:00

31 Commits (d1a3b20115fe1cc129f2d14daa62cdc13043b95d)