citus

Commit Graph

Author	SHA1	Message	Date
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00
Brian Cloutier	b20a086a8f	master_activate_node UDF also returns noderole	2017-07-28 16:02:43 +03:00
Brian Cloutier	32e16ffe02	Give isolation tester ability to see locks on workers	2017-07-26 18:43:04 +03:00
Marco Slot	81198a1d02	Add function for dumping local wait edges	2017-07-25 16:52:32 +02:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Onder Kalaci	3369f3486f	Introduce distributed transaction ids This commit adds distributed transaction id infrastructure in the scope of distributed deadlock detection. In general, the distributed transaction id consists of a tuple in the form of: `(databaseId, initiatorNodeIdentifier, transactionId, timestamp)`. Briefly, we add a shared memory block on each node, which holds some information per backend (i.e., an array `BackendData backends[MaxBackends]`). Later, on each coordinated transaction, Citus sends `SELECT assign_distributed_transaction_id()` right after `BEGIN`. For that backend on the worker, the distributed transaction id is set to the values assigned via the function call. The aim of the above is to correlate the transactions on the coordinator to the transactions on the worker nodes.	2017-07-18 15:01:42 +03:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Marco Slot	04fe3f03f6	Change implementation of shard_name UDF to get schema-qualified shard name	2017-07-04 10:49:40 +03:00
Jason Petersen	cc45712144	Bump extension and configure PACKAGE versions Actually getting this done before the next dev cycle begins.	2017-05-17 15:25:30 -06:00
Marco Slot	8edba5f309	Honour enable_ddl_propagation in truncate trigger	2017-04-29 03:32:52 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
velioglu	e32aff1a26	Size UDFs implemented citus_table_size, citus_relation_size and citus_total_relation_size UDFs are implemented.	2017-03-16 13:50:30 +03:00
Brian Cloutier	807beb7bc0	Remove master_get_local_first_candidate_nodes	2017-03-07 11:50:59 +03:00
Metin Doslu	7cff8719c2	Add worker_hash() and a stub for isolate_tenant_to_new_shard()	2017-01-20 14:38:01 +02:00
Eren Basak	b686d9a025	Add Sequence Support for MX Tables This change adds support for serial columns to be used with MX tables. Prior to this change, sequences of serial columns were created in all workers (for being able to create shards) but never used. With MX, we need to set the sequences so that sequences in each worker create unique values. This is done by setting the MINVALUE, MAXVALUE and START values of the sequence.	2017-01-18 09:43:38 +03:00
Eren Basak	b1ce8d61c0	Create Invalidation Trigger for pg_dist_local_group Table Updates	2017-01-18 09:43:38 +03:00
Murat Tuncer	95862632de	Add citus tools to default configuration	2017-01-10 17:53:27 +03:00
Murat Tuncer	b93185d800	Add master_disable_node UDF We can now remove nodes from cluster regardless of them having an active shard placement.	2017-01-10 10:54:57 +03:00
Burak Yucesoy	31cd2357fe	Add upgrade_to_reference_table With this change we introduce new UDF, upgrade_to_reference_table, which can be used to upgrade existing broadcast tables reference tables. For upgrading, we require that given table contains only one shard.	2017-01-02 17:54:42 +02:00
Eren Basak	e43eed0f7a	Prevent Deadlock on Dropping MX Tables with Sequences This change prevents a deadlock situation during DROP TABLE on an mx table with sequences on workers with metadata.	2016-12-28 16:32:20 +03:00
Burak Yucesoy	0851fd2f0b	GRANT SELECT access for metadata tables to public Previously, we errored out if non-user tries to SELECT query for some metadata tables. It seems that we already GRANT SELECT access to some metadata tables but not others. With this change, we GRANT SELECT access to all existing Citus metadata tables.	2016-12-23 16:32:47 +03:00
Eren Basak	31af40cc26	Handle MX tables on workers during drop table commands	2016-12-23 15:43:32 +03:00
Marco Slot	6852f8a951	Add shard locking UDFs	2016-12-22 11:04:34 +01:00
Burak Yücesoy	501a2ecead	Add get_distribution_value_shardid UDF (#1048 ) * Add get_distribution_value_shardid UDF With this UDF users can now map given distribution value to shard id. We mostly hide shardids from users to prevent unnecessary complexity but some power users might need to know about which entry/value is stored in which shard for maintanence purposes. Signature of this UDF is as follows; bigint get_distribution_value_shardid(table_name regclass, distribution_value anyelement)	2016-12-22 12:17:08 +03:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Metin Doslu	86cca54857	Add colocate_with option to create_distributed_table() With this commit, we support three versions of colocate_with: i.default, ii.none and iii. a specific table name.	2016-12-16 14:53:35 +02:00
Marco Slot	5714be0da5	Expose the column_to_column_name UDF to make partkey in pg_dist_partition human-readable	2016-12-14 10:46:33 +01:00
Eren Basak	afbb5ffb31	Add stop_metadata_sync_to_node UDF	2016-12-14 10:53:12 +03:00
Eren Basak	5e96e4f60e	Make truncate triggers propagated on start_metadata_sync_to_node call	2016-12-14 10:53:10 +03:00
Eren Basak	9eff968d1f	Add start_metadata_sync_to_node UDF This change adds `start_metadata_sync_to_node` UDF which copies the metadata about nodes and MX tables from master to the specified worker, sets its local group ID and marks its hasmetadata to true to allow it receive future DDL changes.	2016-12-13 10:48:03 +03:00
Eren Basak	444f14d546	Add Column Definition List for Output Columns for master_add_node This change allows seeing the names of columns of `master_add_node`, using `SELECT * FROM master_add_node(...)` by specifying output columns in UDF definition.	2016-11-07 14:08:58 -08:00
Onder Kalaci	9cd549f21f	Add stub for Copy shard placement This commit does not change the current behaviour, but, helps to implement enterprise feature without any version changes.	2016-10-26 17:57:55 +03:00
Metin Doslu	4e555880b7	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Andres Freund	fcd150c7c8	Invalidate relcache after pg_dist_shard_placement changes. This forces prepared statements to be re-planned after changes of the placement metadata. There's some locking issues remaining, but that's a a separate task. Also add regression tests verifying that invalidations take effect on prepared statements.	2016-10-26 03:36:35 -07:00
Jason Petersen	73f5b8b05f	Move all funcs to pg_catalog, add test to verify We'd been relying on a single SET search_path command in an earlier script, but a subsequent script RESET search_path, causing any further bare functions to be created in the first schema on the search path. However, starting with an older extension version and executing ALTER scripts one at a time DOES avoid putting any functions in the public namespace, so I wrote an upgrade script resilient to that, especially because PostgreSQL 9.5 will error out if a function is already in the schema it's being moved to.	2016-10-25 12:45:53 -06:00
Burak Yucesoy	5a03acf2bf	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Metin Doslu	405335fcee	Add create_reference_table() create_reference_table() creates a hash distributed table with shard count equals to 1 and replication factor equals to shard_replication_factor configuration value.	2016-10-20 15:29:30 +03:00
Metin Doslu	40bdafa8d1	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Eren Basak	cee7b54e7c	Add worker transaction and transaction recovery infrastructure	2016-10-18 14:18:14 +03:00
Eren Basak	8f477d18f1	Add pg_dist_local_group Metadata Table This change adds the pg_dist_local_group metadata table, which indicates the group id of the current node. It is expected that this table contains one and only one row, which only contains the group id of the node as an integer.	2016-10-14 11:41:14 +03:00
Brian Cloutier	6c3d79b4e7	Drop shardalias	2016-10-14 11:03:26 +03:00
Burak Yucesoy	6668d19a3b	Make shard transfer functions co-location aware With this change, master_copy_shard_placement and master_move_shard_placement functions start to copy/move given shard along with its co-located shards.	2016-10-13 18:16:40 +03:00
Eren Basak	ed3af403fd	Add Metadata Snapshot Infrastructure This change adds the required infrastructure about metadata snapshot from MX codebase into Citus, mainly metadata_sync.c file and master_metadata_snapshot UDF.	2016-10-13 10:40:14 +03:00
Andres Freund	982ad66753	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Brian Cloutier	9d6699b07c	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Marco Slot	32b2bd4ed8	Add replication model column to pg_dist_partition	2016-10-05 01:14:28 +02:00
Robin Thomas	f677fadbe6	Provides safe, idempotent shard-extended names to any object name related to a table that might be distributed, allowing any name that is within regular PostgreSQL length limits to be extended with a shard ID for use in shards on workers. Handles multi-byte character boundaries in identifiers when making prefixes for shard-extended names. Includes tests. Uses hash_any from PostgreSQL's access/hashfunc.c. Removes AppendShardIdToStringInfo() as it's used only once and arguably is best replaced there with a call to AppendShardIdToName(). Adds UDF shard_name(object_name, shard_id) to expose the shard-extended name logic to other PL/PGSQL, UDFs and scripts. Bumps version to 6.0-2 to allow for UDF to be created in migration script. Fixes citusdata/citus#781 and citusdata/citus#179.	2016-10-03 17:02:34 -04:00
Marco Slot	a4efb60b54	Change logicalrelid type in pg_dist_partition and pg_dist_shard to regclass	2016-10-03 20:27:16 +02:00

1 2

65 Commits (353d2db9138c974355e8da125cc4aa5c2ee45314)