citus

Commit Graph

Author	SHA1	Message	Date
Eren Basak	93bc2c6c12	Handle MX tables on workers during drop table commands	2016-12-23 15:43:32 +03:00
Eren Basak	9876e253b7	Propagate DDL commands to metadata workers for MX tables	2016-12-23 15:43:32 +03:00
Onder Kalaci	807fc1cc28	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Eren Basak	59b95a958d	Propagate CREATE SCHEMA commands with the correct AUTHORIZATION clause in start_metadata_sync_to_node	2016-12-14 10:53:12 +03:00
Burak Yucesoy	cdb725a561	Add Foreign Key Support to ALTER TABLE commands With this PR, we add foreign key support to ALTER TABLE commands. For now, we only support foreign constraint creation via ALTER TABLE query, if it is only subcommand in ALTER TABLE subcommand list. We also only allow foreign key creation if replication factor is 1.	2016-12-08 15:03:25 +02:00
Andres Freund	c7f19f6c83	Use connection_management.c from within connection_cache.c. This is a temporary step towards removing connection_cache.c.	2016-12-07 11:44:24 -08:00
Marco Slot	5b5bceb5ed	Take shard metadata lock in master_append_table_to_shard	2016-12-02 15:56:30 +01:00
Brian Cloutier	14b47b769c	Remove dead code: ResponsiveWorkerNodeList	2016-12-02 13:14:11 +03:00
Metin Doslu	670e2bc0dd	Use AccessShareLock on the source table while creating a colocated table While creating a colocated table, we don't want the source table to be dropped. However, using a ShareLock blocks DML statements on the source table, and using AccessShareLock is enough to prevent DROP. Therefore, we just loosened the lock to AccessShareLock.	2016-11-10 09:17:05 -08:00
Önder Kalacı	34fa711b14	Always CASCADE while dropping a shard	2016-11-01 10:16:34 +01:00
Onder Kalaci	dc1b8b3e18	Add stub for Copy shard placement This commit does not change the current behaviour, but, helps to implement enterprise feature without any version changes.	2016-10-26 17:57:55 +03:00
Metin Doslu	520e7e3cb2	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Burak Yucesoy	d837c8eb92	Only repair given shard Previously, when a repair is requested on a shard, we also repair all co-located shards of given shard, which may cause repairing already healthy shards. With this change, we only repair given shard.	2016-10-26 14:36:37 +03:00
Andres Freund	ebbef819f2	Invalidate relcache after pg_dist_shard_placement changes. This forces prepared statements to be re-planned after changes of the placement metadata. There's some locking issues remaining, but that's a a separate task. Also add regression tests verifying that invalidations take effect on prepared statements.	2016-10-26 03:36:35 -07:00
Brian Cloutier	e174ecaf58	Treat nodePort as the 8byte number it is	2016-10-25 16:31:48 +03:00
Marco Slot	914100cfbe	Parallelise DDL commands	2016-10-24 12:39:08 +02:00
Burak Yucesoy	c7414c3af2	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Metin Doslu	7baf72cc86	Final refactoring	2016-10-20 11:29:11 +03:00
Metin Doslu	7586e567b7	Add local function GetNextShardId()	2016-10-20 10:59:31 +03:00
Metin Doslu	31f08f8377	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Metin Doslu	39ddf36084	Add guc variable for shard count	2016-10-19 10:44:50 +03:00
Marco Slot	06e790f420	Parallelise master_modify_multiple_shards	2016-10-19 08:33:08 +02:00
Murat Tuncer	f76cb6672a	Add master_run_on_worker UDF	2016-10-18 17:59:54 +03:00
Eren Basak	e31830f3fb	Add worker transaction and transaction recovery infrastructure	2016-10-18 14:18:14 +03:00
Brian Cloutier	84c4ba1073	Drop shardalias	2016-10-14 11:03:26 +03:00
Burak Yucesoy	55341815a8	Make shard transfer functions co-location aware With this change, master_copy_shard_placement and master_move_shard_placement functions start to copy/move given shard along with its co-located shards.	2016-10-13 18:16:40 +03:00
Eren Basak	6cb3ae93ba	Add Metadata Snapshot Infrastructure This change adds the required infrastructure about metadata snapshot from MX codebase into Citus, mainly metadata_sync.c file and master_metadata_snapshot UDF.	2016-10-13 10:40:14 +03:00
Marco Slot	cc8a30d033	Use UpdateShardPlacementState where appropriate	2016-10-07 11:59:20 -07:00
Andres Freund	5de52c3b04	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Brian Cloutier	62e7bdbdd6	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Marco Slot	b79c8657d6	Avoid use of pnstrdup	2016-10-04 00:31:53 +02:00
Robin Thomas	d23f11490a	Provides safe, idempotent shard-extended names to any object name related to a table that might be distributed, allowing any name that is within regular PostgreSQL length limits to be extended with a shard ID for use in shards on workers. Handles multi-byte character boundaries in identifiers when making prefixes for shard-extended names. Includes tests. Uses hash_any from PostgreSQL's access/hashfunc.c. Removes AppendShardIdToStringInfo() as it's used only once and arguably is best replaced there with a call to AppendShardIdToName(). Adds UDF shard_name(object_name, shard_id) to expose the shard-extended name logic to other PL/PGSQL, UDFs and scripts. Bumps version to 6.0-2 to allow for UDF to be created in migration script. Fixes citusdata/citus#781 and citusdata/citus#179.	2016-10-03 17:02:34 -04:00
Eren Basak	d169caf637	Fix command counter increment bug Fixes citusdata/citus#714 On `InsertShardRow`, we previously called `CommandCounterIncrement()` before `CitusInvalidateRelcacheByRelid(relationId);`. This might prevent to skip invalidation of the distributed table in the next access within the same session.	2016-10-03 17:00:27 +03:00
Murat Tuncer	3ef841c67b	Add UDF master_expire_table_cache	2016-09-28 12:08:37 +03:00
Murat Tuncer	680fab2b67	Remove extra space	2016-09-26 18:23:43 -06:00
Murat Tuncer	25a95d57f8	Refactor SendQueryToPlacements api	2016-09-26 18:23:43 -06:00
Murat Tuncer	a342cacfc4	Address feedback	2016-09-26 18:23:42 -06:00
Murat Tuncer	4416b77088	Add support for truncate statement	2016-09-26 18:23:42 -06:00
Robin Thomas	6880efce5b	Forbid EXCLUDE constraints on distributed tables just as we forbid UNIQUE or PRIMARY KEY constraints. Also, properly propagate valid EXCLUDE constraints to worker shard tables. If an EXCLUDE constraint includes the distribution column, the operator must be an equality operator. Tests in regression suite for exclusion constraints that include the partition column, omit it, and include it but with non-equality operator. Regression tests also verify that valid exclusion constraints are propagated to the shard tables. And the tests work in different timezones now. Fixes citusdata/citus#748 and citusdata/citus#778.	2016-09-21 14:02:42 -04:00
Jason Petersen	b00b15a718	Permit multiple DDL commands in a transaction Three changes here to get to true multi-statement, multi-relation DDL transactions (same functionality pre-5.2, with benefits of atomicity): 1. Changed the multi-shard utility hook to always run (consistency with router executor hook, removes ad-hoc "installed" boolean) 2. Change the global connection list in multi_shard_transaction to instead be a hash; update related functions to operate on global hash instead of local hash/global list 3. Remove check within DDL code to prevent subsequent DDL commands; place unset/reset guard around call to ConnectToNode to permit connecting to additional nodes after DDL transaction has begun In addition, code has been added to raise an error if a ROLLBACK TO SAVEPOINT is attempted (similar to router executor), and comprehensive tests execute all multi-DDL scenarios (full success, user ROLLBACK, any actual errors (say, duplicate index), partial failure (duplicate index on one node but not others), partial COMMIT (one node fails), and 2PC partial PREPARE (one node fails)). Interleavings with other commands (DML, \copy) are similarly all covered.	2016-09-08 22:35:55 -05:00
Robin Thomas	475a6245bf	Remove all usage of pg_dist_shard.shardalias in extension code. (#739 ) Remove regression test of non-null shardalias.	2016-08-19 17:06:22 +03:00
Marco Slot	f538ab7f62	Rewrite WorkerShardStats to avoid invalid value bugs	2016-07-29 20:11:18 +02:00
Jason Petersen	f19779b0ce	Support SERIAL/BIGSERIAL non-partition columns This adds support for SERIAL/BIGSERIAL column types. Because we now can evaluate functions on the master (during execution), adding this is a matter of ensuring the table creation step works properly. To accomplish this, I've added some logic to detect sequences owned by a table (i.e. those related to its columns). Simply creating a sequence and using it in a default value is insufficient; users who do so must ensure the sequence is owned by the column using it. Fortunately, this is exactly what SERIAL and BIGSERIAL do, which is the use case we're targeting with this feature. While testing this, I found that worker_apply_shard_ddl_command actually adds shard identifiers to sequence names, though I found no places that use or test this path. I removed that code so that sequence names are not mutated and will match those used by a SERIAL default value expression. Our use of the new-to-9.5 CREATE SEQUENCE IF NOT EXISTS syntax means we are dropping support for 9.4 (which is being done regardless, but makes this change simpler). I've removed 9.4 from the Travis build matrix. Some edge cases are possible in ALTER SEQUENCE, COPY FROM (on workers), and CREATE SEQUENCE OWNED BY. I've added errors for each so that users understand when and why certain operations are prohibited.	2016-07-28 23:55:40 -06:00
Burak Yucesoy	98025110f0	Add old version(without schema name parameter) of api functions back Fixes #676 We added old versions (i.e. without schema name) of worker_apply_shard_ddl_command, worker_fetch_foreign_file and worker_fetch_regular_table back. During function call of one of these functions, we set schema name as public schema and call the newer version of the functions.	2016-07-28 20:40:38 +03:00
Murat Tuncer	719e44d1f4	Remove PostgreSQL 9.4 support	2016-07-26 20:16:09 +03:00
Burak Yucesoy	444d4eb558	Fix worker_fetch_regular_table with schema Fixes #504 Fixes #646 We changed signature of worker_fetch_regular_table to accept schema name as parameter to make it work with schemas.	2016-07-22 00:44:02 -06:00
Burak Yucesoy	5a93a70e2d	Fix master_apply_delete_command with schema Fixes #73	2016-07-21 15:09:20 +03:00
Burak Yucesoy	d0beacc4e1	Change worker_apply_shard_ddl_command to accept schema name as parameter Fixes #565 Fixes #626 To add schema support to citus, we need to schema-prefix all table names, object names etc. in the queries sent to worker nodes. However; query deparsing is not available for most of DDL commands, therefore it is not easy to generate worker query in the master node. As a solution we are sending schema names along with shard id and query to run to worker nodes with worker_apply_shard_ddl_command. To not break \STAGE command we pass public schema as paramater while calling worker_apply_shard_ddl_command from there. This will not cause problem if user uses \STAGE in different schema because passes schema name is used only if there is no schema name is given in the query.	2016-07-21 14:17:26 +03:00
Burak Yucesoy	71bb558641	Always schema-prefix worker queries Fixes #215 Fixes #267 Fixes #502 Fixes #556 Fixes #557 Fixes #560 Fixes #568 Fixes #623 Fixes #624 With this change we schema-prefix table names, operator names and composite types.	2016-07-20 10:42:24 +03:00
Eren	692ef0964a	Propagate DDL Commands with 2PC Fixes #513 This change modifies the DDL Propagation logic so that DDL queries are propagated via 2-Phase Commit protocol. This way, failures during the execution of distributed DDL commands will not leave the table in an intermediate state and the pending prepared transactions can be commited manually. DDL commands are not allowed inside other transaction blocks or functions. DDL commands are performed with 2PC regardless of the value of `citus.multi_shard_commit_protocol` parameter. The workflow of the successful case is this: 1. Open individual connections to all shard placements and send `BEGIN` 2. Send `SELECT worker_apply_shard_ddl_command(<shardId>, <DDL Command>)` to all connections, one by one, in a serial manner. 3. Send `PREPARE TRANSCATION <transaction_id>` to all connections. 4. Sedn `COMMIT` to all connections. Failure cases: - If a worker problem occurs before sending of all DDL commands is finished, then all changes are rolled back. - If a worker problem occurs after all DDL commands are sent but not after `PREPARE TRANSACTION` commands are finished, then all changes are rolled back. However, if a worker node is failed, then the prepared transactions in that worker should be rolled back manually. - If a worker problem occurs during `COMMIT PREPARED` statements are being sent, then the prepared transactions on the failed workers should be commited manually. - If master fails before the first 'PREPARE TRANSACTION' is sent, then nothing is changed on workers. - If master fails during `PREPARE TRANSACTION` commands are being sent, then the prepared transactions on workers should be rolled back manually. - If master fails during `COMMIT PREPARED` or `ROLLBACK PREPARED` commands are being sent, then the remaining prepared transactions on the workers should be handled manually. This change also helps with #480, since failed DDL changes no longer mark failed placements as inactive.	2016-07-19 10:44:11 +03:00

1 2

84 Commits (8bd5229c8c49aef469891ad906d84245c3859c20)