citus

Commit Graph

Author	SHA1	Message	Date
Burak Velioglu	fa6866ed36	Start to propagate functions to worker nodes with CREATE FUNCTION command together with it's dependencies. If the function depends on any nondistributable object, function will be created only locally. Parameterless version of create_distributed_function becomes obsolete with this change, it will deprecated from the code with a subsequent PR.	2022-02-18 13:56:51 +03:00
Onder Kalaci	ff234fbfd2	Unify old GUCs into a single one Replaces citus.enable_object_propagation with citus.enable_metadata_sync Also, within Citus 11 release cycle, we added citus.enable_metadata_sync_by_default, that is also replaced with citus.enable_metadata_sync. In essence, when citus.enable_metadata_sync is set to true, all the objects and the metadata is send to the remote node. We strongly advice that the users never changes the value of this GUC.	2022-02-04 10:52:56 +01:00
Burak Velioglu	f88cc230bf	Handle tables and objects as metadata. Update UDFs accordingly With this commit we've started to propagate sequences and shell tables within the object dependency resolution. So, ensuring any dependencies for any object will consider shell tables and sequences as well. Separate logics for both shell tables and sequences have been removed. Since both shell tables and sequences logic were implemented as a part of the metadata handling before that logic, we were propagating them while syncing table metadata. With this commit we've divided metadata (which means anything except shards thereafter) syncing logic into multiple parts and implemented it either as a part of ActivateNode. You can check the functions called in ActivateNode to check definition of different metadata. Definitions of start_metadata_sync_to_node and citus_activate_node have also been updated. citus_activate_node will basically create an active node with all metadata and reference table shards. start_metadata_sync_to_node will be same with citus_activate_node except replicating reference tables. stop_metadata_sync_to_node will remove all the metadata. All of those UDFs need to be called by superuser.	2022-01-31 16:20:15 +03:00
Halil Ozan Akgul	25755a7094	Turn ddl propagation off in worker on multi_copy	2021-12-17 15:54:20 +03:00
Halil Ozan Akgul	1d7dde2c4c	Fix metadata sync fails on multi_copy	2021-12-14 10:59:59 +03:00
Onder Kalaci	549edcabb6	Allow disabling node(s) when multiple failures happen As of master branch, Citus does all the modifications to replicated tables (e.g., reference tables and distributed tables with replication factor > 1), via 2PC and avoids any shardstate=3. As a side-effect of those changes, handling node failures for replicated tables change. With this PR, when one (or multiple) node failures happen, the users would see query errors on modifications. If the problem is intermitant, that's OK, once the node failure(s) recover by themselves, the modification queries would succeed. If the node failure(s) are permenant, the users should call `SELECT citus_disable_node(...)` to disable the node. As soon as the node is disabled, modification would start to succeed. However, now the old node gets behind. It means that, when the node is up again, the placements should be re-created on the node. First, use `SELECT citus_activate_node()`. Then, use `SELECT replicate_table_shards(...)` to replicate the missing placements on the re-activated node.	2021-12-01 10:19:48 +01:00
Halil Ozan Akgul	87a1c760d9	Fix tests in multi-1-schedule that fail with metadata syncing	2021-11-26 12:09:53 +03:00
Marco Slot	56eae48daf	Stop updating shard range in citus_update_shard_statistics	2021-11-19 10:51:15 +01:00
Marco Slot	fba93df4b0	Remove copy into new append shard logic	2021-11-07 21:01:40 +01:00
Onder Kalaci	575bb6dde9	Drop support for Inactive Shard placements Given that we do all operations via 2PC, there is no way for any placement to be marked as INACTIVE.	2021-10-22 18:03:35 +02:00
Hanefi Onaldi	7e39c7ea83	Replace master with citus in logs and comments (#5210 ) I replaced - master_add_node, - master_add_inactive_node - master_activate_node with - citus_add_node, - citus_add_inactive_node - citus_activate_node respectively.	2021-08-26 11:31:17 +03:00
Onder Kalaci	2c349e6dfd	Use current user to sync metadata Before this commit, we always synced the metadata with superuser. However, that creates various edge cases such as visibility errors or self distributed deadlocks or complicates user access checks. Instead, with this commit, we use the current user to sync the metadata. Note that, `start_metadata_sync_to_node` still requires super user because accessing certain metadata (like pg_dist_node) always require superuser (e.g., the current user should be a superuser). However, metadata syncing operations regarding the distributed tables can now be done with regular users, as long as the user is the owner of the table. A table owner can still insert non-sense metadata, however it'd only affect its own table. So, we cannot do anything about that.	2021-07-16 13:25:27 +02:00
SaitTalhaNisanci	164c00cf08	Fix typo: longer visible -> no longer visible (#3803 )	2020-04-27 16:32:46 +03:00
Onder Kalaci	e182215d96	Improve connection error message from the worker nodes We currently put the actual error message to the detail part. However, many drivers don't show detail part. As connection errors are somehow common, and hard to trace back, can't we added the detail to the message itself. In addition to that, we changed "connection error" message, as it was confusing to the users who think that the error was happening while connecting to the coordinator. In fact, this error is showing up when the coordinator fails to connect remote nodes.	2020-04-20 13:32:55 +02:00
Hanefi Önaldı	0c5d0cfee9	Notice message to help truncate local data after distribution	2020-04-17 13:21:34 +03:00
Marco Slot	924cd7343a	Defer reference table replication to shard creation time	2020-04-08 12:41:36 -07:00
Marco Slot	90056f7d3c	Remove copy from worker for append-partitioned table	2020-01-13 23:03:40 -08:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Philip Dubé	0e233c63a3	multi_colocation_utils: sort by nodeport, not placementid multi_copy: replace smgr with aclitem, smgr is removed in pg12	2019-07-25 14:33:43 +00:00
Hadi Moshayedi	4bbae02778	Make COPY compatible with unified executor.	2019-06-20 19:53:40 +02:00
Onder Kalaci	ed47e4e6b9	Remove placementId from the ORDER BY to make results consistent	2018-05-11 17:04:50 +03:00
mehmet furkan şahin	785a86ed0a	Tests are updated to use create_distributed_table	2018-05-10 11:18:59 +03:00
Brian Cloutier	0104790385	Fix hard-coded temp directory in multi_copy /tmp does not exist on Windows, use :temp_dir instead	2018-04-17 15:01:22 -07:00
velioglu	82b2d21b0c	Convert broadcast join to reference join After this commit large_table_shard_count wont be used to check whether broadcast join, which is renamed as reference join, can be applied. Reference join can only be applied over reference tables.	2018-04-13 12:58:14 +03:00
Brian Cloutier	9aff4384a1	Make tests platform independent - Force all platforms to use the same collation - Force all platforms to use the same locale - Use /dev/null or NUL, depending on platform - Use /tmp or %TEMP%, dpeending on platform	2018-03-27 14:18:48 -07:00
Metin Doslu	e86d34256c	Change default to false for citus.skip_jsonb_validation_in_copy	2018-03-06 13:19:47 +02:00
Marco Slot	6f7c3bd73b	Skip JSON validation on coordinator during COPY	2018-02-02 15:33:27 +01:00
Marco Slot	a64f0060ba	Reduce the frequency of FinishConnectionIO calls during COPY (#1864 )	2017-12-14 13:21:59 -05:00
Marco Slot	f4ceea5a3d	Enable 2PC by default	2017-11-22 11:26:58 +01:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Marco Slot	3ff46245b3	Make sure we don't use 2PC in copy from worker	2017-08-15 13:44:20 +02:00
Brian Cloutier	74ce4faab5	Make multi_cluster_management test more stable	2017-08-08 11:18:31 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
velioglu	a1ea29ec2b	Use placement connection to drop shards instead of node connection	2017-06-14 14:14:59 +03:00
Marco Slot	f838c83809	Remove redundant pg_dist_jobid_seq restarts in tests	2017-04-18 11:42:32 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
velioglu	19d0c66fa5	Change checks with built-in type	2017-04-11 14:41:37 +03:00
velioglu	1fb11c738f	Check binary output function of type.	2017-04-10 16:28:09 +03:00
Marco Slot	d74fb764b1	Use CitusCopyDestReceiver for regular COPY	2017-02-28 17:24:45 +01:00
Murat Tuncer	1107439ade	Fix dependent tests	2017-01-25 19:19:39 +03:00
Murat Tuncer	5194111420	Add failure case for regression tests	2017-01-25 19:19:39 +03:00
Murat Tuncer	d76f781ae4	Convert multi copy to use new connection api This enables proper transactional behaviour for copy and relaxes some restrictions like combining COPY with single-row modifications. It also provides the basis for relaxing restrictions further, and for optionally allowing connection caching.	2017-01-20 19:15:19 -08:00
Robin Thomas	614c858375	Forbid EXCLUDE constraints on distributed tables just as we forbid UNIQUE or PRIMARY KEY constraints. Also, properly propagate valid EXCLUDE constraints to worker shard tables. If an EXCLUDE constraint includes the distribution column, the operator must be an equality operator. Tests in regression suite for exclusion constraints that include the partition column, omit it, and include it but with non-equality operator. Regression tests also verify that valid exclusion constraints are propagated to the shard tables. And the tests work in different timezones now. Fixes citusdata/citus#748 and citusdata/citus#778.	2016-09-21 14:02:42 -04:00
Metin Doslu	75618fc3fb	Return false in MultiClientQueryResult() on failing query	2016-08-29 17:05:35 +03:00
Eren Başak	0322916700	Lowercase \copy to match PostgreSQL's style for local/psql-level functions	2016-08-22 11:31:26 -06:00
Eren Başak	bb3893d0d8	Set 1PC as the Default Commit Protocol for DDL Commands Fixes #679 This change sets the default commit protocol for distributed DDL commands to '1pc'. If the user issues a distributed DDL command with this default setting, then once in a session, a NOTICE message is shown about using '2pc' being extra safe.	2016-07-29 16:42:55 +03:00
Jason Petersen	abe7304898	Support SERIAL/BIGSERIAL non-partition columns This adds support for SERIAL/BIGSERIAL column types. Because we now can evaluate functions on the master (during execution), adding this is a matter of ensuring the table creation step works properly. To accomplish this, I've added some logic to detect sequences owned by a table (i.e. those related to its columns). Simply creating a sequence and using it in a default value is insufficient; users who do so must ensure the sequence is owned by the column using it. Fortunately, this is exactly what SERIAL and BIGSERIAL do, which is the use case we're targeting with this feature. While testing this, I found that worker_apply_shard_ddl_command actually adds shard identifiers to sequence names, though I found no places that use or test this path. I removed that code so that sequence names are not mutated and will match those used by a SERIAL default value expression. Our use of the new-to-9.5 CREATE SEQUENCE IF NOT EXISTS syntax means we are dropping support for 9.4 (which is being done regardless, but makes this change simpler). I've removed 9.4 from the Travis build matrix. Some edge cases are possible in ALTER SEQUENCE, COPY FROM (on workers), and CREATE SEQUENCE OWNED BY. I've added errors for each so that users understand when and why certain operations are prohibited.	2016-07-28 23:55:40 -06:00
Burak Yucesoy	c1a3478c3b	Remove warnings on schema creation Since now we support schema related operations, there is no need to warn user about schema usage.	2016-07-22 18:24:23 +03:00

1 2

62 Commits (3d044dc54394a2bd5b463f7f7cc61d2c833b3f12)