citus

Commit Graph

Author	SHA1	Message	Date
Jelte Fennema	1d8dde232f	Automatically convert useless declarations using regex replace (#3181 ) * Add declaration removal to CI * Convert declarations	2019-11-21 13:47:29 +01:00
SaitTalhaNisanci	b9b7fd7660	add IsLoggableLevel utility function (#3149 ) * add IsLoggableLevel utility function * add function comment for IsLoggableLevel * put ApplyLogRedaction to logutils	2019-11-15 14:59:13 +03:00
Jelte Fennema	1b2c438e69	Rename variables to not shadow globals in RHEL6 (#3194 ) Fixes #2839	2019-11-15 12:12:24 +01:00
SaitTalhaNisanci	94a7e6475c	Remove copyright years (#2918 ) * Update year as 2012-2019 * Remove copyright years	2019-10-15 17:44:30 +03:00
Hadi Moshayedi	76f3933b05	Add metadatasynced, and sync on master_update_node() Co-authored-by: pykello <hadi.moshayedi@microsoft.com> Co-authored-by: serprex <serprex@users.noreply.github.com>	2019-09-18 09:32:54 -07:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Hadi Moshayedi	83f6c7dab4	Fix subxact release crash	2019-06-19 17:43:10 +02:00
Onder Kalaci	4fd1fcbbef	Refactor shard creation logic This is a preperation for the new executor, where creating shards would go through the executor. So, explicitly generate the commands for further processing.	2019-06-19 10:03:58 +02:00
Hadi Moshayedi	32ecb6884c	Test ROLLBACK TO SAVEPOINT with multi-shard CTE failures	2019-05-01 09:33:43 -07:00
Hadi Moshayedi	aafd22dffa	Fix savepoint rollback for INSERT INTO ... SELECT.	2019-05-01 09:33:43 -07:00
Hadi Moshayedi	b69a762e0b	Fix savepoint rollback after multi-shard update failure.	2019-05-01 09:33:43 -07:00
Jason Petersen	339e6e661e	Remove 9.6 (#2554 ) Removes support and code for PostgreSQL 9.6 cr: @velioglu	2019-01-16 13:11:24 -07:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
Marco Slot	88f64d22db	Prevent connection pointer is NULL details	2018-04-26 12:49:57 -06:00
Marco Slot	394732b6be	Add a connection failure error code	2018-04-26 12:49:57 -06:00
Brian Cloutier	b0b130f064	Fix Windows crash in multi_copy test Without this change we crash on Windows with COPYing into a table with 62 shards, and we ERROR when COPYing into a table with >62 shards: ERROR: WaitForMutipleObjects() failed: error code 87	2018-04-17 15:48:02 -07:00
Murat Tuncer	224b0a8c14	Replace poll with select/poll Windows does not have poll(), so fall back to select()	2018-03-21 20:05:00 -07:00
Onder Kalaci	7dc9589b56	Handle failures during I/O This commit checks the connection status right after any IO happens on the socket. This is necessary since before this commit we didn't pass any information to the higher level functions whether we're done with the connection (e.g., no IO required anymore) or an errors happened during the IO.	2018-03-02 08:33:53 +02:00
Onder Kalaci	da0048e0b7	ForgetResults() becomes a wrapper for ClearResults() ClearResults() is able to handle failures properly by checking the result status. So, relying on it makes error handling more generic in Citus.	2018-03-02 08:33:53 +02:00
Marco Slot	6e79a34c97	Do not check for cancellation in ClearResultsIfReady	2018-02-12 16:45:02 +01:00
Marco Slot	6051aae56e	Handle errors that are discovered during abort	2018-02-12 16:45:02 +01:00
Brian Cloutier	15511f6ba1	Dynamically allocate connection metadata in WaitForAllConnections	2018-02-01 10:30:41 -08:00
Brian Cloutier	a2ed45e206	Remove variable length arrays VLAs aren't supported by Visual Studio. - Remove all existing instances of VLAs. - Add a flag, -Werror=vla, which makes gcc refuse to compile if we add VLAs in the future.	2018-02-01 10:30:41 -08:00
Hadi Moshayedi	5d7c52ffa6	Don't return in PG_TRY() block when cancellations happen in WaitForConnections(). (#1923 ) We shouldn't return in middle of a PG_TRY() block because if we do, we won't reset PG_exception_stack, and later when a re-throw tries to jump to the jump-point which was active in this PG_TRY() block, it seg-faults. We used to return in middle of PG_TRY() block in WaitForConnections() where we checked for cancellations. Whenever cancellations were caught here, Citus crashed. And example was reported by @onderkalaci at #1903.	2018-01-03 09:54:03 -05:00
Marco Slot	8f69973411	Fix cancellation issues in the real-time executor (#1905 )	2018-01-01 23:10:29 -05:00
Marco Slot	a64f0060ba	Reduce the frequency of FinishConnectionIO calls during COPY (#1864 )	2017-12-14 13:21:59 -05:00
metdos	c83edc36b5	Check connection status before using it	2017-11-06 14:53:35 +02:00
Marco Slot	2bb46bb5ee	Reset connectionReady flag after moving a connection in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	e6e6897499	Defer initial PQflush to main loop in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	d6dadb1b25	Use correct index for ModifyWaitEvent in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Murat Tuncer	4676c4f7a5	Prevent crash when remote transaction start fails (#1662 ) We sent multiple commands to worker when starting a transaction. Previously we only checked the result of the first command that is transaction 'BEGIN' which always succeeds. Any failure on following commands were not checked. With this commit, we make sure all command results are checked. If there is any error we report the first error found.	2017-09-26 17:25:46 -07:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Marco Slot	d1befa4df9	Wait for I/O to finish after PQputCopyData	2017-09-12 16:18:42 -07:00
Marco Slot	fca986f214	Add API for waiting for multiple connections	2017-08-11 00:03:06 +02:00
Onder Kalaci	3369f3486f	Introduce distributed transaction ids This commit adds distributed transaction id infrastructure in the scope of distributed deadlock detection. In general, the distributed transaction id consists of a tuple in the form of: `(databaseId, initiatorNodeIdentifier, transactionId, timestamp)`. Briefly, we add a shared memory block on each node, which holds some information per backend (i.e., an array `BackendData backends[MaxBackends]`). Later, on each coordinated transaction, Citus sends `SELECT assign_distributed_transaction_id()` right after `BEGIN`. For that backend on the worker, the distributed transaction id is set to the values assigned via the function call. The aim of the above is to correlate the transactions on the coordinator to the transactions on the worker nodes.	2017-07-18 15:01:42 +03:00
Andres Freund	be8677f926	Add NonblockingForgetResults(). This is very similar to ForgetResults() except that no network IO is performed. Primarily useful in error handling cases.	2017-07-04 14:46:03 -07:00
Andres Freund	75a7ddea0d	Always use connections in non-blocking mode. Now that there's no blocking libpq callers left, default to using non-blocking mode in connection_management.c. This has two advantages: 1) Blockiness doesn't have to frequently be reset, simplifying code 2) Prevents accidental use of blocking libpq functions, since they'll frequently return 'need IO'	2017-07-04 14:46:03 -07:00
Andres Freund	c674bc8640	Add interrupt aware PQputCopy{End,Data} wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	b7f9679ccc	Move interrupt handling code from GetRemoteCommandResult to FinishConnectionIO. Nearby commits will add additional interrupt handling functions, this way we don't have to duplicate the code.	2017-07-04 12:38:52 -07:00
Jason Petersen	2204da19f0	Support PostgreSQL 10 (#1379 ) Adds support for PostgreSQL 10 by copying in the requisite ruleutils and updating all API usages to conform with changes in PostgreSQL 10. Most changes are fairly minor but they are numerous. One particular obstacle was the change in \d behavior in PostgreSQL 10's psql; I had to add SQL implementations (views, mostly) to mimic the pre-10 output.	2017-06-26 02:35:46 -06:00
Metin Doslu	b6659bec22	Send explain queries with savepoints With this commit, we started to send explain queries within a savepoint. After running explain query, we rollback to savepoint. This saves us from side effects of EXPLAIN ANALYZE on DML queries.	2017-04-28 12:13:48 -07:00
Andres Freund	fa5b8fb39f	Fix SendRemoteCommandParams() handling of a NULL MultiConnection->pgConn. (#1271 ) Previously we'd segfault in PQisnonblocking() which, contrary to other libpq calls, doesn't handle a NULL PQconn (because there'd be no appropriate return value for that). cr: @jasonmp85	2017-03-03 12:02:15 -07:00
Brian Cloutier	67ee357d7f	Port WorkerShardStats to new connection API Part of the work in citusdata/citus#1101, this is a pretty direct port over to the new functions and shouldn't result in any behavior changes.	2017-01-17 17:04:37 +02:00
Brian Cloutier	b1b2b4fadf	Create ExecuteOptionalRemoteCommand A small refactor which pulls some code out of `RecoverWorkerTransactions` and into `remote_commands.c`. This code block currently only occurs in `RecoverWorkerTransactions` but will be useful to other functions shortly. Unfortunately we couldn't call it `ExecuteRemoteCommand`, that name was already taken.	2017-01-17 17:04:37 +02:00
Marco Slot	ef326b202a	PQclear in ReportResultError to prevent memory leaks	2017-01-10 02:51:39 +01:00
Marco Slot	92c7567008	Convert worker_transactions to new connection API	2016-12-23 16:14:29 +01:00
Marco Slot	00d55ad957	Add a wrapper for PQsendQuery	2016-12-23 16:14:29 +01:00
Marco Slot	87c62d598e	Connectionapify SendCommandListToWorkerInSingleTransaction	2016-12-23 16:14:29 +01:00
Andres Freund	6eeb43af15	Add PQgetResult() wrapper handling interrupts. This makes it possible to implement cancelling queries blocked on communication with remote nodes.	2016-12-12 15:18:12 -08:00
Andres Freund	3505d431cd	Add initial helpers to make interactions with MultiConnection et al. easier. This includes basic infrastructure for logging of commands sent to remote/worker nodes. Note that this has no effect as of yet, since no callers are converted to the new infrastructure.	2016-12-07 11:44:24 -08:00

50 Commits (1d8dde232fce6d354f3c472ce0507f52f1c82f36)