citus

Commit Graph

Author	SHA1	Message	Date
Nils Dijk	feaac69769	Implementation for asycn FinishConnectionListEstablishment (#2584 )	2019-03-22 17:30:42 +01:00
Marco Slot	f2abf2b8e5	Functions are treated as transaction blocks	2019-03-15 16:34:08 -06:00
Hadi Moshayedi	f4d3b94e22	Fix some of the casts for groupId (#2609 ) A small change which partially addresses #2608.	2019-03-05 12:06:44 -08:00
Onder Kalaci	f706772b2f	Round-robin task assignment policy relies on local transaction id Before this commit, round-robin task assignment policy was relying on the taskId. Thus, even inside a transaction, the tasks were assigned to different nodes. This was especially problematic while reading from reference tables within transaction blocks. Because, we had to expand the distributed transaction to many nodes that are not necessarily already in the distributed transaction.	2019-02-22 19:26:38 +03:00
Jason Petersen	339e6e661e	Remove 9.6 (#2554 ) Removes support and code for PostgreSQL 9.6 cr: @velioglu	2019-01-16 13:11:24 -07:00
Murat Tuncer	ec36030fae	Move functions calls that can fail to outside of spinlock We had recently fixed a spinlock issue due to functions failing, but spinlock is not being released. This is the continuation of that work to eliminate possible regression of the issue. Function calls that are moved out of spinlock scope are macros and plain type casting. However, depending on the configuration they have an alternate implementation in PG source that performs memory allocation. This commit moves last bit of codes to out of spinlock for completion purposes.	2019-01-03 15:59:56 +03:00
Murat Tuncer	9671bc3cbb	Make sure spinlock is not left unreleased when an exception is thrown A spinlock is not released when an exception is thrown after spinlock is acquired. This has caused infinite wait and eventual crash in maintenance daemon. This work moves the code than can fail to the outside of spinlock scope so that in the case of failure spinlock is not left locked since it was not locked in the first place.	2018-12-24 15:47:21 +03:00
Marco Slot	3ff2b47366	Restrict visibility of get_*_active_transactions functions to pg_monitor	2018-12-19 18:32:42 +01:00
Dimitri Fontaine	d1b182de7d	Replace calls to unsafe functions like memcpy and sscanf In answer to a security audit, we double check buffer sizes and avoid known-dangerous operations such as sscanf.	2018-12-04 08:54:43 +01:00
Onder Kalaci	621ccf3946	Ensure to use initialized MaxBackends Postgresql loads shared libraries before calculating MaxBackends. However, Citus relies on MaxBackends being set. Thus, with this commit we use the same steps to calculate MaxBackends while Citus is being loaded (e.g., PG_Init is called). Note that this is safe since all the elements that are used to calculate MaxBackends are PGC_POSTMASTER gucs and a constant value.	2018-12-03 13:25:51 +03:00
Jason Petersen	9fb951c312	Fix user-facing typos Lintian found these (presumably by looking in the text section and running them through e.g. aspell).	2018-10-09 16:54:03 -07:00
Onder Kalaci	73696a03e4	Make sure not to leak intermediate result folders on the workers	2018-10-09 22:47:56 +03:00
velioglu	512d23934f	Show router modify,select and real-time queries on MX views	2018-10-02 13:59:38 +03:00
Murat Tuncer	653c7e4ae0	Fix memory leak in FinishRemoteTransactionPrepare	2018-09-28 11:13:21 +03:00
Onder Kalaci	4cae856846	Relax assertion on transaction abort on PREPARE step In case a failure happens when a transaction is failed on PREPARE, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-09-17 18:09:16 +03:00
Onder Kalaci	a94184fff8	Prevent overflow of memory accesses during deadlock detection In the distributed deadlock detection design, we concluded that prepared transactions cannot be part of a distributed deadlock. The idea is that (a) when the transaction is prepared it already acquires all the locks, so cannot be part of a deadlock (b) even if some other processes blocked on the prepared transaction, prepared transactions would eventually be committed (or rollbacked) and the system will continue operating. With the above in mind, we probably had a mistake in terms of memory allocations. For each backend initialized, we keep a `BackendData` struct. The bug we've introduced is that, we assumed there would only be `MaxBackend` number of backends. However, `MaxBackends` doesn't include the prepared transactions and axuliary processes. When you check Postgres' InitProcGlobal` you'd see that `TotalProcs = MaxBackends + NUM_AUXILIARY_PROCS + max_prepared_xacts;` This commit aligns with total procs processed with that.	2018-09-17 16:23:29 +03:00
velioglu	d1f005daac	Adds UDFs for testing MX functionalities with isolation tests	2018-09-12 07:04:16 +03:00
Onder Kalaci	d657759c97	Views to Provide some insight about the distributed transactions on Citus MX With this commit, we implement two views that are very similar to pg_stat_activity, but showing queries that are involved in distributed queries: - citus_dist_stat_activity: Shows all the distributed queries - citus_worker_stat_activity: Shows all the queries on the shards that are initiated by distributed queries. Both views have the same columns in the outputs. In very basic terms, both of the views are meant to provide some useful insights about the distributed transactions within the cluster. As the names reveal, both views are similar to pg_stat_activity. Also note that these views can be pretty useful on Citus MX clusters. Note that when the views are queried from the worker nodes, they'd not show the distributed transactions that are initiated from the coordinator node. The reason is that the worker nodes do not know the host/port of the coordinator. Thus, it is advisable to query the views from the coordinator. If we bucket the columns that the views returns, we'd end up with the following: - Hostnames and ports: - query_hostname, query_hostport: The node that the query is running - master_query_host_name, master_query_host_port: The node in the cluster initiated the query. Note that for citus_dist_stat_activity view, the query_hostname-query_hostport is always the same with master_query_host_name-master_query_host_port. The distinction is mostly relevant for citus_worker_stat_activity. For example, on Citus MX, a users starts a transaction on Node-A, which starts worker transactions on Node-B and Node-C. In that case, the query hostnames would be Node-B and Node-C whereas the master_query_host_name would Node-A. - Distributed transaction related things: This is mostly the process_id, distributed transactionId and distributed transaction number. - pg_stat_activity columns: These two views get all the columns from pg_stat_activity. We're basically joining pg_stat_activity with get_all_active_transactions on process_id.	2018-09-10 21:33:27 +03:00
Onder Kalaci	76aa6951c2	Properly send commands to other nodes We previously implemented OTHER_WORKERS_WITH_METADATA tag. However, that was wrong. See the related discussion: https://github.com/citusdata/citus/issues/2320 Instead, we switched using OTHER_WORKER_NODES and make the command that we're running optional such that even if the node is not a metadata node, we won't be in trouble.	2018-09-10 16:01:30 +03:00
Onder Kalaci	bf28dd0cff	Do not recover wrong distributed transactions in MX	2018-09-07 09:52:46 +03:00
Onder Kalaci	26e308bf2a	Support TRUNCATE from the MX worker nodes This commit enables support for TRUNCATE on both distributed table and reference tables. The basic idea is to acquire lock on the relation by sending the TRUNCATE command to all metedata worker nodes. We only skip sending the TRUNCATE command to the node that actually executus the command to prevent a self-distributed-deadlock.	2018-09-03 14:06:31 +03:00
Onder Kalaci	97ba7bf2eb	Add the option to skip the node that is executing the node	2018-09-03 14:01:24 +03:00
velioglu	bd30e3e908	Add support for writing to reference tables from MX nodes	2018-08-27 18:15:04 +03:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
Nils Dijk	2a9d47e1a6	fix pg11 tests	2018-08-15 23:27:31 -06:00
Onder Kalaci	7fb529aab9	Some stylistic improvements in the foreign keys to reference table changes.	2018-07-05 23:23:34 +03:00
Onder Kalaci	d83be3a33f	Enforce foreign key restrictions inside transaction blocks When a hash distributed table have a foreign key to a reference table, there are few restrictions we have to apply in order to prevent distributed deadlocks or reading wrong results. The necessity to apply the restrictions arise from cascading nature of foreign keys. When a foreign key on a reference table cascades to a distributed table, a single operation over a single connection can acquire locks on multiple shards of the distributed table. Thus, any parallel operation on that distributed table, in the same transaction should not open parallel connections to the shards. Otherwise, we'd either end-up with a self-distributed deadlock or read wrong results. As briefly described above, the restrictions that we apply is done by tracking the distributed/reference relation accesses inside transaction blocks, and act accordingly when necessary. The two main rules are as follows: - Whenever a parallel distributed relation access conflicts with a consecutive reference relation access, Citus errors out - Whenever a reference relation access is followed by a conflicting parallel relation access, the execution mode is switched to sequential mode. There are also some other notes to mention: - If the user does SET LOCAL citus.multi_shard_modify_mode TO 'sequential';, all the queries should simply work with using one connection per worker and sequentially executing the commands. That's obviously a slower approach than Citus' usual parallel execution. However, we've at least have a way to run all commands successfully. - If an unrelated parallel query executed on any distributed table, we cannot switch to sequential mode. Because, the essense of sequential mode is using one connection per worker. However, in the presence of a parallel connection, the connection manager picks those connections to execute the commands. That contradicts with our purpose, thus we error out. - COPY to a distributed table cannot be executed in sequential mode. Thus, if we switch to sequential mode and COPY is executed, the operation fails and there is currently no way of implementing that. Note that, when the local table is not empty and create_distributed_table is used, citus uses COPY internally. Thus, in those cases, create_distributed_table() will also fail. - There is a GUC called citus.enforce_foreign_key_restrictions to disable all the checks. We added that GUC since the restrictions we apply is sometimes a bit more restrictive than its necessary. The user might want to relax those. Similarly, if you don't have CASCADEing reference tables, you might consider disabling all the checks.	2018-07-03 17:05:55 +03:00
Onder Kalaci	7d0f7835e7	Improve relation accesses association to do less job	2018-06-25 18:40:40 +03:00
Onder Kalaci	8ccb8b679e	Real-time executor marks multi shard relation accesses before opening connections	2018-06-25 18:40:31 +03:00
Onder Kalaci	21038f0d0e	Make sure that inter-shard DDL commands are always covers both tables	2018-06-25 18:40:30 +03:00
Onder Kalaci	2f01894589	Track relation accesses using the connection management infrastructure	2018-06-25 18:40:30 +03:00
Marco Slot	0feb1f2eb1	Do not call CheckRemoteTransactionsHealth from commit handler	2018-06-14 23:33:07 +02:00
Marco Slot	4ab8e87090	Always throw errors on failure on critical connection in router executor	2018-06-14 23:33:07 +02:00
Brian Cloutier	9667ee5ac9	Alleviate OOM failures in COMMIT callback Previously those failures caused us to crash, postgres abort()s when it notices a failure in the COMMIT callback.	2018-05-15 16:39:33 -07:00
Brian Cloutier	4c2bf5d2d6	Move call to RemoveIntermediateResultsDirectory Errors thrown in the COMMIT handler will cause Postgres to segfault, there's nothing it can do it abort the transaction by the time that handler is called! RemoveIntermediateResultsDirectory is problematic for two reasons: - It has calls to ereport(ERROR which have been known to trigger - It makes memory allocations which raise ERRORs when they fail Once the COMMIT process has begun we don't use the intermediate results, so it's safe to remove them a little earlier in the process. A failure here will abort the transaction. That's pretty unnecessary, it's not that important that we remove the results, but it's still better than a crash.	2018-05-10 19:28:41 -07:00
Murat Tuncer	42a8082721	PG11 compatibility refresh adds a shim for a changed function api	2018-05-03 13:21:15 -06:00
mehmet furkan şahin	a4153c6ab1	notice handler is implemented	2018-04-27 14:37:01 +03:00
Marco Slot	3d3c19a717	Improve messages for essential connection failures	2018-04-26 12:58:47 -06:00
Önder Kalacı	ebb8f902c8	Relax assertion on transaction rollback failure (#2052 ) In case a failure happens when a transaction is rollbacked, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-04-26 13:39:03 -04:00
Murat Tuncer	a6fe5ca183	PG11 compatibility update - changes in ruleutils_11.c is reflected - vacuum statement api change is handled. We now allow multi-table vacuum commands. - some other function header changes are reflected - api conflicts between PG11 and earlier versions are handled by adding shims in version_compat.h - various regression tests are fixed due output and functionality in PG1 - no change is made to support new features in PG11 they need to be handled by new commit	2018-04-26 11:29:43 +03:00
Brian Cloutier	8d4c4d5c58	Close all files before trying to remove them	2018-04-24 14:35:20 -07:00
Brian Cloutier	c5f1235090	Turn the crashes on Windows into WARNINGs	2018-04-24 14:35:20 -07:00
Matthew Wozniczka	4582a4b398	Fixed a typo	2018-03-27 22:51:36 -06:00
Marco Slot	6051aae56e	Handle errors that are discovered during abort	2018-02-12 16:45:02 +01:00
Brian Cloutier	a2ed45e206	Remove variable length arrays VLAs aren't supported by Visual Studio. - Remove all existing instances of VLAs. - Add a flag, -Werror=vla, which makes gcc refuse to compile if we add VLAs in the future.	2018-02-01 10:30:41 -08:00
Brian Cloutier	2efe80ce55	CheckForDistributedDeadlocks no longer uses a VLA - variable length arrays (VLAs) do not work with Visual Studio - fix an off-by-one error. We incorrectly assumed there would always at least as many edges as there were nodes. - refactor: reduce scope of transactionNodeStack by moving it into the function which uses it. - refactor: break up the distinct uses of currentStackDepth into separate variables.	2018-02-01 10:30:41 -08:00
Brian Cloutier	097fd15a89	small refactor, CheckDeadlockForTransactionNode builds it's own array	2018-02-01 10:30:41 -08:00
Brian Cloutier	61a6b846b9	Refactor: use a temporary timestamp variable It's against our coding convention to call functions inside parameter lists; when single-stepping with a debugger it's difficult to determine what the function returned. That wouldn't be good enough reason to change this code but while porting Citus to Windows I ran into this line of code. assign_distributed_transaction_id was called with a weird timestamp and I wasn't able to find the problem without first making this change.	2018-01-29 11:20:13 -08:00
Onder Kalaci	fbde87d2d0	Allocate enough space for transaction nodes This fix prevents any potential memory access that might occur while forming the deadlock path.	2018-01-22 08:45:48 +02:00
Onder Kalaci	9a89c0b425	Fix bug while traversing the distributed deadlock graph With this fix, we traverse the graph with DFS which was originally intended. Note that, before the fix, we traverse the graph with BFS which might lead to killing some unrelated backend that is not involved in the distributed deadlock.	2018-01-22 08:45:48 +02:00

1 2 3

141 Commits (796141334dde262da3ebdf449423a2be5d207761)