citus

Commit Graph

Author	SHA1	Message	Date
Onur Tirtir	f80f4839ad	Remove unused functions that cppcheck found	2020-10-19 13:50:52 +03:00
Jelte Fennema	c6f5d5fe88	Add some asserts to pass static analysis (#3805 )	2020-04-29 11:19:11 +02:00
Jelte Fennema	1d8dde232f	Automatically convert useless declarations using regex replace (#3181 ) * Add declaration removal to CI * Convert declarations	2019-11-21 13:47:29 +01:00
Önder Kalacı	960cd02c67	Remove real time router executors (#3142 ) * Remove unused executor codes All of the codes of real-time executor. Some functions in router executor still remains there because there are common functions. We'll move them to accurate places in the follow-up commits. * Move GUCs to transaction mngnt and remove unused struct * Update test output * Get rid of references of real-time executor from code * Warn if real-time executor is picked * Remove lots of unused connection codes * Removed unused code for connection restrictions Real-time and router executors cannot handle re-using of the existing connections within a transaction block. Adaptive executor and COPY can re-use the connections. So, there is no reason to keep the code around for applying the restrictions in the placement connection logic.	2019-11-05 12:48:10 +01:00
SaitTalhaNisanci	94a7e6475c	Remove copyright years (#2918 ) * Update year as 2012-2019 * Remove copyright years	2019-10-15 17:44:30 +03:00
Hadi Moshayedi	b69a762e0b	Fix savepoint rollback after multi-shard update failure.	2019-05-01 09:33:43 -07:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
Marco Slot	625816242a	Don't try to check unopened connection in EXEC_TASK_FAILED state	2018-07-23 11:41:02 -06:00
Onder Kalaci	a5370f5bb0	Realtime executor honours multi_shard_modify_mode We're relying on multi_shard_modify_mode GUC for real-time SELECTs. The name of the GUC is unfortunate, but, adding one more GUC (or renaming the GUC) would make the UX even worse. Given that this mode is mostly important for transaction blocks that involve modification /DDL queries along with real-time SELECTs, we can live with the confusion.	2018-06-06 14:59:54 +03:00
Murat Tuncer	224b0a8c14	Replace poll with select/poll Windows does not have poll(), so fall back to select()	2018-03-21 20:05:00 -07:00
Marco Slot	8f69973411	Fix cancellation issues in the real-time executor (#1905 )	2018-01-01 23:10:29 -05:00
Marco Slot	fa7fa2734b	Log remote commands sent via MultiClientSendQuery	2017-12-22 16:18:40 +01:00
mehmet furkan şahin	fd546cf322	Intermediate result size limitation This commit introduces a new GUC to limit the intermediate result size which we handle when we use read_intermediate_result function for CTEs and complex subqueries.	2017-12-21 14:26:56 +03:00
Murat Tuncer	2d66bf5f16	Fix hard coded formatting strings for 64 bit numbers (#1831 ) Postgres provides OS agnosting formatting macros for formatting 64 bit numbers. Replaced %ld %lu with INT64_FORMAT and UINT64_FORMAT respectively. Also found some incorrect usages of formatting flags and fixed them.	2017-12-04 14:11:06 +03:00
Marco Slot	a9933deac6	Make real time executor work in transactions	2017-11-30 09:59:32 +03:00
Marco Slot	43d5e79eaa	Execute transmit commands as superuser during task-tracker queries	2017-09-28 15:27:25 +02:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Andres Freund	21c25abbb1	Move multi_client_executor to interrupt aware libpq wrappers.	2017-07-04 12:38:52 -07:00
Murat Tuncer	0c4bf2d943	Remove fall back to select if poll is not available (#1466 ) poll is supported on all relevant systems, there is no need to have a fall back mechanism to use select()	2017-06-22 12:11:18 +03:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	2374905c89	Move multi_client_executor.[ch] ontop of connection_management.[ch]. That way connections can be automatically closed after errors and such, and the connection management infrastructure gets wider testing. It also fixes a few issues around connection string building.	2016-12-07 11:44:24 -08:00
Andres Freund	3223b3c92d	Centralized Connection Lifetime Management. Connections are tracked and released by integrating into postgres' transaction handling. That allows to to use connections without having to resort to having to disable interrupts or using PG_TRY/CATCH blocks to avoid leaking connections. This is intended to eventually replace multi_client_executor.c and connection_cache.c, and to provide the basis of a centralized transaction management. The newly introduced transaction hook should, in the future, be the only one in citus, to allow for proper ordering between operations. For now this central handler is responsible for releasing connections and resetting XactModificationLevel after a transaction.	2016-12-07 11:43:18 -08:00
Andres Freund	a6150c2916	Lower "waiting for activity on tasks took longer than" log level. It's perfectly normal to wait longer in several circumstances, and the output can lead to spurious regression output changes.	2016-10-03 13:07:43 -07:00
Jason Petersen	850c51947a	Re-permit DDL in transactions, selectively Recent changes to DDL and transaction logic resulted in a "regression" from the viewpoint of users. Previously, DDL commands were allowed in multi-command transaction blocks, though they were not processed in any actual transactional manner. We improved the atomicity of our DDL code, but added a restriction that DDL commands themselves must not occur in any BEGIN/END transaction block. To give users back the original functionality (and improved atomicity) we now keep track of whether a multi-command transaction has modified data (DML) or schema (DDL). Interleaving the two modification types in a single transaction is disallowed. This first step simply permits a single DDL command in such a block, admittedly an incomplete solution, but one which will permit us to add full multi-DDL command support in a subsequent commit.	2016-08-30 20:37:19 -06:00
Metin Doslu	75618fc3fb	Return false in MultiClientQueryResult() on failing query	2016-08-29 17:05:35 +03:00
Marco Slot	9705cbcdf8	Rewrite WorkerShardStats to avoid invalid value bugs	2016-07-29 20:11:18 +02:00
Marco Slot	5e432449ba	Add MultiClientExecute and MultiClientValueIsNull for simple remote query execution	2016-07-29 20:07:18 +02:00
Onder Kalaci	56c0b0825f	Fix bug related to poll timeout This commit fixes a bug on setting polling timeout. The code updated to comform to the comment that is already placed.	2016-07-26 09:55:47 +03:00
Jason Petersen	5d525fba24	Permit "single-shard" transactions Allows the use of modification commands (INSERT/UPDATE/DELETE) within transaction blocks (delimited by BEGIN and ROLLBACK/COMMIT), so long as all modifications hit a subset of nodes involved in the first such com- mand in the transaction. This does not circumvent the requirement that each individual modification command must still target a single shard. For instance, after sending BEGIN, a user might INSERT some rows to a shard replicated on two nodes. Subsequent modifications can hit other shards, so long as they are on one or both of these nodes. SAVEPOINTs are supported, though if the user actually attempts to send a ROLLBACK command that specifies a SAVEPOINT they will receive an ERROR at the end of the topmost transaction. Placements are only marked inactive if at least one replica succeeds in a transaction where others fail. Non-atomic behavior is possible if the shard targeted by the initial modification within a transaction has a higher replication factor than another shard within the same block and a node with the latter shard has a failure during the COMMIT phase. Other methods of denoting transaction blocks (multi-statement commands sent all at once and functions written in e.g. PL/pgSQL or other such languages) are not presently supported; their treatment remains the same as before.	2016-07-21 15:57:22 -06:00
Jason Petersen	9ba02928ac	Refactor ReportRemoteError to remove boolean arg Broke it into two explicitly-named functions instead: WarnRemoteError and ReraiseRemoteError.	2016-06-07 12:38:32 -06:00
Metin Doslu	7d0c90b398	Fail fast on constraint violations in router executor	2016-06-07 18:11:17 +03:00
Andres Freund	3dac0a4d14	Rely less on remote_task_check_interval. When executing queries with citus.task_executor = 'real-time', query execution could, so far, spend a significant amount of time sleeping. That's because we were a) sleeping after several phases of query execution, even if we're not waiting for network IO b) sleeping for a fixed amount of time when waiting for network IO; often a lot longer than actually required. Just reducing the amount of time slept isn't a real solution, because that just increases CPU usage. Instead have the real-time executor's ManageTaskExecution return whether a task is currently being processed, waiting for reads or writes, or failed. When all tasks are waiting for IO use poll() to wait for IO readyness. That requires to slightly redefine how connection timeouts are handled: before we counted the number of times ManageTaskExecution() was called, and compared that with the timeout divided by the task check interval. That, if processing of tasks took a while, could significantly increase the time till a timeout occurred. Because it was based on the ManageTaskExecution() being called on a constant interval, this approach isn't feasible anymore. Instead measure the actual time since connection establishment was started. That could in theory, if task processing takes a very long time, lead to few passes over PQconnectPoll(). The problem of sleeping too much also exists for the 'task-tracker' executor, but is generally less problematic there, as processing the individual tasks usually will take longer. That said, for e.g. the regression tests it'd be helpful to use a similar approach.	2016-06-02 12:11:16 -06:00
Jason Petersen	e774f22ed4	Fix formatting Checking in citus_indent output.	2016-05-27 15:13:28 -06:00
Amos Bird	92788a0d9c	Remove redundant implementations of error funcs. This patch does some basic cleaning jobs. It removes duplicated implementations of ReportRemoteError() and related ones and adjusts regression tests.	2016-05-27 15:12:59 -06:00
Andres Freund	42d232c0e8	Use the current session's username when connecting to worker nodes. So far we've always used libpq defaults when connecting to workers; bar special environment variables being set that'll always be the user that started the server. That's not desirable because it prevents using users with fewer privileges. Thus change the various APIs creating connections to workers to always use usernames. That means: 1) MultiClientConnect() needs to, optionally, accept a username 2) GetOrEstablishConnection(), including the underlying cache, need to use the current user as part of the connection cache key. That way connections for separate users are distinct, and we always use one with the correct authorization. 3) The task tracker needs to keep track of the username associated with a task, so it can use it when establishing connections outside the originating session.	2016-04-27 10:00:08 -07:00
Andres Freund	29b8576a33	Annotate variables only used for asserts with PG_USED_FOR_ASSERTS_ONLY. This avoids '-Wunused-but-set-variable' type warnings when compiling without assertions, e.g. against a system postgres.	2016-04-19 12:31:12 -06:00
Jason Petersen	423e6c8ea0	Update copyright dates Fixed configure variable and updated all end dates to 2016.	2016-03-23 17:14:37 -06:00
Jason Petersen	fdb37682b2	First formatting attempt Skipped csql, ruleutils, readfuncs, and functions obviously copied from PostgreSQL. Seeing how this looks, then continuing.	2016-02-15 23:29:32 -07:00
Onder Kalaci	136306a1fe	Initial commit of Citus 5.0	2016-02-11 04:05:32 +02:00

39 Commits (f80f4839ad3410c8ff76c3b4b84678480aa208ab)