citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	73696a03e4	Make sure not to leak intermediate result folders on the workers	2018-10-09 22:47:56 +03:00
Hadi Moshayedi	7509c6c8fb	Add tests which check we disallow writes to local tables.	2018-10-06 10:54:44 +02:00
Marco Slot	d56baefe3d	Allow simple DML commands from hot standby	2018-10-06 10:54:44 +02:00
Jason Petersen	1cb48416eb	Add reference table failure tests Fairly straightforward; verified that modifications fail atomically if a worker is down or fails mid-transaction (i.e. all workers need to ack modifications to reference tables in order to persist changes).	2018-10-09 09:39:30 -07:00
Jason Petersen	9bcf2873a7	Add single-shard router select failure tests Including several examples from #1926. I couldn't understand why the recover_prepared_transactions "should be an error", and EXPLAIN has changed since the original bug (so that it runs EXPLAINs in txns, I think for EXPLAIN ANALYZE to not have side effects); other than that, most of the reported bugs now error out rather than crash or return an empty result set.	2018-10-09 08:51:10 -07:00
Jason Petersen	8f2aa00951	Add failure tests for VACUUM/ANALYZE VACUUM runs outside of a transaction, so the failure modes for it are somewhat straightforward, though ANALYZE runs in a 1pc transaction and multi-table VACUUM can fail between statements (PG 11 and higher).	2018-10-09 08:50:37 -07:00
Jason Petersen	ee4114bc7a	Failure tests for modifying multiple shards in txn Tests various failure points during a multi-shard modification within a transaction with multiple statements. Verifies three cases: * Reference tables (single shard, many placements) * Normal table with replication factor two * Multi-shard table with no replication In the replication-factor case, we expect shard health to be affected in some transactions; most others fail the transaction entirely and all we need verify is that no effects of the transaction are visible. Had trouble testing the final PREPARE/COMMIT/ROLLBACK phase of the 2pc, in particular because the error message produced includes the PID of the backend, which is unpredictable.	2018-10-09 09:17:32 -06:00
Murat Tuncer	4f8042085c	Fix drop schema in mx with partitioned tables Drop schema command fails in mx mode if there is a partitioned table with active partitions. This is due to fact that sql drop trigger receives all the dropped objects including partitions. When we call drop table on parent partition, it also drops the partitions on the mx node. This causes the drop table command on partitions to fail on mx node because they are already dropped when the partition parent was dropped. With this work we did not require the table to exist on worker_drop_distributed_table.	2018-10-08 17:01:54 -07:00
Murat Tuncer	71a910d2fa	Add failure tests for insert/select via coordinator	2018-10-04 18:01:19 +03:00
Murat Tuncer	0a987e9c0e	Fix cte subquery failure test	2018-10-03 15:43:48 +03:00
Murat Tuncer	d26b312cad	Add failure test for coordinator pull/push for cte	2018-10-03 15:43:48 +03:00
Murat Tuncer	6c66033455	Add failure tests for multi-shard update/delete Failure tests for update/delete on hash distributed tables using 1PC and 2PC	2018-10-03 15:43:48 +03:00
velioglu	512d23934f	Show router modify,select and real-time queries on MX views	2018-10-02 13:59:38 +03:00
Murat Tuncer	9bdef67bab	Do not create inherited constraints on worker shards PG now allows foreign keys on partitioned tables. Each foreign key constraint on partitioned table is propagated down to partitions. We used to create all constraints on shards when we are creating a new shard, or when just simply moving a shard from one worker to another. We also used the same logic when creating a copy of coordinator table in mx node. With this change we create the constraint on worker node only if it is not an inherited constraint.	2018-09-28 14:14:51 +03:00
Murat Tuncer	653c7e4ae0	Fix memory leak in FinishRemoteTransactionPrepare	2018-09-28 11:13:21 +03:00
Onder Kalaci	cdc0d1491c	Make sure to use correct execution mode for TRUNCATE We used to set the execution mode in the truncate trigger. However, when multiple tables are truncated with a single command, we could set the execution mode very late. Instead, now set the execution mode on the utility hook.	2018-09-25 15:35:27 +03:00
Marco Slot	1ca9a5b867	Do not allow unresolved parameters in INSERT...SELECT	2018-09-24 14:12:04 +02:00
Jason Petersen	d7f10b0896	Rewrite parallel ID test to avoid costly JITting By setting the CPU tuple cost so high, we were triggering JIT. Instead, we should use parallel_tuple_cost. See: rhaas.blogspot.com/2018/06/using-forceparallelmode-correctly.html	2018-09-24 09:29:53 +03:00
Jason Petersen	e62a1ab43d	Revert "Disable JIT during PostgreSQL 11 test runs" This reverts commit `a2fb5a84f1`. JIT wasn't actually interfering with the operation of Citus, a test was just written in a way which caused JIT to run for a function on every row in a 150k-row table.	2018-09-24 09:29:53 +03:00
Marco Slot	877d703ac5	Evaluate functions (and when applicable, parameters) anywhere in query	2018-09-21 12:57:50 -06:00
Onder Kalaci	abc443d7fa	Make sure that shard repair considers replication factor	2018-09-21 15:24:49 +03:00
Onder Kalaci	8520a5b432	worker_append_table_to_shard becomes aware of partitioned tables	2018-09-21 14:40:42 +03:00
Onder Kalaci	c1b5a04f6e	Allow partitioned tables with replication factor > 1 With this commit, we all partitioned distributed tables with replication factor > 1. However, we also have many restrictions. In summary, we disallow all kinds of modifications (including DDLs) on the partition tables. Instead, the user is allowed to run the modifications over the parent table. The necessity for such a restriction have two aspects: - We need to acquire shard resource locks appropriately - We need to handle marking partitions INVALID in case of any failures. Note that, in theory, the parent table should also become INVALID, which is too aggressive.	2018-09-21 14:40:41 +03:00
Murat Tuncer	b6930e3db9	Add distributed locking to truncated mx tables We acquire distributed lock on all mx nodes for truncated tables before actually doing truncate operation. This is needed for distributed serialization of the truncate command without causing a deadlock.	2018-09-21 14:23:19 +03:00
velioglu	d7f75e5b48	Add citus_lock_waits to show locked distributed queries	2018-09-20 14:13:51 +03:00
Murat Tuncer	0f6e514bfb	Fixes a bug on not being able to drop index on a partitioned table. Reason for the failure is that PG11 introduced a new relation kind RELKIND_PARTITIONED_INDEX to be used for partitioned indices. We expanded our check to cover that case.	2018-09-19 13:15:05 +03:00
Marco Slot	f34ab55389	Fix bug preventing rollback in stored procedure	2018-08-31 20:49:20 +02:00
Onder Kalaci	41d606b575	Use tree walker instad of mutator in relation visibility This commit uses _walker instead of _mutator for performance reasons. Given that we're only updating a functionId in the tree, the approach seems fine.	2018-09-18 09:33:01 +03:00
Onder Kalaci	4cae856846	Relax assertion on transaction abort on PREPARE step In case a failure happens when a transaction is failed on PREPARE, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-09-17 18:09:16 +03:00
Onder Kalaci	a94184fff8	Prevent overflow of memory accesses during deadlock detection In the distributed deadlock detection design, we concluded that prepared transactions cannot be part of a distributed deadlock. The idea is that (a) when the transaction is prepared it already acquires all the locks, so cannot be part of a deadlock (b) even if some other processes blocked on the prepared transaction, prepared transactions would eventually be committed (or rollbacked) and the system will continue operating. With the above in mind, we probably had a mistake in terms of memory allocations. For each backend initialized, we keep a `BackendData` struct. The bug we've introduced is that, we assumed there would only be `MaxBackend` number of backends. However, `MaxBackends` doesn't include the prepared transactions and axuliary processes. When you check Postgres' InitProcGlobal` you'd see that `TotalProcs = MaxBackends + NUM_AUXILIARY_PROCS + max_prepared_xacts;` This commit aligns with total procs processed with that.	2018-09-17 16:23:29 +03:00
Marco Slot	55f46acedf	Support TABLESAMPLE in router queries	2018-08-31 13:22:38 +02:00
Brian Cloutier	2fae06056a	Attempt to stabilize packet dumps and add them back it	2018-09-12 22:10:39 -06:00
Brian Cloutier	5bde8626c5	Travis uses Pipfile instead of re-specifying deps	2018-09-12 17:37:14 -06:00
Brian Cloutier	e61e5d4980	Update mitmproxy version to remove vulnerability warnings	2018-09-12 17:17:22 -06:00
Murat Tuncer	ae0032dff8	Add regression tests for procedure calls PG11 introduced PROCEDURE concept similar to FUNCTION Procedure's allow committing/rolling back behavior. This commmit adds regression tests for procedure calls.	2018-09-12 10:28:50 +03:00
velioglu	d1f005daac	Adds UDFs for testing MX functionalities with isolation tests	2018-09-12 07:04:16 +03:00
Murat Tuncer	470ee0b4d9	Revert multi_partition test back to being required Test was marked as optional (ignore) by previous commit. Reverting that change to make test required	2018-09-11 12:39:44 -06:00
Onder Kalaci	d657759c97	Views to Provide some insight about the distributed transactions on Citus MX With this commit, we implement two views that are very similar to pg_stat_activity, but showing queries that are involved in distributed queries: - citus_dist_stat_activity: Shows all the distributed queries - citus_worker_stat_activity: Shows all the queries on the shards that are initiated by distributed queries. Both views have the same columns in the outputs. In very basic terms, both of the views are meant to provide some useful insights about the distributed transactions within the cluster. As the names reveal, both views are similar to pg_stat_activity. Also note that these views can be pretty useful on Citus MX clusters. Note that when the views are queried from the worker nodes, they'd not show the distributed transactions that are initiated from the coordinator node. The reason is that the worker nodes do not know the host/port of the coordinator. Thus, it is advisable to query the views from the coordinator. If we bucket the columns that the views returns, we'd end up with the following: - Hostnames and ports: - query_hostname, query_hostport: The node that the query is running - master_query_host_name, master_query_host_port: The node in the cluster initiated the query. Note that for citus_dist_stat_activity view, the query_hostname-query_hostport is always the same with master_query_host_name-master_query_host_port. The distinction is mostly relevant for citus_worker_stat_activity. For example, on Citus MX, a users starts a transaction on Node-A, which starts worker transactions on Node-B and Node-C. In that case, the query hostnames would be Node-B and Node-C whereas the master_query_host_name would Node-A. - Distributed transaction related things: This is mostly the process_id, distributed transactionId and distributed transaction number. - pg_stat_activity columns: These two views get all the columns from pg_stat_activity. We're basically joining pg_stat_activity with get_all_active_transactions on process_id.	2018-09-10 21:33:27 +03:00
Onder Kalaci	7de5e30432	Change flaky explain test to non-explain This test's output changes depending on which worker is picked for explain (e.g., worker port in the output changes). Given that the test is only aiming to ensure that CTEs inside CTEs work fine in DML queries, it should be fine to get rid of the EXPLAIN. The output is verified to be correct as well.	2018-09-10 16:01:30 +03:00
Onder Kalaci	76aa6951c2	Properly send commands to other nodes We previously implemented OTHER_WORKERS_WITH_METADATA tag. However, that was wrong. See the related discussion: https://github.com/citusdata/citus/issues/2320 Instead, we switched using OTHER_WORKER_NODES and make the command that we're running optional such that even if the node is not a metadata node, we won't be in trouble.	2018-09-10 16:01:30 +03:00
Onder Kalaci	5cf8fbe7b6	Add infrastructure to relation if exists	2018-09-07 14:49:36 +03:00
Onder Kalaci	bf28dd0cff	Do not recover wrong distributed transactions in MX	2018-09-07 09:52:46 +03:00
Murat Tuncer	65276311f7	Reflect changed index for constraint scans in PG11	2018-09-07 08:07:01 +03:00
Murat Tuncer	d8279569b8	Add support for INCLUDE option in index creation INCLUDE is a new feature in index creation in PG11. Included column/expression paramameters are now forwarded to shards	2018-09-06 19:41:06 +03:00
Murat Tuncer	7d3f7c2bf4	Add regression tests related to new PG11 partitioning features	2018-09-06 19:06:28 +03:00
Murat Tuncer	55cf3e321c	Add regression tests for new PG11 window functions - <offset> preceding/following - exclude	2018-09-04 10:48:04 +03:00
Onder Kalaci	1b3257816e	Make sure that table is dropped before shards are dropped This commit fixes a bug where a concurrent DROP TABLE deadlocks with SELECT (or DML) when the SELECT is executed from the workers. The problem was that Citus used to remove the metadata before droping the table on the workers. That creates a time window where the SELECT starts running on some of the nodes and DROP table on some of the other nodes.	2018-09-04 08:57:20 +03:00
Onder Kalaci	2ab0e63b30	Fix flaky test	2018-09-03 14:06:32 +03:00
Onder Kalaci	26e308bf2a	Support TRUNCATE from the MX worker nodes This commit enables support for TRUNCATE on both distributed table and reference tables. The basic idea is to acquire lock on the relation by sending the TRUNCATE command to all metedata worker nodes. We only skip sending the TRUNCATE command to the node that actually executus the command to prevent a self-distributed-deadlock.	2018-09-03 14:06:31 +03:00
Onder Kalaci	97ba7bf2eb	Add the option to skip the node that is executing the node	2018-09-03 14:01:24 +03:00
velioglu	bd30e3e908	Add support for writing to reference tables from MX nodes	2018-08-27 18:15:04 +03:00
velioglu	2639149bd8	Enterprise functions about metadata/resource locks	2018-08-27 16:32:20 +03:00
Onder Kalaci	b8af8c359b	Make sure that modifying CTEs always use the correct execution mode	2018-08-23 14:53:55 +03:00
Onder Kalaci	910ea392f5	Prevent multiple placements of a single shard to lead huge memory allocations	2018-08-22 19:25:01 +03:00
Onder Kalaci	cb481f55cf	Prevent excessive number of unnecessary range table traversal	2018-08-22 11:45:00 +03:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
Jason Petersen	c3c0d62ca6	Add test showing poolinfo validation works In other words, that it errors out.	2018-08-16 20:14:18 -06:00
Jason Petersen	c4e2349b80	Mark failing PostgreSQL 11 test as ignored This commit should be reverted once a new PostgreSQL 11 beta is available: it's due to a bug in the partitioning code which has been fixed in REL_11_STABLE but (not yet) a released tag.	2018-08-16 19:37:37 -06:00
Jason Petersen	a2fb5a84f1	Disable JIT during PostgreSQL 11 test runs It's causing problems with one of our tests.	2018-08-16 19:37:14 -06:00
Nils Dijk	6cf4516fdb	fix \d change for indexes in pg11	2018-08-15 23:27:31 -06:00
Nils Dijk	2a9d47e1a6	fix pg11 tests	2018-08-15 23:27:31 -06:00
mehmet furkan şahin	1a3b9f731e	Make master_disable/activate_node runnable when superuser	2018-08-15 00:43:35 -07:00
Onder Kalaci	85d418412d	Fix DDL execution problem on MX when search_path is used Make sure that the coordinator sends the commands when the search path synchronised with the coordinator's search_path. This is only important when Citus sends the commands that are directly relayed to the worker nodes. For example, the deparsed DLL commands or queries always adds schema qualifications to the queries. So, they do not require this change.	2018-08-13 16:34:50 +03:00
velioglu	44fc9f46fc	Add create_distributed_table (without data) failure tests	2018-08-13 09:31:15 +03:00
Onder Kalaci	974cbf11a5	Hide shard names on MX worker nodes This commit by default enables hiding shard names on MX workers by simple replacing `pg_table_is_visible()` calls with `citus_table_is_visible()` calls on the MX worker nodes. The latter function filters out tables that are known to be shards. The main motivation of this change is a better UX. The functionality can be opted out via a GUC. We also added two views, namely citus_shards_on_worker and citus_shard_indexes_on_worker such that users can query them to see the shards and their corresponding indexes. We also added debug messages such that the filtered tables can be interactively seen by setting the level to DEBUG1.	2018-08-07 14:21:45 +03:00
Onder Kalaci	e13da6a343	Add infrastructure to hide shards on MX worker nodes Add ability to understand whether a table is a known shard on MX workers. Note that this is only useful and applicable for hiding shards on MX worker nodes given that we can have metadata only there.	2018-08-04 09:03:37 +03:00
mehmet furkan şahin	c1f7631f98	failure tests on create_distributed_table nonempty	2018-08-03 12:41:25 -07:00
velioglu	b21bd2d1a0	Add create_reference_table failure tests	2018-08-03 17:49:57 +03:00
velioglu	bc27651dd9	Add failure test for copy on hash distributed table	2018-08-03 17:11:09 +03:00
Brian Cloutier	82fa85fa5b	Add tests for 1PC COPY on append and hash-distributed tables Add tests for 1PC COPY on append and hash-distributed tables	2018-07-31 15:17:59 -07:00
Brian Cloutier	f0f7a691a3	Prevent failure tests from hanging by using a port outside the ephemeral port range - mitmdump now listens on port 9060 - Add some logging to fluent.py, making issues like this easier to debug in the future - Fail the tests if something is already running on the port mitmProxy tries to use - check-failure now works with VPATH builds	2018-07-31 14:30:56 -07:00
mehmet furkan şahin	dde86cb731	Copy to reference table failure tests are added	2018-07-30 11:48:12 +03:00
mehmet furkan şahin	bc757845eb	Citus versioning fix	2018-07-26 10:56:34 +03:00
Brian Cloutier	ace248d13c	Remove unnecessary calls to 'conn.allow()'	2018-07-25 17:45:00 -07:00
mehmet furkan şahin	887aa8150d	Bump citus version to 8.0devel	2018-07-25 12:03:47 +03:00
velioglu	e23625bf5e	Use contype to check for FK constraint instead of reading catalog table	2018-07-24 15:53:05 +03:00
mehmet furkan şahin	6d0fbbace7	ALTER TABLE %s ADD COLUMN constraint check is added	2018-07-24 15:53:05 +03:00
Marco Slot	625816242a	Don't try to check unopened connection in EXEC_TASK_FAILED state	2018-07-23 11:41:02 -06:00
Nils Dijk	2d13900230	error on unsupported changing of distirbution column in ON CONFLICT for INSERT ... SELECT	2018-07-23 15:18:21 +02:00
Nils Dijk	6a15e1c9fc	extract ErrorIfOnConflictNotSupported function for reuse	2018-07-23 12:20:10 +02:00
Nils Dijk	df98900f80	fix missing space for tablein in error	2018-07-20 15:05:13 +02:00
Marco Slot	69a3ebea5f	Ensure StartPlacementListConnection connects with username supplied by the caller	2018-07-19 20:10:11 +02:00
Murat Tuncer	a837dde1a0	Add failure tests for master add/remove/disable/active node	2018-07-13 18:06:24 +03:00
mehmet furkan şahin	f854420079	truncate failure tests are added	2018-07-13 13:20:50 +03:00
Murat Tuncer	2795494758	Added failure test for create index concurrently	2018-07-13 11:53:49 +03:00
Onder Kalaci	a446e71ee7	Add failure testing for DDL commands This commit adds an extensive failure testing, which covers quite a bit of things and their combinations: - 1PC vs 2PC - Replication factor 1 and Replication factor 2 - Network failures and query cancellations - Sequential vs Parallel query execution mode	2018-07-12 13:05:29 +03:00
Jason Petersen	318119910b	Add pg_dist_poolinfo table For storing nodes' pool host/port overrides.	2018-07-10 09:30:22 -07:00
mehmet furkan şahin	3afa7f425d	Topn aggregates are supported	2018-07-10 14:33:42 +03:00
Murat Tuncer	a7277526fd	Make citus_stat_statements_reset() super user function	2018-07-10 11:21:20 +03:00
Marco Slot	89870e76ce	Add a select_opens_transaction_block GUC	2018-07-08 03:50:39 +02:00
Brian Cloutier	a54f9a6d2c	network proxy-based failure testing - Lots of detail is in src/test/regress/mitmscripts/README - Create a new target, make check-failure, which runs tests - Tells travis how to install everything and run the tests	2018-07-06 12:38:53 -07:00
Murat Tuncer	f20258ef10	Expand count distinct support We can now support more complex count distinct operations by pulling necessary columns to coordinator and evalutating the aggreage at coordinator. It supports broad range of expression with the restriction that the expression must contain a column.	2018-07-06 09:44:20 +03:00
Onder Kalaci	7fb529aab9	Some stylistic improvements in the foreign keys to reference table changes.	2018-07-05 23:23:34 +03:00
Brian Cloutier	735218ee5d	Remove sslmode structs and add more helpful description	2018-07-05 14:12:36 +02:00
Nils Dijk	c1c8c38dc9	create placeholder for policy ddl	2018-07-05 11:07:01 +02:00
mehmet furkan şahin	df11dda750	hll aggregates are tested	2018-07-05 08:19:01 +03:00
mehmet furkan şahin	06217be326	hll aggregate functions are supported natively	2018-07-04 16:41:09 +03:00
Murat Tuncer	901066a421	Move partition key logging related code from enterprise	2018-07-04 13:11:34 +03:00
mehmet furkan şahin	f7b901e3fd	CopyShardForeignConstraintCommandList API change for grouped constraints	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	35eac2318d	lock referenced reference table metadata is added For certain operations in enterprise, we need to lock the referenced reference table shard distribution metadata	2018-07-03 17:05:55 +03:00
Onder Kalaci	d83be3a33f	Enforce foreign key restrictions inside transaction blocks When a hash distributed table have a foreign key to a reference table, there are few restrictions we have to apply in order to prevent distributed deadlocks or reading wrong results. The necessity to apply the restrictions arise from cascading nature of foreign keys. When a foreign key on a reference table cascades to a distributed table, a single operation over a single connection can acquire locks on multiple shards of the distributed table. Thus, any parallel operation on that distributed table, in the same transaction should not open parallel connections to the shards. Otherwise, we'd either end-up with a self-distributed deadlock or read wrong results. As briefly described above, the restrictions that we apply is done by tracking the distributed/reference relation accesses inside transaction blocks, and act accordingly when necessary. The two main rules are as follows: - Whenever a parallel distributed relation access conflicts with a consecutive reference relation access, Citus errors out - Whenever a reference relation access is followed by a conflicting parallel relation access, the execution mode is switched to sequential mode. There are also some other notes to mention: - If the user does SET LOCAL citus.multi_shard_modify_mode TO 'sequential';, all the queries should simply work with using one connection per worker and sequentially executing the commands. That's obviously a slower approach than Citus' usual parallel execution. However, we've at least have a way to run all commands successfully. - If an unrelated parallel query executed on any distributed table, we cannot switch to sequential mode. Because, the essense of sequential mode is using one connection per worker. However, in the presence of a parallel connection, the connection manager picks those connections to execute the commands. That contradicts with our purpose, thus we error out. - COPY to a distributed table cannot be executed in sequential mode. Thus, if we switch to sequential mode and COPY is executed, the operation fails and there is currently no way of implementing that. Note that, when the local table is not empty and create_distributed_table is used, citus uses COPY internally. Thus, in those cases, create_distributed_table() will also fail. - There is a GUC called citus.enforce_foreign_key_restrictions to disable all the checks. We added that GUC since the restrictions we apply is sometimes a bit more restrictive than its necessary. The user might want to relax those. Similarly, if you don't have CASCADEing reference tables, you might consider disabling all the checks.	2018-07-03 17:05:55 +03:00
velioglu	6be6911ed9	Create foreign key relation graph and functions to query on it	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	89a8d6ab95	FK from dist to ref is tested for partitioning, MX	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	4db72c99f6	Specific DDLs are sequentialized when there is FK -[x] drop constraint -[x] drop column -[x] alter column type -[x] truncate are sequentialized if there is a foreign constraint from a distributed table to a reference table on the affected relations by the above commands.	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	e37f76c276	tests are added	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	2c5d59f3a8	create_distributed_table in transaction is fixed	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	45f8017f42	create_distributed_table with fk to ref table is implemented	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	2fa4e38841	FK from dist to ref can be added with alter table	2018-07-03 17:05:01 +03:00
Murat Tuncer	3fc98e8225	Add pg_stat_statements to shared_preload_libraries if installed	2018-07-03 16:33:15 +03:00
Murat Tuncer	23800f50f1	Update citus_stat_statements view and regression tests	2018-07-03 16:14:13 +03:00
Murat Tuncer	e532755a6e	Fix bug in partition column extraction added strip_implicit_coercion prior to checking if the expression is Const. This is important to find values for types like bigint.	2018-07-02 18:08:16 +03:00
Murat Tuncer	3fc7cdfe6d	Apply master_stage_protocol refactoring changes	2018-06-28 11:24:57 +03:00
Murat Tuncer	4d35b92016	Add groundwork for citus_stat_statements api	2018-06-27 14:20:03 +03:00
Brian Cloutier	5ce18327a7	Don't spinloop when trying to cleanup a failed connection	2018-06-26 13:13:34 -07:00
Onder Kalaci	4ccabf9544	Increase timeout to keep appveyor happy	2018-06-25 18:40:40 +03:00
Onder Kalaci	7d0f7835e7	Improve relation accesses association to do less job	2018-06-25 18:40:40 +03:00
Onder Kalaci	8ccb8b679e	Real-time executor marks multi shard relation accesses before opening connections	2018-06-25 18:40:31 +03:00
Onder Kalaci	2890154420	Make sure that TRUNCATE always opens a DDL access	2018-06-25 18:40:31 +03:00
Onder Kalaci	21038f0d0e	Make sure that inter-shard DDL commands are always covers both tables	2018-06-25 18:40:30 +03:00
Onder Kalaci	2f01894589	Track relation accesses using the connection management infrastructure	2018-06-25 18:40:30 +03:00
Onder Kalaci	d5472614df	Use non-data connection for intermediate results Make sure that intermediate results use a connection that is not associated with any placement. That is useful in two ways: - More complex queries can be executed with CTEs - Safely use the same connections when there is a foreign key to reference table from a distributed table, which needs to use the same connection for modifications since the reference table might cascade to the distributed table.	2018-06-21 13:26:13 +03:00
Onder Kalaci	7762d81cba	Move test UDF under test folder	2018-06-21 08:42:44 +03:00
Jason Petersen	7a75c2ed31	Add connparam invalidation trigger creation logic This needs to live in Community, since we haven't yet added the com- plication of having divergent upgrade scripts in Enterprise.	2018-06-20 14:13:18 -06:00
mehmet furkan şahin	2b2ce036eb	create_distributed_table honors sequential mode	2018-06-19 17:33:45 +03:00
Onder Kalaci	8f5821493a	Implement C interface for setting GUC We need the ability to switch to sequential mode (e.g., SET LOCAL citus.multi_shard_modify_mode = 'sequential'). This commit enables that.	2018-06-19 10:23:43 +03:00
Marco Slot	f3f2805978	Fix use-after-free that may occur for INSERT..SELECT in prepared statements	2018-06-18 22:55:06 -06:00
velioglu	53b2e81d01	Adds SELECT ... FOR UPDATE support for router plannable queries	2018-06-18 13:55:17 +03:00
Marco Slot	28860b2469	Remove volatile explain plan from regression tests	2018-06-15 00:21:52 +02:00
Marco Slot	04da0cf9b1	Remove costs from explain plans in window_functions tests	2018-06-14 23:51:46 +02:00
Marco Slot	0bbe778760	Rename failOnError to alwaysThrowErrorOnFailure	2018-06-14 23:37:47 +02:00
Marco Slot	0feb1f2eb1	Do not call CheckRemoteTransactionsHealth from commit handler	2018-06-14 23:33:07 +02:00
Marco Slot	bc1cc419e1	Fix could not receive query results error in regression test ouput	2018-06-14 23:33:07 +02:00
Marco Slot	4ab8e87090	Always throw errors on failure on critical connection in router executor	2018-06-14 23:33:07 +02:00
Nils Dijk	73efcb22c4	Extract RoleSpecString and resolve role references	2018-06-14 11:38:42 +02:00
Jason Petersen	5bf7bc64ba	Add pg_dist_authinfo schema and validation This table will be used by Citus Enterprise to populate authentication- related fields in outbound connections; Citus Community lacks support for this functionality.	2018-06-13 11:16:26 -06:00
Jason Petersen	57b3f253c5	Add node_conninfo GUC and related logic To support more flexible (i.e. not at compile-time) specification of libpq connection parameters, this change adds a new GUC, node_conninfo, which must be a space-separated string of key-value pairs suitable for parsing by libpq's connection establishment methods. To avoid rebuilding and parsing these values at connection time, this change also adds a cache in front of the configuration params to permit immediate use of any previously-calculated parameters.	2018-06-12 20:23:47 -06:00
mehmet furkan şahin	d1a3b20115	foreign_constraint_utils is created	2018-06-07 18:19:24 +03:00
Onder Kalaci	a5370f5bb0	Realtime executor honours multi_shard_modify_mode We're relying on multi_shard_modify_mode GUC for real-time SELECTs. The name of the GUC is unfortunate, but, adding one more GUC (or renaming the GUC) would make the UX even worse. Given that this mode is mostly important for transaction blocks that involve modification /DDL queries along with real-time SELECTs, we can live with the confusion.	2018-06-06 14:59:54 +03:00
Onder Kalaci	d918556dca	INSERT .. SELECT pushdown honors multi_shard_modification_mode	2018-06-06 12:42:23 +03:00
Onder Kalaci	336044f2a8	master_modify_multiple_shards() and TRUNCATE honors multi_shard_modification_mode	2018-06-06 12:29:05 +03:00
Onder Kalaci	51cb24b39c	Increase timeout to make the appveyor tests happy	2018-06-05 17:52:18 +03:00
Onder Kalaci	df44956dc3	Make sure that sequential DDL opens a single connection to each node After this commit DDL commands honour `citus.multi_shard_modify_mode`. We preferred using the code-path that executes single task router queries (e.g., ExecuteSingleModifyTask()) in order not to invent a new executor that is only applicable for DDL commands that require sequential execution.	2018-06-05 17:52:17 +03:00
Marco Slot	fd4ff29f2f	Add a debug message with distribution column value	2018-06-05 15:09:17 +03:00
Murat Tuncer	ba50e3f33e	Add handling for grant/revoke all tables in schema	2018-05-31 13:47:02 +03:00
velioglu	20acee2cd4	Bump citus version to 7.5devel	2018-05-28 17:25:21 -06:00
Brian Cloutier	a7e09d777b	Increase deadlock timeout so we get fewer signals	2018-05-16 17:07:24 -07:00
Brian Cloutier	9667ee5ac9	Alleviate OOM failures in COMMIT callback Previously those failures caused us to crash, postgres abort()s when it notices a failure in the COMMIT callback.	2018-05-15 16:39:33 -07:00
Marco Slot	61d2c0f618	Stabilise output of multi_shard_update_delete test	2018-05-11 08:33:23 +02:00
Onder Kalaci	ed47e4e6b9	Remove placementId from the ORDER BY to make results consistent	2018-05-11 17:04:50 +03:00
Onder Kalaci	12e50d96dc	Fix tests where hll is not installed	2018-05-11 16:01:47 +03:00
Brian Cloutier	4c2bf5d2d6	Move call to RemoveIntermediateResultsDirectory Errors thrown in the COMMIT handler will cause Postgres to segfault, there's nothing it can do it abort the transaction by the time that handler is called! RemoveIntermediateResultsDirectory is problematic for two reasons: - It has calls to ereport(ERROR which have been known to trigger - It makes memory allocations which raise ERRORs when they fail Once the COMMIT process has begun we don't use the intermediate results, so it's safe to remove them a little earlier in the process. A failure here will abort the transaction. That's pretty unnecessary, it's not that important that we remove the results, but it's still better than a crash.	2018-05-10 19:28:41 -07:00
Brian Cloutier	b3e85e4f71	Fix the huge appveyor diffs	2018-05-10 18:18:43 -07:00
mehmet furkan şahin	b8c3197399	enterprise test fixes	2018-05-10 13:06:54 +03:00
Onder Kalaci	04d9e886fe	Run concurrent modification queries in tests sequentially	2018-05-10 11:59:18 +03:00
mehmet furkan şahin	785a86ed0a	Tests are updated to use create_distributed_table	2018-05-10 11:18:59 +03:00
mehmet furkan şahin	d35f2725bf	valgrind tests fix	2018-05-10 10:20:14 +03:00
Dimitri Fontaine	8b258cbdb0	Lock reads and writes only to the node being updated in master_update_node Rather than locking out all the writes in the cluster, the function now only locks out writes that target shards hosted by the node we're updating.	2018-05-09 15:14:20 +02:00
Marco Slot	5f5f7b4fe0	Throw an error if placements cannot be found in router executor	2018-05-08 22:39:18 -04:00
Marco Slot	a7e6689890	Run recursive_dml_queries_mx test on its own	2018-05-06 17:12:53 +02:00
Marco Slot	9438e5bde9	Ensure single-shard modifying CTEs are part of distributed transaction	2018-05-06 12:49:40 +02:00
velioglu	caa27161ca	Check volatile functions in modify queries	2018-05-08 11:16:40 +03:00
Hadi Moshayedi	86b12bc2d0	Always prefix operators with their namespace. (#2147 ) Previously we checked if an operator is in pg_catalog, and if it wasn't we prefixed it with namespace in worker queries. This can have a huge impact on performance of physical planner when using custom data types. This happened regardless of current search_path config, because Citus overrides the search path in get_query_def_extended(). When we do so, the check for existence of the operator in current search path in generate_operator_name() fails for any operators outside pg_catalog. This means that nothing gets cached, and in the following calls we will again recheck the system tables for existence of the operators, which took an additional 40-50ms for some of the usecases we were seeing. In this change we skip the pg_catalog check, and always prefix the operator with its namespace.	2018-05-05 13:27:26 -04:00
Marco Slot	2f9c8c6af0	Allow DML commands with unreferenced SELECT CTEs	2018-05-03 14:53:26 +02:00
Marco Slot	f8cfe07fd1	Support intermediate results in distributed INSERT..SELECT	2018-05-03 14:42:28 +02:00
Marco Slot	90cdfff602	Implement recursive planning for DML statements	2018-05-03 14:42:28 +02:00
Murat Tuncer	42a8082721	PG11 compatibility refresh adds a shim for a changed function api	2018-05-03 13:21:15 -06:00
mehmet furkan şahin	ef90122cd3	shard count for some of the tests are increased	2018-05-03 10:44:43 +03:00
Onder Kalaci	317dd02a2f	Implement single repartitioning on hash distributed tables * Change worker_hash_partition_table() such that the divergence between Citus planner's hashing and worker_hash_partition_table() becomes the same. * Rename single partitioning to single range partitioning. * Add single hash repartitioning. Basically, logical planner treats single hash and range partitioning almost equally. Physical planner, on the other hand, treats single hash and dual hash repartitioning almost equally (except for JoinPruning). * Add a new GUC to enable this feature	2018-05-02 18:50:55 +03:00
velioglu	32bcd610c1	Support modify queries with multiple tables With this commit we begin to support modify queries with multiple tables if these queries are pushdownable.	2018-05-02 16:22:26 +03:00
velioglu	d9fa69c031	Refactor query pushdown related logic	2018-05-02 15:03:09 +03:00
Brian Cloutier	f8fb7a27fb	Don't copyObject into the wrong memory context utilityStmt sometimes (such as when it's inside of a plpgsql function) comes from a cached plan, which is kept in a child of the CacheMemoryContext. When we naively call copyObject we're copying it into a statement-local context, which corrupts the cached plan when it's thrown away.	2018-05-01 15:34:32 -07:00
Marco Slot	2559b84049	Drop shards as current user instead of super user	2018-05-01 09:57:20 +02:00
velioglu	121ff39b26	Removes large_table_shard_count GUC	2018-04-29 10:34:50 +02:00
Onder Kalaci	832c91e28c	Move processing each part of the query into its own functions This commit doesn't change any of the logic at all. Instead, the goal is to: * Get rid of any code duplication * Incremental changes to the optimizer made it slightly hard to follow the code, improve that and make it easier to implement new features * Simplify the code by moving each part of query processing (e.g., DISTINCT, LIMIT etc) into its own function * Make the interaction between each part of the query more obvious (e.g., How DISTINCT affects LIMIT etc)	2018-04-27 17:32:38 +03:00
mehmet furkan şahin	f2555317b6	ProcessVacuumStmt update on names	2018-04-27 14:37:01 +03:00
mehmet furkan şahin	a4153c6ab1	notice handler is implemented	2018-04-27 14:37:01 +03:00
Marco Slot	304b3a41ba	Cache the partition column Var	2018-04-26 14:58:16 -06:00
Marco Slot	3d3c19a717	Improve messages for essential connection failures	2018-04-26 12:58:47 -06:00
Marco Slot	88f64d22db	Prevent connection pointer is NULL details	2018-04-26 12:49:57 -06:00
Marco Slot	394732b6be	Add a connection failure error code	2018-04-26 12:49:57 -06:00
Önder Kalacı	ebb8f902c8	Relax assertion on transaction rollback failure (#2052 ) In case a failure happens when a transaction is rollbacked, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-04-26 13:39:03 -04:00
Hadi Moshayedi	24659a97dc	Fail task in real-time executor if no placements found. (#2133 )	2018-04-26 12:05:24 -04:00
Murat Tuncer	a6fe5ca183	PG11 compatibility update - changes in ruleutils_11.c is reflected - vacuum statement api change is handled. We now allow multi-table vacuum commands. - some other function header changes are reflected - api conflicts between PG11 and earlier versions are handled by adding shims in version_compat.h - various regression tests are fixed due output and functionality in PG1 - no change is made to support new features in PG11 they need to be handled by new commit	2018-04-26 11:29:43 +03:00
Brian Cloutier	49255213d4	Configure appveyor to run regression tests - Add install.pl to instal .sql files on Windows - Remove a hack to PGDLLIMPORT some variables - Add citus_version.o to the Makefile - Fix pg_regress_multi's PATH generation on Windows - Output regression.diffs when the tests fail - Fix permissions in data directory, make sure postgres can play with it	2018-04-25 18:02:07 -07:00
Onder Kalaci	ac8f2f1e6d	Eliminate code duplication in WorkerExtendedOpNode() Before this commit, we had code duplication in the WorkerExtendedOpNode(). The duplication was noticeable and any change is prone to bugs. The PR consists of 4 commits. Each commit incrementally fixes the problem by moving certain parts of the duplicated code into smaller, better-documented functions.	2018-04-25 08:54:59 +03:00
Brian Cloutier	8d4c4d5c58	Close all files before trying to remove them	2018-04-24 14:35:20 -07:00
Brian Cloutier	c5f1235090	Turn the crashes on Windows into WARNINGs	2018-04-24 14:35:20 -07:00
Onder Kalaci	ee748d9140	Unify extendedOpNode Processing Before this commit, we had a divergence among the creation of master/worker extended op nodes. This commit moves the related parts into a single place and allows the creation of master/extended op nodes to share a common data structure.	2018-04-24 11:56:38 +03:00
Hadi Moshayedi	966f01fad3	Fix write and copy functions for TaskExecution. (#2120 ) We were missing criticalErrorOccurred from CopyNodeTaskExecution() and OutTaskExecution(). This PR fixes it.	2018-04-23 09:07:52 -04:00
Onder Kalaci	814f0e3acc	Ensure Citus never try to access a not planned subquery PostgreSQL might remove some of the subqueries when they do not contribute to the query result at all. Citus should not try to access such subqueries during planning.	2018-04-20 13:52:00 +03:00
Brian Cloutier	b0b130f064	Fix Windows crash in multi_copy test Without this change we crash on Windows with COPYing into a table with 62 shards, and we ERROR when COPYing into a table with >62 shards: ERROR: WaitForMutipleObjects() failed: error code 87	2018-04-17 15:48:02 -07:00
Brian Cloutier	d02f761d8e	Change intermediate_results test to not crash	2018-04-17 15:14:02 -07:00
Brian Cloutier	0104790385	Fix hard-coded temp directory in multi_copy /tmp does not exist on Windows, use :temp_dir instead	2018-04-17 15:01:22 -07:00
Brian Cloutier	a59c1c634e	Fix cancellation of real time queries Without this change multi_real_time_transaction blocks forever (on Windows) in the block where it repeatedly calls pg_advisory_lock(15). This happens because the deadlock detector tries to cancel the backend but the backend never processes that signal.	2018-04-17 14:26:22 -07:00
mehmet furkan şahin	00e786af00	Capital named schema support is added	2018-04-17 17:17:42 +03:00
mehmet furkan şahin	e5a5502b16	Adds support for multiple ANDs in Having This PR adds support for multiple AND expressions in Having for pushdown planner. We simply make a call to make_ands_explicit from MultiLogicalPlanOptimize for the having qual in workerExtendedOpNode.	2018-04-16 14:14:48 +03:00
Brian Cloutier	42ddfa176d	Fix crash on Windows where there is no detail	2018-04-13 12:54:22 -07:00
velioglu	82b2d21b0c	Convert broadcast join to reference join After this commit large_table_shard_count wont be used to check whether broadcast join, which is renamed as reference join, can be applied. Reference join can only be applied over reference tables.	2018-04-13 12:58:14 +03:00
velioglu	1b92812be2	Add co-placement check to CoPartition function	2018-04-13 12:13:08 +03:00
Marco Slot	9318aeee6b	Allow multiple size function calls per query	2018-04-12 14:16:17 +02:00

... 2 3 4 5 6 ...

1444 Commits (004f28e18cbe4c7353f6a31785bf60b15c0d8de7)