citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	7f68f78ee9	Omit public schema from shard_name output	2017-10-31 00:22:07 +01:00
Murat Tuncer	e16805215d	Support count(distinct) for non-partition columns (#1692 ) Expands count distinct coverage by allowing more cases. We used to support count distinct only if we can push down distinct aggregate to worker query i.e. the count distinct clause was on the partition column of the table, or there was a grouping on the partition column. Now we can support - non-partition columns, with or without grouping on partition column - partition, and non partition column in the same query - having clause - single table subqueries - insert into select queries - join queries where count distinct is on partition, or non-partition column - filters on count distinct clauses (extends existing support) We first try to push down aggregate to worker query (original case), if we can't then we modify worker query to return distinct columns to coordinator node. We do that by adding distinct column targets to group by clauses. Then we perform count distinct operation on the coordinator node. This work should reduce the cases where HLL is used as it can address anything that HLL can. However, if we start having performance issues due to very large number rows, then we can recommend hll use.	2017-10-30 13:12:24 +02:00
Marco Slot	be46661bf7	Block only 2PCs instead of all writes in citus_create_restore_point	2017-10-27 00:07:32 +02:00
mehmet furkan şahin	83ac84d594	order by and unnest are added to multi_colocation_utils tests	2017-10-26 13:44:28 +03:00
mehmet furkan şahin	61ae33dc7f	ALTER TABLE .. REPLICA IDENTITY support is implemented	2017-10-26 13:44:28 +03:00
velioglu	0b5db5d826	Support multi shard update/delete queries	2017-10-25 15:52:38 +03:00
Marco Slot	4bde83e1d2	Relay error message if DML fails on worker	2017-10-25 14:23:21 +02:00
Brian Cloutier	91ff8cd2d5	{*,}create_distributed_table doesn't emit OID (#1710 )	2017-10-16 18:08:51 -06:00
Brian Cloutier	ebcb2b65e9	Add master_move_node function	2017-10-16 10:51:28 -07:00
Brian Cloutier	58cf15ceca	DistributedTableSize doesn't emit oid when erring out	2017-10-14 02:42:57 +03:00
Murat Tuncer	f7ab901766	Add select distinct, and distinct on support Distinct, and distinct on() clauses are supported in simple selects, joins, subqueries, and insert into select queries.	2017-10-13 14:59:48 +03:00
Hadi Moshayedi	a1387f4aa8	Basic usage statistics collection. (#1656 ) Adds ```citus.enable_statistics_collection``` GUC variable, which ```true``` by default, unless built without libcurl. If statistics collection is enabled, sends basic usage data to Citus servers every 24 hours. The data that is collected consists of: - Citus version - OS name & release - Hardware Id - Number of tables, rounded to next power of 2 - Size of data, rounded to next power of 2 - Number of workers	2017-10-11 09:55:15 -04:00
Onder Kalaci	498ac80d8b	Add window function support for SUBQUERY PUSHDOWN and INSERT INTO SELECT This commit provides the support for window functions in subquery and insert into select queries. Note that our support for window functions is still limited because it must have a partition by clause on the distribution key. This commit makes changes in the files insert_select_planner and multi_logical_planner. The required tests are also added with files multi_subquery_window_functions.out and multi_insert_select_window.out.	2017-10-04 15:33:07 +03:00
Marco Slot	24915779d1	Remove separate citus.binary_worker_copy_format regression tests	2017-10-03 17:44:50 +02:00
Marco Slot	9e516513fc	Use local group ID when querying for prepared transactions	2017-10-03 16:36:53 +02:00
Hadi Moshayedi	11adb9b034	Push down LIMIT and HAVING when grouped by partition key. (#1641 ) We can do this because all rows belonging to a group are in the same shard when grouping by distribution column on a range/hash distributed table.	2017-10-02 20:17:51 -04:00
Marco Slot	bb50fc9cb5	Add multi-user re-partitioning regression tests	2017-09-28 15:27:26 +02:00
Murat Tuncer	4676c4f7a5	Prevent crash when remote transaction start fails (#1662 ) We sent multiple commands to worker when starting a transaction. Previously we only checked the result of the first command that is transaction 'BEGIN' which always succeeds. Any failure on following commands were not checked. With this commit, we make sure all command results are checked. If there is any error we report the first error found.	2017-09-26 17:25:46 -07:00
Jason Petersen	b4474fc0b0	Modify version-output tests for PostgreSQL 11 Basically we just care whether the running version is before or after PostgreSQL 10, so testing the major version against 9 and printing a boolean is sufficient.	2017-09-25 17:20:24 -07:00
velioglu	0a56ed910b	Change error message of queries with distributed and local table Citus can handle INSERT INTO ... SELECT queries if the query inserts into local table by reading data from distributed table. The opposite way is not correct. With this commit we warn the user if the latter option is used.	2017-09-22 13:46:19 -07:00
Onder Kalaci	867224bdd7	Make the tests produce more consistent outputs	2017-09-22 20:38:56 +03:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Onder Kalaci	33ec33c5b3	Ensure schema exists on reference table creation If the schema doesn't exists on the workers, create it.	2017-09-18 23:50:47 +03:00
Onder Kalaci	6116c8e93d	Allow pushing down GROUP BYs when at least there is one distribution column in the target list	2017-09-15 19:15:06 +03:00
Onder Kalaci	a5b66912d4	Expand reference table support in subquery pushdown With this commit, we relax the restrictions put on the reference tables with subquery pushdown. We did three notable improvements: 1) Relax equi-join restrictions Previously, we always expected that the non-reference tables are equi joined with reference tables on the partition key of the non-reference table. With this commit, we allow any column of non-reference tables joined using non-equi joins as well. 2) Relax OUTER JOIN restrictions Previously Citus errored out if any reference table exists at any point of the outer part of an outer join. For instance, See the below sketch where (h) denotes a hash distributed relation, (r) denotes a reference table, (L) denotes LEFT JOIN and (I) denotes INNER JOIN. (L) / \ (I) h / \ r h Before this commit Citus would error out since a reference table appears on the left most part of an left join. However, that was too restrictive so that we only error out if the reference table is directly below and in the outer part of an outer join. 3) Bug fixes We've done some minor bugfixes in the existing implementation.	2017-09-14 20:59:22 +03:00
Marco Slot	27da0a29d7	Add volatile function in prepared statement regression test	2017-09-12 13:09:31 -07:00
Burak Yucesoy	273b034720	Bump Citus version	2017-08-28 17:56:39 +03:00
Marco Slot	1920390688	Multi-row INSERTs no longer throw errors in isolation tests	2017-08-25 10:55:56 +02:00
Marco Slot	ae00795dab	Allow default columns in multi-row INSERTs	2017-08-25 10:55:56 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Burak Yucesoy	5be6eb9ef6	Increase coverage of isolation tests - Part 2 With this PR we add isolation tests for COPY to reference table vs. other operations COPY to partitioned table vs. other operations Multi row INSERTs vs other operations INSERT/SELECT vs. other operations UPSERT vs. other operations DELETE vs. other operations TRUNCATE vs. other operations DROP vs. other operations DDL vs. other operations other operations consist of basic SQL operations (like SELECT, INSERT, DELETE, UPSERT, COPY TRUNCATE, CREATE INDEX) as well as some Citus functionalities (like master_modify_multiple_shards, master_apply_delete_command, citus_total_relation_size etc.)	2017-08-23 18:23:36 +03:00
Marco Slot	641420d79f	Remove source node argument from dump_local_wait_edges	2017-08-23 13:14:00 +02:00
Jason Petersen	8cb69e3a14	Add alias for target in multi-row INSERTs This is necessary for multi-row INSERTs for the same reasons we use it in e.g. UPSERTs: if the range table list has more than one entry, then PostgreSQL's deparse logic requires that vars be prefixed by the name of their corresponding range table entry. This of course doesn't affect single-row INSERTs, but since multi-row INSERTs have a VALUE RTE, they were affected. The piece of ruleutils which builds range table names wasn't modified to handle shard extension; instead UPSERT/INSERT INTO ... SELECT added an alias to the RTE. When present, this alias is favored. Doing the same in the multi-row INSERT case fixes RETURNING for such commands.	2017-08-23 10:24:00 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Metin Doslu	0d052e9864	Fix a crash on zero-shard tables	2017-08-18 13:53:59 +03:00
Burak Yucesoy	ae32d786cf	Add new isolation tests	2017-08-17 17:46:03 +03:00
Marco Slot	9e7b1fb858	Return readable nodes in master_get_active_worker_nodes	2017-08-16 11:28:47 +02:00
Hadi Moshayedi	e5fbcf37dd	Add Savepoint Support (#1539 ) This change adds support for SAVEPOINT, ROLLBACK TO SAVEPOINT, and RELEASE SAVEPOINT. When transaction connections are not established yet, savepoints are kept in a stack and sent to the worker when the connection is later established. After establishing connections, savepoint commands are sent as they arrive. This change fixes #1493 .	2017-08-15 13:02:28 -04:00
Marco Slot	3ff46245b3	Make sure we don't use 2PC in copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Marco Slot	fa70089766	Enable 2PC during distributed table creation	2017-08-15 13:44:20 +02:00
Burak Yucesoy	45b273321f	Add tests for locking operations on partitioned tables	2017-08-14 14:55:45 +03:00
Onder Kalaci	4f668ad38b	Make the test outputs consistent by using VACUUM ANALYZE on the tables.	2017-08-12 13:29:25 +03:00
Onder Kalaci	0ba2f9e4e4	Add regression tests for distributed deadlock detection	2017-08-12 13:29:25 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Onder Kalaci	a333c9f16c	Add infrastructure for distributed deadlock detection This commit adds all the necessary pieces to do the distributed deadlock detection. Each distributed transaction is already assigned with distributed transaction ids introduced with `3369f3486f`. The dependency among the distributed transactions are gathered with `80ea233ec1`. With this commit, we implement a DFS (depth first seach) on the dependency graph and search for cycles. Finding a cycle reveals a distributed deadlock. Once we find the deadlock, we examine the path that the cycle exists and cancel the youngest distributed transaction. Note that, we're not yet enabling the deadlock detection by default with this commit.	2017-08-12 13:28:37 +03:00
Marco Slot	59e626d158	Add regression tests for follower clusters	2017-08-12 12:05:56 +02:00
velioglu	100739f62a	Change citus subversion	2017-08-11 11:57:57 +03:00

1 2 3 4 5 ...

509 Commits (7f68f78ee93864a96bb2428a12b41f81bb3bda27)