citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	498ac80d8b	Add window function support for SUBQUERY PUSHDOWN and INSERT INTO SELECT This commit provides the support for window functions in subquery and insert into select queries. Note that our support for window functions is still limited because it must have a partition by clause on the distribution key. This commit makes changes in the files insert_select_planner and multi_logical_planner. The required tests are also added with files multi_subquery_window_functions.out and multi_insert_select_window.out.	2017-10-04 15:33:07 +03:00
Marco Slot	24915779d1	Remove separate citus.binary_worker_copy_format regression tests	2017-10-03 17:44:50 +02:00
Marco Slot	9e516513fc	Use local group ID when querying for prepared transactions	2017-10-03 16:36:53 +02:00
Hadi Moshayedi	11adb9b034	Push down LIMIT and HAVING when grouped by partition key. (#1641 ) We can do this because all rows belonging to a group are in the same shard when grouping by distribution column on a range/hash distributed table.	2017-10-02 20:17:51 -04:00
Marco Slot	bb50fc9cb5	Add multi-user re-partitioning regression tests	2017-09-28 15:27:26 +02:00
Murat Tuncer	4676c4f7a5	Prevent crash when remote transaction start fails (#1662 ) We sent multiple commands to worker when starting a transaction. Previously we only checked the result of the first command that is transaction 'BEGIN' which always succeeds. Any failure on following commands were not checked. With this commit, we make sure all command results are checked. If there is any error we report the first error found.	2017-09-26 17:25:46 -07:00
Jason Petersen	b4474fc0b0	Modify version-output tests for PostgreSQL 11 Basically we just care whether the running version is before or after PostgreSQL 10, so testing the major version against 9 and printing a boolean is sufficient.	2017-09-25 17:20:24 -07:00
velioglu	0a56ed910b	Change error message of queries with distributed and local table Citus can handle INSERT INTO ... SELECT queries if the query inserts into local table by reading data from distributed table. The opposite way is not correct. With this commit we warn the user if the latter option is used.	2017-09-22 13:46:19 -07:00
Onder Kalaci	867224bdd7	Make the tests produce more consistent outputs	2017-09-22 20:38:56 +03:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Onder Kalaci	33ec33c5b3	Ensure schema exists on reference table creation If the schema doesn't exists on the workers, create it.	2017-09-18 23:50:47 +03:00
Onder Kalaci	6116c8e93d	Allow pushing down GROUP BYs when at least there is one distribution column in the target list	2017-09-15 19:15:06 +03:00
Onder Kalaci	a5b66912d4	Expand reference table support in subquery pushdown With this commit, we relax the restrictions put on the reference tables with subquery pushdown. We did three notable improvements: 1) Relax equi-join restrictions Previously, we always expected that the non-reference tables are equi joined with reference tables on the partition key of the non-reference table. With this commit, we allow any column of non-reference tables joined using non-equi joins as well. 2) Relax OUTER JOIN restrictions Previously Citus errored out if any reference table exists at any point of the outer part of an outer join. For instance, See the below sketch where (h) denotes a hash distributed relation, (r) denotes a reference table, (L) denotes LEFT JOIN and (I) denotes INNER JOIN. (L) / \ (I) h / \ r h Before this commit Citus would error out since a reference table appears on the left most part of an left join. However, that was too restrictive so that we only error out if the reference table is directly below and in the outer part of an outer join. 3) Bug fixes We've done some minor bugfixes in the existing implementation.	2017-09-14 20:59:22 +03:00
Marco Slot	27da0a29d7	Add volatile function in prepared statement regression test	2017-09-12 13:09:31 -07:00
Burak Yucesoy	273b034720	Bump Citus version	2017-08-28 17:56:39 +03:00
Marco Slot	1920390688	Multi-row INSERTs no longer throw errors in isolation tests	2017-08-25 10:55:56 +02:00
Marco Slot	ae00795dab	Allow default columns in multi-row INSERTs	2017-08-25 10:55:56 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Burak Yucesoy	5be6eb9ef6	Increase coverage of isolation tests - Part 2 With this PR we add isolation tests for COPY to reference table vs. other operations COPY to partitioned table vs. other operations Multi row INSERTs vs other operations INSERT/SELECT vs. other operations UPSERT vs. other operations DELETE vs. other operations TRUNCATE vs. other operations DROP vs. other operations DDL vs. other operations other operations consist of basic SQL operations (like SELECT, INSERT, DELETE, UPSERT, COPY TRUNCATE, CREATE INDEX) as well as some Citus functionalities (like master_modify_multiple_shards, master_apply_delete_command, citus_total_relation_size etc.)	2017-08-23 18:23:36 +03:00
Marco Slot	641420d79f	Remove source node argument from dump_local_wait_edges	2017-08-23 13:14:00 +02:00
Jason Petersen	8cb69e3a14	Add alias for target in multi-row INSERTs This is necessary for multi-row INSERTs for the same reasons we use it in e.g. UPSERTs: if the range table list has more than one entry, then PostgreSQL's deparse logic requires that vars be prefixed by the name of their corresponding range table entry. This of course doesn't affect single-row INSERTs, but since multi-row INSERTs have a VALUE RTE, they were affected. The piece of ruleutils which builds range table names wasn't modified to handle shard extension; instead UPSERT/INSERT INTO ... SELECT added an alias to the RTE. When present, this alias is favored. Doing the same in the multi-row INSERT case fixes RETURNING for such commands.	2017-08-23 10:24:00 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Metin Doslu	0d052e9864	Fix a crash on zero-shard tables	2017-08-18 13:53:59 +03:00
Burak Yucesoy	ae32d786cf	Add new isolation tests	2017-08-17 17:46:03 +03:00
Marco Slot	9e7b1fb858	Return readable nodes in master_get_active_worker_nodes	2017-08-16 11:28:47 +02:00
Hadi Moshayedi	e5fbcf37dd	Add Savepoint Support (#1539 ) This change adds support for SAVEPOINT, ROLLBACK TO SAVEPOINT, and RELEASE SAVEPOINT. When transaction connections are not established yet, savepoints are kept in a stack and sent to the worker when the connection is later established. After establishing connections, savepoint commands are sent as they arrive. This change fixes #1493 .	2017-08-15 13:02:28 -04:00
Marco Slot	3ff46245b3	Make sure we don't use 2PC in copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Marco Slot	fa70089766	Enable 2PC during distributed table creation	2017-08-15 13:44:20 +02:00
Burak Yucesoy	45b273321f	Add tests for locking operations on partitioned tables	2017-08-14 14:55:45 +03:00
Onder Kalaci	4f668ad38b	Make the test outputs consistent by using VACUUM ANALYZE on the tables.	2017-08-12 13:29:25 +03:00
Onder Kalaci	0ba2f9e4e4	Add regression tests for distributed deadlock detection	2017-08-12 13:29:25 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Onder Kalaci	a333c9f16c	Add infrastructure for distributed deadlock detection This commit adds all the necessary pieces to do the distributed deadlock detection. Each distributed transaction is already assigned with distributed transaction ids introduced with `3369f3486f`. The dependency among the distributed transactions are gathered with `80ea233ec1`. With this commit, we implement a DFS (depth first seach) on the dependency graph and search for cycles. Finding a cycle reveals a distributed deadlock. Once we find the deadlock, we examine the path that the cycle exists and cancel the youngest distributed transaction. Note that, we're not yet enabling the deadlock detection by default with this commit.	2017-08-12 13:28:37 +03:00
Marco Slot	59e626d158	Add regression tests for follower clusters	2017-08-12 12:05:56 +02:00
velioglu	100739f62a	Change citus subversion	2017-08-11 11:57:57 +03:00
velioglu	7c65001e23	Do not delete row from colocation table within drop table	2017-08-11 11:34:33 +03:00
velioglu	b0efffae1c	Correct planner and add more tests	2017-08-11 10:16:13 +03:00
velioglu	ceba81ce35	Move physical planner checks to logical planner	2017-08-11 10:09:47 +03:00
velioglu	0359d03530	Add set operation check for reference tables	2017-08-11 10:09:47 +03:00
velioglu	c4e3b8b5e1	Add planner changes and tests for subquery on reference tables	2017-08-11 10:09:47 +03:00
velioglu	45717dd013	Check equivalence on reference tables for subquery pushdown	2017-08-11 10:09:47 +03:00
Marco Slot	0ae265c436	Add citus_create_restore_point for distributed snapshots	2017-08-11 07:36:20 +02:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Brian Cloutier	c854d51cd8	make multi_reference_table test more stable	2017-08-10 17:37:17 +03:00
Brian Cloutier	3fc87a7a29	Metadata sync also syncs nodes in other clusters	2017-08-10 16:55:55 +03:00
Brian Cloutier	0dee4f8418	Metadata sync syncs all nodes, not just primaries	2017-08-10 16:55:55 +03:00
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	1961add6f9	Improve error message when there are no nodes for a placement	2017-08-10 12:38:51 +03:00
Jason Petersen	a578506718	Add multi-row isolation tests	2017-08-10 01:10:09 -07:00
Jason Petersen	addde54464	Add some tests	2017-08-10 00:32:46 -07:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
Andres Freund	e8b793c454	Support for IN (const, list) and = ANY(const, b, c) pruning.	2017-08-10 08:56:36 +03:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Marco Slot	08ed6d8269	Prevent pg_dist_node changes during master_create_empty_shard	2017-08-09 14:22:09 +02:00
Murat Tuncer	5cb9466255	Rebase node metadata isolation tests	2017-08-09 14:22:09 +02:00
Marco Slot	ad0fdf57ca	Add add/remove node rollback isolation tests	2017-08-09 14:09:54 +02:00
Marco Slot	c2f8bafa05	Fix shard creation vs. pg_dist_node change locking	2017-08-09 14:09:54 +02:00
Marco Slot	868ee6be83	Fix and simplify pg_dist_node locking	2017-08-09 14:09:54 +02:00
Burak Yucesoy	ab5f97861b	Add regression tests for distributed partitioned tables	2017-08-09 10:01:35 +03:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Marco Slot	d3e9746236	Avoid connections that accessed non-colocated placements in multi-shard commands	2017-08-08 18:32:34 +02:00
Brian Cloutier	5914c992e6	cluster management UDFs see nodes in different clusters - master_activate_node and master_disable_node correctly toggle isActive, without crashing - master_add_node rejects duplicate nodes, even if they're in different clusters - master_remove_node allows removing nodes in different clusters	2017-08-08 13:12:06 +03:00
Brian Cloutier	bf197e9f0c	Add test for super-long cluster names	2017-08-08 11:18:31 +03:00
Brian Cloutier	fbecf48a03	Disallow adding primary nodes to non-default clusters	2017-08-08 11:18:31 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00
Brian Cloutier	74ce4faab5	Make multi_cluster_management test more stable	2017-08-08 11:18:31 +03:00
Brian Cloutier	e7846ba7d1	Allow metadata sync functions on secondaries {start,stop}_metadata_sync_to_node now toggle the hasMetadata flag when run on secondaries but don't attempt to actually sync any metadata.	2017-08-07 18:46:51 +03:00
Marco Slot	4cc7c36596	Simplify metadata lock acquisition for DML	2017-08-07 15:36:58 +02:00
Marco Slot	aa7ca81548	Execute UPDATE/DELETE statements with 0 shards	2017-08-07 15:36:58 +02:00
Marco Slot	bac60bb64f	Function evaluation descends into expression trees	2017-08-06 19:53:05 +02:00
Brian Cloutier	37985de85e	master_disable_node no longer crashes when given a non-existant node	2017-08-04 11:14:54 +03:00
Hadi Moshayedi	8229a64fe8	Remove distributed tables' dependency on distribution key columns. (#1527 ) This change removes distributed tables' dependency on distribution key columns. We already check that we cannot drop distribution key columns in ErrorIfUnsupportedAlterTableStmt() at multi_utility.c, so we don't need to have distributed table to distribution key column dependency to avoid dropping of distribution key column. Furthermore, having this dependency causes some warnings in pg_dump --schema-only (See #866), which are not desirable. This change also adds check to disallow drop of distribution keys when citus.enable_ddl_propagation is set to false. Regression tests are updated accordingly.	2017-08-03 10:07:04 -04:00
Burak Yucesoy	37b200a52e	Fix broken isolation tests We try to run our isolation tests paralles as much as possible. In some of those isolation tests we used same table name which causes problem while running them in paralles. This commit changes table names in those tests to ensure tests can run in parallel.	2017-07-31 11:11:49 +03:00
Burak Yucesoy	7769f1d012	Refactor distributed table creation logic This commit is preperation for introducing distributed partitioned table support. We want to clean and refactor some code in distributed table creation logic so that we can handle partitioned tables in more robust way.	2017-07-31 11:11:23 +03:00
Murat Tuncer	520d74b96d	Add a regression test for citus.max_task_string_size (#1524 )	2017-07-28 10:49:09 -07:00
Brian Cloutier	7d8bcb6a88	These tests sometimes deadlock on travis	2017-07-28 16:02:43 +03:00
Brian Cloutier	b20a086a8f	master_activate_node UDF also returns noderole	2017-07-28 16:02:43 +03:00
Onder Kalaci	6132d17481	Convert global wait edges to adjacency list In this commit, we add ability to convert global wait edges into adjacency list with the following format: [transactionId] = [transactionNode->waitsFor {list of waiting transaction nodes}]	2017-07-27 19:53:51 +03:00
Brian Cloutier	32e16ffe02	Give isolation tester ability to see locks on workers	2017-07-26 18:43:04 +03:00
Eren Başak	a12f1980de	Add Progress Tracking Infrastructure This change adds a general purpose infrastructure to log and monitor process about long running progresses. It uses `pg_stat_get_progress_info` infrastructure, introduced with PostgreSQL 9.6 and used for tracking `VACUUM` commands. This patch only handles the creation of a memory space in dynamic shared memory, putting its info in `pg_stat_get_progress_info`, fetching the progress monitors on demand and finalizing the progress tracking.	2017-07-26 14:12:15 +03:00
Marco Slot	80ea233ec1	Add function for dumping global wait edges	2017-07-25 16:52:32 +02:00
Marco Slot	81198a1d02	Add function for dumping local wait edges	2017-07-25 16:52:32 +02:00
Marco Slot	5923334114	Add transaction recovery regression tests	2017-07-24 20:44:38 +02:00
Brian Cloutier	88702ca58a	node_metadata takes out more sane locks - Never release locks - AddNodeMetadata takes ShareRowExclusiveLock so it'll conflict with the trigger which prevents multiple primary nodes. - ActivateNode and SetNodeState used to take AccessShareLock, but they modify the table so they should take RowExclusiveLock. - DeleteNodeRow and InsertNodeRow used to take AccessExclusiveLock but only need RowExclusiveLock.	2017-07-24 11:57:46 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Brian Cloutier	7f1343103e	Fix PG 10 build, UNBOUNDED partitions now have different syntax Update code and tests to match the changes made in pg's d363d42	2017-07-21 14:30:11 +03:00
Brian Cloutier	74dd5bb281	Fix crash when removing an inactive node	2017-07-20 18:55:40 +03:00
Onder Kalaci	3369f3486f	Introduce distributed transaction ids This commit adds distributed transaction id infrastructure in the scope of distributed deadlock detection. In general, the distributed transaction id consists of a tuple in the form of: `(databaseId, initiatorNodeIdentifier, transactionId, timestamp)`. Briefly, we add a shared memory block on each node, which holds some information per backend (i.e., an array `BackendData backends[MaxBackends]`). Later, on each coordinated transaction, Citus sends `SELECT assign_distributed_transaction_id()` right after `BEGIN`. For that backend on the worker, the distributed transaction id is set to the values assigned via the function call. The aim of the above is to correlate the transactions on the coordinator to the transactions on the worker nodes.	2017-07-18 15:01:42 +03:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
Marco Slot	fd72cca6c8	Use predictable placement IDs in regression test output	2017-07-17 13:44:29 +03:00
Onder Kalaci	ce8edd88f7	Apply regression test changes that are due to PostgreSQL 10 changes that have recently changed	2017-07-14 13:22:12 +03:00
Brian Cloutier	72d8d2429b	Add a test for upgrading shard placements	2017-07-12 14:18:27 +02:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Brian Cloutier	fe53fd4a8e	Remove functions created just for unit testing These functions are holdovers from pg_shard and were created for unit testing c-level functions (like InsertShardPlacementRow) which our regression tests already test quite effectively. Removing because it makes refactoring the signatures of those c-level functions unnecessarily difficult. - create_healthy_local_shard_placement_row - update_shard_placement_row_state - delete_shard_placement_row	2017-07-12 14:16:24 +02:00
Brian Cloutier	385d9cbbb7	Ignore generated multi_behavioral_analytics_create_table test files	2017-07-12 14:16:24 +02:00
Marco Slot	bf8377082c	Use consistent placement IDs in mulity_modyfing_xactstest	2017-07-12 14:16:23 +02:00

1 2 3 4 5 ...

547 Commits (7b8f13cf353a98bd35b40b01e1a3c147739fbdc5)