citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	ae00795dab	Allow default columns in multi-row INSERTs	2017-08-25 10:55:56 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Marco Slot	dbf18df995	Don't error out if BuildGlobalWaitGraph fails to connect	2017-08-23 19:08:03 +02:00
Burak Yucesoy	5be6eb9ef6	Increase coverage of isolation tests - Part 2 With this PR we add isolation tests for COPY to reference table vs. other operations COPY to partitioned table vs. other operations Multi row INSERTs vs other operations INSERT/SELECT vs. other operations UPSERT vs. other operations DELETE vs. other operations TRUNCATE vs. other operations DROP vs. other operations DDL vs. other operations other operations consist of basic SQL operations (like SELECT, INSERT, DELETE, UPSERT, COPY TRUNCATE, CREATE INDEX) as well as some Citus functionalities (like master_modify_multiple_shards, master_apply_delete_command, citus_total_relation_size etc.)	2017-08-23 18:23:36 +03:00
Onder Kalaci	c7bb29b69e	Prevent maintanince deamon crashes due to dead processes If after the distributed deadlock detection decides to cancel a backend, the backend has been terminated/killed/cancelled externally, we might be accessing to a NULL pointer. This commit prevents that case by ignoring the current distributed deadlock.	2017-08-23 15:44:09 +03:00
Marco Slot	641420d79f	Remove source node argument from dump_local_wait_edges	2017-08-23 13:14:00 +02:00
Jason Petersen	8cb69e3a14	Add alias for target in multi-row INSERTs This is necessary for multi-row INSERTs for the same reasons we use it in e.g. UPSERTs: if the range table list has more than one entry, then PostgreSQL's deparse logic requires that vars be prefixed by the name of their corresponding range table entry. This of course doesn't affect single-row INSERTs, but since multi-row INSERTs have a VALUE RTE, they were affected. The piece of ruleutils which builds range table names wasn't modified to handle shard extension; instead UPSERT/INSERT INTO ... SELECT added an alias to the RTE. When present, this alias is favored. Doing the same in the multi-row INSERT case fixes RETURNING for such commands.	2017-08-23 10:24:00 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Marco Slot	bd6bf29983	Don't add procs multiple times in BuildWaitGraphForSourceNode	2017-08-21 16:48:30 +02:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Metin Doslu	0d052e9864	Fix a crash on zero-shard tables	2017-08-18 13:53:59 +03:00
Önder Kalacı	b82f886ad3	Merge branch 'master' into improve_deadlock_detection	2017-08-18 13:07:18 +03:00
Marco Slot	7523753a73	Clear metadata OID cache prior to deadlock detection	2017-08-18 11:20:24 +02:00
Andres Freund	b936bde936	Take AccessShareLock on the extension prior to running deadlock detection	2017-08-18 11:20:24 +02:00
Onder Kalaci	20679c9e8b	Relax assertion on deadlock detection considering self deadlocks.	2017-08-18 11:16:38 +03:00
Onder Kalaci	550a5578d8	Skip deadlock detection on the workers Do not run distributed deadlock detection on the worker nodes to prevent errornous decisions to kill the deadlocks.	2017-08-17 19:43:38 +03:00
Burak Yucesoy	ae32d786cf	Add new isolation tests	2017-08-17 17:46:03 +03:00
Marco Slot	1eca53ad40	Exit maintenanced on database crash	2017-08-16 18:29:44 +02:00
Marco Slot	9e7b1fb858	Return readable nodes in master_get_active_worker_nodes	2017-08-16 11:28:47 +02:00
Hadi Moshayedi	e5fbcf37dd	Add Savepoint Support (#1539 ) This change adds support for SAVEPOINT, ROLLBACK TO SAVEPOINT, and RELEASE SAVEPOINT. When transaction connections are not established yet, savepoints are kept in a stack and sent to the worker when the connection is later established. After establishing connections, savepoint commands are sent as they arrive. This change fixes #1493 .	2017-08-15 13:02:28 -04:00
Onder Kalaci	205501532a	Add version check to the maintenance daemon We should prevent running the deadlock detection if there is a major version change. Otherwise, the daemon may access to obsolete metadata catalog tables.	2017-08-15 18:47:13 +03:00
Marco Slot	3ff46245b3	Make sure we don't use 2PC in copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Marco Slot	fa70089766	Enable 2PC during distributed table creation	2017-08-15 13:44:20 +02:00
Marco Slot	9232823070	Abort on failure on master connection during copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	df7723cde5	Should not commit on aborted non-critical connections	2017-08-15 13:44:20 +02:00
Eren Başak	77626c4238	Fix NULL nodeClusterString crush on pg_worker_list.conf migrations	2017-08-14 18:13:53 +03:00
Eren Başak	b3d2f9ba71	Fix pg_worker_list use-after-free bug This change fixes a use-after-free bug while renaming obsolete `pg_worker_list.conf` file, which causes Citus to crash during upgrade (or even extension creation) if `pg_worker_list.conf` exists.	2017-08-14 18:13:53 +03:00
Burak Yucesoy	45b273321f	Add tests for locking operations on partitioned tables	2017-08-14 14:55:45 +03:00
Burak Yucesoy	dfdfb44ebf	Acquire shard resource locks on parent tables while operating on partitions	2017-08-14 14:44:30 +03:00
Burak Yucesoy	a321e750c0	Acquire relation locks on partitions while operation on parent table	2017-08-14 14:44:30 +03:00
Burak Yucesoy	52b9e35d50	Add relationIdList field to the Job struct	2017-08-14 14:06:22 +03:00
Onder Kalaci	4f668ad38b	Make the test outputs consistent by using VACUUM ANALYZE on the tables.	2017-08-12 13:29:25 +03:00
Onder Kalaci	0ba2f9e4e4	Add regression tests for distributed deadlock detection	2017-08-12 13:29:25 +03:00
Onder Kalaci	5b48de7430	Improve deadlock detection for MX We added a new field to the transaction id that is set to true only for the transactions initialized on the coordinator. This is only useful for MX in order to distinguish the transaction that started the distributed transaction on the coordinator where we could have the same transactions' worker queries on the same node.	2017-08-12 13:28:37 +03:00
Onder Kalaci	59133415b0	Add logging infrasture for distributed deadlock detection We added a new GUC citus.log_distributed_deadlock_detection which is off by default. When set to on, we log some debug messages related to the distributed deadlock to the server logs.	2017-08-12 13:28:37 +03:00
Onder Kalaci	e5d5bdff51	Enable distributed deadlock detection on the maintenance deamon With this commit, the maintenance deamon starts to check for distributed deadlocks. We also introduced a GUC variable (distributed_deadlock_detection_factor) whose value is multiplied with Postgres' deadlock_timeout. Setting it to -1 disables the distributed deadlock detection.	2017-08-12 13:28:37 +03:00
Onder Kalaci	66936053a0	Improve error messages when a backend is cancelled by deadlock detection We send SIGINT to a backend that is cancelled due to a deadlock. That approach ends up being a very confusing error message. With this commit we intercept the error messages and show a more meaningful error message to the user.	2017-08-12 13:28:37 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Onder Kalaci	a333c9f16c	Add infrastructure for distributed deadlock detection This commit adds all the necessary pieces to do the distributed deadlock detection. Each distributed transaction is already assigned with distributed transaction ids introduced with `3369f3486f`. The dependency among the distributed transactions are gathered with `80ea233ec1`. With this commit, we implement a DFS (depth first seach) on the dependency graph and search for cycles. Finding a cycle reveals a distributed deadlock. Once we find the deadlock, we examine the path that the cycle exists and cancel the youngest distributed transaction. Note that, we're not yet enabling the deadlock detection by default with this commit.	2017-08-12 13:28:37 +03:00
Marco Slot	59e626d158	Add regression tests for follower clusters	2017-08-12 12:05:56 +02:00
Marco Slot	55992d4bc0	Disallow task-tracker queries on follower clusters	2017-08-12 11:47:31 +02:00
velioglu	100739f62a	Change citus subversion	2017-08-11 11:57:57 +03:00
Marco Slot	53584affa8	Fix locking in create_distributed_table	2017-08-11 11:34:33 +03:00
velioglu	7c65001e23	Do not delete row from colocation table within drop table	2017-08-11 11:34:33 +03:00
velioglu	b0efffae1c	Correct planner and add more tests	2017-08-11 10:16:13 +03:00
velioglu	7550b8ad52	Fix anchor shard id selection when reference table exists	2017-08-11 10:09:47 +03:00
velioglu	ceba81ce35	Move physical planner checks to logical planner	2017-08-11 10:09:47 +03:00
velioglu	0359d03530	Add set operation check for reference tables	2017-08-11 10:09:47 +03:00
velioglu	c4e3b8b5e1	Add planner changes and tests for subquery on reference tables	2017-08-11 10:09:47 +03:00
velioglu	45717dd013	Check equivalence on reference tables for subquery pushdown	2017-08-11 10:09:47 +03:00
Marco Slot	0ae265c436	Add citus_create_restore_point for distributed snapshots	2017-08-11 07:36:20 +02:00
Marco Slot	fdff210ef7	Wait for commit/abort/prepare results asynchronously	2017-08-11 00:03:06 +02:00
Marco Slot	fca986f214	Add API for waiting for multiple connections	2017-08-11 00:03:06 +02:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Brian Cloutier	c854d51cd8	make multi_reference_table test more stable	2017-08-10 17:37:17 +03:00
Brian Cloutier	3fc87a7a29	Metadata sync also syncs nodes in other clusters	2017-08-10 16:55:55 +03:00
Brian Cloutier	0dee4f8418	Metadata sync syncs all nodes, not just primaries	2017-08-10 16:55:55 +03:00
Eren Başak	deb89cb9ce	Delete tesh_helper_functions.h	2017-08-10 14:00:44 +03:00
Eren Başak	f9470329e5	Remove test_helper_functions.h inclusions	2017-08-10 12:42:46 +03:00
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	1961add6f9	Improve error message when there are no nodes for a placement	2017-08-10 12:38:51 +03:00
Jason Petersen	dee66e3959	Final review feedback	2017-08-10 01:10:09 -07:00
Jason Petersen	a578506718	Add multi-row isolation tests	2017-08-10 01:10:09 -07:00
Jason Petersen	addde54464	Add some tests	2017-08-10 00:32:46 -07:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
velioglu	7e436c0277	Add bool expression to pruning instance with a function	2017-08-10 08:56:36 +03:00
Andres Freund	e8b793c454	Support for IN (const, list) and = ANY(const, b, c) pruning.	2017-08-10 08:56:36 +03:00
Onder Kalaci	b5ea3ab6a3	Improve locking semantics for backend management We use the backend shared memory lock for preventing new backends to be part of a new distributed transaction or an existing backend to leave a distributed transaction while we're reading the all backends' data. The primary goal is to provide consistent view of the current distributed transactions while doing the deadlock detection.	2017-08-09 17:17:12 +03:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Marco Slot	08ed6d8269	Prevent pg_dist_node changes during master_create_empty_shard	2017-08-09 14:22:09 +02:00
Murat Tuncer	5cb9466255	Rebase node metadata isolation tests	2017-08-09 14:22:09 +02:00
Marco Slot	3a0571e69b	Remove LockMetadataSnapshot	2017-08-09 14:09:54 +02:00
Marco Slot	ad0fdf57ca	Add add/remove node rollback isolation tests	2017-08-09 14:09:54 +02:00
Marco Slot	c2f8bafa05	Fix shard creation vs. pg_dist_node change locking	2017-08-09 14:09:54 +02:00
Marco Slot	868ee6be83	Fix and simplify pg_dist_node locking	2017-08-09 14:09:54 +02:00
Burak Yucesoy	ab5f97861b	Add regression tests for distributed partitioned tables	2017-08-09 10:01:35 +03:00
Burak Yucesoy	8455d1a4ef	Ensure we are allowing partitioned tables at all appropriate places	2017-08-09 10:01:35 +03:00
Burak Yucesoy	2eee556738	Add distributed partitioned table support for COPY For partitioned tables, PostgreSQL opens partition and its partitions in BeginCopyFrom and it expects its caller to close those relations. However, we do not have quick access to opened relations and performing special operations for partitioned tables isn't necessary in coordinator node. Therefore before calling BeginCopyFrom, we change relkind of those partitioned tables to RELKIND_RELATION. This prevents PostgreSQL to open its partitions as well.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	31f3221342	Add distributed partitioned table support to router plannable queries In standart_planner, PostgreSQL expands partitioned tables to their partitions and call our restriction hook for each partition. It also, for some queries, skips the partitioned table itself completely. This behaviour makes it difficult to prune shards and decide whether query is router plannable or not. To prevent this behaviour, we change inh flag of partitioned tables to false in the query tree. In this case, PostgreSQL treats those partitioned tables as regular relations and does not expand them. This behaviour is inline with our expectations, because we do not want to treat partitioned tables differently on coordinator. Although we are not entirely comfortable with modifying query tree, other solutions to this problem is overly complicated.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Marco Slot	d3e9746236	Avoid connections that accessed non-colocated placements in multi-shard commands	2017-08-08 18:32:34 +02:00
Brian Cloutier	7060ade6fe	GetNodeTuple returns NULL it node does not exist It never throws an error.	2017-08-08 13:12:06 +03:00
Brian Cloutier	a3e9bef685	All users of WorkerNodeHash take an AccessShareLock The metadata cache simulates a SELECT on pg_dist_node. Now the locks it takes also simulate that SELECT.	2017-08-08 13:12:06 +03:00
Brian Cloutier	5914c992e6	cluster management UDFs see nodes in different clusters - master_activate_node and master_disable_node correctly toggle isActive, without crashing - master_add_node rejects duplicate nodes, even if they're in different clusters - master_remove_node allows removing nodes in different clusters	2017-08-08 13:12:06 +03:00
Brian Cloutier	3151b52a0b	Add citus.cluster_name GUC - Nodes with a nodecluster which does not match citus.cluster_name are excluded from the metadata cache and never seen by another part of Citus.	2017-08-08 13:12:06 +03:00
Brian Cloutier	94947c0d54	Refactor: ReplicateShardToAllWorkers more explicitly locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	f87fefa323	Refactor: DistributedTableSize more explicitly only locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	3769381366	Fix inaccurate comment on SetNodeState	2017-08-08 13:12:06 +03:00
Brian Cloutier	bf197e9f0c	Add test for super-long cluster names	2017-08-08 11:18:31 +03:00
Brian Cloutier	fbecf48a03	Disallow adding primary nodes to non-default clusters	2017-08-08 11:18:31 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00
Brian Cloutier	74ce4faab5	Make multi_cluster_management test more stable	2017-08-08 11:18:31 +03:00
Brian Cloutier	e7846ba7d1	Allow metadata sync functions on secondaries {start,stop}_metadata_sync_to_node now toggle the hasMetadata flag when run on secondaries but don't attempt to actually sync any metadata.	2017-08-07 18:46:51 +03:00
Marco Slot	4cc7c36596	Simplify metadata lock acquisition for DML	2017-08-07 15:36:58 +02:00
Marco Slot	aa7ca81548	Execute UPDATE/DELETE statements with 0 shards	2017-08-07 15:36:58 +02:00
Marco Slot	bac60bb64f	Function evaluation descends into expression trees	2017-08-06 19:53:05 +02:00
Brian Cloutier	37985de85e	master_disable_node no longer crashes when given a non-existant node	2017-08-04 11:14:54 +03:00
Hadi Moshayedi	8229a64fe8	Remove distributed tables' dependency on distribution key columns. (#1527 ) This change removes distributed tables' dependency on distribution key columns. We already check that we cannot drop distribution key columns in ErrorIfUnsupportedAlterTableStmt() at multi_utility.c, so we don't need to have distributed table to distribution key column dependency to avoid dropping of distribution key column. Furthermore, having this dependency causes some warnings in pg_dump --schema-only (See #866), which are not desirable. This change also adds check to disallow drop of distribution keys when citus.enable_ddl_propagation is set to false. Regression tests are updated accordingly.	2017-08-03 10:07:04 -04:00
Murat Tuncer	fa18899cf9	Remove serialization/deserialization of multiplan node (#1477 ) introduces copy functions for Citus MultiPlan nodes. uses ExtensibleNode mechanism to store MultiPlan data drops serialiazation of MultiPlans	2017-08-02 08:24:00 +03:00
Burak Yucesoy	37b200a52e	Fix broken isolation tests We try to run our isolation tests paralles as much as possible. In some of those isolation tests we used same table name which causes problem while running them in paralles. This commit changes table names in those tests to ensure tests can run in parallel.	2017-07-31 11:11:49 +03:00
Burak Yucesoy	7769f1d012	Refactor distributed table creation logic This commit is preperation for introducing distributed partitioned table support. We want to clean and refactor some code in distributed table creation logic so that we can handle partitioned tables in more robust way.	2017-07-31 11:11:23 +03:00
Murat Tuncer	520d74b96d	Add a regression test for citus.max_task_string_size (#1524 )	2017-07-28 10:49:09 -07:00
Brian Cloutier	7d8bcb6a88	These tests sometimes deadlock on travis	2017-07-28 16:02:43 +03:00
Brian Cloutier	b20a086a8f	master_activate_node UDF also returns noderole	2017-07-28 16:02:43 +03:00
Murat Tuncer	26f020dc6e	Make maxTaskStringSize configurable (#1501 ) maxTaskStringSize determines the size of worker query string. It was originally hard coded to a specific value. This has caused issues at some users. Since it determines initial shared memory allocation, we did not want to set it to an arbitrary higher number. Instead made it configurable. This commit introduces a new GUC variable max_task_string_size Changes in this variable requires restart to be in effect.	2017-07-27 11:39:12 -07:00
Onder Kalaci	6132d17481	Convert global wait edges to adjacency list In this commit, we add ability to convert global wait edges into adjacency list with the following format: [transactionId] = [transactionNode->waitsFor {list of waiting transaction nodes}]	2017-07-27 19:53:51 +03:00
Murat Tuncer	8729b7d55a	Use cstore_table_size function to determine cstore table size (#1521 ) pg_table_size/pg_relation_size variants always return 0 for cstore tables. We should be using cstore_table_size function for cstore_tables.	2017-07-27 09:02:07 -07:00
Brian Cloutier	32e16ffe02	Give isolation tester ability to see locks on workers	2017-07-26 18:43:04 +03:00
Eren Başak	a12f1980de	Add Progress Tracking Infrastructure This change adds a general purpose infrastructure to log and monitor process about long running progresses. It uses `pg_stat_get_progress_info` infrastructure, introduced with PostgreSQL 9.6 and used for tracking `VACUUM` commands. This patch only handles the creation of a memory space in dynamic shared memory, putting its info in `pg_stat_get_progress_info`, fetching the progress monitors on demand and finalizing the progress tracking.	2017-07-26 14:12:15 +03:00
Marco Slot	80ea233ec1	Add function for dumping global wait edges	2017-07-25 16:52:32 +02:00
Marco Slot	81198a1d02	Add function for dumping local wait edges	2017-07-25 16:52:32 +02:00
Onder Kalaci	58faffa42b	Fix bug on error check for assigning distributed transaction id to a backend that has already been assigned a transaction.	2017-07-25 14:58:07 +03:00
Marco Slot	5923334114	Add transaction recovery regression tests	2017-07-24 20:44:38 +02:00
Marco Slot	3d7f79127d	Do not release locks in LogTransactionRecord	2017-07-24 20:44:38 +02:00
Brian Cloutier	88702ca58a	node_metadata takes out more sane locks - Never release locks - AddNodeMetadata takes ShareRowExclusiveLock so it'll conflict with the trigger which prevents multiple primary nodes. - ActivateNode and SetNodeState used to take AccessShareLock, but they modify the table so they should take RowExclusiveLock. - DeleteNodeRow and InsertNodeRow used to take AccessExclusiveLock but only need RowExclusiveLock.	2017-07-24 11:57:46 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Brian Cloutier	e6c375eb81	Tiny refactor to master_create_empty_shard	2017-07-24 11:57:46 +03:00
Brian Cloutier	ee270b65d7	make WorkerGetNodeWithName a static function	2017-07-24 11:57:46 +03:00
Marco Slot	601b17d544	Use distributed transaction number in 2PC identifiers	2017-07-21 17:36:33 +02:00
Marco Slot	18a6e478af	Fix typo in GetCurrentDistributedTransctionId	2017-07-21 17:36:33 +02:00
Brian Cloutier	7f1343103e	Fix PG 10 build, UNBOUNDED partitions now have different syntax Update code and tests to match the changes made in pg's d363d42	2017-07-21 14:30:11 +03:00
Brian Cloutier	74dd5bb281	Fix crash when removing an inactive node	2017-07-20 18:55:40 +03:00
Hadi Moshayedi	953df34d22	Explicit switch/case fall-throughs to avoid compiler warnings. GCC 7 added `-Wimplicit-fallthrough` to warn for not explicitly specified switch/case fall-throughs. According to https://gcc.gnu.org/gcc-7/changes.html, to suppress that warning we could either use `__attribute__(fallthrough)`, which didn't seem to work for earlier GCC versions, or a `/* fallthrough /` comment just before the following `case`. Previously Citus code had the fall-through comments inside the brackets, which didn't seem to suppress the warning. Putting a `/ fallthrough */` comment outside the brackets and right before the `case` fixes the problem.	2017-07-19 11:41:59 -04:00
Onder Kalaci	3369f3486f	Introduce distributed transaction ids This commit adds distributed transaction id infrastructure in the scope of distributed deadlock detection. In general, the distributed transaction id consists of a tuple in the form of: `(databaseId, initiatorNodeIdentifier, transactionId, timestamp)`. Briefly, we add a shared memory block on each node, which holds some information per backend (i.e., an array `BackendData backends[MaxBackends]`). Later, on each coordinated transaction, Citus sends `SELECT assign_distributed_transaction_id()` right after `BEGIN`. For that backend on the worker, the distributed transaction id is set to the values assigned via the function call. The aim of the above is to correlate the transactions on the coordinator to the transactions on the worker nodes.	2017-07-18 15:01:42 +03:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
Marco Slot	fd72cca6c8	Use predictable placement IDs in regression test output	2017-07-17 13:44:29 +03:00
Onder Kalaci	ce8edd88f7	Apply regression test changes that are due to PostgreSQL 10 changes that have recently changed	2017-07-14 13:22:12 +03:00
Brian Cloutier	72d8d2429b	Add a test for upgrading shard placements	2017-07-12 14:18:27 +02:00
Brian Cloutier	ee4edc498f	Don't release locks early in metadata functions	2017-07-12 14:18:27 +02:00
Brian Cloutier	f40f03270a	Fix locking in ReadWorkerNodes()	2017-07-12 14:18:27 +02:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Brian Cloutier	fe53fd4a8e	Remove functions created just for unit testing These functions are holdovers from pg_shard and were created for unit testing c-level functions (like InsertShardPlacementRow) which our regression tests already test quite effectively. Removing because it makes refactoring the signatures of those c-level functions unnecessarily difficult. - create_healthy_local_shard_placement_row - update_shard_placement_row_state - delete_shard_placement_row	2017-07-12 14:16:24 +02:00
Brian Cloutier	0b64bb1092	Fix typo in comment in CachedRelationLookup	2017-07-12 14:16:24 +02:00
Brian Cloutier	385d9cbbb7	Ignore generated multi_behavioral_analytics_create_table test files	2017-07-12 14:16:24 +02:00
Marco Slot	bf8377082c	Use consistent placement IDs in mulity_modyfing_xactstest	2017-07-12 14:16:23 +02:00
Brian Cloutier	fd8c142530	Remove unused line, @arguments was set but never used	2017-07-12 13:46:27 +02:00
Marco Slot	9f7e4769e2	Clarify placement connection error messages	2017-07-12 11:59:19 +02:00
Marco Slot	d3785b97c0	Remove XactModificationLevel distinction between DML and multi-shard	2017-07-12 11:59:19 +02:00
Marco Slot	710fe8666b	Use GetPlacementListConnection for router DML	2017-07-12 11:26:23 +02:00
Marco Slot	29f21fea59	Use GetPlacementListConnection for multi-shard commands	2017-07-12 11:26:22 +02:00
Marco Slot	01c9b1f921	Use GetPlacementListConnection for router SELECTs	2017-07-12 11:26:22 +02:00
Marco Slot	63676f5d65	Allow choosing a connection for multiple placements with GetPlacementListConnection	2017-07-12 11:26:22 +02:00
Jason Petersen	9018e698ec	Indentation cleanup Uncrustify 0.65 appears to have changed some defaults, resulting in breakages for those of us who have already upgraded; Travis still uses Uncrustify 0.64, but these changes work with both versions (assuming appropriately updated config), so this should permit use of either version for the time being.	2017-07-11 15:59:28 -06:00
Jason Petersen	d896fe7995	Add some test outputs to gitignore These were bothering me.	2017-07-11 15:37:32 -06:00
Burak Yucesoy	a15b3c6df2	Add tests for concurrent INSERT and VACUUM behaviour	2017-07-10 15:46:48 +03:00
Burak Yucesoy	c8b9e4011b	Remove LockRelationDistributionMetadata function	2017-07-10 15:46:37 +03:00
Burak Yucesoy	cb6070c720	Use ShareUpdateExclusiveLock instead ShareLock in VACUUM Before this change, we used ShareLock to acquire lock on distributed tables while running VACUUM. This makes VACUUM and INSERT block each other. With this change we changed lock mode from ShareLock to ShareUpdateExclusiveLock, which does not conflict with the locks INSERT acquire.	2017-07-10 15:46:19 +03:00

1 2 3 4 5 ...

912 Commits (86a70515c54a217fe246f055b1e02b42f73a11a4)