citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	a9933deac6	Make real time executor work in transactions	2017-11-30 09:59:32 +03:00
Jason Petersen	0eacf6bd95	Refactor VacuumStmt checker to be single-return Decided this would be safer for the future (defaults to unsupported).	2017-11-29 16:06:50 -07:00
Jason Petersen	b12e77ab0e	Ensure unsupported VACUUMs don't go to workers Apparently these two blocks have been incorrect for nearly a year…	2017-11-29 16:06:50 -07:00
Marco Slot	7ea718fd8d	Round-robin over worker nodes for 0-shard router queries	2017-11-29 15:52:22 +01:00
Onder Kalaci	05fb0dd020	Add infrastructure for filtering restriction contexts based on the input query In subquery pushdown, we first ensure that each relation is joined with at least on another relation on the partition keys. That's fine given that the decision is binary: pushdown the query at all or not. With recursive planning, we'd want to check whether any specific part of the query can be pushded down or not. Thus, we need the ability to understand which part(s) of the subquery is safe to pushdown. This commit adds the infrastructure for doing that.	2017-11-28 09:58:21 +02:00
Onder Kalaci	26d9b58e9e	Make sure that ExtractRangeTableRelationWalker never misses RTE_RELATION	2017-11-28 09:27:34 +02:00
Onder Kalaci	32def06ebd	Split assigning RTE identities and partitioning related query modifications Note that we used to iterate over the RTEs once for performance reasons. However, keeping an extra copy of original query seems more costly and hard to maintain/explain.	2017-11-28 09:27:34 +02:00
Marco Slot	feffe86440	Subqueries containing functions go through subquery pushdown	2017-11-27 22:13:02 +01:00
Onder Kalaci	48f96bf3e5	Enable non equi joins in subquery pushdown Subquery pushdown planning is based on relation restriction equivalnce. This brings us the opportuneatly to allow any other joins as long as there is an already equi join between the distributed tables. We already allow that for joins with reference tables and this commit allows that for joins among distributed tables.	2017-11-23 16:13:46 +02:00
Onder Kalaci	16421f089f	Register citus custom scan nodes	2017-11-23 11:38:33 +02:00
Onder Kalaci	83c1143505	Refactor custom scan related codes In this commit, we don't change any codes, only create a new file and move the related functions and types there.	2017-11-23 11:38:12 +02:00
Marco Slot	20a526d5c4	Fix memory leak in ListToHashSet	2017-11-22 11:26:58 +01:00
Marco Slot	f4ceea5a3d	Enable 2PC by default	2017-11-22 11:26:58 +01:00
Marco Slot	8486f76e15	Auto-recover 2PC transactions	2017-11-22 11:26:58 +01:00
Marco Slot	6ba3f42d23	Rename MultiPlan to DistributedPlan	2017-11-22 09:36:24 +01:00
Marco Slot	0ad39b36fe	Treat immutable table functions and constant subqueries as reference tables	2017-11-21 14:15:22 +01:00
Onder Kalaci	d558ebb923	Relax the checks on ensuring distribution columns for target entries With this commit, we allow pushing down subqueries with only reference tables where GROUP BY or DISTINCT clause or Window functions include only columns from reference tables.	2017-11-21 12:28:14 +02:00
Andres Freund	d063658d6d	Protect some initializations from being called during backend startup. On EXEC_BACKEND builds these functions shouldn't be called at every backend start.	2017-11-20 15:29:51 -08:00
Brian Cloutier	d267e0f9fa	EXEC_BACKEND: don't put pointers to shared hashes into shared memory Store pointers to shared hashes in process-local variables. Previously pointers to shared hashes were put into shared memory. This causes problems on EXEC_BACKEND because everybody calls execve and receives a brand new address space; the shared hash will be in a different place for every backend. (normally we call fork, which gives you a copy of the address space, so these pointers remain constant)	2017-11-20 15:29:51 -08:00
Brian Cloutier	30a2365d81	Rename CreateDirectory to CitusCreateDirectory	2017-11-20 14:38:26 -08:00
Brian Cloutier	aa2ab023a2	Rename RemoveDirectory -> CitusRemoveDirectory	2017-11-20 14:21:52 -08:00
Brian Cloutier	06f756b0a1	Rename DeleteFile -> CitusDeleteFile	2017-11-20 13:30:11 -08:00
Marco Slot	9793218122	Do not commit already-committed prepared transactions in recovery	2017-11-20 13:18:48 +01:00
Marco Slot	ae47df01ea	Observe prepared xacts twice in RecoverWorkerTransactions to avoid race condition	2017-11-20 11:44:08 +01:00
Marco Slot	2410c2e450	Rewrite recover_prepared_transactions to be fast, non-blocking	2017-11-20 11:27:40 +01:00
Onder Kalaci	5bea95009b	Skip autovacuum processes for distributed deadlock detection Autovacuum process cancels itself if any modification starts on the table in order to avoid blocking your regular Postgres sessions. That's normal and expected. Thus, any locks held by autovacuum process cannot involve in a distributed deadlock since it'll be released if needed.	2017-11-15 14:32:16 +02:00
Onder Kalaci	c65c153a46	Skip speculative locks for distributed deadlock detection These locks are held for a very short duration time and cannot contribute to a deadlock. Speculative locks are used by Postgres for internal notification mechanism among transactions.	2017-11-15 12:43:45 +02:00
Marco Slot	bbbadd6d1b	Bump Citus version to 7.2devel	2017-11-15 10:32:49 +01:00
Marco Slot	d3b634b301	Allow generating placement IDs without using the sequence	2017-11-15 10:12:06 +01:00
Marco Slot	c24a0875a5	Allow generating shard IDs without using the sequence	2017-11-15 10:12:05 +01:00
Brian Cloutier	0f3230170f	Pull in INT32_MAXINT and INT32_MININT	2017-11-14 14:03:46 -08:00
Brian Cloutier	0db8277266	remove unused errno import	2017-11-14 13:09:34 -08:00
Brian Cloutier	5d9f3ae7fd	Remove unused poll import from multi_real_time_executor	2017-11-14 13:09:34 -08:00
Marco Slot	533a533565	Only drop sequences on workers with metadata	2017-11-14 16:01:56 +01:00
velioglu	be28ba8e70	Add stub UDF to run pg_upgrade flawlessly	2017-11-13 16:14:45 +02:00
metdos	111c04c2bd	Warn on CLUSTER command for distributed tables	2017-11-10 12:14:45 +02:00
Burak Yücesoy	863df0b874	Merge branch 'master' into fix_partitioning_in_schema	2017-11-09 12:49:35 +02:00
Burak Yucesoy	17229ed7bd	Fix attaching partition to a distributed table in schema While attaching a partition to a distributed table in schema, we mistakenly used unqualified name to find partitioned table's oid. This caused problems while using partitioned tables with schemas. We are fixing this issue in this PR.	2017-11-09 13:20:29 +03:00
Onder Kalaci	94921a2be1	Skip page-level locks on distributed deadlock detection Short-term share/exclusive page-level locks are used for read/write access. Locks are released immediately after each index row is fetched or inserted. Since those locks may not lead to any deadlocks, it's safe to ignore them in the distributed deadlock detection.	2017-11-09 10:37:23 +02:00
Marco Slot	f71728f634	Add GUC for specifying sslmode in connections to workers	2017-11-08 14:15:58 +01:00
Murat Tuncer	4e3d633ebf	Add check for connection failures during multishard update (#1765 )	2017-11-07 12:33:25 +02:00
Hadi Moshayedi	6d79d25101	Fix a relcache reference leak in stats collection. In DistributedTablesSize() we didn't close the relations that had replication factor > 2. This caused relcache reference leaks, and warning messages like following in logs: WARNING: relcache reference leak: relation "researchers" not closed	2017-11-06 23:16:43 -05:00
metdos	c83edc36b5	Check connection status before using it	2017-11-06 14:53:35 +02:00
Brian Cloutier	7be1545843	Support implicit casts during INSERT/SELECT It's possible to build INSERT SELECT queries which include implicit casts, currently we attempt to support these by adding explicit casts to the SELECT query, but this sometimes crashes because we don't update all nodes with the new types. (SortClauses, for instance) This commit removes those explicit casts and passes an unmodified SELECT query to the COPY executor (how we implement INSERT SELECT under the scenes). In lieu of those cases, COPY has been given some extra logic to inspect queries, notice that the types don't line up with the table it's supposed to be inserting into, and "manually" casting every tuple before sending them to workers.	2017-11-03 22:27:15 -07:00
Marco Slot	6883a09cdd	Allow distributed partitioned table creation in Cloud	2017-11-03 10:09:18 +01:00
Marco Slot	6219186683	Allow distributed INSERT...SELECT via worker nodes in MX	2017-11-02 14:38:39 +01:00
Hadi Moshayedi	7280774cf4	Use list_length() != 1 in SingleReplicatedTable(). ShardPlacementList's implementation can return NIL. In previous implementation we got a segmentation fault in this case. The relation can be dropped after getting distributed table list but before calling SingleReplicatedTable().	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	7691991cb5	Do PG_TRY() inside a subtransaction block. If we don't propagate the errors we are catching in PG_CATCH(), database's internal state might not be clean. So we do PG_TRY() inside a subtransaction so we can rollback to it after catching errors.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	9bfbbf8a04	Make reports hostname configurable and enable stats collection in tests. This patch adds --with-reports-host configure option, which sets the REPORTS_BASE_URL constant. The default is reports.citusdata.com. It also enables stats collection in tests.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	acaf085a80	Add callback function for request by CollectBasicUsageStatistics(). Curl writes the received response to stdout if we don't specify a response callback or an output file. This can pollute the PostgreSQL log. In this change we add a callback function so the response messages aren't added to the log file.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	747e439601	Limit number of stats collection retries to once a day.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	78a2cd9052	Check for Citus updates. Sends a request to /v1/releases/latest?flavor=$CITUS_EDITION once a day, which returns a response similar to {"version": "7.1.0", "major": 7, "minor": 1, "patch": 0}. Then compares it with current Citus version, and if the latest release is newer, logs a LOG message.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	34f3ec0961	Call FlushDistTableCache() before stats collection.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	c18c6625d9	Lock relations before calling citus_table_size(). This is to make sure they don't get dropped.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	97d544b75c	Follow the patterns used in Deadlock Detection in Stats Collection. This includes: (1) Wrap everything inside a StartTransactionCommand()/CommitTransactionCommand(). This is so we can access the database. This also switches to a new memory context and releases it, so we don't have to do our own memory management. (2) LockCitusExtension() so the extension cannot be dropped or created concurrently. (3) Check CitusHasBeenLoaded() && CheckCitusVersion() before doing any work. (4) Do not PG_TRY() inside a loop.	2017-10-31 21:51:43 -04:00
Marco Slot	100aaeb3f5	Fix typo in distributed deadlock error message	2017-10-31 19:39:32 +01:00
metdos	8c356b2bc8	Don't try to add restrictions for reference tables in insert into select	2017-10-31 19:44:10 +02:00
mehmet furkan şahin	32fb19911c	Add Constraint %s Add Primary Key Using index %s support This commit makes a change in relay_event_utility.c to check if the Alter Table command adds a constraint using index. If this is the case, it appends the shard id to the index name.	2017-10-31 16:03:56 +03:00
Marco Slot	7e34348334	Add shard transfer mode parameter to shard copy functions	2017-10-31 13:30:48 +01:00
Marco Slot	2bb46bb5ee	Reset connectionReady flag after moving a connection in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	e6e6897499	Defer initial PQflush to main loop in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	d6dadb1b25	Use correct index for ModifyWaitEvent in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Furkan Sahin	2b39c52f0b	Replica identity on create_distributed_table By this commit, citus minds the replica identity of the table when we distribute the table. So the shards of the distributed table have the same replica identity with the local table.	2017-10-31 13:08:36 +03:00
Marco Slot	7f68f78ee9	Omit public schema from shard_name output	2017-10-31 00:22:07 +01:00
Murat Tuncer	e16805215d	Support count(distinct) for non-partition columns (#1692 ) Expands count distinct coverage by allowing more cases. We used to support count distinct only if we can push down distinct aggregate to worker query i.e. the count distinct clause was on the partition column of the table, or there was a grouping on the partition column. Now we can support - non-partition columns, with or without grouping on partition column - partition, and non partition column in the same query - having clause - single table subqueries - insert into select queries - join queries where count distinct is on partition, or non-partition column - filters on count distinct clauses (extends existing support) We first try to push down aggregate to worker query (original case), if we can't then we modify worker query to return distinct columns to coordinator node. We do that by adding distinct column targets to group by clauses. Then we perform count distinct operation on the coordinator node. This work should reduce the cases where HLL is used as it can address anything that HLL can. However, if we start having performance issues due to very large number rows, then we can recommend hll use.	2017-10-30 13:12:24 +02:00
Marco Slot	be46661bf7	Block only 2PCs instead of all writes in citus_create_restore_point	2017-10-27 00:07:32 +02:00
mehmet furkan şahin	61ae33dc7f	ALTER TABLE .. REPLICA IDENTITY support is implemented	2017-10-26 13:44:28 +03:00
Brian Cloutier	4a17d12d74	Replace uint with uint32	2017-10-25 19:32:12 -07:00
velioglu	0b5db5d826	Support multi shard update/delete queries	2017-10-25 15:52:38 +03:00
Marco Slot	4bde83e1d2	Relay error message if DML fails on worker	2017-10-25 14:23:21 +02:00
Hadi Moshayedi	9a04b78980	Send server_id for statistics reports. (#1698 ) This change introduces the `pg_dist_node_metadata` which has a single jsonb value. When creating the extension, a random server id is generated and stored in there. Everything in the metadata table is added as a nested objected to the json payload that is sent to the reports server.	2017-10-18 21:20:32 -04:00
Hadi Moshayedi	86bcd93a4a	Don't collect stats when there is a version mismatch. (#1712 ) The following scenario can cause an Assert() crash if we don't do this: - Install Citus v7.0-15 - Restart server & run a query to start maintenanced. - Install Citus v7.1 - Restart server & run a query. This will tell user to upgrade. - Type "UPDATE EXTENSION c" & press tab. maintenanced will start and crash with Assert(CitusHasBeenLoaded() && CheckCitusVersion(WARNING)); This change checks Citus version before calling metadata functions so the crash doesn't happen.	2017-10-17 14:01:14 -04:00
Jason Petersen	8544878c4b	Add citus_version(), analogous to PG's version() This will provide the full project name (i.e. Citus/Citus Enterprise), and the host system, compiler, and architecture word size. I wanted to limit the number of copied files in 'config', so I added only config.guess and call it manually, rather than using the macro AC_CANONICAL_HOST, which requires several other files.	2017-10-16 18:09:29 -06:00
Brian Cloutier	91ff8cd2d5	{*,}create_distributed_table doesn't emit OID (#1710 )	2017-10-16 18:08:51 -06:00
Brian Cloutier	ebcb2b65e9	Add master_move_node function	2017-10-16 10:51:28 -07:00
Brian Cloutier	58cf15ceca	DistributedTableSize doesn't emit oid when erring out	2017-10-14 02:42:57 +03:00
Hadi Moshayedi	2aec6eda49	Properly use #ifdef HAVE_LIBCURL.	2017-10-13 12:04:36 -06:00
Jason Petersen	01353cb7cb	Use header define rather than -D flag Eclipse apparently doesn't scan build output looking for -D flags, so having the value actually appear in a header is nicer for those of us using IDEs.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	946659aebe	Delete StatsCollection memory context after we are done with stats reporting. Previously we left the memory context untouched, which overtime leaked memory.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	873fd1e7ff	Fix compiling --without-libcurl. Previously <curl/curl.h> was included even if compiled --without-libcurl. This can fail when libcurl headers are not there. This commit guards this include by checks for HAVE_LIBCURL.	2017-10-13 11:00:09 -04:00
Murat Tuncer	4832abc7cb	Make multi_master_planner.c coding convention compliant Changed order of function definitions and added declarations in the beginning of the file	2017-10-13 14:59:48 +03:00
Murat Tuncer	f7ab901766	Add select distinct, and distinct on support Distinct, and distinct on() clauses are supported in simple selects, joins, subqueries, and insert into select queries.	2017-10-13 14:59:48 +03:00
Hadi Moshayedi	6879f92e23	Fix out of bound memeory access when getting HTTP response code. (#1699 )	2017-10-12 12:51:42 -04:00
Hadi Moshayedi	a1387f4aa8	Basic usage statistics collection. (#1656 ) Adds ```citus.enable_statistics_collection``` GUC variable, which ```true``` by default, unless built without libcurl. If statistics collection is enabled, sends basic usage data to Citus servers every 24 hours. The data that is collected consists of: - Citus version - OS name & release - Hardware Id - Number of tables, rounded to next power of 2 - Size of data, rounded to next power of 2 - Number of workers	2017-10-11 09:55:15 -04:00
Onder Kalaci	498ac80d8b	Add window function support for SUBQUERY PUSHDOWN and INSERT INTO SELECT This commit provides the support for window functions in subquery and insert into select queries. Note that our support for window functions is still limited because it must have a partition by clause on the distribution key. This commit makes changes in the files insert_select_planner and multi_logical_planner. The required tests are also added with files multi_subquery_window_functions.out and multi_insert_select_window.out.	2017-10-04 15:33:07 +03:00
Marco Slot	9e516513fc	Use local group ID when querying for prepared transactions	2017-10-03 16:36:53 +02:00
Hadi Moshayedi	11adb9b034	Push down LIMIT and HAVING when grouped by partition key. (#1641 ) We can do this because all rows belonging to a group are in the same shard when grouping by distribution column on a range/hash distributed table.	2017-10-02 20:17:51 -04:00
Marco Slot	394918f9d0	Invalidate worker and group ID cache in maintenance daemon	2017-10-02 18:14:29 +02:00
Marco Slot	43d5e79eaa	Execute transmit commands as superuser during task-tracker queries	2017-09-28 15:27:25 +02:00
Marco Slot	306c58d59b	Check for absolute paths in COPY with format transmit	2017-09-28 15:27:11 +02:00
Marco Slot	cb6b0e820c	Allow read-only users to run task-tracker queries	2017-09-28 13:52:36 +02:00
Marco Slot	da6b42a3e2	Use unique constraint index for transaction record deletion	2017-09-28 12:04:56 +02:00
Onder Kalaci	68ca8cb7f0	Skip relation extension locks We should skip if the process blocked on the relation extension since those locks are hold for a short duration while the relation is actually extended on the disk and released as soon as the extension is done. Thus, recording such waits on our lock graphs could yield detecting wrong distributed deadlocks.	2017-09-28 10:09:09 +03:00
Murat Tuncer	4676c4f7a5	Prevent crash when remote transaction start fails (#1662 ) We sent multiple commands to worker when starting a transaction. Previously we only checked the result of the first command that is transaction 'BEGIN' which always succeeds. Any failure on following commands were not checked. With this commit, we make sure all command results are checked. If there is any error we report the first error found.	2017-09-26 17:25:46 -07:00
Jason Petersen	b4d53423fa	Add adapter functions for OpenFile changes	2017-09-25 17:20:24 -07:00
Jason Petersen	d686123dae	Omit now-public Explain methods from PG11 build This copy-pasted code is no longer needed in PG11.	2017-09-25 17:20:24 -07:00
Jason Petersen	89d02c6115	Add ruleutils file for PostgreSQL 11	2017-09-25 17:20:24 -07:00
Jason Petersen	bbc15e0598	Handle HASHPROC changes PostgreSQL 11 now has "standard" and "extended" (64-bit) versions of hash functions.	2017-09-25 17:20:24 -07:00
Jason Petersen	6c9b19a954	Add version-compat header For polyfill macros, etc.	2017-09-25 17:20:23 -07:00
Jason Petersen	fbeaa2f9d0	Remove direct access to tupleDesc->attrs A level of indirection was removed from this field for PostgreSQL 11. By using the handy provided macro, we can be version agnostic.	2017-09-25 17:20:23 -07:00
Jason Petersen	6a020b5adc	Update CopyGetAttnums with latest from PostgreSQL This function was recently modified to use the TupleDescAttr wrapper, which abstracts away recent changes to TupleDesc.	2017-09-25 17:20:23 -07:00
Andres Freund	78716e5546	Fix possible shard cache incoherency. When a table and it's shards are dropped, and afterwards the same shard identifiers are reused, e.g. due to a DROP & CREATE EXTENSION, the old entry in the shard cache and the required entry in the shard cache might be for different tables. Force invalidation for both old and new table to fix.	2017-09-25 13:05:09 -07:00
velioglu	0a56ed910b	Change error message of queries with distributed and local table Citus can handle INSERT INTO ... SELECT queries if the query inserts into local table by reading data from distributed table. The opposite way is not correct. With this commit we warn the user if the latter option is used.	2017-09-22 13:46:19 -07:00
Onder Kalaci	867224bdd7	Make the tests produce more consistent outputs	2017-09-22 20:38:56 +03:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Onder Kalaci	6736fd1682	Remove two obsolete functions Namely GetConnectionFromPGconn() and CloseConnectionByPGconn()	2017-09-21 00:36:23 -06:00
Onder Kalaci	33ec33c5b3	Ensure schema exists on reference table creation If the schema doesn't exists on the workers, create it.	2017-09-18 23:50:47 +03:00
Onder Kalaci	6116c8e93d	Allow pushing down GROUP BYs when at least there is one distribution column in the target list	2017-09-15 19:15:06 +03:00
Onder Kalaci	a5b66912d4	Expand reference table support in subquery pushdown With this commit, we relax the restrictions put on the reference tables with subquery pushdown. We did three notable improvements: 1) Relax equi-join restrictions Previously, we always expected that the non-reference tables are equi joined with reference tables on the partition key of the non-reference table. With this commit, we allow any column of non-reference tables joined using non-equi joins as well. 2) Relax OUTER JOIN restrictions Previously Citus errored out if any reference table exists at any point of the outer part of an outer join. For instance, See the below sketch where (h) denotes a hash distributed relation, (r) denotes a reference table, (L) denotes LEFT JOIN and (I) denotes INNER JOIN. (L) / \ (I) h / \ r h Before this commit Citus would error out since a reference table appears on the left most part of an left join. However, that was too restrictive so that we only error out if the reference table is directly below and in the outer part of an outer join. 3) Bug fixes We've done some minor bugfixes in the existing implementation.	2017-09-14 20:59:22 +03:00
Marco Slot	d1befa4df9	Wait for I/O to finish after PQputCopyData	2017-09-12 16:18:42 -07:00
Marco Slot	cbe16169b4	Free per-tuple COPY memory in INSERT...SELECT	2017-09-12 15:35:53 -07:00
Marco Slot	5fe0845d7e	Always copy MultiPlan in GetMultiPlan	2017-09-12 11:38:52 -07:00
Jason Petersen	8b2c3fcc15	Add clarifying comment to RngVarCallbackForDropIdx We don't need the PARTITION-related logic recently added in PostgreSQL.	2017-09-01 15:57:30 -06:00
Jason Petersen	ec30ad38ba	Update ruleutils_10 with latest PostgreSQL changes See: postgres/postgres@21d304dfed postgres/postgres@bb5d6e80b1 postgres/postgres@d363d42bb9 postgres/postgres@eb145fdfea postgres/postgres@decb08ebdf postgres/postgres@a3ca72ae9a postgres/postgres@bc2d716ad0 postgres/postgres@382ceffdf7 postgres/postgres@c7b8998ebb postgres/postgres@e3860ffa4d postgres/postgres@76a3df6e5e	2017-09-01 14:26:59 -06:00
Jason Petersen	ebecde8f6e	Update ruleutils_96 with latest PostgreSQL changes See: postgres/postgres@41ada83774 postgres/postgres@3b0c2dbed0 postgres/postgres@ff2d537223	2017-09-01 14:26:53 -06:00
Marco Slot	0aadbb1760	Convert multi-row INSERT target list to Vars	2017-08-25 10:55:56 +02:00
Marco Slot	ae00795dab	Allow default columns in multi-row INSERTs	2017-08-25 10:55:56 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Marco Slot	dbf18df995	Don't error out if BuildGlobalWaitGraph fails to connect	2017-08-23 19:08:03 +02:00
Onder Kalaci	c7bb29b69e	Prevent maintanince deamon crashes due to dead processes If after the distributed deadlock detection decides to cancel a backend, the backend has been terminated/killed/cancelled externally, we might be accessing to a NULL pointer. This commit prevents that case by ignoring the current distributed deadlock.	2017-08-23 15:44:09 +03:00
Marco Slot	641420d79f	Remove source node argument from dump_local_wait_edges	2017-08-23 13:14:00 +02:00
Jason Petersen	8cb69e3a14	Add alias for target in multi-row INSERTs This is necessary for multi-row INSERTs for the same reasons we use it in e.g. UPSERTs: if the range table list has more than one entry, then PostgreSQL's deparse logic requires that vars be prefixed by the name of their corresponding range table entry. This of course doesn't affect single-row INSERTs, but since multi-row INSERTs have a VALUE RTE, they were affected. The piece of ruleutils which builds range table names wasn't modified to handle shard extension; instead UPSERT/INSERT INTO ... SELECT added an alias to the RTE. When present, this alias is favored. Doing the same in the multi-row INSERT case fixes RETURNING for such commands.	2017-08-23 10:24:00 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Marco Slot	bd6bf29983	Don't add procs multiple times in BuildWaitGraphForSourceNode	2017-08-21 16:48:30 +02:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Metin Doslu	0d052e9864	Fix a crash on zero-shard tables	2017-08-18 13:53:59 +03:00
Önder Kalacı	b82f886ad3	Merge branch 'master' into improve_deadlock_detection	2017-08-18 13:07:18 +03:00
Marco Slot	7523753a73	Clear metadata OID cache prior to deadlock detection	2017-08-18 11:20:24 +02:00
Andres Freund	b936bde936	Take AccessShareLock on the extension prior to running deadlock detection	2017-08-18 11:20:24 +02:00
Onder Kalaci	20679c9e8b	Relax assertion on deadlock detection considering self deadlocks.	2017-08-18 11:16:38 +03:00
Onder Kalaci	550a5578d8	Skip deadlock detection on the workers Do not run distributed deadlock detection on the worker nodes to prevent errornous decisions to kill the deadlocks.	2017-08-17 19:43:38 +03:00
Marco Slot	1eca53ad40	Exit maintenanced on database crash	2017-08-16 18:29:44 +02:00
Marco Slot	9e7b1fb858	Return readable nodes in master_get_active_worker_nodes	2017-08-16 11:28:47 +02:00
Hadi Moshayedi	e5fbcf37dd	Add Savepoint Support (#1539 ) This change adds support for SAVEPOINT, ROLLBACK TO SAVEPOINT, and RELEASE SAVEPOINT. When transaction connections are not established yet, savepoints are kept in a stack and sent to the worker when the connection is later established. After establishing connections, savepoint commands are sent as they arrive. This change fixes #1493 .	2017-08-15 13:02:28 -04:00
Onder Kalaci	205501532a	Add version check to the maintenance daemon We should prevent running the deadlock detection if there is a major version change. Otherwise, the daemon may access to obsolete metadata catalog tables.	2017-08-15 18:47:13 +03:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Marco Slot	fa70089766	Enable 2PC during distributed table creation	2017-08-15 13:44:20 +02:00
Marco Slot	9232823070	Abort on failure on master connection during copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	df7723cde5	Should not commit on aborted non-critical connections	2017-08-15 13:44:20 +02:00
Eren Başak	77626c4238	Fix NULL nodeClusterString crush on pg_worker_list.conf migrations	2017-08-14 18:13:53 +03:00
Eren Başak	b3d2f9ba71	Fix pg_worker_list use-after-free bug This change fixes a use-after-free bug while renaming obsolete `pg_worker_list.conf` file, which causes Citus to crash during upgrade (or even extension creation) if `pg_worker_list.conf` exists.	2017-08-14 18:13:53 +03:00
Burak Yucesoy	dfdfb44ebf	Acquire shard resource locks on parent tables while operating on partitions	2017-08-14 14:44:30 +03:00
Burak Yucesoy	a321e750c0	Acquire relation locks on partitions while operation on parent table	2017-08-14 14:44:30 +03:00
Burak Yucesoy	52b9e35d50	Add relationIdList field to the Job struct	2017-08-14 14:06:22 +03:00
Onder Kalaci	5b48de7430	Improve deadlock detection for MX We added a new field to the transaction id that is set to true only for the transactions initialized on the coordinator. This is only useful for MX in order to distinguish the transaction that started the distributed transaction on the coordinator where we could have the same transactions' worker queries on the same node.	2017-08-12 13:28:37 +03:00
Onder Kalaci	59133415b0	Add logging infrasture for distributed deadlock detection We added a new GUC citus.log_distributed_deadlock_detection which is off by default. When set to on, we log some debug messages related to the distributed deadlock to the server logs.	2017-08-12 13:28:37 +03:00
Onder Kalaci	e5d5bdff51	Enable distributed deadlock detection on the maintenance deamon With this commit, the maintenance deamon starts to check for distributed deadlocks. We also introduced a GUC variable (distributed_deadlock_detection_factor) whose value is multiplied with Postgres' deadlock_timeout. Setting it to -1 disables the distributed deadlock detection.	2017-08-12 13:28:37 +03:00
Onder Kalaci	66936053a0	Improve error messages when a backend is cancelled by deadlock detection We send SIGINT to a backend that is cancelled due to a deadlock. That approach ends up being a very confusing error message. With this commit we intercept the error messages and show a more meaningful error message to the user.	2017-08-12 13:28:37 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Onder Kalaci	a333c9f16c	Add infrastructure for distributed deadlock detection This commit adds all the necessary pieces to do the distributed deadlock detection. Each distributed transaction is already assigned with distributed transaction ids introduced with `3369f3486f`. The dependency among the distributed transactions are gathered with `80ea233ec1`. With this commit, we implement a DFS (depth first seach) on the dependency graph and search for cycles. Finding a cycle reveals a distributed deadlock. Once we find the deadlock, we examine the path that the cycle exists and cancel the youngest distributed transaction. Note that, we're not yet enabling the deadlock detection by default with this commit.	2017-08-12 13:28:37 +03:00
Marco Slot	55992d4bc0	Disallow task-tracker queries on follower clusters	2017-08-12 11:47:31 +02:00
velioglu	100739f62a	Change citus subversion	2017-08-11 11:57:57 +03:00
Marco Slot	53584affa8	Fix locking in create_distributed_table	2017-08-11 11:34:33 +03:00
velioglu	7c65001e23	Do not delete row from colocation table within drop table	2017-08-11 11:34:33 +03:00
velioglu	b0efffae1c	Correct planner and add more tests	2017-08-11 10:16:13 +03:00
velioglu	7550b8ad52	Fix anchor shard id selection when reference table exists	2017-08-11 10:09:47 +03:00
velioglu	ceba81ce35	Move physical planner checks to logical planner	2017-08-11 10:09:47 +03:00
velioglu	0359d03530	Add set operation check for reference tables	2017-08-11 10:09:47 +03:00
velioglu	c4e3b8b5e1	Add planner changes and tests for subquery on reference tables	2017-08-11 10:09:47 +03:00
velioglu	45717dd013	Check equivalence on reference tables for subquery pushdown	2017-08-11 10:09:47 +03:00
Marco Slot	0ae265c436	Add citus_create_restore_point for distributed snapshots	2017-08-11 07:36:20 +02:00
Marco Slot	fdff210ef7	Wait for commit/abort/prepare results asynchronously	2017-08-11 00:03:06 +02:00
Marco Slot	fca986f214	Add API for waiting for multiple connections	2017-08-11 00:03:06 +02:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Brian Cloutier	3fc87a7a29	Metadata sync also syncs nodes in other clusters	2017-08-10 16:55:55 +03:00
Brian Cloutier	0dee4f8418	Metadata sync syncs all nodes, not just primaries	2017-08-10 16:55:55 +03:00
Eren Başak	f9470329e5	Remove test_helper_functions.h inclusions	2017-08-10 12:42:46 +03:00
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	1961add6f9	Improve error message when there are no nodes for a placement	2017-08-10 12:38:51 +03:00
Jason Petersen	dee66e3959	Final review feedback	2017-08-10 01:10:09 -07:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
velioglu	7e436c0277	Add bool expression to pruning instance with a function	2017-08-10 08:56:36 +03:00
Andres Freund	e8b793c454	Support for IN (const, list) and = ANY(const, b, c) pruning.	2017-08-10 08:56:36 +03:00
Onder Kalaci	b5ea3ab6a3	Improve locking semantics for backend management We use the backend shared memory lock for preventing new backends to be part of a new distributed transaction or an existing backend to leave a distributed transaction while we're reading the all backends' data. The primary goal is to provide consistent view of the current distributed transactions while doing the deadlock detection.	2017-08-09 17:17:12 +03:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Marco Slot	08ed6d8269	Prevent pg_dist_node changes during master_create_empty_shard	2017-08-09 14:22:09 +02:00
Marco Slot	3a0571e69b	Remove LockMetadataSnapshot	2017-08-09 14:09:54 +02:00
Marco Slot	c2f8bafa05	Fix shard creation vs. pg_dist_node change locking	2017-08-09 14:09:54 +02:00
Marco Slot	868ee6be83	Fix and simplify pg_dist_node locking	2017-08-09 14:09:54 +02:00
Burak Yucesoy	8455d1a4ef	Ensure we are allowing partitioned tables at all appropriate places	2017-08-09 10:01:35 +03:00
Burak Yucesoy	2eee556738	Add distributed partitioned table support for COPY For partitioned tables, PostgreSQL opens partition and its partitions in BeginCopyFrom and it expects its caller to close those relations. However, we do not have quick access to opened relations and performing special operations for partitioned tables isn't necessary in coordinator node. Therefore before calling BeginCopyFrom, we change relkind of those partitioned tables to RELKIND_RELATION. This prevents PostgreSQL to open its partitions as well.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	31f3221342	Add distributed partitioned table support to router plannable queries In standart_planner, PostgreSQL expands partitioned tables to their partitions and call our restriction hook for each partition. It also, for some queries, skips the partitioned table itself completely. This behaviour makes it difficult to prune shards and decide whether query is router plannable or not. To prevent this behaviour, we change inh flag of partitioned tables to false in the query tree. In this case, PostgreSQL treats those partitioned tables as regular relations and does not expand them. This behaviour is inline with our expectations, because we do not want to treat partitioned tables differently on coordinator. Although we are not entirely comfortable with modifying query tree, other solutions to this problem is overly complicated.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Marco Slot	d3e9746236	Avoid connections that accessed non-colocated placements in multi-shard commands	2017-08-08 18:32:34 +02:00
Brian Cloutier	7060ade6fe	GetNodeTuple returns NULL it node does not exist It never throws an error.	2017-08-08 13:12:06 +03:00
Brian Cloutier	a3e9bef685	All users of WorkerNodeHash take an AccessShareLock The metadata cache simulates a SELECT on pg_dist_node. Now the locks it takes also simulate that SELECT.	2017-08-08 13:12:06 +03:00
Brian Cloutier	5914c992e6	cluster management UDFs see nodes in different clusters - master_activate_node and master_disable_node correctly toggle isActive, without crashing - master_add_node rejects duplicate nodes, even if they're in different clusters - master_remove_node allows removing nodes in different clusters	2017-08-08 13:12:06 +03:00
Brian Cloutier	3151b52a0b	Add citus.cluster_name GUC - Nodes with a nodecluster which does not match citus.cluster_name are excluded from the metadata cache and never seen by another part of Citus.	2017-08-08 13:12:06 +03:00
Brian Cloutier	94947c0d54	Refactor: ReplicateShardToAllWorkers more explicitly locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	f87fefa323	Refactor: DistributedTableSize more explicitly only locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	3769381366	Fix inaccurate comment on SetNodeState	2017-08-08 13:12:06 +03:00
Brian Cloutier	fbecf48a03	Disallow adding primary nodes to non-default clusters	2017-08-08 11:18:31 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00
Brian Cloutier	e7846ba7d1	Allow metadata sync functions on secondaries {start,stop}_metadata_sync_to_node now toggle the hasMetadata flag when run on secondaries but don't attempt to actually sync any metadata.	2017-08-07 18:46:51 +03:00
Marco Slot	4cc7c36596	Simplify metadata lock acquisition for DML	2017-08-07 15:36:58 +02:00
Marco Slot	aa7ca81548	Execute UPDATE/DELETE statements with 0 shards	2017-08-07 15:36:58 +02:00
Marco Slot	bac60bb64f	Function evaluation descends into expression trees	2017-08-06 19:53:05 +02:00
Brian Cloutier	37985de85e	master_disable_node no longer crashes when given a non-existant node	2017-08-04 11:14:54 +03:00
Hadi Moshayedi	8229a64fe8	Remove distributed tables' dependency on distribution key columns. (#1527 ) This change removes distributed tables' dependency on distribution key columns. We already check that we cannot drop distribution key columns in ErrorIfUnsupportedAlterTableStmt() at multi_utility.c, so we don't need to have distributed table to distribution key column dependency to avoid dropping of distribution key column. Furthermore, having this dependency causes some warnings in pg_dump --schema-only (See #866), which are not desirable. This change also adds check to disallow drop of distribution keys when citus.enable_ddl_propagation is set to false. Regression tests are updated accordingly.	2017-08-03 10:07:04 -04:00
Murat Tuncer	fa18899cf9	Remove serialization/deserialization of multiplan node (#1477 ) introduces copy functions for Citus MultiPlan nodes. uses ExtensibleNode mechanism to store MultiPlan data drops serialiazation of MultiPlans	2017-08-02 08:24:00 +03:00
Burak Yucesoy	7769f1d012	Refactor distributed table creation logic This commit is preperation for introducing distributed partitioned table support. We want to clean and refactor some code in distributed table creation logic so that we can handle partitioned tables in more robust way.	2017-07-31 11:11:23 +03:00
Brian Cloutier	b20a086a8f	master_activate_node UDF also returns noderole	2017-07-28 16:02:43 +03:00
Murat Tuncer	26f020dc6e	Make maxTaskStringSize configurable (#1501 ) maxTaskStringSize determines the size of worker query string. It was originally hard coded to a specific value. This has caused issues at some users. Since it determines initial shared memory allocation, we did not want to set it to an arbitrary higher number. Instead made it configurable. This commit introduces a new GUC variable max_task_string_size Changes in this variable requires restart to be in effect.	2017-07-27 11:39:12 -07:00
Onder Kalaci	6132d17481	Convert global wait edges to adjacency list In this commit, we add ability to convert global wait edges into adjacency list with the following format: [transactionId] = [transactionNode->waitsFor {list of waiting transaction nodes}]	2017-07-27 19:53:51 +03:00
Murat Tuncer	8729b7d55a	Use cstore_table_size function to determine cstore table size (#1521 ) pg_table_size/pg_relation_size variants always return 0 for cstore tables. We should be using cstore_table_size function for cstore_tables.	2017-07-27 09:02:07 -07:00
Brian Cloutier	32e16ffe02	Give isolation tester ability to see locks on workers	2017-07-26 18:43:04 +03:00
Eren Başak	a12f1980de	Add Progress Tracking Infrastructure This change adds a general purpose infrastructure to log and monitor process about long running progresses. It uses `pg_stat_get_progress_info` infrastructure, introduced with PostgreSQL 9.6 and used for tracking `VACUUM` commands. This patch only handles the creation of a memory space in dynamic shared memory, putting its info in `pg_stat_get_progress_info`, fetching the progress monitors on demand and finalizing the progress tracking.	2017-07-26 14:12:15 +03:00
Marco Slot	80ea233ec1	Add function for dumping global wait edges	2017-07-25 16:52:32 +02:00
Marco Slot	81198a1d02	Add function for dumping local wait edges	2017-07-25 16:52:32 +02:00
Onder Kalaci	58faffa42b	Fix bug on error check for assigning distributed transaction id to a backend that has already been assigned a transaction.	2017-07-25 14:58:07 +03:00
Marco Slot	3d7f79127d	Do not release locks in LogTransactionRecord	2017-07-24 20:44:38 +02:00
Brian Cloutier	88702ca58a	node_metadata takes out more sane locks - Never release locks - AddNodeMetadata takes ShareRowExclusiveLock so it'll conflict with the trigger which prevents multiple primary nodes. - ActivateNode and SetNodeState used to take AccessShareLock, but they modify the table so they should take RowExclusiveLock. - DeleteNodeRow and InsertNodeRow used to take AccessExclusiveLock but only need RowExclusiveLock.	2017-07-24 11:57:46 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Brian Cloutier	e6c375eb81	Tiny refactor to master_create_empty_shard	2017-07-24 11:57:46 +03:00
Brian Cloutier	ee270b65d7	make WorkerGetNodeWithName a static function	2017-07-24 11:57:46 +03:00
Marco Slot	601b17d544	Use distributed transaction number in 2PC identifiers	2017-07-21 17:36:33 +02:00
Marco Slot	18a6e478af	Fix typo in GetCurrentDistributedTransctionId	2017-07-21 17:36:33 +02:00
Brian Cloutier	7f1343103e	Fix PG 10 build, UNBOUNDED partitions now have different syntax Update code and tests to match the changes made in pg's d363d42	2017-07-21 14:30:11 +03:00
Brian Cloutier	74dd5bb281	Fix crash when removing an inactive node	2017-07-20 18:55:40 +03:00
Hadi Moshayedi	953df34d22	Explicit switch/case fall-throughs to avoid compiler warnings. GCC 7 added `-Wimplicit-fallthrough` to warn for not explicitly specified switch/case fall-throughs. According to https://gcc.gnu.org/gcc-7/changes.html, to suppress that warning we could either use `__attribute__(fallthrough)`, which didn't seem to work for earlier GCC versions, or a `/* fallthrough /` comment just before the following `case`. Previously Citus code had the fall-through comments inside the brackets, which didn't seem to suppress the warning. Putting a `/ fallthrough */` comment outside the brackets and right before the `case` fixes the problem.	2017-07-19 11:41:59 -04:00
Onder Kalaci	3369f3486f	Introduce distributed transaction ids This commit adds distributed transaction id infrastructure in the scope of distributed deadlock detection. In general, the distributed transaction id consists of a tuple in the form of: `(databaseId, initiatorNodeIdentifier, transactionId, timestamp)`. Briefly, we add a shared memory block on each node, which holds some information per backend (i.e., an array `BackendData backends[MaxBackends]`). Later, on each coordinated transaction, Citus sends `SELECT assign_distributed_transaction_id()` right after `BEGIN`. For that backend on the worker, the distributed transaction id is set to the values assigned via the function call. The aim of the above is to correlate the transactions on the coordinator to the transactions on the worker nodes.	2017-07-18 15:01:42 +03:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
Brian Cloutier	72d8d2429b	Add a test for upgrading shard placements	2017-07-12 14:18:27 +02:00
Brian Cloutier	ee4edc498f	Don't release locks early in metadata functions	2017-07-12 14:18:27 +02:00
Brian Cloutier	f40f03270a	Fix locking in ReadWorkerNodes()	2017-07-12 14:18:27 +02:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Brian Cloutier	fe53fd4a8e	Remove functions created just for unit testing These functions are holdovers from pg_shard and were created for unit testing c-level functions (like InsertShardPlacementRow) which our regression tests already test quite effectively. Removing because it makes refactoring the signatures of those c-level functions unnecessarily difficult. - create_healthy_local_shard_placement_row - update_shard_placement_row_state - delete_shard_placement_row	2017-07-12 14:16:24 +02:00
Brian Cloutier	0b64bb1092	Fix typo in comment in CachedRelationLookup	2017-07-12 14:16:24 +02:00
Marco Slot	9f7e4769e2	Clarify placement connection error messages	2017-07-12 11:59:19 +02:00
Marco Slot	d3785b97c0	Remove XactModificationLevel distinction between DML and multi-shard	2017-07-12 11:59:19 +02:00
Marco Slot	710fe8666b	Use GetPlacementListConnection for router DML	2017-07-12 11:26:23 +02:00
Marco Slot	29f21fea59	Use GetPlacementListConnection for multi-shard commands	2017-07-12 11:26:22 +02:00
Marco Slot	01c9b1f921	Use GetPlacementListConnection for router SELECTs	2017-07-12 11:26:22 +02:00
Marco Slot	63676f5d65	Allow choosing a connection for multiple placements with GetPlacementListConnection	2017-07-12 11:26:22 +02:00
Jason Petersen	9018e698ec	Indentation cleanup Uncrustify 0.65 appears to have changed some defaults, resulting in breakages for those of us who have already upgraded; Travis still uses Uncrustify 0.64, but these changes work with both versions (assuming appropriately updated config), so this should permit use of either version for the time being.	2017-07-11 15:59:28 -06:00
Burak Yucesoy	c8b9e4011b	Remove LockRelationDistributionMetadata function	2017-07-10 15:46:37 +03:00
Burak Yucesoy	cb6070c720	Use ShareUpdateExclusiveLock instead ShareLock in VACUUM Before this change, we used ShareLock to acquire lock on distributed tables while running VACUUM. This makes VACUUM and INSERT block each other. With this change we changed lock mode from ShareLock to ShareUpdateExclusiveLock, which does not conflict with the locks INSERT acquire.	2017-07-10 15:46:19 +03:00
Murat Tuncer	2a4eada150	Replace duplicate code and call check_functions_in_node (#1478 ) MasterIrreducibleExpressionWalker has a copied code from function check_functions_in_node() which was available with PG 9.6+. Now PG 9.5 support is dropped we can remove duplicate code and directly call check_functions_in_node().	2017-07-07 10:19:33 +03:00
Marco Slot	31debc96e3	Handle implicit casts in prepared INSERTs	2017-07-06 16:17:35 +02:00
Andres Freund	3461244539	Don't wait for statement completion when aborting coordinated transaction. Previously we used ForgetResults() in StartRemoteTransactionAbort() - that's problematic because there might still be an ongoing statement, and this causes us to wait for its completion. That e.g. happens when a statement running on the coordinator is cancelled.	2017-07-04 14:46:03 -07:00
Andres Freund	0d791f6740	Cancel statements when closing connection at transaction end. That's important because the currently running statement on a worker might continue to hold locks and consume resources, even after the connection is closed. Unfortunately postgres will only notice closed connections when reading from / writing to the network. That might only happen much later.	2017-07-04 14:46:03 -07:00
Andres Freund	be8677f926	Add NonblockingForgetResults(). This is very similar to ForgetResults() except that no network IO is performed. Primarily useful in error handling cases.	2017-07-04 14:46:03 -07:00
Andres Freund	24153fae5d	Add ShutdownConnection() which cancels statement before closing connection. That's primarily useful in error cases, where we want to make sure locks etc held by commands running on workers are released promptly.	2017-07-04 14:46:03 -07:00
Andres Freund	75a7ddea0d	Always use connections in non-blocking mode. Now that there's no blocking libpq callers left, default to using non-blocking mode in connection_management.c. This has two advantages: 1) Blockiness doesn't have to frequently be reset, simplifying code 2) Prevents accidental use of blocking libpq functions, since they'll frequently return 'need IO'	2017-07-04 14:46:03 -07:00
Andres Freund	90a2d13a64	Move multi_copy.c to interrupt aware libpq wrappers.	2017-07-04 14:46:03 -07:00
Andres Freund	21c25abbb1	Move multi_client_executor to interrupt aware libpq wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	ddb0651967	Move citus tools to interrupt aware libpq wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	c674bc8640	Add interrupt aware PQputCopy{End,Data} wrappers.	2017-07-04 12:38:52 -07:00

... 3 4 5 6 7 ...

985 Commits (6656592b8e4daebf3b4795644794715517c8caff)