citus

Commit Graph

Author	SHA1	Message	Date
Brian Cloutier	62e7bdbdd6	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Andres Freund	9ebc46d15c	Initialize count_agg_clauses argument to 0. count_agg_clause adds the cost of the aggregates to the state variable, it doesn't reinitialize it. That is intentional, as it is used to incrementally add costs in some places.	2016-10-03 13:07:43 -07:00
Robin Thomas	1e80d27585	During repartitions, the partitionColumnType argument sent to workers is now a `::regtype` using the qualified name of the column type, not the column type OID which may differ between master/worker nodes. Test coverage of a hash reparitition using a UDT as the join column. Note that the UDFs `worker_hash_partition_table` and `worker_range_partition_table` are unchanged, and rightly expect an OID for the column type; but the planner code building the commands now allows for `::regtype` casting to do its magic. Fixes citusdata/citus#111.	2016-10-03 13:41:20 -04:00
Onder Kalaci	727bed9d69	Differentiate worker and master job temporary folders This commit enables to create different worker and master temporary folders. This change is important for citus-mx on task-tracker execution. In simple words, on citus-mx, the worker could actually be reponsible for the master tasks as well. Prior to this change, both master and worker logic on task-tracker executor was accessing and using the same files for different purposes which was dangerous on certain cases (i.e., when task_tracker_delay is low).	2016-10-03 14:24:08 +03:00
Marco Slot	2dfe17b75e	Make count return 0 if all shards are pruned away Before this change, count on a distributed returned NULL if all shards were pruned away, because on the master we replace with count(..) call with a sum(..) call to sum the counts from the shards. However, sum returns NULL when there are no rows, whereas count is expected to return 0.	2016-09-29 20:27:26 +02:00
Murat Tuncer	ba3d035b23	Make where false queries router plannable	2016-09-28 18:49:26 +03:00
Marco Slot	a2276adcd2	Fix segmentation fault in case of joins with WHERE 1=0	2016-09-26 15:12:29 +02:00
Marco Slot	575bc99be5	Allow noop updates of the partition column	2016-09-07 14:22:41 +02:00
Metin Doslu	60d67a39f1	Add outer join clause list extraction for subquery pushdown logic In subquery pushdown, we allow outer joins if the join condition is on the partition columns. WhereClauseList() used to return all join conditions including outer joins. However, this has been changed with a commit related to outer join support on regular queries. With this commit, we refactored ExtractFromExpressionWalker() to return two lists of qualifiers. The first list is for inner join and filter clauses and the second list is for outer join clauses. Therefore, we can also use outer join clauses to check subquery pushdown prerequisites.	2016-09-02 11:54:44 +03:00
Robin Thomas	475a6245bf	Remove all usage of pg_dist_shard.shardalias in extension code. (#739 ) Remove regression test of non-null shardalias.	2016-08-19 17:06:22 +03:00
Burak Yucesoy	0a2c940ae5	Remove schema name parameter from API functions We remove schema name parameter from worker_fetch_foreign_file and worker_fetch_regular_table functions. We now send schema name concatanated with table name.	2016-07-28 20:41:05 +03:00
Burak Yucesoy	98025110f0	Add old version(without schema name parameter) of api functions back Fixes #676 We added old versions (i.e. without schema name) of worker_apply_shard_ddl_command, worker_fetch_foreign_file and worker_fetch_regular_table back. During function call of one of these functions, we set schema name as public schema and call the newer version of the functions.	2016-07-28 20:40:38 +03:00
Murat Tuncer	992997b8ad	Expand router planner coverage We can now support richer set of queries in router planner. This allow us to support CTEs, joins, window function, subqueries if they are known to be executed at a single worker with a single task (all tables are filtered down to a single shard and a single worker contains all table shards referenced in the query). Fixes : #501	2016-07-27 23:35:38 +03:00
Murat Tuncer	719e44d1f4	Remove PostgreSQL 9.4 support	2016-07-26 20:16:09 +03:00
Murat Tuncer	461fefbdb2	Fix outer join crash when subquery is flatten	2016-07-22 17:01:19 +03:00
Burak Yucesoy	444d4eb558	Fix worker_fetch_regular_table with schema Fixes #504 Fixes #646 We changed signature of worker_fetch_regular_table to accept schema name as parameter to make it work with schemas.	2016-07-22 00:44:02 -06:00
Burak Yucesoy	7df5a265c7	Fix COUNT DISTINCT approximation with schema Fixes #555 Before this change, we were resolving HLL function and type Oid without qualified name. Now we find the schema name where HLL objects are stored and generate qualified names for each objects. Similar fix is also applied for cstore_table_size function call.	2016-07-21 17:29:18 +03:00
Murat Tuncer	eae7f79a8b	Make router planner use original query	2016-07-18 18:23:04 +03:00
Eren	c92c81b550	Add LIMIT/OFFSET Support Fixes #394 This change adds LIMIT/OFFSET support for non router-plannable distributed queries. In cases that we can push the LIMIT down, we add the OFFSET value to that LIMIT in the worker queries. When a query with LIMIT x OFFSET y is issued, the query is propagated to the workers as LIMIT (x+y) OFFSET 0, and on the master table, the original LIMIT and OFFSET values are used. With this change, we can use OFFSET wherever we can use LIMIT.	2016-07-18 12:00:24 +03:00
Andres Freund	bafafcd1bf	citus_indent fixups	2016-07-13 11:45:51 -07:00
Brian Cloutier	728eefcf2b	Simplify code and fix include guards in citus_clauses	2016-07-13 11:45:51 -07:00
Brian Cloutier	9a5e529f6f	cosmetic changes	2016-07-13 11:45:51 -07:00
Brian Cloutier	c46cb19cda	Only reparse queries if the planner flags them for reparsing	2016-07-13 11:45:51 -07:00
Brian Cloutier	d792c0af4d	citus_indent and some renaming	2016-07-13 11:45:51 -07:00
Brian Cloutier	e73b4ac026	Evaluate functions on the master - Enables using VOLATILE functions (like nextval()) in INSERT queries - Enables using STABLE functions (like now()) targetLists and joinTrees UPDATE and INSERT can now contain non-immutable functions. INSERT can contain any kind of expression, while UPDATE can contain any STABLE function, so long as a Var is not passed into the STABLE function, even indirectly. UPDATE TagetEntry's can now also include Vars. There's an exception, CASE/COALESCE statements may not contain mutable functions. Functions calls in master_modify_multiple_shards are also evaluated.	2016-07-13 11:45:51 -07:00
Jason Petersen	9157ac9f10	Remove hash-pruning logic for NULL values It turns out some tests exercised this behavior, but removing it should have no ill effects. Besides, both copy and INSERT disallow NULLs in a table's partition column. Fixes a bug where anti-joins on hash-partitioned distributed tables would incorrectly prune shards early, result in incorrect results (test included).	2016-07-06 17:04:21 -06:00
Andres Freund	586f738bc7	Support RETURNING for modification commands. Fixes: #242	2016-07-01 13:07:12 -07:00
Andres Freund	c9505a47ab	Remember original targetlist in MultiQueryContainerNode(). The old targetlist wasn't used so far, but the upcoming RETURNING support relies on it. This also allows to get rid of some crufty code in multi_executor.c:multi_ExecutorStart(), which used the worker query's targetlist instead of the main statement's (which didn't have one up to now).	2016-07-01 12:50:12 -07:00
Andres Freund	63fcd4a505	Fix definition of faux targetlist element inserted to prevent backward scans. The targetlist contains TargetEntrys containing expressions, not expressions directly. That didn't matter so far, but with the upcoming RETURNING support, the targetlist is inspected to build a TupleDesc. ExecCleanTypeFromTL hits an assert when looking at something that's not a TargetEntry. Mark the entry as resjunk, so it's not actually used.	2016-07-01 12:50:12 -07:00
Murat Tuncer	e86b4b397c	Refactor multi_planner to create router plan directly If router plan creation fails, it falls back to normal planner	2016-06-21 12:50:21 +03:00
Andres Freund	acb36b4505	Store ShardInterval instead of shardId in RangeTableFragments. For CITUS_RTE_RELATION type fragments, reloading shardIntervals from the database is rather expensive. So store a pointer to the full shard interval, instead of just the shard id. There's no new memory lifetime hazards here, because we already passed a pointer to the shardInterval's ->shardId field around. The plan time for the query in issue #607 goes from 2889 ms to 106 ms. with this change.	2016-06-16 17:31:35 -07:00
Andres Freund	1e07a94435	Use cached comparator in ShardIntervalsOverlap(). By far the most expensive part of ShardIntervalsOverlap() is computing the function to use to determine overlap. Luckily we already have that computed and cached. The plan time for the query in issue #607 goes from 8764 ms to 2889 ms with this change.	2016-06-16 17:21:19 -07:00
Marco Slot	f15ec5554c	Do not copy outer join clauses into WHERE	2016-06-16 16:42:32 -07:00
Eren	ae5687e726	Eliminate compile time warnings in multi_logical_optimizer.c This change removes some issues about mixed declarations and code in TablePartitioningSupportsDistinct() and WorkerExtendedOpNode() functions.	2016-06-10 12:27:12 +03:00
Murat Tuncer	315b7f3e4c	Fix crash in count distinct with filters in repartition subqueries now copies all column references in count distinct aggreagete to worker target list and group by. Master target list is also updated to reflect changes in attribute order. Fixes 569	2016-06-09 11:47:24 +03:00
Murat Tuncer	41096f2076	Change equality operator check for operator expressions	2016-06-06 12:34:16 +03:00
Burak Yucesoy	15f55cb675	Remove ONLY clause from worker queries Fixes #475 With this change we prevent addition of ONLY clause to queries prepared for worker nodes. When we add ONLY clause we may miss the inherited tables in worker nodes created by users manually.	2016-06-03 11:42:43 +03:00
Murat Tuncer	9167373f54	Add complex distinct count support for repartitioned subqueries Single table repartition subqueries now support count(distinct column) and count(distinct (case when ...)) expressions. Repartition query extracts column used in aggregate expression and adds them to target list and group by list, master query stays the same (count (distinct ...)) but attribute numbers inside the aggregate expression is modified to reflect changes in repartition query.	2016-05-27 15:43:05 +03:00
eren	793cb2d004	ADD master_modify_multiple_shards UDF Fixes #10 This change creates a new UDF: master_modify_multiple_shards Parameters: modify_query: A simple DELETE or UPDATE query as a string. The UDF is similar to the existing master_apply_delete_command UDF. Basically, given the modify query, it prunes the shard list, re-constructs the query for each shard and sends the query to the placements. Depending on the value of citus.multi_shard_commit_protocol, the commit can be done in one-phase or two-phase manner. Limitations: * It cannot be called inside a transaction block * It only be called with simple operator expressions (like Single Shard Modify) Sample Usage: ``` SELECT master_modify_multiple_shards( 'DELETE FROM customer_delete_protocol WHERE c_custkey > 500 AND c_custkey < 500'); ```	2016-05-26 17:30:35 +03:00
Marco Slot	d333c49280	Add JSON/XML validation to EXPLAIN regression tests and fix issues	2016-05-06 11:30:07 +02:00
Lukas Fittl	19e71b5271	Distributed EXPLAIN: Generate valid JSON output. This modifies the EXPLAIN output functions to actually generate valid JSON output when (FORMAT JSON) is being used. Fixes #494.	2016-05-05 12:48:01 +02:00
Onder Kalaci	38a1092687	Fix compile time warning This change fixes a compile time warning related to definition/declaration order of the code.	2016-05-04 09:42:10 +03:00
Brian Cloutier	5962c9b7c8	Query Planning Performance Improvments (#474 ) - Only look at pruned shards when determining AnchorTable - Use cached shardIntervalCompareFunction during copartition check	2016-05-03 10:48:46 +03:00
Marco Slot	cfbdbe29a9	Add EXPLAIN for simple distributed queries	2016-04-30 00:11:02 +02:00
eren	9dc6f6b2e2	FIX "mixed declarations and code" Warning in multi_physical_planner.c Fixes #477 This change fixes the compile time warning message in BuildMapMergeJob in multi_physical_planner.c about mixed declarations and code. Basically, the problematic declaration is moved up so that no expression is before it.	2016-04-29 11:18:04 +03:00
Brian Cloutier	38fdb01b91	Allow references to columns in UPDATE statements (#472 ) Allow references to columns in UPDATE statements Queries like "UPDATE tbl SET column = column + 1" are now allowed, so long as you don't use any IMMUTABLE functions.	2016-04-28 05:45:16 -07:00
Andres Freund	99e983433f	Run some commands as superuser to allow normal users to execute queries. Some small parts of citus currently require superuser privileges; which is obviously not desirable for production scenarios. Run these small parts under superuser privileges (we use the extension owner) to avoid that. This does not yet coordinate grants between master and workers. Thus it allows to create shards, load data, and run queries as a non-superuser, but it is not easily possible to allow differentiated accesses to several users.	2016-04-27 10:28:22 -07:00
Andres Freund	3dae284bbe	Use the current session's username when connecting to worker nodes. So far we've always used libpq defaults when connecting to workers; bar special environment variables being set that'll always be the user that started the server. That's not desirable because it prevents using users with fewer privileges. Thus change the various APIs creating connections to workers to always use usernames. That means: 1) MultiClientConnect() needs to, optionally, accept a username 2) GetOrEstablishConnection(), including the underlying cache, need to use the current user as part of the connection cache key. That way connections for separate users are distinct, and we always use one with the correct authorization. 3) The task tracker needs to keep track of the username associated with a task, so it can use it when establishing connections outside the originating session.	2016-04-27 10:00:08 -07:00
Onder Kalaci	c763d7492c	Apply final code review feedback - Fix o(n^2) loop to o(n) - Collapse two if statements into a single one - Some coding conventions feedback	2016-04-27 10:36:03 +03:00
Onder Kalaci	876730ad73	Fix Merge Conflict This commit fixes merge conflicts.	2016-04-26 11:18:47 +03:00

1 2

74 Commits (62e7bdbdd67a64002877d73746c974373e4d3be7)