citus

Commit Graph

Author	SHA1	Message	Date
Burak Yucesoy	7cb92b8bb1	Fix COPY produces error when using array of user-defined types Fixes #463 OID of user-defined types may be different in master and worker nodes. This causes errors while sending data between nodes with binary nodes. Because binary copy format adds OID of the element if it is in an array. The code adding OID is in PostgreSQL code, therefore we cannot change it. Instead we decided to use text format if we try to send array of user-defined type.	2016-07-13 11:12:24 +03:00
Jason Petersen	9157ac9f10	Remove hash-pruning logic for NULL values It turns out some tests exercised this behavior, but removing it should have no ill effects. Besides, both copy and INSERT disallow NULLs in a table's partition column. Fixes a bug where anti-joins on hash-partitioned distributed tables would incorrectly prune shards early, result in incorrect results (test included).	2016-07-06 17:04:21 -06:00
Andres Freund	c945ea310b	Add regression tests for RETURNING.	2016-07-01 13:07:12 -07:00
Andres Freund	586f738bc7	Support RETURNING for modification commands. Fixes: #242	2016-07-01 13:07:12 -07:00
Andres Freund	610e17d94a	Combine router executor paths for select and modify commands. The upcoming RETURNING support would otherwise require too much duplication. This contains most of the pieces required for RETURNING support, except removing the planner checks and adjusting regression test output.	2016-07-01 13:07:12 -07:00
Andres Freund	c9505a47ab	Remember original targetlist in MultiQueryContainerNode(). The old targetlist wasn't used so far, but the upcoming RETURNING support relies on it. This also allows to get rid of some crufty code in multi_executor.c:multi_ExecutorStart(), which used the worker query's targetlist instead of the main statement's (which didn't have one up to now).	2016-07-01 12:50:12 -07:00
Andres Freund	63fcd4a505	Fix definition of faux targetlist element inserted to prevent backward scans. The targetlist contains TargetEntrys containing expressions, not expressions directly. That didn't matter so far, but with the upcoming RETURNING support, the targetlist is inspected to build a TupleDesc. ExecCleanTypeFromTL hits an assert when looking at something that's not a TargetEntry. Mark the entry as resjunk, so it's not actually used.	2016-07-01 12:50:12 -07:00
Andres Freund	3201ef8764	Add tests verifying that updates return correct tuple counts. This unfortunately requires adding a new table, triggering renumbering of a number of shard ids.	2016-07-01 12:50:12 -07:00
Metin Doslu	85db53c8fe	Add null check to SqlStateMatchesCategory() Fixes #634	2016-07-01 12:28:46 -07:00
Jason Petersen	518adff539	Minor formatting fix Noticed that uncrustify doesn't like the array-of-struct literals, so omitting them from formatting (at least here).	2016-06-28 13:09:57 -06:00
Jason Petersen	f132dbad1a	Use literal instead of constant to fix 9.4 build PG_UINT32_MAX doesn't exist before 9.5. Missed this because I removed my assert-enabled builds during packaging work. Fixes #619	2016-06-28 12:36:14 -06:00
Andres Freund	891dd366d2	Provide our own psqlscan.l->psqlscan.l rule. As postgres's generic .l -> .c Makefile rule uses ifdef - which is evaluated early, not during rule evaluation - we have to override the rule, in addition to the detection of FLEX in the previous commit. Fixes: #439	2016-06-22 11:03:23 -07:00
Jason Petersen	56f145e7fa	Purge connection if re-raising error The only way we re-raise an error is if the raiseError flag is true, so might as well purge connection in that block rather than independently checking errorLevel.	2016-06-21 09:51:12 -06:00
Murat Tuncer	e86b4b397c	Refactor multi_planner to create router plan directly If router plan creation fails, it falls back to normal planner	2016-06-21 12:50:21 +03:00
Burak Yucesoy	2da5ae240e	Fix master_append_table_to_shard to work with schemas Fixes #78 With this change, it is possible to append a table in any schema to shard. The function master_append_table_to_shard now supports schema names.	2016-06-17 04:35:00 +03:00
Andres Freund	acb36b4505	Store ShardInterval instead of shardId in RangeTableFragments. For CITUS_RTE_RELATION type fragments, reloading shardIntervals from the database is rather expensive. So store a pointer to the full shard interval, instead of just the shard id. There's no new memory lifetime hazards here, because we already passed a pointer to the shardInterval's ->shardId field around. The plan time for the query in issue #607 goes from 2889 ms to 106 ms. with this change.	2016-06-16 17:31:35 -07:00
Andres Freund	1e07a94435	Use cached comparator in ShardIntervalsOverlap(). By far the most expensive part of ShardIntervalsOverlap() is computing the function to use to determine overlap. Luckily we already have that computed and cached. The plan time for the query in issue #607 goes from 8764 ms to 2889 ms with this change.	2016-06-16 17:21:19 -07:00
Andres Freund	1c24d703d7	Add tests for LEFT JOIN ON clauses preventing matches left/right.	2016-06-16 16:53:02 -07:00
Marco Slot	f15ec5554c	Do not copy outer join clauses into WHERE	2016-06-16 16:42:32 -07:00
Metin Doslu	7ede3db4f5	Drop function from public and create in pg_catalog Fixes #600	2016-06-16 14:08:40 -07:00
Murat Tuncer	aeb6443898	Reduce regression test runtime -Added 2 more schedules for task-tracker and multi-binary instead of running multi_schedule 3 times -set task-tracker-delay for each long running schedule	2016-06-15 16:35:07 +03:00
Burak Yucesoy	bdf9ca2466	Append shardId before escaping the table name Fixes #550, fixes #545 If table name contains special characters, it needs to be escaped. However in some cases, we escape table name before appending shardId, which causes syntax error in the queries sent to worker nodes. With this change we now append shardId before escaping table names.	2016-06-15 04:15:40 +03:00
Murat Tuncer	5fddb9c34e	Remove variant files This checkin removes variant files we needed due to differences in outputs of pg94 and pg95 runs. However, variant file for test multi_upsert stays since this file tests for a feature that does not exist in pg94, and outputs are drastically different.	2016-06-13 12:12:06 +03:00
Eren	ae5687e726	Eliminate compile time warnings in multi_logical_optimizer.c This change removes some issues about mixed declarations and code in TablePartitioningSupportsDistinct() and WorkerExtendedOpNode() functions.	2016-06-10 12:27:12 +03:00
Murat Tuncer	0de0e7c3d1	Refactor task tracker cleanup to enable workers receive cleanup jobs Long sleep is replaced by multiple small sleeps. Maximum timeout is also increased since we do not have to wait for that long most of the cases.	2016-06-09 17:03:54 +03:00
Murat Tuncer	315b7f3e4c	Fix crash in count distinct with filters in repartition subqueries now copies all column references in count distinct aggreagete to worker target list and group by. Master target list is also updated to reflect changes in attribute order. Fixes 569	2016-06-09 11:47:24 +03:00
Jason Petersen	369ab7664c	Minor formatting/comment fixes	2016-06-08 10:34:07 -06:00
Amos Bird	b58dfd93ae	Add overflow checks.	2016-06-08 10:30:03 +08:00
Amos Bird	ad42423c24	Eliminates the possibilities of counter overflows. This patch uses scanint8 instead of pg_atoi to make sure the affected tuples counter never gets overflow.	2016-06-08 10:30:03 +08:00
Burak Yücesoy	54c0d827d8	Fix wrong storage type for foreign tables Fixes #496 Previously we do not check whether table is foreign or not while creating empty shards, and set storage type to 't'(Standard table) or 'c'(Columnar table). Now if the table is foreign table(but not CStore foreign table) we set storage type to 'f'(Foreign table). If it is CStore foreign table, we set its storage type to 'c', i.e. columnar table have priority over foreign table. Please note that 'c' is only used for CStore tables not for other possible columnar stores at the moment. Possible improvement could be checking for other columnar stores, though I am not sure if there is a way to check it for all other columnar stores.	2016-06-08 04:12:01 +03:00
Jason Petersen	6e9a2869e0	Add back test for INSERT where all placements fail Since we now short-circuit on certain remote errors, we want to ensure we preserve the old behavior of not modifying any placement states if a non-short-circuiting error occurs on all placements.	2016-06-07 13:21:23 -06:00
Jason Petersen	281d93b3c6	Make ReportRemoteError's CONTEXT style-compliant There's not a ton of documentation about what CONTEXT lines should look like, but this seems like the most dominant pattern. Similarly, users should expect lowercase, non-period strings.	2016-06-07 12:47:16 -06:00
Jason Petersen	8efb504d1a	Refactor ReportRemoteError to remove boolean arg Broke it into two explicitly-named functions instead: WarnRemoteError and ReraiseRemoteError.	2016-06-07 12:38:32 -06:00
Metin Doslu	dfc7dd8d87	Fail fast on constraint violations in router executor	2016-06-07 18:11:17 +03:00
Metin Doslu	6195535906	Update ereport format	2016-06-07 15:58:32 +03:00
Metin Doslu	5eb2e76296	Update only shard length on statistics update for hash-partitioned Update only the shard length on master_update_shard_statistics() call for hash-partitioned tables. Fixes #519.	2016-06-07 15:04:29 +03:00
Eren	0645cba428	Set Explicit ShardId/JobId In Regression Tests Fixes #271 This change sets ShardIds and JobIds for each test case. Before this change, when a new test that somehow increments Job or Shard IDs is added, then the tests after the new test should be updated. ShardID and JobID sequences are set at the beginning of each file with the following commands: ``` ALTER SEQUENCE pg_catalog.pg_dist_shardid_seq RESTART 290000; ALTER SEQUENCE pg_catalog.pg_dist_jobid_seq RESTART 290000; ``` ShardIds and JobIds are multiples of 10000. Exceptions are: - multi_large_shardid: shardid and jobid sequences are set to much larger values - multi_fdw_large_shardid: same as above - multi_join_pruning: Causes a race condition with multi_hash_pruning since they are run in parallel.	2016-06-07 14:32:44 +03:00
Murat Tuncer	fcd4248f6a	Add enable_ddl_propagation flag to control automatic ddl propagation	2016-06-06 13:42:46 +03:00
Murat Tuncer	41096f2076	Change equality operator check for operator expressions	2016-06-06 12:34:16 +03:00
Burak Yücesoy	658fa600d2	Update regression tests where metadata edited manually Fixes #302 Since our previous syntax did not allow creating hash partitioned tables, some of the previous tests manually changed partition method to hash to be able to test it. With this change we remove unnecessary workaround and create hash distributed tables instead. Also in some tests metadata was created manually. With this change we also fixed this issue.	2016-06-04 13:50:42 +00:00
Burak Yucesoy	15f55cb675	Remove ONLY clause from worker queries Fixes #475 With this change we prevent addition of ONLY clause to queries prepared for worker nodes. When we add ONLY clause we may miss the inherited tables in worker nodes created by users manually.	2016-06-03 11:42:43 +03:00
Andres Freund	ee5bb2297b	Rely less on remote_task_check_interval. When executing queries with citus.task_executor = 'real-time', query execution could, so far, spend a significant amount of time sleeping. That's because we were a) sleeping after several phases of query execution, even if we're not waiting for network IO b) sleeping for a fixed amount of time when waiting for network IO; often a lot longer than actually required. Just reducing the amount of time slept isn't a real solution, because that just increases CPU usage. Instead have the real-time executor's ManageTaskExecution return whether a task is currently being processed, waiting for reads or writes, or failed. When all tasks are waiting for IO use poll() to wait for IO readyness. That requires to slightly redefine how connection timeouts are handled: before we counted the number of times ManageTaskExecution() was called, and compared that with the timeout divided by the task check interval. That, if processing of tasks took a while, could significantly increase the time till a timeout occurred. Because it was based on the ManageTaskExecution() being called on a constant interval, this approach isn't feasible anymore. Instead measure the actual time since connection establishment was started. That could in theory, if task processing takes a very long time, lead to few passes over PQconnectPoll(). The problem of sleeping too much also exists for the 'task-tracker' executor, but is generally less problematic there, as processing the individual tasks usually will take longer. That said, for e.g. the regression tests it'd be helpful to use a similar approach.	2016-06-02 12:11:16 -06:00
Metin Doslu	c094104d9e	Move master_update_shard_statistics() to pg_catalog Fixes #546	2016-06-02 10:52:47 +03:00
Jason Petersen	cc46222e35	Fix formatting Checking in citus_indent output.	2016-05-27 15:13:28 -06:00
Amos Bird	ed0002f28e	Remove redundant implementations of error funcs. This patch does some basic cleaning jobs. It removes duplicated implementations of ReportRemoteError() and related ones and adjusts regression tests.	2016-05-27 15:12:59 -06:00
Jason Petersen	f9f17cd1ba	Merge branch credativ:reproducible cr: @jasonmp85	2016-05-27 12:45:55 -06:00
Matthew Seaman	62bf21de5d	Add inet includes for htonl and htons funtions Needed to fix FreeBSD builds.	2016-05-27 12:36:12 -06:00
Murat Tuncer	9167373f54	Add complex distinct count support for repartitioned subqueries Single table repartition subqueries now support count(distinct column) and count(distinct (case when ...)) expressions. Repartition query extracts column used in aggregate expression and adds them to target list and group by list, master query stays the same (count (distinct ...)) but attribute numbers inside the aggregate expression is modified to reflect changes in repartition query.	2016-05-27 15:43:05 +03:00
Metin Doslu	a82efa6613	Make master_create_empty_shard() aware of the shard placement policy Now, master_create_empty_shard() will create shards according to the value of citus.shard_placement_policy which also makes default round-robin instead of random.	2016-05-27 15:05:53 +03:00
eren	793cb2d004	ADD master_modify_multiple_shards UDF Fixes #10 This change creates a new UDF: master_modify_multiple_shards Parameters: modify_query: A simple DELETE or UPDATE query as a string. The UDF is similar to the existing master_apply_delete_command UDF. Basically, given the modify query, it prunes the shard list, re-constructs the query for each shard and sends the query to the placements. Depending on the value of citus.multi_shard_commit_protocol, the commit can be done in one-phase or two-phase manner. Limitations: * It cannot be called inside a transaction block * It only be called with simple operator expressions (like Single Shard Modify) Sample Usage: ``` SELECT master_modify_multiple_shards( 'DELETE FROM customer_delete_protocol WHERE c_custkey > 500 AND c_custkey < 500'); ```	2016-05-26 17:30:35 +03:00

1 2 3 4

153 Commits (7cb92b8bb1e3756041998fef5970ad56e734d816)