citus

Commit Graph

Author	SHA1	Message	Date
Amos Bird	fd1b088208	Add overflow checks.	2016-06-08 10:30:03 +08:00
Amos Bird	107b926262	Eliminates the possibilities of counter overflows. This patch uses scanint8 instead of pg_atoi to make sure the affected tuples counter never gets overflow.	2016-06-08 10:30:03 +08:00
Burak Yücesoy	323f1151e0	Fix wrong storage type for foreign tables Fixes #496 Previously we do not check whether table is foreign or not while creating empty shards, and set storage type to 't'(Standard table) or 'c'(Columnar table). Now if the table is foreign table(but not CStore foreign table) we set storage type to 'f'(Foreign table). If it is CStore foreign table, we set its storage type to 'c', i.e. columnar table have priority over foreign table. Please note that 'c' is only used for CStore tables not for other possible columnar stores at the moment. Possible improvement could be checking for other columnar stores, though I am not sure if there is a way to check it for all other columnar stores.	2016-06-08 04:12:01 +03:00
Jason Petersen	a19520b9bd	Add back test for INSERT where all placements fail Since we now short-circuit on certain remote errors, we want to ensure we preserve the old behavior of not modifying any placement states if a non-short-circuiting error occurs on all placements.	2016-06-07 13:21:23 -06:00
Jason Petersen	48f4e5d1a5	Make ReportRemoteError's CONTEXT style-compliant There's not a ton of documentation about what CONTEXT lines should look like, but this seems like the most dominant pattern. Similarly, users should expect lowercase, non-period strings.	2016-06-07 12:47:16 -06:00
Jason Petersen	9ba02928ac	Refactor ReportRemoteError to remove boolean arg Broke it into two explicitly-named functions instead: WarnRemoteError and ReraiseRemoteError.	2016-06-07 12:38:32 -06:00
Metin Doslu	7d0c90b398	Fail fast on constraint violations in router executor	2016-06-07 18:11:17 +03:00
Metin Doslu	15eed396b3	Update ereport format	2016-06-07 15:58:32 +03:00
Metin Doslu	28a16beba7	Update only shard length on statistics update for hash-partitioned Update only the shard length on master_update_shard_statistics() call for hash-partitioned tables. Fixes #519.	2016-06-07 15:04:29 +03:00
Eren	5512bb359a	Set Explicit ShardId/JobId In Regression Tests Fixes #271 This change sets ShardIds and JobIds for each test case. Before this change, when a new test that somehow increments Job or Shard IDs is added, then the tests after the new test should be updated. ShardID and JobID sequences are set at the beginning of each file with the following commands: ``` ALTER SEQUENCE pg_catalog.pg_dist_shardid_seq RESTART 290000; ALTER SEQUENCE pg_catalog.pg_dist_jobid_seq RESTART 290000; ``` ShardIds and JobIds are multiples of 10000. Exceptions are: - multi_large_shardid: shardid and jobid sequences are set to much larger values - multi_fdw_large_shardid: same as above - multi_join_pruning: Causes a race condition with multi_hash_pruning since they are run in parallel.	2016-06-07 14:32:44 +03:00
Murat Tuncer	360e884de1	Add enable_ddl_propagation flag to control automatic ddl propagation	2016-06-06 13:42:46 +03:00
Murat Tuncer	20ba0f72a6	Change equality operator check for operator expressions	2016-06-06 12:34:16 +03:00
Burak Yücesoy	2f096cad74	Update regression tests where metadata edited manually Fixes #302 Since our previous syntax did not allow creating hash partitioned tables, some of the previous tests manually changed partition method to hash to be able to test it. With this change we remove unnecessary workaround and create hash distributed tables instead. Also in some tests metadata was created manually. With this change we also fixed this issue.	2016-06-04 13:50:42 +00:00
Burak Yucesoy	5db357eb1a	Remove ONLY clause from worker queries Fixes #475 With this change we prevent addition of ONLY clause to queries prepared for worker nodes. When we add ONLY clause we may miss the inherited tables in worker nodes created by users manually.	2016-06-03 11:42:43 +03:00
Andres Freund	3dac0a4d14	Rely less on remote_task_check_interval. When executing queries with citus.task_executor = 'real-time', query execution could, so far, spend a significant amount of time sleeping. That's because we were a) sleeping after several phases of query execution, even if we're not waiting for network IO b) sleeping for a fixed amount of time when waiting for network IO; often a lot longer than actually required. Just reducing the amount of time slept isn't a real solution, because that just increases CPU usage. Instead have the real-time executor's ManageTaskExecution return whether a task is currently being processed, waiting for reads or writes, or failed. When all tasks are waiting for IO use poll() to wait for IO readyness. That requires to slightly redefine how connection timeouts are handled: before we counted the number of times ManageTaskExecution() was called, and compared that with the timeout divided by the task check interval. That, if processing of tasks took a while, could significantly increase the time till a timeout occurred. Because it was based on the ManageTaskExecution() being called on a constant interval, this approach isn't feasible anymore. Instead measure the actual time since connection establishment was started. That could in theory, if task processing takes a very long time, lead to few passes over PQconnectPoll(). The problem of sleeping too much also exists for the 'task-tracker' executor, but is generally less problematic there, as processing the individual tasks usually will take longer. That said, for e.g. the regression tests it'd be helpful to use a similar approach.	2016-06-02 12:11:16 -06:00
Metin Doslu	d4c4eaa9ff	Move master_update_shard_statistics() to pg_catalog Fixes #546	2016-06-02 10:52:47 +03:00
Jason Petersen	e774f22ed4	Fix formatting Checking in citus_indent output.	2016-05-27 15:13:28 -06:00
Amos Bird	92788a0d9c	Remove redundant implementations of error funcs. This patch does some basic cleaning jobs. It removes duplicated implementations of ReportRemoteError() and related ones and adjusts regression tests.	2016-05-27 15:12:59 -06:00
Jason Petersen	c0c71cb8c5	Merge branch credativ:reproducible cr: @jasonmp85	2016-05-27 12:45:55 -06:00
Matthew Seaman	332c322b4f	Add inet includes for htonl and htons funtions Needed to fix FreeBSD builds.	2016-05-27 12:36:12 -06:00
Murat Tuncer	2b0d6473b9	Add complex distinct count support for repartitioned subqueries Single table repartition subqueries now support count(distinct column) and count(distinct (case when ...)) expressions. Repartition query extracts column used in aggregate expression and adds them to target list and group by list, master query stays the same (count (distinct ...)) but attribute numbers inside the aggregate expression is modified to reflect changes in repartition query.	2016-05-27 15:43:05 +03:00
Metin Doslu	afa74ce5ca	Make master_create_empty_shard() aware of the shard placement policy Now, master_create_empty_shard() will create shards according to the value of citus.shard_placement_policy which also makes default round-robin instead of random.	2016-05-27 15:05:53 +03:00
eren	132d9212d0	ADD master_modify_multiple_shards UDF Fixes #10 This change creates a new UDF: master_modify_multiple_shards Parameters: modify_query: A simple DELETE or UPDATE query as a string. The UDF is similar to the existing master_apply_delete_command UDF. Basically, given the modify query, it prunes the shard list, re-constructs the query for each shard and sends the query to the placements. Depending on the value of citus.multi_shard_commit_protocol, the commit can be done in one-phase or two-phase manner. Limitations: * It cannot be called inside a transaction block * It only be called with simple operator expressions (like Single Shard Modify) Sample Usage: ``` SELECT master_modify_multiple_shards( 'DELETE FROM customer_delete_protocol WHERE c_custkey > 500 AND c_custkey < 500'); ```	2016-05-26 17:30:35 +03:00
Burak Yucesoy	0e71ffd937	Fix #469 This change renames one of the ReceiveRegularFile functions with more descriptive name.	2016-05-26 12:03:36 +03:00
Christoph Berg	7df82baf46	Sort list of objects in src/backend/distributed/Makefile Make's $(wildcard) does not sort the glob result, but returns filenames in filesystem ordering. This makes the build result vary and hence unreproducible on the binary level. Fix by adding $(sort). Spotted by Debian's reproducible builds project.	2016-05-18 10:42:20 +02:00
Jason Petersen	4ca4f10966	Add multi_copy test outputs to gitignore	2016-05-10 13:36:56 -06:00
Jason Petersen	61b6394e4b	Add gitignore rules for latest install files Got tired of dirty git tree.	2016-05-10 11:57:11 -06:00
Marco Slot	1b4fbc76e2	Add JSON/XML validation to EXPLAIN regression tests and fix issues	2016-05-06 11:30:07 +02:00
Lukas Fittl	2f694f7af3	Distributed EXPLAIN: Generate valid JSON output. This modifies the EXPLAIN output functions to actually generate valid JSON output when (FORMAT JSON) is being used. Fixes #494.	2016-05-05 12:48:01 +02:00
Onder Kalaci	d7fd56df89	Fix check-full failures This commit fixes failures happen during check-full. The change does make clean seperation of executor types in certain places to keep the outputs stable.	2016-05-05 12:28:22 +03:00
Andres Freund	5f282dd241	Stamp 5.1 release.	2016-05-04 18:05:41 -07:00
Andres Freund	4d7bcfdd35	Generate extension versions from the previous one.	2016-05-04 18:05:41 -07:00
Onder Kalaci	38da3c826b	Fix compile time warning This change fixes a compile time warning related to definition/declaration order of the code.	2016-05-04 09:42:10 +03:00
Marco Slot	845aebfe19	Remove costs from explain regression tests	2016-05-03 22:11:23 +02:00
Metin Doslu	866271b765	Add COPY support on worker nodes for append partitioned relations Now, we can copy to an append-partitioned distributed relation from any worker node by providing master options such as; COPY relation_name FROM file_path WITH (delimiter '\|', master_host 'localhost', master_port 5432); where master_port is optional and default is 5432.	2016-05-03 16:00:00 +03:00
Marco Slot	0c140cf333	Add deprecation warning to copy_to_distributed_table	2016-05-03 14:08:42 +02:00
Brian Cloutier	58535eb337	Query Planning Performance Improvments (#474 ) - Only look at pruned shards when determining AnchorTable - Use cached shardIntervalCompareFunction during copartition check	2016-05-03 10:48:46 +03:00
Marco Slot	24a74fb0ae	Remove spurious intermediate regression test files	2016-05-02 12:30:15 +02:00
Jason Petersen	510783f84f	Force bad connections in tests by closing sockets Based on Andres' suggestion, I removed SetConnectionStatus, moving its functionality directly into set_connection_status_bad, which now simply shuts down the socket underlying a particular connection. This keeps the functionality as-is while removing our questionable use of internal libpq headers.	2016-04-29 15:56:04 -07:00
Marco Slot	fc4f23065a	Add EXPLAIN for simple distributed queries	2016-04-30 00:11:02 +02:00
eren	7e19ebe679	FIX "mixed declarations and code" Warning in multi_physical_planner.c Fixes #477 This change fixes the compile time warning message in BuildMapMergeJob in multi_physical_planner.c about mixed declarations and code. Basically, the problematic declaration is moved up so that no expression is before it.	2016-04-29 11:18:04 +03:00
Brian Cloutier	0036eb3253	Allow references to columns in UPDATE statements (#472 ) Allow references to columns in UPDATE statements Queries like "UPDATE tbl SET column = column + 1" are now allowed, so long as you don't use any IMMUTABLE functions.	2016-04-28 05:45:16 -07:00
eren	ab240a7d4c	Rename copy_transaction_manager This change renames the distributed transaction manager parameter from citus.copy_transaction_manager to citus.multi_shard_commit_protocol. Distributed transaction manager has been used only by the COPY on hash partitioned tables but it can be used by upcoming features so, we needed to rename so that its name do not contain a reference to COPY. The change also includes renames like transaction_manager_options to commit_protocol_options and TRANSACTION_MANAGER_1PC to COMMIT_PROTOCOL_1PC. With this change, declaration of MultiShardCommitProtocol (was CopyTransactionManager) is moved from multi_copy.c to multi_transaction.c.	2016-04-28 15:12:50 +03:00
Andres Freund	0ce1e3ddaf	Perform permission checks on operations re-implemented by citus. Currently that's just COPY FROM. There's other places where we could check for permissions earlier (to fail less verbosely), but since there's other pending changes in the whole DDL area, which is affected by this, I'm just adding a note to those places.	2016-04-27 10:28:36 -07:00
Andres Freund	758a70a8ff	Create new shards as owned the distributed table's owner. That's important because ownership of relations implies special privileges. Without this change, a distributed table can be accessible by a table's owner, but a shard created by another user might not.	2016-04-27 10:28:33 -07:00
Andres Freund	3a264db2fe	Add ReplicateGrantStmt(). This is the basis for coordinating GRANT/REVOKE across nodes.	2016-04-27 10:28:25 -07:00
Andres Freund	7c281fbe07	Add pg_get_table_grants() function and support extending GRANTs.	2016-04-27 10:28:25 -07:00
Andres Freund	eae65404d0	Grant SELECT for pg_catalog.pg_dist* to PUBLIC. Given pg_class et al. are readable by everyone there's little point in restricting read only access to citus catalogs.	2016-04-27 10:28:25 -07:00
Andres Freund	a5b3dcddb3	Run some commands as superuser to allow normal users to execute queries. Some small parts of citus currently require superuser privileges; which is obviously not desirable for production scenarios. Run these small parts under superuser privileges (we use the extension owner) to avoid that. This does not yet coordinate grants between master and workers. Thus it allows to create shards, load data, and run queries as a non-superuser, but it is not easily possible to allow differentiated accesses to several users.	2016-04-27 10:28:22 -07:00
Andres Freund	25615ee9d7	Add CitusExtensionOwner(), to execute some priviledged operations under. There exist some operations we have to execute with elevated privileges. The most expedient user for that is the user owning the citusdb extension.	2016-04-27 10:26:08 -07:00

1 2 3

126 Commits (fd1b088208ca48d88ed6520df7023330bdee1461)