citus

Commit Graph

Author	SHA1	Message	Date
Jason Petersen	41ed433b0e	Remove hash-pruning logic for NULL values It turns out some tests exercised this behavior, but removing it should have no ill effects. Besides, both copy and INSERT disallow NULLs in a table's partition column. Fixes a bug where anti-joins on hash-partitioned distributed tables would incorrectly prune shards early, result in incorrect results (test included).	2016-07-06 17:04:21 -06:00
Andres Freund	38f4722f6f	Add tests for LEFT JOIN ON clauses preventing matches left/right.	2016-06-16 16:53:02 -07:00
Marco Slot	52bc209c37	Do not copy outer join clauses into WHERE	2016-06-16 16:42:32 -07:00
Burak Yucesoy	4a718d293b	Append shardId before escaping the table name Fixes #550, fixes #545 If table name contains special characters, it needs to be escaped. However in some cases, we escape table name before appending shardId, which causes syntax error in the queries sent to worker nodes. With this change we now append shardId before escaping table names.	2016-06-15 04:15:40 +03:00
Murat Tuncer	0db413491c	Fix crash in count distinct with filters in repartition subqueries now copies all column references in count distinct aggreagete to worker target list and group by. Master target list is also updated to reflect changes in attribute order. Fixes 569	2016-06-09 11:47:24 +03:00
Metin Doslu	28a16beba7	Update only shard length on statistics update for hash-partitioned Update only the shard length on master_update_shard_statistics() call for hash-partitioned tables. Fixes #519.	2016-06-07 15:04:29 +03:00
Eren	5512bb359a	Set Explicit ShardId/JobId In Regression Tests Fixes #271 This change sets ShardIds and JobIds for each test case. Before this change, when a new test that somehow increments Job or Shard IDs is added, then the tests after the new test should be updated. ShardID and JobID sequences are set at the beginning of each file with the following commands: ``` ALTER SEQUENCE pg_catalog.pg_dist_shardid_seq RESTART 290000; ALTER SEQUENCE pg_catalog.pg_dist_jobid_seq RESTART 290000; ``` ShardIds and JobIds are multiples of 10000. Exceptions are: - multi_large_shardid: shardid and jobid sequences are set to much larger values - multi_fdw_large_shardid: same as above - multi_join_pruning: Causes a race condition with multi_hash_pruning since they are run in parallel.	2016-06-07 14:32:44 +03:00
Murat Tuncer	360e884de1	Add enable_ddl_propagation flag to control automatic ddl propagation	2016-06-06 13:42:46 +03:00
Burak Yücesoy	2f096cad74	Update regression tests where metadata edited manually Fixes #302 Since our previous syntax did not allow creating hash partitioned tables, some of the previous tests manually changed partition method to hash to be able to test it. With this change we remove unnecessary workaround and create hash distributed tables instead. Also in some tests metadata was created manually. With this change we also fixed this issue.	2016-06-04 13:50:42 +00:00
Murat Tuncer	2b0d6473b9	Add complex distinct count support for repartitioned subqueries Single table repartition subqueries now support count(distinct column) and count(distinct (case when ...)) expressions. Repartition query extracts column used in aggregate expression and adds them to target list and group by list, master query stays the same (count (distinct ...)) but attribute numbers inside the aggregate expression is modified to reflect changes in repartition query.	2016-05-27 15:43:05 +03:00
Metin Doslu	afa74ce5ca	Make master_create_empty_shard() aware of the shard placement policy Now, master_create_empty_shard() will create shards according to the value of citus.shard_placement_policy which also makes default round-robin instead of random.	2016-05-27 15:05:53 +03:00
eren	132d9212d0	ADD master_modify_multiple_shards UDF Fixes #10 This change creates a new UDF: master_modify_multiple_shards Parameters: modify_query: A simple DELETE or UPDATE query as a string. The UDF is similar to the existing master_apply_delete_command UDF. Basically, given the modify query, it prunes the shard list, re-constructs the query for each shard and sends the query to the placements. Depending on the value of citus.multi_shard_commit_protocol, the commit can be done in one-phase or two-phase manner. Limitations: * It cannot be called inside a transaction block * It only be called with simple operator expressions (like Single Shard Modify) Sample Usage: ``` SELECT master_modify_multiple_shards( 'DELETE FROM customer_delete_protocol WHERE c_custkey > 500 AND c_custkey < 500'); ```	2016-05-26 17:30:35 +03:00
Metin Doslu	866271b765	Add COPY support on worker nodes for append partitioned relations Now, we can copy to an append-partitioned distributed relation from any worker node by providing master options such as; COPY relation_name FROM file_path WITH (delimiter '\|', master_host 'localhost', master_port 5432); where master_port is optional and default is 5432.	2016-05-03 16:00:00 +03:00
Marco Slot	fc4f23065a	Add EXPLAIN for simple distributed queries	2016-04-30 00:11:02 +02:00
Brian Cloutier	7b1dc0d511	Support count(distinct) on hash partitioned tables Also add test to ensure we get the same results when running count(distinct) on range and hash partitioned tables.	2016-04-20 04:54:07 -07:00
eren	448527c3af	Fix JOINs on varchar columns with subquery pushdown Fixes #379 Varchar VAR struct is wrapped in RELABELTYPE struct inside PostgreSQL code and IsPartitionColumnRecursive function considers only VAR types so returning false for varchar. This change adds strip_implicit_coercions() call to the columnExpression in IsPartitionColumnRecursive function so that we get rid of implicit coercions like RELABELTYPE are stripped to VAR.	2016-04-19 21:55:50 -06:00
eren	1ffc30d7f5	Fix Shard Pruning Problem With Subqueries on VARCHAR Partition Columns Fixes #375 Prior to this change, shard pruning couldn't be done if: - Table is hash-distributed - Partition column of is VARCHAR - Query to be pruned is a subquery There were two problems: - A bug in left-side/right-side checks for the partition column - We were not considering relabeled types (VARCHAR was relabeled as TEXT)	2016-04-19 21:55:50 -06:00
Metin Doslu	132a77f992	Add COPY support on master node for append partitioned relations	2016-04-19 21:57:59 +03:00
Metin Doslu	1150ce6414	Send COPY rows in binary format	2016-04-12 20:22:31 +02:00
Marco Slot	d25ee8fbd8	Support for COPY FROM, based on pg_shard PR by Postres Pro	2016-04-12 20:22:31 +02:00
Andres Freund	53309461cb	Improve DDL replication related regression tests. The previous form of the test, utilizing DEBUG2, included too much output dependent on the specifc system and version. Reformulate it to explicitly connect to workers and show the schema there, when necessary. The only remaining difference in some of the remaining alternate regression test files was due to an older minor version release change. Remove those as well.	2016-03-17 16:05:54 -07:00
Marco Slot	75a141a7c6	Merge remote-tracking branch 'origin/master' into feature/drop_shards_on_drop_table	2016-02-17 22:52:58 +01:00
Jason Petersen	ec1e74e7f9	Change tests to use default staging policy The default staging policy is now round-robin, though tests were still configured to use local-first. Testing with the shipping default seems like the best option, correctness-wise, and since local-first has some issues with OSes where connecting from localhost doesn't always resolve to 'localhost', just going with the default is a win-win.	2016-02-17 11:03:17 -07:00
Marco Slot	52f11223e5	Drop shards when a distributed table is dropped After this change, shards and associated metadata are automatically dropped when running DROP TABLE on a distributed table, which fixes #230. It also adds schema support for master_apply_delete_command, which fixes #73. Dropping the shards happens in the master_drop_all_shards UDF, which is called from the SQL_DROP trigger. Inside the trigger, the table is no longer visible and calling master_apply_delete_command directly wouldn't work and oid <-> name mappings are not available. The master_drop_all_shards function therefore takes the relation id, schema name, and table name as parameters, which can be obtained from pg_event_trigger_dropped_objects() in the SQL_DROP trigger. If the user calls master_drop_all_shards while the table still exists, the schema name and table name are ignored. Author: Marco Slot Reviewed-By: Andres Freund	2016-02-16 10:54:29 +01:00
Murat Tuncer	55c44b48dd	Changed product name to citus All citusdb references in - extension, binary names - file headers - all configuration name prefixes - error/warning messages - some functions names - regression tests are changed to be citus.	2016-02-15 16:04:31 +02:00
Onder Kalaci	136306a1fe	Initial commit of Citus 5.0	2016-02-11 04:05:32 +02:00

1 2 3

126 Commits (6f1a8dfdbe9323aa7fcd3d34696828a277eed23e)