citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	053f10d91c	Fix list length lookup in WorkerGetLiveNodeCount	2017-04-29 02:13:20 +02:00
Burak Yucesoy	edd69310fd	Fix check-vanilla tests It semms that GEQO optimizations, when it is set to on, create their own memory context and free it after when it is no longer necessary. In join multi_join_restriction_hook we allocate our variables in the CurrentMemoryContext, which is GEQO's memory context if it is active. To prevent deallocation of our variables when GEQO's memory context is freed, we started to allocate memory fo these variables in separate MemoryContext.	2017-04-29 01:55:18 +02:00
Marco Slot	97d36b7dfe	Check whether relation ID exists in citus_relation_size	2017-04-29 01:39:39 +02:00
Andres Freund	f6ef7f2c03	Faster shard pruning. So far citus used postgres' predicate proofing logic for shard pruning, except for INSERT and COPY which were already optimized for speed. That turns out to be too slow: * Shard pruning for SELECTs is currently O(#shards), because PruneShardList calls predicate_refuted_by() for every shard. Obviously using an O(N) type algorithm for general pruning isn't good. * predicate_refuted_by() is quite expensive on its own right. That's primarily because it's optimized for doing a single refutation proof, rather than performing the same proof over and over. * predicate_refuted_by() does not keep persistent state (see 2.) for function calls, which means that a lot of syscache lookups will be performed. That's particularly bad if the partitioning key is a composite key, because without a persistent FunctionCallInfo record_cmp() has to repeatedly look-up the type definition of the composite key. That's quite expensive. Thus replace this with custom-code that works in two phases: 1) Search restrictions for constraints that can be pruned upon 2) Use those restrictions to search for matching shards in the most efficient manner available: a) Binary search / Hash Lookup in case of hash partitioned tables b) Binary search for equal clauses in case of range or append tables without overlapping shards. c) Binary search for inequality clauses, searching for both lower and upper boundaries, again in case of range or append tables without overlapping shards. d) exhaustive search testing each ShardInterval My measurements suggest that we are considerably, often orders of magnitude, faster than the previous solution, even if we have to fall back to exhaustive pruning.	2017-04-28 14:40:41 -07:00
Andres Freund	2013090a77	Add DistTableCacheEntry->hasOverlappingShardInterval. This determines whether it's possible to perform binary search on sortedShardIntervalArray or not. If e.g. two shards have overlapping ranges, that'd be prohibitive. That'll be useful in later commit introducing faster shard pruning.	2017-04-28 14:40:38 -07:00
Andres Freund	15d427f931	Add DistTableCacheEntry->shardValueCompareFunction. That's useful when comparing values a hash-partitioned table is filtered by. The existing shardIntervalCompareFunction is about comparing hashed values, not unhashed ones. The added btree opclass function is so we can get a comparator back. This should be changed much more widely, but is not necessary so far.	2017-04-28 14:40:38 -07:00
Andres Freund	f3172e9719	Build DistTableCacheEntry->shardIntervalCompareFunction even for 0 shards. Previously we, unnecessarily, used a the first shard's type information to to look up the comparison function. But that information is already available, so use it. That's helpful because we sometimes want to access the comparator function even if there's no shards.	2017-04-28 14:40:38 -07:00
Andres Freund	99642306ed	Fix: Make FindShardIntervalIndex robust against 0 shards.	2017-04-28 14:40:38 -07:00
Metin Doslu	d411892fe6	Send explain queries with savepoints With this commit, we started to send explain queries within a savepoint. After running explain query, we rollback to savepoint. This saves us from side effects of EXPLAIN ANALYZE on DML queries.	2017-04-28 12:13:48 -07:00
Jason Petersen	1c353e68aa	Remove FastShardPruning method With the other simplifications, it doesn't make sense to keep around.	2017-04-27 13:32:36 -06:00
Jason Petersen	06497d74f5	Refactor FindShardInterval to use cacheEntry All callers fetch a cache entry and extract/compute arguments for the eventual FindShardInterval call, so it makes more sense to refactor into that function itself; this solves the use-after-free bug, too.	2017-04-27 13:32:36 -06:00
Andres Freund	4fe14bdeda	Some cleanup in multi_subquery test. Remove trailing whitespace and use of EXPLAIN instead of EXPLAIN (COSTS OFF).	2017-04-26 11:33:56 -07:00
Andres Freund	f064c33d5c	Add back pruning coverage lost in last commit. Because we can't rely on the debuggin message anymore, add a bunch of explain statements that roughly fulfill the same purpose.	2017-04-26 11:33:56 -07:00
Andres Freund	5b389eb6d7	Boring regression test output adjustments. Soon shard pruning will be optimized not to generally work linearly anymore. Thus we can't print the pruned shard intervals as currently done anymore. The current printing of shard ids also prevents us from running tests in parallel, as otherwise shard ids aren't linearly numbered.	2017-04-26 11:33:56 -07:00
Andres Freund	9e4ec991d8	Skip exhaustive test in CoPartitionedTables() if declared colocated. That's considerably cheaper.	2017-04-26 11:19:17 -07:00
Marco Slot	1b4ebd490d	Only process error if not NULL in StoreErrorMessage	2017-04-21 17:01:01 +02:00
Marco Slot	326f8d9d61	Use right sizeof in UpdateRelationColocationGroup	2017-04-21 16:37:09 +02:00
Burak Yucesoy	a35d0cd8af	Configure valgrind command line arguments	2017-04-21 16:30:12 +03:00
Burak Yucesoy	9312ef8bcf	Stabilize test outputs	2017-04-21 16:08:52 +03:00
Eren Basak	71d99b72ce	Add support for proper valgrind tests This change allows valgrind tests (`make check-multi-vg`) to be run seamlessly without test output errors and timeout problems.	2017-04-21 16:08:52 +03:00
Marco Slot	7d1f7b8923	Support expressions in the partition column in INSERTs	2017-04-21 14:05:52 +02:00
velioglu	a26edd2249	Implement ALTER TABLE ADD CONSTRAINT command	2017-04-20 15:02:33 +03:00
velioglu	5b3e47de7a	Log message of across shard queries according to the log level	2017-04-20 12:24:46 +03:00
velioglu	be3cdb14ea	Change native hash function with worker_hash	2017-04-19 22:16:55 +03:00
Jason Petersen	f999bcd7ca	Enable distributed ALTER TABLE ... RENAME COLUMN Pretty straightforward. Had some concerns about locking, but due to the fact that all distributed operations use either some level of deparsing or need to enumerate column names, they all block during any concurrent column renames (due to the AccessExclusive lock). In addition, I had some misgivings about permitting renames of the dis- tribution column, but nothing bad comes from just allowing them. Finally, I tried to trigger any sort of error using prepared statements and could not trigger any errors not also exhibited by plain PostgreSQL tables.	2017-04-18 22:47:48 -06:00
Marco Slot	0f63edc5b4	Add basic read-only transaction tests	2017-04-18 11:42:33 +02:00
Marco Slot	53899946e7	Remove redundant pg_dist_jobid_seq restarts in tests	2017-04-18 11:42:32 +02:00
Marco Slot	d7a5f6997c	Set citus.enable_unique_job_ids in tests with job ID in output	2017-04-18 11:42:32 +02:00
Marco Slot	c7603215dd	Stop using a sequence to generate unique job IDs	2017-04-18 11:31:51 +02:00
Burak Yucesoy	58a809b0e8	Set default value of isactive to true With this change, we set to default value of isactive column to true so that upgrading users all nodes will be marked as active to not break their environment.	2017-04-18 09:40:44 +03:00
Burak Yucesoy	5aefe20725	Fix node copy error Instead of directly returning heap tuple obtained from heap scan we return copied version of it.	2017-04-17 19:38:18 +03:00
Metin Doslu	f45c2c43b5	Fix table in name in prepared statement regression tests	2017-04-17 16:17:30 +02:00
Marco Slot	a4c98727be	Support UPDATE/DELETE with parameterised partition column qual	2017-04-17 16:17:30 +02:00
Marco Slot	2fbe546ddd	Support query parameters in combination with function evaluation	2017-04-17 15:40:55 +02:00
Marco Slot	ccc796cf66	Create indexes after worker_append_table_to_shard during shard repair	2017-04-17 15:17:21 +02:00
Burak Yucesoy	d58cb416a4	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Burak Yucesoy	cd5dc2693d	Error out on parameterized SQL functions Before this commit, we were erroring out for queries containing parameterized SQL functions like 'SELECT parameterized_sql_query(value)' as we should, however we were returning wrong results for queries like 'SELECT * FROM parameterized_sql_query(value)'. With this commit we started to error out on such queries too.	2017-04-13 16:36:24 +03:00
Onder Kalaci	6c9296aca0	Remove uninstantiated qual logic, use attribute equivalences In this PR, we aim to deduce whether each of the RTE_RELATION is joined with at least on another RTE_RELATION on their partition keys. If each RTE_RELATION follows the above rule, we can conclude that all RTE_RELATIONs are joined on their partition keys. In order to do that, we invented a new equivalence class namely: AttributeEquivalenceClass. In very simple words, a AttributeEquivalenceClass is identified by an unique id and consists of a list of AttributeEquivalenceMembers. Each AttributeEquivalenceMember is designed to identify attributes uniquely within the whole query. The necessity of this arise since varno attributes are defined within a single level of a query. Instead, here we want to identify each RTE_RELATION uniquely and try to find equality among each RTE_RELATION's partition key. Whenever we find an equality clause A = B, where both A and B originates from relation attributes (i.e., not random expressions), we create an AttributeEquivalenceClass to record this knowledge. If we later find another equivalence B = C, we create another AttributeEquivalenceClass. Finally, we can apply transitity rules and generate a new AttributeEquivalenceClass which includes A, B and C. Note that equality among the members are identified by the varattno and rteIdentity. Each equality among RTE_RELATION is saved using an AttributeEquivalenceClass where each member attribute is identified by a AttributeEquivalenceMember. In the final step, we try generate a common attribute equivalence class that holds as much as AttributeEquivalenceMembers whose attributes are a partition keys.	2017-04-13 11:51:26 +03:00
velioglu	584c0c34a3	Change checks with built-in type	2017-04-11 14:41:37 +03:00
velioglu	5ba77d8abb	Check binary output function of type.	2017-04-10 16:28:09 +03:00
Jason Petersen	fc2c23f15a	Use RESET for GUC test, not reconnect More limited in what it does, better test.	2017-04-04 16:40:17 -06:00
Jason Petersen	b0a8d9da34	Add comments, use strncmp, clean up GUC desc. Good to go!	2017-04-04 16:16:49 -06:00
Jason Petersen	2ce82abb04	Clean up remaining error messages Added details and hints, based off of similar PostgreSQL scenarios.	2017-04-04 16:11:59 -06:00
Jason Petersen	41612177be	Clean up ErrorIfUnstableCreateOrAlterExtensionStmt Swaps an Assert in for an ereport, and adds details and hints to the error message to help users with a possibly confusing scenario.	2017-04-04 15:58:57 -06:00
Jason Petersen	0e6e42c59a	Refactor utility-skip/extn-check code This was getting pretty long and complex in the context of the main utility hook. Moved out the checks for what should skip Citus process- ing and what should have version checks performed.	2017-04-04 15:07:22 -06:00
Burak Yucesoy	66a801dd4e	Add enable_version_checks GUC and address feedback	2017-04-04 19:11:13 +03:00
Jason Petersen	0707f262b6	Self-implemented review feedback The use of a bare src/ rather than $srcdir caused configure to fail during VPATH builds. With our additional dependency upon AWK, we need to call AC_PROG_AWK, otherwise environments may not have $AWK set. Finally, citus_version.h should be in .gitignore.	2017-04-03 22:55:12 -06:00
Burak Yucesoy	63b232e4ba	Error out if binary citus version does not match installed extension With this change, we start to error out if loaded citus binaries does not match the available major version or installed citus extension version. In this case we force user to restart the server or run ALTER EXTENSION depending on the situation	2017-04-03 17:36:13 -06:00
Jason Petersen	963090fe05	Address review feedback Should just about do it.	2017-04-03 11:44:57 -06:00
Jason Petersen	afe6908e26	Improve CONCURRENTLY-related error messages Thought this looked slightly nicer than the default behavior. Changed preventTransaction to concurrent to be clearer that this code path presently affects CONCURRENTLY code only.	2017-04-03 11:19:15 -06:00

1 2 3 4 5 ...

566 Commits (053f10d91c9e1bf166d6837f285a0c48c5b09104)