citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	e58d76c5f6	Fix assert failure in bare SELECT FROM reference table FOR UPDATE in MX	2019-09-23 17:00:09 +02:00
Hadi Moshayedi	d24cefd055	Set active snapshot before SyncMetadataToNodes().	2019-09-19 09:00:25 -07:00
Hadi Moshayedi	d2f2acc4b2	Make master_update_node citus-ha friendly.	2019-09-18 09:32:54 -07:00
Hadi Moshayedi	76f3933b05	Add metadatasynced, and sync on master_update_node() Co-authored-by: pykello <hadi.moshayedi@microsoft.com> Co-authored-by: serprex <serprex@users.noreply.github.com>	2019-09-18 09:32:54 -07:00
Philip Dubé	492d1b2cba	ActivePrimaryNodeList: add lockMode parameter	2019-09-13 17:44:56 +00:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Jelte Fennema	e4cfea3751	Correctly add schema when distributing sequence definitons Fixes 2958	2019-09-13 17:19:35 +02:00
Jelte Fennema	389086102a	Refactor 9 argument function to use a struct (#2952 ) For another PR I needed to add another column which would require to add another argument to an already 9 argument function signature. In this case it would be a boolean flag and there were already two boolean flags in there. In my experience it becomes really easy to mess up the order of these flags at that point. Especially because the type system doesn't distinguish between the 3 different booleans with completely different meanings. So I refactored these signatures to receive a struct containing most of these arguments. Like that you don't mess up orderening, because the meaning of the boolean is not order dependent but fieldname dependent. It also makes it possible to set good shared defaults for this struct.	2019-09-13 15:49:53 +02:00
Hadi Moshayedi	48ff4691a0	Return nodeid instead of record in some UDFs	2019-09-12 12:46:21 -07:00
Jelte Fennema	eb7e45d556	Make LookupNodeForGroup extern	2019-09-12 16:40:25 +02:00
Philip Dubé	41dca121e2	Support GENERATE ALWAYS AS STORED	2019-09-04 14:50:17 +00:00
Nils Dijk	936d546a3c	Refactor Ensure Schema Exists to Ensure Dependecies Exists (#2882 ) DESCRIPTION: Refactor ensure schema exists to dependency exists Historically we only supported schema's as table dependencies to be created on the workers before a table gets distributed. This PR puts infrastructure in place to walk pg_depend to figure out which dependencies to create on the workers. Currently only schema's are supported as objects to create before creating a table. We also keep track of dependencies that have been created in the cluster. When we add a new node to the cluster we use this catalog to know which objects need to be created on the worker. Side effect of knowing which objects are already distributed is that we don't have debug messages anymore when creating schema's that are already created on the workers.	2019-09-04 14:10:20 +02:00
Philip Dubé	28d964240f	Remove CheckForUpdates https://reports.citusdata.com/v1/releases/latest We haven't updated the version CheckForUpdates sees since 7.1.0	2019-09-03 21:11:25 +00:00
Philip Dubé	da00c62eea	create_distributed_table: include COLLATE on columns	2019-08-29 14:22:54 +00:00
Jelte Fennema	cbecf97c84	Move tuplestore setup to a helper function (#2898 ) * Add tuplestore helpers * More detailed error messages in tuplestore * Add CreateTupleDescCopy to SetupTuplestore * Use new SetupTuplestore helper function * Remove unnecessary copy * Remove comment about undefined behaviour	2019-08-27 09:11:08 +02:00
Philip Dubé	eba3828ef7	ColocatedShardIntervalList: sort	2019-08-26 17:42:41 +00:00
Philip Dubé	e5cd298a98	pg12 revised layout of FunctionCallInfoData See `a9c35cf85c` clang raises a warning due to FunctionCall2InfoData technically being variable sized This is fine, as the struct is the size we want it to be. So silence the warning	2019-08-22 19:02:35 +00:00
Philip Dubé	be3285828f	Collations matter for hashing strings in pg12 See https://www.postgresql.org/docs/12/collation.html#COLLATION-NONDETERMINISTIC	2019-08-22 18:58:37 +00:00
Philip Dubé	018ad1c58e	pg12: version_compat.h, tuples, oids, misc	2019-08-22 18:57:23 +00:00
Philip Dubé	68c4b71f93	Fix up includes with pg12 changes	2019-08-22 18:56:21 +00:00
Philip Dubé	fbc3e346e8	ruleutils_12.c Produced this file by copying ruleutils_11.c, then comparing postgres ruleutils.c changes between REL_11_STABLE & REL_12_STABLE	2019-08-22 18:56:05 +00:00
Hadi Moshayedi	6be1bacddd	Fix distributed deadlock for TRUNCATE	2019-08-22 11:03:53 -07:00
Hadi Moshayedi	a5b087c89b	Support FKs between reference tables	2019-08-21 16:11:27 -07:00
Hadi Moshayedi	c582eb89c8	Add some missing locks.	2019-08-15 12:34:31 -07:00
Philip Dubé	cd951fa9ca	Avoid multiple pg_dist_colocation records being created for reference tables master_deactivate_node is updated to decrement the replication factor Otherwise deactivation could have create_reference_table produce a second record UpdateColocationGroupReplicationFactor is renamed UpdateColocationGroupReplicationFactorForReferenceTables & the implementation looks up the record based on distributioncolumntype == InvalidOid, rather than by id Otherwise the record's replication factor fails to be maintained when there are no reference tables	2019-08-13 17:21:02 +00:00
Philip Dubé	3982b4635f	CompareShardIntervals: if intervals are equal, compare id. Works around sort being unstable	2019-07-26 16:13:36 +00:00
Philip Dubé	0915027389	DistributedPlan: replace operation with modLevel This causes no behaviorial changes, only organizes better to implement modifying CTEs Also rename ExtactInsertRangeTableEntry to ExtractResultRelationRTE, as the source of this function didn't match the documentation Remove Task's upsertQuery in favor of ROW_MODIFY_NONCOMMUTATIVE Split up AcquireExecutorShardLock into more internal functions Tests: Normalize multi_reference_table multi_create_table_constraints	2019-07-16 13:58:18 -07:00
Nils Dijk	791cc26a86	Fix an issue with subquery map merge jobs as non-root Also automated all manual tests around multi user isolation for internal citus udf's automate upgrade_to_reference_table tests add negative tests for lock_relation_if_exists add tests for permissions on worker_cleanup_job_schema_cache add tests for worker_fetch_partition_file add tests for worker_merge_files_into_table fix problem with worker_merge_files_and_run_query when run as non-super user and add tests for behaviour	2019-07-10 12:40:05 +02:00
Marco Slot	97334ff1ec	Copy WorkerNode before returning in FindWorkerNode	2019-07-05 09:35:53 +02:00
Hadi Moshayedi	47aa95d00d	Fix a NULL dereference.	2019-07-03 16:26:49 -07:00
Önder Kalacı	40da78c6fd	Introduce the adaptive executor (#2798 ) With this commit, we're introducing the Adaptive Executor. The commit message consists of two distinct sections. The first part explains how the executor works. The second part consists of the commit messages of the individual smaller commits that resulted in this commit. The readers can search for the each of the smaller commit messages on https://github.com/citusdata/citus and can learn more about the history of the change. /------------------------------------------------------------------------- * adaptive_executor.c * * The adaptive executor executes a list of tasks (queries on shards) over * a connection pool per worker node. The results of the queries, if any, * are written to a tuple store. * * The concepts in the executor are modelled in a set of structs: * * - DistributedExecution: * Execution of a Task list over a set of WorkerPools. * - WorkerPool * Pool of WorkerSessions for the same worker which opportunistically * executes "unassigned" tasks from a queue. * - WorkerSession: * Connection to a worker that is used to execute "assigned" tasks * from a queue and may execute unasssigned tasks from the WorkerPool. * - ShardCommandExecution: * Execution of a Task across a list of placements. * - TaskPlacementExecution: * Execution of a Task on a specific placement. * Used in the WorkerPool and WorkerSession queues. * * Every connection pool (WorkerPool) and every connection (WorkerSession) * have a queue of tasks that are ready to execute (readyTaskQueue) and a * queue/set of pending tasks that may become ready later in the execution * (pendingTaskQueue). The tasks are wrapped in a ShardCommandExecution, * which keeps track of the state of execution and is referenced from a * TaskPlacementExecution, which is the data structure that is actually * added to the queues and describes the state of the execution of a task * on a particular worker node. * * When the task list is part of a bigger distributed transaction, the * shards that are accessed or modified by the task may have already been * accessed earlier in the transaction. We need to make sure we use the * same connection since it may hold relevant locks or have uncommitted * writes. In that case we "assign" the task to a connection by adding * it to the task queue of specific connection (in * AssignTasksToConnections). Otherwise we consider the task unassigned * and add it to the task queue of a worker pool, which means that it * can be executed over any connection in the pool. * * A task may be executed on multiple placements in case of a reference * table or a replicated distributed table. Depending on the type of * task, it may not be ready to be executed on a worker node immediately. * For instance, INSERTs on a reference table are executed serially across * placements to avoid deadlocks when concurrent INSERTs take conflicting * locks. At the beginning, only the "first" placement is ready to execute * and therefore added to the readyTaskQueue in the pool or connection. * The remaining placements are added to the pendingTaskQueue. Once * execution on the first placement is done the second placement moves * from pendingTaskQueue to readyTaskQueue. The same approach is used to * fail over read-only tasks to another placement. * * Once all the tasks are added to a queue, the main loop in * RunDistributedExecution repeatedly does the following: * * For each pool: * - ManageWorkPool evaluates whether to open additional connections * based on the number unassigned tasks that are ready to execute * and the targetPoolSize of the execution. * * Poll all connections: * - We use a WaitEventSet that contains all (non-failed) connections * and is rebuilt whenever the set of active connections or any of * their wait flags change. * * We almost always check for WL_SOCKET_READABLE because a session * can emit notices at any time during execution, but it will only * wake up WaitEventSetWait when there are actual bytes to read. * * We check for WL_SOCKET_WRITEABLE just after sending bytes in case * there is not enough space in the TCP buffer. Since a socket is * almost always writable we also use WL_SOCKET_WRITEABLE as a * mechanism to wake up WaitEventSetWait for non-I/O events, e.g. * when a task moves from pending to ready. * * For each connection that is ready: * - ConnectionStateMachine handles connection establishment and failure * as well as command execution via TransactionStateMachine. * * When a connection is ready to execute a new task, it first checks its * own readyTaskQueue and otherwise takes a task from the worker pool's * readyTaskQueue (on a first-come-first-serve basis). * * In cases where the tasks finish quickly (e.g. <1ms), a single * connection will often be sufficient to finish all tasks. It is * therefore not necessary that all connections are established * successfully or open a transaction (which may be blocked by an * intermediate pgbouncer in transaction pooling mode). It is therefore * essential that we take a task from the queue only after opening a * transaction block. * * When a command on a worker finishes or the connection is lost, we call * PlacementExecutionDone, which then updates the state of the task * based on whether we need to run it on other placements. When a * connection fails or all connections to a worker fail, we also call * PlacementExecutionDone for all queued tasks to try the next placement * and, if necessary, mark shard placements as inactive. If a task fails * to execute on all placements, the execution fails and the distributed * transaction rolls back. * * For multi-row INSERTs, tasks are executed sequentially by * SequentialRunDistributedExecution instead of in parallel, which allows * a high degree of concurrency without high risk of deadlocks. * Conversely, multi-row UPDATE/DELETE/DDL commands take aggressive locks * which forbids concurrency, but allows parallelism without high risk * of deadlocks. Note that this is unrelated to SEQUENTIAL_CONNECTION, * which indicates that we should use at most one connection per node, but * can run tasks in parallel across nodes. This is used when there are * writes to a reference table that has foreign keys from a distributed * table. * * Execution finishes when all tasks are done, the query errors out, or * the user cancels the query. * ------------------------------------------------------------------------- / All the commits involved here: * Initial unified executor prototype * Latest changes * Fix rebase conflicts to master branch * Add missing variable for assertion * Ensure that master_modify_multiple_shards() returns the affectedTupleCount * Adjust intermediate result sizes The real-time executor uses COPY command to get the results from the worker nodes. Unified executor avoids that which results in less data transfer. Simply adjust the tests to lower sizes. * Force one connection per placement (or co-located placements) when requested The existing executors (real-time and router) always open 1 connection per placement when parallel execution is requested. That might be useful under certain circumstances: (a) User wants to utilize as much as CPUs on the workers per distributed query (b) User has a transaction block which involves COPY command Also, lots of regression tests rely on this execution semantics. So, we'd enable few of the tests with this change as well. * For parameters to be resolved before using them For the details, see PostgreSQL's copyParamList() * Unified executor sorts the returning output * Ensure that unified executor doesn't ignore sequential execution of DDLJob's Certain DDL commands, mainly creating foreign keys to reference tables, should be executed sequentially. Otherwise, we'd end up with a self distributed deadlock. To overcome this situaiton, we set a flag `DDLJob->executeSequentially` and execute it sequentially. Note that we have to do this because the command might not be called within a transaction block, and we cannot call `SetLocalMultiShardModifyModeToSequential()`. This fixes at least two test: multi_insert_select_on_conflit.sql and multi_foreign_key.sql Also, I wouldn't mind scattering local `targetPoolSize` variables within the code. The reason is that we'll soon have a GUC (or a global variable based on a GUC) that'd set the pool size. In that case, we'd simply replace `targetPoolSize` with the global variables. * Fix 2PC conditions for DDL tasks * Improve closing connections that are not fully established in unified execution * Support foreign keys to reference tables in unified executor The idea for supporting foreign keys to reference tables is simple: Keep track of the relation accesses within a transaction block. - If a parallel access happens on a distributed table which has a foreign key to a reference table, one cannot modify the reference table in the same transaction. Otherwise, we're very likely to end-up with a self-distributed deadlock. - If an access to a reference table happens, and then a parallel access to a distributed table (which has a fkey to the reference table) happens, we switch to sequential mode. Unified executor misses the function calls that marks the relation accesses during the execution. Thus, simply add the necessary calls and let the logic kick in. * Make sure to close the failed connections after the execution * Improve comments * Fix savepoints in unified executor. * Rebuild the WaitEventSet only when necessary * Unclaim connections on all errors. * Improve failure handling for unified executor - Implement the notion of errorOnAnyFailure. This is similar to Critical Connections that the connection managament APIs provide - If the nodes inside a modifying transaction expand, activate 2PC - Fix few bugs related to wait event sets - Mark placement INACTIVE during the execution as much as possible as opposed to we do in the COMMIT handler - Fix few bugs related to scheduling next placement executions - Improve decision on when to use 2PC Improve the logic to start a transaction block for distributed transactions - Make sure that only reference table modifications are always executed with distributed transactions - Make sure that stored procedures and functions are executed with distributed transactions * Move waitEventSet to DistributedExecution This could also be local to RunDistributedExecution(), but in that case we had to mark it as "volatile" to avoid PG_TRY()/PG_CATCH() issues, and cast it to non-volatile when doing WaitEventSetFree(). We thought that would make code a bit harder to read than making this non-local, so we move it here. See comments for PG_TRY() in postgres/src/include/elog.h and "man 3 siglongjmp" for more context. * Fix multi_insert_select test outputs Two things: 1) One complex transaction block is now supported. Simply update the test output 2) Due to dynamic nature of the unified executor, the orders of the errors coming from the shards might change (e.g., all of the queries on the shards would fail, but which one appears on the error message?). To fix that, we simply added it to our shardId normalization tool which happens just before diff. * Fix subeury_and_cte test The error message is updated from: failed to execute task To: more than one row returned by a subquery or an expression which is a lot clearer to the user. * Fix intermediate_results test outputs Simply update the error message from: could not receive query results to result "squares" does not exist which makes a lot more sense. * Fix multi_function_in_join test The error messages update from: Failed to execute task XXX To: function f(..) does not exist * Fix multi_query_directory_cleanup test The unified executor does not create any intermediate files. * Fix with_transactions test A test case that just started to work fine * Fix multi_router_planner test outputs The error message is update from: Could not receive query results To: Relation does not exists which is a lot more clearer for the users * Fix multi_router_planner_fast_path test The error message is update from: Could not receive query results To: Relation does not exists which is a lot more clearer for the users * Fix isolation_copy_placement_vs_modification by disabling select_opens_transaction_block * Fix ordering in isolation_multi_shard_modify_vs_all * Add executor locks to unified executor * Make sure to allocate enought WaitEvents The previous code was missing the waitEvents for the latch and postmaster death. * Fix rebase conflicts for master rebase * Make sure that TRUNCATE relies on unified executor * Implement true sequential execution for multi-row INSERTS Execute the individual tasks executed one by one. Note that this is different than MultiShardConnectionType == SEQUENTIAL_CONNECTION case (e.g., sequential execution mode). In that case, running the tasks across the nodes in parallel is acceptable and implemented in that way. However, the executions that are qualified here would perform poorly if the tasks across the workers are executed in parallel. We currently qualify only one class of distributed queries here, multi-row INSERTs. If we do not enforce true sequential execution, concurrent multi-row upserts could easily form a distributed deadlock when the upserts touch the same rows. * Remove SESSION_LIFESPAN flag in unified_executor * Apply failure test updates We've changed the failure behaviour a bit, and also the error messages that show up to the user. This PR covers majority of the updates. * Unified executor honors citus.node_connection_timeout With this commit, unified executor errors out if even a single connection cannot be established within citus.node_connection_timeout. And, as a side effect this fixes failure_connection_establishment test. * Properly increment/decrement pool size variables Before this commit, the idle and active connection counts were not properly calculated. * insert_select_executor goes through unified executor. * Add missing file for task tracker * Modify ExecuteTaskListExtended()'s signature * Sort output of INSERT ... SELECT ... RETURNING * Take partition locks correctly in unified executor * Alternative implementation for force_max_query_parallelization * Fix compile warnings in unified executor * Fix style issues * Decrement idleConnectionCount when idle connection is lost * Always rebuild the wait event sets In the previous implementation, on waitFlag changes, we were only modifying the wait events. However, we've realized that it might be an over optimization since (a) we couldn't see any performance benefits (b) we see some errors on failures and because of (a) we prefer to disable it now. * Make sure to allocate enough sized waitEventSet With multi-row INSERTs, we might have more sessions than taskworkerCount after few calls of RunDistributedExecution() because the previous sessions would also be alive. Instead, re-allocate events when the connectino set changes. Implement SELECT FOR UPDATE on reference tables On master branch, we do two extra things on SELECT FOR UPDATE queries on reference tables: - Acquire executor locks - Execute the query on all replicas With this commit, we're implementing the same logic on the new executor. * SELECT FOR UPDATE opens transaction block even if SelectOpensTransactionBlock disabled Otherwise, users would be very confused and their logic is very likely to break. * Fix build error * Fix the newConnectionCount calculation in ManageWorkerPool * Fix rebase conflicts * Fix minor test output differences * Fix citus indent * Remove duplicate sorts that is added with rebase * Create distributed table via executor * Fix wait flags in CheckConnectionReady * failure_savepoints output for unified executor. * failure_vacuum output (pg 10) for unified executor. * Fix WaitEventSetWait timeout in unified executor * Stabilize failure_truncate test output * Add an ORDER BY to multi_upsert * Fix regression test outputs after rebase to master * Add executor.c comment * Rename executor.c to adaptive_executor.c * Do not schedule tasks if the failed placement is not ready to execute Before the commit, we were blindly scheduling the next placement executions even if the failed placement is not on the ready queue. Now, we're ensuring that if failed placement execution is on a failed pool or session where the execution is on the pendingQueue, we do not schedule the next task. Because the other placement execution should be already running. * Implement a proper custom scan node for adaptive executor - Switch between the executors, add GUC to set the pool size - Add non-adaptive regression test suites - Enable CIRCLE CI for non-adaptive tests - Adjust test output files * Add slow start interval to the executor * Expose max_cached_connection_per_worker to user * Do not start slow when there are cached connections * Consider ExecutorSlowStartInterval in NextEventTimeout * Fix memory issues with ReceiveResults(). * Disable executor via TaskExecutorType * Make sure to execute the tests with the other executor * Use task_executor_type to enable-disable adaptive executor * Remove useless code * Adjust the regression tests * Add slow start regression test * Rebase to master * Fix test failures in adaptive executor. * Rebase to master - 2 * Improve comments & debug messages * Set force_max_query_parallelization in isolation_citus_dist_activity * Force max parallelization for creating shards when asked to use exclusive connection. * Adjust the default pool size * Expand description of max_adaptive_executor_pool_size GUC * Update warnings in FinishRemoteTransactionCommit() * Improve session clean up at the end of execution Explicitly list all the states that the execution might end, otherwise warn. * Remove MULTI_CONNECTION_WAIT_RETRY which is not used at all * Add more ORDER BYs to multi_mx_partitioning	2019-06-28 14:04:40 +02:00
Hanefi Onaldi	7e8fd49b94	Create Schemas as superuser on all shard/table creation UDFs - All the schema creations on the workers will now be via superuser connections - If a shard is being repaired or a shard is replicated, we will create the schema only in the relevant worker; and in all the other cases where a schema creation is needed, we will block operations until we ensure the schema exists in all the workers	2019-06-26 17:12:28 +02:00
Philip Dubé	84fe626378	multi_router_planner: refactor error propagation	2019-06-26 10:32:01 +02:00
Philip Dubé	9ed6dd5570	Ignore compile_commands.json, fix typo	2019-06-26 10:32:01 +02:00
Nils Dijk	5df1b49bed	Feature: optionally force master_update_node during failover (#2773 ) When `master_update_node` is called to update a node's location it waits for appropriate locks to become available. This is useful during normal operation as new operations will be blocked till after the metadata update while running operations have time to finish. When `master_update_node` is called after a node failure it is less useful to wait for running operations to finish as they can't. The lock being held indicates an operation that once attempted to commit will fail as the machine already failed. Now the downside is the failover is postponed till the termination point of the operation. This has been observed by users to take a significant amount of time causing the rest of the system to be observed unavailable. With this patch it is possible in such situations to invoke `master_update_node` with 2 optional arguments: - `force` (bool defaults to `false`): When called with true the update of the metadata will be forced to proceed by terminating conflicting backends. A cancel is not enough as the backend might be in idle time (eg. an interactive session, or going back and forth between an appliaction), therefore a more intrusive solution of termination is used here. - `lock_cooldown` (int defaults to `10000`): This is the time in milliseconds before conflicting backends are terminated. This is to allow the backends to finish cleanly before terminating them. This allows the user to set an upperbound to the expected time to complete the metadata update, eg. performing the failover. The functionality is implemented by spawning a background worker that has the task of helping a certain backend in acquiring its locks. The backend is either terminated on successful execution of the metadata update, or once the memory context of the expression gets reset, eg. on a cancel of the statement.	2019-06-21 12:03:15 +02:00
Philip Dubé	4bfcf5b665	Enable Werror for all warnings Changes to ruleutils match changes made upstream to silence gcc fallthrough warnings	2019-06-18 14:43:54 -07:00
Philip Dubé	342d423725	Fix join alias resolution FROM (query) alias ignored renaming In nested subqueries the select list would rename, while the join alias would not respect that	2019-06-12 17:25:07 -07:00
exialin	59e54de54d	Minor code clean-up	2019-05-24 14:26:26 +02:00
Murat Tuncer	3fe482adbc	Fix DistShardCacheHash initialization InitializeCaches() method may prematurely set performedInitialization without actually creating DistShardCacheHash. Fix makes sure flag is set only if DistShardCacheHash is created successfully. Also introduced a new memory context to allocate aforementioned hash tables. If allocation/initialization fails for any reason we make sure memory is reclaimed by deleting the memory context.	2019-05-15 16:47:44 +03:00
Jason Petersen	71d5d1c865	Enable variable shadowing warnings; fix all Rather than wait for another place like the previous commit to bite us, I think we should turn on this warning.	2019-04-30 13:24:25 -06:00
Nils Dijk	feaac69769	Implementation for asycn FinishConnectionListEstablishment (#2584 )	2019-03-22 17:30:42 +01:00
Jason Petersen	a2c6f596f9	Address code review comments	2019-03-21 11:59:52 -06:00
Marco Slot	5ff1821411	Cache the current database name Purely for performance reasons.	2019-03-20 12:14:46 +03:00
Marco Slot	0ea4e52df5	Add nodeId to shardPlacements and use it for shard placement comparisons Before this commit, shardPlacements were identified with shardId, nodeName and nodeport. Instead of using nodeName and nodePort, we now use nodeId since it apparently has performance benefits in several places in the code.	2019-03-20 12:14:46 +03:00
exialin	84b853e1b5	Fix some typos (#2620 )	2019-03-14 16:48:31 -07:00
Murat Tuncer	2681231c98	Create column aliases for shard tables in worker queries when requested	2019-03-07 12:54:42 +03:00
Hadi Moshayedi	f4d3b94e22	Fix some of the casts for groupId (#2609 ) A small change which partially addresses #2608.	2019-03-05 12:06:44 -08:00
Jason Petersen	3df2f51881	Turn on style-checking, fix lingering violations We'd been ignoring updating uncrustify for some time now because I'd thought these were misclassifications that would require an update in our rules to address. Turns out they're legit, so I'm checking them in.	2019-02-26 23:01:40 -07:00
Jason Petersen	339e6e661e	Remove 9.6 (#2554 ) Removes support and code for PostgreSQL 9.6 cr: @velioglu	2019-01-16 13:11:24 -07:00
Nils Dijk	694992e946	upgrade default ssl_ciphers to more restrictive on extension creation Show ssl_ciphers in ssl_by_default_test	2018-12-12 15:33:15 +01:00
velioglu	3e0cff94a6	Add FunctionOidExtended function	2018-12-10 11:59:41 +03:00
Nils Dijk	4af40eee76	Enable SSL by default during installation of citus	2018-12-07 11:23:19 -07:00
Marco Slot	70fb9c851b	Remove odd memcpy usag in BuildCachedShardList	2018-12-04 14:09:10 +01:00
Marco Slot	8893cc141d	Support INSERT...SELECT with ON CONFLICT or RETURNING via coordinator Before this commit, Citus supported INSERT...SELECT queries with ON CONFLICT or RETURNING clauses only for pushdownable ones, since queries supported via coordinator were utilizing COPY infrastructure of PG to send selected tuples to the target worker nodes. After this PR, INSERT...SELECT queries with ON CONFLICT or RETURNING clauses will be performed in two phases via coordinator. In the first phase selected tuples will be saved to the intermediate table which is colocated with target table of the INSERT...SELECT query. Note that, a utility function to save results to the colocated intermediate result also implemented as a part of this commit. In the second phase, INSERT.. SELECT query is directly run on the worker node using the intermediate table as the source table.	2018-11-30 15:29:12 +03:00
Dimitri Fontaine	5ae2d03881	Refrain from having a strong opinion on maxGroupId. When initializing a Citus formation automatically from an external piece of software such as Citus-HA, the following process process may be used: - decide on the groupId in the external software - SELECT * FROM master_add_inactive_node('localhost', 9701, groupid => X) When Citus checks for maxGroupId, it forbids other software to pick their own group Ids to ues with the master_add_inactive_node() API. This patch removes the extra testing around maxGroupId.	2018-11-28 04:29:15 +01:00
Marco Slot	e17025e1d4	Check table ownership in mark_tables_colocated	2018-11-18 00:11:38 +01:00
Marco Slot	18acd00553	Check permissions in lock_relation_if_exists	2018-11-18 00:11:38 +01:00
Marco Slot	aab9f623eb	Check table ownership in upgrade_to_reference_table	2018-11-16 23:27:34 +01:00
Marco Slot	2de8ef29c3	Revoke function permissions for node metadata functions	2018-11-15 06:25:07 +01:00
Marco Slot	f383e4f307	Description: Refactor code that handles DDL commands from one file into a module The file handling the utility functions (DDL) for citus organically grew over time and became unreasonably large. This refactor takes that file and refactored the functionality into separate files per command. Initially modeled after the directory and file layout that can be found in postgres. Although the size of the change is quite big there are barely any code changes. Only one two functions have been added for readability purposes: - PostProcessIndexStmt which is extracted from PostProcessUtility - PostProcessAlterTableStmt which is extracted from multi_ProcessUtility A README.md has been added to `src/backend/distributed/commands` describing the contents of the module and every file in the module. We need more documentation around the overloading of the COPY command, for now the boilerplate has been added for people with better knowledge to fill out.	2018-11-14 13:36:27 +01:00
Murat Tuncer	cc401a2616	Create function_utils for pg function call related utilities	2018-11-07 15:29:38 +03:00
Hadi Moshayedi	d3e284dcd6	Use heap_deform_tuple() instead of calling heap_getattr(). (#2464 ) After Fast ALTER TABLE ADD COLUMN with a non-NULL default in PG11, physical heaps might not contain all attributes after a ALTER TABLE ADD COLUMN happens. heap_getattr() returns NULL when the physical tuple doesn't contain an attribute. So we should use heap_deform_tuple() in these cases, which fills in the missing attributes. Our catalog tables evolve over time, and an upgrade might involve some ALTER TABLE ADD COLUMN commands. Note that we don't need to worry about postgres catalog tables and we can use heap_getattr() for them, because they only change between major versions. This also fixes #2453.	2018-11-05 15:11:01 -05:00
Hadi Moshayedi	431ac80563	Keep track of cached entries in case of interruption. (#2433 ) * Keep track of cached entries in case of interruption. Previously we set DistTableCacheEntry->sortedShardIntervalArray and DistTableCacheEntry->shardIntervalArrayLength after we entered all related shard entries into DistShardCacheHash. The drawback was that if populating DistShardCacheHash was interrupted, ResetDistTableCacheEntry() didn't see the shard hash entries created, so was unable to clean them up. This patch fixes that by setting sortedShardIntervalArray earlier, and incrementing shardIntervalArrayLength as we enter shards into the cache.	2018-10-15 14:06:56 -04:00
Marco Slot	d56baefe3d	Allow simple DML commands from hot standby	2018-10-06 10:54:44 +02:00
Marco Slot	877d703ac5	Evaluate functions (and when applicable, parameters) anywhere in query	2018-09-21 12:57:50 -06:00
Onder Kalaci	abc443d7fa	Make sure that shard repair considers replication factor	2018-09-21 15:24:49 +03:00
Onder Kalaci	c1b5a04f6e	Allow partitioned tables with replication factor > 1 With this commit, we all partitioned distributed tables with replication factor > 1. However, we also have many restrictions. In summary, we disallow all kinds of modifications (including DDLs) on the partition tables. Instead, the user is allowed to run the modifications over the parent table. The necessity for such a restriction have two aspects: - We need to acquire shard resource locks appropriately - We need to handle marking partitions INVALID in case of any failures. Note that, in theory, the parent table should also become INVALID, which is too aggressive.	2018-09-21 14:40:41 +03:00
Murat Tuncer	b6930e3db9	Add distributed locking to truncated mx tables We acquire distributed lock on all mx nodes for truncated tables before actually doing truncate operation. This is needed for distributed serialization of the truncate command without causing a deadlock.	2018-09-21 14:23:19 +03:00
Marco Slot	55f46acedf	Support TABLESAMPLE in router queries	2018-08-31 13:22:38 +02:00
Onder Kalaci	5cf8fbe7b6	Add infrastructure to relation if exists	2018-09-07 14:49:36 +03:00
Murat Tuncer	d8279569b8	Add support for INCLUDE option in index creation INCLUDE is a new feature in index creation in PG11. Included column/expression paramameters are now forwarded to shards	2018-09-06 19:41:06 +03:00
velioglu	bd30e3e908	Add support for writing to reference tables from MX nodes	2018-08-27 18:15:04 +03:00
velioglu	2639149bd8	Enterprise functions about metadata/resource locks	2018-08-27 16:32:20 +03:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
mehmet furkan şahin	1a3b9f731e	Make master_disable/activate_node runnable when superuser	2018-08-15 00:43:35 -07:00
Onder Kalaci	974cbf11a5	Hide shard names on MX worker nodes This commit by default enables hiding shard names on MX workers by simple replacing `pg_table_is_visible()` calls with `citus_table_is_visible()` calls on the MX worker nodes. The latter function filters out tables that are known to be shards. The main motivation of this change is a better UX. The functionality can be opted out via a GUC. We also added two views, namely citus_shards_on_worker and citus_shard_indexes_on_worker such that users can query them to see the shards and their corresponding indexes. We also added debug messages such that the filtered tables can be interactively seen by setting the level to DEBUG1.	2018-08-07 14:21:45 +03:00
Jason Petersen	318119910b	Add pg_dist_poolinfo table For storing nodes' pool host/port overrides.	2018-07-10 09:30:22 -07:00
mehmet furkan şahin	35eac2318d	lock referenced reference table metadata is added For certain operations in enterprise, we need to lock the referenced reference table shard distribution metadata	2018-07-03 17:05:55 +03:00
Onder Kalaci	d83be3a33f	Enforce foreign key restrictions inside transaction blocks When a hash distributed table have a foreign key to a reference table, there are few restrictions we have to apply in order to prevent distributed deadlocks or reading wrong results. The necessity to apply the restrictions arise from cascading nature of foreign keys. When a foreign key on a reference table cascades to a distributed table, a single operation over a single connection can acquire locks on multiple shards of the distributed table. Thus, any parallel operation on that distributed table, in the same transaction should not open parallel connections to the shards. Otherwise, we'd either end-up with a self-distributed deadlock or read wrong results. As briefly described above, the restrictions that we apply is done by tracking the distributed/reference relation accesses inside transaction blocks, and act accordingly when necessary. The two main rules are as follows: - Whenever a parallel distributed relation access conflicts with a consecutive reference relation access, Citus errors out - Whenever a reference relation access is followed by a conflicting parallel relation access, the execution mode is switched to sequential mode. There are also some other notes to mention: - If the user does SET LOCAL citus.multi_shard_modify_mode TO 'sequential';, all the queries should simply work with using one connection per worker and sequentially executing the commands. That's obviously a slower approach than Citus' usual parallel execution. However, we've at least have a way to run all commands successfully. - If an unrelated parallel query executed on any distributed table, we cannot switch to sequential mode. Because, the essense of sequential mode is using one connection per worker. However, in the presence of a parallel connection, the connection manager picks those connections to execute the commands. That contradicts with our purpose, thus we error out. - COPY to a distributed table cannot be executed in sequential mode. Thus, if we switch to sequential mode and COPY is executed, the operation fails and there is currently no way of implementing that. Note that, when the local table is not empty and create_distributed_table is used, citus uses COPY internally. Thus, in those cases, create_distributed_table() will also fail. - There is a GUC called citus.enforce_foreign_key_restrictions to disable all the checks. We added that GUC since the restrictions we apply is sometimes a bit more restrictive than its necessary. The user might want to relax those. Similarly, if you don't have CASCADEing reference tables, you might consider disabling all the checks.	2018-07-03 17:05:55 +03:00
velioglu	6be6911ed9	Create foreign key relation graph and functions to query on it	2018-07-03 17:05:55 +03:00
Murat Tuncer	3fc7cdfe6d	Apply master_stage_protocol refactoring changes	2018-06-28 11:24:57 +03:00
Murat Tuncer	4d35b92016	Add groundwork for citus_stat_statements api	2018-06-27 14:20:03 +03:00
Onder Kalaci	8ccb8b679e	Real-time executor marks multi shard relation accesses before opening connections	2018-06-25 18:40:31 +03:00
Onder Kalaci	2f01894589	Track relation accesses using the connection management infrastructure	2018-06-25 18:40:30 +03:00
Jason Petersen	7a75c2ed31	Add connparam invalidation trigger creation logic This needs to live in Community, since we haven't yet added the com- plication of having divergent upgrade scripts in Enterprise.	2018-06-20 14:13:18 -06:00
velioglu	53b2e81d01	Adds SELECT ... FOR UPDATE support for router plannable queries	2018-06-18 13:55:17 +03:00
Jason Petersen	5bf7bc64ba	Add pg_dist_authinfo schema and validation This table will be used by Citus Enterprise to populate authentication- related fields in outbound connections; Citus Community lacks support for this functionality.	2018-06-13 11:16:26 -06:00
Jason Petersen	57b3f253c5	Add node_conninfo GUC and related logic To support more flexible (i.e. not at compile-time) specification of libpq connection parameters, this change adds a new GUC, node_conninfo, which must be a space-separated string of key-value pairs suitable for parsing by libpq's connection establishment methods. To avoid rebuilding and parsing these values at connection time, this change also adds a cache in front of the configuration params to permit immediate use of any previously-calculated parameters.	2018-06-12 20:23:47 -06:00
mehmet furkan şahin	d1a3b20115	foreign_constraint_utils is created	2018-06-07 18:19:24 +03:00
Onder Kalaci	df44956dc3	Make sure that sequential DDL opens a single connection to each node After this commit DDL commands honour `citus.multi_shard_modify_mode`. We preferred using the code-path that executes single task router queries (e.g., ExecuteSingleModifyTask()) in order not to invent a new executor that is only applicable for DDL commands that require sequential execution.	2018-06-05 17:52:17 +03:00
Marco Slot	fd4ff29f2f	Add a debug message with distribution column value	2018-06-05 15:09:17 +03:00
Dimitri Fontaine	8b258cbdb0	Lock reads and writes only to the node being updated in master_update_node Rather than locking out all the writes in the cluster, the function now only locks out writes that target shards hosted by the node we're updating.	2018-05-09 15:14:20 +02:00
Hadi Moshayedi	86b12bc2d0	Always prefix operators with their namespace. (#2147 ) Previously we checked if an operator is in pg_catalog, and if it wasn't we prefixed it with namespace in worker queries. This can have a huge impact on performance of physical planner when using custom data types. This happened regardless of current search_path config, because Citus overrides the search path in get_query_def_extended(). When we do so, the check for existence of the operator in current search path in generate_operator_name() fails for any operators outside pg_catalog. This means that nothing gets cached, and in the following calls we will again recheck the system tables for existence of the operators, which took an additional 40-50ms for some of the usecases we were seeing. In this change we skip the pg_catalog check, and always prefix the operator with its namespace.	2018-05-05 13:27:26 -04:00
Onder Kalaci	317dd02a2f	Implement single repartitioning on hash distributed tables * Change worker_hash_partition_table() such that the divergence between Citus planner's hashing and worker_hash_partition_table() becomes the same. * Rename single partitioning to single range partitioning. * Add single hash repartitioning. Basically, logical planner treats single hash and range partitioning almost equally. Physical planner, on the other hand, treats single hash and dual hash repartitioning almost equally (except for JoinPruning). * Add a new GUC to enable this feature	2018-05-02 18:50:55 +03:00
velioglu	32bcd610c1	Support modify queries with multiple tables With this commit we begin to support modify queries with multiple tables if these queries are pushdownable.	2018-05-02 16:22:26 +03:00
Marco Slot	304b3a41ba	Cache the partition column Var	2018-04-26 14:58:16 -06:00
Murat Tuncer	a6fe5ca183	PG11 compatibility update - changes in ruleutils_11.c is reflected - vacuum statement api change is handled. We now allow multi-table vacuum commands. - some other function header changes are reflected - api conflicts between PG11 and earlier versions are handled by adding shims in version_compat.h - various regression tests are fixed due output and functionality in PG1 - no change is made to support new features in PG11 they need to be handled by new commit	2018-04-26 11:29:43 +03:00
Hadi Moshayedi	966f01fad3	Fix write and copy functions for TaskExecution. (#2120 ) We were missing criticalErrorOccurred from CopyNodeTaskExecution() and OutTaskExecution(). This PR fixes it.	2018-04-23 09:07:52 -04:00
Marco Slot	ee132c5ead	Prune shards once per relation in subquery pushdown	2018-04-10 20:33:07 +02:00
Burak Yucesoy	0c283fa8a3	Add partitioning support to MX tables Previously, we prevented creation of partitioned tables on Citus MX. We decided to not focus on this feature until there is a need. Since now there are requests for this feature, we are implementing support for partitioned tables on Citus MX.	2018-04-06 12:47:06 +03:00
velioglu	698d585fb5	Remove broadcast join logic After this change all the logic related to shard data fetch logic will be removed. Planner won't plan any ShardFetchTask anymore. Shard fetch related steps in real time executor and task-tracker executor have been removed.	2018-03-30 11:45:19 +03:00
Brian Cloutier	f8f0d4aedc	Add Windows replacement for uname	2018-03-21 20:35:56 -07:00
Metin Doslu	bcf660475a	Add support for modifying CTEs	2018-02-27 15:08:32 +02:00
Marco Slot	0cba4ab588	Refactor worker node hash initialisation	2018-02-12 23:36:43 +01:00
Marco Slot	40d715d494	Cache worker node array for faster iteration	2018-02-12 23:36:43 +01:00
Marco Slot	6f7c3bd73b	Skip JSON validation on coordinator during COPY	2018-02-02 15:33:27 +01:00
Brian Cloutier	e6ebfc1f53	Remove VLA from UpdateNodeLocation	2018-02-01 10:30:41 -08:00
Brian Cloutier	a2ed45e206	Remove variable length arrays VLAs aren't supported by Visual Studio. - Remove all existing instances of VLAs. - Add a flag, -Werror=vla, which makes gcc refuse to compile if we add VLAs in the future.	2018-02-01 10:30:41 -08:00
Brian Cloutier	457f570b77	Small refactor, we were using incompatible types	2018-01-31 11:05:59 -08:00
Brian Cloutier	b864d014ab	GetNextNodeId() incorrectly called PG_RETURN_DATUM - Also stabilize the output of a multi_router_planner test	2018-01-29 15:32:36 -08:00
Brian Cloutier	76d1edc3fd	Don't rely on gcc-specific features (#1963 ) * Don't use expressions inside compound statements * Don't depend on __builtin_constant_p * Remove reliance on S_ISLNK * Replace use of __func__: older mcvs doesn't support this builtin	2018-01-23 17:03:29 -08:00
Dimitri Fontaine	c9760fbb64	Fix CREATE INDEX with storage options on distributed tables. By sharing the implementation of the function AppendOptionListToString on three call sites, we would expand an extra OPTIONS keyword in a create index statement, and omit other bits of the specific syntax here. This patch introduces an AppendStorageParametersToString() function that is very similar to AppendOptionListToString() but handles WITH(a="foo",...) syntax that is used in reloptions (aka Storage Parameters). Fixes #1747.	2018-01-17 21:56:40 +01:00
Dimitri Fontaine	952da72c55	Implement ALTER TABLE\|INDEX ... SET\|RESET (). PostgreSQL implements support for several relation kinds in a single statement, such as in the AlterTableStmt case, which supports both tables and indexes and more (see ATExecSetRelOptions in PostgreSQL source code file src/backend/commands/tablecmds.c for an example of that). As a consequence, this patch implements support for setting and resetting storage parameters on both relation kinds.	2018-01-17 21:56:40 +01:00
Brian Cloutier	fb7b86fa14	Replace strtoull with pg_strtouint64 The macro we were using to detect strtoull isn't set on Windows, and just in case there are differences use a portable function from PG instead of calling strtoull directly.	2017-12-21 14:28:51 +01:00
mehmet furkan şahin	fd546cf322	Intermediate result size limitation This commit introduces a new GUC to limit the intermediate result size which we handle when we use read_intermediate_result function for CTEs and complex subqueries.	2017-12-21 14:26:56 +03:00
Marco Slot	cbbd418af2	Add citus.copy_format OIDs to metadata cache	2017-12-14 09:32:55 +01:00
Marco Slot	7d1191954d	Add DistributedSubPlan node	2017-12-14 09:32:55 +01:00
Marco Slot	f8550b8c85	Fix issues with read_intermediate_result signature	2017-12-07 13:47:56 +01:00
Marco Slot	eab15aa035	Avoid deadlock in ColocatedTableId	2017-12-06 11:49:34 +01:00
Marco Slot	7279d42849	Treat read_intermediate_result as recurring tuples	2017-12-04 14:50:11 +01:00
Murat Tuncer	2d66bf5f16	Fix hard coded formatting strings for 64 bit numbers (#1831 ) Postgres provides OS agnosting formatting macros for formatting 64 bit numbers. Replaced %ld %lu with INT64_FORMAT and UINT64_FORMAT respectively. Also found some incorrect usages of formatting flags and fixed them.	2017-12-04 14:11:06 +03:00
Marco Slot	20a526d5c4	Fix memory leak in ListToHashSet	2017-11-22 11:26:58 +01:00
Marco Slot	8486f76e15	Auto-recover 2PC transactions	2017-11-22 11:26:58 +01:00
Marco Slot	6ba3f42d23	Rename MultiPlan to DistributedPlan	2017-11-22 09:36:24 +01:00
Andres Freund	d063658d6d	Protect some initializations from being called during backend startup. On EXEC_BACKEND builds these functions shouldn't be called at every backend start.	2017-11-20 15:29:51 -08:00
Brian Cloutier	d267e0f9fa	EXEC_BACKEND: don't put pointers to shared hashes into shared memory Store pointers to shared hashes in process-local variables. Previously pointers to shared hashes were put into shared memory. This causes problems on EXEC_BACKEND because everybody calls execve and receives a brand new address space; the shared hash will be in a different place for every backend. (normally we call fork, which gives you a copy of the address space, so these pointers remain constant)	2017-11-20 15:29:51 -08:00
Marco Slot	ae47df01ea	Observe prepared xacts twice in RecoverWorkerTransactions to avoid race condition	2017-11-20 11:44:08 +01:00
Marco Slot	2410c2e450	Rewrite recover_prepared_transactions to be fast, non-blocking	2017-11-20 11:27:40 +01:00
Brian Cloutier	0f3230170f	Pull in INT32_MAXINT and INT32_MININT	2017-11-14 14:03:46 -08:00
Hadi Moshayedi	6d79d25101	Fix a relcache reference leak in stats collection. In DistributedTablesSize() we didn't close the relations that had replication factor > 2. This caused relcache reference leaks, and warning messages like following in logs: WARNING: relcache reference leak: relation "researchers" not closed	2017-11-06 23:16:43 -05:00
Marco Slot	6883a09cdd	Allow distributed partitioned table creation in Cloud	2017-11-03 10:09:18 +01:00
Hadi Moshayedi	7280774cf4	Use list_length() != 1 in SingleReplicatedTable(). ShardPlacementList's implementation can return NIL. In previous implementation we got a segmentation fault in this case. The relation can be dropped after getting distributed table list but before calling SingleReplicatedTable().	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	7691991cb5	Do PG_TRY() inside a subtransaction block. If we don't propagate the errors we are catching in PG_CATCH(), database's internal state might not be clean. So we do PG_TRY() inside a subtransaction so we can rollback to it after catching errors.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	9bfbbf8a04	Make reports hostname configurable and enable stats collection in tests. This patch adds --with-reports-host configure option, which sets the REPORTS_BASE_URL constant. The default is reports.citusdata.com. It also enables stats collection in tests.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	acaf085a80	Add callback function for request by CollectBasicUsageStatistics(). Curl writes the received response to stdout if we don't specify a response callback or an output file. This can pollute the PostgreSQL log. In this change we add a callback function so the response messages aren't added to the log file.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	747e439601	Limit number of stats collection retries to once a day.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	78a2cd9052	Check for Citus updates. Sends a request to /v1/releases/latest?flavor=$CITUS_EDITION once a day, which returns a response similar to {"version": "7.1.0", "major": 7, "minor": 1, "patch": 0}. Then compares it with current Citus version, and if the latest release is newer, logs a LOG message.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	34f3ec0961	Call FlushDistTableCache() before stats collection.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	c18c6625d9	Lock relations before calling citus_table_size(). This is to make sure they don't get dropped.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	97d544b75c	Follow the patterns used in Deadlock Detection in Stats Collection. This includes: (1) Wrap everything inside a StartTransactionCommand()/CommitTransactionCommand(). This is so we can access the database. This also switches to a new memory context and releases it, so we don't have to do our own memory management. (2) LockCitusExtension() so the extension cannot be dropped or created concurrently. (3) Check CitusHasBeenLoaded() && CheckCitusVersion() before doing any work. (4) Do not PG_TRY() inside a loop.	2017-10-31 21:51:43 -04:00
Furkan Sahin	2b39c52f0b	Replica identity on create_distributed_table By this commit, citus minds the replica identity of the table when we distribute the table. So the shards of the distributed table have the same replica identity with the local table.	2017-10-31 13:08:36 +03:00
Brian Cloutier	4a17d12d74	Replace uint with uint32	2017-10-25 19:32:12 -07:00
velioglu	0b5db5d826	Support multi shard update/delete queries	2017-10-25 15:52:38 +03:00
Hadi Moshayedi	9a04b78980	Send server_id for statistics reports. (#1698 ) This change introduces the `pg_dist_node_metadata` which has a single jsonb value. When creating the extension, a random server id is generated and stored in there. Everything in the metadata table is added as a nested objected to the json payload that is sent to the reports server.	2017-10-18 21:20:32 -04:00
Hadi Moshayedi	86bcd93a4a	Don't collect stats when there is a version mismatch. (#1712 ) The following scenario can cause an Assert() crash if we don't do this: - Install Citus v7.0-15 - Restart server & run a query to start maintenanced. - Install Citus v7.1 - Restart server & run a query. This will tell user to upgrade. - Type "UPDATE EXTENSION c" & press tab. maintenanced will start and crash with Assert(CitusHasBeenLoaded() && CheckCitusVersion(WARNING)); This change checks Citus version before calling metadata functions so the crash doesn't happen.	2017-10-17 14:01:14 -04:00
Jason Petersen	8544878c4b	Add citus_version(), analogous to PG's version() This will provide the full project name (i.e. Citus/Citus Enterprise), and the host system, compiler, and architecture word size. I wanted to limit the number of copied files in 'config', so I added only config.guess and call it manually, rather than using the macro AC_CANONICAL_HOST, which requires several other files.	2017-10-16 18:09:29 -06:00
Brian Cloutier	ebcb2b65e9	Add master_move_node function	2017-10-16 10:51:28 -07:00
Hadi Moshayedi	2aec6eda49	Properly use #ifdef HAVE_LIBCURL.	2017-10-13 12:04:36 -06:00
Jason Petersen	01353cb7cb	Use header define rather than -D flag Eclipse apparently doesn't scan build output looking for -D flags, so having the value actually appear in a header is nicer for those of us using IDEs.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	946659aebe	Delete StatsCollection memory context after we are done with stats reporting. Previously we left the memory context untouched, which overtime leaked memory.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	873fd1e7ff	Fix compiling --without-libcurl. Previously <curl/curl.h> was included even if compiled --without-libcurl. This can fail when libcurl headers are not there. This commit guards this include by checks for HAVE_LIBCURL.	2017-10-13 11:00:09 -04:00
Murat Tuncer	f7ab901766	Add select distinct, and distinct on support Distinct, and distinct on() clauses are supported in simple selects, joins, subqueries, and insert into select queries.	2017-10-13 14:59:48 +03:00
Hadi Moshayedi	6879f92e23	Fix out of bound memeory access when getting HTTP response code. (#1699 )	2017-10-12 12:51:42 -04:00
Hadi Moshayedi	a1387f4aa8	Basic usage statistics collection. (#1656 ) Adds ```citus.enable_statistics_collection``` GUC variable, which ```true``` by default, unless built without libcurl. If statistics collection is enabled, sends basic usage data to Citus servers every 24 hours. The data that is collected consists of: - Citus version - OS name & release - Hardware Id - Number of tables, rounded to next power of 2 - Size of data, rounded to next power of 2 - Number of workers	2017-10-11 09:55:15 -04:00
Marco Slot	394918f9d0	Invalidate worker and group ID cache in maintenance daemon	2017-10-02 18:14:29 +02:00
Marco Slot	da6b42a3e2	Use unique constraint index for transaction record deletion	2017-09-28 12:04:56 +02:00
Jason Petersen	89d02c6115	Add ruleutils file for PostgreSQL 11	2017-09-25 17:20:24 -07:00
Jason Petersen	6c9b19a954	Add version-compat header For polyfill macros, etc.	2017-09-25 17:20:23 -07:00
Jason Petersen	fbeaa2f9d0	Remove direct access to tupleDesc->attrs A level of indirection was removed from this field for PostgreSQL 11. By using the handy provided macro, we can be version agnostic.	2017-09-25 17:20:23 -07:00
Andres Freund	78716e5546	Fix possible shard cache incoherency. When a table and it's shards are dropped, and afterwards the same shard identifiers are reused, e.g. due to a DROP & CREATE EXTENSION, the old entry in the shard cache and the required entry in the shard cache might be for different tables. Force invalidation for both old and new table to fix.	2017-09-25 13:05:09 -07:00
Onder Kalaci	867224bdd7	Make the tests produce more consistent outputs	2017-09-22 20:38:56 +03:00
Jason Petersen	ec30ad38ba	Update ruleutils_10 with latest PostgreSQL changes See: postgres/postgres@21d304dfed postgres/postgres@bb5d6e80b1 postgres/postgres@d363d42bb9 postgres/postgres@eb145fdfea postgres/postgres@decb08ebdf postgres/postgres@a3ca72ae9a postgres/postgres@bc2d716ad0 postgres/postgres@382ceffdf7 postgres/postgres@c7b8998ebb postgres/postgres@e3860ffa4d postgres/postgres@76a3df6e5e	2017-09-01 14:26:59 -06:00
Jason Petersen	ebecde8f6e	Update ruleutils_96 with latest PostgreSQL changes See: postgres/postgres@41ada83774 postgres/postgres@3b0c2dbed0 postgres/postgres@ff2d537223	2017-09-01 14:26:53 -06:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Marco Slot	7523753a73	Clear metadata OID cache prior to deadlock detection	2017-08-18 11:20:24 +02:00
Andres Freund	b936bde936	Take AccessShareLock on the extension prior to running deadlock detection	2017-08-18 11:20:24 +02:00
Marco Slot	1eca53ad40	Exit maintenanced on database crash	2017-08-16 18:29:44 +02:00
Onder Kalaci	205501532a	Add version check to the maintenance daemon We should prevent running the deadlock detection if there is a major version change. Otherwise, the daemon may access to obsolete metadata catalog tables.	2017-08-15 18:47:13 +03:00
Eren Başak	77626c4238	Fix NULL nodeClusterString crush on pg_worker_list.conf migrations	2017-08-14 18:13:53 +03:00
Eren Başak	b3d2f9ba71	Fix pg_worker_list use-after-free bug This change fixes a use-after-free bug while renaming obsolete `pg_worker_list.conf` file, which causes Citus to crash during upgrade (or even extension creation) if `pg_worker_list.conf` exists.	2017-08-14 18:13:53 +03:00
Burak Yucesoy	dfdfb44ebf	Acquire shard resource locks on parent tables while operating on partitions	2017-08-14 14:44:30 +03:00
Burak Yucesoy	a321e750c0	Acquire relation locks on partitions while operation on parent table	2017-08-14 14:44:30 +03:00
Burak Yucesoy	52b9e35d50	Add relationIdList field to the Job struct	2017-08-14 14:06:22 +03:00
Onder Kalaci	e5d5bdff51	Enable distributed deadlock detection on the maintenance deamon With this commit, the maintenance deamon starts to check for distributed deadlocks. We also introduced a GUC variable (distributed_deadlock_detection_factor) whose value is multiplied with Postgres' deadlock_timeout. Setting it to -1 disables the distributed deadlock detection.	2017-08-12 13:28:37 +03:00
Marco Slot	53584affa8	Fix locking in create_distributed_table	2017-08-11 11:34:33 +03:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Brian Cloutier	3fc87a7a29	Metadata sync also syncs nodes in other clusters	2017-08-10 16:55:55 +03:00
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	1961add6f9	Improve error message when there are no nodes for a placement	2017-08-10 12:38:51 +03:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Marco Slot	3a0571e69b	Remove LockMetadataSnapshot	2017-08-09 14:09:54 +02:00
Marco Slot	868ee6be83	Fix and simplify pg_dist_node locking	2017-08-09 14:09:54 +02:00
Burak Yucesoy	8455d1a4ef	Ensure we are allowing partitioned tables at all appropriate places	2017-08-09 10:01:35 +03:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Brian Cloutier	7060ade6fe	GetNodeTuple returns NULL it node does not exist It never throws an error.	2017-08-08 13:12:06 +03:00
Brian Cloutier	a3e9bef685	All users of WorkerNodeHash take an AccessShareLock The metadata cache simulates a SELECT on pg_dist_node. Now the locks it takes also simulate that SELECT.	2017-08-08 13:12:06 +03:00
Brian Cloutier	5914c992e6	cluster management UDFs see nodes in different clusters - master_activate_node and master_disable_node correctly toggle isActive, without crashing - master_add_node rejects duplicate nodes, even if they're in different clusters - master_remove_node allows removing nodes in different clusters	2017-08-08 13:12:06 +03:00
Brian Cloutier	3151b52a0b	Add citus.cluster_name GUC - Nodes with a nodecluster which does not match citus.cluster_name are excluded from the metadata cache and never seen by another part of Citus.	2017-08-08 13:12:06 +03:00
Brian Cloutier	94947c0d54	Refactor: ReplicateShardToAllWorkers more explicitly locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	3769381366	Fix inaccurate comment on SetNodeState	2017-08-08 13:12:06 +03:00
Brian Cloutier	fbecf48a03	Disallow adding primary nodes to non-default clusters	2017-08-08 11:18:31 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00
Marco Slot	bac60bb64f	Function evaluation descends into expression trees	2017-08-06 19:53:05 +02:00
Brian Cloutier	37985de85e	master_disable_node no longer crashes when given a non-existant node	2017-08-04 11:14:54 +03:00
Murat Tuncer	fa18899cf9	Remove serialization/deserialization of multiplan node (#1477 ) introduces copy functions for Citus MultiPlan nodes. uses ExtensibleNode mechanism to store MultiPlan data drops serialiazation of MultiPlans	2017-08-02 08:24:00 +03:00
Burak Yucesoy	7769f1d012	Refactor distributed table creation logic This commit is preperation for introducing distributed partitioned table support. We want to clean and refactor some code in distributed table creation logic so that we can handle partitioned tables in more robust way.	2017-07-31 11:11:23 +03:00
Brian Cloutier	b20a086a8f	master_activate_node UDF also returns noderole	2017-07-28 16:02:43 +03:00
Brian Cloutier	88702ca58a	node_metadata takes out more sane locks - Never release locks - AddNodeMetadata takes ShareRowExclusiveLock so it'll conflict with the trigger which prevents multiple primary nodes. - ActivateNode and SetNodeState used to take AccessShareLock, but they modify the table so they should take RowExclusiveLock. - DeleteNodeRow and InsertNodeRow used to take AccessExclusiveLock but only need RowExclusiveLock.	2017-07-24 11:57:46 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Brian Cloutier	7f1343103e	Fix PG 10 build, UNBOUNDED partitions now have different syntax Update code and tests to match the changes made in pg's d363d42	2017-07-21 14:30:11 +03:00
Brian Cloutier	74dd5bb281	Fix crash when removing an inactive node	2017-07-20 18:55:40 +03:00
Hadi Moshayedi	953df34d22	Explicit switch/case fall-throughs to avoid compiler warnings. GCC 7 added `-Wimplicit-fallthrough` to warn for not explicitly specified switch/case fall-throughs. According to https://gcc.gnu.org/gcc-7/changes.html, to suppress that warning we could either use `__attribute__(fallthrough)`, which didn't seem to work for earlier GCC versions, or a `/* fallthrough /` comment just before the following `case`. Previously Citus code had the fall-through comments inside the brackets, which didn't seem to suppress the warning. Putting a `/ fallthrough */` comment outside the brackets and right before the `case` fixes the problem.	2017-07-19 11:41:59 -04:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
Brian Cloutier	f40f03270a	Fix locking in ReadWorkerNodes()	2017-07-12 14:18:27 +02:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Brian Cloutier	0b64bb1092	Fix typo in comment in CachedRelationLookup	2017-07-12 14:16:24 +02:00
Marco Slot	01c9b1f921	Use GetPlacementListConnection for router SELECTs	2017-07-12 11:26:22 +02:00
Burak Yucesoy	c8b9e4011b	Remove LockRelationDistributionMetadata function	2017-07-10 15:46:37 +03:00
Marco Slot	31debc96e3	Handle implicit casts in prepared INSERTs	2017-07-06 16:17:35 +02:00
Marco Slot	da47a03b18	Move INSERT ... SELECT planning logic into one place	2017-06-29 15:03:14 +02:00
Onder Kalaci	5f3f1d75a3	Add some utility functions for partitioned tables This commit is intended to be a base for supporting declarative partitioning on distributed tables. Here we add the following utility functions and their unit tests: * Very basic functions including differnentiating partitioned tables and partitions, listing the partitions * Generating the PARTITION BY (expr) and adding this to the DDL events of partitioned tables * Ability to generate text representations of the ranges for partitions * Ability to generate the `ALTER TABLE parent_table ATTACH PARTITION partition_table FOR VALUES value_range` * Ability to apply add shard ids to the above command using `worker_apply_inter_shard_ddl_command()` * Ability to generate `ALTER TABLE parent_table DETACH PARTITION`	2017-06-28 09:39:55 +03:00
Andres Freund	dc3997c3b8	Remove 9.5 related node wrappers. Now that all branches support the extensible node infrastructure, we don't need our wrappers anymore.	2017-06-26 08:46:32 -07:00
Andres Freund	b96ba9b490	Fix code only enabled for 9.5. There's still supporting wrappers used, a subsequent commit will remove those. This also removes the already unused tuplecount_t define.	2017-06-26 08:46:32 -07:00
Andres Freund	60c28ce7a6	Remove 9.5 specific C files.	2017-06-26 08:46:32 -07:00
Jason Petersen	2204da19f0	Support PostgreSQL 10 (#1379 ) Adds support for PostgreSQL 10 by copying in the requisite ruleutils and updating all API usages to conform with changes in PostgreSQL 10. Most changes are fairly minor but they are numerous. One particular obstacle was the change in \d behavior in PostgreSQL 10's psql; I had to add SQL implementations (views, mostly) to mimic the pre-10 output.	2017-06-26 02:35:46 -06:00
Andres Freund	c3b7c5dc33	Introduce per-database maintenance process. This will be used for deadlock detection, prepared transaction recovery amongst others, but currently is just idling around.	2017-06-23 11:53:39 -07:00
Andres Freund	3483bb99eb	Minimal infrastructure for per-backend citus initialization.	2017-06-23 11:20:10 -07:00
Andres Freund	1691f780fd	Force cache invalidation machinery to be initialized earlier. Previously it was not guaranteed that invalidations were registered after creating the extension, only if the extension was used afterwards.	2017-06-23 11:20:10 -07:00
Andres Freund	f645dca593	Centralized metadata_cache cache variables into one struct, to avoid missing resets. E.g. extensionOwner was already missed.	2017-06-23 11:20:10 -07:00
Marco Slot	2f8ac82660	Execute INSERT..SELECT via coordinator if it cannot be pushed down Add a second implementation of INSERT INTO distributed_table SELECT ... that is used if the query cannot be pushed down. The basic idea is to execute the SELECT query separately and pass the results into the distributed table using a CopyDestReceiver, which is also used for COPY and create_distributed_table. When planning the SELECT, we go through planner hooks again, which means the SELECT can also be a distributed query. EXPLAIN is supported, but EXPLAIN ANALYZE is not because preventing double execution was a lot more complicated in this case.	2017-06-22 15:46:30 +02:00
Marco Slot	56876596d5	Add support for unlogged distributed tables	2017-06-14 13:50:00 +02:00
Burak Yucesoy	8c1bbf1417	Register cache invalidation callback before version checks With this commit we start to register InvalidateDistRelationCacheCallback function as cache invalidation callback function before version checks because during version checks we use cache to look up relation ids of some relations like pg_dist_relation or pg_dist_partition_logical_relid_index and we want to know about cache invalidation before accessing them.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	c7bfa06cb9	Fix incorrect call to CheckInstalledVersion During version update, we indirectly calld CheckInstalledVersion via ChackCitusVersions. This obviously fails because during version update it is expected to have version mismatch between installed version and binary version. Thus, we remove that ChackCitusVersions. We now only call ChackAvailableVersion.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	9fb15c439c	Add version checks to necessary UDFs	2017-05-22 09:53:29 +03:00
Burak Yucesoy	eea8c51e1f	Only error out on distributed queries when there is version mismatch Before this commit, we were erroring out at almost all queries if there is a version mismatch. With this commit, we started to error out only requested operation touches distributed tables. Normally we would need to use distributed cache to understand whether a table is distributed or not. However, it is not safe to read our metadata tables when there is a version mismatch, thus it is not safe to create distributed cache. Therefore for this specific occasion, we directly read from pg_dist_partition table. However; reading from catalog is costly and we should not use this method in other places as much as possible.	2017-05-22 09:53:29 +03:00
Burak Yucesoy	acb0d23717	Fix crash during upgrade from 5.2 to 6.2 This commit fixes the problem where we incorrectly try to reach distributed table cache when the extension is not loaded completely. We tried to reach the cache because we wanted to get reference table information to activate the node. However it is actually not necessary to explicitly activate the nodes which come from master_initialize_node_metadata. Because it only runs during extension creation and at that time there are no reference tables and all nodes are considered as active.	2017-05-19 00:01:36 +03:00
Önder Kalacı	e0257aecd9	Accept invalidation messages before accessing the metadata cache (#1406 ) * Accept invalidation messages before accessing the metadata cache This commit is crucial to prevent stale metadata reads from the cache. Without this commit, some of the operations may use stale metadata which could end up with various bugs such as crashes, inconsistent/lost data etc. As an example, consider that a COPY operation is blocked on shard metadata lock. Another concurrent session updates the metadata and invalidates the cache. However, since Citus doesn't accept invalidations, COPY continues with the stale metadata once it acquires the lock. With this commit, we make sure that invalidation messages are accepted just before accessing the metadata cache and preventing any operation to use stale metadata. * Add isolation tests for placement changes and conccurrent operations - add node with reference table vs COPY/insert/update/DDL - repair shard vs COPY/insert/update/DDL - repair shard vs repair shard	2017-05-12 12:32:35 +03:00
Jason Petersen	f0c6c47c4e	Fix CREATE SEQUENCE generation bug Apparently we've had a typo all this time causing us to pass the cache value for the start value.	2017-05-03 21:47:06 -07:00
Brian Cloutier	22e7aa9a4f	Fix crash in isolation tests - There was a crash when the table a shardid belonged to changed during a session. Instead of crashing (a failed assert) we now throw an error - Update the isolation test which was crashing to no longer exercise that code path - Add a regression test to check that the error is thrown	2017-04-29 04:25:26 +03:00
Önder Kalacı	ad5cd326a4	Subquery pushdown - main branch (#1323 ) * Enabling physical planner for subquery pushdown changes This commit applies the logic that exists in INSERT .. SELECT planning to the subquery pushdown changes. The main algorithm is followed as : - pick an anchor relation (i.e., target relation) - per each target shard interval - add the target shard interval's shard range as a restriction to the relations (if all relations joined on the partition keys) - Check whether the query is router plannable per target shard interval. - If router plannable, create a task * Add union support within the JOINS This commit adds support for UNION/UNION ALL subqueries that are in the following form: .... (Q1 UNION Q2 UNION ...) as union_query JOIN (QN) ... In other words, we currently do NOT support the queries that are in the following form where union query is not JOINed with other relations/subqueries : .... (Q1 UNION Q2 UNION ...) as union_query .... * Subquery pushdown planner uses original query With this commit, we change the input to the logical planner for subquery pushdown. Before this commit, the planner was relying on the query tree that is transformed by the postgresql planner. After this commit, the planner uses the original query. The main motivation behind this change is the simplify deparsing of subqueries. * Enable top level subquery join queries This work enables - Top level subquery joins - Joins between subqueries and relations - Joins involving more than 2 range table entries A new regression test file is added to reflect enabled test cases * Add top level union support This commit adds support for UNION/UNION ALL subqueries that are in the following form: .... (Q1 UNION Q2 UNION ...) as union_query .... In other words, Citus supports allow top level unions being wrapped into aggregations queries and/or simple projection queries that only selects some fields from the lower level queries. * Disallow subqueries without a relation in the range table list for subquery pushdown This commit disallows subqueries without relation in the range table list. This commit is only applied for subquery pushdown. In other words, we do not add this limitation for single table re-partition subqueries. The reasoning behind this limitation is that if we allow pushing down such queries, the result would include (shardCount * expectedResults) where in a non distributed world the result would be (expectedResult) only. * Disallow subqueries without a relation in the range table list for INSERT .. SELECT This commit disallows subqueries without relation in the range table list. This commit is only applied for INSERT.. SELECT queries. The reasoning behind this limitation is that if we allow pushing down such queries, the result would include (shardCount * expectedResults) where in a non distributed world the result would be (expectedResult) only. * Change behaviour of subquery pushdown flag (#1315) This commit changes the behaviour of the citus.subquery_pushdown flag. Before this commit, the flag is used to enable subquery pushdown logic. But, with this commit, that behaviour is enabled by default. In other words, the flag is now useless. We prefer to keep the flag since we don't want to break the backward compatibility. Also, we may consider using that flag for other purposes in the next commits. * Require subquery_pushdown when limit is used in subquery Using limit in subqueries may cause returning incorrect results. Therefore we allow limits in subqueries only if user explicitly set subquery_pushdown flag. * Evaluate expressions on the LIMIT clause (#1333) Subquery pushdown uses orignal query, the LIMIT and OFFSET clauses are not evaluated. However, logical optimizer expects these expressions are already evaluated by the standard planner. This commit manually evaluates the functions on the logical planner for subquery pushdown. * Better format subquery regression tests (#1340) * Style fix for subquery pushdown regression tests With this commit we intented a more consistent style for the regression tests we've added in the - multi_subquery_union.sql - multi_subquery_complex_queries.sql - multi_subquery_behavioral_analytics.sql * Enable the tests that are temporarily commented This commit enables some of the regression tests that were commented out until all the development is done. * Fix merge conflicts (#1347) - Update regression tests to meet the changes in the regression test output. - Replace Ifs with Asserts given that the check is already done - Update shard pruning outputs * Add view regression tests for increased subquery coverage (#1348) - joins between views and tables - joins between views - union/union all queries involving views - views with limit - explain queries with view * Improve btree operators for the subquery tests This commit adds the missing comprasion for subquery composite key btree comparator.	2017-04-29 04:09:48 +03:00
Andres Freund	d399f395f7	Faster shard pruning. So far citus used postgres' predicate proofing logic for shard pruning, except for INSERT and COPY which were already optimized for speed. That turns out to be too slow: * Shard pruning for SELECTs is currently O(#shards), because PruneShardList calls predicate_refuted_by() for every shard. Obviously using an O(N) type algorithm for general pruning isn't good. * predicate_refuted_by() is quite expensive on its own right. That's primarily because it's optimized for doing a single refutation proof, rather than performing the same proof over and over. * predicate_refuted_by() does not keep persistent state (see 2.) for function calls, which means that a lot of syscache lookups will be performed. That's particularly bad if the partitioning key is a composite key, because without a persistent FunctionCallInfo record_cmp() has to repeatedly look-up the type definition of the composite key. That's quite expensive. Thus replace this with custom-code that works in two phases: 1) Search restrictions for constraints that can be pruned upon 2) Use those restrictions to search for matching shards in the most efficient manner available: a) Binary search / Hash Lookup in case of hash partitioned tables b) Binary search for equal clauses in case of range or append tables without overlapping shards. c) Binary search for inequality clauses, searching for both lower and upper boundaries, again in case of range or append tables without overlapping shards. d) exhaustive search testing each ShardInterval My measurements suggest that we are considerably, often orders of magnitude, faster than the previous solution, even if we have to fall back to exhaustive pruning.	2017-04-28 14:40:41 -07:00
Andres Freund	6bd2e3ed30	Add DistTableCacheEntry->hasOverlappingShardInterval. This determines whether it's possible to perform binary search on sortedShardIntervalArray or not. If e.g. two shards have overlapping ranges, that'd be prohibitive. That'll be useful in later commit introducing faster shard pruning.	2017-04-28 14:40:38 -07:00
Andres Freund	105483ec56	Add DistTableCacheEntry->shardValueCompareFunction. That's useful when comparing values a hash-partitioned table is filtered by. The existing shardIntervalCompareFunction is about comparing hashed values, not unhashed ones. The added btree opclass function is so we can get a comparator back. This should be changed much more widely, but is not necessary so far.	2017-04-28 14:40:38 -07:00
Andres Freund	52571c00ad	Build DistTableCacheEntry->shardIntervalCompareFunction even for 0 shards. Previously we, unnecessarily, used a the first shard's type information to to look up the comparison function. But that information is already available, so use it. That's helpful because we sometimes want to access the comparator function even if there's no shards.	2017-04-28 14:40:38 -07:00
Andres Freund	ba93d32c8a	Fix: Make FindShardIntervalIndex robust against 0 shards.	2017-04-28 14:40:38 -07:00
Jason Petersen	93e3afc25c	Remove FastShardPruning method With the other simplifications, it doesn't make sense to keep around.	2017-04-27 13:32:36 -06:00
Jason Petersen	42ee7c05f5	Refactor FindShardInterval to use cacheEntry All callers fetch a cache entry and extract/compute arguments for the eventual FindShardInterval call, so it makes more sense to refactor into that function itself; this solves the use-after-free bug, too.	2017-04-27 13:32:36 -06:00
Marco Slot	7faf4657b7	Use right sizeof in UpdateRelationColocationGroup	2017-04-21 16:37:09 +02:00
Marco Slot	4ed093970a	Support expressions in the partition column in INSERTs	2017-04-21 14:05:52 +02:00
velioglu	2327b63291	Change native hash function with worker_hash	2017-04-19 22:16:55 +03:00
Burak Yucesoy	1a56b99f13	Fix node copy error Instead of directly returning heap tuple obtained from heap scan we return copied version of it.	2017-04-17 19:38:18 +03:00
Marco Slot	5e58804d44	Support query parameters in combination with function evaluation	2017-04-17 15:40:55 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Jason Petersen	033fda9183	Clean up remaining error messages Added details and hints, based off of similar PostgreSQL scenarios.	2017-04-04 16:11:59 -06:00
Burak Yucesoy	a09614553f	Add enable_version_checks GUC and address feedback	2017-04-04 19:11:13 +03:00
Burak Yucesoy	087d8427e3	Error out if binary citus version does not match installed extension With this change, we start to error out if loaded citus binaries does not match the available major version or installed citus extension version. In this case we force user to restart the server or run ALTER EXTENSION depending on the situation	2017-04-03 17:36:13 -06:00
Jason Petersen	dd9365433e	Update documentation Ensure all functions have comments, etc.	2017-04-03 11:19:15 -06:00
Jason Petersen	95d8d27c4f	Change IndexStmt to generate worker DDL on master Because we can't execute CREATE INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:14 -06:00
Metin Doslu	1f838199f8	Use CustomScan API for query execution Custom Scan is a node in the planned statement which helps external providers to abstract data scan not just for foreign data wrappers but also for regular relations so you can benefit your version of caching or hardware optimizations. This sounds like only an abstraction on the data scan layer, but we can use it as an abstraction for our distributed queries. The only thing we need to do is to find distributable parts of the query, plan for them and replace them with a Citus Custom Scan. Then, whenever PostgreSQL hits this custom scan node in its Vulcano style execution, it will call our callback functions which run distributed plan and provides tuples to the upper node as it scans a regular relation. This means fewer code changes, fewer bugs and more supported features for us! First, in the distributed query planner phase, we create a Custom Scan which wraps the distributed plan. For real-time and task-tracker executors, we add this custom plan under the master query plan. For router executor, we directly pass the custom plan because there is not any master query. Then, we simply let the PostgreSQL executor run this plan. When it hits the custom scan node, we call the related executor parts for distributed plan, fill the tuple store in the custom scan and return results to PostgreSQL executor in Vulcano style, a tuple per XXX_ExecScan() call. * Modify planner to utilize Custom Scan node. * Create different scan methods for different executors. * Use native PostgreSQL Explain for master part of queries.	2017-03-14 12:17:51 +02:00
Andres Freund	52358fe891	Initial temp table removal implementation	2017-03-14 12:09:49 +02:00
Murat Tuncer	72027f2eba	Remove default clause from shard DDL when sequences are used	2017-03-01 17:32:48 +03:00
Marco Slot	bab1b65491	Fix spelling in master_initialize_node_metadata comment	2017-03-01 12:27:50 +01:00
Brian Cloutier	a30b9b93a4	Set a memory context when throwing deferred errors	2017-02-02 15:14:21 +02:00
Marco Slot	ba940a1de9	Use coordinator instead of schema node in terminology	2017-01-25 11:07:23 +01:00
Burak Yucesoy	484cb12cd0	Add LoadShardPlacement UDF This UDF returns a shard placement from cache given shard id and placement id. At the moment it iterates over all shard placements of given shard by ShardPlacementList and searches given placement id in that list, which is not a good solution performance-wise. However, currently, this function will be used only when there is a failed transaction. If a need arises we can optimize this function in the future.	2017-01-23 21:04:57 +03:00
Andres Freund	c244b8ef4a	Make router planner error handling more flexible. So far router planner had encapsulated different functionality in MultiRouterPlanCreate. Modifications always go through router, selects sometimes. Modifications always error out if the query is unsupported, selects return NULL. Especially the error handling is a problem for the upcoming extension of prepared statement support. Split MultiRouterPlanCreate into CreateRouterPlan and CreateModifyPlan, and change them to not throw errors. Instead errors are now reported by setting the new MultiPlan->plannigError. Callers of router planner functionality now have to throw errors themselves if desired, but also can skip doing so. This is a pre-requisite for expanding prepared statement support. While touching all those lines, improve a number of error messages by getting them closer to the postgres error message guidelines.	2017-01-23 09:23:50 -08:00
Andres Freund	557ccc6fda	Support for deferred error messages. It can be useful, e.g. in the upcoming prepared statement support, to be able to return an error from a function that is not raised immediately, but can later be thrown. That allows e.g. to attempt to plan a statment using different methods and to create good error messages in each planner, but to only error out after all planners have been run. To enable that create support for deferred error messages that can be created (supporting errorcode, message, detail, hint) in one function, and then thrown in different place.	2017-01-23 09:23:50 -08:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Onder Kalaci	bd825be340	Improve heap access methods This commit improves heap access methods for reference table upgrade and colocation group modifications.	2017-01-20 14:53:29 +02:00
Metin Doslu	93e626c896	Refactor get_shard_id_for_distribution_column() and other minor changes	2017-01-20 14:38:01 +02:00
Metin Doslu	ed77260aa1	Return a deep copy shard list from ColocatedShardIntervalList()	2017-01-20 14:38:01 +02:00
Murat Tuncer	c12bd7b75e	Remove hint message from master_remove_node UDF Hint about master_disable_node was giving wrong impression to users. Removal is better than keeping it.	2017-01-18 22:33:00 -07:00
Eren Basak	e7c15ecc1f	Make `upgrade_to_reference_table` function MX-compatible	2017-01-18 16:49:50 +03:00
Eren Basak	56ca590daa	Propagate metadata changes for deleted reference table placements on master_remove_node call	2017-01-18 16:00:07 +03:00
Eren Basak	be78769ae4	Propagate new reference table placement metadata on `master_add_node`	2017-01-18 15:59:06 +03:00
Eren Basak	b686d9a025	Add Sequence Support for MX Tables This change adds support for serial columns to be used with MX tables. Prior to this change, sequences of serial columns were created in all workers (for being able to create shards) but never used. With MX, we need to set the sequences so that sequences in each worker create unique values. This is done by setting the MINVALUE, MAXVALUE and START values of the sequence.	2017-01-18 09:43:38 +03:00
Andres Freund	6972186652	Add ShardPlacement fields required for colocated placement connection mapping.	2017-01-16 13:42:54 -08:00
Burak Yucesoy	3315ae6142	Remove placement metadata of reference tables after master_remove_node With this change, we start to delete placement of reference tables at given worker node after master_remove_node UDF call. We remove placement metadata at master node but we do not drop actual shard from the worker node. There are two reasons for that decision, first, it is not critical to DROP the shards in the workers because Citus will ignore them as long as node is removed from cluster and if we add that node back to cluster we will DROP and recreate all reference tables. Second, if node is unreachable, it becomes complicated to cover failure cases and have a transaction support.	2017-01-16 11:24:56 +03:00
Murat Tuncer	77f8db6b14	Add view support Enables use views within distributed queries. User can create and use a view on distributed tables/queries as he/she would use with regular queries. After this change router queries will have full support for views, insert into select queries will support reading from views, not writing into. Outer joins would have a limited support, and would error out at certain cases such as when a view is in the inner side of the outer join. Although PostgreSQL supports writing into views under certain circumstances. We disallowed that for distributed views.	2017-01-13 09:39:42 +03:00
Onder Kalaci	aed5f817fa	Refactor CheckShardPlacements() and improve support for node removal This commit refactors CheckShardPlacements() so that it only considers modifyingConnection. Also, it skips nodes which are removed from the cluster.	2017-01-12 20:10:10 +02:00
Andres Freund	b813b39241	Cache ShardPlacements in metadata cache. So far we've reloaded them frequently. Besides avoiding that cost - noticeable for some workloads with large shard counts - it makes it easier to add information to ShardPlacements that help us make placement_connection.c colocation aware.	2017-01-10 18:14:18 -08:00
Andres Freund	8cb47195ba	Make LoadShardInterval() backed by the metadata cache. Doing so requires adding a mapping from shardId to the cache entries. For that metadata_cache.c now maintains an additional hashtable. That hashtable only references shard intervals in the dist table cache.	2017-01-10 17:00:19 -08:00
Andres Freund	f6e8647337	Split DistTableCacheEntry() into separate functions. Previously the function was getting too large. Thus this splits the function into separate parts for looking up the cache entry and building the cache contents.	2017-01-10 15:23:18 -08:00
Murat Tuncer	b93185d800	Add master_disable_node UDF We can now remove nodes from cluster regardless of them having an active shard placement.	2017-01-10 10:54:57 +03:00
Burak Yucesoy	9c9f479e4b	Replicate reference tables when new node is added With this change, we start to replicate all reference tables to the new node when new node is added to the cluster with master_add_node command. We also update replication factor of reference table's colocation group.	2017-01-05 14:30:41 +03:00
Onder Kalaci	6d050fd677	Use 2PC for reference table modification With this commit, we ensure that router executor always uses 2PC for reference table modifications and never mark the placements of it as INVALID.	2017-01-04 12:46:35 +02:00
Burak Yucesoy	31cd2357fe	Add upgrade_to_reference_table With this change we introduce new UDF, upgrade_to_reference_table, which can be used to upgrade existing broadcast tables reference tables. For upgrading, we require that given table contains only one shard.	2017-01-02 17:54:42 +02:00
Eren Basak	7e09bd6836	Error on Unsupported Features on Workers This change makes the metadata workers error out on unsupported commands.	2017-01-02 16:03:45 +03:00
Metin Doslu	1ddc70ca55	Add binary search capability to ShardIndex() Renamed FindShardIntervalIndex() to ShardIndex() and added binary search capability. It used to assume that hash partition tables are always uniformly distributed which is not true if upcoming tenant isolation feature is applied. This commit also reduces code duplication.	2016-12-30 18:55:34 +02:00
Marco Slot	92c7567008	Convert worker_transactions to new connection API	2016-12-23 16:14:29 +01:00
Eren Basak	bed2e353db	Propagate `mark_tables_colocated` changes in `pg_dist_partition` table to metadata workers.	2016-12-23 15:43:32 +03:00
Eren Basak	71d73ec5ff	Propagate DDL commands to metadata workers for MX tables	2016-12-23 15:43:32 +03:00
Marco Slot	11031bcf55	Enable evaluation of stable functions in INSERT..SELECT	2016-12-23 12:47:21 +01:00
Marco Slot	d745d7bf70	Add explicit RelationShards mapping to tasks	2016-12-23 10:23:43 +01:00
Marco Slot	6852f8a951	Add shard locking UDFs	2016-12-22 11:04:34 +01:00
Burak Yücesoy	501a2ecead	Add get_distribution_value_shardid UDF (#1048 ) * Add get_distribution_value_shardid UDF With this UDF users can now map given distribution value to shard id. We mostly hide shardids from users to prevent unnecessary complexity but some power users might need to know about which entry/value is stored in which shard for maintanence purposes. Signature of this UDF is as follows; bigint get_distribution_value_shardid(table_name regclass, distribution_value anyelement)	2016-12-22 12:17:08 +03:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Metin Doslu	20b8f1feeb	Refactor distribution column type check for colocation	2016-12-16 15:24:45 +02:00
Metin Doslu	e2d0bd38f2	Don't allow tables with different replication models to be colocated	2016-12-16 15:23:49 +02:00
Metin Doslu	86cca54857	Add colocate_with option to create_distributed_table() With this commit, we support three versions of colocate_with: i.default, ii.none and iii. a specific table name.	2016-12-16 14:53:35 +02:00
Metin Doslu	edbedbd744	Move colocation related functions to colocation_utils.c	2016-12-16 14:52:40 +02:00
Andres Freund	a77cf36778	Use connection_management.c from within connection_cache.c. This is a temporary step towards removing connection_cache.c.	2016-12-07 11:44:24 -08:00
Andres Freund	3505d431cd	Add initial helpers to make interactions with MultiConnection et al. easier. This includes basic infrastructure for logging of commands sent to remote/worker nodes. Note that this has no effect as of yet, since no callers are converted to the new infrastructure.	2016-12-07 11:44:24 -08:00
Andres Freund	3223b3c92d	Centralized Connection Lifetime Management. Connections are tracked and released by integrating into postgres' transaction handling. That allows to to use connections without having to resort to having to disable interrupts or using PG_TRY/CATCH blocks to avoid leaking connections. This is intended to eventually replace multi_client_executor.c and connection_cache.c, and to provide the basis of a centralized transaction management. The newly introduced transaction hook should, in the future, be the only one in citus, to allow for proper ordering between operations. For now this central handler is responsible for releasing connections and resetting XactModificationLevel after a transaction.	2016-12-07 11:43:18 -08:00
Andres Freund	883af02b54	Add some basic helpers to make use of dynahash hashtables easier.	2016-12-06 14:15:36 -08:00
Marco Slot	3d09a2e5c2	Use READ_UINT64_FIELD for placement ID in ReadShardPlacement	2016-12-05 17:22:23 +01:00
Eren Basak	fb88b167a7	Propagate node add/remove to the nodes with hasmetadata=true This change propagates the changes done by `master_add_node` and `master_remove_node` to the workers that contain metadata.	2016-12-02 14:43:32 +03:00
Murat Tuncer	b5c1ecb684	Fix failures during pg_upgrade - fix error in CitusHasBeenLoaded() - allow creation of pg_catalog tables during upgrade	2016-11-11 17:22:45 -08:00
Marco Slot	f6b3af7a49	Use co-located shard ID in multi_shard_transaction	2016-11-02 11:01:19 +01:00
Metin Doslu	c6f5cabbe3	Error on different shard placement count In ErrorIfShardPlacementsNotColocated(), while checking if shards are colocated, error out if matching shard intervals have different number of shard placements.	2016-10-26 18:46:05 +03:00
Metin Doslu	4e555880b7	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Marco Slot	275378aa45	Re-acquire metadata locks in RouterExecutorStart	2016-10-26 14:34:59 +02:00
Andres Freund	fcd150c7c8	Invalidate relcache after pg_dist_shard_placement changes. This forces prepared statements to be re-planned after changes of the placement metadata. There's some locking issues remaining, but that's a a separate task. Also add regression tests verifying that invalidations take effect on prepared statements.	2016-10-26 03:36:35 -07:00
Onder Kalaci	1673ea937c	Feature: INSERT INTO ... SELECT This commit adds INSERT INTO ... SELECT feature for distributed tables. We implement INSERT INTO ... SELECT by pushing down the SELECT to each shard. To compute that we use the router planner, by adding an "uninstantiated" constraint that the partition column be equal to a certain value. standard_planner() distributes that constraint to all the tables where it knows how to push the restriction safely. An example is that the tables that are connected via equi joins. The router planner then iterates over the target table's shards, for each we replace the "uninstantiated" restriction, with one that PruneShardList() handles. Do so by replacing the partitioning qual parameter added in multi_planner() with the current shard's actual boundary values. Also, add the current shard's boundary values to the top level subquery to ensure that even if the partitioning qual is not distributed to all the tables, we never run the queries on the shards that don't match with the current shard boundaries. Finally, perform the normal shard pruning to decide on whether to push the query to the current shard or not. We do not support certain SQLs on the subquery, which are described/commented on ErrorIfInsertSelectQueryNotSupported(). We also added some locking on the router executor. When an INSERT/SELECT command runs on a distributed table with replication factor >1, we need to ensure that it sees the same result on each placement of a shard. So we added the ability such that router executor takes exclusive locks on shards from which the SELECT in an INSERT/SELECT reads in order to prevent concurrent changes. This is not a very optimal solution, but it's simple and correct. The citus.all_modifications_commutative can be used to avoid aggressive locking. An INSERT/SELECT whose filters are known to exclude any ongoing writes can be marked as commutative. See RequiresConsistentSnapshot() for the details. We also moved the decison of whether the multiPlan should be executed on the router executor or not to the planning phase. This allowed us to integrate multi task router executor tasks to the router executor smoothly.	2016-10-26 10:01:00 +03:00
Brian Cloutier	2e96f6ab27	Fix crash when upgrading to Citus 6 Between restart (running the new code) and ALTER EXTENSION citus UPGRADE there was an inconsistency where we assumed that pg_dist_partition had the repmodel column set. Now we give it a default value if the column doesn't exist yet.	2016-10-24 15:18:29 +03:00
Burak Yucesoy	5a03acf2bf	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Metin Doslu	58ac477ffb	Change return type of BuildDistributionKeyFromColumnName() to Var * BuildDistributionKeyFromColumnName() always returns a Var pointer, so there is no reason to return a Node pointer instead of a Var pointer.	2016-10-20 10:59:31 +03:00
Metin Doslu	161093908e	Convert colocationid to uint32	2016-10-20 10:59:31 +03:00
Metin Doslu	40bdafa8d1	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Marco Slot	a497e7178c	Parallelise master_modify_multiple_shards	2016-10-19 08:33:08 +02:00
Marco Slot	9d98acfb6d	Move requiresMasterEvaluation from Task to Job	2016-10-19 08:23:06 +02:00
Andres Freund	ac14b2edbc	Support PostgreSQL 9.6 Adds support for PostgreSQL 9.6 by copying in the requisite ruleutils file and refactoring the out/readfuncs code to flexibly support the old-style copy/pasted out/readfuncs (prior to 9.6) or use extensible node APIs (in 9.6 and higher). Most version-specific code within this change is only needed to set new fields in the AggRef nodes we build for aggregations. Version-specific test output files were added in certain cases, though in most they were not necessary. Each such file begins by e.g. printing the major version in order to clarify its purpose. The comment atop citus_nodes.h details how to add support for new nodes for when that becomes necessary.	2016-10-18 16:23:55 -06:00
Eren Basak	cee7b54e7c	Add worker transaction and transaction recovery infrastructure	2016-10-18 14:18:14 +03:00
Eren Basak	f3ede37c9f	Add hasmetadata column to pg_dist_node	2016-10-17 11:52:18 +03:00
Eren Basak	c7bf2021fa	Add metadata infrastructure for pg_dist_local_group table	2016-10-17 11:52:18 +03:00
Metin Doslu	d03a2af778	Add HAVING support This commit completes having support in Citus by adding having support for real-time and task-tracker executors. Multiple tests are added to regression tests to cover new supported queries with having support.	2016-10-13 15:47:53 +03:00
Eren Basak	ed3af403fd	Add Metadata Snapshot Infrastructure This change adds the required infrastructure about metadata snapshot from MX codebase into Citus, mainly metadata_sync.c file and master_metadata_snapshot UDF.	2016-10-13 10:40:14 +03:00
Andres Freund	982ad66753	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Brian Cloutier	9d6699b07c	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Marco Slot	32b2bd4ed8	Add replication model column to pg_dist_partition	2016-10-05 01:14:28 +02:00
Onder Kalaci	0993f2fb2c	Update ColocatedShardPlacementList() function name to ColocatedShardIntervalList() which was intented.	2016-10-04 09:51:42 +03:00
Jason Petersen	1c560dfa9c	Update ruleutils_95 with latest PostgreSQL changes Hand-applied changes from a diff I generated between 9.5.0 and 9.5.4.	2016-09-29 15:54:38 -06:00
Burak Yucesoy	1ee39eb098	Internal co-location API With this commit we introduce internal API for co-location related operations.	2016-09-29 11:56:53 +03:00
Jason Petersen	0caf0d95f1	Fix unique-violation-in-xact segfault An interaction between ReraiseRemoteError and DML transaction support causes segfaults: * ReraiseRemoteError calls PurgeConnection, freeing a connection... * That connection is still in the xactParticipantHash At transaction end, the memory in the freed connection might happen to pass the "is this connection OK?" check, causing us to try to send an ABORT over that connection. By removing it from the transaction hash before calling ReraiseRemoteError, we avoid this possibility.	2016-09-27 16:44:03 -06:00
Murat Tuncer	6317bbe9a8	Address feedback	2016-09-26 18:23:42 -06:00
Jason Petersen	74f4e0003b	Permit multiple DDL commands in a transaction Three changes here to get to true multi-statement, multi-relation DDL transactions (same functionality pre-5.2, with benefits of atomicity): 1. Changed the multi-shard utility hook to always run (consistency with router executor hook, removes ad-hoc "installed" boolean) 2. Change the global connection list in multi_shard_transaction to instead be a hash; update related functions to operate on global hash instead of local hash/global list 3. Remove check within DDL code to prevent subsequent DDL commands; place unset/reset guard around call to ConnectToNode to permit connecting to additional nodes after DDL transaction has begun In addition, code has been added to raise an error if a ROLLBACK TO SAVEPOINT is attempted (similar to router executor), and comprehensive tests execute all multi-DDL scenarios (full success, user ROLLBACK, any actual errors (say, duplicate index), partial failure (duplicate index on one node but not others), partial COMMIT (one node fails), and 2PC partial PREPARE (one node fails)). Interleavings with other commands (DML, \copy) are similarly all covered.	2016-09-08 22:35:55 -05:00
Eric B. Ridge	e80f1612a6	Add syscols in queries; extend relnames in indexes To permit use with ZomboDB (https://github.com/zombodb/zombodb), two changes were necessary: 1. Permit use of `tableoid` system column in queries 2. Extend relation names appearing in index expressions The first is accomplished by simply changing the deparse logic to allow system columns in queries destined for distributed tables. The latter was slightly more complex, given that DDL extension currently occurs on workers. But since indexes cannot reference tables other than the one being indexed, it is safe to look for any relation reference ending in a '*' character and extend their penultimate segments with a shard id. This change also adds an error to prevent users from distributing any relations using the WITH (OIDS) feature, which is unsupported.	2016-09-07 11:54:55 -05:00
Jason Petersen	850c51947a	Re-permit DDL in transactions, selectively Recent changes to DDL and transaction logic resulted in a "regression" from the viewpoint of users. Previously, DDL commands were allowed in multi-command transaction blocks, though they were not processed in any actual transactional manner. We improved the atomicity of our DDL code, but added a restriction that DDL commands themselves must not occur in any BEGIN/END transaction block. To give users back the original functionality (and improved atomicity) we now keep track of whether a multi-command transaction has modified data (DML) or schema (DDL). Interleaving the two modification types in a single transaction is disallowed. This first step simply permits a single DDL command in such a block, admittedly an incomplete solution, but one which will permit us to add full multi-DDL command support in a subsequent commit.	2016-08-30 20:37:19 -06:00
Andres Freund	63fb8311cb	Don't access pg_dist_partition->partkey directly, use heap_getattr(). Text datums can't be directly accessed via the struct equivalence trick used to access catalogs. That's because, as an optimization, they're sometimes aligned to 1 byte ("text"'s alignment), and sometimes to 4 bytes. That depends on it being a short varlena (cf. VARATT_NOT_PAD_BYTE) or not. In the case at hand here, partkey became longer than 127 characters - the boundary for short varlenas (cf. VARATT_CAN_MAKE_SHORT()). Thus it became 4 byte/int aligned. Which lead to the direct struct access accessing the wrong data. The fix is simply to never access partkey that way - to enforce that, hide partkey ehind the usual ifdef. Fixes: #674	2016-07-29 10:02:36 -07:00
Jason Petersen	bedf53d566	Quick fix for possible segfault in PurgeConnection Now that connections can be acquired without going through the cache, we have to handle cases where functions assume the cache has been ini- tialized.	2016-07-29 00:12:56 -06:00
Jason Petersen	abe7304898	Support SERIAL/BIGSERIAL non-partition columns This adds support for SERIAL/BIGSERIAL column types. Because we now can evaluate functions on the master (during execution), adding this is a matter of ensuring the table creation step works properly. To accomplish this, I've added some logic to detect sequences owned by a table (i.e. those related to its columns). Simply creating a sequence and using it in a default value is insufficient; users who do so must ensure the sequence is owned by the column using it. Fortunately, this is exactly what SERIAL and BIGSERIAL do, which is the use case we're targeting with this feature. While testing this, I found that worker_apply_shard_ddl_command actually adds shard identifiers to sequence names, though I found no places that use or test this path. I removed that code so that sequence names are not mutated and will match those used by a SERIAL default value expression. Our use of the new-to-9.5 CREATE SEQUENCE IF NOT EXISTS syntax means we are dropping support for 9.4 (which is being done regardless, but makes this change simpler). I've removed 9.4 from the Travis build matrix. Some edge cases are possible in ALTER SEQUENCE, COPY FROM (on workers), and CREATE SEQUENCE OWNED BY. I've added errors for each so that users understand when and why certain operations are prohibited.	2016-07-28 23:55:40 -06:00
Murat Tuncer	c20080992d	Remove PostgreSQL 9.4 support	2016-07-26 20:16:09 +03:00
Jason Petersen	5d525fba24	Permit "single-shard" transactions Allows the use of modification commands (INSERT/UPDATE/DELETE) within transaction blocks (delimited by BEGIN and ROLLBACK/COMMIT), so long as all modifications hit a subset of nodes involved in the first such com- mand in the transaction. This does not circumvent the requirement that each individual modification command must still target a single shard. For instance, after sending BEGIN, a user might INSERT some rows to a shard replicated on two nodes. Subsequent modifications can hit other shards, so long as they are on one or both of these nodes. SAVEPOINTs are supported, though if the user actually attempts to send a ROLLBACK command that specifies a SAVEPOINT they will receive an ERROR at the end of the topmost transaction. Placements are only marked inactive if at least one replica succeeds in a transaction where others fail. Non-atomic behavior is possible if the shard targeted by the initial modification within a transaction has a higher replication factor than another shard within the same block and a node with the latter shard has a failure during the COMMIT phase. Other methods of denoting transaction blocks (multi-statement commands sent all at once and functions written in e.g. PL/pgSQL or other such languages) are not presently supported; their treatment remains the same as before.	2016-07-21 15:57:22 -06:00
Burak Yucesoy	20debfc0ee	Fix COUNT DISTINCT approximation with schema Fixes #555 Before this change, we were resolving HLL function and type Oid without qualified name. Now we find the schema name where HLL objects are stored and generate qualified names for each objects. Similar fix is also applied for cstore_table_size function call.	2016-07-21 17:29:18 +03:00
Burak Yucesoy	a0e8f9eb64	Always schema-prefix worker queries Fixes #215 Fixes #267 Fixes #502 Fixes #556 Fixes #557 Fixes #560 Fixes #568 Fixes #623 Fixes #624 With this change we schema-prefix table names, operator names and composite types.	2016-07-20 10:42:24 +03:00
Eren	3eaff48114	Propagate DDL Commands with 2PC Fixes #513 This change modifies the DDL Propagation logic so that DDL queries are propagated via 2-Phase Commit protocol. This way, failures during the execution of distributed DDL commands will not leave the table in an intermediate state and the pending prepared transactions can be commited manually. DDL commands are not allowed inside other transaction blocks or functions. DDL commands are performed with 2PC regardless of the value of `citus.multi_shard_commit_protocol` parameter. The workflow of the successful case is this: 1. Open individual connections to all shard placements and send `BEGIN` 2. Send `SELECT worker_apply_shard_ddl_command(<shardId>, <DDL Command>)` to all connections, one by one, in a serial manner. 3. Send `PREPARE TRANSCATION <transaction_id>` to all connections. 4. Sedn `COMMIT` to all connections. Failure cases: - If a worker problem occurs before sending of all DDL commands is finished, then all changes are rolled back. - If a worker problem occurs after all DDL commands are sent but not after `PREPARE TRANSACTION` commands are finished, then all changes are rolled back. However, if a worker node is failed, then the prepared transactions in that worker should be rolled back manually. - If a worker problem occurs during `COMMIT PREPARED` statements are being sent, then the prepared transactions on the failed workers should be commited manually. - If master fails before the first 'PREPARE TRANSACTION' is sent, then nothing is changed on workers. - If master fails during `PREPARE TRANSACTION` commands are being sent, then the prepared transactions on workers should be rolled back manually. - If master fails during `COMMIT PREPARED` or `ROLLBACK PREPARED` commands are being sent, then the remaining prepared transactions on the workers should be handled manually. This change also helps with #480, since failed DDL changes no longer mark failed placements as inactive.	2016-07-19 10:44:11 +03:00
Andres Freund	4cf0a4e48e	citus_indent fixups	2016-07-13 11:45:51 -07:00
Brian Cloutier	0cad3b22cc	Simplify code and fix include guards in citus_clauses	2016-07-13 11:45:51 -07:00
Brian Cloutier	af9515f669	Only reparse queries if the planner flags them for reparsing	2016-07-13 11:45:51 -07:00
Brian Cloutier	4820366a6f	citus_indent and some renaming	2016-07-13 11:45:51 -07:00
Brian Cloutier	ae91768c96	Evaluate functions on the master - Enables using VOLATILE functions (like nextval()) in INSERT queries - Enables using STABLE functions (like now()) targetLists and joinTrees UPDATE and INSERT can now contain non-immutable functions. INSERT can contain any kind of expression, while UPDATE can contain any STABLE function, so long as a Var is not passed into the STABLE function, even indirectly. UPDATE TagetEntry's can now also include Vars. There's an exception, CASE/COALESCE statements may not contain mutable functions. Functions calls in master_modify_multiple_shards are also evaluated.	2016-07-13 11:45:51 -07:00
Metin Doslu	e5ecf92328	Add null check to SqlStateMatchesCategory() Fixes #634	2016-07-01 12:28:46 -07:00
Jason Petersen	16fc92bf6b	Purge connection if re-raising error The only way we re-raise an error is if the raiseError flag is true, so might as well purge connection in that block rather than independently checking errorLevel.	2016-06-21 09:51:12 -06:00
Jason Petersen	48f4e5d1a5	Make ReportRemoteError's CONTEXT style-compliant There's not a ton of documentation about what CONTEXT lines should look like, but this seems like the most dominant pattern. Similarly, users should expect lowercase, non-period strings.	2016-06-07 12:47:16 -06:00
Jason Petersen	9ba02928ac	Refactor ReportRemoteError to remove boolean arg Broke it into two explicitly-named functions instead: WarnRemoteError and ReraiseRemoteError.	2016-06-07 12:38:32 -06:00
Metin Doslu	7d0c90b398	Fail fast on constraint violations in router executor	2016-06-07 18:11:17 +03:00
Metin Doslu	15eed396b3	Update ereport format	2016-06-07 15:58:32 +03:00
eren	132d9212d0	ADD master_modify_multiple_shards UDF Fixes #10 This change creates a new UDF: master_modify_multiple_shards Parameters: modify_query: A simple DELETE or UPDATE query as a string. The UDF is similar to the existing master_apply_delete_command UDF. Basically, given the modify query, it prunes the shard list, re-constructs the query for each shard and sends the query to the placements. Depending on the value of citus.multi_shard_commit_protocol, the commit can be done in one-phase or two-phase manner. Limitations: * It cannot be called inside a transaction block * It only be called with simple operator expressions (like Single Shard Modify) Sample Usage: ``` SELECT master_modify_multiple_shards( 'DELETE FROM customer_delete_protocol WHERE c_custkey > 500 AND c_custkey < 500'); ```	2016-05-26 17:30:35 +03:00
eren	ab240a7d4c	Rename copy_transaction_manager This change renames the distributed transaction manager parameter from citus.copy_transaction_manager to citus.multi_shard_commit_protocol. Distributed transaction manager has been used only by the COPY on hash partitioned tables but it can be used by upcoming features so, we needed to rename so that its name do not contain a reference to COPY. The change also includes renames like transaction_manager_options to commit_protocol_options and TRANSACTION_MANAGER_1PC to COMMIT_PROTOCOL_1PC. With this change, declaration of MultiShardCommitProtocol (was CopyTransactionManager) is moved from multi_copy.c to multi_transaction.c.	2016-04-28 15:12:50 +03:00
Andres Freund	7c281fbe07	Add pg_get_table_grants() function and support extending GRANTs.	2016-04-27 10:28:25 -07:00
Andres Freund	25615ee9d7	Add CitusExtensionOwner(), to execute some priviledged operations under. There exist some operations we have to execute with elevated privileges. The most expedient user for that is the user owning the citusdb extension.	2016-04-27 10:26:08 -07:00
Andres Freund	12a246de37	Perform permission checks in functions manipulating distributed tables. Previously several commands, amongst them commands like master_create_distributed_table(), were allowed for everyone. That's not good: Even though citus currently requires superuser permissions, we shouldn't allow non-superusers to perform actions as sensitive as making a table distributed. There's no checks on the worker_* functions, as these usually just punt the action to underlying postgres functionality, which then perform the necessary checks.	2016-04-27 10:22:20 -07:00
Andres Freund	42d232c0e8	Use the current session's username when connecting to worker nodes. So far we've always used libpq defaults when connecting to workers; bar special environment variables being set that'll always be the user that started the server. That's not desirable because it prevents using users with fewer privileges. Thus change the various APIs creating connections to workers to always use usernames. That means: 1) MultiClientConnect() needs to, optionally, accept a username 2) GetOrEstablishConnection(), including the underlying cache, need to use the current user as part of the connection cache key. That way connections for separate users are distinct, and we always use one with the correct authorization. 3) The task tracker needs to keep track of the username associated with a task, so it can use it when establishing connections outside the originating session.	2016-04-27 10:00:08 -07:00
Onder Kalaci	108114ab99	Apply final code review feedback - Fix o(n^2) loop to o(n) - Collapse two if statements into a single one - Some coding conventions feedback	2016-04-27 10:36:03 +03:00
Onder Kalaci	c4b783b70b	Fix Merge Conflict This commit fixes merge conflicts.	2016-04-26 11:18:47 +03:00
Onder Kalaci	6c7abc2ba5	Add fast shard pruning path for INSERTs on hash partitioned tables This commit adds a fast shard pruning path for INSERTs on hash-partitioned tables. The rationale behind this change is that if there exists a sorted shard interval array, a single index lookup on the array allows us to find the corresponding shard interval. As mentioned above, we need a sorted (wrt shardminvalue) shard interval array. Thus, this commit updates shardIntervalArray to sortedShardIntervalArray in the metadata cache. Then uses the low-level API that is defined in multi_copy to handle the fast shard pruning. The performance impact of this change is more apparent as more shards exist for a distributed table. Previous implementation was relying on linear search through the shard intervals. However, this commit relies on constant lookup time on shard interval array. Thus, the shard pruning becomes less dependent on the shard count.	2016-04-26 11:16:00 +03:00
Brian Cloutier	7a6d689259	Clear metadata_cache upon DROP EXTENSION When we notice that pg_dist_partition is being invalidated we assume that the citus extension is being dropped and drop state such as extensionLoaded and the cached oids of all the metadata tables. This frees the user from needing to reconnect after running DROP EXTENSION, so we also no longer send a warning message.	2016-04-22 07:25:49 -07:00
Metin Doslu	132a77f992	Add COPY support on master node for append partitioned relations	2016-04-19 21:57:59 +03:00
eren	64aefed46f	Fix SELECT problem with no target list Prior to this change, performing a SELECT query without a target list caused backend to crash. Sample Query: SELECT FROM github_events; (without any * before FROM) PostgreSQL: ``` -- (39599 rows) ``` Citus: ``` server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. The connection to the server was lost. Attempting reset: Failed. !> ``` The problem was an unnecessary Assert on column list in SetRangeTblExtraData(citus_nodefuncs.c)	2016-04-13 11:08:14 +03:00
Metin Doslu	1150ce6414	Send COPY rows in binary format	2016-04-12 20:22:31 +02:00
Marco Slot	d25ee8fbd8	Support for COPY FROM, based on pg_shard PR by Postres Pro	2016-04-12 20:22:31 +02:00
Jason Petersen	423e6c8ea0	Update copyright dates Fixed configure variable and updated all end dates to 2016.	2016-03-23 17:14:37 -06:00
Marco Slot	75a141a7c6	Merge remote-tracking branch 'origin/master' into feature/drop_shards_on_drop_table	2016-02-17 22:52:58 +01:00
Murat Tuncer	3528d7ce85	Merge from master branch into feature/citusdb-to-citus	2016-02-17 14:49:01 +02:00
Jason Petersen	8ad5b09251	Merge pull request #344 from citusdata/fix_shard_lock_acquisition#342 Ensure router executor acquires proper shard lock cr: @onderkalaci	2016-02-16 16:43:39 -07:00
Jason Petersen	0d196d1bf4	Ensure router executor acquires proper shard lock Though Citus' Task struct has a shardId field, it doesn't have the same semantics as the one previously used in pg_shard code. The analogous field in the Citus Task is anchorShardId. I've also added an argument check to the relevant locking function to catch future locking attempts which pass an invalid argument.	2016-02-16 11:20:18 -07:00
Marco Slot	37f580f9c7	Trim comment about invalidating dropped relations	2016-02-16 14:04:12 +01:00
Marco Slot	2af6797c04	Perform relcache invalidation in CitusInvalidateRelcacheByRelid	2016-02-16 12:59:38 +01:00
Jason Petersen	74372f70e0	Omit get_extension_schema from formatting It exactly matches the implementation in extension.c.	2016-02-15 23:29:33 -07:00
Jason Petersen	fdb37682b2	First formatting attempt Skipped csql, ruleutils, readfuncs, and functions obviously copied from PostgreSQL. Seeing how this looks, then continuing.	2016-02-15 23:29:32 -07:00
Murat Tuncer	55c44b48dd	Changed product name to citus All citusdb references in - extension, binary names - file headers - all configuration name prefixes - error/warning messages - some functions names - regression tests are changed to be citus.	2016-02-15 16:04:31 +02:00
Jason Petersen	4494e57bbd	Rename GetConnection to address name conflict The postgres_fdw extension has an extern function with an identical signature, which can cause problems when both extensions are loaded. A simple rename can fix this for now (this is the only function with) such a conflict.	2016-02-12 13:35:02 -07:00
Onder Kalaci	136306a1fe	Initial commit of Citus 5.0	2016-02-11 04:05:32 +02:00

... 6 7 8 9 10 ...

725 Commits (62a00b1b3495d635b7af320802329a8f12b9299d)