citus

Commit Graph

Author	SHA1	Message	Date
Philip Dubé	363409a0c2	Propagate REINDEX TABLE & REINDEX INDEX	2019-09-27 18:14:53 +00:00
Hanefi Onaldi	66b9f2e887	Deparsing and qualifiying for FUNCTION/PROCEDURE statements (#3014 ) This PR aims to add all the necessary logic to qualify and deparse all possible `{ALTER\|DROP} .. {FUNCTION\|PROCEDURE}` queries. As Procedures are introduced in PG11, the code contains many PG version checks. I tried my best to make it easy to clean up once we drop PG10 support. Here are some caveats: - I assumed that the parse tree is a valid one. There are some queries that are not allowed, but still are parsed successfully by postgres planner. Such queries will result in errors in execution time. (e.g. `ALTER PROCEDURE p STRICT` -> `STRICT` action is valid for functions but not procedures. Postgres decides to parse them nevertheless.)	2019-09-27 19:02:52 +02:00
Marco Slot	2868e02a3d	Implement SELECT function call delegation. When a function is marked as colocated with a distributed table, we try delegating queries of kind "SELECT func(...)" to workers. We currently only support this simple form, and don't delegate forms like "SELECT f1(...), f2(...)", "SELECT f1(...) FROM ...", or function calls inside transactions. As a side effect, we also fix the transactional semantics of DO blocks. Previously we didn't consider a DO block a multi-statement transaction. Now we do. Co-authored-by: Marco Slot <marco@citusdata.com> Co-authored-by: serprex <serprex@users.noreply.github.com> Co-authored-by: pykello <hadi.moshayedi@microsoft.com>	2019-09-27 09:13:25 -07:00
Halil Ozan Akgul	824a69587c	Created isolation tests for insert select on MX	2019-09-26 17:40:36 +03:00
Halil Ozan Akgul	d56ab6274c	Created isolation tests for drop, alter, index and select for update on MX.	2019-09-26 10:47:14 +03:00
Halil Ozan Akgul	d426fb2159	Created isolation tests for truncate on MX.	2019-09-25 16:51:20 +03:00
Halil Ozan Akgul	62b6852923	Created isolation tests for copy on MX.	2019-09-25 15:36:05 +03:00
Onder Kalaci	219f3676a0	Improve some tests around local execution and CTE inlining on pg 12	2019-09-25 10:53:19 +02:00
Philip Dubé	4f60e3a149	Feedback	2019-09-24 17:31:09 +00:00
Marco Slot	c1e43b25da	Use the new create_distributed_function API in some call tests	2019-09-24 17:31:09 +00:00
Philip Dubé	90e1f1442a	Annotated tests for multi_mx_call. Co-authored-by: pykello <hadi.moshayedi@microsoft.com>	2019-09-24 17:31:09 +00:00
Philip Dubé	c95d46b4f3	Extend multi_mx_call with some of Hadi's suggestions for better test coverage	2019-09-24 17:31:09 +00:00
Philip Dubé	16b8d17aba	Test: multi_mx_call	2019-09-24 17:31:09 +00:00
Onder Kalaci	18de78f386	Relax the colocation checks for distributed functions As long as the types can be coerced, it is safe to pushdown functions.	2019-09-24 16:31:08 +02:00
Jelte Fennema	0f90c2497e	Use synchronous replication for follower tests	2019-09-24 15:51:49 +02:00
Jelte Fennema	78ccc323d1	Remove stuff needed only for PG 9.6 from test runner	2019-09-24 15:51:49 +02:00
Jelte Fennema	bd2103e597	Remove flappy test	2019-09-24 14:15:33 +02:00
Jelte Fennema	897ec1bdeb	Revert "Temporarily disable flappy test" This reverts commit `4b4459ee62`.	2019-09-24 14:15:33 +02:00
Marco Slot	42be8afd74	Swap pg_dist_node groupid and nodeid sequences	2019-09-24 12:03:44 +02:00
Marco Slot	0dea485c68	Fix misspelling in multi_colocation_utils	2019-09-24 11:27:30 +02:00
Marco Slot	4b4459ee62	Temporarily disable flappy test	2019-09-24 11:02:34 +02:00
Hadi Moshayedi	48078a30e6	Fix wait_until_metadata_sync() for postgres 12. Postgres 12 now has an assertion that the calls to WaitLatchOrSocket handle postmaster death.	2019-09-23 14:15:35 -07:00
Philip Dubé	06faba91c0	Include ifdefs for pg12 API changes, update local_shard_executiuon test to avoid CTE inlining	2019-09-23 20:22:35 +00:00
Onder Kalaci	d37745bfc7	Sync metadata to worker nodes after create_distributed_function Since the distributed functions are useful when the workers have metadata, we automatically sync it. Also, after master_add_node(). We do it lazily and let the deamon sync it. That's mainly because the metadata syncing cannot be done in transaction blocks, and we don't want to add lots of transactional limitations to master_add_node() and create_distributed_function().	2019-09-23 18:30:53 +02:00
Marco Slot	5f23b951c7	Support serial and smallserial when syncing metadata	2019-09-23 17:39:21 +02:00
Marco Slot	e58d76c5f6	Fix assert failure in bare SELECT FROM reference table FOR UPDATE in MX	2019-09-23 17:00:09 +02:00
SaitTalhaNisanci	71e7047e65	Enhance pg upgrade tests (#3002 ) * Enhance pg upgrade tests * Add a specific upgrade test for pg_dist_partition We store the index of distribution column, and when a column with an index that is smaller than distribution column index is dropped before an upgrade, the index should still match the distribution column after an upgrade	2019-09-23 17:37:14 +03:00
Marco Slot	d85d77634d	Handle anonymous composite types on the target list	2019-09-23 14:53:02 +02:00
Halil Ozan Akgul	b55b275a30	Created isolation tests for update, delete and upsert on MX	2019-09-23 14:13:29 +03:00
Onder Kalaci	d7e2968120	Add parameters to create_distributed_function() With this commit, we're changing the API for create_distributed_function() such that users can provide the distribution argument and the colocation information.	2019-09-22 21:53:33 +02:00
Onder Kalaci	e1fe8d60b4	Make sure that functions are also listed in SupportedDependencyByCitus We've recently merged two commits, `db5d03931d` and `eccba1d4c3`, which actually operates on the very similar places. It turns out that we've an integration issue, where master_add_node() fails to replicate the functions to newly added node.	2019-09-20 11:02:50 +02:00
Nils Dijk	72015faeb2	fix disable_object_propagation test for pg12	2019-09-19 17:40:24 +02:00
Hanefi Onaldi	ed11b9590c	Add distributed func creation queries in dependency replication logic	2019-09-18 20:07:45 +03:00
Hadi Moshayedi	d2f2acc4b2	Make master_update_node citus-ha friendly.	2019-09-18 09:32:54 -07:00
Hadi Moshayedi	76f3933b05	Add metadatasynced, and sync on master_update_node() Co-authored-by: pykello <hadi.moshayedi@microsoft.com> Co-authored-by: serprex <serprex@users.noreply.github.com>	2019-09-18 09:32:54 -07:00
Nils Dijk	db5d03931d	Feature disable object propagation (#2986 ) DESCRIPTION: Provide a GUC to turn of the new dependency propagation functionality In the case the dependency propagation functionality introduced in 9.0 causes issues to a cluster of a user they can turn it off almost completely. The only dependency that will still be propagated and kept track of is the schema to emulate the old behaviour. GUC to change is `citus.enable_object_propagation`. When set to `false` the functionality will be mostly turned off. Be aware that objects marked as distributed in `pg_dist_object` will still be kept in the catalog as a distributed object. Alter statements to these objects will not be propagated to workers and may cause desynchronisation.	2019-09-18 17:16:22 +02:00
Philip Dubé	ac14f1dd49	pg12 doesn't support client_min_messages as 'fatal'	2019-09-17 20:37:06 +00:00
Nils Dijk	2b7f5552c8	Fix: rename remote type on conflict (#2983 ) DESCRIPTION: Rename remote types during type propagation To prevent data to be destructed when a remote type differs from the type on the coordinator during type propagation we wanted to rename the type instead of `DROP CASCADE`. This patch removes the `DROP` logic and adds the creation of a rename statement to a free name.	2019-09-17 18:54:10 +02:00
Nils Dijk	0a3152d09c	Add feature flag to turn off create type propagation (#2982 ) DESCRIPTION: Add feature flag to turn off create type propagation When `citus.enable_create_type_propagation` is set to `false` citus will not propagate `CREATE TYPE` statements to the workers. Types are still distributed when tables that depend on these types are distributed.	2019-09-17 15:50:06 +02:00
Halil Ozan Akgul	5333296a54	Created isolation tests for select on MX	2019-09-17 12:44:45 +03:00
Halil Ozan Akgul	7cde785031	Added the MX isolation tests for insert	2019-09-16 15:49:43 +03:00
Hanefi Onaldi	8f2a3a0604	Introduce create_distributed_function(regproc) UDF (#2961 ) This PR aims to add the minimal set of changes required to start distributing functions. You can use create_distributed_function(regproc) UDF to distribute a function. SELECT create_distributed_function('add(int,int)'); The function definition should include the param types to properly identify the correct function that we wish to distribute	2019-09-13 23:27:46 +03:00
Philip Dubé	fb10edcb9d	isolation_add_node_vs_reference_table_operations: test add in parallel with create_reference_table	2019-09-13 18:13:58 +00:00
Philip Dubé	5e5f4628a0	Fix pg12 compile	2019-09-13 17:25:30 +00:00
Jelte Fennema	4bbf65d913	Change SQL migration build process for easier reviews (#2951 ) @thanodnl told me it was a bit of a problem that it's impossible to see the history of a UDF in git. The only way to do so is by reading all the sql migration files from new to old. Another problem is that it's also hard to review the changed UDF during code review, because to find out what changed you have to do the same. I thought of a IMHO better (but not perfect) way to handle this. We keep the definition of a UDF in sql/udfs/{name_of_udf}/latest.sql. That file we change whenever we need to make a change to the the UDF. On top of that you also make a snapshot of the file in sql/udfs/{name_of_udf}/{migration-version}.sql (e.g. 9.0-1.sql) by copying the contents. This way you can easily view what the actual changes were by looking at the latest.sql file. There's still the question on how to use these files then. Sadly postgres doesn't allow inclusion of other sql files in the migration sql file (it does in psql using \i). So instead I used the C preprocessor+ make to compile a sql/xxx.sql to a build/sql/xxx.sql file. This final build/sql/xxx.sql file has every occurence of #include "somefile.sql" in sql/xxx.sql replaced by the contents of somefile.sql.	2019-09-13 18:44:27 +02:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Jelte Fennema	e4cfea3751	Correctly add schema when distributing sequence definitons Fixes 2958	2019-09-13 17:19:35 +02:00
Jelte Fennema	579a40dfa5	Add make check-base-mx	2019-09-13 17:19:35 +02:00
Halil Ozan Akgul	4d34b79b87	There were two multi insert - single insert tests but no multi insert - multi insert test. Fixed it.	2019-09-13 16:09:11 +03:00
Nils Dijk	05f0668cdc	Fix: schema leak onto create index statement cache (#2964 ) DESCRIPTION: Fix schema leak on CREATE INDEX statement When a CREATE INDEX is cached between execution we might leak the schema name onto the cached statement of an earlier execution preventing the right index to be created. Even though the cache is cleared when the search_path changes we can trigger this behaviour by having the schema already on the search path before a colliding table is created in a schema earlier on the `search_path`. When calling an unqualified create index via a function (used to trigger the caching behaviour) we see that the index is created on the wrong table after the schema leaked onto the statement. By copying the complete `PlannedStmt` and `utilityStmt` during our planning phase for distributed ddls we make sure we are not leaking the schema name onto a cached data structure. Caveat; COPY statements already have a lot of parsestree copying ongoing without directly putting it back on the `pstmt`. We should verify that copies modify the statement and potentially copy the complete `pstmt` there already.	2019-09-13 14:04:23 +02:00

1 2 3 4 5 ...

1017 Commits (363409a0c2aec722e3311f032e5d4137de58ec85)