citus

Commit Graph

Author	SHA1	Message	Date
Jelte Fennema	af9fb9f785	Fix depend arguments for OSX clang cpp (#2978 ) A better fix for #2975. Apparently for OSX cpp -MF and -MT shouldn't have a space in between the flag and their value. Without the space it still works for gcc as well.	2019-09-16 15:22:07 +02:00
Halil Ozan Akgül	301febbd2c	Merge pull request #2967 from citusdata/mx_isolation_test_insert MX isolation test insert	2019-09-16 15:56:09 +03:00
Halil Ozan Akgul	7cde785031	Added the MX isolation tests for insert	2019-09-16 15:49:43 +03:00
Jelte Fennema	31fac3b90e	Don't generate SQL files twice by not making directories a target (#2977 )	2019-09-16 12:53:17 +02:00
Önder Kalacı	13947a63ce	Don't use flags that mac clang doesn't support as it does on other platforms (#2975 )	2019-09-16 11:44:06 +02:00
Hanefi Onaldi	8f2a3a0604	Introduce create_distributed_function(regproc) UDF (#2961 ) This PR aims to add the minimal set of changes required to start distributing functions. You can use create_distributed_function(regproc) UDF to distribute a function. SELECT create_distributed_function('add(int,int)'); The function definition should include the param types to properly identify the correct function that we wish to distribute	2019-09-13 23:27:46 +03:00
Philip Dubé	012595da11	Merge pull request #2927 from citusdata/fix_2909 ActivePrimaryNodeList: Lock DistNodeRelationId()	2019-09-13 18:22:23 +00:00
Philip Dubé	fb10edcb9d	isolation_add_node_vs_reference_table_operations: test add in parallel with create_reference_table	2019-09-13 18:13:58 +00:00
Philip Dubé	492d1b2cba	ActivePrimaryNodeList: add lockMode parameter	2019-09-13 17:44:56 +00:00
Philip Dubé	482f3b1474	Merge pull request #2971 from citusdata/fix-pg12 Fix pg12 compile	2019-09-13 17:34:57 +00:00
Philip Dubé	5e5f4628a0	Fix pg12 compile	2019-09-13 17:25:30 +00:00
Jelte Fennema	4bbf65d913	Change SQL migration build process for easier reviews (#2951 ) @thanodnl told me it was a bit of a problem that it's impossible to see the history of a UDF in git. The only way to do so is by reading all the sql migration files from new to old. Another problem is that it's also hard to review the changed UDF during code review, because to find out what changed you have to do the same. I thought of a IMHO better (but not perfect) way to handle this. We keep the definition of a UDF in sql/udfs/{name_of_udf}/latest.sql. That file we change whenever we need to make a change to the the UDF. On top of that you also make a snapshot of the file in sql/udfs/{name_of_udf}/{migration-version}.sql (e.g. 9.0-1.sql) by copying the contents. This way you can easily view what the actual changes were by looking at the latest.sql file. There's still the question on how to use these files then. Sadly postgres doesn't allow inclusion of other sql files in the migration sql file (it does in psql using \i). So instead I used the C preprocessor+ make to compile a sql/xxx.sql to a build/sql/xxx.sql file. This final build/sql/xxx.sql file has every occurence of #include "somefile.sql" in sql/xxx.sql replaced by the contents of somefile.sql.	2019-09-13 18:44:27 +02:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Önder Kalacı	6e4fbeb8b9	Merge pull request #2962 from citusdata/fix-2958 Correctly add schema when distributing sequence definitons	2019-09-13 17:26:51 +02:00
Jelte Fennema	e4cfea3751	Correctly add schema when distributing sequence definitons Fixes 2958	2019-09-13 17:19:35 +02:00
Jelte Fennema	579a40dfa5	Add make check-base-mx	2019-09-13 17:19:35 +02:00
Jelte Fennema	389086102a	Refactor 9 argument function to use a struct (#2952 ) For another PR I needed to add another column which would require to add another argument to an already 9 argument function signature. In this case it would be a boolean flag and there were already two boolean flags in there. In my experience it becomes really easy to mess up the order of these flags at that point. Especially because the type system doesn't distinguish between the 3 different booleans with completely different meanings. So I refactored these signatures to receive a struct containing most of these arguments. Like that you don't mess up orderening, because the meaning of the boolean is not order dependent but fieldname dependent. It also makes it possible to set good shared defaults for this struct.	2019-09-13 15:49:53 +02:00
Önder Kalacı	48b7fbb9e5	Merge pull request #2968 from citusdata/insert_isolation_duplicate_test Changed the duplicate test into missing test	2019-09-13 15:31:54 +02:00
Halil Ozan Akgul	4d34b79b87	There were two multi insert - single insert tests but no multi insert - multi insert test. Fixed it.	2019-09-13 16:09:11 +03:00
Nils Dijk	05f0668cdc	Fix: schema leak onto create index statement cache (#2964 ) DESCRIPTION: Fix schema leak on CREATE INDEX statement When a CREATE INDEX is cached between execution we might leak the schema name onto the cached statement of an earlier execution preventing the right index to be created. Even though the cache is cleared when the search_path changes we can trigger this behaviour by having the schema already on the search path before a colliding table is created in a schema earlier on the `search_path`. When calling an unqualified create index via a function (used to trigger the caching behaviour) we see that the index is created on the wrong table after the schema leaked onto the statement. By copying the complete `PlannedStmt` and `utilityStmt` during our planning phase for distributed ddls we make sure we are not leaking the schema name onto a cached data structure. Caveat; COPY statements already have a lot of parsestree copying ongoing without directly putting it back on the `pstmt`. We should verify that copies modify the statement and potentially copy the complete `pstmt` there already.	2019-09-13 14:04:23 +02:00
Hadi Moshayedi	1f84056b83	Merge pull request #2963 from citusdata/update_udfs Return nodeid instead of record in some UDFs	2019-09-12 14:54:16 -07:00
Hadi Moshayedi	48ff4691a0	Return nodeid instead of record in some UDFs	2019-09-12 12:46:21 -07:00
Philip Dubé	d23185d077	Merge pull request #2957 from citusdata/dont-distribute-aggregate-named-invalid Begin searching AggregateNames from 1, not 0	2019-09-12 17:02:06 +00:00
Philip Dubé	ae1171a373	Test invalid aggregate	2019-09-12 16:55:05 +00:00
Philip Dubé	2aa6852dea	Begin searching AggregateNames from 1, not 0	2019-09-12 16:55:05 +00:00
Jelte Fennema	d6deb062aa	Add shard rebalancer stubs	2019-09-12 16:40:25 +02:00
Jelte Fennema	58012054c9	Add an extra advisory lock tag class	2019-09-12 16:40:25 +02:00
Jelte Fennema	eb7e45d556	Make LookupNodeForGroup extern	2019-09-12 16:40:25 +02:00
Jelte Fennema	257406fda7	Fix ArrayObjectCount for zero sized arrays	2019-09-12 16:40:25 +02:00
Jelte Fennema	de5174f763	include postgres.h into some of our .h files to silence warnings	2019-09-12 16:40:25 +02:00
Jelte Fennema	ea2e010d42	Better editorconfig	2019-09-12 16:40:25 +02:00
Jelte Fennema	4ebdf5989b	Add check-minimal to test Makefile	2019-09-12 16:40:25 +02:00
Önder Kalacı	07cca85227	Merge pull request #2938 from citusdata/local_execution_2 Introduce the concept of Local Execution	2019-09-12 12:18:43 +02:00
Onder Kalaci	0b0c779c77	Introduce the concept of Local Execution /* * local_executor.c * * The scope of the local execution is locally executing the queries on the * shards. In other words, local execution does not deal with any local tables * that are not shards on the node that the query is being executed. In that sense, * the local executor is only triggered if the node has both the metadata and the * shards (e.g., only Citus MX worker nodes). * * The goal of the local execution is to skip the unnecessary network round-trip * happening on the node itself. Instead, identify the locally executable tasks and * simply call PostgreSQL's planner and executor. * * The local executor is an extension of the adaptive executor. So, the executor uses * adaptive executor's custom scan nodes. * * One thing to note that Citus MX is only supported with replication factor = 1, so * keep that in mind while continuing the comments below. * * On the high level, there are 3 slightly different ways of utilizing local execution: * * (1) Execution of local single shard queries of a distributed table * * This is the simplest case. The executor kicks at the start of the adaptive * executor, and since the query is only a single task the execution finishes * without going to the network at all. * * Even if there is a transaction block (or recursively planned CTEs), as long * as the queries hit the shards on the same, the local execution will kick in. * * (2) Execution of local single queries and remote multi-shard queries * * The rule is simple. If a transaction block starts with a local query execution, * all the other queries in the same transaction block that touch any local shard * have to use the local execution. Although this sounds restrictive, we prefer to * implement in this way, otherwise we'd end-up with as complex scenarious as we * have in the connection managements due to foreign keys. * * See the following example: * BEGIN; * -- assume that the query is executed locally * SELECT count() FROM test WHERE key = 1; * -- at this point, all the shards that reside on the * -- node is executed locally one-by-one. After those finishes * -- the remaining tasks are handled by adaptive executor * SELECT count() FROM test; * * (3) Modifications of reference tables * * Modifications to reference tables have to be executed on all nodes. So, after the * local execution, the adaptive executor keeps continuing the execution on the other * nodes. * * Note that for read-only queries, after the local execution, there is no need to * kick in adaptive executor. * * There are also few limitations/trade-offs that is worth mentioning. First, the * local execution on multiple shards might be slow because the execution has to * happen one task at a time (e.g., no parallelism). Second, if a transaction * block/CTE starts with a multi-shard command, we do not use local query execution * since local execution is sequential. Basically, we do not want to lose parallelism * across local tasks by switching to local execution. Third, the local execution * currently only supports queries. In other words, any utility commands like TRUNCATE, * fails if the command is executed after a local execution inside a transaction block. * Forth, the local execution cannot be mixed with the executors other than adaptive, * namely task-tracker, real-time and router executors. Finally, related with the * previous item, COPY command cannot be mixed with local execution in a transaction. * The implication of that any part of INSERT..SELECT via coordinator cannot happen * via the local execution. */	2019-09-12 11:51:25 +02:00
Marco Slot	d69be38932	Merge pull request #2933 from citusdata/drop_poolinfo_fk Drop foreign key from pg_dist_poolinfo to pg_dist_node	2019-09-12 11:50:05 +02:00
SaitTalhaNisanci	e132d579f2	Change --new-bindir flag description to be consistent (#2950 )	2019-09-11 15:36:39 +03:00
SaitTalhaNisanci	0f170cb75f	Use variables instead of hardcoded tmp dirs (#2944 )	2019-09-11 13:25:18 +03:00
Jelte Fennema	c591a135f1	Update ubuntu dependencies in CONTRIBUTING (#2941 )	2019-09-11 09:49:43 +02:00
Önder Kalacı	dd4e767702	Merge pull request #2942 from citusdata/fix_adaptive_bug Make sure that lost connections are handled properly in adaptive executor	2019-09-10 18:01:17 +02:00
Onder Kalaci	485189c0b6	Make sure that lost connections are handled properly Before this patch, when a connection is lost, we'd have the following situation: - Pop a task execution from readyQueue - Lost connection - Fail the session/pool. -> This step was not acting properly because we've popped the task, but not set to session->currentTask yet After the patch: - Pop a task execution from readyQueue - Immediately set it to session->currentTask - Lost connection - Fail the session/pool. -> At this step, failing the session would trigger query failures (or failovers) properly.	2019-09-10 17:54:27 +02:00
SaitTalhaNisanci	d99deab7d9	Add upgrade postgres version test (#2940 ) * Add creating a citus cluster script Creating a citus cluster is automated. Before running this script: - Citus should be installed and its control file should be added to postgres. (make install) - Postgres should be installed. * Initialize upgrade test table and fill * Finalize the layout of upgrade tests Postgres upgrade function is added. The newly added UDFs(citus_prepare_pg_upgrade, citus_finish_pg_upgrade) are used to perform upgrade. * Refactor upgrade test and add config file * Add schedules for upgrade testing * Use pg_regress for upgrade tests pg_regress is used for creating a simple distributed table in upgrade tests. After upgrading another schedule is used to verify that the distributed table exists. Router and realtime queries are used for verifying. * Run upgrade tests as a postgres user in a temp dir postgres user is used for psql to be consistent at running tests. A temp dir is created and the temp dir's permissions are changed so that postgres user can access it. All psql commands are now run with postgres user. "Select * from t" query is changed as "Select * from t order by a" so that the result is always in the same order. * Add docopt and arguments for the upgrade script Docopt dependency is added to parse flags in script. Some refactoring in variable names is done. * Add readme for upgrade tests * Refactor upgrade tests Use relative data path instead of absolute assuming that this script will always be run from 'src/test/regress' Remove 'citus-path' flag Use specific version for docopt instead of * Use named args in string formatting * Resolve a security problem Instead of using string formatting in subprocess.call, arguments list is used. Otherwise users could do shell injection. Shell = True is removed from subprocess call as it is not recommended to use this. * Add how the test works to readme * Refactor some variables to be consistent * Update upgrade script based on the reviews It was possible that postgres server would stay running even when the script crashes, atexit library is used to ensure that we always do a teardown where we stop the databases. Some formatting is done in the code for better readability. Config class is used instead of a dictonary. A target for upgrade test is added to makefile. Unused flags/functions/variables are removed. * Format commands and remove unnecessary flag from readme	2019-09-10 17:56:04 +03:00
Marco Slot	810aca8d41	Drop foreign key from pg_dist_poolinfo to pg_dist_node	2019-09-10 09:52:19 +02:00
Philip Dubé	b4a1a0fb80	Merge pull request #2911 from citusdata/test_merge_files_and_query_more Extend tests from release testing	2019-09-05 16:57:54 +00:00
Philip Dubé	b301cf628a	Test worker_cleanup_job_schema_cache actually drops schemas	2019-09-05 16:52:24 +00:00
Philip Dubé	8979fd038b	worker_check_invalid_arguments: invalid task/job ids	2019-09-05 16:52:24 +00:00
Philip Dubé	5f9e88b260	multi_multiuser: test that worker_merge_files_and_query doesn't allow privilege escalation	2019-09-05 16:52:24 +00:00
Philip Dubé	60dc42a3ae	Merge pull request #2929 from citusdata/fix_pg12_distobject get_catalog_object_by_oid requires an extra parameter in pg12	2019-09-05 16:46:04 +00:00
Philip Dubé	a28b82d67d	get_catalog_object_by_oid requires an extra parameter in pg12	2019-09-05 16:38:07 +00:00
Nils Dijk	511e715ee3	Remove early escape in walking pg_depend (#2930 ) This is a bug that got in when we inlined the body of a function into this loop. Earlier revisions had two loops, hence a function that would be reused. With a return instead of a continue the list of dependencies being walked is dependent on the order in which we find them in pg_depend. This became apparent during pg12 compatibility. The order of entries in pg12 was luckily different causing a random test to fail due to this return. By changing it to a continue we only skip the entries that we don’t want to follow instead of skipping all entries that happen to be found later. sidefix for more stable isolation tests around ensure dependency	2019-09-05 18:03:34 +02:00
Philip Dubé	f90fb10b5f	Merge pull request #2879 from citusdata/pg12_generatedcolumns Pg12 generated columns	2019-09-04 15:07:18 +00:00

... 2 3 4 5 6 ...

2842 Commits (86a4c0925bbbe9fdfb27103f9d223c0a2bc0dbf8) All Branches Search

2842 Commits (86a4c0925bbbe9fdfb27103f9d223c0a2bc0dbf8)

All Branches