citus

Commit Graph

Author	SHA1	Message	Date
Jelte Fennema	389086102a	Refactor 9 argument function to use a struct (#2952 ) For another PR I needed to add another column which would require to add another argument to an already 9 argument function signature. In this case it would be a boolean flag and there were already two boolean flags in there. In my experience it becomes really easy to mess up the order of these flags at that point. Especially because the type system doesn't distinguish between the 3 different booleans with completely different meanings. So I refactored these signatures to receive a struct containing most of these arguments. Like that you don't mess up orderening, because the meaning of the boolean is not order dependent but fieldname dependent. It also makes it possible to set good shared defaults for this struct.	2019-09-13 15:49:53 +02:00
Önder Kalacı	48b7fbb9e5	Merge pull request #2968 from citusdata/insert_isolation_duplicate_test Changed the duplicate test into missing test	2019-09-13 15:31:54 +02:00
Halil Ozan Akgul	4d34b79b87	There were two multi insert - single insert tests but no multi insert - multi insert test. Fixed it.	2019-09-13 16:09:11 +03:00
Nils Dijk	05f0668cdc	Fix: schema leak onto create index statement cache (#2964 ) DESCRIPTION: Fix schema leak on CREATE INDEX statement When a CREATE INDEX is cached between execution we might leak the schema name onto the cached statement of an earlier execution preventing the right index to be created. Even though the cache is cleared when the search_path changes we can trigger this behaviour by having the schema already on the search path before a colliding table is created in a schema earlier on the `search_path`. When calling an unqualified create index via a function (used to trigger the caching behaviour) we see that the index is created on the wrong table after the schema leaked onto the statement. By copying the complete `PlannedStmt` and `utilityStmt` during our planning phase for distributed ddls we make sure we are not leaking the schema name onto a cached data structure. Caveat; COPY statements already have a lot of parsestree copying ongoing without directly putting it back on the `pstmt`. We should verify that copies modify the statement and potentially copy the complete `pstmt` there already.	2019-09-13 14:04:23 +02:00
Hadi Moshayedi	1f84056b83	Merge pull request #2963 from citusdata/update_udfs Return nodeid instead of record in some UDFs	2019-09-12 14:54:16 -07:00
Hadi Moshayedi	48ff4691a0	Return nodeid instead of record in some UDFs	2019-09-12 12:46:21 -07:00
Philip Dubé	d23185d077	Merge pull request #2957 from citusdata/dont-distribute-aggregate-named-invalid Begin searching AggregateNames from 1, not 0	2019-09-12 17:02:06 +00:00
Philip Dubé	ae1171a373	Test invalid aggregate	2019-09-12 16:55:05 +00:00
Philip Dubé	2aa6852dea	Begin searching AggregateNames from 1, not 0	2019-09-12 16:55:05 +00:00
Jelte Fennema	d6deb062aa	Add shard rebalancer stubs	2019-09-12 16:40:25 +02:00
Jelte Fennema	58012054c9	Add an extra advisory lock tag class	2019-09-12 16:40:25 +02:00
Jelte Fennema	eb7e45d556	Make LookupNodeForGroup extern	2019-09-12 16:40:25 +02:00
Jelte Fennema	257406fda7	Fix ArrayObjectCount for zero sized arrays	2019-09-12 16:40:25 +02:00
Jelte Fennema	de5174f763	include postgres.h into some of our .h files to silence warnings	2019-09-12 16:40:25 +02:00
Jelte Fennema	ea2e010d42	Better editorconfig	2019-09-12 16:40:25 +02:00
Jelte Fennema	4ebdf5989b	Add check-minimal to test Makefile	2019-09-12 16:40:25 +02:00
Önder Kalacı	07cca85227	Merge pull request #2938 from citusdata/local_execution_2 Introduce the concept of Local Execution	2019-09-12 12:18:43 +02:00
Onder Kalaci	0b0c779c77	Introduce the concept of Local Execution /* * local_executor.c * * The scope of the local execution is locally executing the queries on the * shards. In other words, local execution does not deal with any local tables * that are not shards on the node that the query is being executed. In that sense, * the local executor is only triggered if the node has both the metadata and the * shards (e.g., only Citus MX worker nodes). * * The goal of the local execution is to skip the unnecessary network round-trip * happening on the node itself. Instead, identify the locally executable tasks and * simply call PostgreSQL's planner and executor. * * The local executor is an extension of the adaptive executor. So, the executor uses * adaptive executor's custom scan nodes. * * One thing to note that Citus MX is only supported with replication factor = 1, so * keep that in mind while continuing the comments below. * * On the high level, there are 3 slightly different ways of utilizing local execution: * * (1) Execution of local single shard queries of a distributed table * * This is the simplest case. The executor kicks at the start of the adaptive * executor, and since the query is only a single task the execution finishes * without going to the network at all. * * Even if there is a transaction block (or recursively planned CTEs), as long * as the queries hit the shards on the same, the local execution will kick in. * * (2) Execution of local single queries and remote multi-shard queries * * The rule is simple. If a transaction block starts with a local query execution, * all the other queries in the same transaction block that touch any local shard * have to use the local execution. Although this sounds restrictive, we prefer to * implement in this way, otherwise we'd end-up with as complex scenarious as we * have in the connection managements due to foreign keys. * * See the following example: * BEGIN; * -- assume that the query is executed locally * SELECT count() FROM test WHERE key = 1; * -- at this point, all the shards that reside on the * -- node is executed locally one-by-one. After those finishes * -- the remaining tasks are handled by adaptive executor * SELECT count() FROM test; * * (3) Modifications of reference tables * * Modifications to reference tables have to be executed on all nodes. So, after the * local execution, the adaptive executor keeps continuing the execution on the other * nodes. * * Note that for read-only queries, after the local execution, there is no need to * kick in adaptive executor. * * There are also few limitations/trade-offs that is worth mentioning. First, the * local execution on multiple shards might be slow because the execution has to * happen one task at a time (e.g., no parallelism). Second, if a transaction * block/CTE starts with a multi-shard command, we do not use local query execution * since local execution is sequential. Basically, we do not want to lose parallelism * across local tasks by switching to local execution. Third, the local execution * currently only supports queries. In other words, any utility commands like TRUNCATE, * fails if the command is executed after a local execution inside a transaction block. * Forth, the local execution cannot be mixed with the executors other than adaptive, * namely task-tracker, real-time and router executors. Finally, related with the * previous item, COPY command cannot be mixed with local execution in a transaction. * The implication of that any part of INSERT..SELECT via coordinator cannot happen * via the local execution. */	2019-09-12 11:51:25 +02:00
Marco Slot	d69be38932	Merge pull request #2933 from citusdata/drop_poolinfo_fk Drop foreign key from pg_dist_poolinfo to pg_dist_node	2019-09-12 11:50:05 +02:00
SaitTalhaNisanci	e132d579f2	Change --new-bindir flag description to be consistent (#2950 )	2019-09-11 15:36:39 +03:00
SaitTalhaNisanci	0f170cb75f	Use variables instead of hardcoded tmp dirs (#2944 )	2019-09-11 13:25:18 +03:00
Jelte Fennema	c591a135f1	Update ubuntu dependencies in CONTRIBUTING (#2941 )	2019-09-11 09:49:43 +02:00
Önder Kalacı	dd4e767702	Merge pull request #2942 from citusdata/fix_adaptive_bug Make sure that lost connections are handled properly in adaptive executor	2019-09-10 18:01:17 +02:00
Onder Kalaci	485189c0b6	Make sure that lost connections are handled properly Before this patch, when a connection is lost, we'd have the following situation: - Pop a task execution from readyQueue - Lost connection - Fail the session/pool. -> This step was not acting properly because we've popped the task, but not set to session->currentTask yet After the patch: - Pop a task execution from readyQueue - Immediately set it to session->currentTask - Lost connection - Fail the session/pool. -> At this step, failing the session would trigger query failures (or failovers) properly.	2019-09-10 17:54:27 +02:00
SaitTalhaNisanci	d99deab7d9	Add upgrade postgres version test (#2940 ) * Add creating a citus cluster script Creating a citus cluster is automated. Before running this script: - Citus should be installed and its control file should be added to postgres. (make install) - Postgres should be installed. * Initialize upgrade test table and fill * Finalize the layout of upgrade tests Postgres upgrade function is added. The newly added UDFs(citus_prepare_pg_upgrade, citus_finish_pg_upgrade) are used to perform upgrade. * Refactor upgrade test and add config file * Add schedules for upgrade testing * Use pg_regress for upgrade tests pg_regress is used for creating a simple distributed table in upgrade tests. After upgrading another schedule is used to verify that the distributed table exists. Router and realtime queries are used for verifying. * Run upgrade tests as a postgres user in a temp dir postgres user is used for psql to be consistent at running tests. A temp dir is created and the temp dir's permissions are changed so that postgres user can access it. All psql commands are now run with postgres user. "Select * from t" query is changed as "Select * from t order by a" so that the result is always in the same order. * Add docopt and arguments for the upgrade script Docopt dependency is added to parse flags in script. Some refactoring in variable names is done. * Add readme for upgrade tests * Refactor upgrade tests Use relative data path instead of absolute assuming that this script will always be run from 'src/test/regress' Remove 'citus-path' flag Use specific version for docopt instead of * Use named args in string formatting * Resolve a security problem Instead of using string formatting in subprocess.call, arguments list is used. Otherwise users could do shell injection. Shell = True is removed from subprocess call as it is not recommended to use this. * Add how the test works to readme * Refactor some variables to be consistent * Update upgrade script based on the reviews It was possible that postgres server would stay running even when the script crashes, atexit library is used to ensure that we always do a teardown where we stop the databases. Some formatting is done in the code for better readability. Config class is used instead of a dictonary. A target for upgrade test is added to makefile. Unused flags/functions/variables are removed. * Format commands and remove unnecessary flag from readme	2019-09-10 17:56:04 +03:00
Marco Slot	810aca8d41	Drop foreign key from pg_dist_poolinfo to pg_dist_node	2019-09-10 09:52:19 +02:00
Philip Dubé	b4a1a0fb80	Merge pull request #2911 from citusdata/test_merge_files_and_query_more Extend tests from release testing	2019-09-05 16:57:54 +00:00
Philip Dubé	b301cf628a	Test worker_cleanup_job_schema_cache actually drops schemas	2019-09-05 16:52:24 +00:00
Philip Dubé	8979fd038b	worker_check_invalid_arguments: invalid task/job ids	2019-09-05 16:52:24 +00:00
Philip Dubé	5f9e88b260	multi_multiuser: test that worker_merge_files_and_query doesn't allow privilege escalation	2019-09-05 16:52:24 +00:00
Philip Dubé	60dc42a3ae	Merge pull request #2929 from citusdata/fix_pg12_distobject get_catalog_object_by_oid requires an extra parameter in pg12	2019-09-05 16:46:04 +00:00
Philip Dubé	a28b82d67d	get_catalog_object_by_oid requires an extra parameter in pg12	2019-09-05 16:38:07 +00:00
Nils Dijk	511e715ee3	Remove early escape in walking pg_depend (#2930 ) This is a bug that got in when we inlined the body of a function into this loop. Earlier revisions had two loops, hence a function that would be reused. With a return instead of a continue the list of dependencies being walked is dependent on the order in which we find them in pg_depend. This became apparent during pg12 compatibility. The order of entries in pg12 was luckily different causing a random test to fail due to this return. By changing it to a continue we only skip the entries that we don’t want to follow instead of skipping all entries that happen to be found later. sidefix for more stable isolation tests around ensure dependency	2019-09-05 18:03:34 +02:00
Philip Dubé	f90fb10b5f	Merge pull request #2879 from citusdata/pg12_generatedcolumns Pg12 generated columns	2019-09-04 15:07:18 +00:00
Philip Dubé	bdd30bb181	Don't allow distributing by a generated column	2019-09-04 14:50:17 +00:00
Philip Dubé	41dca121e2	Support GENERATE ALWAYS AS STORED	2019-09-04 14:50:17 +00:00
Nils Dijk	936d546a3c	Refactor Ensure Schema Exists to Ensure Dependecies Exists (#2882 ) DESCRIPTION: Refactor ensure schema exists to dependency exists Historically we only supported schema's as table dependencies to be created on the workers before a table gets distributed. This PR puts infrastructure in place to walk pg_depend to figure out which dependencies to create on the workers. Currently only schema's are supported as objects to create before creating a table. We also keep track of dependencies that have been created in the cluster. When we add a new node to the cluster we use this catalog to know which objects need to be created on the worker. Side effect of knowing which objects are already distributed is that we don't have debug messages anymore when creating schema's that are already created on the workers.	2019-09-04 14:10:20 +02:00
Philip Dubé	bc97523940	Merge pull request #2925 from citusdata/remove_check_for_updates Remove CheckForUpdates	2019-09-03 21:28:17 +00:00
Philip Dubé	28d964240f	Remove CheckForUpdates https://reports.citusdata.com/v1/releases/latest We haven't updated the version CheckForUpdates sees since 7.1.0	2019-09-03 21:11:25 +00:00
Philip Dubé	077f5e26af	Merge pull request #2926 from citusdata/normalize_all_the_tests Normalize all tests	2019-09-03 21:10:40 +00:00
Philip Dubé	4d26829d50	Remove normalized_tests.lst, don't normalize check-vanilla	2019-09-03 17:25:00 +00:00
Philip Dubé	169d2f193f	Merge pull request #2914 from citusdata/propagate_column_collate create_distributed_table: include COLLATE on columns	2019-08-29 14:31:21 +00:00
Philip Dubé	da00c62eea	create_distributed_table: include COLLATE on columns	2019-08-29 14:22:54 +00:00
Philip Dubé	dd57232ba3	Merge pull request #2912 from citusdata/MaxBackends_max_wal_senders Update TotalProcCount to match update in InitializeMaxBackends in pg12	2019-08-29 14:16:38 +00:00
Philip Dubé	32ef459025	backend_data.c: include max_wal_senders in calculating maxBackend, matches changes in pg12's InitializeMaxBackends	2019-08-28 21:24:33 +00:00
Jelte Fennema	cbecf97c84	Move tuplestore setup to a helper function (#2898 ) * Add tuplestore helpers * More detailed error messages in tuplestore * Add CreateTupleDescCopy to SetupTuplestore * Use new SetupTuplestore helper function * Remove unnecessary copy * Remove comment about undefined behaviour	2019-08-27 09:11:08 +02:00
Philip Dubé	b354644c56	Merge pull request #2908 from citusdata/sort_colocatedshardintervallist Sort ColocatedShardIntervalList	2019-08-26 17:53:47 +00:00
Philip Dubé	eba3828ef7	ColocatedShardIntervalList: sort	2019-08-26 17:42:41 +00:00
Philip Dubé	c1587cc00a	Merge pull request #2906 from citusdata/add-rls-SET-LOCAL-GUC-test Test SET LOCAL propagation when GUC is used in RLS policy	2019-08-22 20:36:05 +00:00
Matthias Kurz	fc069dc611	Test SET LOCAL propagation when GUC is used in RLS policy	2019-08-22 20:29:52 +00:00

1 2 3 4 5 ...

2726 Commits (5f23b951c7acdc9d520b86dba18f223c61978b81) All Branches Search

2726 Commits (5f23b951c7acdc9d520b86dba18f223c61978b81)

All Branches