citus

Commit Graph

Author	SHA1	Message	Date
Onder Kalaci	629ecc3dee	Add the infrastructure to count the number of client backends Considering the adaptive connection management improvements that we plan to roll soon, it makes it very helpful to know the number of active client backends. We are doing this addition to simplify yhe adaptive connection management for single node Citus. In single node Citus, both the client backends and Citus parallel queries would compete to get slots on Postgres' `max_connections` on the same Citus database. With adaptive connection management, we have the counters for Citus parallel queries. That helps us to adaptively decide on the remote executions pool size (e.g., throttle connections if necessary). However, we do not have any counters for the total number of client backends on the database. For single node Citus, we should consider all the client backends, not only the remote connections that Citus does. Of course Postgres internally knows how many client backends are active. However, to get that number Postgres iterates over all the backends. For examaple, see [pg_stat_get_db_numbackends](`8e90ec5580/src/backend/utils/adt/pgstatfuncs.c (L1240)`) where Postgres iterates over all the backends. For our purpuses, we need this information on every connection establishment. That's why we cannot affort to do this kind of iterattion.	2020-11-25 19:19:24 +01:00
Ahmet Gedemenli	a64dc8a72b	Fixes a bug preventing INSERT SELECT .. ON CONFLICT with a constraint name on local shards Separate search relation shard function Add tests	2020-11-25 15:10:46 +03:00
Onder Kalaci	7accbff3f6	Do not cache all the distributed table metadata during CitusTableTypeIdList() CitusTableTypeIdList() function iterates on all the entries of pg_dist_partition and loads all the metadata in to the cache. This can be quite memory intensive especially when there are lots of distributed tables. When partitioned tables are used, it is common to have many distributed tables given that each partition also becomes a distributed table. CitusTableTypeIdList() is used on every CREATE TABLE .. PARTITION OF.. command as well. It means that, anytime a partition is created, Citus loads all the metadata to the cache. Note that Citus typically only loads the accessed table's metadata to the cache.	2020-11-24 17:44:06 +01:00
Önder Kalacı	c760cd3470	Move local execution after remote execution (#4301 ) * Move local execution after the remote execution Before this commit, when both local and remote tasks exist, the executor was starting the execution with local execution. There is no strict requirements on this. Especially considering the adaptive connection management improvements that we plan to roll soon, moving the local execution after to the remote execution makes more sense. The adaptive connection management for single node Citus would look roughly as follows: - Try to connect back to the coordinator for running parallel queries. - If succeeds, go on and execute tasks in parallel - If fails, fallback to the local execution So, we'll use local execution as a fallback mechanism. And, moving it after to the remote execution allows us to implement such further scenarios.	2020-11-24 13:43:38 +01:00
Hadi Moshayedi	40b52ab757	Fix memory leaks in column store	2020-11-23 11:26:12 -08:00
Jeff Davis	ba6ec610e2	address review comment	2020-11-20 10:03:12 -08:00
Jeff Davis	8cee2b092b	remove columnar FDW code	2020-11-20 10:03:12 -08:00
Onder Kalaci	c433c66f2b	Do not execute subplans multiple times with cursors Before this commit, we let AdaptiveExecutorPreExecutorRun() to be effective multiple times on every FETCH on cursors. That does not affect the correctness of the query results, but adds significant overhead.	2020-11-20 10:43:56 +01:00
Hadi Moshayedi	b182a95389	Fix ALTER COLUMN ... SET TYPE for columnar	2020-11-19 15:36:45 -08:00
Jeff Davis	cef1d0e915	fixup test output	2020-11-19 12:45:52 -08:00
Jeff Davis	91015deb9d	rename UDFs also	2020-11-19 12:27:40 -08:00
Jeff Davis	a2b698a766	rename cstore_tableam -> columnar	2020-11-19 12:15:51 -08:00
SaitTalhaNisanci	9c44911226	Improve error messages in shard pruning (#4324 )	2020-11-18 17:16:06 +03:00
Hadi Moshayedi	2747fd80ff	Add prepared materialized view tests for columnar	2020-11-17 20:13:20 -08:00
Hadi Moshayedi	6711340ea6	Add prepared xact & stmt tests for columnar	2020-11-17 20:00:57 -08:00
Hadi Moshayedi	97cba2d5b6	Implements write state management for tuple inserts. TableAM API doesn't allow us to pass around a state variable along all of the tuple inserts belonging to the same command. We require this in columnar store, since we batch them, and when we have enough rows we flush them as stripes. To do that, we keep a (relfilenode) -> stack of (subxact id, TableWriteState) global mapping. Inserts Whenever we want to insert a tuple, we look up for the relation's relfilenode in this mapping. If top of the stack matches current subtransaction, we us the existing TableWriteState. Otherwise, we allocate a new TableWriteState and push it on top of stack. (Sub)Transaction Commit/Aborts When the subtransaction or transaction is committed, we flush and pop all entries matching current SubTransactionId. When the subtransaction or transaction is committed, we pop all entries matching current SubTransactionId and discard them without flushing. Reads Since we might have unwritten rows which needs to be read by a table scan, we flush write states on SELECTs. Since flushing the write state of upper transactions in a subtransaction will cause metadata being written in wrong subtransaction, we ERROR out if any of the upper subtransactions have unflushed rows. Table Drops We record in which subtransaction the table was dropped. When committing a subtransaction in which table was dropped, we propagate the drop to upper transaction. When aborting a subtransaction in which table was dropped, we mark table as not deleted.	2020-11-17 12:07:16 -08:00
Nils Dijk	725f4a37d0	change configure to not have options	2020-11-17 19:01:54 +01:00
Nils Dijk	22df8027b0	add extra output for multi_extension targeting pg11	2020-11-17 19:01:54 +01:00
Nils Dijk	7c891a01a9	create missing objects during upgrade path	2020-11-17 19:01:51 +01:00
Nils Dijk	2987535172	add pg upgrade tests verifying table am is created	2020-11-17 18:55:36 +01:00
Nils Dijk	d065bb495d	Prepare downgrade script and bump development version to 10.0-1	2020-11-17 18:55:35 +01:00
Nils Dijk	b6d4a1bbe2	fix style	2020-11-17 18:55:35 +01:00
Nils Dijk	3bb6554976	make tests run	2020-11-17 18:55:35 +01:00
Nils Dijk	f89bd3eeb5	move columnar test files	2020-11-17 18:55:34 +01:00
SaitTalhaNisanci	34de1f645c	Update failure test dependencies (#4284 ) * Update failure test dependencies There was a security alert for cryptography. The vulnerability was fixed in 3.2.0. The vulnebarility: "RSA decryption was vulnerable to Bleichenbacher timing vulnerabilities, which would impact people using RSA decryption in online scenarios." The fix: `58494b41d6` It wasn't enough to only update crpytography because mitm was incompatible with the new version, so mitm is also upgraded. The steps to do in local: python -m pip install -U cryptography python -m pip install -U mitmproxy	2020-11-17 19:16:08 +03:00
Onur Tirtir	5e3dc9d707	Bump citus version to 10.0devel	2020-11-09 13:16:54 +03:00
Onur Tirtir	5d5966f700	Fix a flaky test in mixed_relkind_tests (#4300 )	2020-11-06 14:53:30 +03:00
Onder Kalaci	e0d2ac7620	Do not rely on set_rel_pathlist_hook for finding local relations When a relation is used on an OUTER JOIN with FALSE filters, set_rel_pathlist_hook may not be called for the table. There might be other cases as well, so do not rely on the hook for classification of the tables.	2020-11-06 11:14:30 +01:00
Onur Tirtir	0556952607	Normalize partitioned table aliases in explain output (#4295 ) Aliases that postgres choose for partitioned tables in explain output might change in different pg versions, so normalize them and remove the alternative test output	2020-11-06 10:44:01 +03:00
Onur Tirtir	d912d4bc38	Print full file path in valgrind testing (#4299 )	2020-11-06 10:26:53 +03:00
Onur Tirtir	cc8be422ce	Fix relkind checks in planner for relkinds other than RELKIND_RELATION (#4294 ) We were qualifying relations with relkind != RELKIND_RELATION as non-relations due to the strict checks around RangeTblEntry->relkind in planner.	2020-11-05 14:21:02 +03:00
Hanefi Önaldı	d6f19e2298	Honor error message conventions	2020-11-03 18:11:18 +03:00
Hanefi Önaldı	85a4b61a0e	Prevent undistribute_table calls for partitions	2020-11-03 18:10:20 +03:00
Hanefi Önaldı	5db380f33a	Prevent undistribute_table calls for foreign tables	2020-11-03 17:33:29 +03:00
Halil Ozan Akgul	77b3be8b6d	Turn RelOptInfos to only used field of them, relids, to be able to copy	2020-10-22 13:42:28 +03:00
Onur Tirtir	790beea59f	Add intermediate result tests with unsupported outer joins (#4262 )	2020-10-20 12:11:18 +03:00
SaitTalhaNisanci	0f209377c4	Fix incorrect join related fields (#4242 ) * Fix incorrect join related fields Ruleutils expect to give the original index of join columns hence we should consider the dropped columns while setting the fields in SetJoinRelatedFieldsCompat. * add some more tests for joins * Move tests to join.sql and create a utility function	2020-10-19 18:28:39 +03:00
Onur Tirtir	c49077d594	Disallow outer joins `ON TRUE` with ref & dist tables when ref table is outer relation (#4255 ) Disallow `ON TRUE` outer joins with reference & distributed tables when reference table is outer relation by fixing the logic bug made when calling `LeftListIsSubset` function. Also, be more defensive when removing duplicate join restrictions when join clause is empty for non-inner joins as they might still contain useful information for non-inner joins.	2020-10-19 16:58:11 +03:00
Onder Kalaci	bbedfca761	Improve the relation restriction counters It seems like Postgres could call set_rel_pathlist() for the same relation multiple times. This breaks the logic where we assume relationCount eqauls to the number of entries in relationRestrictionList. In summary, relationRestrictionList may contain duplicate entries.	2020-10-19 08:51:16 +02:00
Hadi Moshayedi	663549db33	Set explicit transfer_mode in tableam tests	2020-10-16 12:40:37 -07:00
Nils Dijk	caabbf4b84	Table access method support for distributed tables	2020-10-16 12:02:25 -07:00
Marco Slot	8976f245ab	Support reference table view in reference table modification	2020-10-16 11:31:24 +02:00
Onder Kalaci	596f7bf4a9	Add more regression test for single node Citus Tests on commands with SCHEMA.	2020-10-15 17:32:32 +02:00
Onder Kalaci	fe3caf3bc8	Local execution considers intermediate result size limit With this commit, we make sure that local execution adds the intermediate result size as the distributed execution adds. Plus, it enforces the citus.max_intermediate_result_size value.	2020-10-15 17:18:55 +02:00
Marco Slot	31858c8a29	Check table existence in EnsureRelationKindSupported	2020-10-15 17:05:06 +02:00
Onder Kalaci	15e724c073	Add regression tests for outer/cross JOINs	2020-10-14 15:17:30 +02:00
Onder Kalaci	de33079065	Improve outer join checks Before this commit, the logic was: - As long as the outer side of the JOIN is not a JOIN (e.g., relation or subquery etc.), we check for the existence of any recurring tuples. There were two implications of this decision. First, even if a subquery which is on the outer side contains distributed table JOIN reference table, Citus would unnecessarily throw an error. Note that, the JOIN inside the subquery would already be going to be tested recursively. But, as long as that check passes, there is no reason for the upper JOIN to fail. An example, which used to fail and now works: SELECT * FROM (SELECT * FROM dist JOIN ref) as foo LEFT JOIN dist; Second, certain JOINs, especially with ON (true) conditions were not represented as Citus expects the JOINs to be in the format DeferredErrorIfUnsupportedRecurringTuplesJoin().	2020-10-14 15:17:30 +02:00
Onur Tirtir	1a28858c47	Disallow field indirection in INSERT/UPDATE queries (#4241 )	2020-10-14 14:11:59 +03:00
Onur Tirtir	8efca3b60a	Fix a crash with inserting domain composite types in coord. evaluation (#4231 ) Use short lived per-tuple context in citus_evaluate_expr like (pg) evaluate_expr does. We should not use planState->ExprContext when evaluating expressions as it might lead to freeing the same executor twice (first one happens in citus_evaluate_expr itself and the other one happens when postgres doing clean-up for the top level executor state), which in turn might cause seg.faults. However, now as we don't have necessary planState info to evaluate prepared statements, we also add planState->es_param_list_info to per-tuple ExprContext.	2020-10-13 14:19:59 +03:00
Halil Ozan Akgul	e2736c25bd	Adds support for WITH TIES option	2020-10-12 19:34:18 +03:00
Sait Talha Nisanci	dc40758355	Return early if there is no citus table in VACUUM	2020-10-09 11:10:00 +03:00
Sait Talha Nisanci	99bb79745a	Commit transaction for VACUUM on shell table With postgres 13, there is a global lock that prevents multiple VACUUMs happening in the current database. This global lock is taken for a short time but this creates a problem because of the following: - We execute the VACUUM for the shell table through the standard process utility. In this step the global lock is taken for the current database. - If the current node has shard placements then it tries to execute VACUUM over a connection to localhost with ExecuteUtilityTaskList. - the VACUUM on shard placements cannot proceed because it is waiting for the global lock for the current database to be released. - The acquired lock from the VACUUM for shell table will not be released until the transaction is committed. - So there is a deadlock. As a solution, we commit the current transaction in case of VACUUM after the VACUUM is executed for the shell table. Executing the VACUUM on a shell table is not important because the data there will probably be truncated. PostprocessVacuumStmt takes the necessary locks on the shell table so we don't need to take any extra locks after we commit the current transaction.	2020-10-09 10:57:44 +03:00
Marco Slot	881e5df780	Fix a bug that could lead to multiple maintenance daemons	2020-10-08 16:18:14 +02:00
Marco Slot	18219843d0	Add maintenance daemon error tests	2020-10-08 16:17:33 +02:00
Marco Slot	dbc348b7e0	Create sequence dependency during metadata syncing	2020-10-06 10:57:39 +02:00
Marco Slot	9bba8bb4e8	Remove master_drop_sequences	2020-10-06 10:57:33 +02:00
Sait Talha Nisanci	078dcae18c	Write settings to postgres configuration file directly In our test structure, we have been passing postgres configurations from the terminal, which causes problems after it hits to a certain length hence it cannot start the server and understanding why it failed is not easy because there isn't a nice error message. This commit changes this to write the settings directly to the postgres configuration file. This way we can add as many postgres settings as we want to without needing to worry about the length problem.	2020-10-05 22:09:08 +03:00
Onur Tirtir	2cd0a69dfb	Fix multi-row & router INSERT crash with local exec. when def. cols not specified (#4197 ) Multi-row & router INSERT's were crashing with local execution if at least one of the DEFAULT columns were not specified in VALUES list. This was because, the changes we make on query->values_lists and query->targetList was sufficient for deparsing given INSERT for remote execution but not sufficient for local execution. With this commit, DEFAULT value normalization for multi-row & router INSERT's is fixed by adding dummy column references for unspecified DEFAULT columns.	2020-10-05 10:45:17 +03:00
Hanefi Önaldı	6d8e83d24f	Replace worker_hash calls with partkey IS NOT NULL filters	2020-10-02 18:16:24 +03:00
Önder Kalacı	df5aa0f0cc	Switch to sequential execution if the index name is long (#4209 ) Citus has the logic to truncate the long shard names to prevent various issues, including self-deadlocks. However, for partitioned tables, when index is created on the parent table, the index names on the partitions are auto-generated by Postgres. We use the same Postgres function to generate the index names on the shards of the partitions. If the length exceeds the limit, we switch to sequential execution mode.	2020-10-02 13:39:34 +03:00
Ahmet Gedemenli	70e9edb4f2	Add subplan test with insert	2020-10-01 13:58:55 +03:00
Jelte Fennema	13ef8252e7	Add broken distributed subplan test	2020-10-01 13:52:42 +03:00
Ahmet Gedemenli	3357eea46b	Add regression tests for PG13 WAL	2020-10-01 13:52:42 +03:00
Hanefi Önaldı	9ec85f1283	Remove some pgoptions to prevent hitting bash command character limits	2020-09-30 15:04:40 +03:00
Hanefi Önaldı	b0a2c1ee5c	Disallow volatile functions on single shard update queries We currently do not support volatile functions in update/delete statements because the function evaluation logic does not know how to distinguish volatile functions (that need to be evaluated per row) from stable functions (that need to be evaluated per query), and it is also not safe to push the volatile functions down on replicated tables.	2020-09-29 15:40:21 +03:00
Marco Slot	b905c8043d	Fix create index concurrently crash with local execution	2020-09-25 11:49:09 +02:00
Ahmet Gedemenli	abfb79bda6	Sort explain analyze output by task time Add sort method parameter for regression tests Fix check-style Change sorting method parameters to enum Polish Add task fields to OutTask Add test into multi_explain Fix isolation test	2020-09-24 11:38:40 +03:00
Onur Tirtir	64d5ac6a10	Do not downgrade if a citus local table exists (#4174 ) As the previous versions of Citus don't know how to handle citus local tables, we should prevent downgrading from 9.5 to older versions if any citus local tables exists.	2020-09-22 14:19:50 +03:00
Onder Kalaci	5d017cd123	Improve node matedata when coordinator is added Coordinator should always be always active, hasmetadata and metadasynced. Prevent changing those fields.	2020-09-21 14:53:41 +02:00
Onder Kalaci	6fc1dea85c	Improve the robustness of function call delegation Pushing down the CALLs to the node that the CALL is executed is dangerous and could lead to infinite recursion. When the coordinator added as worker, Citus was by chance preventing this. The coordinator was marked as "not metadatasynced" node in pg_dist_node, which prevented CALL/function delegation to happen. With this commit, we do the following: - Fix metadatasynced column for the coordinator on pg_dist_node - Prevent pushdown of function/procedure to the same node that the function/procedure is being executed. Today, we do not sync pg_dist_object (e.g., distributed functions metadata) to the worker nodes. But, even if we do it now, the function call delegation would prevent the infinite recursion.	2020-09-21 14:53:30 +02:00
Onur Tirtir	1b31b22635	Refactor the functions that return OID lists for citus tables	2020-09-18 16:42:46 +03:00
SaitTalhaNisanci	dae2c69fd7	Not allow removing a single node with ref tables (#4127 ) * Not allow removing a single node with ref tables We should not allow removing a node if it is the only node in the cluster and there is a data on it. We have this check for distributed tables but we didn't have it for reference tables. * Update src/test/regress/expected/single_node.out Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> * Update src/test/regress/sql/single_node.sql Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2020-09-18 15:35:59 +03:00
Ahmet Gedemenli	1cf11b4632	Shorten insert_select_connection_leak_test	2020-09-18 10:07:15 +03:00
Önder Kalacı	8d3f353746	Add more tests for single node citus - distributetd tables (#4166 )	2020-09-17 17:50:35 +02:00
Marco Slot	c9d46c618b	Fix EXPLAIN ANALYZE truncation	2020-09-17 14:42:21 +02:00
Onur Tirtir	d81559b7f8	Use "table" instead of "reference table" in sequential truncate log (#4164 ) We might get this debug message for citus local tables as well	2020-09-17 14:37:36 +03:00
Onur Tirtir	4118560b75	Prevent citus local table creation from a catalog table (#4158 )	2020-09-15 14:30:48 +03:00
Önder Kalacı	e7079d1384	Add orderbys to some tests (#4162 )	2020-09-14 16:59:22 +02:00
Marco Slot	b82f6ee163	Add tests for distributing catalog tables	2020-09-10 04:46:11 +02:00
Marco Slot	bd12555b16	Fix distributing tables owned by extensions	2020-09-10 04:46:11 +02:00
Onur Tirtir	9a56c22917	Add udf tests with citus local tables (#4154 )	2020-09-11 12:36:53 +03:00
Onur Tirtir	3a73fba810	Apply planner changes for citus local tables	2020-09-09 11:51:18 +03:00
Onur Tirtir	0b1cc118a9	Adapt other cache entry changes for citus local tables	2020-09-09 11:50:55 +03:00
Onur Tirtir	a58a4395ab	Extend citus local table utility command support This commit brings following features: Foreign key support from citus local tables to reference tables * Foreign key support from reference tables to citus local tables (only with RESTRICT & NO ACTION behavior) * ALTER TABLE ENABLE/DISABLE trigger command support * CREATE/DROP/ALTER trigger command support and disallows: * ALTER TABLE ATTACH/DETACH PARTITION commands * CREATE TABLE <postgres table> ATTACH PARTITION <citus local table> commands * Foreign keys from postgres tables to citus local tables (the other way was already disallowed) for citus local tables.	2020-09-09 11:50:55 +03:00
Onur Tirtir	17cc810372	Implement "citus local table" creation logic	2020-09-09 11:50:48 +03:00
Onur Tirtir	ba208eae4d	Record non-distributed table accesses in local executor (#4139 )	2020-09-07 18:19:08 +03:00
Hanefi Önaldı	024d398cd7	Allow distribution of functions that read from reference tables create_distributed_function(function_name, distribution_arg_name, colocate_with text) This UDF did not allow colocate_with parameters when there were no disttribution_arg_name supplied. This commit changes the behaviour to allow missing distribution_arg_name parameters when the function should be colocated with a reference table.	2020-09-01 07:28:34 +03:00
Önder Kalacı	983206c5e1	Hide `citus.subquery_pushdown` flag and NOTICE when enabled (#4124 ) * Hide citus.subquery_pushdown flag This flag is dangerous and could likely to let queries return wrong results. The flag has a very specific purpose for a very specific data distribution and query structure. In those cases, when the flag is set, the user can skip recursive planning altogether at their own risk. The meaning of the flag is that "I know what I'm doing such that the query structure/data distribution is on my control, so Citus can skip many correctness checks". For regular users, enabling this flag is discouraged. We have to keep the support only for backward compatibility for some users. In addition to that, give a NOTICE to discourage new users to use it.	2020-08-28 14:53:09 +02:00
SaitTalhaNisanci	2459ba6eca	Update docker images (#4122 ) * Update and separate test images The build image was a single one and it would contain pg11, pg12 and pg13. Now it is separated so that we can build each pg major independently. Tags are used as full postgres versions so that we can know which version we use by looking at the tag. For example exttester:11.9 would mean we are using pg11.9. pg11 is updated from 11.5 to 11.9. pg12 is updated from 12rc to 12.4. * Ignore memory usage in pg13 explain * Use citus instead of personal repo	2020-08-26 16:23:59 +03:00
SaitTalhaNisanci	20c39fae9a	Loosen the requirement to pushdown a subquery with ref tables (#4110 ) AllTargetExpressionsAreColumnReferences would return false if a query had an entry that is referencing the outer query. It seems safe to not have this for non-distributed tables, such as reference tables. We already have separate checks for other cases such as having limits.	2020-08-14 12:11:15 +03:00
Hadi Moshayedi	7b74eca22d	Support EXPLAIN EXECUTE ANALYZE.	2020-08-10 13:44:30 -07:00
Philip Dubé	212ae7163f	Fix non deterministic collation test to work with ancient libicu versions CentOS 7's libicu is too old for und-u-ks-level2 @colStrength=secondary works with both older & newer versions of libicu	2020-08-07 12:34:32 +00:00
Halil Ozan Akgul	375310b7f1	Adds support for table undistribution	2020-08-05 14:36:03 +03:00
Sait Talha Nisanci	fe4ac51d8c	Normalize Output:.. since it changes with pg13 Fix indentation for better readability	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	d68bfc5687	Improve error for index operator class parameters The error message when index has opclassopts is improved and the commit from postgres side is also included for future reference. Also some minor style related changes are applied.	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	288aa58603	add alternative out for pg13 test	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	d0b0c88920	Changelog: error out if index has opclassopts Error out if index has opclassopts. Changelog entry on PG13: Allow CREATE INDEX to specify the GiST signature length and maximum number of integer ranges (Nikita Glukhov)	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	f7a1971361	Changelog: Alter type options It seems that we don't support propagating commands related to base types. Therefore Alter TYPE options doesn't seem to apply to us. I have added a test to verify that we don't propagate them. Changelog entry on pg13: Add ALTER TYPE options useful for extensions, like TOAST and I/O functions control (Tomas Vondra, Tom Lane)	2020-08-04 15:38:11 +03:00
Sait Talha Nisanci	00633165fc	Changelog: Test unicode escapes Unicode escapes work as expected, related tests are added. Changelog entry on PG13: Allow Unicode escapes, e.g., E'\u####', U&'\####', to specify any character available in the database encoding, even when the database encoding is not UTF-8 (Tom Lane)	2020-08-04 15:36:30 +03:00
Sait Talha Nisanci	79dcb80140	Changelog: Test IS NORMALIZED for pg13 Tests for is_normalized and normalized ar eadded. One thing that seems to be because of existent bug is that when we don't give the second argument to normalize or is_normalized, which is optional, it crashes. Because in the executor part, in the expression we don't have the default argument. Changelog entry in PG-13: Add SQL functions NORMALIZE() to normalize Unicode strings, and IS NORMALIZED to check for normalization (Peter Eisentraut) Commit on Postgres: 2991ac5fc9b3904ca4582be6d323497d7c3d17c9	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	ebabca16b7	Changelog: Test row suffix notation It seems that row suffix notation is working fine with our code, a test is added. Changelog entry in PG13: Allow ROW values values to have their members extracted with suffix notation (Tom Lane)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	275ccd0400	Changelog: Test that alter view rename column works Changelog entry in PG13: Add ALTER VIEW syntax to rename view columns (Fujii Masao)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	920d7211e4	Changelog: Test that we error out for DROP EXPRESSION PG13 now supports dropping expression from a column such as generated columns. We error out with this currently. Changelog entry in postgres: Add ALTER TABLE clause DROP EXPRESSION to remove generated properties from columns (Peter Eisentraut)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	87088d92bc	Changelog: handle VACUUM PARALLEL option Postgres 13 added a new VACUUM option, PARALLEL. It is now supported in our code as well. Relevant changelog message on postgres: Allow VACUUM to process indexes in parallel (Masahiko Sawada, Amit Kapila)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	1070828465	update cte inline output for pg13 Make some macros in version_compat more robust Remove commented code in ruleutils Remove unnecessary variable assignments	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	157af140e4	ignore concurrent root page split debugs	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	ff7a563c57	decrease log level to debug1 to prevent flaky debug	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	6ff4e42706	Add alternative output for multi_function_in_join With pg13, constants functions from "FROM" clause are replaced. This means that in citus side, we will see the constraints in restriction info, instead of the function call. For example: SELECT * FROM table1 JOIN add(3,5) sum ON (id = sum) ORDER BY id ASC; Assuming that the function `add` returns constant, it will be evaluated on postgres side. This means that this query will be routable because there will be only one shard after pruning with the restrictions. However before pg13, this would be multi shard query. And it would go into recursive planning, the function would be evaluated on the coordinator because it can be. This means that with pg13, users will need to distribute the function because when it is routable executable, it will currently also send the function call to the worker in the query. So the function should exist in the worker. It could be better to replace the constant in the query tree as well so that the query string sent to the worker has the constant value and therefore it doesn't need the function. However I feel like users would already have the function in workers if they have any multi shard query. Commit on Postgres side: 7266d0997dd2a0632da38a594c78e25ff21df67e	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	a34a1126ec	add alternative output for pg13 in some tests	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	c5c9ec288f	fix multi_mx_create_table test	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	76c7b3d1c6	Remove unused steps in isolation tests PG13 gives a warning for unused steps therefore we should remove the unused steps in isolation tests.	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	17388e2e91	update some tests	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	de82d0ff79	add output for pg13 for propagate extension commands CREATE EXTENSION <name> FROM <old_version> is not supported anymore with postgres 13. An alternative output is added for pg13 where we basically error for that statement.	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	80d2bc2317	normalize some output and sort test result	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	0f6c21d418	sort result in ch_bench_having_mx test	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	70f27c10e5	Add some normalization rules for tests The not-null constraint message changed with pg13 slightly hence a normalization rule is added for that, which converts it to pg < 13 output. Commit on postgres: 05f18c6b6b6e4b44302ee20a042cedc664532aa2 An extra debug message is added related to indexes on postgres, these are safe to be ignored, so we can delete them from tests. Commit on Postgres side: 612a1ab76724aa1514b6509269342649f8cab375 varnoold is renamed as varnosyn and varoattno is renamed as varattnosyn so in the output we normalize the values as the old ones to simply pass the tests.	2020-08-04 15:10:22 +03:00
Onder Kalaci	eeb8c81de2	Implement shared connection count reservation & enable `citus.max_shared_pool_size` for COPY With this patch, we introduce `locally_reserved_shared_connections.c/h` files which are responsible for reserving some space in shared memory counters upfront. We sometimes need to reserve connections, but not necessarily establish them. For example: - COPY command should reserve connections as it cannot know which connections it needs in which order. COPY establishes connections as any input data hits the workers. For example, for router COPY command, it only establishes 1 connection. As discussed here (https://github.com/citusdata/citus/pull/3849#pullrequestreview-431792473), COPY needs to reserve connections up-front, otherwise we can end up with resource starvation/un-detected deadlocks.	2020-08-03 18:51:40 +02:00
nukoyluoglu	38987431e7	propagation of CHECK statements to workers with parentheses (#4039 ) * ensure propagation of CHECK statements to workers with parantheses & adjust regression test outputs * add tests for distributing tables with simple CHECK constraints * added test for CHECK on bool variable	2020-07-27 15:08:37 +03:00
Benjamin Satzger	a35a15a513	Distribute custom aggregates with multiple arguments (#4047 ) Enable custom aggregates with multiple parameters to be executed on workers. #2921 introduces distributed execution of custom aggregates. One of the limitations of this feature is that only aggregate functions with a single aggregation parameter can be pushed to worker nodes. Aim of this change is to remove that limitation and support handling of multi-parameter aggregates. Resolves: #3997 See also: #2921	2020-07-24 15:16:00 -07:00
Halil Ozan Akgul	38b72ddd66	Fixes create index concurrently bug	2020-07-24 12:14:14 +03:00
Halil Ozan Akgül	e9f89ed651	Fixes the non existing table bug (#4058 )	2020-07-23 18:01:21 +03:00
Sait Talha Nisanci	01c23b0df2	update test outputs with task-tracker removal	2020-07-21 16:25:08 +03:00
Sait Talha Nisanci	1dbd545cf4	replace task-tracker with adaptive in tests	2020-07-21 16:21:01 +03:00
Sait Talha Nisanci	4308d867d9	remove task-tracker in comments, documentation	2020-07-21 16:21:01 +03:00
Hanefi Önaldı	e534dbae4a	Accept list of values in a supported ALTER ROLE .. SET statement Some GUCs support a list of values which is indicated by GUC_LIST_INPUT flag. When an ALTER ROLE .. SET statement is executed, the new configuration default for affected users and databases are stored in the setconfig(text[]) column in a pg_db_role_setting record. If a GUC that supports a list of values is used in an ALTER ROLE .. SET statement, we need to split the text into items delimited by commas.	2020-07-21 03:49:57 +03:00
Nils Dijk	00a4a15d95	fix sorting on string litteral (#4045 ) As noted by Talha https://github.com/citusdata/citus/pull/4029#issuecomment-660466972 there was still some sort order flappiness in the test. The root cause is that sorting on `1::text` sorts on the literal `'1'` which causes sorting to be indeterministic. This behaviour is consistent with Postgres' behaviour, so no bug on Citus' side.	2020-07-20 17:39:27 +02:00
Onder Kalaci	c25de2cf22	Remove flag from As it doesn't make any sense anymore	2020-07-20 12:45:05 +02:00
SaitTalhaNisanci	b3af63c8ce	Remove task tracker executor (#3850 ) * use adaptive executor even if task-tracker is set * Update check-multi-mx tests for adaptive executor Basically repartition joins are enabled where necessary. For parallel tests max adaptive executor pool size is decresed to 2, otherwise we would get too many clients error. * Update limit_intermediate_size test It seems that when we use adaptive executor instead of task tracker, we exceed the intermediate result size less in the test. Therefore updated the tests accordingly. * Update multi_router_planner It seems that there is one problem with multi_router_planner when we use adaptive executor, we should fix the following error: +ERROR: relation "authors_range_840010" does not exist +CONTEXT: while executing command on localhost:57637 * update repartition join tests for check-multi * update isolation tests for repartitioning * Error out if shard_replication_factor > 1 with repartitioning As we are removing the task tracker, we cannot switch to it if shard_replication_factor > 1. In that case, we simply error out. * Remove MULTI_EXECUTOR_TASK_TRACKER * Remove multi_task_tracker_executor Some utility methods are moved to task_execution_utils.c. * Remove task tracker protocol methods * Remove task_tracker.c methods * remove unused methods from multi_server_executor * fix style * remove task tracker specific tests from worker_schedule * comment out task tracker udf calls in tests We were using task tracker udfs to test permissions in multi_multiuser.sql. We should find some other way to test them, then we should remove the commented out task tracker calls. * remove task tracker test from follower schedule * remove task tracker tests from multi mx schedule * Remove task-tracker specific functions from worker functions * remove multi task tracker extra schedule * Remove unused methods from multi physical planner * remove task_executor_type related things in tests * remove LoadTuplesIntoTupleStore * Do initial cleanup for repartition leftovers During startup, task tracker would call TrackerCleanupJobDirectories and TrackerCleanupJobSchemas to clean up leftover directories and job schemas. With adaptive executor, while doing repartitions it is possible to leak these things as well. We don't retry cleanups, so it is possible to have leftover in case of errors. TrackerCleanupJobDirectories is renamed as RepartitionCleanupJobDirectories since it is repartition specific now, however TrackerCleanupJobSchemas cannot be used currently because it is task tracker specific. The thing is that this function is a no-op currently. We should add cleaning up intermediate schemas to DoInitialCleanup method when that problem is solved(We might want to solve it in this PR as well) * Revert "remove task tracker tests from multi mx schedule" This reverts commit `03ecc0a681`. * update multi mx repartition parallel tests * not error with task_tracker_conninfo_cache_invalidate * not run 4 repartition queries in parallel It seems that when we run 4 repartition queries in parallel we get too many clients error on CI even though we don't get it locally. Our guess is that, it is because we open/close many connections without doing some work and postgres has some delay to close the connections. Hence even though connections are removed from the pg_stat_activity, they might still not be closed. If the above assumption is correct, it is unlikely for it to happen in practice because: - There is some network latency in clusters, so this leaves some times for connections to be able to close - Repartition joins return some data and that also leaves some time for connections to be fully closed. As we don't get this error in our local, we currently assume that it is not a bug. Ideally this wouldn't happen when we get rid of the task-tracker repartition methods because they don't do any pruning and might be opening more connections than necessary. If this still gives us "too many clients" error, we can try to increase the max_connections in our test suite(which is 100 by default). Also there are different places where this error is given in postgres, but adding some backtrace it seems that we get this from ProcessStartupPacket. The backtraces can be found in this link: https://circleci.com/gh/citusdata/citus/138702 * Set distributePlan->relationIdList when it is needed It seems that we were setting the distributedPlan->relationIdList after JobExecutorType is called, which would choose task-tracker if replication factor > 1 and there is a repartition query. However, it uses relationIdList to decide if the query has a repartition query, and since it was not set yet, it would always think it is not a repartition query and would choose adaptive executor when it should choose task-tracker. * use adaptive executor even with shard_replication_factor > 1 It seems that we were already using adaptive executor when replication_factor > 1. So this commit removes the check. * remove multi_resowner.c and deprecate some settings * remove TaskExecution related leftovers * change deprecated API error message * not recursively plan single relatition repartition subquery * recursively plan single relation repartition subquery * test depreceated task tracker functions * fix overlapping shard intervals in range-distributed test * fix error message for citus_metadata_container * drop task-tracker deprecated functions * put the implemantation back to worker_cleanup_job_schema_cachesince citus cloud uses it * drop some functions, add downgrade script Some deprecated functions are dropped. Downgrade script is added. Some gucs are deprecated. A new guc for repartition joins bucket size is added. * order by a test to fix flappiness	2020-07-18 13:11:36 +03:00
Marco Slot	9cb8dc9d12	Improve error message when creating a foreign key to a local table	2020-07-13 13:57:22 +02:00
SaitTalhaNisanci	bc011a6286	Add IsCitusTable check to citus table utilities (#4028 )	2020-07-14 18:29:33 +03:00
Nils Dijk	23d44eba9f	fix flappy tests due to undeterministic order of test output (#4029 ) As reported on #4011 https://github.com/citusdata/citus/pull/4011/files#r453804702 some of the tests were flapping due to an indeterministic order for test outputs. This PR makes the test output ordered for all tests returning non-zero rows. Needs to be backported to 9.2, 9.3, 9.4	2020-07-14 15:47:29 +02:00
SaitTalhaNisanci	ab5be77709	test coordinator reference-distributed table join (#3698 )	2020-07-14 11:43:03 +03:00
Sait Talha Nisanci	1b5ed45a58	add multi follower repartition tests	2020-07-13 19:50:50 +03:00
Sait Talha Nisanci	510535f558	address feedback	2020-07-13 19:45:02 +03:00
Sait Talha Nisanci	41ec76a6ad	use ActiveReadableNodeList in JobExecutorType and task tracker The reason we should use ActiveReadableNodeList instead of ActiveReadableNonCoordinatorNodeList is that if coordinator is added to cluster as a worker, it should be counted as well. Otherwise if there is only coordinator in the cluster, the count will be 0, hence we get a warning. In MultiTaskTrackerExecute, we should connect to coordinator if it is added to the cluster because it will also be assigned tasks.	2020-07-13 19:45:02 +03:00
Sait Talha Nisanci	d97d03ec65	use ActivePrimaryNodeList to include coordinator ActiveReadableWorkerNodeList doesn't include coordinator, however if coordinator is added as a worker, we should also include that while planning. The current methods are very easily misusable and this requires a refactoring to make the distinction between methods that include coordinator and that don't very explicit as they can introduce subtle/major bugs pretty easily.	2020-07-13 19:20:15 +03:00
Sait Talha Nisanci	db1b78148c	send schema creation/cleanup to coordinator in repartitions We were using ALL_WORKERS TargetWorkerSet while sending temporary schema creation and cleanup. We(well mostly I) thought that ALL_WORKERS would also include coordinator when it is added as a worker. It turns out that it was FILTERING OUT the coordinator even if it is added as a worker to the cluster. So to have some context here, in repartitions, for each jobId we create (at least we were supposed to) a schema in each worker node in the cluster. Then we partition each shard table into some intermediate files, which is called the PARTITION step. So after this partition step each node has some intermediate files having tuples in those nodes. Then we fetch the partition files to necessary worker nodes, which is called the FETCH step. Then from the files we create intermediate tables in the temporarily created schemas, which is called a MERGE step. Then after evaluating the result, we remove the temporary schemas(one for each job ID in each node) and files. If node 1 has file1, and node 2 has file2 after PARTITION step, it is enough to either move file1 from node1 to node2 or vice versa. So we prune one of them. In the MERGE step, if the schema for a given jobID doesn't exist, the node tries to use the `public` schema if it is a superuser, which is actually added for testing in the past. So when we were not sending schema creation comands for each job ID to the coordinator(because we were using ALL_WORKERS flag, and it doesn't include the coordinator), we would basically not have any schemas for repartitions in the coordinator. The PARTITION step would be executed on the coordinator (because the tasks are generated in the planner part) and it wouldn't give us any error because it doesn't have anything to do with the temporary schemas(that we didn't create). But later two things would happen: - If by chance the fetch is pruned on the coordinator side, we the other nodes would fetch the partitioned files from the coordinator and execute the query as expected, because it has all the information. - If the fetch tasks are not pruned in the coordinator, in the MERGE step, the coordinator would either error out saying that the necessary schema doesn't exist, or it would try to create the temporary tables under public schema ( if it is a superuser). But then if we had the same task ID with different jobID it would fail saying that the table already exists, which is an error we were getting. In the first case, the query would work okay, but it would still not do the cleanup, hence we would leave the partitioned files from the PARTITION step there. Hence ensure_no_intermediate_data_leak would fail. To make things more explicit and prevent such bugs in the future, ALL_WORKERS is named as ALL_NON_COORD_WORKERS. And a new flag to return all the active nodes is added as ALL_DATA_NODES. For repartition case, we don't use the only-reference table nodes but this version makes the code simpler and there shouldn't be any significant performance issue with that.	2020-07-13 19:20:15 +03:00
SaitTalhaNisanci	76ddb85545	improve error message in secondaries (#4025 )	2020-07-13 19:18:57 +03:00
Nils Dijk	449d1f0e91	force aliases in deparsing for queries with anonymous column references (#4011 ) DESCRIPTION: Force aliases in deparsing for queries with anonymous column references Fixes: #3985 The root cause has todo with discrepancies in the query tree we create. I think in the future we should spend some time on categorising all changes we made to ruleutils and see if we can change the data structure `query` we pass to the deparser to have an actual valid postgres query for the deparser to render. For now the fix is to keep track, besides changing the names of the entries in the target list, also if we have a reference to an anonymous columns. If there are anonymous columns we set the `printaliases` flag to true which forces the deparser to add the aliases.	2020-07-13 16:29:24 +02:00
Hadi Moshayedi	3651fc64ee	Fix Subtransaction memory leak	2020-07-09 12:33:39 -07:00
Jelte Fennema	16242d5264	Fix write queries with const expressions and COLLATE in various places (#3973 )	2020-07-08 18:19:53 +02:00
Jelte Fennema	ab01571c9e	Fix crash with single node dummy placement (#3993 ) Static analysis found an issue where we could dereference `NULL`, because `CreateDummyPlacement` could return `NULL` when there were no workers. This PR changes it so that it never returns `NULL`, which was intended by @marcocitus when doing this change: https://github.com/citusdata/citus/pull/3887/files#r438136433 While adding tests for citus on a single node I also added some more basic tests and it turns out we error out on repartition joins. This has been present since `shouldhaveshards` was introduced and is not trivial to fix. So I created a separate issue for this: https://github.com/citusdata/citus/issues/3996	2020-07-08 17:11:25 +02:00
Philip Dubé	444472ffc6	ruleutils: use get_rtable_name for deparsing resultRelation	2020-07-07 12:20:41 +00:00
Marco Slot	b4fec63bc0	Rename master evaluation to coordinator evaluation	2020-07-07 10:37:41 +02:00
Jelte Fennema	8ab47f4f37	Add a CI check to see if all tests are part of a schedule (#3959 ) I recently forgot to add tests to a schedule in two of my PRs. One of these was caught by review, but the other one was not. This adds a script to causes CI to ensure that each test in the repo is included in at least one schedule. Three tests were found that were currently not part of a schedule. This PR adds those three tests to a schedule as well and it also fixes some small issues with these tests.	2020-07-03 11:34:55 +02:00
Jelte Fennema	9311978487	Add README for CI scripts We keep accumulating more and more scripts to flag issues in CI. This is good, but we are currently missing consistent documentation for them. This commit moves all these scripts to the `ci` directory and adds some documentation for all of them in the README. It also makes sure that the last line of output of a failed script points to this documentation.	2020-07-03 10:22:48 +02:00
Onur Tirtir	be17ebb334	Bump citus version to 9.5devel	2020-07-01 14:46:55 +03:00
Hanefi Önaldı	ca2ececb3b	Downgrade path from 9.4 to 9.3 to 9.2	2020-07-01 10:38:11 +03:00
Sait Talha Nisanci	e5a21f07cb	test aggregates with expressions	2020-06-30 11:41:16 -07:00
Jelte Fennema	392c5e2c34	Fix wrong cancellation message about distributed deadlocks (#3956 )	2020-06-30 14:57:46 +02:00

1 2 3 4 5 ...

1695 Commits (04aeb6938b452da03a03b7c9a47bee39bc5340b4)