citus

Commit Graph

Author	SHA1	Message	Date
Sait Talha Nisanci	8cabd2e822	Decrease memory usage with rebalancer We decrease memory usage by: - Freeing temporary buffers - Using separate memory context for blocks that uses "small" amount of memory but can be repeated many times such as loops	2021-04-29 13:40:47 +03:00
Hanefi Onaldi	2f90ce931b	Fix minor issues with makefile targets (#4717 )	2021-04-28 15:46:55 +03:00
Marco Slot	4b49cb112f	Fix FROM ONLY queries on partitioned tables	2021-04-27 16:10:07 +02:00
Ahmet Gedemenli	fe65be993e	Sort GUCs in alphabetic order	2021-04-26 15:05:42 +03:00
Ahmet Gedemenli	332c5ce4ad	Fix worker partitioned size functions (#4922 )	2021-04-26 10:29:46 +03:00
Onder Kalaci	918838e488	Allow constant VALUES clauses in pushdown queries As long as the VALUES clause contains constant values, we should not recursively plan the queries/CTEs. This is a follow-up work of #1805. So, we can easily apply OUTER join checks as if VALUES clause is a reference table/immutable function.	2021-04-21 14:28:08 +02:00
SaitTalhaNisanci	93c2dcf3d2	Fix data-race with concurrent calls of DropMarkedShards (#4909 ) * Fix problews with concurrent calls of DropMarkedShards When trying to enable `citus.defer_drop_after_shard_move` by default it turned out that DropMarkedShards was not safe to call concurrently. This could especially cause big problems when also moving shards at the same time. During tests it was possible to trigger a state where a shard that was moved would not be available on any of the nodes anymore after the move. Currently DropMarkedShards is only called in production by the maintenaince deamon. Since this is only a single process triggering such a race is currently impossible in production settings. In future changes we will want to call DropMarkedShards from other places too though. * Add some isolation tests Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2021-04-21 10:59:48 +03:00
Ahmet Gedemenli	33c620f232	Optimize partitioned disk size calculation (#4905 ) * Optimize partitioned disk size calculation * Polish * Fix test for citus_shard_cost_by_disk_size Try optimizing if not CSTORE	2021-04-19 13:30:56 +03:00
Onder Kalaci	5482d5822f	Keep more statistics about connection establishment times When DEBUG4 enabled, Citus now prints per connection establishment time.	2021-04-16 14:56:31 +02:00
Onder Kalaci	5b78f6cd63	Keep more execution statistics When DEBUG4 enabled, Citus now prints per task execution times.	2021-04-16 14:45:00 +02:00
Hanefi Onaldi	9919fbe3f8	Switch to sequential mode on long partition names This commit adds support for long partition names for distributed tables: - ALTER TABLE dist_table ATTACH PARTITION .. - CREATE TABLE .. PARTITION OF dist_table .. Note: create_distributed_table UDF does not support long table and partition names, and is not covered in this commit	2021-04-14 15:27:50 +03:00
Ahmet Gedemenli	e445e3d39c	Introduce 3 partitioned size udfs (#4899 ) * Introduce 3 partitioned size udfs * Add tests for new partition size udfs * Fix type incompatibilities * Convert UDFs into pure sql functions * Fix function comment	2021-04-13 17:36:27 +03:00
Onur Tirtir	fe5c985e1d	Remove HAS_TABLEAM config since we dropped pg11 support (#4862 ) * Remove HAS_TABLEAM config * Drop columnar_ensure_objects_exist * Not call columnar_ensure_objects_exist in citus_finish_pg_upgrade	2021-04-13 10:51:26 +03:00
Ahmet Gedemenli	d74d358a45	Refactor size queries with new enum SizeQueryType (#4898 ) * Refactor size queries with new enum SizeQueryType * Polish	2021-04-12 17:14:29 +03:00
SaitTalhaNisanci	b453563e88	Warm up connections params hash (#4872 ) ConnParams(AuthInfo and PoolInfo) gets a snapshot, which will block the remote connectinos to localhost. And the release of snapshot will be blocked by the snapshot. This leads to a deadlock. We warm up the conn params hash before starting a new transaction so that the entries will already be there when we start a new transaction. Hence GetConnParams will not get a snapshot.	2021-04-12 13:08:38 +03:00
Ahmet Gedemenli	caef0463b0	Update func comment for PostprocessCreateTableStmt	2021-04-09 13:41:59 +03:00
Ahmet Gedemenli	52e467a9a0	Error out if inheriting a distributed table (#4871 ) * Error out if inheriting a distributed table * Add test inheriting a distirbuted table	2021-04-07 11:21:06 +03:00
Ahmet Gedemenli	e4c4a9b683	Fix error message for local table joins (#4870 ) * Fix error message for local table joins * Fix error messages for regression tests expected outputs	2021-04-06 16:18:28 +03:00
Ahmet Gedemenli	840c879572	Remove redundant if statement for schema name	2021-04-06 10:29:17 +03:00
Halil Ozan Akgul	a5038046f9	Adds shard_count parameter to create_distributed_table	2021-03-29 16:22:49 +03:00
SaitTalhaNisanci	03832f353c	Drop postgres 11 support	2021-03-25 09:20:28 +03:00
Önder Kalacı	b5f4320164	Make sure that single task local executions start coordinated transaction (#4831 ) With https://github.com/citusdata/citus/pull/4806 we enabled 2PC for any non-read-only local task. However, if the execution is a single task, enabling 2PC (CoordinatedTransactionShouldUse2PC) hits an assertion as we are not in a coordinated transaction. There is no downside of using a coordinated transaction for single task local queries.	2021-03-17 12:20:57 +01:00
Ahmet Gedemenli	5e5db9eefa	Add udf citus_get_active_worker_nodes	2021-03-17 13:15:59 +03:00
Marco Slot	fbc2147e11	Replace MAX_PUT_COPY_DATA_BUFFER_SIZE by citus.remote_copy_flush_threshold GUC	2021-03-16 06:00:38 +01:00
Marco Slot	1646fca445	Add GUC to set maximum connection lifetime	2021-03-16 01:57:57 +01:00
Marco Slot	6c5d263b7a	Remove unnecessary AtEOXact_Files call	2021-03-15 09:34:02 +01:00
Onder Kalaci	e65e72130d	Rename use -> shouldUse Because setting the flag doesn't necessarily mean that we'll use 2PC. If connections are read-only, we will not use 2PC. In other words, we'll use 2PC only for connections that modified any placements.	2021-03-12 08:29:43 +00:00
Onder Kalaci	6a7ed7b309	Do not trigger 2PC for reads on local execution Before this commit, Citus used 2PC no matter what kind of local query execution happens. For example, if the coordinator has shards (and the workers as well), even a simple SELECT query could start 2PC: ```SQL WITH cte_1 AS (SELECT * FROM test LIMIT 10) SELECT count(*) FROM cte_1; ``` In this query, the local execution of the shards (and also intermediate result reads) triggers the 2PC. To prevent that, Citus now distinguishes local reads and local writes. And, Citus switches to 2PC only if a modification happens. This may still lead to unnecessary 2PCs when there is a local modification and remote SELECTs only. Though, we handle that separately via #4587.	2021-03-12 08:29:43 +00:00
Onur Tirtir	874d5fd962	Remove foreign keys between columnar metadata tables (#4791 ) Postgres keeps AFTER trigger state for each transaction, because we can have deferred AFTER triggers which will be fired at the end of a transaction. Postgres cleans up this state at the end of transaction. Postgres processes ON COMMIT triggers after cleaning-up the AFTER trigger states. So if we fire any triggers in ON COMMIT, the AFTER trigger state won't be cleaned-up properly and the transaction state will be left in an inconsistent state, which might result in assertion failure. So with this commit, we remove foreign keys between columnar metadata tables and enforce constraints between them manually when dropping columnar tables.	2021-03-12 11:28:17 +03:00
Naisila Puka	71a9f45513	Fix upgrade and downgrade paths for master/citus_update_table_statistics (#4805 )	2021-03-11 14:52:40 +03:00
Naisila Puka	196064836c	Skip 2PC for readonly connections in a transaction (#4587 ) * Skip 2PC for readonly connections in a transaction * Use ConnectionModifiedPlacement() function * Remove the second check of ConnectionModifiedPlacement() * Add order by to prevent flaky output * Test using pg_dist_transaction	2021-03-10 20:01:37 +03:00
Marco Slot	58f85f55c0	Fixes a crash in queries with a modifying CTE and a SELECT without FROM	2021-03-09 10:39:33 +01:00
Philip Dubé	4e22f02997	Fix various typos due to zealous repetition	2021-03-04 19:28:15 +00:00
Marco Slot	f25de6a0e3	Try to return earlier in idempotent master_add_node	2021-03-02 21:22:47 +01:00
Hadi Moshayedi	affe38eac6	Populate DATABASEOID cache before CREATE INDEX CONCURRENTLY	2021-03-03 12:59:46 -08:00
Onder Kalaci	54ee96470e	Pass pointer of AttributeEquivalenceClass instead of pointer of pointer AttributeEquivalenceClass seems to be unnecessarily used with multiple pointers. Just use a single pointer for ease of read.	2021-03-03 12:27:26 +01:00
Onder Kalaci	d1cd198655	Prevent infinite recursion for queries that involve UNION ALL and JOIN With this commit, we make sure to prevent infinite recursion for queries in the format: [subquery with a UNION ALL] JOIN [table or subquery] Also, fixes a bug where we pushdown UNION ALL below a JOIN even if the UNION ALL is not safe to pushdown.	2021-03-03 12:27:26 +01:00
Naisila Puka	2f30614fe3	Reimplement citus_update_table_statistics to detect dist. deadlocks (#4752 ) * Reimplement citus_update_table_statistics * Update stats for the given table not colocation group * Add tests for reimplemented citus_update_table_statistics * Use coordinated transaction, merge with citus_shard_sizes functions * Update the old master_update_table_statistics as well	2021-03-03 04:12:30 +03:00
Marco Slot	dca615c5aa	Normalize the ConvertTable notices	2021-03-01 10:36:12 +01:00
SaitTalhaNisanci	feee25dfbd	Use translated vars in postgres 13 as well (#4746 ) * Use translated vars in postgres 13 as well Postgres 13 removed translated vars with pg 13 so we had a special logic for pg 13. However it had some bug, so now we copy the translated vars before postgres deletes it. This also simplifies the logic. * fix rtoffset with pg >= 13	2021-02-26 19:41:29 +03:00
Halil Ozan Akgul	5c5cb200f7	Adds GRANT for public to citus_tables	2021-02-26 16:24:33 +03:00
Önder Kalacı	0fe26a216c	Prevent cross join without any target list entries (#4750 ) /* * The physical planner assumes that all worker queries would have * target list entries based on the fact that at least the column * on the JOINs have to be on the target list. However, there is * an exception to that if there is a cartesian product join and * there is no additional target list entries belong to one side * of the JOIN. Once we support cartesian product join, we should * remove this error. */	2021-02-26 11:04:21 +01:00
Onur Tirtir	54ac924bef	Grant read access for columnar metadata tables to unprivileged user	2021-02-26 12:31:09 +03:00
Onur Tirtir	dcc0207605	Add 10.0-2 schema version	2021-02-26 12:31:09 +03:00
Naisila Puka	5ebd4eac7f	Preserve colocation with procedures in alter_distributed_table (#4743 )	2021-02-25 19:52:47 +03:00
Hanefi Onaldi	9a792ef841	Remove length limitations for table renames	2021-02-24 03:35:27 +03:00
Naisila Puka	dbb88f6f8b	Fix insert query with CTEs/sublinks/subqueries etc (#4700 ) * Fix insert query with CTE * Add more cases with deferred pruning but false fast path * Add more tests * Better readability with if statements	2021-02-23 18:00:47 +03:00
SaitTalhaNisanci	dcf54eaf2a	Use PROCESS_UTILITY_QUERY in utility calls When we use PROCESS_UTILITY_TOPLEVEL it causes some problems when combined with other extensions such as pg_audit. With this commit we use PROCESS_UTILITY_QUERY in the codebase to fix those problems.	2021-02-19 13:55:59 +03:00
Sait Talha Nisanci	bbf6132226	Revert "wip (#4730 )" This reverts commit `62e6d54a4e`.	2021-02-19 13:55:59 +03:00
SaitTalhaNisanci	62e6d54a4e	wip (#4730 )	2021-02-19 13:42:19 +03:00
Marco Slot	972a8bc0b7	Rewrite time_partitions join clause to avoid smallint[] operator	2021-02-18 12:01:18 +01:00
Ahmet Gedemenli	1f345f65b4	Support dropping local table indexes along with a distributed index	2021-02-18 13:30:12 +03:00
Onur Tirtir	676d9a9726	Bump Citus to 10.1devel	2021-02-17 11:54:33 +03:00
Onur Tirtir	d61fd6e478	Decide changing sequence dependencies on MX nodes according to resulting relation (#4713 ) When executing alter_table / undistribute_table udf's, we should not try to change sequence dependencies on MX workers if new table wouldn't require syncing metadata. Previously, we were checking that for input table. But in some cases, the fact that input table requires syncing metadata doesn't imply the same for resulting table (e.g when undistributing a Citus table). Even more, doing that was giving an unexpected error when undistributing a Citus table so this commit actually fixes that.	2021-02-15 19:20:26 +03:00
SaitTalhaNisanci	bcbd24f8de	Only consider pseudo constants for shortcuts (#4712 ) It seems that we need to consider only pseudo constants while doing some shortcuts in planning. For example there could be a false clause but it can contribute to the result in which case it will not be a pseudo constant.	2021-02-15 18:39:37 +03:00
SaitTalhaNisanci	0f1ce7a913	Not skip relation in conversion if it doesn't have RelationRestriction (#4685 ) We would exclude tables without relationRestriction from conversion candidates in local-distributed table joins. This could leave a leftover local table which should have been converted to a subquery. Ideally I would expect that in each call to CreateDistributedPlan we would pass a new plan id, but that seems like a bigger change.	2021-02-12 12:33:55 +03:00
Onder Kalaci	f297c96ec5	Add regression tests for COPY into colocated intermediate results To add the tests without too much data, make the copy switchover configurable.	2021-02-11 15:41:06 +01:00
Onder Kalaci	5d5a357487	Do not connection re-use for intermediate results /* * Colocated intermediate results are just files and not required to use * the same connections with their co-located shards. So, we are free to * use any connection we can get. * * Also, the current connection re-use logic does not know how to handle * intermediate results as the intermediate results always truncates the * existing files. That's why, we use one connection per intermediate * result. */	2021-02-11 15:41:06 +01:00
Ahmet Gedemenli	c8e83d1f26	Fix dropping fkey when distributing table	2021-02-11 15:48:35 +03:00
SaitTalhaNisanci	847b79078f	Not consider subplans in restriction list (#4679 ) * Not consider subplans in restriction list * Not consider sublink, alternative subplan in restrictions	2021-02-11 15:04:07 +03:00
Hadi Moshayedi	c3dcd6b9f8	Columnar: don't include stripe reservation locks in lock graph.	2021-02-10 10:20:20 -08:00
Onur Tirtir	9f619a85d6	Fix EXPLAIN ANALYZE exec when query returns no cols (#4672 ) We do not include dummy column if original task didn't return any columns. Otherwise, number of columns that original task returned wouldn't match number of columns returned by worker_save_query_explain_analyze.	2021-02-10 17:59:47 +03:00
Onder Kalaci	c804c9aa21	Allow local execution for intermediate results in COPY When COPY is used for copying into co-located files, it was not allowed to use local execution. The primary reason was Citus treating co-located intermediate results as co-located shards, and COPY into the distributed table was done via "format result". And, local execution of such COPY commands was not implemented. With this change, we implement support for local execution with "format result". To do that, we use the buffer for every file on shardState->copyOutState, similar to how local copy on shards are implemented. In fact, the logic is similar to local copy on shards, but instead of writing to the shards, Citus writes the results to a file. The logic relies on LOCAL_COPY_FLUSH_THRESHOLD, and flushes only when the size exceeds the threshold. But, unlike local copy on shards, in this case we write the headers and footers just once.	2021-02-09 15:00:06 +01:00
Hanefi Onaldi	353b080474	Fix Semmle errors (#4636 ) Co-authored-by: Halil Ozan Akgül <hozanakgul@gmail.com>	2021-02-08 18:37:44 +03:00
SaitTalhaNisanci	e96da4886f	Sort results in citus_shards and give raw size (#4649 ) * Sort results in citus_shards and give raw size Sort results so that it is consistent and also similar to citus_tables. Use raw size in the output so that doing operations on the size is easier. * Change column ordering	2021-02-08 15:29:42 +03:00
Ahmet Gedemenli	5dd2a3da03	Convert RelabelTypes into CollateExprs in get_rule_expr function	2021-02-05 12:06:46 +03:00
Ahmet Gedemenli	503171d2f2	Merge branch 'master' into rename-master-parameter-for-dist-stat-activity	2021-02-04 15:37:13 +03:00
Ahmet Gedemenli	2443b20b2c	Rename master to distributed for worker stat activity	2021-02-04 12:20:06 +03:00
Onder Kalaci	fc9a23792c	COPY uses adaptive connection management on local node With #4338, the executor is smart enough to failover to local node if there is not enough space in max_connections for remote connections. For COPY, the logic is different. With #4034, we made COPY work with the adaptive connection management slightly differently. The cause of the difference is that COPY doesn't know which placements are going to be accessed hence requires to get connections up-front. Similarly, COPY decides to use local execution up-front. With this commit, we change the logic for COPY on local nodes: Try to reserve a connection to local host. This logic follows the same logic (e.g., citus.local_shared_pool_size) as the executor because COPY also relies on TryToIncrementSharedConnectionCounter(). If reservation to local node fails, switch to local execution Apart from this, if local execution is disabled, we follow the exact same logic for multi-node Citus. It means that if we are out of the connection, we'd give an error.	2021-02-04 09:45:07 +01:00
Ahmet Gedemenli	34840ddc5c	Rename master to citus for dist stat activity cols	2021-02-04 11:12:23 +03:00
Sait Talha Nisanci	ff82e85ea2	Replace workerNodeCount -> nodeCount	2021-02-03 20:02:03 +03:00
Sait Talha Nisanci	eb5be579e3	Set previous cell inside a for loop	2021-02-03 20:02:03 +03:00
Sait Talha Nisanci	9ba3f70420	Remove unused method	2021-02-03 20:02:03 +03:00
Sait Talha Nisanci	24e60b44a1	Consider coordinator in intermediate result optimization It seems that we were not considering the case where coordinator was added to the cluster as a worker in the optimization of intermediate results. This could lead to errors when coordinator was added as a worker.	2021-02-03 20:02:03 +03:00
Onur Tirtir	c0f2817b70	Disallow using alter_table udfs with tables having any identity cols (#4635 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating table from scratch. For this reason, we don't allow using alter_table udfs with tables having any identity cols.	2021-02-03 19:33:54 +03:00
Onur Tirtir	3a403090fd	Disallow adding local table with identity column to metadata (#4633 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating shell relation. For this reason, we don't allow adding local tables -having identity cols- to metadata.	2021-02-03 19:05:17 +03:00
Onur Tirtir	5efb742f8a	Skip copying GENERATED ALWAYS AS STORED cols in ReplaceTable (#4616 ) Postgres doesn't allow inserting into columns having GENERATED ALWAYS AS (...) STORED expressions. For this reason, when executing undistribute_table or an alter_* udf, we should skip copying such columns. This is not bad since Postgres would already generate such columns.	2021-02-03 17:55:16 +03:00
Onur Tirtir	53b1888cac	Rename DropAndMoveDefaultSequenceOwnerships	2021-02-02 18:17:42 +03:00
Onur Tirtir	93c3f30024	Rename ExtractColumnsOwningSequences	2021-02-02 18:17:42 +03:00
Onur Tirtir	912d829757	Skip GENERATED AS ALWAYS STORED cols when processing cols owning sequences When finding columns owning sequences, we shouldn't rely on atthasdef since it might be true when column has GENERATED ALWAYS AS (...) STORED expression.	2021-02-02 18:17:42 +03:00
Onur Tirtir	c8a48c6eee	Not try to sync metadata for local tables (#4625 )	2021-02-02 15:12:12 +03:00
Onur Tirtir	c5d4e7081b	Fix invalid read issue in deprecated create_citus_local_table udf (#4611 ) Since create_citus_local_table doesn't specify cascadeViaForeignKeys option, we can't directly call citus_add_local_table_to_metadata from create_citus_local_table. Instead, implement an internal method and call it from deprecated udf too.	2021-02-02 12:53:27 +03:00
Brian Bergeron	1253eeb9ff	Don't propagate ALTER ROLE SET when scoped to a different database (#4471 ) Co-authored-by: brberger <brberger@microsoft.com>	2021-02-01 15:49:26 +03:00
Hanefi Önaldı	cab17afce9	Introduce UDFs for fixing partitioned table constraint names	2021-01-29 17:32:20 +03:00
Hanefi Önaldı	92cf49b7e9	Limit shardId in partitioned table constraint names to only CHECK	2021-01-29 17:29:53 +03:00
SaitTalhaNisanci	738825cc38	Fix partition column index issue (#4591 ) * Fix partition column index issue We send column names to worker_hash/range_partition_table methods, and in these methods we check the column name index from tuple descriptor. Then this index is used to decide the bucket that the current row will be sent for the repartition. This becomes a problem when there are the same column names in the tupleDescriptor. Then we can choose the wrong index. Hence the partitioned data will be put to wrong workers. Then the result could miss some data because workers might contain different range of data. An example: TupleDescriptor contains "trip_id", "car_id", "car_id" for one table. It contains only "car_id" for the other table. And assuming that the tables will be partitioned by car_id, it is not certain what should be used for deciding the bucket number for the first table. Assuming value 2 goes to bucket 2 and value 3 goes to bucket 3, it is not certain which bucket "1 2 3" (trip_id, car_id, car_id) row will go to. As a solution we send the index of partition column in targetList instead of the column name. The old API is kept so that if workers upgrade work, it still works (though it will have the same bug) * Use the same method so that backporting is easier	2021-01-29 14:40:40 +03:00
Onder Kalaci	04fcd73eb6	When reaches to shared pool size, COPY sets the placement access It looks like we forgot to set the placement accesses, and this could lead to self-deadlocks on complex transaction blocks.	2021-01-28 12:45:57 +01:00
Onder Kalaci	36bdeef1bb	When reaches to executor pool size, COPY sets the placement access It looks like we forgot to set the placement accesses, and this could lead to self-deadlocks on complex transaction blocks.	2021-01-28 12:45:57 +01:00
Onur Tirtir	bb5962ee79	Early error out when creating citus local from a temp table (#4592 )	2021-01-28 14:18:06 +03:00
Halil Ozan Akgul	913aa91449	Adds error message to AlterTableSetAccessMethod for below PG12	2021-01-28 11:32:02 +03:00
Onur Tirtir	b20615cbbe	Advise dropping foreign key in addition to create_reference_table hint (#4590 )	2021-01-27 17:59:06 +03:00
Onur Tirtir	8151c4b443	Merge remote-tracking branch 'origin/master' into rename-create_citus_local_table	2021-01-27 17:08:58 +03:00
Ahmet Gedemenli	b2c1bbddd4	Merge branch 'master' into fix-dropping-mat-views-when-alter-table	2021-01-27 16:33:10 +03:00
Ahmet Gedemenli	35043c56f1	Fix dropping materialized views while doing alter table	2021-01-27 16:32:09 +03:00
Onur Tirtir	93a83d5472	Rename create_citus_local_table.c to citus_add_local_table_to_metadata.c	2021-01-27 15:52:37 +03:00
Onur Tirtir	1a4482a37c	Get rid of the sql dir for new udf	2021-01-27 15:52:37 +03:00
Onur Tirtir	2f30be823e	Rename create_citus_local_table to citus_add_local_table_to_metadata For simplicity in downgrade test in multi_extension, didn't actually remove create_citus_local_table udf.	2021-01-27 15:52:36 +03:00
Onur Tirtir	c06fcc26e5	Hide notice messages when implicitly undistributing citus local tables	2021-01-27 13:42:06 +03:00
Onur Tirtir	458a81f93d	Add suppressNoticeMessages to TableConversionState	2021-01-27 12:53:58 +03:00
Onur Tirtir	cacb76d2c6	Not mention citus local tables in error messages (#4579 )	2021-01-27 12:36:53 +03:00

1 2 3 4 5 ...

2284 Commits (a7e4d6c94a96a9013ad23fb6b6c44dc83da15d32)