citus

Commit Graph

Author	SHA1	Message	Date
Marco Slot	666696c01c	Deprecate citus.replicate_reference_tables_on_activate, make it always off (#6474 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-11-04 16:21:10 +01:00
Jelte Fennema	737e2bb1bb	Don't leak search_path to workers on DDL (#6444 ) DESCRIPTION: Don't leak search_path to workers on DDL For DDL we have to set the `search_path` on workers to the same as on the coordinator for some DDL to work. Previously this search_path would leak outside of the transaction that was used for the DDL. This fixes that by using `SET LOCAL` instead of `SET`. The only place where we still use plain `SET` is for DDL commands that are not allowed within transactions, such as `CREATE INDEX CONCURRENLTY`. This fixes this flaky test: ```diff CONTEXT: SQL statement "SELECT change_id FROM distributed_triggers.data_changes WHERE shard_key_value = NEW.shard_key_value AND object_id = NEW.object_id ORDER BY change_id DESC LIMIT 1" -PL/pgSQL function record_change() line XX at SQL statement +PL/pgSQL function distributed_triggers.record_change() line 17 at SQL statement while executing command on localhost:57638 DELETE FROM data_ref_table where shard_key_value = 'hello'; ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27849/workflows/75ae5f1a-100b-4b7a-b991-7de069f39ee1/jobs/831429 I had tried to fix this flaky test in #5894 and then I tried implementing a better fix in #5896, where @marcocitus suggested this better fix. This change reverts the fix from #5894 and implements the fix suggested by Marco. Our multi_mx_alter_distributed_table test actually depended on the old buggy search_path leaking behavior. After fixing the bug that test would fail like this: ```diff CALL proc_0(1.0); DEBUG: pushing down the procedure -NOTICE: Res: 3 -DETAIL: from localhost:xxxxx +ERROR: relation "test_proc_colocation_0" does not exist +CONTEXT: PL/pgSQL function mx_alter_distributed_table.proc_0(double precision) line 5 at SQL statement +while executing command on localhost:57637 RESET client_min_messages; ``` I fixed this test by fully qualifying the table names used in the procedure. I think it's quite unlikely that actual users depend on this behavior though. Since it would require first doing DDL before calling a procedure in a session where the search_path was changed after connecting.	2022-10-19 16:47:35 +02:00
Onder Kalaci	766f340ce0	Prevent failures on partitioned distributed tables with statistics objects on PG 15 Comment from the code is clear on this: /* * The statistics objects of the distributed table are not relevant * for the distributed planning, so we can override it. * * Normally, we should not need this. However, the combination of * Postgres commit 269b532aef55a579ae02a3e8e8df14101570dfd9 and * Citus function AdjustPartitioningForDistributedPlanning() * forces us to do this. The commit expects statistics objects * of partitions to have "inh" flag set properly. Whereas, the * function overrides "inh" flag. To avoid Postgres to throw error, * we override statlist such that Postgres does not try to process * any statistics objects during the standard_planner() on the * coordinator. In the end, we do not need the standard_planner() * on the coordinator to generate an optimized plan. We call * into standard_planner() for other purposes, such as generating the * relationRestrictionContext here. * * AdjustPartitioningForDistributedPlanning() is a hack that we use * to prevent Postgres' standard_planner() to expand all the partitions * for the distributed planning when a distributed partitioned table * is queried. It is required for both correctness and performance * reasons. Although we can eliminate the use of the function for * the correctness (e.g., make sure that rest of the planner can handle * partitions), it's performance implication is hard to avoid. Certain * planning logic of Citus (such as router or query pushdown) relies * heavily on the relationRestrictionList. If * AdjustPartitioningForDistributedPlanning() is removed, all the * partitions show up in the, causing high planning times for * such queries. */	2022-09-15 14:36:05 +03:00
Naisila Puka	7d6410c838	Drop postgres 12 support (#6040 ) * Remove if conditions with PG_VERSION_NUM < 13 * Remove server_above_twelve(&eleven) checks from tests * Fix tests * Remove pg12 and pg11 alternative test output files * Remove pg12 specific normalization rules * Some more if conditions in the code * Change RemoteCollationIdExpression and some pg12/pg13 comments * Remove some more normalization rules	2022-07-20 17:49:36 +03:00
Burak Velioglu	1460452442	Introduce CREATE/DROP VIEW Adds support for propagating create/drop view commands and views to worker node while scaling out the cluster. Since views are dropped while converting the table type, metadata connection will be used while propagating view commands to not switch to sequential mode.	2022-05-10 13:07:14 +03:00
Burak Velioglu	f88cc230bf	Handle tables and objects as metadata. Update UDFs accordingly With this commit we've started to propagate sequences and shell tables within the object dependency resolution. So, ensuring any dependencies for any object will consider shell tables and sequences as well. Separate logics for both shell tables and sequences have been removed. Since both shell tables and sequences logic were implemented as a part of the metadata handling before that logic, we were propagating them while syncing table metadata. With this commit we've divided metadata (which means anything except shards thereafter) syncing logic into multiple parts and implemented it either as a part of ActivateNode. You can check the functions called in ActivateNode to check definition of different metadata. Definitions of start_metadata_sync_to_node and citus_activate_node have also been updated. citus_activate_node will basically create an active node with all metadata and reference table shards. start_metadata_sync_to_node will be same with citus_activate_node except replicating reference tables. stop_metadata_sync_to_node will remove all the metadata. All of those UDFs need to be called by superuser.	2022-01-31 16:20:15 +03:00
Halil Ozan Akgul	87a1c760d9	Fix tests in multi-1-schedule that fail with metadata syncing	2021-11-26 12:09:53 +03:00
naisila	385ba94d15	Run fix_partition_shard_index_names after each wrong naming command	2021-11-08 10:43:34 +01:00
Ahmet Gedemenli	67dca4363d	Dont auto-undistribute user-added citus local tables (#5314 ) * Disable auto-undistribute for user-added citus local tables	2021-10-28 12:10:26 +03:00
Philip Dubé	cc50682158	Fix typos. Spurred spotting "connectios" in logs	2021-10-25 13:54:09 +00:00
Halil Ozan Akgul	b710e0064d	Fix tests that fail with MX in multi_schedule	2021-10-15 12:58:38 +03:00
Ahmet Gedemenli	35f6fe5f9f	Refactor/Improve PreprocessAlterTableStmtAttachPartition (#5366 ) * Refactor/Improve PreprocessAlterTableStmtAttachPartition	2021-10-14 11:39:39 +03:00
Ahmet Gedemenli	d19793c174	Add partitioning support for citus local tables Add/fix tests Fix creating partitions Add test for mx - partition creating case Enable cascading to partitioned tables Fix mx partition adding test Fix cascading through fkeys Style Disable converting with non-inherited fkeys Fix detach bug Early return in case of cascade & Add tests Style Fix undistribute_table bug & Fix test outputs Remove RemovePartitionRelationIds Test with undistribute_table Add test for mx+convert+undistribute Remove redundant usage of CreatePartitionedCitusLocalTable Add some comments Introduce bulk functions for generating attach/detach partition commands Fix: Convert partitioned tables after adding fkey Change the error message for partitions Introduce function ErrorIfPartitionTableAddedToMetadata Polish attach/detach command generation functions Use time_partitions for testing Move mx tests to citus_local_tables_mx Add new partitioned table to cascade test Add test with time series management UDFs Fix test output Fix: Assertion fail on relation access tracking Style Refactor creating partitioned citus local tables Remove CreatePartitionedCitusLocalTable Style Error out if converting multi-level table Revert some old tests Error out adding partitioned partition Polish Polish/address Fix create table partition of case Use CascadeOperationForRelationIdList if no cascade needed Fix create partition bug Revert / Add new tests to mx Style Fix dropping fkey bug Add test with IF NOT EXISTS Convert to CLT when doing ATTACH PARTITION Add comments Add more tests with time series management Edit the error message for converting the child Use OR instead of AND in ErrorIfUnsupportedAlterTableStmt Edit/improve tests Disable ddl prop when dropping default column definitions Disable/enable ddl prop just before/after the command Add comment Add sequence test Add trigger test Remove NeedCascadeViaForeignKeys Add one more insert to sequence test Add comment Style Fix test output shard ids Update comments Disable creating fkey on partitions Move partition check to CreateCitusLocalTable Add comment Add check for attachingmulti-level partition Add test for pg_constraint Check pg_dist_partition in tests Add test inserting on the worker	2021-10-11 10:45:07 +03:00
Halil Ozan Akgul	9c9d4b5eeb	Turn MX on by default	2021-10-08 18:17:21 +03:00
Naisila Puka	d0390af72d	Add fix_partition_shard_index_names udf to fix currently broken names (#5291 ) * Add udf to include shardId in broken partition shard index names * Address reviews: rename index such that operations can be done on it * More comprehensive index tests * Final touches and formatting	2021-10-07 19:34:52 +03:00
tejeswarm	a1604a87e6	Parition shards to be colocated with the parent shards	2021-09-22 14:47:04 -07:00
Burak Velioglu	c3895f35cd	Add helper UDFs for easy time partition management - get_missing_time_partition_ranges: Gets the ranges of missing partitions for the given table, interval and range unless any existing partition conflicts with calculated missing ranges. - create_time_partitions: Creates partitions by getting range values from get_missing_time_partition_ranges. - drop_old_time_partitions: Drops partitions of the table older than given threshold.	2021-09-03 23:03:13 +03:00
Sait Talha Nisanci	6b65dbc492	Add partition_wise_join to avoid big alternative output There was a small part in multi_partitioning that would need an alternative output for pg14. Instead of adding an alternative for the whole file, we created a new file, called partition_wise_join.sql and added the alternative output for that.	2021-09-03 15:41:28 +03:00
Naisila Puka	4fb05efabb	Distributes partition-to-be table before ProcessUtility (#5191 ) * Skip ALTER TABLE constraint checks while planning * Revert previous commit's solution, keep tests * Distribute partition-to-be table before ProcessUtility * Acquire locks in PreprocessAlterTableStmtAttachPartition	2021-09-02 13:07:42 +03:00
Naisila Puka	acb5ae6ab6	Skip dropping shards when we know it's a partition (#5176 )	2021-08-31 17:41:37 +03:00
Ahmet Gedemenli	0fbddc740d	Fix shard id difference for enterprise	2021-06-01 17:17:46 +03:00
Ahmet Gedemenli	69d39c0e8b	Fix relname null bug when parallel execution	2021-06-01 14:14:35 +03:00
Hanefi Onaldi	878513f325	Remove all occurences of replication_model GUC	2021-05-21 16:14:59 +03:00
Ahmet Gedemenli	bc818e76e2	Add notice log message for skipping child tables for optimization	2021-05-06 16:49:37 +03:00
Hanefi Onaldi	23a505d41f	Bump PG versions in CI (#4941 ) Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com> Co-authored-by: Sait Talha Nisanci <s.talhanisanci@gmail.com>	2021-05-03 13:51:20 +03:00
Marco Slot	4b49cb112f	Fix FROM ONLY queries on partitioned tables	2021-04-27 16:10:07 +02:00
Ahmet Gedemenli	332c5ce4ad	Fix worker partitioned size functions (#4922 )	2021-04-26 10:29:46 +03:00
Ahmet Gedemenli	33c620f232	Optimize partitioned disk size calculation (#4905 ) * Optimize partitioned disk size calculation * Polish * Fix test for citus_shard_cost_by_disk_size Try optimizing if not CSTORE	2021-04-19 13:30:56 +03:00
Hanefi Onaldi	9919fbe3f8	Switch to sequential mode on long partition names This commit adds support for long partition names for distributed tables: - ALTER TABLE dist_table ATTACH PARTITION .. - CREATE TABLE .. PARTITION OF dist_table .. Note: create_distributed_table UDF does not support long table and partition names, and is not covered in this commit	2021-04-14 15:27:50 +03:00
Ahmet Gedemenli	e445e3d39c	Introduce 3 partitioned size udfs (#4899 ) * Introduce 3 partitioned size udfs * Add tests for new partition size udfs * Fix type incompatibilities * Convert UDFs into pure sql functions * Fix function comment	2021-04-13 17:36:27 +03:00
Ahmet Gedemenli	52e467a9a0	Error out if inheriting a distributed table (#4871 ) * Error out if inheriting a distributed table * Add test inheriting a distirbuted table	2021-04-07 11:21:06 +03:00
Hanefi Önaldı	cab17afce9	Introduce UDFs for fixing partitioned table constraint names	2021-01-29 17:32:20 +03:00
Naisila Puka	7124a7715d	Skip 'already exists' in CREATE TABLE IF NOT EXISTS PARTITION OF (#4507 ) * Just skip 'already exists' in CT IF NOT EXISTS PARTITION OF * Generalize to tables that are not already distributed partitions	2021-01-18 15:56:02 +03:00
Marco Slot	e7f13978b5	Add a view for simple (time) partitions and their access methods	2021-01-08 11:28:15 +01:00
Halil Ozan Akgül	a8626d1944	Fixes the table used in the error message (#4449 )	2020-12-25 16:48:50 +03:00
Sait Talha Nisanci	5618f3a3fc	Use BaseRestrictInfo for finding equality columns Baseinfo also has pushed down filters etc, so it makes more sense to use BaseRestrictInfo to determine what columns have constant equality filters. Also RteIdentity is used for removing conversion candidates instead of rteIndex.	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	69992d58f9	Add broken local-dist table modifications tests It seems that most of the updates were broken, we weren't aware of it because there wasn't any data in the tables. They are broken mostly because local tables do not have a shard id and some code paths should be updated with that information, currently when there is an invalid shard id, it is assumed to be pruned. Consider local tables in router planner In case there is a local table, the shard id will not be valid and there are some checks that rely on shard id, we should skip these in case of local tables, which is handled with a dummy placement. Add citus local table dist table join tests add local-dist table mixed joins tests	2020-12-15 18:18:36 +03:00
Onur Tirtir	0556952607	Normalize partitioned table aliases in explain output (#4295 ) Aliases that postgres choose for partitioned tables in explain output might change in different pg versions, so normalize them and remove the alternative test output	2020-11-06 10:44:01 +03:00
Onur Tirtir	a58a4395ab	Extend citus local table utility command support This commit brings following features: Foreign key support from citus local tables to reference tables * Foreign key support from reference tables to citus local tables (only with RESTRICT & NO ACTION behavior) * ALTER TABLE ENABLE/DISABLE trigger command support * CREATE/DROP/ALTER trigger command support and disallows: * ALTER TABLE ATTACH/DETACH PARTITION commands * CREATE TABLE <postgres table> ATTACH PARTITION <citus local table> commands * Foreign keys from postgres tables to citus local tables (the other way was already disallowed) for citus local tables.	2020-09-09 11:50:55 +03:00
Sait Talha Nisanci	a34a1126ec	add alternative output for pg13 in some tests	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	01c23b0df2	update test outputs with task-tracker removal	2020-07-21 16:25:08 +03:00
Sait Talha Nisanci	1dbd545cf4	replace task-tracker with adaptive in tests	2020-07-21 16:21:01 +03:00
SaitTalhaNisanci	b3af63c8ce	Remove task tracker executor (#3850 ) * use adaptive executor even if task-tracker is set * Update check-multi-mx tests for adaptive executor Basically repartition joins are enabled where necessary. For parallel tests max adaptive executor pool size is decresed to 2, otherwise we would get too many clients error. * Update limit_intermediate_size test It seems that when we use adaptive executor instead of task tracker, we exceed the intermediate result size less in the test. Therefore updated the tests accordingly. * Update multi_router_planner It seems that there is one problem with multi_router_planner when we use adaptive executor, we should fix the following error: +ERROR: relation "authors_range_840010" does not exist +CONTEXT: while executing command on localhost:57637 * update repartition join tests for check-multi * update isolation tests for repartitioning * Error out if shard_replication_factor > 1 with repartitioning As we are removing the task tracker, we cannot switch to it if shard_replication_factor > 1. In that case, we simply error out. * Remove MULTI_EXECUTOR_TASK_TRACKER * Remove multi_task_tracker_executor Some utility methods are moved to task_execution_utils.c. * Remove task tracker protocol methods * Remove task_tracker.c methods * remove unused methods from multi_server_executor * fix style * remove task tracker specific tests from worker_schedule * comment out task tracker udf calls in tests We were using task tracker udfs to test permissions in multi_multiuser.sql. We should find some other way to test them, then we should remove the commented out task tracker calls. * remove task tracker test from follower schedule * remove task tracker tests from multi mx schedule * Remove task-tracker specific functions from worker functions * remove multi task tracker extra schedule * Remove unused methods from multi physical planner * remove task_executor_type related things in tests * remove LoadTuplesIntoTupleStore * Do initial cleanup for repartition leftovers During startup, task tracker would call TrackerCleanupJobDirectories and TrackerCleanupJobSchemas to clean up leftover directories and job schemas. With adaptive executor, while doing repartitions it is possible to leak these things as well. We don't retry cleanups, so it is possible to have leftover in case of errors. TrackerCleanupJobDirectories is renamed as RepartitionCleanupJobDirectories since it is repartition specific now, however TrackerCleanupJobSchemas cannot be used currently because it is task tracker specific. The thing is that this function is a no-op currently. We should add cleaning up intermediate schemas to DoInitialCleanup method when that problem is solved(We might want to solve it in this PR as well) * Revert "remove task tracker tests from multi mx schedule" This reverts commit `03ecc0a681`. * update multi mx repartition parallel tests * not error with task_tracker_conninfo_cache_invalidate * not run 4 repartition queries in parallel It seems that when we run 4 repartition queries in parallel we get too many clients error on CI even though we don't get it locally. Our guess is that, it is because we open/close many connections without doing some work and postgres has some delay to close the connections. Hence even though connections are removed from the pg_stat_activity, they might still not be closed. If the above assumption is correct, it is unlikely for it to happen in practice because: - There is some network latency in clusters, so this leaves some times for connections to be able to close - Repartition joins return some data and that also leaves some time for connections to be fully closed. As we don't get this error in our local, we currently assume that it is not a bug. Ideally this wouldn't happen when we get rid of the task-tracker repartition methods because they don't do any pruning and might be opening more connections than necessary. If this still gives us "too many clients" error, we can try to increase the max_connections in our test suite(which is 100 by default). Also there are different places where this error is given in postgres, but adding some backtrace it seems that we get this from ProcessStartupPacket. The backtraces can be found in this link: https://circleci.com/gh/citusdata/citus/138702 * Set distributePlan->relationIdList when it is needed It seems that we were setting the distributedPlan->relationIdList after JobExecutorType is called, which would choose task-tracker if replication factor > 1 and there is a repartition query. However, it uses relationIdList to decide if the query has a repartition query, and since it was not set yet, it would always think it is not a repartition query and would choose adaptive executor when it should choose task-tracker. * use adaptive executor even with shard_replication_factor > 1 It seems that we were already using adaptive executor when replication_factor > 1. So this commit removes the check. * remove multi_resowner.c and deprecate some settings * remove TaskExecution related leftovers * change deprecated API error message * not recursively plan single relatition repartition subquery * recursively plan single relation repartition subquery * test depreceated task tracker functions * fix overlapping shard intervals in range-distributed test * fix error message for citus_metadata_container * drop task-tracker deprecated functions * put the implemantation back to worker_cleanup_job_schema_cachesince citus cloud uses it * drop some functions, add downgrade script Some deprecated functions are dropped. Downgrade script is added. Some gucs are deprecated. A new guc for repartition joins bucket size is added. * order by a test to fix flappiness	2020-07-18 13:11:36 +03:00
SaitTalhaNisanci	164c00cf08	Fix typo: longer visible -> no longer visible (#3803 )	2020-04-27 16:32:46 +03:00
Hanefi Önaldı	0c5d0cfee9	Notice message to help truncate local data after distribution	2020-04-17 13:21:34 +03:00
Marco Slot	5b1d1dd413	Remove unnecessary use of max_parallel_workers_per_gather	2020-03-06 13:18:58 +01:00
Marco Slot	d0fead6691	Disable Postgres parallelism by default in tests	2020-03-06 13:18:58 +01:00
Jelte Fennema	e76281500c	Replace shardId lock with lock on colocation+shardIntervalIndex (#3374 ) This new locking pattern makes sure that some deadlocks that could happend during rebalancing cannot occur anymore.	2020-01-16 13:14:01 +01:00
Jelte Fennema	0c6983a80e	Normalize tests: pg12 changes	2020-01-06 09:32:03 +01:00
Jelte Fennema	7730bd449c	Normalize tests: Remove trailing whitespace	2020-01-06 09:32:03 +01:00

1 2

78 Commits (321fcfcdb5cefc02feb0f13e760211511c7d91c1)