citus

Commit Graph

Author	SHA1	Message	Date
Jelte Fennema	abffa6c3b9	Use shard split copy code for blocking shard moves (#6098 ) The new shard copy code that was created for shard splits has some advantages over the old shard copy code. The old code was using worker_append_table_to_shard, which wrote to disk twice. And it also didn't use binary copy when that was possible. Both of these issues were fixed in the new copy code. This PR starts using this new copy logic also for shard moves, not just for shard splits. On my local machine I created a single shard table like this. ```sql set citus.shard_count = 1; create table t(id bigint, a bigint); select create_distributed_table('t', 'id'); INSERT into t(id, a) SELECT i, i from generate_series(1, 100000000) i; ``` I then turned `fsync` off to make sure I wasn't bottlenecked by disk. Finally I moved this shard between nodes with `citus_move_shard_placement` with `block_writes`. Before this PR a move took ~127s, after this PR it took only ~38s. So for this small test this resulted in spending ~70% less time. And I also tried the same test for a table that contained large strings: ```sql set citus.shard_count = 1; create table t(id bigint, a bigint, content text); select create_distributed_table('t', 'id'); INSERT into t(id, a, content) SELECT i, i, 'aunethautnehoautnheaotnuhetnohueoutnehotnuhetncouhaeohuaeochgrhgd.athbetndairgexdbuhaobulrhdbaetoausnetohuracehousncaoehuesousnaceohuenacouhancoexdaseohusnaetobuetnoduhasneouhaceohusnaoetcuhmsnaetohuacoeuhebtokteaoshetouhsanetouhaoug.lcuahesonuthaseauhcoerhuaoecuh.lg;rcydabsnetabuesabhenth' from generate_series(1, 20000000) i; ```	2022-08-01 20:10:36 +03:00
Naisila Puka	85324f3acc	Clean up multi_shard_commit_protocol guc leftovers (#6110 )	2022-08-01 15:22:02 +03:00
Onder Kalaci	149771792b	Remove useless version compats most likely leftover from earlier versions	2022-07-29 10:31:55 +02:00
Onder Kalaci	5bc8a81aa7	Add colocation checks for shard splits	2022-07-27 10:01:19 +02:00
Onder Kalaci	12fa3aaf6b	Concurrent shard move/copy and colocated table creation fix It turns out that create_distributed_table and citus_move/copy_shard_placement does not work well concurrently. To fix that, we need to acquire a lock, which sounds like a good use of colocation lock. However, the current usage of colocation lock is limited to higher level UDFs like rebalance_table_shards etc. Those usage of lock is still useful, but we cannot acquire the same lock on citus_move_shard_placement etc. because the coordinator connects to itself to acquire the lock. Hence, the high level UDF blocks itself. To fix that, we use one more colocation lock, with the placements are the main objects to consider.	2022-07-27 10:01:19 +02:00
Nitish Upreti	3d569cc49a	Shard Split support for Columnar and Partitioned Table (#6067 ) DESCRIPTION: This PR extends support for Partitioned and Columnar tables in blocking 'citus_split_shard_by_split_points' workflow. Columnar Support : No special handling required. Just removing checks that fails split for columnar table and adding test coverage. Partitioned Table Support : Skip copying of parent table as they are empty, The partitions contain data and are treated as co-located shards that will be copied separately. Attach partitions to parent on destination after inserting new shard metadata and before creating foreign key constraints. MISC: Fix Bug #4949 where Blocking shard moves fails if there is a foreign key between partitioned distributed tables (from child to parent). TEST: Added new test 'citus_split_shards_columnar_partitioned' for splitting 'partitioned' and 'columnar + partitioned' table. Added new test 'shard_move_constraints_blocking' to add coverage for shard move bug fix. Updated test 'citus_split_shard_by_split_points_negative' to allow columnar and partitioned table.	2022-07-20 12:24:50 -07:00
Naisila Puka	7d6410c838	Drop postgres 12 support (#6040 ) * Remove if conditions with PG_VERSION_NUM < 13 * Remove server_above_twelve(&eleven) checks from tests * Fix tests * Remove pg12 and pg11 alternative test output files * Remove pg12 specific normalization rules * Some more if conditions in the code * Change RemoteCollationIdExpression and some pg12/pg13 comments * Remove some more normalization rules	2022-07-20 17:49:36 +03:00
aykutbozkurt	ebb6d1c8c0	refactor code where GetObjectAddressFromParseTree is called because it returns list of addresses now	2022-07-19 18:13:12 +03:00
Nitish Upreti	5b3537cdff	Shard Split for Citus (#6029 ) * Blocking split setup * Add missing type * Missing API from Metadata Sync * Shard Split e2e code * Worker Split Copy DestReceiver skeleton * Basic destreceiver code * worker_split_copy UDF * UDF calling * Split points are text * Isolate Tenant and Split Shard Unification * Fixing executor and misc * Reindent code * Fixing UDF definitions * Hello World Local Copy works * Remote copy hello world works * Local and Remote binary test * Fixing text local copy and adding tests * Hello World shard split works * Negative tests * Blocking Split workflow works * Refactor * Bug fix * Reindent * Cleaning up and adding comments * Basic test for shard split workflow * ReIndent * Circle CI integration * Removing include causing circle-ci build failure * Remove SplitCopyDestReceiver and use PartitionedResultDestReceiver * Add support for citus.enable_binary_protocol * Reindent * Fix build break * Update Test * Cleanup on catch * Addressing open comments * Update downgrade script and quote schema/table in COPY statement * Fix metadata sync issue. Update regression test * Isolation test and bug fix * Add Isolation test, fix foreign constraint deadlock issue * Misc code review comments * Test name needing to be quoted * Refactor code from review comments * Explaining shardGroupSplitIntervalListList * Fix upgrade & downgrade * Fix broken test * Test fix Round 2 * Fixing bug and modifying test appropriately * Fully qualify copy udf name. Run Reindent * Address PR comments * Fix null handling when creating AuxiliaryStructures * Ensure local copy is triggered in tests * Limit max shards that can be created with split * Test failure fix * Remove split_mode and use shard_transfer_mode instead' * Fix test failure * Fix test failure * Fixing permission issue when splitting non-superuser owned tables * Fix test expected output * Remove extra space * Fix test * attempt to fix test * Addressing Marco's PR comment * Only clean shards created by workflow * Remove from merge * Update test	2022-07-18 02:54:15 -07:00
Onder Kalaci	b2e9a5baf1	Make sure citus_is_coordinator works on read replicas	2022-07-13 14:11:18 +02:00
Jelte Fennema	184c7c0bce	Make enterprise features open source (#6008 ) This PR makes all of the features open source that were previously only available in Citus Enterprise. Features that this adds: 1. Non blocking shard moves/shard rebalancer (`citus.logical_replication_timeout`) 2. Propagation of CREATE/DROP/ALTER ROLE statements 3. Propagation of GRANT statements 4. Propagation of CLUSTER statements 5. Propagation of ALTER DATABASE ... OWNER TO ... 6. Optimization for COPY when loading JSON to avoid double parsing of the JSON object (`citus.skip_jsonb_validation_in_copy`) 7. Support for row level security 8. Support for `pg_dist_authinfo`, which allows storing different authentication options for different users, e.g. you can store passwords or certificates here. 9. Support for `pg_dist_poolinfo`, which allows using connection poolers in between coordinator and workers 10. Tracking distributed query execution times using citus_stat_statements (`citus.stat_statements_max`, `citus.stat_statements_purge_interval`, `citus.stat_statements_track`). This is disabled by default. 11. Blocking tenant_isolation 12. Support for `sslkey` and `sslcert` in `citus.node_conninfo`	2022-06-16 00:23:46 -07:00
jeff-davis	a9f8a60007	Columnar: support relation options with ALTER TABLE. (#5935 ) Columnar: support relation options with ALTER TABLE. Use ALTER TABLE ... SET/RESET to specify relation options rather than alter_columnar_table_set() and alter_columnar_table_reset(). Not only is this more ergonomic, but it also allows better integration because it can be treated like DDL on a regular table. For instance, citus can use its own ProcessUtility_hook to distribute the new settings to the shards. DESCRIPTION: Columnar: support relation options with ALTER TABLE.	2022-05-20 08:35:00 -07:00
Marco Slot	ad5214b50c	Allow distributed execution from run_command_on_* functions	2022-05-20 15:26:47 +02:00
Onder Kalaci	b4dbd84743	Prevent distributed queries while disabling first worker node First worker node has a special meaning for modifications on the replicated tables It is used to acquire a remote lock, such that the modifications are serialized. With this commit, we make sure that we do not let any distributed query to see a different 'first worker node' while first worker node is disabled. Note that, maybe implicitly mentioned above, when first worker node is disabled, the first worker node changes, that's why we have to handle the situation.	2022-05-18 17:21:12 +02:00
Jeff Davis	26f5e20580	PG15: update integer parsing APIs. Account for PG commits 3c6f8c011f and cfc7191dfe.	2022-05-02 10:12:03 -07:00
Marco Slot	9476f377b5	Remove old re-partitioning functions	2022-04-04 18:11:52 +02:00
jeff-davis	c485a04139	Separate build of citus.so and citus_columnar.so. (#5805 ) * Separate build of citus.so and citus_columnar.so. Because columnar code is statically-linked to both modules, it doesn't make sense to load them both at once. A subsequent commit will make the modules entirely separate and allow loading them both simultaneously. Author: Yanwen Jin * Separate citus and citus_columnar modules. Now the modules are independent. Columnar can be loaded by itself, or along with citus. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2022-03-31 19:47:17 -07:00
Gledis Zeneli	b825232ecb	Handle rebalance / replication when a node is disabled (Fix #5664 ) (#5729 ) The issue in question is caused when rebalance / replication call `FullShardPlacementList` which returns all shard placements (including those in disabled nodes with `citus_disable_node`). Eventually, `FindFillStateForPlacement` looks for the state across active workers and fails to find a state for the placements which are in the disabled workers causing a seg fault shortly after. Approach: * `ActivePlacementHash` was not using the status of the shard placement's node to determine if the node it is active. Initially, I just fixed that. * Additionally, I refactored the code which handles active shards in replication / rebalance to: * use a single function to determine if a shard placement is active. * do the shard active shard filtering before calling `RebalancePlacementUpdates` and `ReplicationPlacementUpdates`, so test methods like `shard_placement_rebalance_array` and `shard_placement_replication_array` which have different shard placement active requirements can do their own filtering while using the same rebalance / replicate logic that `rebalance_table_shards` and `replicate_table_shards` use. Fix #5664	2022-02-25 19:54:30 +03:00
Halil Ozan Akgul	f6cd4d0f07	Overrides pg_cancel_backend and pg_terminate_backend to accept global pid	2022-02-21 16:41:35 +03:00
Marco Slot	872f0a79db	Remove random shard placement policy	2022-02-06 21:55:58 +01:00
Marco Slot	0cae8e7d6b	Remove local-node-first shard placement	2022-02-06 21:36:34 +01:00
Onder Kalaci	72d7d92611	Apply code review feedback	2022-02-04 10:52:57 +01:00
Teja Mupparti	f31bce5b48	Fixes the issue seen in https://github.com/citusdata/citus-enterprise/issues/745 With this commit, rebalancer backends are identified by application_name = citus_rebalancer and the regular internal backends are identified by application_name = citus_internal	2022-02-03 09:40:46 -08:00
Burak Velioglu	f88cc230bf	Handle tables and objects as metadata. Update UDFs accordingly With this commit we've started to propagate sequences and shell tables within the object dependency resolution. So, ensuring any dependencies for any object will consider shell tables and sequences as well. Separate logics for both shell tables and sequences have been removed. Since both shell tables and sequences logic were implemented as a part of the metadata handling before that logic, we were propagating them while syncing table metadata. With this commit we've divided metadata (which means anything except shards thereafter) syncing logic into multiple parts and implemented it either as a part of ActivateNode. You can check the functions called in ActivateNode to check definition of different metadata. Definitions of start_metadata_sync_to_node and citus_activate_node have also been updated. citus_activate_node will basically create an active node with all metadata and reference table shards. start_metadata_sync_to_node will be same with citus_activate_node except replicating reference tables. stop_metadata_sync_to_node will remove all the metadata. All of those UDFs need to be called by superuser.	2022-01-31 16:20:15 +03:00
Hanefi Onaldi	9edfbe7718	Fix the default value for DeferShardDeleteOnMove The default for GUC citus.defer_drop_after_shard_move is true. However we initialize the global variable with a false value.	2022-01-07 11:01:49 +03:00
Ahmet Gedemenli	45e423136c	Support foreign tables in MX (#5461 )	2022-01-06 18:50:34 +03:00
jeff-davis	2e03efd91e	Columnar: move DDL hooks to citus to remove dependency. (#5547 ) Add a new hook ColumnarTableSetOptions_hook so that citus can get control when the columnar table options change.	2022-01-04 23:26:46 -08:00
jeff-davis	c9292cfad1	Make pg_version_compat.h and listutils.c dependency-free. (#5548 ) Split distributed/version_compat.h into dependency-free pg_version_compat.h, and the original which still has dependencies. The original doesn't have much purpose, but until other files have better discipline about including the correct header files, then it's still needed. Also make distributed/listutils.h dependency-free. Should be moved outside of 'distributed' subdirectory, but that will cause significant code churn, so leave for another cleanup patch. Now both files can be included in columnar without creating a dependency on citus.	2022-01-04 23:02:08 -08:00
Hanefi Onaldi	9d4d73898a	Move healthcheck logic into new file (#5531 ) and add a missing `CheckCitusVersion(ERROR)` call	2021-12-15 15:58:20 -08:00
Hanefi Onaldi	29e4516642	Introduce citus_check_cluster_node_health UDF This UDF coordinates connectivity checks accross the whole cluster. This UDF gets the list of active readable nodes in the cluster, and coordinates all connectivity checks in sequential order. The algorithm is: for sourceNode in activeReadableWorkerList: c = connectToNode(sourceNode) for targetNode in activeReadableWorkerList: result = c.execute( "SELECT citus_check_connection_to_node(targetNode.name, targetNode.port") emit sourceNode.name, sourceNode.port, targetNode.name, targetNode.port, result - result -> true -> connection attempt from source to target succeeded - result -> false -> connection attempt from source to target failed - result -> NULL -> connection attempt from the current node to source node failed I suggest you use the following query to get an overview on the connectivity: SELECT bool_and(COALESCE(result, false)) FROM citus_check_cluster_node_health(); Whenever this query returns false, there is a connectivity issue, check in detail.	2021-12-15 01:41:51 +03:00
Hanefi Onaldi	13fff9c37a	Remove NOOP tuplestore_donestoring calls PostgreSQL does not need calling this function since 7.4 release, and it is a NOOP. For more details, check PostgreSQL commit below : commit dd04e958c8b03c0f0512497651678c7816af3198 Author: Tom Lane <tgl@sss.pgh.pa.us> Date: Sun Mar 9 03:34:10 2003 +0000 tuplestore_donestoring() isn't needed anymore, but provide a no-op macro definition so as not to create compatibility problems. diff --git a/src/include/utils/tuplestore.h b/src/include/utils/tuplestore.h index b46babacd1..76fe9fb428 100644 --- a/src/include/utils/tuplestore.h +++ b/src/include/utils/tuplestore.h @@ -17,7 +17,7 @@ * Portions Copyright (c) 1996-2002, PostgreSQL Global Development Group * Portions Copyright (c) 1994, Regents of the University of California * - * $Id: tuplestore.h,v 1.8 2003/03/09 02:19:13 tgl Exp $ + * $Id: tuplestore.h,v 1.9 2003/03/09 03:34:10 tgl Exp $ * ------------------------------------------------------------------------- / @@ -41,6 +41,9 @@ extern Tuplestorestate tuplestore_begin_heap(bool randomAccess, extern void tuplestore_puttuple(Tuplestorestate state, void tuple); +/ tuplestore_donestoring() used to be required, but is no longer used / +#define tuplestore_donestoring(state) ((void) 0) + / backwards scan is only allowed if randomAccess was specified 'true' / extern void tuplestore_gettuple(Tuplestorestate state, bool forward, bool should_free);	2021-12-14 18:55:02 +03:00
Hanefi Onaldi	56e9b1b968	Introduce UDF to check worker connectivity citus_check_connection_to_node runs a simple query on a remote node and reports whether this attempt was successful. This UDF will be used to make sure each worker node can connect to all the worker nodes in the cluster. parameters: nodename: required nodeport: optional (default: 5432) return value: boolean success	2021-12-03 02:30:28 +03:00
Marco Slot	56eae48daf	Stop updating shard range in citus_update_shard_statistics	2021-11-19 10:51:15 +01:00
Hanefi Onaldi	c0d43d4905	Prevent cache usage on citus_drop_trigger codepaths	2021-11-18 20:24:51 +03:00
Marco Slot	9e6ca23286	Remove cstore_fdw-related logic	2021-11-16 13:59:03 +01:00
Marco Slot	78866df13c	Remove master_append_table_to_shard UDF	2021-11-08 10:43:24 +01:00
Philip Dubé	cc50682158	Fix typos. Spurred spotting "connectios" in logs	2021-10-25 13:54:09 +00:00
Onder Kalaci	575bb6dde9	Drop support for Inactive Shard placements Given that we do all operations via 2PC, there is no way for any placement to be marked as INACTIVE.	2021-10-22 18:03:35 +02:00
Önder Kalacı	b3299de81c	Drop support for citus.multi_shard_commit_protocol (#5380 ) In the past, we allowed users to manually switch to 1PC (e.g., one phase commit). However, with this commit, we don't. All multi-shard modifications are done via 2PC.	2021-10-21 14:01:28 +02:00
Marco Slot	dafba6c242	Deprecate master_get_table_metadata UDF	2021-10-21 12:08:05 +02:00
Marco Slot	096660d61d	Remove master_apply_delete_command	2021-10-18 22:29:37 +02:00
Halil Ozan Akgul	43d5853b6d	Fixes function names in comments	2021-10-06 09:24:43 +03:00
Naisila Puka	a69abe3be0	Fixes bug about int and smallint sequences on MX (#5254 ) * Introduce worker_nextval udf for int&smallint column defaults * Fix current tests and add new ones for worker_nextval	2021-09-09 23:41:07 +03:00
Sait Talha Nisanci	0b67fcf81d	Fix style	2021-09-03 16:09:59 +03:00
Halil Ozan Akgul	b01e7e884c	Pass NULL for plannerInfo as we don't generate PlaceHolderVars	2021-09-03 15:27:25 +03:00
Naisila Puka	acb5ae6ab6	Skip dropping shards when we know it's a partition (#5176 )	2021-08-31 17:41:37 +03:00
Jelte Fennema	481f8be084	Fix crash in shard rebalancer when no distributed tables exist (#5205 ) The logging of the amount of ignored moves crashed when no distributed tables existed in a cluster. This also fixes in passing that the logging of ignored moves logs the correct number of ignored moves if there exist multiple colocation groups and all are rebalanced at the same time.	2021-08-31 14:15:24 +02:00
Onur Tirtir	549ca4de6d	Use RelationGetIndexList instead of scanning pg_index	2021-08-18 17:50:57 +03:00
Naisila Puka	e5b32b2c3c	Acquire AccessShareLock before updating table statistics (#5155 )	2021-08-12 13:58:15 +03:00
Onder Kalaci	5f02d18ef8	transactional metadata sync for maintanince daemon As we use the current user to sync the metadata to the nodes with #5105 (and many other PRs), there is no reason that prevents us to use the coordinated transaction for metadata syncing. This commit also renames few functions to reflect their actual implementation.	2021-08-09 10:34:55 +02:00

1 2 3

126 Commits (3ddc0896510db59d339e7404094af588fb670edf)