citus

Commit Graph

Author	SHA1	Message	Date
Onur Tirtir	a830862717	Not undistribute Citus local table when converting it to a reference table / single-shard table	2023-08-29 12:57:28 +03:00
Emel Şimşek	3fda2c3254	Change test files in multi and multi-1 schedules to accommodate coordinator in the metadata. (#6939 ) Changes test files in multi and multi-1 schedules such that they accomodate coordinator in metadata. Changes fall into the following buckets: 1. When coordinator is in metadata, reference table shards are present in coordinator too. This changes test outputs checking the table size, shard numbers etc. for reference tables. 2. When coordinator is in metadata, postgres tables are converted to citus local tables whenever a foreign key relationship to them is created. This changes some test cases which tests it should not be possible to create foreign keys to postgres tables. 3. Remove lines that add/remove coordinator for testing purposes.	2023-06-05 10:37:48 +03:00
Marco Slot	666696c01c	Deprecate citus.replicate_reference_tables_on_activate, make it always off (#6474 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-11-04 16:21:10 +01:00
Marco Slot	b79111527e	Avoid blocking writes in create_distributed_table_concurrently (#6324 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 12:09:37 -07:00
Marco Slot	ba2fe3e3c4	Remove do_repair option from citus_copy_shard_placement (#6299 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-09 15:44:30 +02:00
Marco Slot	e6b1845931	Change split logic to avoid EnsureReferenceTablesExistOnAllNodesExtended (#6208 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-05 22:02:18 +02:00
Jelte Fennema	5c0205ce10	Fix flakyness in multi_replicate_reference_table (#6235 ) In CI multi_replicate_reference_table would sometimes fail like this: ```diff -- detects correctly that referecence table doesn't have replica identity SELECT replicate_reference_tables(); -ERROR: cannot use logical replication to transfer shards of the relation initially_not_replicated_reference_table since it doesn't have a REPLICA IDENTITY or PRIMARY KEY +ERROR: cannot use logical replication to transfer shards of the relation ref_table since it doesn't have a REPLICA IDENTITY or PRIMARY KEY DETAIL: UPDATE and DELETE commands on the shard will error out during logical replication unless there is a REPLICA IDENTITY or PRIMARY KEY. HINT: If you wish to continue without a replica identity set the shard_transfer_mode to 'force_logical' or 'block_writes'. ``` Because `CitusTableTypeIdList` returns tables in heap order so it's a bit random which one is first in the list. And the test contained multiple tables that didn't have a primary key or replica identity. So it made sense that the error could be for either one of these tables. This PR makes the test output consistent by changing one of the tables to have a primary key. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26387/workflows/fc3196e7-ddf2-4000-a70b-5ac71c836321/jobs/748940	2022-08-24 13:34:10 +03:00
Marco Slot	bad8196da3	Verify that we can replicate reference tables using rebalancer (#6232 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-24 00:34:21 +02:00
Jelte Fennema	8017693b2f	Allow specifying the shard_transfer_mode when replicating reference tables (#6070 ) When using `citus.replicate_reference_tables_on_activate = off`, reference tables need to be replicated later. This can be done using the `replicate_reference_tables()` UDF. However, this function only allowed blocking replication. This changes the function to default to logical replication instead, and allows choosing any of our existing shard transfer modes.	2022-08-09 13:21:31 +03:00
Marco Slot	3b57ff2867	Fix crash in citus_copy_shard_placement	2022-08-09 09:31:05 +02:00
Marco Slot	055bbd6212	Use coordinated transaction when there are multiple queries per task	2022-03-18 15:04:27 +01:00
Onder Kalaci	650243927c	Relax some transactional limications on activate node We already enforce EnsureSequentialModeMetadataOperations(), and given that all activate node is transaction, we should be fine	2022-02-01 15:56:55 +01:00
Halil Ozan Akgul	df8d0f3db1	Turn metadata sync on in multi_replicate_reference_table and multi_citus_tools	2021-12-17 10:25:57 +03:00
Ahmet Gedemenli	8bae58fdb7	Add parameter to cleanup metadata (#5055 ) * Add parameter to cleanup metadata * Set clear metadata default to true * Add test for clearing metadata * Separate test file for start/stop metadata syncing * Fix stop_sync bug for secondary nodes * Use PreventInTransactionBlock * DRemovedebuggiing logs * Remove relation not found logs from mx test * Revert localGroupId when doing stop_sync * Move metadata sync test to mx schedule * Add test with name that needs to be quoted * Add test for views and matviews * Add test for distributed table with custom type * Add comments to test * Add test with stats, indexes and constraints * Fix matview test * Add test for dropped column * Add notice messages to stop_metadata_sync * Add coordinator check to stop metadat sync * Revert local_group_id only if clearMetadata is true * Add a final check to see the metadata is sane * Remove the drop verbosity in test * Remove table description tests from sync test * Add stop sync to coordinator test * Change the order in stop_sync * Add test for hybrid (columnar+heap) partitioned table * Change error to notice for stop sync to coordinator * Sync at the end of the test to prevent any failures * Add test case in a transaction block * Remove relation not found tests	2021-07-01 16:23:53 +03:00
Hanefi Onaldi	878513f325	Remove all occurences of replication_model GUC	2021-05-21 16:14:59 +03:00
Marco Slot	d900a7336e	Automatically add placeholder record for coordinator	2021-01-08 15:09:53 +01:00
Marco Slot	e3dcc278e0	Remove upgrade_to_reference_table UDF	2020-12-23 00:40:14 +01:00
Ahmet Gedemenli	936775e8e3	Delete transactions when removing node With this commit, we delete entries in pg_dist_transaction for the primary nodes that are removed by `master_remove_node`.	2020-12-07 11:35:20 +03:00
Onder Kalaci	e182215d96	Improve connection error message from the worker nodes We currently put the actual error message to the detail part. However, many drivers don't show detail part. As connection errors are somehow common, and hard to trace back, can't we added the detail to the message itself. In addition to that, we changed "connection error" message, as it was confusing to the users who think that the error was happening while connecting to the coordinator. In fact, this error is showing up when the coordinator fails to connect remote nodes.	2020-04-20 13:32:55 +02:00
Hadi Moshayedi	1250d691d3	Replicate reference tables before master_create_empty_shard	2020-04-17 16:47:03 -07:00
Hadi Moshayedi	59b9a4e5a1	Detect deadlocks in replicate_reference_tables()	2020-04-15 11:06:18 -07:00
Hadi Moshayedi	f9de734329	Ensure metadata is synced on ReplicateColocatedShardPlacement	2020-04-13 11:45:21 -07:00
Hadi Moshayedi	dda53a0bba	GUC for replicate reference tables on activate.	2020-04-08 12:42:45 -07:00
Hadi Moshayedi	c168a53ebc	Tests for replicate_reference_tables	2020-04-08 12:41:36 -07:00
Hadi Moshayedi	acfa850c38	Make multi_replicate_reference_table check-base friendly	2020-04-08 12:41:36 -07:00
Marco Slot	924cd7343a	Defer reference table replication to shard creation time	2020-04-08 12:41:36 -07:00
SaitTalhaNisanci	39bbec0f30	add tests for local copy execution	2020-03-18 09:28:59 +03:00
Jelte Fennema	7730bd449c	Normalize tests: Remove trailing whitespace	2020-01-06 09:32:03 +01:00
Jelte Fennema	7f3de68b0d	Normalize tests: header separator length	2020-01-06 09:32:03 +01:00
Jelte Fennema	7b833466ba	Normalize tests: s/shard [0-9]+/shard xxxxx/g	2020-01-03 11:44:30 +01:00
Jelte Fennema	8c5c0dd74c	Normalize tests: s/localhost:[0-9]+/localhost:xxxxx/g	2020-01-03 11:40:50 +01:00
Philip Dubé	fcf2fd819b	Add distributioncolumncollation to to pg_dist_colocation Use partition column's collation for range distributed tables Don't allow non deterministic collations for hash distributed tables CoPartitionedTables: don't compare unequal types	2019-12-09 19:51:40 +00:00
Hadi Moshayedi	15af1637aa	Replicate reference tables to coordinator.	2019-11-15 05:50:19 -08:00
Hadi Moshayedi	e00d1546f3	Don't maintain replicationfactor of reference tables	2019-11-05 07:23:14 -08:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Hadi Moshayedi	a5b087c89b	Support FKs between reference tables	2019-08-21 16:11:27 -07:00
Philip Dubé	cd951fa9ca	Avoid multiple pg_dist_colocation records being created for reference tables master_deactivate_node is updated to decrement the replication factor Otherwise deactivation could have create_reference_table produce a second record UpdateColocationGroupReplicationFactor is renamed UpdateColocationGroupReplicationFactorForReferenceTables & the implementation looks up the record based on distributioncolumntype == InvalidOid, rather than by id Otherwise the record's replication factor fails to be maintained when there are no reference tables	2019-08-13 17:21:02 +00:00
mehmet furkan şahin	785a86ed0a	Tests are updated to use create_distributed_table	2018-05-10 11:18:59 +03:00
mehmet furkan şahin	ef90122cd3	shard count for some of the tests are increased	2018-05-03 10:44:43 +03:00
Marco Slot	3d3c19a717	Improve messages for essential connection failures	2018-04-26 12:58:47 -06:00
Marco Slot	f4ceea5a3d	Enable 2PC by default	2017-11-22 11:26:58 +01:00
Marco Slot	89eb833375	Use citus.next_shard_id where practical in regression tests	2017-11-15 10:12:05 +01:00
metdos	c83edc36b5	Check connection status before using it	2017-11-06 14:53:35 +02:00
velioglu	7c65001e23	Do not delete row from colocation table within drop table	2017-08-11 11:34:33 +03:00
Brian Cloutier	74ce4faab5	Make multi_cluster_management test more stable	2017-08-08 11:18:31 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Marco Slot	f838c83809	Remove redundant pg_dist_jobid_seq restarts in tests	2017-04-18 11:42:32 +02:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Burak Yucesoy	d80e7849a4	Convert DropShards to use new connection API With this change DropShards function started to use new connection API. DropShards function is used by DROP TABLE, master_drop_all_shards and master_apply_delete_command, therefore all of these functions now support transactional operations. In DropShards function, if we cannot reach a node, we mark shard state of related placements as FILE_TO_DELETE and continue to drop remaining shards; however if any error occurs after establishing the connection, we ROLLBACK whole operation.	2017-01-23 21:08:41 +03:00
Jason Petersen	56197dbdba	Add replication_model GUC This adds a replication_model GUC which is used as the replication model for any new distributed table that is not a reference table. With this change, tables with replication factor 1 are no longer implicitly MX tables. The GUC is similarly respected during empty shard creation for e.g. existing append-partitioned tables. If the model is set to streaming while replication factor is greater than one, table and shard creation routines will error until this invalid combination is corrected. Changing this parameter requires superuser permissions.	2017-01-23 09:05:14 -07:00

1 2

54 Commits (af88c37b56ea2dcfedfb14e1cff3312a4cace369)