citus

Commit Graph

Author	SHA1	Message	Date
gindibay	a1ab60f5ad	Merge branch 'main' into alter_database_additional_options	2023-11-20 14:40:52 +03:00
Gürkan İndibay	3b556cb5ed	Adds create / drop database propagation support (#7240 ) DESCRIPTION: Adds support for propagating `CREATE`/`DROP` database In this PR, create and drop database support is added. For CREATE DATABASE: * "oid" option is not supported * specifying "strategy" to be different than "wal_log" is not supported * specifying "template" to be different than "template1" is not supported The last two are because those are not saved in `pg_database` and when activating a node, we cannot assume what parameters were provided when creating the database. And "oid" is not supported because whether user specified an arbitrary oid when creating the database is not saved in pg_database and we want to avoid from oid collisions that might arise from attempting to use an auto-assigned oid on workers. Finally, in case of node activation, GRANTs for the database are also propagated. --------- Co-authored-by: Jelte Fennema-Nio <github-tech@jeltef.nl> Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-21 16:43:51 +03:00
gindibay	8443ae36e7	Fixes indentation	2023-11-16 02:59:32 +03:00
gindibay	95572c5ada	Adds EnableCreateDatabasePropagation flag check	2023-11-16 02:58:56 +03:00
gindibay	a68587391c	Fixes indentation	2023-11-15 21:32:28 +03:00
gindibay	772a13dc7c	Removes SendDatabaseGrantSyncCommands	2023-11-15 21:32:01 +03:00
gindibay	9b6511ad10	Fixes indentation	2023-11-15 20:57:27 +03:00
gindibay	144ede3bf0	Adds GUC check for db grant sync	2023-11-15 20:45:13 +03:00
gindibay	6f838e6e88	Fixes indentation	2023-11-15 16:34:02 +03:00
gindibay	9a558bdece	Adds datacl propagation	2023-11-15 16:04:26 +03:00
Gürkan İndibay	32df3313d4	Update src/backend/distributed/metadata/metadata_sync.c Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-13 18:59:36 +03:00
gindibay	fcdea98edd	Removes drop in citus_internal_db_command udf	2023-11-13 15:58:09 +03:00
Onur Tirtir	2ca6e2c358	indent	2023-11-13 11:43:31 +03:00
Onur Tirtir	5b446b1137	make tests passing	2023-11-13 11:27:29 +03:00
Onur Tirtir	240313e286	Support role commands from any node (#7278 ) DESCRIPTION: Adds support from issuing role management commands from worker nodes It's unlikely to get into a distributed deadlock with role commands, we don't care much about them at the moment. There were several attempts to reduce the chances of a deadlock but we didn't any of them merged into main branch yet, see: #7325 #7016 #7009	2023-11-10 09:58:51 +00:00
gindibay	f8b3f322aa	Fixed review items	2023-11-10 08:33:51 +03:00
Gürkan İndibay	3067d1ef08	Apply suggestions from code review Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-10 17:34:09 +03:00
gindibay	8df30a2a2b	Fixes EnsureCitusInitiatedOperation name change	2023-11-10 05:52:14 +03:00
Gürkan İndibay	0a73cb31b0	Merge branch 'main' into create_alter_database	2023-11-10 14:12:48 +03:00
gindibay	65660db10d	Fixes review items	2023-11-08 02:02:00 +03:00
gindibay	641e413676	Adds metadatasync for create database	2023-10-26 20:36:43 +03:00
Onur Tirtir	a830862717	Not undistribute Citus local table when converting it to a reference table / single-shard table	2023-08-29 12:57:28 +03:00
Naisila Puka	a282953274	Fix ScanKeyInit RegProcedure and Datum arguments (#7072 ) Index scans in PG16 return empty sets because of extra compatibility enforcement for `ScanKeyInit` arguments. Could be one of the relevant PG commits: `c8b2ef05f4` This PR fixes all incompatible `RegProcedure` and `Datum` arguments in all `ScanKeyInit` functions used throughout the codebase. Helpful for https://github.com/citusdata/citus/pull/6952	2023-07-21 14:11:10 +03:00
Naisila Puka	69af3e8509	Drop PG13 Support Phase 2 - Remove PG13 specific paths/tests (#7007 ) This commit is the second and last phase of dropping PG13 support. It consists of the following: - Removes all PG_VERSION_13 & PG_VERSION_14 from codepaths - Removes pg_version_compat entries and columnar_version_compat entries specific for PG13 - Removes alternative pg13 test outputs - Removes PG13 normalize lines and fix the test outputs based on that It is a continuation of `5bf163a27d`	2023-06-21 14:18:23 +03:00
Onur Tirtir	dbdf04e8ba	Rename pg_dist tenant_schema to pg_dist_schema (#7001 )	2023-06-14 12:12:15 +03:00
Halil Ozan Akgül	321fcfcdb5	Add Support for Single Shard Tables in update_distributed_table_colocation (#6924 ) Adds Support for Single Shard Tables in `update_distributed_table_colocation`. This PR changes checks that make sure tables should be hash distributed table to hash or single shard distributed tables.	2023-05-29 11:47:50 +03:00
Onur Tirtir	246b054a7d	Add support for schema-based-sharding via a GUC (#6866 ) DESCRIPTION: Adds citus.enable_schema_based_sharding GUC that allows sharding the database based on schemas when enabled. * Refactor the logic that automatically creates Citus managed tables * Refactor CreateSingleShardTable() to allow specifying colocation id instead * Add support for schema-based-sharding via a GUC ### What this PR is about: Add citus.enable_schema_based_sharding GUC to enable schema-based sharding. Each schema created while this GUC is ON will be considered as a tenant schema. Later on, regardless of whether the GUC is ON or OFF, any table created in a tenant schema will be converted to a single shard distributed table (without a shard key). All the tenant tables that belong to a particular schema will be co-located with each other and will have a shard count of 1. We introduce a new metadata table --pg_dist_tenant_schema-- to do the bookkeeping for tenant schemas: ```sql psql> \d pg_dist_tenant_schema Table "pg_catalog.pg_dist_tenant_schema" ┌───────────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├───────────────┼─────────┼───────────┼──────────┼─────────┤ │ schemaid │ oid │ │ not null │ │ │ colocationid │ integer │ │ not null │ │ └───────────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "pg_dist_tenant_schema_pkey" PRIMARY KEY, btree (schemaid) "pg_dist_tenant_schema_unique_colocationid_index" UNIQUE, btree (colocationid) psql> table pg_dist_tenant_schema; ┌───────────┬───────────────┐ │ schemaid │ colocationid │ ├───────────┼───────────────┤ │ 41963 │ 91 │ │ 41962 │ 90 │ └───────────┴───────────────┘ (2 rows) ``` Colocation id column of pg_dist_tenant_schema can never be NULL even for the tenant schemas that don't have a tenant table yet. This is because, we assign colocation ids to tenant schemas as soon as they are created. That way, we can keep associating tenant schemas with particular colocation groups even if all the tenant tables of a tenant schema are dropped and recreated later on. When a tenant schema is dropped, we delete the corresponding row from pg_dist_tenant_schema. In that case, we delete the corresponding colocation group from pg_dist_colocation as well. ### Future work for 12.0 release: We're building schema-based sharding on top of the infrastructure that adds support for creating distributed tables without a shard key (https://github.com/citusdata/citus/pull/6867). However, not all the operations that can be done on distributed tables without a shard key necessarily make sense (in the same way) in the context of schema-based sharding. For example, we need to think about what happens if user attempts altering schema of a tenant table. We will tackle such scenarios in a future PR. We will also add a new UDF --citus.schema_tenant_set() or such-- to allow users to use an existing schema as a tenant schema, and another one --citus.schema_tenant_unset() or such-- to stop using a schema as a tenant schema in future PRs.	2023-05-26 10:49:58 +03:00
Onur Tirtir	fa467e05e7	Add support for creating distributed tables with a null shard key (#6745 ) With this PR, we allow creating distributed tables with without specifying a shard key via create_distributed_table(). Here are the the important details about those tables: * Specifying `shard_count` is not allowed because it is assumed to be 1. * We mostly call such tables as "null shard-key" table in code / comments. * To avoid doing a breaking layout change in create_distributed_table(); instead of throwing an error, it will inform the user that `distribution_type` param is ignored unless it's explicitly set to NULL or 'h'. * `colocate_with` param allows colocating such null shard-key tables to each other. * We define this table type, i.e., NULL_SHARD_KEY_TABLE, as a subclass of DISTRIBUTED_TABLE because we mostly want to treat them as distributed tables in terms of SQL / DDL / operation support. * Metadata for such tables look like: - distribution method => DISTRIBUTE_BY_NONE - replication model => REPLICATION_MODEL_STREAMING - colocation id => != INVALID_COLOCATION_ID (distinguishes from Citus local tables) * We assign colocation groups for such tables to different nodes in a round-robin fashion based on the modulo of "colocation id". Note that this PR doesn't care about DDL (except CREATE TABLE) / SQL / operation (i.e., Citus UDFs) support for such tables but adds a preliminary API.	2023-05-03 16:18:27 +03:00
aykut-bozkurt	8cb69cfd13	break sequence dependency during table creation (#6889 ) We need to break sequence dependency for a table while creating the table during non-transactional metadata sync to ensure idempotency of the creation of the table. Problem: When we send `SELECT pg_catalog.worker_drop_sequence_dependency(logicalrelid::regclass::text) FROM pg_dist_partition` to workers during the non-transactional sync, table might not be in `pg_dist_partition` at worker, and sequence dependency is not broken at the worker. Solution: We break sequence dependency via `SELECT pg_catalog.worker_drop_sequence_dependency(logicalrelid::regclass::text)` for each table while creating it at the workers. It is safe to send since the udf is a no-op when there is no sequence dependency. DESCRIPTION: Fixes a bug related to sequence idempotency at non-transactional sync. Fixes https://github.com/citusdata/citus/issues/6888.	2023-04-28 15:09:09 +03:00
aykutbozkurt	f2f0ec9dda	PR #6728 / commit - 12 Force activated bare connections to close at transaction end.	2023-03-30 11:06:16 +03:00
aykutbozkurt	35dbdae5a4	PR #6728 / commit - 11 Let AddNodeMetadata to use metadatasync api during node addition.	2023-03-30 11:06:16 +03:00
aykutbozkurt	a74232bb39	PR #6728 / commit - 9 Do not enforce distributed transaction at `EnsureCoordinatorInitiatedOperation`.	2023-03-30 10:53:22 +03:00
aykutbozkurt	f8fb20cc95	PR #6728 / commit - 7 Remove unused old metadata sync methods.	2023-03-30 10:53:22 +03:00
aykutbozkurt	bc25ba51c3	PR #6728 / commit - 5 Let `ActivateNode` use new metadata sync api.	2023-03-30 10:53:22 +03:00
aykutbozkurt	29ef9117e6	PR #6728 / commit - 4 Add new metadata sync methods which uses MemorySyncContext api so that during the sync we can - free memory to prevent OOM, - use either transactional or nontransactional modes according to the GUC .	2023-03-30 10:53:22 +03:00
aykutbozkurt	8feb8c634a	PR #6728 / commit - 3 Let nontransactional sync mode create transaction per shell table during dropping the shell tables from worker.	2023-03-30 10:53:20 +03:00
aykutbozkurt	85d50203d1	PR #6728 / commit - 2 - Create MetadataSyncContext api to encapsulate both transactional and nontransactional modes, - Add a GUC to switch between metadata sync transaction modes.	2023-03-30 10:52:46 +03:00
Gokhan Gulbiz	e71bfd6074	Identity column implementation refactorings (#6738 ) This pull request proposes a change to the logic used for propagating identity columns to worker nodes in citus. Instead of creating a dependent sequence for each identity column and changing its default value to `nextval(seq)/worker_nextval(seq)`, this update will pass the identity columns as-is to the worker nodes. Please note that there are a few limitations to this change. 1. Only bigint identity columns will be allowed in distributed tables to ensure compatibility with the DDL from any node functionality. Our current distributed sequence implementation only allows insert statements from all nodes for bigint sequences. 2. `alter_distributed_table` and `undistribute_table` operations will not be allowed for tables with identity columns. This is because we do not have a proper way of keeping sequence states consistent across the cluster. DESCRIPTION: Prevents using identity columns on data types other than `bigint` on distributed tables DESCRIPTION: Prevents using `alter_distributed_table` and `undistribute_table` UDFs when a table has identity columns DESCRIPTION: Fixes a bug that prevents enforcing identity column restrictions on worker nodes Depends on #6740 Fixes #6694	2023-03-30 10:41:01 +03:00
Marco Slot	b09d239809	Propagate CREATE PUBLICATION statements	2023-03-29 00:59:12 +02:00
Onur Tirtir	20a5f3af2b	Replace CITUS_TABLE_WITH_NO_DIST_KEY checks with HasDistributionKey() (#6743 ) Now that we will soon add another table type having DISTRIBUTE_BY_NONE as distribution method and that we want the code to interpret such tables mostly as distributed tables, let's make the definition of those other two table types more strict by removing CITUS_TABLE_WITH_NO_DIST_KEY macro. And instead, use HasDistributionKey() check in the places where the logic applies to all table types that have / don't have a distribution key. In future PRs, we might want to convert some of those HasDistributionKey() checks if logic only applies to Citus local / reference tables, not the others. And adding HasDistributionKey() also allows us to consider having DISTRIBUTE_BY_NONE as the distribution method as a "table attribute" that can apply to distributed tables too, rather something that determines the table type.	2023-03-10 13:55:52 +03:00
Gokhan Gulbiz	2388fbea6e	Identity Column Support on Citus Managed Tables (#6591 ) DESCRIPTION: Identity Column Support on Citus Managed Tables	2023-01-19 15:45:41 +03:00
Ahmet Gedemenli	b3b135867e	Remove shardstate from placement insert functions (#6615 )	2023-01-18 09:52:38 +01:00
Marco Slot	ad3407b5ff	Revert "Make the metadata syncing less resource invasive [Phase-1]" (#6618 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-01-13 13:56:55 +01:00
Önder Kalacı	a1aa96b32c	Make the metadata syncing less resource invasive [Phase-1] (#6537 )	2023-01-04 11:36:45 +01:00
Ahmet Gedemenli	235047670d	Drop SHARD_STATE_TO_DELETE (#6494 ) DESCRIPTION: Drop `SHARD_STATE_TO_DELETE` and use the cleanup records instead Drops the shard state that is used to mark shards as orphaned. Now we insert cleanup records into `pg_dist_cleanup` so "orphaned" shards will be dropped either by maintenance daemon or internal cleanup calls. With this PR, we make the "cleanup orphaned shards" functions to be no-op, as they would not be needed anymore. This PR includes some naming changes about placement functions. We don't need functions that filter orphaned shards, as there will be no orphaned shards anymore. We will also be introducing a small script with this PR, for users with orphaned shards. We'll basically delete the orphaned shard entries from `pg_dist_placement` and insert cleanup records into `pg_dist_cleanup` for each one of them, during Citus upgrade. We also have a lot of flakiness fixes in this PR. Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2023-01-03 14:38:16 +03:00
Marco Slot	666696c01c	Deprecate citus.replicate_reference_tables_on_activate, make it always off (#6474 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-11-04 16:21:10 +01:00
Marco Slot	6bb31c5d75	Add non-blocking variant of create_distributed_table (#6087 ) Added create_distributed_table_concurrently which is nonblocking variant of create_distributed_table. It bases on the split API which takes advantage of logical replication to support nonblocking split operations. Co-authored-by: Marco Slot <marco.slot@gmail.com> Co-authored-by: aykutbozkurt <aykut.bozkurt1995@gmail.com>	2022-08-30 15:35:40 +03:00
Onder Kalaci	9ec8e627c1	Support Sequences owned by columns before distributing tables There are 3 different ways that a sequence can be interacting with tables. (1) and (2) are already supported. This commit adds support for (3). (1) column DEFAULT nextval('seq'): The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \|------------------ sequence <--------\| (2) serial columns: Bigserial/small serial etc: The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \| \| sequence <--------\| (3) Sequence OWNED BY table.column: Added support for this type of resolution in this commit. The dependency is almost like the following, and ExpandCitusSupportedTypes() is NOT responsible for finding the dependency. schema <--- table <--- column ^ \| sequence	2022-08-18 10:29:40 +02:00
aykut-bozkurt	4992533e33	support grant statement propagation for aggregates (#6132 )	2022-08-05 14:47:33 +03:00
Naisila Puka	7d6410c838	Drop postgres 12 support (#6040 ) * Remove if conditions with PG_VERSION_NUM < 13 * Remove server_above_twelve(&eleven) checks from tests * Fix tests * Remove pg12 and pg11 alternative test output files * Remove pg12 specific normalization rules * Some more if conditions in the code * Change RemoteCollationIdExpression and some pg12/pg13 comments * Remove some more normalization rules	2022-07-20 17:49:36 +03:00

1 2 3 4

196 Commits (a1ab60f5ad0108058f8032bdf34cedd8e87e4604)