citus

Commit Graph

Author	SHA1	Message	Date
Ahmet Gedemenli	5b0e60884e	Merge pull request #4373 from citusdata/propagate-alter-schema-rename Propagate alter schema rename	2020-12-02 15:29:26 +03:00
Ahmet Gedemenli	5242dcfe99	Add tests for propagating alter schema rename	2020-12-02 15:18:26 +03:00
Ahmet Gedemenli	514c6a76ac	Propagate alter schema rename	2020-12-02 15:18:26 +03:00
Nils Dijk	fde93072dd	Merge pull request #4335 from citusdata/fix/cstore-options-dist-tables columnar table options for distributed tables	2020-12-02 13:12:57 +01:00
Nils Dijk	6f9c040f76	DESCRIPTION: Propagate columnar table settings for distributed tables When distributing a columnar table, as well as changing options on a distributed columnar table, this patch will forward the settings from the coordinator to the workers. For propagating options changes on an already distributed table this change is pretty straight forward. Before applying the change in options locally we will create a `DDLJob` that contains a call to `alter_columnar_table_set(...)` for every shard placement with all settings of the current table. This goes both for setting an option as well as resetting. This will reset the values to the defaults configured on the coordinator. Having the effect that the coordinator is authoritative on the settings and makes sure the shards have the same settings set as the table on the coordinator. When a columnar table is distributed it is using the `TableDDLCommand` infra structure to create a new kind of `TableDDLCommand`. This new type, called a `TableDDLCommandFunction` contains a context and 2 function pointers to execute. One function returns the command as applied on the table, the second function will return the sql command to apply to a shard with a given shard id. The schema name is ignored as it will use the fully qualified name of the shard in the same schema as the base table.	2020-12-02 13:02:42 +01:00
Halil Ozan Akgül	ef0914a7f8	Adds ORDER BY to flaky test (#4305 ) Co-authored-by: Önder Kalacı <onder@citusdata.com>	2020-12-02 14:24:05 +03:00
Önder Kalacı	48d6266fd4	Merge pull request #4374 from citusdata/sequential_execution_use_lcao Multi-row INSERTs use local execution when placements are local	2020-12-01 22:45:35 +03:00
Onder Kalaci	f7e1aa3f22	Multi-row INSERTs use local execution when placements are local Multi-row execution already uses sequential execution. When shards are local, using local execution is profitable as it avoids an extra connection establishment to the local node.	2020-12-01 21:37:59 +03:00
Onur Tirtir	ea79ca0e5e	Merge pull request #4372 from citusdata/update-cl-951 Update CHANGELOG for 9.5.1	2020-12-01 16:51:36 +03:00
Onur Tirtir	dd3453ced5	Update CHANGELOG for 9.5.1	2020-12-01 14:02:36 +03:00
Marco Slot	df3539710a	Merge pull request #4370 from citusdata/marcocitus/fix-flappy	2020-12-01 11:30:53 +01:00
Ahmet Gedemenli	cc9ea31c60	Merge pull request #4356 from citusdata/add-test-for-citus-size-func Add test for citus table size func in transaction with modification	2020-12-01 11:08:20 +03:00
Ahmet Gedemenli	8e5f0487eb	Add order by for flaky test	2020-12-01 10:54:52 +03:00
Ahmet Gedemenli	67761897ab	Add test for citus table size func in transaction with modification Add test for citus_relation_size	2020-12-01 10:38:15 +03:00
Hadi Moshayedi	feecb7b423	Columnar: few fixes (#4371 ) * Columnar: fix a memory issue * Columnar: no need for deferred triggers * Columnar: relax memory growth constraints	2020-11-30 18:09:43 -08:00
Hadi Moshayedi	a94e8c9cda	Associate column store metadata with storage id (#4347 )	2020-11-30 18:01:43 -08:00
Marco Slot	de22b633cb	Merge pull request #4365 from citusdata/marcocitus/fix-flappy Fix flappy test: Run subquery_prepared_statements by itself	2020-11-30 22:35:51 +01:00
Marco Slot	4a05b2ad77	Merge pull request #4367 from citusdata/isolate_join_test Isolate join test	2020-11-30 22:09:21 +01:00
Sait Talha Nisanci	8b0aed521f	Isolate join test Join test gets too many clients error too frequently hence we should not run anything concurrently with that. Hopefully this will fix the flakiness of test.	2020-12-01 00:00:17 +03:00
Marco Slot	04cffdd925	Run master_copy_shard_placement separately	2020-11-30 20:34:03 +01:00
Marco Slot	48caca4084	Improve regression test settings	2020-11-30 20:34:03 +01:00
SaitTalhaNisanci	c31a8df380	Call 6 times not 7 in subquery_prepared_statements (#4357 )	2020-11-30 21:20:51 +03:00
Onur Tirtir	03bcccdee0	Fix hostname length check in StartNodeUserDatabaseConnection (#4363 ) Copying string before hostname length check makes the check useless	2020-11-30 20:00:35 +03:00
Onur Tirtir	7f3d1182ed	Handle invalid connection hash entries (#4362 ) If MemoryContextAlloc errors out -e.g. during an OOM-, ConnectionHashEntry->connections stays as NULL. With this commit, we add isValid flag to ConnectionHashEntry that should be set to true right after we allocate & initialize ConnectionHashEntry->connections list properly, and we check it before accesing to ConnectionHashEntry->connections.	2020-11-30 19:44:03 +03:00
SaitTalhaNisanci	8c3dd6338e	Run pg12 and pg13 separately (#4352 ) It seems that sometimes we get `too many clients errors` with this set of parallel tests, hence two of them are separated.	2020-11-30 19:32:49 +03:00
Marco Slot	ecbc1ab008	Run subquery_prepared_statements by itself	2020-11-30 08:53:06 +01:00
Hadi Moshayedi	7f43804dae	Normalize VACUUM VERBOSE output (#4353 ) This is to avoid flaky changes like the following in test outputs: -CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.00 s. +CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.02 s.	2020-11-27 12:07:25 -08:00
Nils Dijk	383e334023	refactor options to their own table linked to the regclass (#4346 ) Columnar options were by accident linked to the relfilenode instead of the regclass/relation oid. This PR moves everything related to columnar options to their own catalog table.	2020-11-27 11:22:08 -08:00
SaitTalhaNisanci	af02ac6cf5	Refactor MultiRouterPlannableQuery (#4350 ) The name of the function is different than the implemantation. Because the function is designed to only consider SELECT queries. Also this changes the assert with an error.	2020-11-27 18:44:38 +03:00
Nils Dijk	326e6afa53	refactor table ddl events scoped for shards (#4342 ) Refactor internals on how Citus creates the SQL commands it sends to recreate shards. Before Citus collected solely ddl commands as `char `'s to recreate a table. If they were used to create a shard they were wrapped with `worker_apply_shard_ddl_command` and send to the workers. On the workers the UDF wrapping the ddl command would rewrite the parsetree to replace tables names with their shard name equivalent. This worked well, but poses an issue when adding columnar. Due to limitations in Postgres on creating custom options on table access methods we need to fall back on a UDF to set columnar specific options. Now, to recreate the table, we can not longer rely on having solely DDL statements to recreate a table. A prototype was made to run this UDF wrapped in `worker_apply_shard_ddl_command`. This became pretty messy, hard to understand and subsequently hard to maintain. This PR proposes a refactor of the internal representation of table ddl commands into a `TableDDLCommand` structure. The current implementation only supports a `char ` as its contents. Based on the use of the DDL statement (eg. creating the table -mx- or creating a shard) one of two different functions can be called to get the statement to send to the worker: - `GetTableDDLCommand(TableDDLCommand command)`: This function returns that ddl command to create the table. In this implementation it will just return the `char `. This has the same functionality as getting the old list and not wrapping it. - `GetShardedTableDDLCommand(TableDDLCommand command, uint64 shardId, char schemaName)`: This function returns the ddl command wrapped in `worker_apply_shard_ddl_command` with the `shardId` as an argument. Due to backwards compatibility it also accepts a. `schemaName`. The exact purpose is not directly clear. Ideally new implementations would work with fully qualified statements and ignore the `schemaName`. A future implementation could accept 2.function pointers and a `void *` for context to let the two pointers work on. This gives greater flexibility in controlling what commands get send in which situations. Also, in a future, we could implement the intermediate step of creating the `parsetree` datastructure of statements based on the contents in the catalog with a corresponding deparser. For sharded queries a mutator could be ran over the parsetree to rewrite the tablenames to the names with the shard identifier. This will completely omit the requirement for `worker_apply_shard_ddl_command`.	2020-11-26 13:31:59 +01:00
SaitTalhaNisanci	83020f444e	Initialize fast planner restriction context (#4349 ) We initialize fast planner restriction context so that code paths that rely on this being not NULL will operate without a problem.	2020-11-26 13:45:27 +03:00
Önder Kalacı	7539454ccb	Merge pull request #4312 from citusdata/single_node_conn_mngmt_backend_counter Add the infrastructure to count the number of client backends	2020-11-25 19:49:57 +01:00
Onder Kalaci	629ecc3dee	Add the infrastructure to count the number of client backends Considering the adaptive connection management improvements that we plan to roll soon, it makes it very helpful to know the number of active client backends. We are doing this addition to simplify yhe adaptive connection management for single node Citus. In single node Citus, both the client backends and Citus parallel queries would compete to get slots on Postgres' `max_connections` on the same Citus database. With adaptive connection management, we have the counters for Citus parallel queries. That helps us to adaptively decide on the remote executions pool size (e.g., throttle connections if necessary). However, we do not have any counters for the total number of client backends on the database. For single node Citus, we should consider all the client backends, not only the remote connections that Citus does. Of course Postgres internally knows how many client backends are active. However, to get that number Postgres iterates over all the backends. For examaple, see [pg_stat_get_db_numbackends](`8e90ec5580/src/backend/utils/adt/pgstatfuncs.c (L1240)`) where Postgres iterates over all the backends. For our purpuses, we need this information on every connection establishment. That's why we cannot affort to do this kind of iterattion.	2020-11-25 19:19:24 +01:00
SaitTalhaNisanci	180195b445	Remove unused parameter from VarConstOpExprClause (#4348 )	2020-11-25 21:00:22 +03:00
Ahmet Gedemenli	850b292886	Merge pull request #4326 from citusdata/constraint-key-name-fail Fix constraint name for local execution	2020-11-25 15:20:03 +03:00
Ahmet Gedemenli	a64dc8a72b	Fixes a bug preventing INSERT SELECT .. ON CONFLICT with a constraint name on local shards Separate search relation shard function Add tests	2020-11-25 15:10:46 +03:00
Onur Tirtir	46be63d76b	Refactor PreprocessIndexStmt (#4272 )	2020-11-25 12:19:37 +03:00
Önder Kalacı	ba300dcad8	Merge pull request #4344 from citusdata/improveCitusTableTypeIdList Do not cache all the distributed table metadata during CitusTableTypedList()	2020-11-24 17:51:53 +01:00
Onder Kalaci	7accbff3f6	Do not cache all the distributed table metadata during CitusTableTypeIdList() CitusTableTypeIdList() function iterates on all the entries of pg_dist_partition and loads all the metadata in to the cache. This can be quite memory intensive especially when there are lots of distributed tables. When partitioned tables are used, it is common to have many distributed tables given that each partition also becomes a distributed table. CitusTableTypeIdList() is used on every CREATE TABLE .. PARTITION OF.. command as well. It means that, anytime a partition is created, Citus loads all the metadata to the cache. Note that Citus typically only loads the accessed table's metadata to the cache.	2020-11-24 17:44:06 +01:00
Önder Kalacı	c760cd3470	Move local execution after remote execution (#4301 ) * Move local execution after the remote execution Before this commit, when both local and remote tasks exist, the executor was starting the execution with local execution. There is no strict requirements on this. Especially considering the adaptive connection management improvements that we plan to roll soon, moving the local execution after to the remote execution makes more sense. The adaptive connection management for single node Citus would look roughly as follows: - Try to connect back to the coordinator for running parallel queries. - If succeeds, go on and execute tasks in parallel - If fails, fallback to the local execution So, we'll use local execution as a fallback mechanism. And, moving it after to the remote execution allows us to implement such further scenarios.	2020-11-24 13:43:38 +01:00
Onur Tirtir	d15a4c15cf	Merge pull request #4341 from citusdata/update-cl-943 Update CHANGELOG for 9.4.3	2020-11-24 13:29:59 +03:00
Onur Tirtir	76a429f19b	Update CHANGELOG for 9.4.3	2020-11-24 12:52:16 +03:00
Hadi Moshayedi	fc0ef8abba	Merge pull request #4336 from citusdata/cstore_memory_leaks Fix memory leaks in column store	2020-11-23 11:40:39 -08:00
Hadi Moshayedi	40b52ab757	Fix memory leaks in column store	2020-11-23 11:26:12 -08:00
Önder Kalacı	532b457554	Solidify the slow-start algorithm (#4318 ) The adaptive executor emulates the TCP's slow start algorithm. Whenever the executor needs new connections, it doubles the number of connections established in the previous iteration. This approach is powerful. When the remote queries are very short (like index lookup with < 1ms), even a single connection is sufficent most of the time. When the remote queries are long, the executor can quickly establish necessary number of connections. One missing piece on our implementation seems that the executor keeps doubling the number of connections even if the previous connection attempts have been finalized. Instead, we should wait until all the attempts are finalized. This is how TCP's slow-start works. Plus, it decreases the unnecessary pressure on the remote nodes.	2020-11-23 19:20:13 +01:00
jeff-davis	2e70dbe40a	Merge pull request #4330 from citusdata/remove-fdw remove columnar FDW code	2020-11-20 10:19:20 -08:00
Jeff Davis	ba6ec610e2	address review comment	2020-11-20 10:03:12 -08:00
Jeff Davis	8cee2b092b	remove columnar FDW code	2020-11-20 10:03:12 -08:00
Jelte Fennema	b2def22ab1	Fix possible uninitialized variable warning (#4334 ) I got this warning when compiling citus: ``` ../columnar/write_state_management.c: In function ‘PendingWritesInUpperTransactions’: ../columnar/write_state_management.c:364:20: warning: ‘entry’ may be used uninitialized in this function [-Wmaybe-uninitialized] if (found && entry->writeStateStack != NULL) ~~~~~^~~~~~~~~~~~~~~~ ``` I fixed this by checking by always initializing entry, by using an early return if `WriteStateMap` didn't exist. Instead of using the `found` variable to check for existence of the key, I now simply check the `entry` variable itself. To quote the postgres comment on the hash_enter function: > If foundPtr isn't NULL, then *foundPtr is set true if we found an > existing entry in the table, false otherwise. This is needed in the > HASH_ENTER case, but is redundant with the return value otherwise.	2020-11-20 16:02:03 +01:00
Önder Kalacı	856e5c85cf	Merge pull request #4331 from citusdata/pre_executor_run Do not execute subplans multiple times with cursors	2020-11-20 13:36:07 +01:00

1 2 3 4 5 ...

4338 Commits (cceaf31e4c09c86485f12273a8b7036b12ec34cc) All Branches Search

4338 Commits (cceaf31e4c09c86485f12273a8b7036b12ec34cc)

All Branches