citus

Commit Graph

Author	SHA1	Message	Date
Hadi Moshayedi	a94e8c9cda	Associate column store metadata with storage id (#4347 )	2020-11-30 18:01:43 -08:00
Marco Slot	de22b633cb	Merge pull request #4365 from citusdata/marcocitus/fix-flappy Fix flappy test: Run subquery_prepared_statements by itself	2020-11-30 22:35:51 +01:00
Marco Slot	4a05b2ad77	Merge pull request #4367 from citusdata/isolate_join_test Isolate join test	2020-11-30 22:09:21 +01:00
Sait Talha Nisanci	8b0aed521f	Isolate join test Join test gets too many clients error too frequently hence we should not run anything concurrently with that. Hopefully this will fix the flakiness of test.	2020-12-01 00:00:17 +03:00
SaitTalhaNisanci	c31a8df380	Call 6 times not 7 in subquery_prepared_statements (#4357 )	2020-11-30 21:20:51 +03:00
Onur Tirtir	03bcccdee0	Fix hostname length check in StartNodeUserDatabaseConnection (#4363 ) Copying string before hostname length check makes the check useless	2020-11-30 20:00:35 +03:00
Onur Tirtir	7f3d1182ed	Handle invalid connection hash entries (#4362 ) If MemoryContextAlloc errors out -e.g. during an OOM-, ConnectionHashEntry->connections stays as NULL. With this commit, we add isValid flag to ConnectionHashEntry that should be set to true right after we allocate & initialize ConnectionHashEntry->connections list properly, and we check it before accesing to ConnectionHashEntry->connections.	2020-11-30 19:44:03 +03:00
SaitTalhaNisanci	8c3dd6338e	Run pg12 and pg13 separately (#4352 ) It seems that sometimes we get `too many clients errors` with this set of parallel tests, hence two of them are separated.	2020-11-30 19:32:49 +03:00
Marco Slot	ecbc1ab008	Run subquery_prepared_statements by itself	2020-11-30 08:53:06 +01:00
Hadi Moshayedi	7f43804dae	Normalize VACUUM VERBOSE output (#4353 ) This is to avoid flaky changes like the following in test outputs: -CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.00 s. +CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.02 s.	2020-11-27 12:07:25 -08:00
Nils Dijk	383e334023	refactor options to their own table linked to the regclass (#4346 ) Columnar options were by accident linked to the relfilenode instead of the regclass/relation oid. This PR moves everything related to columnar options to their own catalog table.	2020-11-27 11:22:08 -08:00
SaitTalhaNisanci	af02ac6cf5	Refactor MultiRouterPlannableQuery (#4350 ) The name of the function is different than the implemantation. Because the function is designed to only consider SELECT queries. Also this changes the assert with an error.	2020-11-27 18:44:38 +03:00
Nils Dijk	326e6afa53	refactor table ddl events scoped for shards (#4342 ) Refactor internals on how Citus creates the SQL commands it sends to recreate shards. Before Citus collected solely ddl commands as `char `'s to recreate a table. If they were used to create a shard they were wrapped with `worker_apply_shard_ddl_command` and send to the workers. On the workers the UDF wrapping the ddl command would rewrite the parsetree to replace tables names with their shard name equivalent. This worked well, but poses an issue when adding columnar. Due to limitations in Postgres on creating custom options on table access methods we need to fall back on a UDF to set columnar specific options. Now, to recreate the table, we can not longer rely on having solely DDL statements to recreate a table. A prototype was made to run this UDF wrapped in `worker_apply_shard_ddl_command`. This became pretty messy, hard to understand and subsequently hard to maintain. This PR proposes a refactor of the internal representation of table ddl commands into a `TableDDLCommand` structure. The current implementation only supports a `char ` as its contents. Based on the use of the DDL statement (eg. creating the table -mx- or creating a shard) one of two different functions can be called to get the statement to send to the worker: - `GetTableDDLCommand(TableDDLCommand command)`: This function returns that ddl command to create the table. In this implementation it will just return the `char `. This has the same functionality as getting the old list and not wrapping it. - `GetShardedTableDDLCommand(TableDDLCommand command, uint64 shardId, char schemaName)`: This function returns the ddl command wrapped in `worker_apply_shard_ddl_command` with the `shardId` as an argument. Due to backwards compatibility it also accepts a. `schemaName`. The exact purpose is not directly clear. Ideally new implementations would work with fully qualified statements and ignore the `schemaName`. A future implementation could accept 2.function pointers and a `void *` for context to let the two pointers work on. This gives greater flexibility in controlling what commands get send in which situations. Also, in a future, we could implement the intermediate step of creating the `parsetree` datastructure of statements based on the contents in the catalog with a corresponding deparser. For sharded queries a mutator could be ran over the parsetree to rewrite the tablenames to the names with the shard identifier. This will completely omit the requirement for `worker_apply_shard_ddl_command`.	2020-11-26 13:31:59 +01:00
SaitTalhaNisanci	83020f444e	Initialize fast planner restriction context (#4349 ) We initialize fast planner restriction context so that code paths that rely on this being not NULL will operate without a problem.	2020-11-26 13:45:27 +03:00
Önder Kalacı	7539454ccb	Merge pull request #4312 from citusdata/single_node_conn_mngmt_backend_counter Add the infrastructure to count the number of client backends	2020-11-25 19:49:57 +01:00
Onder Kalaci	629ecc3dee	Add the infrastructure to count the number of client backends Considering the adaptive connection management improvements that we plan to roll soon, it makes it very helpful to know the number of active client backends. We are doing this addition to simplify yhe adaptive connection management for single node Citus. In single node Citus, both the client backends and Citus parallel queries would compete to get slots on Postgres' `max_connections` on the same Citus database. With adaptive connection management, we have the counters for Citus parallel queries. That helps us to adaptively decide on the remote executions pool size (e.g., throttle connections if necessary). However, we do not have any counters for the total number of client backends on the database. For single node Citus, we should consider all the client backends, not only the remote connections that Citus does. Of course Postgres internally knows how many client backends are active. However, to get that number Postgres iterates over all the backends. For examaple, see [pg_stat_get_db_numbackends](`8e90ec5580/src/backend/utils/adt/pgstatfuncs.c (L1240)`) where Postgres iterates over all the backends. For our purpuses, we need this information on every connection establishment. That's why we cannot affort to do this kind of iterattion.	2020-11-25 19:19:24 +01:00
SaitTalhaNisanci	180195b445	Remove unused parameter from VarConstOpExprClause (#4348 )	2020-11-25 21:00:22 +03:00
Ahmet Gedemenli	850b292886	Merge pull request #4326 from citusdata/constraint-key-name-fail Fix constraint name for local execution	2020-11-25 15:20:03 +03:00
Ahmet Gedemenli	a64dc8a72b	Fixes a bug preventing INSERT SELECT .. ON CONFLICT with a constraint name on local shards Separate search relation shard function Add tests	2020-11-25 15:10:46 +03:00
Onur Tirtir	46be63d76b	Refactor PreprocessIndexStmt (#4272 )	2020-11-25 12:19:37 +03:00
Önder Kalacı	ba300dcad8	Merge pull request #4344 from citusdata/improveCitusTableTypeIdList Do not cache all the distributed table metadata during CitusTableTypedList()	2020-11-24 17:51:53 +01:00
Onder Kalaci	7accbff3f6	Do not cache all the distributed table metadata during CitusTableTypeIdList() CitusTableTypeIdList() function iterates on all the entries of pg_dist_partition and loads all the metadata in to the cache. This can be quite memory intensive especially when there are lots of distributed tables. When partitioned tables are used, it is common to have many distributed tables given that each partition also becomes a distributed table. CitusTableTypeIdList() is used on every CREATE TABLE .. PARTITION OF.. command as well. It means that, anytime a partition is created, Citus loads all the metadata to the cache. Note that Citus typically only loads the accessed table's metadata to the cache.	2020-11-24 17:44:06 +01:00
Önder Kalacı	c760cd3470	Move local execution after remote execution (#4301 ) * Move local execution after the remote execution Before this commit, when both local and remote tasks exist, the executor was starting the execution with local execution. There is no strict requirements on this. Especially considering the adaptive connection management improvements that we plan to roll soon, moving the local execution after to the remote execution makes more sense. The adaptive connection management for single node Citus would look roughly as follows: - Try to connect back to the coordinator for running parallel queries. - If succeeds, go on and execute tasks in parallel - If fails, fallback to the local execution So, we'll use local execution as a fallback mechanism. And, moving it after to the remote execution allows us to implement such further scenarios.	2020-11-24 13:43:38 +01:00
Onur Tirtir	d15a4c15cf	Merge pull request #4341 from citusdata/update-cl-943 Update CHANGELOG for 9.4.3	2020-11-24 13:29:59 +03:00
Onur Tirtir	76a429f19b	Update CHANGELOG for 9.4.3	2020-11-24 12:52:16 +03:00
Hadi Moshayedi	fc0ef8abba	Merge pull request #4336 from citusdata/cstore_memory_leaks Fix memory leaks in column store	2020-11-23 11:40:39 -08:00
Hadi Moshayedi	40b52ab757	Fix memory leaks in column store	2020-11-23 11:26:12 -08:00
Önder Kalacı	532b457554	Solidify the slow-start algorithm (#4318 ) The adaptive executor emulates the TCP's slow start algorithm. Whenever the executor needs new connections, it doubles the number of connections established in the previous iteration. This approach is powerful. When the remote queries are very short (like index lookup with < 1ms), even a single connection is sufficent most of the time. When the remote queries are long, the executor can quickly establish necessary number of connections. One missing piece on our implementation seems that the executor keeps doubling the number of connections even if the previous connection attempts have been finalized. Instead, we should wait until all the attempts are finalized. This is how TCP's slow-start works. Plus, it decreases the unnecessary pressure on the remote nodes.	2020-11-23 19:20:13 +01:00
jeff-davis	2e70dbe40a	Merge pull request #4330 from citusdata/remove-fdw remove columnar FDW code	2020-11-20 10:19:20 -08:00
Jeff Davis	ba6ec610e2	address review comment	2020-11-20 10:03:12 -08:00
Jeff Davis	8cee2b092b	remove columnar FDW code	2020-11-20 10:03:12 -08:00
Jelte Fennema	b2def22ab1	Fix possible uninitialized variable warning (#4334 ) I got this warning when compiling citus: ``` ../columnar/write_state_management.c: In function ‘PendingWritesInUpperTransactions’: ../columnar/write_state_management.c:364:20: warning: ‘entry’ may be used uninitialized in this function [-Wmaybe-uninitialized] if (found && entry->writeStateStack != NULL) ~~~~~^~~~~~~~~~~~~~~~ ``` I fixed this by checking by always initializing entry, by using an early return if `WriteStateMap` didn't exist. Instead of using the `found` variable to check for existence of the key, I now simply check the `entry` variable itself. To quote the postgres comment on the hash_enter function: > If foundPtr isn't NULL, then *foundPtr is set true if we found an > existing entry in the table, false otherwise. This is needed in the > HASH_ENTER case, but is redundant with the return value otherwise.	2020-11-20 16:02:03 +01:00
Önder Kalacı	856e5c85cf	Merge pull request #4331 from citusdata/pre_executor_run Do not execute subplans multiple times with cursors	2020-11-20 13:36:07 +01:00
Onder Kalaci	c433c66f2b	Do not execute subplans multiple times with cursors Before this commit, we let AdaptiveExecutorPreExecutorRun() to be effective multiple times on every FETCH on cursors. That does not affect the correctness of the query results, but adds significant overhead.	2020-11-20 10:43:56 +01:00
Önder Kalacı	b0ddbbd33a	Enable parallel query on EXPLAIN ANALYZE (#4325 ) It seems that we forgot to pass the revelant flag to enable Postgres' parallel query capabilities on the shards when user does EXPLAIN ANALYZE on a distributed table.	2020-11-20 09:54:04 +01:00
Hadi Moshayedi	c35f38459b	Merge pull request #4320 from citusdata/cstore_alter_table Fix ALTER COLUMN ... TYPE for columnar	2020-11-19 15:58:15 -08:00
Hadi Moshayedi	b182a95389	Fix ALTER COLUMN ... SET TYPE for columnar	2020-11-19 15:36:45 -08:00
jeff-davis	4e035b6044	Merge pull request #4328 from citusdata/rename rename cstore_tableam -> columnar	2020-11-19 13:37:31 -08:00
Jeff Davis	cef1d0e915	fixup test output	2020-11-19 12:45:52 -08:00
Jeff Davis	91015deb9d	rename UDFs also	2020-11-19 12:27:40 -08:00
Jeff Davis	a2b698a766	rename cstore_tableam -> columnar	2020-11-19 12:15:51 -08:00
SaitTalhaNisanci	05390729f9	Merge pull request #4327 from citusdata/initializeVariable Initialize entry variable as NULL	2020-11-19 16:37:24 +03:00
Sait Talha Nisanci	ddc8e6c702	Initialize entry variable as NULL	2020-11-19 15:23:39 +03:00
SaitTalhaNisanci	09f737d942	Merge pull request #4283 from citusdata/component_governance_config Add component governance config	2020-11-19 13:27:46 +03:00
SaitTalhaNisanci	3dca29a4c3	Merge branch 'master' into component_governance_config	2020-11-19 13:16:01 +03:00
Sait Talha Nisanci	5f436e10d0	Add the NOTICE file	2020-11-18 17:49:01 +03:00
SaitTalhaNisanci	9c44911226	Improve error messages in shard pruning (#4324 )	2020-11-18 17:16:06 +03:00
Hadi Moshayedi	021ed07f12	Merge pull request #4322 from citusdata/cstore_tests Test more of SQL features with column store	2020-11-17 20:28:03 -08:00
Hadi Moshayedi	2747fd80ff	Add prepared materialized view tests for columnar	2020-11-17 20:13:20 -08:00
Hadi Moshayedi	6711340ea6	Add prepared xact & stmt tests for columnar	2020-11-17 20:00:57 -08:00

1 2 3 4 5 ...

4221 Commits (a94e8c9cda06db17889a72867c8fa06342dd9a54) All Branches Search

4221 Commits (a94e8c9cda06db17889a72867c8fa06342dd9a54)

All Branches