citus

Commit Graph

Author	SHA1	Message	Date
Gokhan Gulbiz	81953d7ac6	Convert char* to text for text_substr call	2023-03-28 08:33:01 +03:00
Gokhan Gulbiz	426cfd3ce5	Indent	2023-03-28 08:33:01 +03:00
Gokhan Gulbiz	8b09a4f8c0	Handle no comment end chars	2023-03-28 08:33:01 +03:00
Gokhan Gulbiz	fc23fd5061	Use text_substr for getting top comment	2023-03-28 08:33:00 +03:00
Gokhan Gulbiz	905cc5b4f3	Use palloc instead of malloc	2023-03-28 08:33:00 +03:00
Gokhan Gulbiz	3ec2994abd	Indent	2023-03-28 08:33:00 +03:00
Gokhan Gulbiz	3a5c7c3280	Indent	2023-03-28 08:33:00 +03:00
Gokhan Gulbiz	486e2a622a	Refactoring to reduce nesting	2023-03-28 08:33:00 +03:00
Gokhan Gulbiz	8e1e827242	Remove unnecessary check	2023-03-28 08:32:59 +03:00
Gokhan Gulbiz	5e6ac25885	Renamings	2023-03-28 08:32:59 +03:00
Gokhan Gulbiz	a355825bfe	Set INVALID_COLOCATION_ID if colocationId doesn't exist in the annotation.	2023-03-28 08:32:59 +03:00
Gokhan Gulbiz	c60de03d6a	Indent	2023-03-28 08:32:21 +03:00
Gokhan Gulbiz	da24f2fd62	Indent	2023-03-28 08:32:20 +03:00
Gokhan Gulbiz	3cfa197f69	Escape/Unescape sql comment chars	2023-03-28 08:32:20 +03:00
Gokhan Gulbiz	80dd73711e	Minor renamings and refactorings	2023-03-28 08:32:20 +03:00
Gokhan Gulbiz	dbc26cacb5	Add comment chars escaping	2023-03-28 08:32:20 +03:00
Gokhan Gulbiz	89e6623960	Fix attribute prefix	2023-03-28 08:32:20 +03:00
Gokhan Gulbiz	bb4aacb92f	Fix tenant statistics annotations normalization	2023-03-28 08:32:19 +03:00
Gokhan Gulbiz	e9a6f8a7c5	Indent	2023-03-28 08:32:19 +03:00
Gokhan Gulbiz	21298f6661	Validate attribute prefix existance on query string	2023-03-28 08:32:19 +03:00
Gokhan Gulbiz	6d8cd8a9a0	Validate input string length	2023-03-28 08:31:26 +03:00
Gokhan Gulbiz	eaa896e744	Normalize multiline sql comment statements	2023-03-28 08:31:26 +03:00
Gokhan Gulbiz	517ceb2d22	Use strncpy_s instead of strncpy	2023-03-28 08:31:26 +03:00
Gokhan Gulbiz	fda680d22e	Use palloc instead of malloc	2023-03-28 08:31:26 +03:00
Gokhan Gulbiz	024526ab2f	Introduce JSON based annotation parsing	2023-03-28 08:31:25 +03:00
Gokhan Gulbiz	9d2d97fe67	Add ExtractFieldInt32(..) to jsonbutils	2023-03-28 08:03:26 +03:00
Halil Ozan Akgül	b989e8872c	Citus stats tenants collector view (#6761 ) Add a view that collects statistics from all nodes	2023-03-27 17:42:22 +03:00
Halil Ozan Akgul	d6603390ab	Add multi tenant statistics monitoring	2023-03-27 17:13:24 +03:00
Onur Tirtir	372a93b529	Make 8 more tests runnable multiple times via run_test.py (#6791 ) Soon I will be doing some changes related to #692 in router planner and those changes require updating ~5/6 tests related to router planning. And to make those test files runnable by run_test.py multiple times, we need to make some other tests (that they're run in parallel / they badly depend on) ready for run_test.py too.	2023-03-27 12:19:06 +03:00
Teja Mupparti	da7db53c87	Refactor some of the planning code to accomodate a new planning path for MERGE SQL	2023-03-22 11:29:24 -07:00
Onur Tirtir	e1f1d63050	Rename AllRelations.. functions to AllDistributedRelations.. (#6789 ) Because they're only interested in distributed tables. Even more, this replaces HasDistributionKey() check with IsCitusTableType(DISTRIBUTED_TABLE) because this doesn't make a difference on main and sounds slightly more intuitive. Plus, this would also allow safely using this function in https://github.com/citusdata/citus/pull/6773.	2023-03-22 15:15:23 +03:00
Onur Tirtir	4960ced175	Add an arbitrary config test heavily based on multi_router_planner_fast_path.sql (#6782 ) This would be useful for testing #6773. This is because, given that #6773 only adds support for router / fast-path queries, theoretically almost all the tests that we have in that test file should work for null-shard-key tables too (and they indeed do). I deliberately did not replace multi_router_planner_fast_path.sql with the one that I'm adding into arbitrary configs because we might still want to see when we're able to go through fast-path planning for the usual distributed tables (the ones that have a shard key).	2023-03-22 10:49:08 +03:00
Ahmet Gedemenli	2713e015d6	Check before logicalrep for rebalancer, error if needed (#6754 ) DESCRIPTION: Check before logicalrep for rebalancer, error if needed Check if we can use logical replication or not, in case of shard transfer mode = auto, before executing the shard moves. If we can't, error out. Before this PR, we used to error out in the middle of shard moves: ```sql set citus.shard_count = 4; -- just to get the error sooner select citus_remove_node('localhost',9702); create table t1 (a int primary key); select create_distributed_table('t1','a'); create table t2 (a bigint); select create_distributed_table('t2','a'); select citus_add_node('localhost',9702); select rebalance_table_shards(); NOTICE: Moving shard 102008 from localhost:9701 to localhost:9702 ... NOTICE: Moving shard 102009 from localhost:9701 to localhost:9702 ... NOTICE: Moving shard 102012 from localhost:9701 to localhost:9702 ... ERROR: cannot use logical replication to transfer shards of the relation t2 since it doesn't have a REPLICA IDENTITY or PRIMARY KEY ``` Now we check and error out in the beginning, without moving the shards. fixes: #6727	2023-03-21 16:34:52 +03:00
Onur Tirtir	aa465b6de1	Decide what to do with router planner error at one place (#6781 )	2023-03-21 14:04:07 +03:00
aykut-bozkurt	aa33988c6e	fix pip lock file (#6766 ) ci/fix_styles.sh were complaining about `black` and `isort` packages are not found even if I `pipenv install --dev` due to broken lock file. I regenerated the lock file and now it works fine. We also wanted to upgrade required python version for the pipfile.	2023-03-21 00:58:12 +03:00
aykut-bozkurt	ea3093bdb6	Make workerCount configurable for regression tests (#6764 ) Make worker count flexible in our regression tests instead of hardcoding it to 2 workers.	2023-03-20 12:06:31 +03:00
Teja Mupparti	cf55136281	1) Restrict MERGE command INSERT to the source's distribution column Fixes #6672 2) Move all MERGE related routines to a new file merge_planner.c 3) Make ConjunctionContainsColumnFilter() static again, and rearrange the code in MergeQuerySupported() 4) Restore the original format in the comments section. 5) Add big serial test. Implement latest set of comments	2023-03-16 13:43:08 -07:00
Teja Mupparti	1e42cd3da0	Support MERGE on distributed tables with restrictions This implements the phase - II of MERGE sql support Support routable query where all the tables in the merge-sql are distributed, co-located, and both the source and target relations are joined on the distribution column with a constant qual. This should be a Citus single-task query. Below is an example. SELECT create_distributed_table('t1', 'id'); SELECT create_distributed_table('s1', 'id', colocate_with => ‘t1’); MERGE INTO t1 USING s1 ON t1.id = s1.id AND t1.id = 100 WHEN MATCHED THEN UPDATE SET val = s1.val + 10 WHEN MATCHED THEN DELETE WHEN NOT MATCHED THEN INSERT (id, val, src) VALUES (s1.id, s1.val, s1.src) Basically, MERGE checks to see if There are a minimum of two distributed tables (source and a target). All the distributed tables are indeed colocated. MERGE relations are joined on the distribution column MERGE .. USING .. ON target.dist_key = source.dist_key The query should touch only a single shard i.e. JOIN AND with a constant qual MERGE .. USING .. ON target.dist_key = source.dist_key AND target.dist_key = <> If any of the conditions are not met, it raises an exception. (cherry picked from commit `44c387b978`) This implements MERGE phase3 Support pushdown query where all the tables in the merge-sql are Citus-distributed, co-located, and both the source and target relations are joined on the distribution column. This will generate multiple tasks which execute independently after pushdown. SELECT create_distributed_table('t1', 'id'); SELECT create_distributed_table('s1', 'id', colocate_with => ‘t1’); MERGE INTO t1 USING s1 ON t1.id = s1.id WHEN MATCHED THEN UPDATE SET val = s1.val + 10 WHEN MATCHED THEN DELETE WHEN NOT MATCHED THEN INSERT (id, val, src) VALUES (s1.id, s1.val, s1.src) *The only exception for both the phases II and III is, UPDATEs and INSERTs must be done on the same shard-group as the joined key; for example, below scenarios are NOT supported as the key-value to be inserted/updated is not guaranteed to be on the same node as the id distribution-column. MERGE INTO target t USING source s ON (t.customer_id = s.customer_id) WHEN NOT MATCHED THEN - - INSERT(customer_id, …) VALUES (<non-local-constant-key-value>, ……); OR this scenario where we update the distribution column itself MERGE INTO target t USING source s On (t.customer_id = s.customer_id) WHEN MATCHED THEN UPDATE SET customer_id = 100; (cherry picked from commit `fa7b8949a8`)	2023-03-16 13:43:08 -07:00
Jelte Fennema	b8b85072d6	Add pytest depedencies to Pipfile (#6767 ) In #6720 I'm adding a `pytest` based testing framework. This adds the dependencies for those. They have already been [merged into our docker files][the-process-merge] in the the-process repo preparation for #6720. But by not having them on our citus main branch it is impossible to make changes to the Pipfile, because our CI Dockerfiles and master are out of date. Since #6720 will need some more discussion and might take a few more weeks to be merged, this takes out the Pipfile changes. By merging this PR we can unblock new Pipfile changes. Unblocks and partially addresses #6766 [the-process-merge]: https://github.com/citusdata/the-process/pull/117	2023-03-15 14:53:14 +01:00
Onur Tirtir	a0a41943d7	Remove pg_depend entries from columnar metadata indexes to columnar-am (inserted in #5456 ) (#6628 ) DESCRIPTION: Fixes (pg_dump/pg_upgrade) dependency loop warnings caused by pg_depend entries inserted by citus_columnar Fixes #5510. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.	2023-03-15 01:24:57 +03:00
Onur Tirtir	9550ebd118	Remove pg_depend entries from columnar metadata indexes to columnar-am In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.	2023-03-14 17:13:52 +03:00
Onur Tirtir	be0735a329	Use "cpp" to expand "#include" directives in columnar sql files	2023-03-14 17:13:52 +03:00
Onur Tirtir	2b4be535de	Do clean-up before upgrade_columnar_before to make it runnable multiple times So that flaky test detector can run upgrade_columnar_before.sql multiple times.	2023-03-14 17:13:52 +03:00
Onur Tirtir	994f67185f	Make upgrade_columnar_after runnable multiple times This commit hides port numbers in upgrade_columnar_after because the port numbers assigned to nodes in upgrade schedule differ from the ones that flaky test detector assigns.	2023-03-14 17:13:52 +03:00
Onur Tirtir	821f26cc74	Fix flaky test detection for upgrade tests When run_test.py is run for an upgrade_._after.sql then, then automatically run the corresponding uprade_._before.sql file first. This is because all those upgrade_._after.sql files depend on the objects created in upgrade_._before.sql files by definition.	2023-03-14 17:13:52 +03:00
Onur Tirtir	f68fc9e69c	Decide core distribution params in CreateCitusTable (#6760 ) Decide core distribution params in CreateCitusTable to reduce the chances of creating Citus tables based on incorrect combinations of distribution method and replication model params. Also introduce DistributedTableParams struct to encapsulate the parameters that are specific to distributed tables.	2023-03-14 14:24:52 +03:00
Onur Tirtir	cc945fa331	Add multi_create_fdw into minimal_schedule (#6759 ) So that we can run the tests that require fake_fdw by using minimal schedule too. Also move multi_create_fdw.sql up in multi_1_schedule to make it available to more tests.	2023-03-14 10:22:34 +03:00
Onur Tirtir	20a5f3af2b	Replace CITUS_TABLE_WITH_NO_DIST_KEY checks with HasDistributionKey() (#6743 ) Now that we will soon add another table type having DISTRIBUTE_BY_NONE as distribution method and that we want the code to interpret such tables mostly as distributed tables, let's make the definition of those other two table types more strict by removing CITUS_TABLE_WITH_NO_DIST_KEY macro. And instead, use HasDistributionKey() check in the places where the logic applies to all table types that have / don't have a distribution key. In future PRs, we might want to convert some of those HasDistributionKey() checks if logic only applies to Citus local / reference tables, not the others. And adding HasDistributionKey() also allows us to consider having DISTRIBUTE_BY_NONE as the distribution method as a "table attribute" that can apply to distributed tables too, rather something that determines the table type.	2023-03-10 13:55:52 +03:00
Onur Tirtir	e3cf7ace7c	Stabilize single_node.sql and others that report illegal node removal (#6751 ) See https://app.circleci.com/pipelines/github/citusdata/citus/30859/workflows/223d61db-8c1d-4909-9aea-d8e470f0368b/jobs/1009243.	2023-03-08 15:25:36 +03:00
Onur Tirtir	d82c11f793	Refactor CreateDistributedTable() (#6742 ) Split the main logic that allows creating a Citus table into the internal function CreateCitusTable(). Old CreateDistributedTable() function was assuming that it's creating a reference table when the distribution method is DISTRIBUTE_BY_NONE. However, soon this won't be the case when adding support for creating single-shard distributed tables because their distribution method would also be the same. Now the internal method CreateCitusTable() doesn't make any assumptions about table's replication model or such. Instead, it expects callers to properly set all such metadata bits. Even more, some of the parameters the old CreateDistributedTable() takes --such as the shard count-- were not meaningful for a reference table, and would be the same as for new table type.	2023-03-08 13:38:51 +03:00

1 2 3 4 5 ...

6459 Commits (81953d7ac669eb512f7deb77d91645ab2f18472b) All Branches Search

6459 Commits (81953d7ac669eb512f7deb77d91645ab2f18472b)

All Branches