citus

Commit Graph

Author	SHA1	Message	Date
jeff-davis	4f213f293e	Columnar: use generate_series for test rather than load. (#5181 )	2021-08-16 16:12:06 -07:00
Onur Tirtir	68f46c5dc9	Use scan context for intermediate mem allocs too	2021-08-16 11:06:03 +03:00
Burak Velioglu	4355ba0a38	Add CREATE INDEX ... ON ONLY and ALTER INDEX ... ATTACH PARTITION (#4938 #4980 ) - Add support for CRETE INDEX ... ON ONLY: Before that commit we were not sending "ONLY" option to the worker nodes at all. With this commit, "ONLY" parameter will be sent to the worker nodes if it is necessary. (#4938) - Add support for ALTER INDEX ... ATTACH PARTITION: Attach child_index to parent_index by creating same inheritance on shard level in addition to table level. (#4980)	2021-08-13 13:12:45 +03:00
Ahmet Gedemenli	9e90894f21	Synchronize hasmetadata flag on mx workers (#5086 ) * Synchronize hasmetadata flag on mx workers * Switch to sequential execution * Add test * Use SetWorkerColumn * Add test for stop_sync * Remove usage of UpdateHasmetadataOnWorkersWithMetadata * Remove MarkNodeMetadataSynced * Fix test for metadatasynced * Remove MarkNodeMetadataSynced * Style * Remove MarkNodeHasMetadata * Remove UpdateDistNodeBoolAttr * Refactor SetWorkerColumn * Use SetWorkerColumnLocalOnly when setting up dependencies * Use SetWorkerColumnLocalOnly in TriggerSyncMetadataToPrimaryNodes * Style * Make update command generator functions static * Set metadatasynced before syncing * Call SetWorkerColumn only if the sync is successful * Try to sync all nodes * Fix indexno * Update metadatasynced locally first * Break if a node fails to sync metadata * Send worker commands optional * Style & Rebase * Add raiseOnError param to SetWorkerColumn * Style * Set metadatasynced for all metadata nodes * Style * Introduce SetWorkerColumnOptional * Polish * Style * Dont send set command to not synced metadata nodes * Style * Polish * Add test for stop_sync * Add test for shouldhaveshards * Add test for isactive flag * Sort by placementid in the function verify_metadata * Cover edge cases for failing nodes * Add comments * Add nodeport to isactive test * Add warning if metadata out of sync * Update warning message	2021-08-12 14:16:18 +03:00
Onder Kalaci	5f02d18ef8	transactional metadata sync for maintanince daemon As we use the current user to sync the metadata to the nodes with #5105 (and many other PRs), there is no reason that prevents us to use the coordinated transaction for metadata syncing. This commit also renames few functions to reflect their actual implementation.	2021-08-09 10:34:55 +02:00
Onder Kalaci	35964c6366	Dropped columns do not diverge distribution column for partitioned tables Before this commit, creating a partition after a DROP column on the parent (position before dist. key) was leading to partition to have the wrong distribution column.	2021-08-06 13:36:12 +02:00
naisila	798a7902bf	Fix master_update_table_statistics scripts for 9.5	2021-08-03 18:15:56 +03:00
naisila	f9fa5a3d69	Fix master_update_table_statistics scripts for 9.4	2021-08-03 18:15:56 +03:00
Onder Kalaci	482b8096e9	Introduce citus_internal_update_relation_colocation update_distributed_table_colocation can be called by the relation owner, and internally it updates pg_dist_partition. With this commit, update_distributed_table_colocation uses an internal UDF to access pg_dist_partition. As a result, this operation can now be done by regular users on MX.	2021-08-03 11:44:58 +02:00
Onur Tirtir	93ebbb0607	Re-cost SeqPath's as well for columnar tables	2021-08-02 11:32:25 +03:00
Onur Tirtir	297f59a70e	Re-cost columnar table index paths	2021-08-02 11:16:37 +03:00
Onur Tirtir	73058d35cc	Not free (stripe) chunk buffers after de-serializing Previously, we were only using chunk group reader for sequential scan. However, to support index scans on columnar tables, now we use very same low level functions for index scan too. Since those low-level functions were only used for sequential scan, it was guaranteed that we would never read the same chunk group more than once, so we were freeing chunk buffers after deserializing them into a separate buffer. Now that we use those low level functions for index scan, we cannot free chunk buffers since it's possible to read the same chunk group again, such that: - read chunk group 1 of stripe 5 - read chunk group 2 of stripe 5 - read chunk group 1 of stripe 5 again Here, when we decide to read chunk group 1 for a second time, chunk group 1 is not cached. Plus, before this commit, we were freeing the chunk buffers for chunk group 1 after the first read and then we were getting segfault or errors from low-level de-compression APIs.	2021-08-02 11:00:12 +03:00
Onur Tirtir	83f5d42365	Use long-lasting mem cxt & optimize correlated index scan	2021-08-02 11:00:12 +03:00
Onur Tirtir	90e856d6bc	Keep supported indexes when converting table to columnar	2021-07-30 16:41:01 +03:00
SaitTalhaNisanci	4559d02c41	Fix union pushdown issue (#5079 ) * Fix UNION not being pushdown Postgres optimizes column fields that are not needed in the output. We were relying on these fields to understand if it is safe to push down a union query. This fix looks at the parse query, which has the original column fields to detect if it is safe to push down a union query. * Add more tests * Simplify code and make it more robust * Process varlevelsup > 0 in FindReferencedTableColumn * Only look for outers vars in union path * Add more comments * Remove UNION ALL specific logic for pulling up childvars	2021-07-29 13:52:55 +03:00
Jelte Fennema	2aa67421a7	Fix showing target shard size in the rebalance progress monitor (#5136 ) The progress monitor wouldn't actually update the size of the shard on the target node when using "block_writes" as the `shard_transfer_mode`. The reason for this is that the CREATE TABLE part of the shard creation would only be committed once all data was moved as well. This caused our size calculation to always return 0, since the table did not exist yet in the session that the progress monitor used. This is fixed by first committing creation of the table, and only then starting the actual data copy. The test output changes slightly. Apparently splitting this up in two transactions instead of one, increases the table size after the copy by about 40kB. The additional size used doesn't increase when with the amount of data in the table is larger (it stays ~40kB per shard). So this small change in test output is not considered an actual problem.	2021-07-23 16:37:00 +02:00
Jelte Fennema	7d0b6dc9be	Include data_type and cache in sequence definition on workers These two options were not included when creating the sequences on the workers as part of metadata syncing. The missing `data_type` part of the definition made finding the cause of #5126 harder than necessary, because of confusing errors.	2021-07-22 11:49:06 +02:00
Onder Kalaci	c8368e7929	Introduce citus_internal_delete_shard_metadata With this function, the owner of the table is allowed to remove shard metadata. This is going to be useful for tenant-isolation.	2021-07-19 13:25:05 +02:00
Jelte Fennema	adf17a8cf1	Add upgrade and dowgrade tests for Citus 10.2 (#5120 ) It seems we forgot to add this when starting 10.2 development.	2021-07-16 14:39:04 +02:00
Onder Kalaci	2c349e6dfd	Use current user to sync metadata Before this commit, we always synced the metadata with superuser. However, that creates various edge cases such as visibility errors or self distributed deadlocks or complicates user access checks. Instead, with this commit, we use the current user to sync the metadata. Note that, `start_metadata_sync_to_node` still requires super user because accessing certain metadata (like pg_dist_node) always require superuser (e.g., the current user should be a superuser). However, metadata syncing operations regarding the distributed tables can now be done with regular users, as long as the user is the owner of the table. A table owner can still insert non-sense metadata, however it'd only affect its own table. So, we cannot do anything about that.	2021-07-16 13:25:27 +02:00
Onur Tirtir	f00c63c33d	Support columnar table index builds with CONCURRENTLY option (#5032 ) With this commit, we add (`CREATE INDEX` / `REINDEX`) `CONCURRENTLY` support for columnar tables. For that, we implement `columnar_index_validate_scan` callback. The reasoning behind the implementation is as follows: * Postgres function `validate_index` provides all the TIDs that are currently in the index to `columnar_index_validate_scan` callback via a `tupleSort` object.. * We start scanning the table by using `columnar_getnextslot` as usual. Before moving forward, note that `columnar_getnextslot` guarantees to return tuples in the order of their TIDs. * For us to use during table scan, postgres provides a snapshot guaranteeing that any tuples that are valid according to that snapshot but are not in the index must be added to the index. * Then for each tuple that we read from our table, we continue iterating given `tupleSort` to find the first TID that is greater than or equal to our tuple's TID. If both TID's are equal to each other, then we skip the tuple since it's already indexed. If the TID that we read from tupleSort is greater then our tuple's TID, then we decide to insert this tuple into index.	2021-07-09 13:44:58 +03:00
Hanefi Onaldi	8e9cc229ff	Remove public schema dependency for 10.0 upgrades This commit contains a subset of the changes that should be cherry picked to 10.0 releases.	2021-07-09 02:08:22 +03:00
Ahmet Gedemenli	ed3b98a80b	Add failure test for stop_metadata_sync_to_node (#5102 )	2021-07-08 18:23:19 +03:00
Marco Slot	b14955c2bd	Fix PG upgrade scripts for 10.0	2021-07-05 14:38:20 +02:00
Marco Slot	3c0dfc12c0	Fix PG upgrade scripts for 9.5	2021-07-05 13:39:35 +02:00
Marco Slot	bee202aa39	Fix PG upgrade scripts for 9.4	2021-07-05 13:39:28 +02:00
Onur Tirtir	b118d4188e	Fix lower boundary calculation when pruning range dist table shards (#5082 ) This happens only when we have a "<" or "<=" filter on distribution column of a range distributed table and that filter falls in between two shards. When the filter falls in between two shards: If the filter is ">" or ">=", then UpperShardBoundary was returning "upperBoundIndex - 1", where upperBoundIndex is exclusive shard index used during binary seach. This is expected since upperBoundIndex is an exclusive index. If the filter is "<" or "<=", then LowerShardBoundary was returning "lowerBoundIndex + 1", where lowerBoundIndex is inclusive shard index used during binary seach. On the other hand, since lowerBoundIndex is an inclusive index, we should just return lowerBoundIndex instead of doing "+ 1". Before this commit, we were missing leftmost shard in such queries. * Remove useless conditional branches The branch that we delete from UpperShardBoundary was obviously useless. The other one in LowerShardBoundary became useless after we remove "+ 1" from there. This indeed is another proof of what & how we are fixing with this pr. * Improve comments and add more * Add some tests for upper bound calculation too	2021-07-02 14:48:21 +03:00
Ahmet Gedemenli	8bae58fdb7	Add parameter to cleanup metadata (#5055 ) * Add parameter to cleanup metadata * Set clear metadata default to true * Add test for clearing metadata * Separate test file for start/stop metadata syncing * Fix stop_sync bug for secondary nodes * Use PreventInTransactionBlock * DRemovedebuggiing logs * Remove relation not found logs from mx test * Revert localGroupId when doing stop_sync * Move metadata sync test to mx schedule * Add test with name that needs to be quoted * Add test for views and matviews * Add test for distributed table with custom type * Add comments to test * Add test with stats, indexes and constraints * Fix matview test * Add test for dropped column * Add notice messages to stop_metadata_sync * Add coordinator check to stop metadat sync * Revert local_group_id only if clearMetadata is true * Add a final check to see the metadata is sane * Remove the drop verbosity in test * Remove table description tests from sync test * Add stop sync to coordinator test * Change the order in stop_sync * Add test for hybrid (columnar+heap) partitioned table * Change error to notice for stop sync to coordinator * Sync at the end of the test to prevent any failures * Add test case in a transaction block * Remove relation not found tests	2021-07-01 16:23:53 +03:00
Sait Talha Nisanci	e7ed16c296	Not include to-be-deleted shards while finding shard placements Ignore orphaned shards in more places Only use active shard placements in RouterInsertTaskList Use IncludingOrphanedPlacements in some more places Fix comment Add tests	2021-06-28 13:05:31 +03:00
Naisila Puka	fe5907ad2d	Adds propagation of ALTER SEQUENCE and other improvements (#5061 ) * Alter seq type when we first use the seq in a dist table * Don't allow type changes when seq is used in dist table * ALTER SEQUENCE propagation * Tests for ALTER SEQUENCE propagation * Relocate AlterSequenceType and ensure dependencies for sequence * Support for citus local tables, and other fixes * Final formatting	2021-06-24 21:23:25 +03:00
Jelte Fennema	d1d386a904	Only allow moves of shards of distributed tables (#5072 ) Moving shards of reference tables was possible in at least one case: ```sql select citus_disable_node('localhost', 9702); create table r(x int); select create_reference_table('r'); set citus.replicate_reference_tables_on_activate = off; select citus_activate_node('localhost', 9702); select citus_move_shard_placement(102008, 'localhost', 9701, 'localhost', 9702); ``` This would then remove the reference table shard on the source, causing all kinds of issues. This fixes that by disallowing all shard moves except for shards of distributed tables. Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-06-23 16:25:46 +02:00
Onder Kalaci	75847d10b5	Add regression tests for changing column type with fkey closes https://github.com/citusdata/citus/issues/2337 as it doesn't apply anymore.	2021-06-23 09:03:55 +03:00
Onder Kalaci	55ed93bf0d	fix regression tests to avoid any conflicts in enterprise	2021-06-22 08:45:17 +03:00
Onder Kalaci	76ae5dd0db	Improve regression tests for prepared statements With a recent commit, we made (`644b266dee`) the behaviour of prepared statements for local cached plans has slightly changed. Now, Citus caches the plans when they are re-used. This make triggering of local cached plans on the 7th execution, and 8th execution is the first time the plan is used from the cached. So, the tests are improved to cover 8th execution.	2021-06-21 13:34:44 +03:00
Onder Kalaci	69ca943e58	Deparse/parse the local cached queries With local query caching, we try to avoid deparse/parse stages as the operation is too costly. However, we can do deparse/parse operations once per cached queries, right before we put the plan into the cache. With that, we avoid edge cases like (4239) or (5038). In a sense, we are making the local plan caching behave similar for non-cached local/remote queries, by forcing to deparse the query once.	2021-06-21 12:24:29 +03:00
Onur Tirtir	681f700321	Fix first_row_number test for stripe_row_limit enforcement	2021-06-17 10:51:43 +03:00
Onur Tirtir	3d11c0f9ef	Merge remote-tracking branch 'origin/master' into columnar-index Conflicts: src/test/regress/expected/columnar_empty.out src/test/regress/expected/multi_extension.out	2021-06-16 20:23:50 +03:00
Onur Tirtir	b6b969971a	Error out for CLUSTER commands on columnar tables	2021-06-16 20:06:33 +03:00
Onur Tirtir	9b4dc2f804	Prevent using parallel scan for columnar index builds	2021-06-16 19:59:32 +03:00
Onur Tirtir	10a762aa88	Implement columnar index support functions	2021-06-16 19:59:32 +03:00
Halil Ozan Akgul	db03afe91e	Bump citus version to 10.2devel	2021-06-16 17:44:05 +03:00
SaitTalhaNisanci	1784c7ef85	Merge branch 'master' into split_multi	2021-06-16 15:26:09 +03:00
Sait Talha Nisanci	c7d04e7f40	swap multi_schedule and multi_schedule_1	2021-06-16 14:40:14 +03:00
Sait Talha Nisanci	c55e44a4af	Drop table if exists	2021-06-16 14:19:59 +03:00
Naisila Puka	e26b29d3bb	Fix nextval('seq_name'::text) bug, and schema for seq tests (#5046 )	2021-06-16 13:58:49 +03:00
Marco Slot	a7e4d6c94a	Fix a bug that causes worker_create_or_alter_role to crash with NULL input	2021-06-15 20:07:08 +02:00
Onur Tirtir	a209999618	Enforce table opt constraints when using alter_columnar_table_set (#5029 )	2021-06-08 17:39:16 +03:00
Ahmet Gedemenli	089ef35940	Disable dropping and truncating known shards Add test for disabling dropping and truncating known shards	2021-06-02 14:30:27 +02:00
Jelte Fennema	503c70b619	Cleanup orphaned shards before moving when necessary A shard move would fail if there was an orphaned version of the shard on the target node. With this change before actually fail, we try to clean up orphaned shards to see if that fixes the issue.	2021-06-04 11:23:07 +02:00
Jelte Fennema	280b9ae018	Cleanup orphaned shards at the start of a rebalance In case the background daemon hasn't cleaned up shards yet, we do this manually at the start of a rebalance.	2021-06-04 11:23:07 +02:00
Jelte Fennema	7015049ea5	Add citus_cleanup_orphaned_shards UDF Sometimes the background daemon doesn't cleanup orphaned shards quickly enough. It's useful to have a UDF to trigger this removal when needed. We already had a UDF like this but it was only used during testing. This exposes that UDF to users. As a safety measure it cannot be run in a transaction, because that would cause the background daemon to stop cleaning up shards while this transaction is running.	2021-06-04 11:23:07 +02:00
Naisila Puka	0f37ab5f85	Fixes column default coming from a sequence (#4914 ) * Add user-defined sequence support for MX * Remove default part when propagating to workers * Fix ALTER TABLE with sequences for mx tables * Clean up and add tests * Propagate DROP SEQUENCE * Removing function parts * Propagate ALTER SEQUENCE * Change sequence type before propagation & cleanup * Revert "Propagate ALTER SEQUENCE" This reverts commit 2bef64c5a29f4e7224a7f43b43b88e0133c65159. * Ensure sequence is not used in a different column with different type * Insert select tests * Propagate rename sequence stmt * Fix issue with group ID cache invalidation * Add ALTER TABLE ALTER COLUMN TYPE .. precaution * Fix attnum inconsistency and add various tests * Add ALTER SEQUENCE precaution * Remove Citus hook * More tests Co-authored-by: Marco Slot <marco.slot@gmail.com>	2021-06-03 23:02:09 +03:00
Marco Slot	c03729ad03	Only warn about reference tables when removing last node	2021-06-01 10:53:12 +02:00
Hanefi Onaldi	056005db4d	Improve tests for truncating local data (#5012 ) We have a slightly different behavior when using truncate_local_data_after_distributing_table UDF on metadata synced clusters. This PR aims to add tests to cover such cases. We allow distributing tables with data that have foreign keys to reference tables only on metadata synced clusters. This is the reason why some of my earlier tests failed when run on a single node Citus cluster.	2021-06-03 08:51:32 +03:00
Ahmet Gedemenli	0fbddc740d	Fix shard id difference for enterprise	2021-06-01 17:17:46 +03:00
Ahmet Gedemenli	69d39c0e8b	Fix relname null bug when parallel execution	2021-06-01 14:14:35 +03:00
Ahmet Gedemenli	9638933d9d	Remove function GenerateNewTargetEntriesForSortClauses	2021-06-01 12:35:36 +03:00
SaitTalhaNisanci	c72d2b479b	Add tests for union pushdown workaround (#5005 )	2021-05-31 20:02:20 +02:00
SaitTalhaNisanci	8c3f85692d	Not consider old placements when disabling or removing a node (#4960 ) * Not consider old placements when disabling or removing a node * update cluster test	2021-05-28 22:38:20 +02:00
Hanefi Onaldi	878513f325	Remove all occurences of replication_model GUC	2021-05-21 16:14:59 +03:00
SaitTalhaNisanci	87e3a5e24a	Use 2PC when using a node connection (#4997 )	2021-05-21 14:58:53 +03:00
SaitTalhaNisanci	82f34a8d88	Enable citus.defer_drop_after_shard_move by default (#4961 ) Enable citus.defer_drop_after_shard_move by default	2021-05-21 10:48:32 +03:00
Jelte Fennema	10f06ad753	Fetch shard size on the fly for the rebalance monitor Without this change the rebalancer progress monitor gets the shard sizes from the `shardlength` column in `pg_dist_placement`. This column needs to be updated manually by calling `citus_update_table_statistics`. However, `citus_update_table_statistics` could lead to distributed deadlocks while database traffic is on-going (see #4752). To work around this we don't use `shardlength` column anymore. Instead for every rebalance we now fetch all shard sizes on the fly. Two additional things this does are: 1. It adds tests for the rebalance progress function. 2. If a shard move cannot be done because a source or target node is unreachable, then we error in stop the rebalance, instead of showing a warning and continuing. When using the by_disk_size rebalance strategy it's not safe to continue with other moves if a specific move failed. It's possible that the failed move made space for the next move, and because the failed move never happened this space now does not exist. 3. Adds two new columns to the result of `get_rebalancer_progress` which shows the size of the shard on the source and target node. Fixes #4930	2021-05-20 16:38:17 +02:00
Nils Dijk	a6c2d2a4c4	Feature: alter database owner (#4986 ) DESCRIPTION: Add support for ALTER DATABASE OWNER This adds support for changing the database owner. It achieves this by marking the database as a distributed object. By marking the database as a distributed object it will look for its dependencies and order the user creation commands (enterprise only) before the alter of the database owner. This is mostly important when adding new nodes. By having the database marked as a distributed object it can easily understand for which `ALTER DATABASE ... OWNER TO ...` commands to propagate by resolving the object address of the database and verifying it is a distributed object, and hence should propagate changes of owner ship to all workers. Given the ownership of the database might have implications on subsequent commands in transactions we force sequential mode for transactions that have a `ALTER DATABASE ... OWNER TO ...` command in them. This will fail the transaction with meaningful help when the transaction already executed parallel statements. By default the feature is turned off since roles are not automatically propagated, having it turned on would cause hard to understand errors for the user. It can be turned on by the user via setting the `citus.enable_alter_database_owner`.	2021-05-20 13:27:44 +02:00
Onder Kalaci	d07db99ea4	Make sure that target node in shard moves is eligable for shard move	2021-05-20 10:51:01 +02:00
Jelte Fennema	924959fdb1	Include result type in upgrade diff test (#4987 ) We often change result types of functions slightly. Our downgrade tests wouldn't notice these changes. This change adds them to the description of these items. An example of an SQL change that isn't caught without this change and is caught with the get_rebalance_progress change in this PR: https://github.com/citusdata/citus/pull/4963	2021-05-18 16:25:39 +02:00
Nils Dijk	c91f8d8a15	Feature: localhost guc (#4836 ) DESCRIPTION: introduce `citus.local_hostname` GUC for connections to the current node Citus once in a while needs to connect to itself for some systems operations. This used to be hardcoded to `localhost`. The hardcoded hostname causes some issues, for example in environments where `sslmode=verify-full` is required. It is not always desirable or even feasible to get `localhost` as an alt name on the certificate. By introducing a GUC to use when connecting to the current instance the user has more control what network path is used and what hostname is required to be present in the server certificate.	2021-05-12 16:59:44 +02:00
Hanefi Onaldi	13808b60cf	Update gitignore files	2021-05-12 09:49:07 +03:00
Jelte Fennema	cbbd10b974	Implement an improvement threshold in the rebalancer (#4927 ) Every move in the rebalancer algorithm results in an improvement in the balance. However, even if the improvement in the balance was very small the move was still chosen. This is especially problematic if the shard itself is very big and the move will take a long time. This changes the rebalancer algorithm to take the relative size of the balance improvement into account when choosing moves. By default a move will not be chosen if it improves the balance by less than half of the size of the shard. An extra argument is added to the rebalancer functions so that the user can decide to lower the default threshold if the ignored move is wanted anyway.	2021-05-11 14:24:59 +02:00
Onur Tirtir	2e419ea177	Add first_row_number column to columnar.stripe for tid mapping	2021-05-10 20:16:50 +03:00
jeff-davis	7b9aecff21	Columnnar: metapage changes. (#4907 ) * Columnar: introduce columnar storage API. This new API is responsible for the low-level storage details of columnar; translating large reads and writes into individual block reads and writes that respect the page headers and emit WAL. It's also responsible for the columnar metapage, resource reservations (stripe IDs, row numbers, and data), and truncation. This new API is not used yet, but will be used in subsequent forthcoming commits. * Columnar: add columnar_storage_info() for debugging purposes. * Columnar: expose ColumnarMetadataNewStorageId(). * Columnar: always initialize metapage at creation time. This avoids the complexity of dealing with tables where the metapage has not yet been initialized. * Columnar: columnar storage upgrade/downgrade UDFs. Necessary upgrade/downgrade step so that new code doesn't see an old metapage. * Columnar: improve metadata.c comment. * Columnar: make ColumnarMetapage internal to the storage API. Callers should not have or need direct access to the metapage. * Columnar: perform resource reservation using storage API. * Columnar: implement truncate using storage API. * Columnar: implement read/write paths with storage API. * Columnar: add storage tests. * Revert "Columnar: don't include stripe reservation locks in lock graph." This reverts commit `c3dcd6b9f8`. No longer needed because the columnar storage API takes care of concurrency for resource reservation. * Columnar: remove unnecessary lock when reserving. No longer necessary because the columnar storage API takes care of concurrent resource reservation. * Add simple upgrade tests for storage/ branch * fix multi_extension.out Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-05-10 20:16:46 +03:00
Ahmet Gedemenli	8cb505d6e1	Fix matview access method change issue (#4959 ) * Fix matview access method change issue * Use pg function get_am_name * Split view generation command into pieces	2021-05-07 15:47:24 +03:00
SaitTalhaNisanci	6b1904d37a	When moving a shard to a new node ensure there is enough space (#4929 ) * When moving a shard to a new node ensure there is enough space * Add WairForMiliseconds time utility * Add more tests and increase readability * Remove the retry loop and use a single udf for disk stats * Address review * address review Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2021-05-06 17:28:02 +03:00
Ahmet Gedemenli	bc818e76e2	Add notice log message for skipping child tables for optimization	2021-05-06 16:49:37 +03:00
Ahmet Gedemenli	2e0bb5c0c8	Fix nested select query with union bug	2021-05-05 20:35:00 +03:00
Jelte Fennema	50357db957	Simplify code that tests the shard rebalancer algorithm (#4925 ) This modifies the test code to use sane defaults instead of requiring all values to be specified in the test.	2021-05-03 15:47:19 +02:00
Marco Slot	4b49cb112f	Fix FROM ONLY queries on partitioned tables	2021-04-27 16:10:07 +02:00
Onur Tirtir	889ad6fa8c	Run some upgrade tests only when old version=9.0	2021-04-26 14:53:53 +03:00
Ahmet Gedemenli	332c5ce4ad	Fix worker partitioned size functions (#4922 )	2021-04-26 10:29:46 +03:00
Onder Kalaci	918838e488	Allow constant VALUES clauses in pushdown queries As long as the VALUES clause contains constant values, we should not recursively plan the queries/CTEs. This is a follow-up work of #1805. So, we can easily apply OUTER join checks as if VALUES clause is a reference table/immutable function.	2021-04-21 14:28:08 +02:00
Ahmet Gedemenli	33c620f232	Optimize partitioned disk size calculation (#4905 ) * Optimize partitioned disk size calculation * Polish * Fix test for citus_shard_cost_by_disk_size Try optimizing if not CSTORE	2021-04-19 13:30:56 +03:00
Onur Tirtir	96278822d9	Move columnar test helpers to a separate file (#4908 ) * Move columnar test helpers to another file * Rename column_store_memory_stats to columnar_store_memory_stats	2021-04-16 18:56:21 +03:00
Onder Kalaci	5b78f6cd63	Keep more execution statistics When DEBUG4 enabled, Citus now prints per task execution times.	2021-04-16 14:45:00 +02:00
Hanefi Onaldi	9919fbe3f8	Switch to sequential mode on long partition names This commit adds support for long partition names for distributed tables: - ALTER TABLE dist_table ATTACH PARTITION .. - CREATE TABLE .. PARTITION OF dist_table .. Note: create_distributed_table UDF does not support long table and partition names, and is not covered in this commit	2021-04-14 15:27:50 +03:00
Ahmet Gedemenli	e445e3d39c	Introduce 3 partitioned size udfs (#4899 ) * Introduce 3 partitioned size udfs * Add tests for new partition size udfs * Fix type incompatibilities * Convert UDFs into pure sql functions * Fix function comment	2021-04-13 17:36:27 +03:00
Ahmet Gedemenli	52e467a9a0	Error out if inheriting a distributed table (#4871 ) * Error out if inheriting a distributed table * Add test inheriting a distirbuted table	2021-04-07 11:21:06 +03:00
Ahmet Gedemenli	48a6a5b128	Add test for public shard not found issue	2021-04-06 10:29:17 +03:00
Halil Ozan Akgul	a5038046f9	Adds shard_count parameter to create_distributed_table	2021-03-29 16:22:49 +03:00
SaitTalhaNisanci	03832f353c	Drop postgres 11 support	2021-03-25 09:20:28 +03:00
Nils Dijk	787ee97867	Tests: foreign key non colocated tests (#4841 ) Earlier versions of Citus (pre 9.0) had a bug where a user was able to get in a situation where a foreign key between two non-colocated tables was allowed. This was caused by the wrongful scoping together with only setting to on of a boolean variable in a loop, causing the `true` from an earlier iteration to leak into a new iteration. This was 'by accident' solved in a refactor that was executed in the preparation of the 9.0 release. Only recently we had a user running into this and it was tracked down to this behaviour. Given the dire situation a user could get them self into when running into this bug we have backported a fix to the latest 8.3 release branch. To make sure this regression does not happen anymore in the future I propose we add the tests from the backport to our mainline. For reference: https://github.com/citusdata/citus/pull/4840	2021-03-22 15:33:56 +01:00
Ahmet Gedemenli	5e5db9eefa	Add udf citus_get_active_worker_nodes	2021-03-17 13:15:59 +03:00
Marco Slot	fbc2147e11	Replace MAX_PUT_COPY_DATA_BUFFER_SIZE by citus.remote_copy_flush_threshold GUC	2021-03-16 06:00:38 +01:00
Marco Slot	1646fca445	Add GUC to set maximum connection lifetime	2021-03-16 01:57:57 +01:00
Onur Tirtir	1d3e075e62	Support temporary columnar tables (#4766 )	2021-03-12 12:01:36 +03:00
Onder Kalaci	e65e72130d	Rename use -> shouldUse Because setting the flag doesn't necessarily mean that we'll use 2PC. If connections are read-only, we will not use 2PC. In other words, we'll use 2PC only for connections that modified any placements.	2021-03-12 08:29:43 +00:00
Onder Kalaci	6a7ed7b309	Do not trigger 2PC for reads on local execution Before this commit, Citus used 2PC no matter what kind of local query execution happens. For example, if the coordinator has shards (and the workers as well), even a simple SELECT query could start 2PC: ```SQL WITH cte_1 AS (SELECT * FROM test LIMIT 10) SELECT count(*) FROM cte_1; ``` In this query, the local execution of the shards (and also intermediate result reads) triggers the 2PC. To prevent that, Citus now distinguishes local reads and local writes. And, Citus switches to 2PC only if a modification happens. This may still lead to unnecessary 2PCs when there is a local modification and remote SELECTs only. Though, we handle that separately via #4587.	2021-03-12 08:29:43 +00:00
Onur Tirtir	874d5fd962	Remove foreign keys between columnar metadata tables (#4791 ) Postgres keeps AFTER trigger state for each transaction, because we can have deferred AFTER triggers which will be fired at the end of a transaction. Postgres cleans up this state at the end of transaction. Postgres processes ON COMMIT triggers after cleaning-up the AFTER trigger states. So if we fire any triggers in ON COMMIT, the AFTER trigger state won't be cleaned-up properly and the transaction state will be left in an inconsistent state, which might result in assertion failure. So with this commit, we remove foreign keys between columnar metadata tables and enforce constraints between them manually when dropping columnar tables.	2021-03-12 11:28:17 +03:00
Naisila Puka	196064836c	Skip 2PC for readonly connections in a transaction (#4587 ) * Skip 2PC for readonly connections in a transaction * Use ConnectionModifiedPlacement() function * Remove the second check of ConnectionModifiedPlacement() * Add order by to prevent flaky output * Test using pg_dist_transaction	2021-03-10 20:01:37 +03:00
Marco Slot	9c0d7f5c26	Add tests for modifying CTE and SELECT without FROM	2021-03-09 10:39:33 +01:00
SaitTalhaNisanci	aef7fc3a51	Ignore columnar generated test files (#4796 )	2021-03-09 10:52:08 +03:00
Philip Dubé	4e22f02997	Fix various typos due to zealous repetition	2021-03-04 19:28:15 +00:00
Onur Tirtir	1bb7a0a268	Fix chunk_group_consistency regression test view (#4765 )	2021-03-04 12:20:25 +03:00
Onur Tirtir	9728ce1167	Add tests for concurrent index deadlock issue (#4775 )	2021-03-04 11:56:54 +03:00
Onder Kalaci	d1cd198655	Prevent infinite recursion for queries that involve UNION ALL and JOIN With this commit, we make sure to prevent infinite recursion for queries in the format: [subquery with a UNION ALL] JOIN [table or subquery] Also, fixes a bug where we pushdown UNION ALL below a JOIN even if the UNION ALL is not safe to pushdown.	2021-03-03 12:27:26 +01:00
Naisila Puka	2f30614fe3	Reimplement citus_update_table_statistics to detect dist. deadlocks (#4752 ) * Reimplement citus_update_table_statistics * Update stats for the given table not colocation group * Add tests for reimplemented citus_update_table_statistics * Use coordinated transaction, merge with citus_shard_sizes functions * Update the old master_update_table_statistics as well	2021-03-03 04:12:30 +03:00
jeff-davis	9da9bd3dfd	Columnar: rename files and tests. (#4751 ) * Columnar: rename files and tests. * Columnar: Rename TableState to ColumnarState.	2021-03-01 08:34:24 -08:00
SaitTalhaNisanci	feee25dfbd	Use translated vars in postgres 13 as well (#4746 ) * Use translated vars in postgres 13 as well Postgres 13 removed translated vars with pg 13 so we had a special logic for pg 13. However it had some bug, so now we copy the translated vars before postgres deletes it. This also simplifies the logic. * fix rtoffset with pg >= 13	2021-02-26 19:41:29 +03:00
Halil Ozan Akgul	5c5cb200f7	Adds GRANT for public to citus_tables	2021-02-26 16:24:33 +03:00
Önder Kalacı	0fe26a216c	Prevent cross join without any target list entries (#4750 ) /* * The physical planner assumes that all worker queries would have * target list entries based on the fact that at least the column * on the JOINs have to be on the target list. However, there is * an exception to that if there is a cartesian product join and * there is no additional target list entries belong to one side * of the JOIN. Once we support cartesian product join, we should * remove this error. */	2021-02-26 11:04:21 +01:00
Onur Tirtir	54ac924bef	Grant read access for columnar metadata tables to unprivileged user	2021-02-26 12:31:09 +03:00
Onur Tirtir	dcc0207605	Add 10.0-2 schema version	2021-02-26 12:31:09 +03:00
Onur Tirtir	5ed954844c	Ensure table owner when using alter_columnar_table_set/alter_columnar_table_reset (#4748 )	2021-02-26 12:27:51 +03:00
Naisila Puka	5ebd4eac7f	Preserve colocation with procedures in alter_distributed_table (#4743 )	2021-02-25 19:52:47 +03:00
Hanefi Onaldi	5aff18b573	Fix flaky test	2021-02-24 17:09:08 +03:00
Hanefi Onaldi	7bebeb872d	Failing long table name tests	2021-02-24 03:35:27 +03:00
Naisila Puka	dbb88f6f8b	Fix insert query with CTEs/sublinks/subqueries etc (#4700 ) * Fix insert query with CTE * Add more cases with deferred pruning but false fast path * Add more tests * Better readability with if statements	2021-02-23 18:00:47 +03:00
Naisila Puka	105bb580e1	Add columnar regression tests (#4727 ) * Add cursor tests for columnar tables * Add columnar tests for data types w/out comp. operators * Add more prepared statements with columnar tables * Add constraint tests for columnar tables * Add row level security, detach partition and rename columnar tests * Add some ORDER BYs	2021-02-23 14:16:38 +03:00
Ahmet Gedemenli	1f345f65b4	Support dropping local table indexes along with a distributed index	2021-02-18 13:30:12 +03:00
Onur Tirtir	676d9a9726	Bump Citus to 10.1devel	2021-02-17 11:54:33 +03:00
Onur Tirtir	d61fd6e478	Decide changing sequence dependencies on MX nodes according to resulting relation (#4713 ) When executing alter_table / undistribute_table udf's, we should not try to change sequence dependencies on MX workers if new table wouldn't require syncing metadata. Previously, we were checking that for input table. But in some cases, the fact that input table requires syncing metadata doesn't imply the same for resulting table (e.g when undistributing a Citus table). Even more, doing that was giving an unexpected error when undistributing a Citus table so this commit actually fixes that.	2021-02-15 19:20:26 +03:00
SaitTalhaNisanci	bcbd24f8de	Only consider pseudo constants for shortcuts (#4712 ) It seems that we need to consider only pseudo constants while doing some shortcuts in planning. For example there could be a false clause but it can contribute to the result in which case it will not be a pseudo constant.	2021-02-15 18:39:37 +03:00
SaitTalhaNisanci	0f1ce7a913	Not skip relation in conversion if it doesn't have RelationRestriction (#4685 ) We would exclude tables without relationRestriction from conversion candidates in local-distributed table joins. This could leave a leftover local table which should have been converted to a subquery. Ideally I would expect that in each call to CreateDistributedPlan we would pass a new plan id, but that seems like a bigger change.	2021-02-12 12:33:55 +03:00
Hadi Moshayedi	e690d8b79b	Move stripe.chunk_count to last position	2021-02-11 17:00:44 -08:00
Jeff Davis	1f1c3c362b	Columnar: rename chunk_num -> chunk_group_num.	2021-02-11 09:27:00 -08:00
Onder Kalaci	f297c96ec5	Add regression tests for COPY into colocated intermediate results To add the tests without too much data, make the copy switchover configurable.	2021-02-11 15:41:06 +01:00
Onder Kalaci	5d5a357487	Do not connection re-use for intermediate results /* * Colocated intermediate results are just files and not required to use * the same connections with their co-located shards. So, we are free to * use any connection we can get. * * Also, the current connection re-use logic does not know how to handle * intermediate results as the intermediate results always truncates the * existing files. That's why, we use one connection per intermediate * result. */	2021-02-11 15:41:06 +01:00
Ahmet Gedemenli	c8e83d1f26	Fix dropping fkey when distributing table	2021-02-11 15:48:35 +03:00
SaitTalhaNisanci	847b79078f	Not consider subplans in restriction list (#4679 ) * Not consider subplans in restriction list * Not consider sublink, alternative subplan in restrictions	2021-02-11 15:04:07 +03:00
Onur Tirtir	ec7ab68f3b	Test adding local table with long name to metadata	2021-02-10 18:05:04 +03:00
Onur Tirtir	9f619a85d6	Fix EXPLAIN ANALYZE exec when query returns no cols (#4672 ) We do not include dummy column if original task didn't return any columns. Otherwise, number of columns that original task returned wouldn't match number of columns returned by worker_save_query_explain_analyze.	2021-02-10 17:59:47 +03:00
Hadi Moshayedi	52297804ae	Fix zero column tables	2021-02-09 23:05:11 -08:00
Hadi Moshayedi	2d09c76b76	Rename storageid to storage_id	2021-02-09 19:57:04 -08:00
Hadi Moshayedi	8270b598b6	Rename stripeid, chunkid, and attnum	2021-02-09 19:50:50 -08:00
Hadi Moshayedi	be90c20457	Fix write path for zero column tables	2021-02-09 14:14:06 -08:00
Hadi Moshayedi	c8d61a31e2	Columnar: chunk_group metadata table	2021-02-09 14:11:58 -08:00
Onder Kalaci	c804c9aa21	Allow local execution for intermediate results in COPY When COPY is used for copying into co-located files, it was not allowed to use local execution. The primary reason was Citus treating co-located intermediate results as co-located shards, and COPY into the distributed table was done via "format result". And, local execution of such COPY commands was not implemented. With this change, we implement support for local execution with "format result". To do that, we use the buffer for every file on shardState->copyOutState, similar to how local copy on shards are implemented. In fact, the logic is similar to local copy on shards, but instead of writing to the shards, Citus writes the results to a file. The logic relies on LOCAL_COPY_FLUSH_THRESHOLD, and flushes only when the size exceeds the threshold. But, unlike local copy on shards, in this case we write the headers and footers just once.	2021-02-09 15:00:06 +01:00
Hadi Moshayedi	eff8cffaf3	Columnar: improve naming of limit config variables. (#4653 ) * Rename chunk_row_count to chunk_group_row_limit * Rename stripe_row_count to stripe_row_limit * Undo couple of renames	2021-02-06 09:04:04 -08:00
Jeff Davis	b1882d4400	Columnar: Call nextval_internal instead of DirectFunctionCall.	2021-02-06 01:45:30 -08:00
Ahmet Gedemenli	5dd2a3da03	Convert RelabelTypes into CollateExprs in get_rule_expr function	2021-02-05 12:06:46 +03:00
Onder Kalaci	fc9a23792c	COPY uses adaptive connection management on local node With #4338, the executor is smart enough to failover to local node if there is not enough space in max_connections for remote connections. For COPY, the logic is different. With #4034, we made COPY work with the adaptive connection management slightly differently. The cause of the difference is that COPY doesn't know which placements are going to be accessed hence requires to get connections up-front. Similarly, COPY decides to use local execution up-front. With this commit, we change the logic for COPY on local nodes: Try to reserve a connection to local host. This logic follows the same logic (e.g., citus.local_shared_pool_size) as the executor because COPY also relies on TryToIncrementSharedConnectionCounter(). If reservation to local node fails, switch to local execution Apart from this, if local execution is disabled, we follow the exact same logic for multi-node Citus. It means that if we are out of the connection, we'd give an error.	2021-02-04 09:45:07 +01:00
Hadi Moshayedi	5fde617229	Columnar: disallow CREATE INDEX CONCURRENTLY	2021-02-03 12:10:00 -08:00
Jeff Davis	4043731c41	Columnar: fix inheritance planning.	2021-02-03 10:41:21 -08:00
Sait Talha Nisanci	24e60b44a1	Consider coordinator in intermediate result optimization It seems that we were not considering the case where coordinator was added to the cluster as a worker in the optimization of intermediate results. This could lead to errors when coordinator was added as a worker.	2021-02-03 20:02:03 +03:00
Onur Tirtir	c0f2817b70	Disallow using alter_table udfs with tables having any identity cols (#4635 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating table from scratch. For this reason, we don't allow using alter_table udfs with tables having any identity cols.	2021-02-03 19:33:54 +03:00
Onur Tirtir	3a403090fd	Disallow adding local table with identity column to metadata (#4633 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating shell relation. For this reason, we don't allow adding local tables -having identity cols- to metadata.	2021-02-03 19:05:17 +03:00
Onur Tirtir	5efb742f8a	Skip copying GENERATED ALWAYS AS STORED cols in ReplaceTable (#4616 ) Postgres doesn't allow inserting into columns having GENERATED ALWAYS AS (...) STORED expressions. For this reason, when executing undistribute_table or an alter_* udf, we should skip copying such columns. This is not bad since Postgres would already generate such columns.	2021-02-03 17:55:16 +03:00
jeff-davis	e03246dd45	Colummnar: mark custom scan path paralle_safe. (#4619 ) Enables an overall plan to be parallel (e.g. over a partition hierarchy), even though an individual ColumnarScan is not parallel-aware. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-02 11:56:00 -08:00
jeff-davis	e195af7e72	Columnar: always disable parallel paths. (#4617 ) Previously, if columnar.enable_custom_scan was false, parallel paths could remain, leading to an unexpected error. Also, ensure that cheapest_parameterized_paths is cleared if a custom scan is used. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-02 11:37:42 -08:00
Onur Tirtir	912d829757	Skip GENERATED AS ALWAYS STORED cols when processing cols owning sequences When finding columns owning sequences, we shouldn't rely on atthasdef since it might be true when column has GENERATED ALWAYS AS (...) STORED expression.	2021-02-02 18:17:42 +03:00
Onur Tirtir	c8a48c6eee	Not try to sync metadata for local tables (#4625 )	2021-02-02 15:12:12 +03:00
Hadi Moshayedi	bcb162976f	Fix #4608	2021-02-01 16:23:16 -08:00
Hadi Moshayedi	f5b1e49b79	Columnar: Fix lateral joins	2021-02-01 11:59:36 -08:00
Hadi Moshayedi	ef927688fa	Columnar: Fix ALTER TABLE ... ADD COLUMN.	2021-02-01 11:40:17 -08:00
Brian Bergeron	1253eeb9ff	Don't propagate ALTER ROLE SET when scoped to a different database (#4471 ) Co-authored-by: brberger <brberger@microsoft.com>	2021-02-01 15:49:26 +03:00
Hanefi Önaldı	cab17afce9	Introduce UDFs for fixing partitioned table constraint names	2021-01-29 17:32:20 +03:00
Hanefi Önaldı	92cf49b7e9	Limit shardId in partitioned table constraint names to only CHECK	2021-01-29 17:29:53 +03:00
SaitTalhaNisanci	738825cc38	Fix partition column index issue (#4591 ) * Fix partition column index issue We send column names to worker_hash/range_partition_table methods, and in these methods we check the column name index from tuple descriptor. Then this index is used to decide the bucket that the current row will be sent for the repartition. This becomes a problem when there are the same column names in the tupleDescriptor. Then we can choose the wrong index. Hence the partitioned data will be put to wrong workers. Then the result could miss some data because workers might contain different range of data. An example: TupleDescriptor contains "trip_id", "car_id", "car_id" for one table. It contains only "car_id" for the other table. And assuming that the tables will be partitioned by car_id, it is not certain what should be used for deciding the bucket number for the first table. Assuming value 2 goes to bucket 2 and value 3 goes to bucket 3, it is not certain which bucket "1 2 3" (trip_id, car_id, car_id) row will go to. As a solution we send the index of partition column in targetList instead of the column name. The old API is kept so that if workers upgrade work, it still works (though it will have the same bug) * Use the same method so that backporting is easier	2021-01-29 14:40:40 +03:00
SaitTalhaNisanci	1ba399f5ca	Fix a flaky behaviour in shared_connection_stats (#4596 ) With the previous query, we were not pushing down the pg_sleep hence the number of connections to a worker could be different from run to run.	2021-01-28 18:42:49 +03:00
Onder Kalaci	c7ea46067f	Add regression tests	2021-01-28 12:45:57 +01:00
Onur Tirtir	bb5962ee79	Early error out when creating citus local from a temp table (#4592 )	2021-01-28 14:18:06 +03:00
Halil Ozan Akgul	913aa91449	Adds error message to AlterTableSetAccessMethod for below PG12	2021-01-28 11:32:02 +03:00
Nils Dijk	07d3b4fd04	fix NaN cost estimate on empty columnar tables (#4593 ) Fixing a division by zero in the cost calculations for scanning a columnar table. Due to how the columns in a columnar table are counted an empty table would result in a division by zero. Instead this patch keeps the column selection ratio on zero when this happens, resulting in an accurate cost of zero pages to scan a columnar table. fixes #4589	2021-01-27 17:32:17 +01:00
Onur Tirtir	8151c4b443	Merge remote-tracking branch 'origin/master' into rename-create_citus_local_table	2021-01-27 17:08:58 +03:00
Ahmet Gedemenli	b2c1bbddd4	Merge branch 'master' into fix-dropping-mat-views-when-alter-table	2021-01-27 16:33:10 +03:00
Ahmet Gedemenli	35043c56f1	Fix dropping materialized views while doing alter table	2021-01-27 16:32:09 +03:00
Onur Tirtir	dfcdccd0e7	Rename udf in regression tests (as per prev commit)	2021-01-27 15:52:37 +03:00
Naisila Puka	94bc2703bc	Make undistribute_table() and citus_create_local_table() work with columnar (#4563 ) * Make undistribute_table() and citus_create_local_table() work with columnar * Rename and use LocallyExecuteUtilityTask for UDF check * Remove 'local' references in ExecuteUtilityCommand	2021-01-27 01:17:20 +03:00
Ahmet Gedemenli	e99f052904	Fix index renaming when creating citus local tables	2021-01-26 15:52:48 +03:00
Jeff Davis	d62e54dc09	Columnar: optimize write path.	2021-01-25 11:47:21 -08:00
Onur Tirtir	215d6630c3	Update foreign_key_to_reference_table so that test output doesn't change	2021-01-25 11:03:39 +03:00
Onur Tirtir	b5ea033a0b	Convert postgres tables to citus local when creating reference table having fkeys	2021-01-25 11:02:50 +03:00
Onur Tirtir	941c8fbf32	Automatically undistribute citus local tables when no more fkeys with reference tables (#4538 )	2021-01-22 18:15:41 +03:00
Marco Slot	03328e9679	Rename citus_tables column names to be query-friendly	2021-01-21 18:58:30 +01:00
Ahmet Gedemenli	3ac30ef9d8	Merge branch 'master' into remove-deprecated-gucs-udfs	2021-01-22 13:06:13 +03:00
Ahmet Gedemenli	76354ff563	Merge branch 'master' into remove-deprecated-gucs-udfs	2021-01-22 12:47:06 +03:00
Ahmet Gedemenli	887b67953b	Merge branch 'master' into fix-bug-create-citus-local-table-with-stats	2021-01-22 12:46:47 +03:00
Hadi Moshayedi	ff38996645	More meaningful columnar metadata table names	2021-01-21 21:29:07 -08:00
Önder Kalacı	9b39b25390	Prevent citus local table creation via remote execution (#4540 ) /* * Creating Citus local tables relies on functions that accesses * shards locally (e.g., ExecuteAndLogDDLCommand()). As long as * we don't teach those functions to access shards remotely, we * cannot relax this check. */	2021-01-21 11:26:45 +03:00
Onur Tirtir	433062e5d2	Add fkeys between citus local and reference tables in some tests (#4546 )	2021-01-20 19:30:20 +03:00
Ahmet Gedemenli	89a6fe83f7	Replace to update_distributed_table_colocation for tests	2021-01-20 17:30:06 +03:00
Ahmet Gedemenli	ceb6b503c0	Remove unused UDF mark_tables_colocated	2021-01-20 17:29:23 +03:00
Ahmet Gedemenli	2fa060a32d	Fix bug creating citus local table with stats	2021-01-20 17:17:13 +03:00
Halil Ozan Akgul	434f5af030	Adds same access method check	2021-01-20 15:18:03 +03:00
Hadi Moshayedi	8a5b6a43fc	Normalize citus_local_tables	2021-01-19 15:56:42 -08:00
Hadi Moshayedi	bc01c795a2	Reland #4419	2021-01-19 07:48:47 -08:00
Halil Ozan Akgul	27c2bd1599	Moves creation of ALTER INDEX STATISTICS commands next to index commands	2021-01-18 16:55:53 +03:00
Naisila Puka	7124a7715d	Skip 'already exists' in CREATE TABLE IF NOT EXISTS PARTITION OF (#4507 ) * Just skip 'already exists' in CT IF NOT EXISTS PARTITION OF * Generalize to tables that are not already distributed partitions	2021-01-18 15:56:02 +03:00
Onur Tirtir	f1ecbc3a53	Fix segfault when adding/dropping fkey from ref to citus local via remote exec (#4528 )	2021-01-17 20:43:33 +03:00
Onur Tirtir	5a3e8a6e24	Skip postgres tables for UndistributeTable(cascadeViaFKeys) (#4530 ) The reason behind skipping postgres tables is that we support foreign keys between postgres tables and reference tables (without converting postgres tables to citus local tables) when enable_local_reference_table_foreign_keys is false or when coordinator is not added to metadata.	2021-01-17 20:32:30 +03:00
Ahmet Gedemenli	107097ee28	Fix assert failure when creating statistics	2021-01-15 19:36:58 +03:00
Onur Tirtir	7dddfa2d0b	Not invalidate fkey cache if citus not installed (#4521 )	2021-01-15 18:31:43 +03:00
Onder Kalaci	30d0a65f40	Adds citus.enable_local_reference_table_foreign_keys When enabled any foreign keys between local tables and reference tables supported by converting the local table to a citus local table. When the coordinator is not in the metadata, the logic is disabled as foreign keys are not allowed in this configuration.	2021-01-15 18:04:52 +03:00
Onur Tirtir	e718d24868	Add support for CREATE TABLE commands defining foreign keys	2021-01-15 17:46:06 +03:00
Ahmet Gedemenli	9a100bcdb9	Remove unused GUCs Remove deprecated variables Remove GUC citus.sslmode Remove GUC citus.expire_cached_shards Remove GUC citus.task_tracker_delay Remove GUC citus.max_assign_task_batch_size Remove GUC citus.max_tracked_tasks_per_node Remove GUC citus.max_running_tasks_per_node Remove GUC citus.large_table_shard_count Remove GUC citus.max_task_string_size Remove GUC citus.binary_master_copy_format	2021-01-15 13:30:45 +03:00
Onur Tirtir	787ed643dd	Undistribute table when cascade_via_foreign_keys=true even if rel has no fkeys (#4516 ) If relation is not involved in any foreign key relationships, foreign key graph would not return any relations for given relationId as expected. But even if it's the case, we should still undistribute the table itself.	2021-01-15 12:45:44 +03:00
Onur Tirtir	36b418982f	Add support for ALTER TABLE commands defining foreign keys	2021-01-14 17:12:00 +03:00
Marco Slot	b840e97cd6	Add a alter_old_partitions_set_access_method UDF	2021-01-14 10:44:14 +01:00
Ahmet Gedemenli	9b56ad48cb	Recreate invalidation functions for Citus10 Fix multi_create_table Add schema name to altered functions Recreate invalidation functions when downgrading	2021-01-13 23:18:07 +03:00
jeff-davis	ec319faa43	Only allow columnar tables with permanent storage (#4492 ). (#4495 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-01-13 10:37:34 -08:00
jeff-davis	b49beda4c3	Stronger check for triggers on columnar tables (#4493 ). (#4494 ) * Stronger check for triggers on columnar tables (#4493). Previously, we used a simple ProcessUtility_hook. Change to use an object_access_hook instead. * Replace alter_table_set_access_method test on partition with foreign key Co-authored-by: Jeff Davis <jefdavi@microsoft.com> Co-authored-by: Marco Slot <marco.slot@gmail.com>	2021-01-13 10:30:53 -08:00
Marco Slot	de6aaaa648	Expand support for subqueries in target list through recursive planning	2021-01-13 17:26:09 +01:00
Onur Tirtir	ccbc3de535	Enable reference/distributed table creation from citus local tables	2021-01-13 17:14:26 +03:00
Halil Ozan Akgul	2be14cce2e	Adds alter_distributed_table and alter_table_set_access_method UDFs	2021-01-13 16:02:39 +03:00
SaitTalhaNisanci	724d56f949	Add citus shard helper view (#4361 ) With citus shard helper view, we can easily see: - where each shard is, which node, which port - what kind of table it belongs to - its size With such a view, we can see shards that have a size bigger than some value, which could be useful. Also debugging can be easier in production as well with this view. Fetch shards in one go per node The previous implementation was slow because it would do a lot of round trips, one per shard to be exact. Hence it is improved so that we fetch all the shard_name, shard-size pairs per node in one go. Construct shards_names, sizes query on coordinator	2021-01-13 13:58:47 +03:00
Önder Kalacı	7e0826a06b	Make sure that materialized views that contains only (#4499 ) Make sure that materialized views that contains only intermediate results work fine.	2021-01-13 13:17:43 +03:00
Ahmet Gedemenli	436c9d9d79	Remove the word 'master' from Citus UDFs (#4472 ) * Replace master_add_node with citus_add_node * Replace master_activate_node with citus_activate_node * Replace master_add_inactive_node with citus_add_inactive_node * Use master udfs in old scripts * Replace master_add_secondary_node with citus_add_secondary_node * Replace master_disable_node with citus_disable_node * Replace master_drain_node with citus_drain_node * Replace master_remove_node with citus_remove_node * Replace master_set_node_property with citus_set_node_property * Replace master_unmark_object_distributed with citus_unmark_object_distributed * Replace master_update_node with citus_update_node * Replace master_update_shard_statistics with citus_update_shard_statistics * Replace master_update_table_statistics with citus_update_table_statistics * Rename master_conninfo_cache_invalidate to citus_conninfo_cache_invalidate Rename master_dist_local_group_cache_invalidate to citus_dist_local_group_cache_invalidate * Replace master_copy_shard_placement with citus_copy_shard_placement * Replace master_move_shard_placement with citus_move_shard_placement * Rename master_dist_node_cache_invalidate to citus_dist_node_cache_invalidate * Rename master_dist_object_cache_invalidate to citus_dist_object_cache_invalidate * Rename master_dist_partition_cache_invalidate to citus_dist_partition_cache_invalidate * Rename master_dist_placement_cache_invalidate to citus_dist_placement_cache_invalidate * Rename master_dist_shard_cache_invalidate to citus_dist_shard_cache_invalidate * Drop master_modify_multiple_shards * Rename master_drop_all_shards to citus_drop_all_shards * Drop master_create_distributed_table * Drop master_create_worker_shards * Revert old function definitions * Add missing revoke statement for citus_disable_node	2021-01-13 12:10:43 +03:00
Onur Tirtir	dd55ab394e	Disallow cascade_via_foreign_keys if any partition rel has non-inherited fkeys (#4487 )	2021-01-11 21:50:09 +03:00
Marco Slot	d900a7336e	Automatically add placeholder record for coordinator	2021-01-08 15:09:53 +01:00
Marco Slot	597533b1ff	Add citus_set_coordinator_host	2021-01-08 13:36:26 +01:00
Marco Slot	e7f13978b5	Add a view for simple (time) partitions and their access methods	2021-01-08 11:28:15 +01:00
Onur Tirtir	5289785da4	Add cascade_via_foreign_keys option to create_citus_local_table (#4462 )	2021-01-08 15:13:26 +03:00
Marco Slot	011283122b	Add the shard rebalancer implementation	2021-01-07 16:51:55 +01:00
Onur Tirtir	f3801143fb	Add cascade option to undistribute_table	2021-01-07 15:41:49 +03:00
Marco Slot	47c1b19174	Revert "Do metadata sync in a separate background worker." This reverts commit `4df723cf9b`.	2021-01-07 10:30:04 +01:00
Marco Slot	5de3337b2f	Support local execution for INSERT..SELECT with re-partitioning	2021-01-06 16:15:53 +01:00
Naisila Puka	bcfc0aa4e9	Rethrow original concurrent index creation failure message (#4469 ) * Rethrow original concurrent index creation failure message * Alter test outputs for concurrent index creation * Detect duplicate table failure in concurrent index creation * Add test for conc. index creation w/out duplicates	2021-01-06 15:27:13 +03:00
Ahmet Gedemenli	1f36ff7c17	Prevent deadlock for long named partitioned index creation on single node (#4461 ) * Prevent deadlock for long named partitioned index creation on single node * Create IsSingleNodeCluster function * Use both local and sequential execution	2021-01-05 13:39:13 +03:00
Ahmet Gedemenli	f27649754b	Add alter index set statistics support (#4455 ) * Add alter index set statistics support * Use attNum instead of attName	2021-01-05 13:23:11 +03:00
Onur Tirtir	87e5276bdd	Fix fkey graph test for self reference (#4450 )	2020-12-28 12:47:39 +03:00
Naisila Puka	04aeb6938b	Merge branch 'master' into issue4237	2020-12-25 12:36:40 +03:00
Hadi Moshayedi	4df723cf9b	Do metadata sync in a separate background worker.	2020-12-24 08:25:55 -08:00
Naisila Puka	0bb2c991f9	Merge branch 'master' into issue4237	2020-12-24 18:05:27 +03:00
Ahmet Gedemenli	5af585269a	Add separate pg13 test for stats targets	2020-12-24 18:01:25 +03:00
naisila	59a81491e8	Add test for master_create_empty_shard on coordinator	2020-12-24 17:59:40 +03:00
Ahmet Gedemenli	d4bc17f6f0	Propagate statistics with altered targets	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	f7c70f9a63	Propagate alter stats target	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	5a1607b6c0	Propagate alter stats schema	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	bdce4a7e67	Propagate rename statistics	2020-12-24 17:10:12 +03:00
Onur Tirtir	5ed9197041	Implement infra to get foreign key connected relations (#4439 ) On top of our foreign key graph, implement the infrastructure to get list of relations that are connected to input relation via a foreign key graph. We need this to support cascading create_citus_local_table & undistribute_table operations. Also add regression tests to see what our foreign key graph is able to capture currently.	2020-12-24 16:42:40 +03:00
Onur Tirtir	57e7defa3c	Support CREATE INDEX commands without index name on citus tables (#4273 )	2020-12-23 23:15:39 +03:00
Marco Slot	e3dcc278e0	Remove upgrade_to_reference_table UDF	2020-12-23 00:40:14 +01:00
jeff-davis	90d63cb792	Add columnar pg_dump test. (#4433 )	2020-12-22 15:57:35 -08:00
Ahmet Gedemenli	874fa1fc09	Propagate Drop Statistics	2020-12-22 18:34:46 +03:00
Marco Slot	321cc784c7	Collapse Citus 7.* scripts into Citus 8.0-1	2020-12-21 22:55:51 +01:00
Hadi Moshayedi	dde0323b57	Columnar: enable zstd & lz4 compilation by default (#4402 ) * Columnar: enable zstd & lz4 compilation by default * Make zstd & lz4 tests more consistent * Don't require lz4 & zstd for postgres 11 Co-authored-by: Nils Dijk <nils@citusdata.com>	2020-12-21 12:11:58 -08:00
Onur Tirtir	cceaf31e4c	Add some more tests with views to test recursive planning on views (#4427 ) (cherry picked from commit `51f422f3c6`)	2020-12-21 11:53:37 +03:00
jeff-davis	49281202af	Add simple follower test for columnar. (#4432 )	2020-12-18 13:59:20 -08:00
jeff-davis	3e0f1aaaab	Prevent inserting into logically-replicated columnar table. (#4429 )	2020-12-18 12:29:30 -08:00
Marco Slot	f2056e553f	Expose partition column of subqueries in optimizer (#4355 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2020-12-18 20:32:52 +01:00
SaitTalhaNisanci	145112f3a0	Fix attribute numbers in subquery conversions (#4426 ) Attribute number in a subquery RTE and relation RTE means different things. In a relation attribute number will point to the column number in the table definition including the dropped columns as well however in subquery, it means the index in the target list. When we convert a relation RTE to subquery RTE we should either correct all the relevant attribute numbers or we can just add a dummy column for the dropped columns. We choose the latter in this commit because it is practically too vulnerable to update all the vars in a query. Another thing this commit fixes is that in case a join restriction clause list contains a false clause, we should just returns a false clause instead of the whole list, because the whole list will contain restrictions from other RTEs as well and this breaks the query, which can be seen from the output changes, now it is much simpler. Also instead of adding single tests for dropped columns, we choose to run the whole mixed queries with tables with dropped columns, this revealed some bugs already, which are fixed in this commit.	2020-12-18 20:25:41 +03:00
Nils Dijk	a748729998	rework ci	2020-12-18 18:04:45 +01:00
Ahmet Gedemenli	770d3da1ca	Add dependencies for stat schemas	2020-12-18 17:04:13 +03:00
Ahmet Gedemenli	6c0465566a	Propagate create statistics	2020-12-17 20:38:36 +03:00
Marco Slot	1e2518f83c	Add tests for router queries with catalog tables (#4422 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2020-12-17 15:07:50 +01:00
Marco Slot	100e5d3196	Address review feedback	2020-12-15 15:23:38 +01:00
Marco Slot	23dccd8941	Add some new tests for complex correlated subqueries in WHERE	2020-12-15 14:17:16 +01:00
Marco Slot	707a6554b1	Support co-located/recurring correlated subqueries	2020-12-15 14:17:16 +01:00
Sait Talha Nisanci	181a7e1d36	Skip dropped columns	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	7951273f74	Refactor WrapRteRelationIntoSubquery	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	0e53aa5d3b	Add more tests	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	f5dd5379b2	Add more tests	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	3aed6c3ad0	Rename containsOnlyLocalTable as isLocalTableModification Update error message in Modify View	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	13c43d5744	Improve table conversion logic in dist-local joins	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	5618f3a3fc	Use BaseRestrictInfo for finding equality columns Baseinfo also has pushed down filters etc, so it makes more sense to use BaseRestrictInfo to determine what columns have constant equality filters. Also RteIdentity is used for removing conversion candidates instead of rteIndex.	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	69992d58f9	Add broken local-dist table modifications tests It seems that most of the updates were broken, we weren't aware of it because there wasn't any data in the tables. They are broken mostly because local tables do not have a shard id and some code paths should be updated with that information, currently when there is an invalid shard id, it is assumed to be pruned. Consider local tables in router planner In case there is a local table, the shard id will not be valid and there are some checks that rely on shard id, we should skip these in case of local tables, which is handled with a dummy placement. Add citus local table dist table join tests add local-dist table mixed joins tests	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	2a44029aaf	Simplify ContainsTableToBeConvertedToSubquery AllDataLocallyAccessible and ContainsLocalTableSubqueryJoin are removed. We can possibly remove ModifiesLocalTableWithRemoteCitusLocalTable as well. Though this removal has a side effect that now when all the data is locally available, we could still wrap a relation into a subquery, I guess that should be resolved in the router planner itself. Add more tests	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	26d9f0b457	Use auto mode in tests and fix debug message	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	3bd53a24a3	Support update on postgres table from citus local table	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	4b6611460a	Support foreign table joins as well	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	7e9204eba9	Update vars in quals while wrapping RTE to subquery When we wrap an RTE to subquery we are updating the variables varno's as 1, however we should also update the varno's of vars in quals. Also some other small code quality improvements are done.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	0689f2ac1a	Recursively plan distributed tables only if all have unique filters The previous algorithm was not consistent and it could convert different RTEs based on the table orders in the query. Now we convert local tables if there is a distributed table which doesn't have a unique index. So if there are 4 tables, local1, local2, dist1, dist2_with_pkey then we will convert local1 and local2 in `auto` mode. Converting a distributed table is not that logical because as there is a distributed table without a unique index, we will need to convert the local tables anyway. So converting the distributed table with pkey is redundant.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	a008fc611c	Support materialized view joins as well	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	5f46abffd9	Update check multi tests	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	eebcd995b3	Add some more tests	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	5693cabc41	Not convert an already routable plannable query We should not recursively plan an already routable plannable query. An example of this is (SELECT * FROM local JOIN (SELECT * FROM dist) d1 USING(a)); So we let the recursive planner do all of its work and at the end we convert the final query to to handle unsupported joins. While doing each conversion, we check if it is router plannable, if so we stop. Only consider range table entries that are in jointree If a range table is not in jointree then there is no point in considering that because we are trying to convert range table entries to subqueries for join use case.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	2ff65f3630	Enable partitioned distributed tables in local-dist table joins	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	44953579cf	Enable citus-local distributed table joins Check equality in quals We want to recursively plan distributed tables only if they have an equality filter on a unique column. So '>' and '<' operators will not trigger recursive planning of distributed tables in local-distributed table joins. Recursively plan distributed table only if the filter is constant If the filter is not a constant then the join might return multiple rows and there is a chance that the distributed table will return huge data. Hence if the filter is not constant we choose to recursively plan the local table.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	f3d55448b3	Choose distributed table if it has a unique index in filter When doing local-distributed table joins we convert one of them to subquery. The current policy is that we convert distributed tables to subquery if it has a unique index on a column that has unique index(primary key also has a unique index).	2020-12-15 18:17:10 +03:00
Onder Kalaci	945193555b	add basic regression tests	2020-12-15 18:17:10 +03:00
Onder Kalaci	594e001f3b	Add filter pushdown regression tests Also handle WHERE false	2020-12-15 18:17:10 +03:00
Onder Kalaci	82a4830c7d	Adjust the existing regression tests	2020-12-15 18:17:10 +03:00
Marco Slot	f2538a456f	Support co-located/recurring sublinks in the target list	2020-12-13 15:45:24 +01:00
Hadi Moshayedi	4dd22cc4e4	Columnar: Fix ANALYZE for large number of rows.	2020-12-10 09:52:33 -08:00
Hadi Moshayedi	b3dac5e9d1	Columnar: set default compression as zstd if available	2020-12-09 14:32:08 -08:00
Hadi Moshayedi	4668fe51a6	Columnar: Make compression level configurable	2020-12-09 08:48:50 -08:00
Hadi Moshayedi	f5a4a4bc74	Columnar: Support zstd compression	2020-12-09 08:30:55 -08:00
Hadi Moshayedi	3f81ee26fd	Columnar: Support LZ4 compression	2020-12-09 08:29:07 -08:00
jeff-davis	260a02180b	Add tests for unsupported columnar storage features (#4397 ) Add negative tests: * Deletes * Sample scan * Special columns * Tuple locks * Indexes	2020-12-09 00:08:45 -08:00
Jeff Davis	c91e5b052b	more test fixups	2020-12-07 13:43:27 -08:00
Jeff Davis	7169ba21c4	more test fixes	2020-12-07 13:36:46 -08:00
Jeff Davis	3758e83850	Rename cstore->columnar in SQL objects and errors.	2020-12-07 13:01:53 -08:00
Jeff Davis	ad919ff220	Tests for UPDATE and error message improvement. UPDATEs on partitioned tables that affect only row partitions should succeed, the rest should fail. Also rename CStoreScan to ColumnarScan to make the error message more relevant.	2020-12-07 11:25:30 -08:00
Ahmet Gedemenli	936775e8e3	Delete transactions when removing node With this commit, we delete entries in pg_dist_transaction for the primary nodes that are removed by `master_remove_node`.	2020-12-07 11:35:20 +03:00
Hadi Moshayedi	01da2a1c73	Columnar: track decompressed length in metadata	2020-12-04 09:09:39 -08:00
Onder Kalaci	bd9827aed9	Add regression tests with different data types We typically do not test Citus with these uncommon data types. Now, we already have the tests for ADF integration, add it to regression tests as well.	2020-12-04 10:25:00 +03:00
Hadi Moshayedi	4a9aebaa7b	Columnar: rename block to chunk	2020-12-03 08:50:19 -08:00
Hadi Moshayedi	24bfd368a9	Columnar: Fix VACUUM for empty tables	2020-12-03 08:46:09 -08:00
Marco Slot	c9b658daea	Add a public.citus_tables view	2020-12-03 17:31:40 +01:00
Marco Slot	4098d33acb	Allow citus size functions on replicated tables	2020-12-03 16:33:24 +01:00
Marco Slot	c69ea2512a	Fix flappy failure test	2020-12-03 13:54:02 +01:00
Onder Kalaci	c546ec5e78	Local node connection management When Citus needs to parallelize queries on the local node (e.g., the node executing the distributed query and the shards are the same), we need to be mindful about the connection management. The reason is that the client backends that are running distributed queries are competing with the client backends that Citus initiates to parallelize the queries in order to get a slot on the max_connections. In that regard, we implemented a "failover" mechanism where if the distributed queries cannot get a connection, the execution failovers the tasks to the local execution. The failover logic is follows: - As the connection manager if it is OK to get a connection - If yes, we are good. - If no, we fail the workerPool and the failure triggers the failover of the tasks to local execution queue The decision of getting a connection is follows: /* * For local nodes, solely relying on citus.max_shared_pool_size or * max_connections might not be sufficient. The former gives us * a preview of the future (e.g., we let the new connections to establish, * but they are not established yet). The latter gives us the close to * precise view of the past (e.g., the active number of client backends). * * Overall, we want to limit both of the metrics. The former limit typically * kics in under regular loads, where the load of the database increases in * a reasonable pace. The latter limit typically kicks in when the database * is issued lots of concurrent sessions at the same time, such as benchmarks. */	2020-12-03 14:16:13 +03:00
Hadi Moshayedi	c2f60b6422	Columnar: pg_upgrade support (#4354 )	2020-12-02 08:46:59 -08:00
Ahmet Gedemenli	5242dcfe99	Add tests for propagating alter schema rename	2020-12-02 15:18:26 +03:00
Nils Dijk	6f9c040f76	DESCRIPTION: Propagate columnar table settings for distributed tables When distributing a columnar table, as well as changing options on a distributed columnar table, this patch will forward the settings from the coordinator to the workers. For propagating options changes on an already distributed table this change is pretty straight forward. Before applying the change in options locally we will create a `DDLJob` that contains a call to `alter_columnar_table_set(...)` for every shard placement with all settings of the current table. This goes both for setting an option as well as resetting. This will reset the values to the defaults configured on the coordinator. Having the effect that the coordinator is authoritative on the settings and makes sure the shards have the same settings set as the table on the coordinator. When a columnar table is distributed it is using the `TableDDLCommand` infra structure to create a new kind of `TableDDLCommand`. This new type, called a `TableDDLCommandFunction` contains a context and 2 function pointers to execute. One function returns the command as applied on the table, the second function will return the sql command to apply to a shard with a given shard id. The schema name is ignored as it will use the fully qualified name of the shard in the same schema as the base table.	2020-12-02 13:02:42 +01:00
Halil Ozan Akgül	ef0914a7f8	Adds ORDER BY to flaky test (#4305 ) Co-authored-by: Önder Kalacı <onder@citusdata.com>	2020-12-02 14:24:05 +03:00
Onder Kalaci	f7e1aa3f22	Multi-row INSERTs use local execution when placements are local Multi-row execution already uses sequential execution. When shards are local, using local execution is profitable as it avoids an extra connection establishment to the local node.	2020-12-01 21:37:59 +03:00
Marco Slot	48caca4084	Improve regression test settings	2020-11-30 20:34:03 +01:00
Ahmet Gedemenli	8e5f0487eb	Add order by for flaky test	2020-12-01 10:54:52 +03:00
Ahmet Gedemenli	67761897ab	Add test for citus table size func in transaction with modification Add test for citus_relation_size	2020-12-01 10:38:15 +03:00
Hadi Moshayedi	feecb7b423	Columnar: few fixes (#4371 ) * Columnar: fix a memory issue * Columnar: no need for deferred triggers * Columnar: relax memory growth constraints	2020-11-30 18:09:43 -08:00
Hadi Moshayedi	a94e8c9cda	Associate column store metadata with storage id (#4347 )	2020-11-30 18:01:43 -08:00
SaitTalhaNisanci	c31a8df380	Call 6 times not 7 in subquery_prepared_statements (#4357 )	2020-11-30 21:20:51 +03:00
Nils Dijk	383e334023	refactor options to their own table linked to the regclass (#4346 ) Columnar options were by accident linked to the relfilenode instead of the regclass/relation oid. This PR moves everything related to columnar options to their own catalog table.	2020-11-27 11:22:08 -08:00
Onder Kalaci	629ecc3dee	Add the infrastructure to count the number of client backends Considering the adaptive connection management improvements that we plan to roll soon, it makes it very helpful to know the number of active client backends. We are doing this addition to simplify yhe adaptive connection management for single node Citus. In single node Citus, both the client backends and Citus parallel queries would compete to get slots on Postgres' `max_connections` on the same Citus database. With adaptive connection management, we have the counters for Citus parallel queries. That helps us to adaptively decide on the remote executions pool size (e.g., throttle connections if necessary). However, we do not have any counters for the total number of client backends on the database. For single node Citus, we should consider all the client backends, not only the remote connections that Citus does. Of course Postgres internally knows how many client backends are active. However, to get that number Postgres iterates over all the backends. For examaple, see [pg_stat_get_db_numbackends](`8e90ec5580/src/backend/utils/adt/pgstatfuncs.c (L1240)`) where Postgres iterates over all the backends. For our purpuses, we need this information on every connection establishment. That's why we cannot affort to do this kind of iterattion.	2020-11-25 19:19:24 +01:00
Ahmet Gedemenli	a64dc8a72b	Fixes a bug preventing INSERT SELECT .. ON CONFLICT with a constraint name on local shards Separate search relation shard function Add tests	2020-11-25 15:10:46 +03:00
Önder Kalacı	c760cd3470	Move local execution after remote execution (#4301 ) * Move local execution after the remote execution Before this commit, when both local and remote tasks exist, the executor was starting the execution with local execution. There is no strict requirements on this. Especially considering the adaptive connection management improvements that we plan to roll soon, moving the local execution after to the remote execution makes more sense. The adaptive connection management for single node Citus would look roughly as follows: - Try to connect back to the coordinator for running parallel queries. - If succeeds, go on and execute tasks in parallel - If fails, fallback to the local execution So, we'll use local execution as a fallback mechanism. And, moving it after to the remote execution allows us to implement such further scenarios.	2020-11-24 13:43:38 +01:00
Hadi Moshayedi	40b52ab757	Fix memory leaks in column store	2020-11-23 11:26:12 -08:00
Jeff Davis	8cee2b092b	remove columnar FDW code	2020-11-20 10:03:12 -08:00
Onder Kalaci	c433c66f2b	Do not execute subplans multiple times with cursors Before this commit, we let AdaptiveExecutorPreExecutorRun() to be effective multiple times on every FETCH on cursors. That does not affect the correctness of the query results, but adds significant overhead.	2020-11-20 10:43:56 +01:00
Hadi Moshayedi	b182a95389	Fix ALTER COLUMN ... SET TYPE for columnar	2020-11-19 15:36:45 -08:00
Jeff Davis	91015deb9d	rename UDFs also	2020-11-19 12:27:40 -08:00
Jeff Davis	a2b698a766	rename cstore_tableam -> columnar	2020-11-19 12:15:51 -08:00
Hadi Moshayedi	2747fd80ff	Add prepared materialized view tests for columnar	2020-11-17 20:13:20 -08:00
Hadi Moshayedi	6711340ea6	Add prepared xact & stmt tests for columnar	2020-11-17 20:00:57 -08:00
Hadi Moshayedi	97cba2d5b6	Implements write state management for tuple inserts. TableAM API doesn't allow us to pass around a state variable along all of the tuple inserts belonging to the same command. We require this in columnar store, since we batch them, and when we have enough rows we flush them as stripes. To do that, we keep a (relfilenode) -> stack of (subxact id, TableWriteState) global mapping. Inserts Whenever we want to insert a tuple, we look up for the relation's relfilenode in this mapping. If top of the stack matches current subtransaction, we us the existing TableWriteState. Otherwise, we allocate a new TableWriteState and push it on top of stack. (Sub)Transaction Commit/Aborts When the subtransaction or transaction is committed, we flush and pop all entries matching current SubTransactionId. When the subtransaction or transaction is committed, we pop all entries matching current SubTransactionId and discard them without flushing. Reads Since we might have unwritten rows which needs to be read by a table scan, we flush write states on SELECTs. Since flushing the write state of upper transactions in a subtransaction will cause metadata being written in wrong subtransaction, we ERROR out if any of the upper subtransactions have unflushed rows. Table Drops We record in which subtransaction the table was dropped. When committing a subtransaction in which table was dropped, we propagate the drop to upper transaction. When aborting a subtransaction in which table was dropped, we mark table as not deleted.	2020-11-17 12:07:16 -08:00
Nils Dijk	22df8027b0	add extra output for multi_extension targeting pg11	2020-11-17 19:01:54 +01:00
Nils Dijk	2987535172	add pg upgrade tests verifying table am is created	2020-11-17 18:55:36 +01:00
Nils Dijk	d065bb495d	Prepare downgrade script and bump development version to 10.0-1	2020-11-17 18:55:35 +01:00
Nils Dijk	b6d4a1bbe2	fix style	2020-11-17 18:55:35 +01:00
Nils Dijk	3bb6554976	make tests run	2020-11-17 18:55:35 +01:00
Nils Dijk	f89bd3eeb5	move columnar test files	2020-11-17 18:55:34 +01:00
Onur Tirtir	5e3dc9d707	Bump citus version to 10.0devel	2020-11-09 13:16:54 +03:00
Onur Tirtir	5d5966f700	Fix a flaky test in mixed_relkind_tests (#4300 )	2020-11-06 14:53:30 +03:00
Onder Kalaci	e0d2ac7620	Do not rely on set_rel_pathlist_hook for finding local relations When a relation is used on an OUTER JOIN with FALSE filters, set_rel_pathlist_hook may not be called for the table. There might be other cases as well, so do not rely on the hook for classification of the tables.	2020-11-06 11:14:30 +01:00
Onur Tirtir	cc8be422ce	Fix relkind checks in planner for relkinds other than RELKIND_RELATION (#4294 ) We were qualifying relations with relkind != RELKIND_RELATION as non-relations due to the strict checks around RangeTblEntry->relkind in planner.	2020-11-05 14:21:02 +03:00
Hanefi Önaldı	85a4b61a0e	Prevent undistribute_table calls for partitions	2020-11-03 18:10:20 +03:00
Hanefi Önaldı	5db380f33a	Prevent undistribute_table calls for foreign tables	2020-11-03 17:33:29 +03:00
Halil Ozan Akgul	77b3be8b6d	Turn RelOptInfos to only used field of them, relids, to be able to copy	2020-10-22 13:42:28 +03:00
Onur Tirtir	790beea59f	Add intermediate result tests with unsupported outer joins (#4262 )	2020-10-20 12:11:18 +03:00
SaitTalhaNisanci	0f209377c4	Fix incorrect join related fields (#4242 ) * Fix incorrect join related fields Ruleutils expect to give the original index of join columns hence we should consider the dropped columns while setting the fields in SetJoinRelatedFieldsCompat. * add some more tests for joins * Move tests to join.sql and create a utility function	2020-10-19 18:28:39 +03:00
Onur Tirtir	c49077d594	Disallow outer joins `ON TRUE` with ref & dist tables when ref table is outer relation (#4255 ) Disallow `ON TRUE` outer joins with reference & distributed tables when reference table is outer relation by fixing the logic bug made when calling `LeftListIsSubset` function. Also, be more defensive when removing duplicate join restrictions when join clause is empty for non-inner joins as they might still contain useful information for non-inner joins.	2020-10-19 16:58:11 +03:00
Onder Kalaci	bbedfca761	Improve the relation restriction counters It seems like Postgres could call set_rel_pathlist() for the same relation multiple times. This breaks the logic where we assume relationCount eqauls to the number of entries in relationRestrictionList. In summary, relationRestrictionList may contain duplicate entries.	2020-10-19 08:51:16 +02:00
Hadi Moshayedi	663549db33	Set explicit transfer_mode in tableam tests	2020-10-16 12:40:37 -07:00
Nils Dijk	caabbf4b84	Table access method support for distributed tables	2020-10-16 12:02:25 -07:00
Marco Slot	8976f245ab	Support reference table view in reference table modification	2020-10-16 11:31:24 +02:00
Onder Kalaci	596f7bf4a9	Add more regression test for single node Citus Tests on commands with SCHEMA.	2020-10-15 17:32:32 +02:00
Onder Kalaci	fe3caf3bc8	Local execution considers intermediate result size limit With this commit, we make sure that local execution adds the intermediate result size as the distributed execution adds. Plus, it enforces the citus.max_intermediate_result_size value.	2020-10-15 17:18:55 +02:00
Marco Slot	31858c8a29	Check table existence in EnsureRelationKindSupported	2020-10-15 17:05:06 +02:00
Onder Kalaci	15e724c073	Add regression tests for outer/cross JOINs	2020-10-14 15:17:30 +02:00
Onder Kalaci	de33079065	Improve outer join checks Before this commit, the logic was: - As long as the outer side of the JOIN is not a JOIN (e.g., relation or subquery etc.), we check for the existence of any recurring tuples. There were two implications of this decision. First, even if a subquery which is on the outer side contains distributed table JOIN reference table, Citus would unnecessarily throw an error. Note that, the JOIN inside the subquery would already be going to be tested recursively. But, as long as that check passes, there is no reason for the upper JOIN to fail. An example, which used to fail and now works: SELECT * FROM (SELECT * FROM dist JOIN ref) as foo LEFT JOIN dist; Second, certain JOINs, especially with ON (true) conditions were not represented as Citus expects the JOINs to be in the format DeferredErrorIfUnsupportedRecurringTuplesJoin().	2020-10-14 15:17:30 +02:00
Onur Tirtir	1a28858c47	Disallow field indirection in INSERT/UPDATE queries (#4241 )	2020-10-14 14:11:59 +03:00
Onur Tirtir	8efca3b60a	Fix a crash with inserting domain composite types in coord. evaluation (#4231 ) Use short lived per-tuple context in citus_evaluate_expr like (pg) evaluate_expr does. We should not use planState->ExprContext when evaluating expressions as it might lead to freeing the same executor twice (first one happens in citus_evaluate_expr itself and the other one happens when postgres doing clean-up for the top level executor state), which in turn might cause seg.faults. However, now as we don't have necessary planState info to evaluate prepared statements, we also add planState->es_param_list_info to per-tuple ExprContext.	2020-10-13 14:19:59 +03:00
Halil Ozan Akgul	e2736c25bd	Adds support for WITH TIES option	2020-10-12 19:34:18 +03:00
Sait Talha Nisanci	dc40758355	Return early if there is no citus table in VACUUM	2020-10-09 11:10:00 +03:00
Sait Talha Nisanci	99bb79745a	Commit transaction for VACUUM on shell table With postgres 13, there is a global lock that prevents multiple VACUUMs happening in the current database. This global lock is taken for a short time but this creates a problem because of the following: - We execute the VACUUM for the shell table through the standard process utility. In this step the global lock is taken for the current database. - If the current node has shard placements then it tries to execute VACUUM over a connection to localhost with ExecuteUtilityTaskList. - the VACUUM on shard placements cannot proceed because it is waiting for the global lock for the current database to be released. - The acquired lock from the VACUUM for shell table will not be released until the transaction is committed. - So there is a deadlock. As a solution, we commit the current transaction in case of VACUUM after the VACUUM is executed for the shell table. Executing the VACUUM on a shell table is not important because the data there will probably be truncated. PostprocessVacuumStmt takes the necessary locks on the shell table so we don't need to take any extra locks after we commit the current transaction.	2020-10-09 10:57:44 +03:00
Marco Slot	881e5df780	Fix a bug that could lead to multiple maintenance daemons	2020-10-08 16:18:14 +02:00
Marco Slot	18219843d0	Add maintenance daemon error tests	2020-10-08 16:17:33 +02:00
Marco Slot	dbc348b7e0	Create sequence dependency during metadata syncing	2020-10-06 10:57:39 +02:00
Marco Slot	9bba8bb4e8	Remove master_drop_sequences	2020-10-06 10:57:33 +02:00
Onur Tirtir	2cd0a69dfb	Fix multi-row & router INSERT crash with local exec. when def. cols not specified (#4197 ) Multi-row & router INSERT's were crashing with local execution if at least one of the DEFAULT columns were not specified in VALUES list. This was because, the changes we make on query->values_lists and query->targetList was sufficient for deparsing given INSERT for remote execution but not sufficient for local execution. With this commit, DEFAULT value normalization for multi-row & router INSERT's is fixed by adding dummy column references for unspecified DEFAULT columns.	2020-10-05 10:45:17 +03:00

... 5 6 7 8 9 ...

1778 Commits (ace800851a88d691f694c86244dcccd72ea90d1d)