citus

...

Author	SHA1	Message	Date
dependabot[bot]	5deaf9a616	Bump werkzeug from 2.3.7 to 3.0.6 in /src/test/regress (#8003 ) Bumps [werkzeug](https://github.com/pallets/werkzeug) from 2.3.7 to 3.0.6. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pallets/werkzeug/releases">werkzeug's releases</a>.</em></p> <blockquote> <h2>3.0.6</h2> <p>This is the Werkzeug 3.0.6 security fix release, which fixes security issues but does not otherwise change behavior and should not result in breaking changes.</p> <p>PyPI: <a href="https://pypi.org/project/Werkzeug/3.0.6/">https://pypi.org/project/Werkzeug/3.0.6/</a> Changes: <a href="https://werkzeug.palletsprojects.com/en/stable/changes/#version-3-0-6">https://werkzeug.palletsprojects.com/en/stable/changes/#version-3-0-6</a></p> <ul> <li>Fix how <code>max_form_memory_size</code> is applied when parsing large non-file fields. <a href="https://github.com/advisories/GHSA-q34m-jh98-gwm2">GHSA-q34m-jh98-gwm2</a></li> <li><code>safe_join</code> catches certain paths on Windows that were not caught by <code>ntpath.isabs</code> on Python < 3.11. <a href="https://github.com/advisories/GHSA-f9vj-2wh5-fj8j">GHSA-f9vj-2wh5-fj8j</a></li> </ul> <h2>3.0.5</h2> <p>This is the Werkzeug 3.0.5 fix release, which fixes bugs but does not otherwise change behavior and should not result in breaking changes.</p> <p>PyPI: <a href="https://pypi.org/project/Werkzeug/3.0.5/">https://pypi.org/project/Werkzeug/3.0.5/</a> Changes: <a href="https://werkzeug.palletsprojects.com/en/stable/changes/#version-3-0-5">https://werkzeug.palletsprojects.com/en/stable/changes/#version-3-0-5</a> Milestone: <a href="https://github.com/pallets/werkzeug/milestone/37?closed=1">https://github.com/pallets/werkzeug/milestone/37?closed=1</a></p> <ul> <li>The Watchdog reloader ignores file closed no write events. <a href="https://redirect.github.com/pallets/werkzeug/issues/2945">#2945</a></li> <li>Logging works with client addresses containing an IPv6 scope. <a href="https://redirect.github.com/pallets/werkzeug/issues/2952">#2952</a></li> <li>Ignore invalid authorization parameters. <a href="https://redirect.github.com/pallets/werkzeug/issues/2955">#2955</a></li> <li>Improve type annotation fore <code>SharedDataMiddleware</code>. <a href="https://redirect.github.com/pallets/werkzeug/issues/2958">#2958</a></li> <li>Compatibility with Python 3.13 when generating debugger pin and the current UID does not have an associated name. <a href="https://redirect.github.com/pallets/werkzeug/issues/2957">#2957</a></li> </ul> <h2>3.0.4</h2> <p>This is the Werkzeug 3.0.4 fix release, which fixes bugs but does not otherwise change behavior and should not result in breaking changes.</p> <p>PyPI: <a href="https://pypi.org/project/Werkzeug/3.0.4/">https://pypi.org/project/Werkzeug/3.0.4/</a> Changes: <a href="https://werkzeug.palletsprojects.com/en/3.0.x/changes/#version-3-0-4">https://werkzeug.palletsprojects.com/en/3.0.x/changes/#version-3-0-4</a> Milestone: <a href="https://github.com/pallets/werkzeug/milestone/36?closed=1">https://github.com/pallets/werkzeug/milestone/36?closed=1</a></p> <ul> <li>Restore behavior where parsing <code>multipart/x-www-form-urlencoded</code> data with invalid UTF-8 bytes in the body results in no form data parsed rather than a 413 error. <a href="https://redirect.github.com/pallets/werkzeug/issues/2930">#2930</a></li> <li>Improve <code>parse_options_header</code> performance when parsing unterminated quoted string values. <a href="https://redirect.github.com/pallets/werkzeug/issues/2904">#2904</a></li> <li>Debugger pin auth is synchronized across threads/processes when tracking failed entries. <a href="https://redirect.github.com/pallets/werkzeug/issues/2916">#2916</a></li> <li>Dev server handles unexpected <code>SSLEOFError</code> due to issue in Python < 3.13. <a href="https://redirect.github.com/pallets/werkzeug/issues/2926">#2926</a></li> <li>Debugger pin auth works when the URL already contains a query string. <a href="https://redirect.github.com/pallets/werkzeug/issues/2918">#2918</a></li> </ul> <h2>3.0.3</h2> <p>This is the Werkzeug 3.0.3 security release, which fixes security issues and bugs but does not otherwise change behavior and should not result in breaking changes.</p> <p>PyPI: <a href="https://pypi.org/project/Werkzeug/3.0.3/">https://pypi.org/project/Werkzeug/3.0.3/</a> Changes: <a href="https://werkzeug.palletsprojects.com/en/3.0.x/changes/#version-3-0-3">https://werkzeug.palletsprojects.com/en/3.0.x/changes/#version-3-0-3</a> Milestone: <a href="https://github.com/pallets/werkzeug/milestone/35?closed=1">https://github.com/pallets/werkzeug/milestone/35?closed=1</a></p> <ul> <li>Only allow <code>localhost</code>, <code>.localhost</code>, <code>127.0.0.1</code>, or the specified hostname when running the dev server, to make debugger requests. Additional hosts can be added by using the debugger middleware directly. The debugger UI makes requests using the full URL rather than only the path. GHSA-2g68-c3qc-8985</li> <li>Make reloader more robust when <code>""</code> is in <code>sys.path</code>. <a href="https://redirect.github.com/pallets/werkzeug/issues/2823">#2823</a></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pallets/werkzeug/blob/main/CHANGES.rst">werkzeug's changelog</a>.</em></p> <blockquote> <h2>Version 3.0.6</h2> <p>Released 2024-10-25</p> <ul> <li>Fix how <code>max_form_memory_size</code> is applied when parsing large non-file fields. :ghsa:<code>q34m-jh98-gwm2</code></li> <li><code>safe_join</code> catches certain paths on Windows that were not caught by <code>ntpath.isabs</code> on Python < 3.11. :ghsa:<code>f9vj-2wh5-fj8j</code></li> </ul> <h2>Version 3.0.5</h2> <p>Released 2024-10-24</p> <ul> <li>The Watchdog reloader ignores file closed no write events. :issue:<code>2945</code></li> <li>Logging works with client addresses containing an IPv6 scope :issue:<code>2952</code></li> <li>Ignore invalid authorization parameters. :issue:<code>2955</code></li> <li>Improve type annotation fore <code>SharedDataMiddleware</code>. :issue:<code>2958</code></li> <li>Compatibility with Python 3.13 when generating debugger pin and the current UID does not have an associated name. :issue:<code>2957</code></li> </ul> <h2>Version 3.0.4</h2> <p>Released 2024-08-21</p> <ul> <li>Restore behavior where parsing <code>multipart/x-www-form-urlencoded</code> data with invalid UTF-8 bytes in the body results in no form data parsed rather than a 413 error. :issue:<code>2930</code></li> <li>Improve <code>parse_options_header</code> performance when parsing unterminated quoted string values. :issue:<code>2904</code></li> <li>Debugger pin auth is synchronized across threads/processes when tracking failed entries. :issue:<code>2916</code></li> <li>Dev server handles unexpected <code>SSLEOFError</code> due to issue in Python < 3.13. :issue:<code>2926</code></li> <li>Debugger pin auth works when the URL already contains a query string. :issue:<code>2918</code></li> </ul> <h2>Version 3.0.3</h2> <p>Released 2024-05-05</p> <ul> <li>Only allow <code>localhost</code>, <code>.localhost</code>, <code>127.0.0.1</code>, or the specified hostname when running the dev server, to make debugger requests. Additional hosts can be added by using the debugger middleware directly. The debugger</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`5eaefc3996`"><code>5eaefc3</code></a> release version 3.0.6</li> <li><a href="`2767bcb10a`"><code>2767bcb</code></a> Merge commit from fork</li> <li><a href="`87cc78a25f`"><code>87cc78a</code></a> catch special absolute path on Windows Python < 3.11</li> <li><a href="`50cfeebcb0`"><code>50cfeeb</code></a> Merge commit from fork</li> <li><a href="`8760275afb`"><code>8760275</code></a> apply max_form_memory_size another level up in the parser</li> <li><a href="`8d6a12e2af`"><code>8d6a12e</code></a> start version 3.0.6</li> <li><a href="`a7b121abc7`"><code>a7b121a</code></a> release version 3.0.5 (<a href="https://redirect.github.com/pallets/werkzeug/issues/2961">#2961</a>)</li> <li><a href="`9caf72ac06`"><code>9caf72a</code></a> release version 3.0.5</li> <li><a href="`e28a2451e9`"><code>e28a245</code></a> catch OSError from getpass.getuser (<a href="https://redirect.github.com/pallets/werkzeug/issues/2960">#2960</a>)</li> <li><a href="`e6b4cce97e`"><code>e6b4cce</code></a> catch OSError from getpass.getuser</li> <li>Additional commits viewable in <a href="https://github.com/pallets/werkzeug/compare/2.3.7...3.0.6">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=werkzeug&package-manager=pip&previous-version=2.3.7&new-version=3.0.6)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-06-26 18:30:16 +03:00
dependabot[bot]	c36072064a	Bump cryptography from 42.0.3 to 44.0.1 in /.devcontainer/src/test/regress (#8038 ) Bumps [cryptography](https://github.com/pyca/cryptography) from 42.0.3 to 44.0.1. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst">cryptography's changelog</a>.</em></p> <blockquote> <p>44.0.1 - 2025-02-11</p> <pre><code> * Updated Windows, macOS, and Linux wheels to be compiled with OpenSSL 3.4.1. * We now build ``armv7l`` ``manylinux`` wheels and publish them to PyPI. * We now build ``manylinux_2_34`` wheels and publish them to PyPI. <p>.. _v44-0-0:</p> <p>44.0.0 - 2024-11-27 </code></pre></p> <ul> <li><strong>BACKWARDS INCOMPATIBLE:</strong> Dropped support for LibreSSL < 3.9.</li> <li>Deprecated Python 3.7 support. Python 3.7 is no longer supported by the Python core team. Support for Python 3.7 will be removed in a future <code>cryptography</code> release.</li> <li>Updated Windows, macOS, and Linux wheels to be compiled with OpenSSL 3.4.0.</li> <li>macOS wheels are now built against the macOS 10.13 SDK. Users on older versions of macOS should upgrade, or they will need to build <code>cryptography</code> themselves.</li> <li>Enforce the :rfc:<code>5280</code> requirement that extended key usage extensions must not be empty.</li> <li>Added support for timestamp extraction to the :class:<code>~cryptography.fernet.MultiFernet</code> class.</li> <li>Relax the Authority Key Identifier requirements on root CA certificates during X.509 verification to allow fields permitted by :rfc:<code>5280</code> but forbidden by the CA/Browser BRs.</li> <li>Added support for :class:<code>~cryptography.hazmat.primitives.kdf.argon2.Argon2id</code> when using OpenSSL 3.2.0+.</li> <li>Added support for the :class:<code>~cryptography.x509.Admissions</code> certificate extension.</li> <li>Added basic support for PKCS7 decryption (including S/MIME 3.2) via :func:<code>~cryptography.hazmat.primitives.serialization.pkcs7.pkcs7_decrypt_der</code>, :func:<code>~cryptography.hazmat.primitives.serialization.pkcs7.pkcs7_decrypt_pem</code>, and :func:<code>~cryptography.hazmat.primitives.serialization.pkcs7.pkcs7_decrypt_smime</code>.</li> </ul> <p>.. _v43-0-3:</p> <p>43.0.3 - 2024-10-18</p> <pre><code> * Fixed release metadata for ``cryptography-vectors`` <p>.. _v43-0-2:</p> <p>43.0.2 - 2024-10-18 </code></pre></p> <ul> <li>Fixed compilation when using LibreSSL 4.0.0.</li> </ul> <p>.. _v43-0-1:</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`adaaaed77d`"><code>adaaaed</code></a> Bump for 44.0.1 release (<a href="https://redirect.github.com/pyca/cryptography/issues/12441">#12441</a>)</li> <li><a href="`ccc61dabe3`"><code>ccc61da</code></a> [backport] test and build on armv7l (<a href="https://redirect.github.com/pyca/cryptography/issues/12420">#12420</a>) (<a href="https://redirect.github.com/pyca/cryptography/issues/12431">#12431</a>)</li> <li><a href="`f299a48153`"><code>f299a48</code></a> remove deprecated call (<a href="https://redirect.github.com/pyca/cryptography/issues/12052">#12052</a>)</li> <li><a href="`439eb0594a`"><code>439eb05</code></a> Bump version for 44.0.0 (<a href="https://redirect.github.com/pyca/cryptography/issues/12051">#12051</a>)</li> <li><a href="`2c5ad4d8dc`"><code>2c5ad4d</code></a> chore(deps): bump maturin from 1.7.4 to 1.7.5 in /.github/requirements (<a href="https://redirect.github.com/pyca/cryptography/issues/12050">#12050</a>)</li> <li><a href="`d23968addd`"><code>d23968a</code></a> chore(deps): bump libc from 0.2.165 to 0.2.166 (<a href="https://redirect.github.com/pyca/cryptography/issues/12049">#12049</a>)</li> <li><a href="`133c0e02ed`"><code>133c0e0</code></a> Bump x509-limbo and/or wycheproof in CI (<a href="https://redirect.github.com/pyca/cryptography/issues/12047">#12047</a>)</li> <li><a href="`f2259d7aa0`"><code>f2259d7</code></a> Bump BoringSSL and/or OpenSSL in CI (<a href="https://redirect.github.com/pyca/cryptography/issues/12046">#12046</a>)</li> <li><a href="`e201c870b8`"><code>e201c87</code></a> fixed metadata in changelog (<a href="https://redirect.github.com/pyca/cryptography/issues/12044">#12044</a>)</li> <li><a href="`c6104cc366`"><code>c6104cc</code></a> Prohibit Python 3.9.0, 3.9.1 -- they have a bug that causes errors (<a href="https://redirect.github.com/pyca/cryptography/issues/12045">#12045</a>)</li> <li>Additional commits viewable in <a href="https://github.com/pyca/cryptography/compare/42.0.3...44.0.1">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=cryptography&package-manager=pip&previous-version=42.0.3&new-version=44.0.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-06-26 17:28:51 +03:00
dependabot[bot]	c350c7be46	Bump tornado from 6.4 to 6.5 in /.devcontainer/src/test/regress (#8037 ) Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4 to 6.5. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/tornadoweb/tornado/blob/master/docs/releases.rst">tornado's changelog</a>.</em></p> <blockquote> <h1>Release notes</h1> <p>.. toctree:: :maxdepth: 2</p> <p>releases/v6.5.1 releases/v6.5.0 releases/v6.4.2 releases/v6.4.1 releases/v6.4.0 releases/v6.3.3 releases/v6.3.2 releases/v6.3.1 releases/v6.3.0 releases/v6.2.0 releases/v6.1.0 releases/v6.0.4 releases/v6.0.3 releases/v6.0.2 releases/v6.0.1 releases/v6.0.0 releases/v5.1.1 releases/v5.1.0 releases/v5.0.2 releases/v5.0.1 releases/v5.0.0 releases/v4.5.3 releases/v4.5.2 releases/v4.5.1 releases/v4.5.0 releases/v4.4.3 releases/v4.4.2 releases/v4.4.1 releases/v4.4.0 releases/v4.3.0 releases/v4.2.1 releases/v4.2.0 releases/v4.1.0 releases/v4.0.2 releases/v4.0.1 releases/v4.0.0 releases/v3.2.2 releases/v3.2.1 releases/v3.2.0 releases/v3.1.1 releases/v3.1.0 releases/v3.0.2 releases/v3.0.1 releases/v3.0.0</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`ab5f354312`"><code>ab5f354</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3498">#3498</a> from bdarnell/final-6.5</li> <li><a href="`3623024dfc`"><code>3623024</code></a> Final release notes for 6.5.0</li> <li><a href="`b39b892bf7`"><code>b39b892</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3497">#3497</a> from bdarnell/multipart-log-spam</li> <li><a href="`cc61050e8f`"><code>cc61050</code></a> httputil: Raise errors instead of logging in multipart/form-data parsing</li> <li><a href="`ae4a4e4fea`"><code>ae4a4e4</code></a> asyncio: Preserve contextvars across SelectorThread on Windows (<a href="https://redirect.github.com/tornadoweb/tornado/issues/3479">#3479</a>)</li> <li><a href="`197ff13f76`"><code>197ff13</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3496">#3496</a> from bdarnell/undeprecate-set-event-loop</li> <li><a href="`c3d906c4ad`"><code>c3d906c</code></a> requirements: Upgrade tox to 4.26.0</li> <li><a href="`a83897732e`"><code>a838977</code></a> testing: Remove deprecation warning filter for set_event_loop</li> <li><a href="`d8e0d36eba`"><code>d8e0d36</code></a> build: Fix free-threaded build, mark speedups module as no-GIL</li> <li><a href="`bfe7489485`"><code>bfe7489</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3492">#3492</a> from bdarnell/relnotes-6.5</li> <li>Additional commits viewable in <a href="https://github.com/tornadoweb/tornado/compare/v6.4.0...v6.5.0">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=tornado&package-manager=pip&previous-version=6.4&new-version=6.5)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-06-26 17:00:18 +03:00
ibrahim halatci	8587de850b	Filter out upload coverage action for PRs from forks (#8033 ) add condition to filter out coverage upload action for PRs from forks as the necessary secret is not available to them and fails the whole pipeline	2025-06-26 14:12:39 +03:00
naisila	4cd8bb1b67	Bump Citus version to 13.2devel	2025-06-24 16:21:48 +02:00
naisila	4456913801	Add Changelog entries for 13.1.0, 13.0.4, 12.1.8 13.1.0 https://github.com/citusdata/citus/pull/8006 13.0.4 https://github.com/citusdata/citus/pull/8005 12.1.8 https://github.com/citusdata/citus/pull/8004	2025-06-24 16:21:48 +02:00
Onur Tirtir	55a0d1f730	Add skip_qualify_public param to shard_name() to allow qualifying for "public" schema (#8014 ) DESCRIPTION: Adds skip_qualify_public param to `shard_name()` UDF to allow qualifying for "public" schema when needed.	2025-06-02 10:15:32 +03:00
dependabot[bot]	5e37fe0c46	Bump cryptography from 42.0.3 to 44.0.1 in /src/test/regress (#7996 ) Bumps [cryptography](https://github.com/pyca/cryptography) from 42.0.3 to 44.0.1. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst">cryptography's changelog</a>.</em></p> <blockquote> <p>44.0.1 - 2025-02-11</p> <pre><code> * Updated Windows, macOS, and Linux wheels to be compiled with OpenSSL 3.4.1. * We now build ``armv7l`` ``manylinux`` wheels and publish them to PyPI. * We now build ``manylinux_2_34`` wheels and publish them to PyPI. <p>.. _v44-0-0:</p> <p>44.0.0 - 2024-11-27 </code></pre></p> <ul> <li><strong>BACKWARDS INCOMPATIBLE:</strong> Dropped support for LibreSSL < 3.9.</li> <li>Deprecated Python 3.7 support. Python 3.7 is no longer supported by the Python core team. Support for Python 3.7 will be removed in a future <code>cryptography</code> release.</li> <li>Updated Windows, macOS, and Linux wheels to be compiled with OpenSSL 3.4.0.</li> <li>macOS wheels are now built against the macOS 10.13 SDK. Users on older versions of macOS should upgrade, or they will need to build <code>cryptography</code> themselves.</li> <li>Enforce the :rfc:<code>5280</code> requirement that extended key usage extensions must not be empty.</li> <li>Added support for timestamp extraction to the :class:<code>~cryptography.fernet.MultiFernet</code> class.</li> <li>Relax the Authority Key Identifier requirements on root CA certificates during X.509 verification to allow fields permitted by :rfc:<code>5280</code> but forbidden by the CA/Browser BRs.</li> <li>Added support for :class:<code>~cryptography.hazmat.primitives.kdf.argon2.Argon2id</code> when using OpenSSL 3.2.0+.</li> <li>Added support for the :class:<code>~cryptography.x509.Admissions</code> certificate extension.</li> <li>Added basic support for PKCS7 decryption (including S/MIME 3.2) via :func:<code>~cryptography.hazmat.primitives.serialization.pkcs7.pkcs7_decrypt_der</code>, :func:<code>~cryptography.hazmat.primitives.serialization.pkcs7.pkcs7_decrypt_pem</code>, and :func:<code>~cryptography.hazmat.primitives.serialization.pkcs7.pkcs7_decrypt_smime</code>.</li> </ul> <p>.. _v43-0-3:</p> <p>43.0.3 - 2024-10-18</p> <pre><code> * Fixed release metadata for ``cryptography-vectors`` <p>.. _v43-0-2:</p> <p>43.0.2 - 2024-10-18 </code></pre></p> <ul> <li>Fixed compilation when using LibreSSL 4.0.0.</li> </ul> <p>.. _v43-0-1:</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`adaaaed77d`"><code>adaaaed</code></a> Bump for 44.0.1 release (<a href="https://redirect.github.com/pyca/cryptography/issues/12441">#12441</a>)</li> <li><a href="`ccc61dabe3`"><code>ccc61da</code></a> [backport] test and build on armv7l (<a href="https://redirect.github.com/pyca/cryptography/issues/12420">#12420</a>) (<a href="https://redirect.github.com/pyca/cryptography/issues/12431">#12431</a>)</li> <li><a href="`f299a48153`"><code>f299a48</code></a> remove deprecated call (<a href="https://redirect.github.com/pyca/cryptography/issues/12052">#12052</a>)</li> <li><a href="`439eb0594a`"><code>439eb05</code></a> Bump version for 44.0.0 (<a href="https://redirect.github.com/pyca/cryptography/issues/12051">#12051</a>)</li> <li><a href="`2c5ad4d8dc`"><code>2c5ad4d</code></a> chore(deps): bump maturin from 1.7.4 to 1.7.5 in /.github/requirements (<a href="https://redirect.github.com/pyca/cryptography/issues/12050">#12050</a>)</li> <li><a href="`d23968addd`"><code>d23968a</code></a> chore(deps): bump libc from 0.2.165 to 0.2.166 (<a href="https://redirect.github.com/pyca/cryptography/issues/12049">#12049</a>)</li> <li><a href="`133c0e02ed`"><code>133c0e0</code></a> Bump x509-limbo and/or wycheproof in CI (<a href="https://redirect.github.com/pyca/cryptography/issues/12047">#12047</a>)</li> <li><a href="`f2259d7aa0`"><code>f2259d7</code></a> Bump BoringSSL and/or OpenSSL in CI (<a href="https://redirect.github.com/pyca/cryptography/issues/12046">#12046</a>)</li> <li><a href="`e201c870b8`"><code>e201c87</code></a> fixed metadata in changelog (<a href="https://redirect.github.com/pyca/cryptography/issues/12044">#12044</a>)</li> <li><a href="`c6104cc366`"><code>c6104cc</code></a> Prohibit Python 3.9.0, 3.9.1 -- they have a bug that causes errors (<a href="https://redirect.github.com/pyca/cryptography/issues/12045">#12045</a>)</li> <li>Additional commits viewable in <a href="https://github.com/pyca/cryptography/compare/42.0.3...44.0.1">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=cryptography&package-manager=pip&previous-version=42.0.3&new-version=44.0.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-28 20:48:29 +03:00
dependabot[bot]	e8c3179b4d	Bump tornado from 6.4.2 to 6.5.1 in /src/test/regress (#8001 ) Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4.2 to 6.5.1. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/tornadoweb/tornado/blob/master/docs/releases.rst">tornado's changelog</a>.</em></p> <blockquote> <h1>Release notes</h1> <p>.. toctree:: :maxdepth: 2</p> <p>releases/v6.5.1 releases/v6.5.0 releases/v6.4.2 releases/v6.4.1 releases/v6.4.0 releases/v6.3.3 releases/v6.3.2 releases/v6.3.1 releases/v6.3.0 releases/v6.2.0 releases/v6.1.0 releases/v6.0.4 releases/v6.0.3 releases/v6.0.2 releases/v6.0.1 releases/v6.0.0 releases/v5.1.1 releases/v5.1.0 releases/v5.0.2 releases/v5.0.1 releases/v5.0.0 releases/v4.5.3 releases/v4.5.2 releases/v4.5.1 releases/v4.5.0 releases/v4.4.3 releases/v4.4.2 releases/v4.4.1 releases/v4.4.0 releases/v4.3.0 releases/v4.2.1 releases/v4.2.0 releases/v4.1.0 releases/v4.0.2 releases/v4.0.1 releases/v4.0.0 releases/v3.2.2 releases/v3.2.1 releases/v3.2.0 releases/v3.1.1 releases/v3.1.0 releases/v3.0.2 releases/v3.0.1 releases/v3.0.0</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`b5586f3f29`"><code>b5586f3</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3503">#3503</a> from bdarnell/multipart-utf8</li> <li><a href="`62c276434d`"><code>62c2764</code></a> Release notes for v6.5.1</li> <li><a href="`170a58af2c`"><code>170a58a</code></a> httputil: Fix support for non-latin1 filenames in multipart uploads</li> <li><a href="`ab5f354312`"><code>ab5f354</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3498">#3498</a> from bdarnell/final-6.5</li> <li><a href="`3623024dfc`"><code>3623024</code></a> Final release notes for 6.5.0</li> <li><a href="`b39b892bf7`"><code>b39b892</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3497">#3497</a> from bdarnell/multipart-log-spam</li> <li><a href="`cc61050e8f`"><code>cc61050</code></a> httputil: Raise errors instead of logging in multipart/form-data parsing</li> <li><a href="`ae4a4e4fea`"><code>ae4a4e4</code></a> asyncio: Preserve contextvars across SelectorThread on Windows (<a href="https://redirect.github.com/tornadoweb/tornado/issues/3479">#3479</a>)</li> <li><a href="`197ff13f76`"><code>197ff13</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3496">#3496</a> from bdarnell/undeprecate-set-event-loop</li> <li><a href="`c3d906c4ad`"><code>c3d906c</code></a> requirements: Upgrade tox to 4.26.0</li> <li>Additional commits viewable in <a href="https://github.com/tornadoweb/tornado/compare/v6.4.2...v6.5.1">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=tornado&package-manager=pip&previous-version=6.4.2&new-version=6.5.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-28 17:45:11 +03:00
dependabot[bot]	92dc7f36fc	Bump jinja2 from 3.1.3 to 3.1.6 in /src/test/regress (#8002 ) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.6. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pallets/jinja/releases">jinja2's releases</a>.</em></p> <blockquote> <h2>3.1.6</h2> <p>This is the Jinja 3.1.6 security release, which fixes security issues but does not otherwise change behavior and should not result in breaking changes compared to the latest feature release.</p> <p>PyPI: <a href="https://pypi.org/project/Jinja2/3.1.6/">https://pypi.org/project/Jinja2/3.1.6/</a> Changes: <a href="https://jinja.palletsprojects.com/en/stable/changes/#version-3-1-6">https://jinja.palletsprojects.com/en/stable/changes/#version-3-1-6</a></p> <ul> <li>The <code>\|attr</code> filter does not bypass the environment's attribute lookup, allowing the sandbox to apply its checks. <a href="https://github.com/pallets/jinja/security/advisories/GHSA-cpwx-vrp4-4pq7">https://github.com/pallets/jinja/security/advisories/GHSA-cpwx-vrp4-4pq7</a></li> </ul> <h2>3.1.5</h2> <p>This is the Jinja 3.1.5 security fix release, which fixes security issues and bugs but does not otherwise change behavior and should not result in breaking changes compared to the latest feature release.</p> <p>PyPI: <a href="https://pypi.org/project/Jinja2/3.1.5/">https://pypi.org/project/Jinja2/3.1.5/</a> Changes: <a href="https://jinja.palletsprojects.com/changes/#version-3-1-5">https://jinja.palletsprojects.com/changes/#version-3-1-5</a> Milestone: <a href="https://github.com/pallets/jinja/milestone/16?closed=1">https://github.com/pallets/jinja/milestone/16?closed=1</a></p> <ul> <li>The sandboxed environment handles indirect calls to <code>str.format</code>, such as by passing a stored reference to a filter that calls its argument. <a href="https://github.com/pallets/jinja/security/advisories/GHSA-q2x7-8rv6-6q7h">GHSA-q2x7-8rv6-6q7h</a></li> <li>Escape template name before formatting it into error messages, to avoid issues with names that contain f-string syntax. <a href="https://redirect.github.com/pallets/jinja/issues/1792">#1792</a>, <a href="https://github.com/pallets/jinja/security/advisories/GHSA-gmj6-6f8f-6699">GHSA-gmj6-6f8f-6699</a></li> <li>Sandbox does not allow <code>clear</code> and <code>pop</code> on known mutable sequence types. <a href="https://redirect.github.com/pallets/jinja/issues/2032">#2032</a></li> <li>Calling sync <code>render</code> for an async template uses <code>asyncio.run</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1952">#1952</a></li> <li>Avoid unclosed <code>auto_aiter</code> warnings. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>Return an <code>aclose</code>-able <code>AsyncGenerator</code> from <code>Template.generate_async</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>Avoid leaving <code>root_render_func()</code> unclosed in <code>Template.generate_async</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>Avoid leaving async generators unclosed in blocks, includes and extends. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>The runtime uses the correct <code>concat</code> function for the current environment when calling block references. <a href="https://redirect.github.com/pallets/jinja/issues/1701">#1701</a></li> <li>Make <code>\|unique</code> async-aware, allowing it to be used after another async-aware filter. <a href="https://redirect.github.com/pallets/jinja/issues/1781">#1781</a></li> <li><code>\|int</code> filter handles <code>OverflowError</code> from scientific notation. <a href="https://redirect.github.com/pallets/jinja/issues/1921">#1921</a></li> <li>Make compiling deterministic for tuple unpacking in a <code>{% set ... %}</code> call. <a href="https://redirect.github.com/pallets/jinja/issues/2021">#2021</a></li> <li>Fix dunder protocol (<code>copy</code>/<code>pickle</code>/etc) interaction with <code>Undefined</code> objects. <a href="https://redirect.github.com/pallets/jinja/issues/2025">#2025</a></li> <li>Fix <code>copy</code>/<code>pickle</code> support for the internal <code>missing</code> object. <a href="https://redirect.github.com/pallets/jinja/issues/2027">#2027</a></li> <li><code>Environment.overlay(enable_async)</code> is applied correctly. <a href="https://redirect.github.com/pallets/jinja/issues/2061">#2061</a></li> <li>The error message from <code>FileSystemLoader</code> includes the paths that were searched. <a href="https://redirect.github.com/pallets/jinja/issues/1661">#1661</a></li> <li><code>PackageLoader</code> shows a clearer error message when the package does not contain the templates directory. <a href="https://redirect.github.com/pallets/jinja/issues/1705">#1705</a></li> <li>Improve annotations for methods returning copies. <a href="https://redirect.github.com/pallets/jinja/issues/1880">#1880</a></li> <li><code>urlize</code> does not add <code>mailto:</code> to values like <code>@a@b</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1870">#1870</a></li> <li>Tests decorated with <code>@pass_context</code> can be used with the <code>\|select</code> filter. <a href="https://redirect.github.com/pallets/jinja/issues/1624">#1624</a></li> <li>Using <code>set</code> for multiple assignment (<code>a, b = 1, 2</code>) does not fail when the target is a namespace attribute. <a href="https://redirect.github.com/pallets/jinja/issues/1413">#1413</a></li> <li>Using <code>set</code> in all branches of <code>{% if %}{% elif %}{% else %}</code> blocks does not cause the variable to be considered initially undefined. <a href="https://redirect.github.com/pallets/jinja/issues/1253">#1253</a></li> </ul> <h2>3.1.4</h2> <p>This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise change behavior and should not result in breaking changes.</p> <p>PyPI: <a href="https://pypi.org/project/Jinja2/3.1.4/">https://pypi.org/project/Jinja2/3.1.4/</a> Changes: <a href="https://jinja.palletsprojects.com/en/3.1.x/changes/#version-3-1-4">https://jinja.palletsprojects.com/en/3.1.x/changes/#version-3-1-4</a></p> <ul> <li>The <code>xmlattr</code> filter does not allow keys with <code>/</code> solidus, <code>></code> greater-than sign, or <code>=</code> equals sign, in addition to disallowing spaces. Regardless of any validation done by Jinja, user input should never be used as keys to this filter, or must be separately validated first. GHSA-h75v-3vvj-5mfj</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pallets/jinja/blob/main/CHANGES.rst">jinja2's changelog</a>.</em></p> <blockquote> <h2>Version 3.1.6</h2> <p>Released 2025-03-05</p> <ul> <li>The <code>\|attr</code> filter does not bypass the environment's attribute lookup, allowing the sandbox to apply its checks. :ghsa:<code>cpwx-vrp4-4pq7</code></li> </ul> <h2>Version 3.1.5</h2> <p>Released 2024-12-21</p> <ul> <li>The sandboxed environment handles indirect calls to <code>str.format</code>, such as by passing a stored reference to a filter that calls its argument. :ghsa:<code>q2x7-8rv6-6q7h</code></li> <li>Escape template name before formatting it into error messages, to avoid issues with names that contain f-string syntax. :issue:<code>1792</code>, :ghsa:<code>gmj6-6f8f-6699</code></li> <li>Sandbox does not allow <code>clear</code> and <code>pop</code> on known mutable sequence types. :issue:<code>2032</code></li> <li>Calling sync <code>render</code> for an async template uses <code>asyncio.run</code>. :pr:<code>1952</code></li> <li>Avoid unclosed <code>auto_aiter</code> warnings. :pr:<code>1960</code></li> <li>Return an <code>aclose</code>-able <code>AsyncGenerator</code> from <code>Template.generate_async</code>. :pr:<code>1960</code></li> <li>Avoid leaving <code>root_render_func()</code> unclosed in <code>Template.generate_async</code>. :pr:<code>1960</code></li> <li>Avoid leaving async generators unclosed in blocks, includes and extends. :pr:<code>1960</code></li> <li>The runtime uses the correct <code>concat</code> function for the current environment when calling block references. :issue:<code>1701</code></li> <li>Make <code>\|unique</code> async-aware, allowing it to be used after another async-aware filter. :issue:<code>1781</code></li> <li><code>\|int</code> filter handles <code>OverflowError</code> from scientific notation. :issue:<code>1921</code></li> <li>Make compiling deterministic for tuple unpacking in a <code>{% set ... %}</code> call. :issue:<code>2021</code></li> <li>Fix dunder protocol (<code>copy</code>/<code>pickle</code>/etc) interaction with <code>Undefined</code> objects. :issue:<code>2025</code></li> <li>Fix <code>copy</code>/<code>pickle</code> support for the internal <code>missing</code> object. :issue:<code>2027</code></li> <li><code>Environment.overlay(enable_async)</code> is applied correctly. :pr:<code>2061</code></li> <li>The error message from <code>FileSystemLoader</code> includes the paths that were searched. :issue:<code>1661</code></li> <li><code>PackageLoader</code> shows a clearer error message when the package does not contain the templates directory. :issue:<code>1705</code></li> <li>Improve annotations for methods returning copies. :pr:<code>1880</code></li> <li><code>urlize</code> does not add <code>mailto:</code> to values like <code>@a@b</code>. :pr:<code>1870</code></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`15206881c0`"><code>1520688</code></a> release version 3.1.6</li> <li><a href="`90457bbf33`"><code>90457bb</code></a> Merge commit from fork</li> <li><a href="`065334d1ee`"><code>065334d</code></a> attr filter uses env.getattr</li> <li><a href="`033c20015c`"><code>033c200</code></a> start version 3.1.6</li> <li><a href="`bc68d4efa9`"><code>bc68d4e</code></a> use global contributing guide (<a href="https://redirect.github.com/pallets/jinja/issues/2070">#2070</a>)</li> <li><a href="`247de5e0c5`"><code>247de5e</code></a> use global contributing guide</li> <li><a href="`ab8218c7a1`"><code>ab8218c</code></a> use project advisory link instead of global</li> <li><a href="`b4ffc8ff29`"><code>b4ffc8f</code></a> release version 3.1.5 (<a href="https://redirect.github.com/pallets/jinja/issues/2066">#2066</a>)</li> <li><a href="`877f6e51be`"><code>877f6e5</code></a> release version 3.1.5</li> <li><a href="`8d58859265`"><code>8d58859</code></a> remove test pypi</li> <li>Additional commits viewable in <a href="https://github.com/pallets/jinja/compare/3.1.3...3.1.6">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=jinja2&package-manager=pip&previous-version=3.1.3&new-version=3.1.6)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-28 17:01:42 +03:00
dependabot[bot]	98d95a9b9d	Bump jinja2 from 3.1.3 to 3.1.6 in /.devcontainer/src/test/regress (#7995 ) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.6. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/pallets/jinja/releases">jinja2's releases</a>.</em></p> <blockquote> <h2>3.1.6</h2> <p>This is the Jinja 3.1.6 security release, which fixes security issues but does not otherwise change behavior and should not result in breaking changes compared to the latest feature release.</p> <p>PyPI: <a href="https://pypi.org/project/Jinja2/3.1.6/">https://pypi.org/project/Jinja2/3.1.6/</a> Changes: <a href="https://jinja.palletsprojects.com/en/stable/changes/#version-3-1-6">https://jinja.palletsprojects.com/en/stable/changes/#version-3-1-6</a></p> <ul> <li>The <code>\|attr</code> filter does not bypass the environment's attribute lookup, allowing the sandbox to apply its checks. <a href="https://github.com/pallets/jinja/security/advisories/GHSA-cpwx-vrp4-4pq7">https://github.com/pallets/jinja/security/advisories/GHSA-cpwx-vrp4-4pq7</a></li> </ul> <h2>3.1.5</h2> <p>This is the Jinja 3.1.5 security fix release, which fixes security issues and bugs but does not otherwise change behavior and should not result in breaking changes compared to the latest feature release.</p> <p>PyPI: <a href="https://pypi.org/project/Jinja2/3.1.5/">https://pypi.org/project/Jinja2/3.1.5/</a> Changes: <a href="https://jinja.palletsprojects.com/changes/#version-3-1-5">https://jinja.palletsprojects.com/changes/#version-3-1-5</a> Milestone: <a href="https://github.com/pallets/jinja/milestone/16?closed=1">https://github.com/pallets/jinja/milestone/16?closed=1</a></p> <ul> <li>The sandboxed environment handles indirect calls to <code>str.format</code>, such as by passing a stored reference to a filter that calls its argument. <a href="https://github.com/pallets/jinja/security/advisories/GHSA-q2x7-8rv6-6q7h">GHSA-q2x7-8rv6-6q7h</a></li> <li>Escape template name before formatting it into error messages, to avoid issues with names that contain f-string syntax. <a href="https://redirect.github.com/pallets/jinja/issues/1792">#1792</a>, <a href="https://github.com/pallets/jinja/security/advisories/GHSA-gmj6-6f8f-6699">GHSA-gmj6-6f8f-6699</a></li> <li>Sandbox does not allow <code>clear</code> and <code>pop</code> on known mutable sequence types. <a href="https://redirect.github.com/pallets/jinja/issues/2032">#2032</a></li> <li>Calling sync <code>render</code> for an async template uses <code>asyncio.run</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1952">#1952</a></li> <li>Avoid unclosed <code>auto_aiter</code> warnings. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>Return an <code>aclose</code>-able <code>AsyncGenerator</code> from <code>Template.generate_async</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>Avoid leaving <code>root_render_func()</code> unclosed in <code>Template.generate_async</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>Avoid leaving async generators unclosed in blocks, includes and extends. <a href="https://redirect.github.com/pallets/jinja/issues/1960">#1960</a></li> <li>The runtime uses the correct <code>concat</code> function for the current environment when calling block references. <a href="https://redirect.github.com/pallets/jinja/issues/1701">#1701</a></li> <li>Make <code>\|unique</code> async-aware, allowing it to be used after another async-aware filter. <a href="https://redirect.github.com/pallets/jinja/issues/1781">#1781</a></li> <li><code>\|int</code> filter handles <code>OverflowError</code> from scientific notation. <a href="https://redirect.github.com/pallets/jinja/issues/1921">#1921</a></li> <li>Make compiling deterministic for tuple unpacking in a <code>{% set ... %}</code> call. <a href="https://redirect.github.com/pallets/jinja/issues/2021">#2021</a></li> <li>Fix dunder protocol (<code>copy</code>/<code>pickle</code>/etc) interaction with <code>Undefined</code> objects. <a href="https://redirect.github.com/pallets/jinja/issues/2025">#2025</a></li> <li>Fix <code>copy</code>/<code>pickle</code> support for the internal <code>missing</code> object. <a href="https://redirect.github.com/pallets/jinja/issues/2027">#2027</a></li> <li><code>Environment.overlay(enable_async)</code> is applied correctly. <a href="https://redirect.github.com/pallets/jinja/issues/2061">#2061</a></li> <li>The error message from <code>FileSystemLoader</code> includes the paths that were searched. <a href="https://redirect.github.com/pallets/jinja/issues/1661">#1661</a></li> <li><code>PackageLoader</code> shows a clearer error message when the package does not contain the templates directory. <a href="https://redirect.github.com/pallets/jinja/issues/1705">#1705</a></li> <li>Improve annotations for methods returning copies. <a href="https://redirect.github.com/pallets/jinja/issues/1880">#1880</a></li> <li><code>urlize</code> does not add <code>mailto:</code> to values like <code>@a@b</code>. <a href="https://redirect.github.com/pallets/jinja/issues/1870">#1870</a></li> <li>Tests decorated with <code>@pass_context</code> can be used with the <code>\|select</code> filter. <a href="https://redirect.github.com/pallets/jinja/issues/1624">#1624</a></li> <li>Using <code>set</code> for multiple assignment (<code>a, b = 1, 2</code>) does not fail when the target is a namespace attribute. <a href="https://redirect.github.com/pallets/jinja/issues/1413">#1413</a></li> <li>Using <code>set</code> in all branches of <code>{% if %}{% elif %}{% else %}</code> blocks does not cause the variable to be considered initially undefined. <a href="https://redirect.github.com/pallets/jinja/issues/1253">#1253</a></li> </ul> <h2>3.1.4</h2> <p>This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise change behavior and should not result in breaking changes.</p> <p>PyPI: <a href="https://pypi.org/project/Jinja2/3.1.4/">https://pypi.org/project/Jinja2/3.1.4/</a> Changes: <a href="https://jinja.palletsprojects.com/en/3.1.x/changes/#version-3-1-4">https://jinja.palletsprojects.com/en/3.1.x/changes/#version-3-1-4</a></p> <ul> <li>The <code>xmlattr</code> filter does not allow keys with <code>/</code> solidus, <code>></code> greater-than sign, or <code>=</code> equals sign, in addition to disallowing spaces. Regardless of any validation done by Jinja, user input should never be used as keys to this filter, or must be separately validated first. GHSA-h75v-3vvj-5mfj</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/pallets/jinja/blob/main/CHANGES.rst">jinja2's changelog</a>.</em></p> <blockquote> <h2>Version 3.1.6</h2> <p>Released 2025-03-05</p> <ul> <li>The <code>\|attr</code> filter does not bypass the environment's attribute lookup, allowing the sandbox to apply its checks. :ghsa:<code>cpwx-vrp4-4pq7</code></li> </ul> <h2>Version 3.1.5</h2> <p>Released 2024-12-21</p> <ul> <li>The sandboxed environment handles indirect calls to <code>str.format</code>, such as by passing a stored reference to a filter that calls its argument. :ghsa:<code>q2x7-8rv6-6q7h</code></li> <li>Escape template name before formatting it into error messages, to avoid issues with names that contain f-string syntax. :issue:<code>1792</code>, :ghsa:<code>gmj6-6f8f-6699</code></li> <li>Sandbox does not allow <code>clear</code> and <code>pop</code> on known mutable sequence types. :issue:<code>2032</code></li> <li>Calling sync <code>render</code> for an async template uses <code>asyncio.run</code>. :pr:<code>1952</code></li> <li>Avoid unclosed <code>auto_aiter</code> warnings. :pr:<code>1960</code></li> <li>Return an <code>aclose</code>-able <code>AsyncGenerator</code> from <code>Template.generate_async</code>. :pr:<code>1960</code></li> <li>Avoid leaving <code>root_render_func()</code> unclosed in <code>Template.generate_async</code>. :pr:<code>1960</code></li> <li>Avoid leaving async generators unclosed in blocks, includes and extends. :pr:<code>1960</code></li> <li>The runtime uses the correct <code>concat</code> function for the current environment when calling block references. :issue:<code>1701</code></li> <li>Make <code>\|unique</code> async-aware, allowing it to be used after another async-aware filter. :issue:<code>1781</code></li> <li><code>\|int</code> filter handles <code>OverflowError</code> from scientific notation. :issue:<code>1921</code></li> <li>Make compiling deterministic for tuple unpacking in a <code>{% set ... %}</code> call. :issue:<code>2021</code></li> <li>Fix dunder protocol (<code>copy</code>/<code>pickle</code>/etc) interaction with <code>Undefined</code> objects. :issue:<code>2025</code></li> <li>Fix <code>copy</code>/<code>pickle</code> support for the internal <code>missing</code> object. :issue:<code>2027</code></li> <li><code>Environment.overlay(enable_async)</code> is applied correctly. :pr:<code>2061</code></li> <li>The error message from <code>FileSystemLoader</code> includes the paths that were searched. :issue:<code>1661</code></li> <li><code>PackageLoader</code> shows a clearer error message when the package does not contain the templates directory. :issue:<code>1705</code></li> <li>Improve annotations for methods returning copies. :pr:<code>1880</code></li> <li><code>urlize</code> does not add <code>mailto:</code> to values like <code>@a@b</code>. :pr:<code>1870</code></li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`15206881c0`"><code>1520688</code></a> release version 3.1.6</li> <li><a href="`90457bbf33`"><code>90457bb</code></a> Merge commit from fork</li> <li><a href="`065334d1ee`"><code>065334d</code></a> attr filter uses env.getattr</li> <li><a href="`033c20015c`"><code>033c200</code></a> start version 3.1.6</li> <li><a href="`bc68d4efa9`"><code>bc68d4e</code></a> use global contributing guide (<a href="https://redirect.github.com/pallets/jinja/issues/2070">#2070</a>)</li> <li><a href="`247de5e0c5`"><code>247de5e</code></a> use global contributing guide</li> <li><a href="`ab8218c7a1`"><code>ab8218c</code></a> use project advisory link instead of global</li> <li><a href="`b4ffc8ff29`"><code>b4ffc8f</code></a> release version 3.1.5 (<a href="https://redirect.github.com/pallets/jinja/issues/2066">#2066</a>)</li> <li><a href="`877f6e51be`"><code>877f6e5</code></a> release version 3.1.5</li> <li><a href="`8d58859265`"><code>8d58859</code></a> remove test pypi</li> <li>Additional commits viewable in <a href="https://github.com/pallets/jinja/compare/3.1.3...3.1.6">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=jinja2&package-manager=pip&previous-version=3.1.3&new-version=3.1.6)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-28 15:36:56 +03:00
dependabot[bot]	c7f5e2b975	Bump tornado from 6.4 to 6.4.2 in /src/test/regress (#7984 ) Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4 to 6.4.2. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/tornadoweb/tornado/blob/master/docs/releases.rst">tornado's changelog</a>.</em></p> <blockquote> <h1>Release notes</h1> <p>.. toctree:: :maxdepth: 2</p> <p>releases/v6.5.0 releases/v6.4.2 releases/v6.4.1 releases/v6.4.0 releases/v6.3.3 releases/v6.3.2 releases/v6.3.1 releases/v6.3.0 releases/v6.2.0 releases/v6.1.0 releases/v6.0.4 releases/v6.0.3 releases/v6.0.2 releases/v6.0.1 releases/v6.0.0 releases/v5.1.1 releases/v5.1.0 releases/v5.0.2 releases/v5.0.1 releases/v5.0.0 releases/v4.5.3 releases/v4.5.2 releases/v4.5.1 releases/v4.5.0 releases/v4.4.3 releases/v4.4.2 releases/v4.4.1 releases/v4.4.0 releases/v4.3.0 releases/v4.2.1 releases/v4.2.0 releases/v4.1.0 releases/v4.0.2 releases/v4.0.1 releases/v4.0.0 releases/v3.2.2 releases/v3.2.1 releases/v3.2.0 releases/v3.1.1 releases/v3.1.0 releases/v3.0.2 releases/v3.0.1 releases/v3.0.0 releases/v2.4.1</p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`a5ecfab15e`"><code>a5ecfab</code></a> Bump version to 6.4.2</li> <li><a href="`bc7df6bafd`"><code>bc7df6b</code></a> Fix tests with Twisted 24.7.0</li> <li><a href="`d5ba4a1695`"><code>d5ba4a1</code></a> httputil: Fix quadratic performance of cookie parsing</li> <li><a href="`2a0e1d13b5`"><code>2a0e1d1</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3388">#3388</a> from bdarnell/release-641</li> <li><a href="`b7af4e8f5e`"><code>b7af4e8</code></a> Release notes and version bump for version 6.4.1</li> <li><a href="`d65f6e71a7`"><code>d65f6e7</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3387">#3387</a> from bdarnell/chunked-parsing</li> <li><a href="`8d721a877d`"><code>8d721a8</code></a> httputil: Only strip tabs and spaces from header values</li> <li><a href="`7786f09f84`"><code>7786f09</code></a> Merge pull request <a href="https://redirect.github.com/tornadoweb/tornado/issues/3386">#3386</a> from bdarnell/curl-crlf</li> <li><a href="`fb119c767e`"><code>fb119c7</code></a> http1connection: Stricter handling of transfer-encoding</li> <li><a href="`b0ffc58e02`"><code>b0ffc58</code></a> curl_httpclient,http1connection: Prohibit CR and LF in headers</li> <li>Additional commits viewable in <a href="https://github.com/tornadoweb/tornado/compare/v6.4.0...v6.4.2">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=tornado&package-manager=pip&previous-version=6.4&new-version=6.4.2)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/citusdata/citus/network/alerts). </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: ibrahim halatci <ihalatci@gmail.com>	2025-05-26 10:59:59 +03:00
ibrahim halatci	282523549e	bumbed codeql version to v3 (#7999 ) DESCRIPTION: bumbed codeql version to v3	2025-05-23 14:13:33 +03:00
Naisila Puka	c98341e4ed	Bump PG versions to 17.5, 16.9, 15.13 (#7986 ) Nontrivial bump because of the following PG15.3 commit 317aba70e https://github.com/postgres/postgres/commit/317aba70e Previously, when views were converted to RTE_SUBQUERY the relid would be cleared in PG15. In this patch of PG15, relid is retained. Therefore, we add a check with the "relkind and rtekind" to identify the converted views in 15.13 Sister PR https://github.com/citusdata/the-process/pull/164 Using dev image sha because I encountered the libpq symlink issue again with "-v219b87c"	2025-05-22 14:08:03 +02:00
Onur Tirtir	8d2fbca8ef	Fix unsafe memory access in citus_unmark_object_distributed() (#7985 ) _Since we've never released a Citus release that contains the commit that introduced this bug (see #7461), we don't need to have a DESCRIPTION line that shows up in release changelog._ From 8 valgrind test targets run for release-13.1 with PG 17.5, we got 1344 stack traces and except one of them, they were all about below unsafe memory access because this is a very hot code-path that we execute via our drop trigger. On main, even `make -C src/test/regress/ check-base-vg` dumps this stack trace with PG 16/17 to src/test/regress/citus_valgrind_test_log.txt when executing "multi_cluster_management", and this is not the case with this PR anymore. ```c ==27337== VALGRINDERROR-BEGIN ==27337== Conditional jump or move depends on uninitialised value(s) ==27337== at 0x7E26B68: citus_unmark_object_distributed (home/onurctirtir/citus/src/backend/distributed/metadata/distobject.c:113) ==27337== by 0x7E26CC7: master_unmark_object_distributed (home/onurctirtir/citus/src/backend/distributed/metadata/distobject.c:153) ==27337== by 0x4BD852: ExecInterpExpr (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execExprInterp.c:758) ==27337== by 0x4BFD00: ExecInterpExprStillValid (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execExprInterp.c:1870) ==27337== by 0x51D82C: ExecEvalExprSwitchContext (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/../../../src/include/executor/executor.h:355) ==27337== by 0x51D8A4: ExecProject (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/../../../src/include/executor/executor.h:389) ==27337== by 0x51DADB: ExecResult (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/nodeResult.c:136) ==27337== by 0x4D72ED: ExecProcNodeFirst (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execProcnode.c:464) ==27337== by 0x4CA394: ExecProcNode (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/../../../src/include/executor/executor.h:273) ==27337== by 0x4CD34C: ExecutePlan (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execMain.c:1670) ==27337== by 0x4CAA7C: standard_ExecutorRun (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execMain.c:365) ==27337== by 0x7E1E475: CitusExecutorRun (home/onurctirtir/citus/src/backend/distributed/executor/multi_executor.c:238) ==27337== Uninitialised value was created by a heap allocation ==27337== at 0x4848899: malloc (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so) ==27337== by 0x9AB1F7: AllocSetContextCreateInternal (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/utils/mmgr/aset.c:438) ==27337== by 0x4E0D56: CreateExprContextInternal (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execUtils.c:261) ==27337== by 0x4E0E3E: CreateExprContext (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execUtils.c:311) ==27337== by 0x4E10D9: ExecAssignExprContext (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execUtils.c:490) ==27337== by 0x51EE09: ExecInitSeqScan (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/nodeSeqscan.c:147) ==27337== by 0x4D6CE1: ExecInitNode (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execProcnode.c:210) ==27337== by 0x5243C7: ExecInitSubqueryScan (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/nodeSubqueryscan.c:126) ==27337== by 0x4D6DD9: ExecInitNode (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execProcnode.c:250) ==27337== by 0x4F05B2: ExecInitAppend (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/nodeAppend.c:223) ==27337== by 0x4D6C46: ExecInitNode (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/execProcnode.c:182) ==27337== by 0x52003D: ExecInitSetOp (home/onurctirtir/.pgenv/src/postgresql-16.2/src/backend/executor/nodeSetOp.c:530) ==27337== ==27337== VALGRINDERROR-END ```	2025-05-20 15:22:35 +03:00
Alper Kocatas	088ba75057	Add citus_nodes view (#7968 ) DESCRIPTION: Adds `citus_nodes` view that displays the node name, port, role, and "active" for nodes in the cluster. This PR adds `citus_nodes` view to the `pg_catalog` schema. The `citus_nodes` view is created in the `citus` schema and is used to display the node name, port, role, and active status of each node in the `pg_dist_node` table. The view is granted `SELECT` permission to the `PUBLIC` role and is set to the `pg_catalog` schema. Test cases was added to `multi_cluster_management` tests. structs.py was modified to add white spaces as `citus_indent` required. --------- Co-authored-by: Alper Kocatas <alperkocatas@microsoft.com>	2025-05-14 15:05:12 +03:00
Naisila Puka	a18040869a	Error out for queries with outer joins and pseudoconstant quals in PG<17 (#7937 ) PG15 commit d1ef5631e620f9a5b6480a32bb70124c857af4f1 and PG16 commit 695f5deb7902865901eb2d50a70523af655c3a00 disallow replacing joins with scans in queries with pseudoconstant quals. This commit prevents the set_join_pathlist_hook from being called if any of the join restrictions is a pseudo-constant. So in these cases, citus has no info on the join, never sees that the query has an outer join, and ends up producing an incorrect plan. PG17 fixes this by commit 9e9931d2bf40e2fea447d779c2e133c2c1256ef3 Therefore, we take this extra measure here for PG versions less than 17. hasOuterJoin can never be true when set_join_pathlist_hook is absent.	2025-05-11 21:47:28 +00:00
Mehmet YILMAZ	a4040ba5da	Planner: lift volatile target‑list items in `WrapSubquery` to coordinator (prevents sequence‑leap in distributed `INSERT … SELECT`) (#7976 ) This PR fixes #7784 and refactors the `WrapSubquery(Query subquery)` function to improve clarity and correctness when handling volatile expressions in subqueries during Citus insert-select rewriting. ### Background The `WrapSubquery` function rewrites a query of the form: ```sql INSERT INTO target_table SELECT ... FROM ... ``` ...by wrapping the `SELECT` in a subquery: ```sql SELECT <outer-TL> FROM ( <subquery with volatile expressions replaced with NULL> ) citus_insert_select_subquery ``` This transformation allows: Volatile expressions (e.g., `nextval`, `now`) not used in `GROUP BY` or `ORDER BY` to be evaluated exactly once on the coordinator. * Stable/immutable or sort-relevant expressions to remain in the worker-executed subquery. * Placeholder `NULL`s to maintain column alignment in the inner subquery. ### Fix Details * Restructured the code into labeled logical sections: 1. Build wrapper query (`SELECT … FROM (subquery)`) 2. Rewrite target lists with volatility analysis 3. Assign and return updated query trees * Preserved existing behavior, focusing on clarity and maintainability. ### How the new code handles volatile items stage \| what we look for \| what we do \| why -- \| -- \| -- \| -- scan target list once \| 1. `expr_is_volatile(te->expr)` 2. `te->ressortgroupref != 0` (is the column used in GROUP BY / ORDER BY?) \| decide whether to hoist or keep \| we must not hoist an expression the inner query still needs for sorting/grouping, otherwise its `SortGroupClause` breaks volatile & not used in sort/group \| deep‑copy the expression into the outer target list \| executes once on the coordinator \| \| leave a typed `NULL `placeholder (visible, not `resjunk`) in the inner target list \| keeps column numbering stable for helpers that already ran (reorder, cast); the worker sends a cheap constant \| stable / immutable, or volatile but used in sort/group \| keep the original expression in the inner list; outer list references it via a `Var `\| workers can evaluate it safely and, if needed, the inner ORDER BY still works \| ### Example Given this query: ```sql INSERT INTO t SELECT nextval('s'), 42 FROM generate_series(1, 2); ``` The planner rewrites it as: ```sql SELECT nextval('s'), col2 FROM (SELECT NULL::bigint AS col1, 42 AS col2 FROM generate_series(1, 2)) citus_insert_select_subquery; ``` This ensures `nextval('s')` is evaluated only once per row on the coordinator, not on each worker node, preserving correct sequence semantics. #### Outer‑Var guard (`FindReferencedTableColumn`) Because `WrapSubquery` adds an extra query level, lots of Vars that the old code never expected become “outer” Vars; without teaching `FindReferencedTableColumn` to climb that extra level reliably, Citus would intermittently reject valid foreign keys and even hit asserts. * Re‑implemented the outer‑Var guard so that the function: * Walks deterministically up the query stack when `skipOuterVars = false` (default for FK / UNION checks). A new while‑loop copies — rather than truncates — `parentQueryList` on each hop, eliminating list‑aliasing that made issue 5248 fail intermittently in parallel regressions. * Handles multi‑level `varlevelsup` in a single loop; never mutates the caller’s list in place.	2025-05-06 17:45:49 +03:00
Colm	d4dd44e715	Propagate SECURITY LABEL on tables and columns. (#7956 ) Issue #7709 asks for security labels on columns to be propagated, to support the `anon` extension. Before, Citus supported security labels on roles (#7735) and this PR adds support for propagating security labels on tables and columns. All scenarios that involve propagating metadata for a Citus table now include the security labels on the table and on the columns of the table. These scenarios are: - When a table becomes distributed using `create_distributed_table()` or `create_reference_table()`, its security labels (if any) are propageted. - When a security label is defined on a distributed table, or one of its columns, the label is propagated. - When a node is added to a Citus cluster, all distributed tables have their security labels propagated. - When a column of a distributed table is dropped, any security labels on the column are also dropped. - When a column is added to a distributed table, security labels can be defined on the column and are propagated. - Security labels on a distributed table or its columns are not propagated when `citus.enable_metadata_sync` is enabled. Regress test `seclabel` is extended with tests to cover these scenarios. The implementation is somewhat involved because it impacts DDL propagation of Citus tables, but can be broken down as follows: - distributed_object_ops has `Role_SecLabel`, `Table_SecLabel` and `Column_SecLabel` to take care of security labels on roles, tables and columns. `Any_SecLabel` is used for all other security labels and is essentially a nop. - Deparser support - `DeparseRoleSecLabelStmt()`, `DeparseTableSecLabelStmt()` and `DeparseColumnSecLabelStmt()` take care of deparsing security label statements on roles, tables and columns respectively. - When reconstructing the DDL for a citus table, security labels on the table or its columns are included by having `GetPreLoadTableCreationCommands()` call a new function `CreateSecurityLabelCommands()` to take care of any security labels on the table or its columns. - When changing a distributed table name to a shard name before running a command locally on a worker, function `RelayEventExtendNames()` checks for security labels on a table or its columns.	2025-04-30 18:03:52 +01:00
Onur Tirtir	ea7aa6712d	Move stat view implementations into a submodule (#7975 ) Also move serialize_distributed_ddls into commands submodule, seems like an oversight from last year (by me).	2025-04-29 14:22:29 +03:00
Onur Tirtir	d2e6cf1de0	Fix dev documentation for stat counters (#7974 ) Minor updates on the relevant portion of the tech readme and a code comment stat_counters.c	2025-04-29 11:35:58 +05:00
Onur Tirtir	3d61c4dc71	Add citus_stat_counters view and citus_stat_counters_reset() function to reset it (#7917 ) DESCRIPTION: Adds citus_stat_counters view that can be used to query stat counters that Citus collects while the feature is enabled, which is controlled by citus.enable_stat_counters. citus_stat_counters() can be used to query the stat counters for the provided database oid and citus_stat_counters_reset() can be used to reset them for the provided database oid or for the current database if nothing or 0 is provided. Today we don't persist stat counters on server shutdown. In other words, stat counters are automatically reset in case of a server restart. Details on the underlying design can be found in header comment of stat_counters.c and in the technical readme. ------- Here are the details about what we track as of this PR: For connection management, we have three statistics about the inter-node connections initiated by the node itself: * connection_establishment_succeeded * connection_establishment_failed * connection_reused While the first two are relatively easier to understand, the third one covers the case where a connection is reused. This can happen when a connection was already established to the desired node, Citus decided to cache it for some time (see citus.max_cached_conns_per_worker & citus.max_cached_connection_lifetime), and then reused it for a new remote operation. Here are the other important details about these connection statistics: 1. connection_establishment_failed doesn't care about the connections that we could establish but are lost later in the transaction. Plus, we cannot guarantee that the connections that are counted in connection_establishment_succeeded were not lost later. 2. connection_establishment_failed doesn't care about the optional connections (see OPTIONAL_CONNECTION flag) that we gave up establishing because of the connection throttling rules we follow (see citus.max_shared_pool_size & citus.local_shared_pool_size). The reaason for this is that we didn't even try to establish these connections. 3. For the rest of the cases where a connection failed for some reason, we always increment connection_establishment_failed even if the caller was okay with the failure and know how to recover from it (e.g., the adaptive executor knows how to fall back local execution when the target node is the local node and if it cannot establish a connection to the local node). The reason is that even if it's likely that we can still serve the operation, we still failed to establish the connection and we want to track this. 4. Finally, the connection failures that we count in connection_establishment_failed might be caused by any of the following reasons and for now we prefer to _not_ further distinguish them for simplicity: a. remote node is down or cannot accept any more connections, or overloaded such that citus.node_connection_timeout is not enough to establish a connection b. any internal Citus error that might result in preparing a bad connection string so that libpq fails when parsing the connection string even before actually trying to establish a connection via connect() call c. broken citus.node_conninfo or such Citus configuration that was incorrectly set by the user can also result in similar outcomes as in b d. internal waitevent set / poll errors or OOM in local node We also track two more statistics for query execution: * query_execution_single_shard * query_execution_multi_shard And more importantly, both query_execution_single_shard and query_execution_multi_shard are not only tracked for the top-level queries but also for the subplans etc. The reason is that for some queries, e.g., the ones that go through recursive planning, after Citus performs the heavy work as part of subplans, the work that needs to be done for the top-level query becomes quite straightforward. And for such query types, it would be deceiving if we only incremented the query stat counters for the top-level query. Similarly, for non-pushable INSERT .. SELECT and MERGE queries, we perform separate counter increments for the SELECT / source part of the query besides the final INSERT / MERGE query.	2025-04-28 12:23:52 +00:00
ThomasC02	37e23f44b4	Add Support for CASCADE/RESTRICT in REVOKE statements (#7958 ) Fixes #7105. DESCRIPTION: Fixes a bug that causes omitting CASCADE clause for the commands sent to workers for REVOKE commands on tables. --------- Co-authored-by: ThomasC02 <thomascantrell02@gmail.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Tiago Silva <tiagos3373@gmail.com>	2025-04-26 01:13:41 +03:00
Karina	48d89c9c1b	Adjust max_prepared_transactions only when it is default (#7712 ) DESCRIPTION: Adjusts max_prepared_transactions only when it's set to default on PG >= 16 Fixes #7711. Change AdjustMaxPreparedTransactions to really check if max_prepared_transactions is explicitly set by user, and only adjust max_prepared_transactions when it is default. This fixes 021_twophase test failure with loaded Citus library after postgres/postgres@b39c5272. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2025-04-24 11:11:49 +00:00
Mehmet YILMAZ	bb9d90ecc3	Update "Build & Test" workflow to use ubuntu-latest (#7959 ) The retirement of the ubuntu-20.04 runner has been announced by GitHub, with its removal scheduled for April 15, 2025. To ensure uninterrupted execution of CI workflows, "Build & Test" workflow can use the ubuntu-latest runner. It currently points to Ubuntu 22.04 and will automatically track supported versions going forward.	2025-04-18 11:14:30 +03:00
manaldush	0e6127c4f6	AddressSanitizer: stack-use-after-scope on distributed_planner:HasUnresolvedExternParamsWalker (#7948 ) Var externParamPlaceholder is created on stack, and its address is used for paramFetch. Postgres code return address of externParamPlaceholder var to externParam, then code flow go out of scope and dereference pointer on stack out of scope. Fixes https://github.com/citusdata/citus/issues/7941. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2025-04-04 13:27:56 +00:00
manaldush	f084b79a4b	AddressSanitizer: stack-use-after-scope on address in CreateBackgroundJob (#7949 ) Var jobTypeName is created on stack and its value over pointer is used in heap_form_tuple, so we have stack use out of scope. Issue was detected with adress sanitizer. Fixes #7943.	2025-04-04 13:03:41 +00:00
Cédric Villemain	1dc60e38bb	Propagates GRANT/REVOKE rights on table columns (#7918 ) This commit adds support for GRANT/REVOKE on table columns. It extends propagated DDL according to this logic: https://github.com/citusdata/citus/tree/main/src/backend/distributed#ddl * Unchanged pre-existing behavior related to splitting ddl per relation during propagation. * Changed the way ACL are checked in some cases (see `EnsureTablePermissions()` and associated commits) * Rewrite `pg_get_table_grants` to include column grants as well * Add missing `pfree()` in `pg_get_table_grants()` Fixes https://github.com/citusdata/citus/issues/7287 Also check a box in https://github.com/citusdata/citus/issues/4812	2025-04-04 11:54:16 +03:00
Cédric Villemain	a7e686c106	Make sure to prevent INSERT INTO ... SELECT queries involving subfield or sublink (#7912 ) DESCRIPTION: Makes sure to prevent `INSERT INTO ... SELECT` queries involving subfield or sublink, to avoid crashes The following query was crashing the backend: ``` INSERT INTO field_indirection_test_1 ( int_col, ct1_col.int_1,ct1_col.int_2 ) SELECT 0, 1, 2; -- crash ``` En passant, added more tests with sublink in distributed_types and found another query with wrong behavior: ``` INSERT INTO domain_indirection_test (f1,f3.if1) SELECT 0, 1; ERROR: could not find a conversion path from type 23 to 17619 -- not the expected ERROR ``` Fixed them by using `strip_implicit_coercions()` on target entry expression before checking for the presence of a subscript or fieldstore, else we fail to find the existing ones and wrongly accept to execute unsafe query.	2025-03-27 09:39:43 +00:00
naisila	88904eda97	Update changelog for 13.0.3 (cherry picked from commit `bbe0539df2`)	2025-03-20 15:45:26 +03:00
eaydingol	9bddf57053	Add changelog for 12.1.7 (#7889 ) Add changelog entries for 12.1.7 --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> (cherry picked from commit `bae20578d4`)	2025-03-20 15:45:26 +03:00
Naisila Puka	4b4fa22b64	Fix mis-deparsing of shard query in "output-table column" name conflict (#7932 ) DESCRIPTION: Fixes a bug in deparsing of shard query in case of "output-table column" name conflict If an `ORDER BY` item in `SELECT` is a bare identifier, the parser _first seeks it as an output column name_ of the `SELECT` (for SQL92 compatibility). However, ruleutils.c is expecting the SQL99 interpretation _where such a name is an input column name_. So it's possible to produce an incorrect display of a view in the (admittedly pretty ill-advised) case where some other column is renamed in the `SELECT` output list to match an `ORDER BY` column. The `DISTINCT ON` expressions are interpreted using the same rules as for `ORDER BY`. We had an issue reported that actually uses `DISTINCT ON`: #7684 Since Citus uses ruleutils deparsing logic to create the shard queries, it would not table-qualify the column names as needed. PG17 fixed this https://github.com/postgres/postgres/commit/a7eb633563c by table-qualifying such names in the dumped view text. Therefore, Citus doesn't reproduce the issue in PG17, since PG17 table-qualifies the column names when needed, and the produced shard queries are correct. This PR applies the PG17 patch to `ruleutils_15.c` and `ruleutils_16.c`. Even though we generally try to avoid modifying the ruleutils files, in this case we are applying a Postgres patch that `ruleutils_17.c` already has: `897d996b8f` Thanks @c2main for your discussion and idea in the issue. Fixes #7684	2025-03-19 14:21:30 +03:00
German Eichberger	1c09469dd2	Adds a method to determine if current node is primary (#7720 ) DESCRIPTION: Adds citus_is_primary_node() UDF to determine if the current node is a primary node in the cluster. --------- Co-authored-by: German Eichberger <geeichbe@microsoft.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2025-03-18 15:12:42 +00:00
Onur Tirtir	680b870d45	Add STYLEGUIDE.md and update some other md files on best practices (#7347 )	2025-03-14 15:42:59 +00:00
Naisila Puka	ec13c24558	Bump PG versions to 17.4, 16.8, 15.12 (#7925 )	2025-03-14 15:06:07 +03:00
naisila	1d947f0734	Change commit sha	2025-03-14 14:43:01 +03:00
naisila	6b2f113947	Try locally built images	2025-03-14 14:28:34 +03:00
naisila	bdd3ff085d	Try to bump PG versions	2025-03-14 12:11:29 +03:00
Naisila Puka	6b00afac39	Merge release-13.0 commits to main (#7922 ) This is a Merge commit that includes all changes from release-13.0 branch into main branch. This Merge commit adds PG17 support and drops PG14 support from the main branch. Local steps to open this PR and include `release-13.0` commits to the `main` branch: ```bash git checkout release-13.0 git checkout -b naisila/merge_13_0 git rebase main ``` Understandably, the rebase step was a resolve-conflict pain. On top of resolving some conflicts, I had to add some more commits to this PR such that the main branch compiles and runs as we want it to. Mainly there were PG17 additions or PG14 subtractions. I chose this approach as it cleanly stacks _any new_ `release-13.0` changes on top of the current main branch. Only new ones, not stuff there is already on main (we had backported several commits from main to `release-13.0`, so we ignore those in this PR). The idea is to merge all these commits in the main branch, not squash and merge. Note 0: We should remove PG14 tests from required tests as this PR will drop PG14 support in the main branch as well. Note 1: `check-style` fails because it considers `src/backend/distributed/sql/citus--12.1-1--12.2-1.sql` as deleted, and `src/backend/distributed/sql/downgrades/citus--12.2-1--12.1-1.sql` as renamed. The reason is that the downgrade script actually stayed 98% the same therefore was considered a rename. I don't think we can fix this. Note 2: I tried the following approach as well: ```bash git checkout main git checkout -b naisila/merge_13_0 git merge release-13.0 ``` However, this approach was a mess as it included several irrelevant commits that differ between the main and `release-13.0` branch which just make this PR difficult to understand. For reference, I have pushed a different branch with that approach. https://github.com/citusdata/citus/tree/naisila/merge_13_0_first_try As you can see it's 156 commits ahead of main, with irrelevant commits such as `1b4d7a51f8`. The reason is that it's including commits from the very first point of divergence between `main` and `release-12.1` branch (because we had cloned `release-13.0` branch from `release-12.1` branch, not `main`).	2025-03-13 15:56:44 +03:00
naisila	10f1a50f1f	Fix dockerfile to remove pg14 and include pg17	2025-03-13 15:15:27 +03:00
naisila	52bf7a1d03	Fix ObjectClass declaration for PG17 since it was removed Relevant PG commit: `89e5ef7e21` 89e5ef7e21812916c9cf9fcf56e45f0f74034656 We had already provided a fix for this in the following commit `da2624cee8` However, this solution wasn't enough for the commits on main. Specifically, we had issues with the following commit: `1d55debb98` Problem: https://github.com/citusdata/citus/actions/runs/13806825532/attempts/1#summary-38619483894 This new solution is better anyway. We define exactly what was previously defined in PG<17.	2025-03-13 15:13:56 +03:00
naisila	1d0bdbd749	Bump Citus into 13.1devel	2025-03-13 15:13:56 +03:00
naisila	be75c0ec4c	Use datlocale in check_database_on_all_nodes function for PG17 This commit also has to do with renaming of daticulocale to datlocale Relevant PG commit: f696c0cd5f299f1b51e214efc55a22a782cc175d `f696c0cd5f` Keeping this commit separate from the previous one because these changes will be different once we drop PG15 support. For now I renamed pg_ge_15_options to pg_ge_15_17_options and together with it I changed the meaning of the variable. However when we drop PG14 support, we will use pg_ge_17_options and delete pg_ge_15_options altogether	2025-03-13 15:13:56 +03:00
naisila	caceb35eba	Some cleanup from dropping pg14	2025-03-13 15:13:56 +03:00
naisila	08913e27d7	PG17 renamed Anum_pg_database_daticulocale to Anum_pg_database_datlocale	2025-03-13 15:13:56 +03:00
naisila	17b4122e84	Rename some more foreach_ptr to foreach_declared_ptr	2025-03-13 15:13:56 +03:00
naisila	c02d899b6c	Change StaticAssertStmt for node-wide objects to pg17	2025-03-13 15:13:56 +03:00
ibrahim halatci	421bc462b2	updated change log for the 13.0.2 patch release (#7924 ) updated change log for the 13.0.2 patch release --------- Co-authored-by: Ibrahim Halatci <ihalatci@microsoft.com>	2025-03-13 15:13:56 +03:00
Cédric Villemain	ed40a0ad02	fix issue #7676 : wrong handler around MULTIEXPR (#7914 ) DESCRIPTION: Fixes a bug with `UPDATE SET (...) = (SELECT some_func(),... )` (#7676) Citus was checking for presence of sublink, but forgot to manage multiexpr while evaluating clauses during planning. At this stage (citus planner), it's not always possible to call PostgreSQL code because the tree is not yet ready for PostgreSQL pure executor. Fixes https://github.com/citusdata/citus/issues/7676. Fixed by adding a new function to check sublink or multiexpr in the tree. --------- Co-authored-by: Colm <colmmchugh@microsoft.com>	2025-03-12 16:03:30 +03:00
Mehmet YILMAZ	e50563fbd8	Issue 7887 Enhance AddInsertSelectCasts for Identity Columns (#7920 ) ## Enhance `AddInsertSelectCasts` for Identity Columns This PR fixes #7887 and improves the behavior of partial inserts into identity columns by modifying the `AddInsertSelectCasts` function. Specifically, we introduce special-case handling for `nextval(...)` calls (represented in the parse tree as `NextValueExpr`) to ensure that if the identity column’s declared type differs from `nextval`’s default return type (`int8`), we cast the expression properly. This prevents mismatches like `int8` → `int4` from causing “invalid string enlargement” errors or other type-related failures. When `INSERT ... SELECT` is processed, `AddInsertSelectCasts` reconciles each target column’s type with the corresponding SELECT expression’s type. Historically, for identity columns that rely on `nextval(...)`, we can end up with a mismatch: - `nextval` returns `int8`, - The identity column might be `int4`, `bigint`, or another integer type. Without a correct cast, Postgres or Citus can produce plan-time or runtime errors. By detecting `NextValueExpr` and applying a cast to the column’s type, the final plan ensures consistent insertion without errors. ## What Changed 1. Check for `NextValueExpr`: In `AddInsertSelectCasts`, we now have a code block: ```c if (IsA(selectEntry->expr, NextValueExpr)) { Oid nextvalType = GetNextvalReturnTypeCatalog(); ... // If (targetType != nextvalType), build a cast from int8 -> targetType } else { // fallback to generic mismatch logic } ``` This short-circuits any expression that’s a `nextval(...)` call, letting us explicitly cast to the correct type. 2. Fallback Generic Logic: If it isn’t a `NextValueExpr` (i.e. a normal column or expression mismatch), we still rely on the existing path that compares `sourceType` vs. `targetType` and calls `CastExpr(...)` if they differ. 3. `GetNextvalReturnTypeCatalog`: We added or refined a helper function to confirm that `nextval` returns `int8`, or do a `LookupFuncName("nextval", ...)` to discover the function’s return type from `pg_proc`—making it robust if future changes happen. ## Benefits - Partial inserts into identity columns no longer fail with type mismatches. - When `nextval` yields `int8` but the identity column is `int4` (or another type), we properly cast to the column’s type in the plan. - Preserves the existing approach for other columns—only identity calls get the specialized `NextValueExpr` logic. ## Testing - Extended `generatedidentity.sql` test scenario to cover partial inserts into both `GENERATED ALWAYS` and `GENERATED BY DEFAULT` identity columns, including tests for the `OVERRIDING SYSTEM VALUE` clause and partial inserts referencing foreign-key columns.	2025-03-12 12:43:01 +03:00
Mehmet YILMAZ	756e8f66e0	Remove citus-tools subproject and add gitignore (#7916 )	2025-03-12 12:43:01 +03:00
Muhammad Usama	95da74c47f	Fix Deadlock with transaction recovery is possible during Citus upgrades (#7910 ) DESCRIPTION: Fixes deadlock with transaction recovery that is possible during Citus upgrades. Fixes #7875. This commit addresses two interrelated deadlock issues uncovered during Citus upgrades: 1. Local Deadlock: - Problem: In `RecoverWorkerTransactions()`, a new connection is created for each worker node to perform transaction recovery by locking the `pg_dist_transaction` catalog table until the end of the transaction. When `RecoverTwoPhaseCommits()` calls this function for each worker node, the order of acquiring locks on `pg_dist_authinfo` and `pg_dist_transaction` can alternate. This reversal can lead to a deadlock if any concurrent process requires locks on these tables. - Fix: Pre-establish all worker node connections upfront so that `RecoverWorkerTransactions()` operates with a single, consistent connection. This ensures that locks on `pg_dist_authinfo` and `pg_dist_transaction` are always acquired in the correct order, thereby preventing the local deadlock. 2. Distributed Deadlock: - Problem: After resolving the local deadlock, a distributed deadlock issue emerges. The maintenance daemon calls `RecoverWorkerTransactions()` on each worker node— including the local node—which leads to a complex locking sequence: - A RowExclusiveLock is taken on the `pg_dist_transaction` table in `RecoverWorkerTransactions()`. - An update extension then tries to acquire an AccessExclusiveLock on the same table, getting blocked by the RowExclusiveLock. - A subsequent query (e.g., a SELECT on `pg_prepared_xacts`) issued using a separate connection on the local node gets blocked due to locks held during a call to `BuildCitusTableCacheEntry()`. - The maintenance daemon waits for this query, resulting in a circular wait and stalling the entire cluster. - Fix: Avoid cache lookups for internal PostgreSQL tables by implementing an early bailout for relation IDs below `FirstNormalObjectId` (system objects). This eliminates unnecessary calls to `BuildCitusTableCache`, reducing lock contention and mitigating the distributed deadlock. Furthermore, this optimization improves performance in fast connect→query_catalog→disconnect cycles by eliminating redundant cache creation and lookups. 3. Also reverts the commit that disabled the relevant test cases.	2025-03-12 12:43:01 +03:00
Colm	4139370a1d	#7782 - catch when Postgres planning removes all Citus tables (#7907 ) DESCRIPTION: fix a planning error caused by a redundant WHERE clause Fix a Citus planning glitch that occurs in a DML query when the WHERE clause of the query is of the form: ` WHERE true OR <expression with 1 or more citus tables> ` and this is the only place in the query referencing a citus table. Postgres' standard planner transforms the WHERE clause to: ` WHERE true ` So the query now has no citus tables, confusing the Citus planner as described in issues #7782 and #7783. The fix is to check, after Postgres standard planner, if the Query has been transformed as shown, and re-run the check of whether or not the query needs distributed planning.	2025-03-12 12:43:01 +03:00
Mehmet YILMAZ	87ec3def55	Fix 0-Task Plans in Single-Shard Router When Updating a Local Table with Reference Table in Subquery (#7897 ) This PR fixes an issue #7891 in the Citus planner where an `UPDATE` on a local table with a subquery referencing a reference table could produce a 0-task plan. Historically, the planner sometimes failed to detect that both the target and referenced tables were effectively “local,” assigning `INVALID_SHARD_ID `and yielding a no-op plan. ### Root Cause - In the Citus router logic (`PlanRouterQuery`), we relied on `shardId` to determine whether a query should be routed to a single shard. - If `shardId == INVALID_SHARD_ID`, but we also had not marked the query as a “local table modification,” the code path would produce zero tasks. - Local + reference tables do not require multi-shard routing. Failing to detect this “purely local” scenario caused Citus to incorrectly route to zero tasks. ### Changes Enhanced Local Table Detection - Updated `IsLocalTableModification` and related checks to consider both local and reference tables as “local” for planning, preventing the 0-task scenario. - Expanded `ContainsOnlyLocalOrReferenceTables` to return true if there are no fully distributed tables in the query. Added Regress Test - Introduced a new regress test (`issue_7891.sql`) which reproduces the scenario. - Verifies we get a valid single- or local-task plan rather than a 0-task plan.	2025-03-12 12:43:01 +03:00
Colm	ec141f696a	Enhance MERGE .. WHEN NOT MATCHED BY SOURCE for repartitioned source (#7900 ) DESCRIPTION: Ensure that a MERGE command on a distributed table with a `WHEN NOT MATCHED BY SOURCE` clause runs against all shards of the distributed table. The Postgres MERGE command updates a table using a table or a query as a data source. It provides three ways to match the target table with the source: `WHEN MATCHED` means that there is a row in both the target and source; `WHEN NOT MATCHED` means that there is a row in the source that has no match (is not present) in the target; and, as of PG17, `WHEN NOT MATCHED BY SOURCE` means that there is a row in the target that has no match in the source. In Citus, when a MERGE command updates a distributed table using a local/reference table or a distributed query as source, that source is repartitioned, and for each repartitioned shard that has data (i.e. 1 or more rows) the MERGE is run against the corresponding distributed table shard. Suppose the distributed table has 32 shards, and the source repartitions into 4 shards that have data, with the remaining 28 shards being empty; then the MERGE command is performed on the 4 corresponding shards of the distributed table. However, the semantics of `WHEN NOT MATCHED BY SOURCE` are that the specified action must be performed on the target for each row in the target that is not in the source; so if the source is empty, all target rows should be updated. To see this, consider the following MERGE command: ``` MERGE INTO target AS t USING source AS s ON t.id = s.id WHEN NOT MATCHED BY SOURCE THEN UPDATE t SET t.col1 = 100 ``` If the source has zero rows then every row in the target is updated s.t. its col1 value is 100. Currently in Citus a MERGE on a distributed table with a local/reference table or a distributed query as source ignores shards of the distributed table when the corresponding shard of the repartitioned source has zero rows. However, if the MERGE command specifies a `WHEN NOT MATCHED BY SOURCE` clause, then the MERGE should be performed on all shards of the distributed table, to ensure that the specified action is performed on the target for each row in the target that is not in the source. This PR enhances Citus MERGE execution so that when a repartitioned source shard has zero rows, and the MERGE command specifies a `WHEN NOT MATCHED BY SOURCE` clause, the MERGE is performed against the corresponding shard of the distributed table using an empty (zero row) relation as source, by generating a query of the form: ``` MERGE INTO target_shard_0002 AS t USING (SELECT id FROM (VALUES (NULL) ) source_0002(id) WHERE FALSE) AS s ON t.id = s.id WHEN NOT MATCHED BY SOURCE THEN UPDATE t set t.col1 = 100 ``` This works because each row in the target shard will be updated, and `WHEN MATCHED` and `WHEN NOT MATCHED`, if specified, will be no-ops because the source has zero rows. To implement this when the source is a local or reference table involves teaching function `ExcuteSourceAtCoordAndRedistribution()` in `merge_executor.c` to not prune tasks when the query has `WHEN NOT MATCHED BY SOURCE` but to instead replace the task's query to one that uses an empty relation as source. And when the source is a distributed query, function `ExecuteMergeSourcePlanIntoColocatedIntermediateResults()` (also in `merge_executor.c`) instead of skipping empty tasks now generates a query that uses an empty relation as source for the corresponding target shard of the distributed table, but again only when the query has `WHEN NOT MATCHED BY SOURCE`. A new function `BuildEmptyResultQuery()` is added to `recursive_planning.c` and it is used by both the aforementioned functions in `merge_executor.c` to build an empty relation to use as the source. It applies the appropriate type to each column of the empty relation so the join with the target makes sense to the query compiler.	2025-03-12 12:43:01 +03:00
OlgaSergeyevaB	ccd7ddee36	Custom Scan (ColumnarScan): exclude outer_join_rels from CandidateRelids (#7703 ) DESCRIPTION: Fixes a crash in columnar custom scan that happens when a columnar table is used in a join. Fixes issue #7647. Co-authored-by: Ольга Сергеева <ob-sergeeva@it-serv.ru>	2025-03-12 12:43:01 +03:00
Colm	89674d9630	[Bug Fix] SEGV on query with Left Outer Join (#7787 ) (#7901 ) DESCRIPTION: Fixes a crash in left outer joins that can happen when there is an an aggregate on a column from the inner side of the join. Fix the SEGV seen in #7787 and #7899; it occurs because a column in the targetlist of a worker subquery can contain a non-empty varnullingrels field if the column is from the inner side of a left outer join. The issue can also occur with the columns in the HAVING clause, and this is also tested in the fix. The issue was triggered by the introduction of the varnullingrels to Vars in Postgres 16 (2489d76c) There is a related issue, #7705, where a non-empty varnullingrels was incorrectly copied into the query tree for the combine query. Here, a non-empty varnullingrels field of a var is incorrectly copied into the query tree for a worker subquery. The regress file from #7705 is used (and renamed) to also test this (#7787). An alternative test output file is required for Postgres 15 because of an optimization to DISTINCT in Postgres 16 (1349d2790bf).	2025-03-12 12:43:01 +03:00
Naisila Puka	2b5dfbbd08	Bump Citus version to 13.0.1 (#7872 )	2025-03-12 12:43:01 +03:00
Onur Tirtir	7004295065	Revert "Release RowExclusiveLock on pg_dist_transaction as soon as remote xacts are recovered" This reverts commit `684b4c6b96`.	2025-03-12 12:43:01 +03:00
Naisila Puka	3b1c082791	Drops PG14 support (#7753 ) DESCRIPTION: Drops PG14 support 1. Remove "$version_num" != 'xx' from configure file 2. delete all PG_VERSION_NUM = PG_VERSION_XX references in the code 3. Look at pg_version_compat.h file, remove all _compat functions etc defined specifically for PGXX differences 4. delete all PG_VERSION_NUM >= PG_VERSION_(XX+1), PG_VERSION_NUM < PG_VERSION_(XX+1) ifs in the codebase 5. delete ruleutils_xx.c file 6. cleanup normalize.sed file from pg14 specific lines 7. delete all alternative output files for that particular PG version, server_version_ge variable helps here	2025-03-12 12:43:01 +03:00
Onur Tirtir	d5618b6b4c	Release RowExclusiveLock on pg_dist_transaction as soon as remote xacts are recovered As of this commit, after recovering the remote transactions, now we release the lock on pg_dist_transaction while closing it to avoid deadlocks that might occur because of trying to acquire a lock on pg_dist_authinfo while holding a lock on pg_dist_transaction. Such a scenario can only cause a deadlock if another transaction is trying to acquire a strong lock on pg_dist_transaction while holding a lock on pg_dist_authinfo. As of today, we (implicitly) acquire a strong lock on pg_dist_transaction only when upgrading Citus to 11.3-1 and this happens when creating a REPLICA IDENTITY on pg_dist_transaction. And regardless of the code-path we are in, it should be okay to release the lock there because all we do after that point is to abort the prepared transactions that are not part of an in-progress distributed transaction and releasing the lock before doing so should be just fine. This also changes the blocking behavior between citus_create_restore_point and the transaction recovery code-path in the sense that now citus_create_restore_point doesn't until transaction recovery completes aborting the prepared transactions that are not part of an in-progress distributed transaction. However, this should be fine because even before this was possible, e.g., if transaction recovery fails to open a remote connection to a node.	2025-03-12 12:43:01 +03:00
Naisila Puka	ef59b659c5	fix changelog date (#7859 )	2025-03-12 12:43:01 +03:00
Naisila Puka	85739b34bf	Fix pg17 test (#7857 ) error merged in `ab7c3b7804`	2025-03-12 12:43:01 +03:00
Mehmet YILMAZ	1bb6c7e95f	PG17 Compatibility - Fix crash when pg_class is used in MERGE (#7853 ) This pull request addresses Issue #7846, where specific MERGE queries on non-distributed and distributed tables can result in crashes in certain scenarios. The issue stems from the usage of `pg_class` catalog table, and the `FilterShardsFromPgclass` function in Citus. This function goes through the query's jointree to hide the shards. However, in PG17, MERGE's join quals are in a separate structure called `mergeJoinCondition`. Therefore FilterShardsFromPgclass was not filtering correctly in a `MERGE` command that involves `pg_class`. To fix the issue, we handle `mergeJoinCondition` separately in PG17. Relevant PG commit: `0294df2f1f` Non-Distributed Tables: A MERGE query involving a non-distributed table using `pg_catalog.pg_class` as the source may execute successfully but needs testing to ensure stability. Distributed Tables: Performing a MERGE on a distributed table using `pg_catalog.pg_class` as the source raises an error: `ERROR: MERGE INTO a distributed table from Postgres table is not yet supported` However, in some cases, this can lead to a server crash if the unsupported operation is not properly handled. This is the test output from the same test conducted prior to the code changes being implemented. ``` -- Issue #7846: Test crash scenarios with MERGE on non-distributed and distributed tables -- Step 1: Connect to a worker node to verify shard visibility \c postgresql://postgres@localhost::worker_1_port/regression?application_name=psql SET search_path TO pg17; -- Step 2: Create and test a non-distributed table CREATE TABLE non_dist_table_12345 (id INTEGER); -- Test MERGE on the non-distributed table MERGE INTO non_dist_table_12345 AS target_0 USING pg_catalog.pg_class AS ref_0 ON target_0.id = ref_0.relpages WHEN NOT MATCHED THEN DO NOTHING; SSL SYSCALL error: EOF detected connection to server was lost ```	2025-03-12 12:43:01 +03:00
Colm	a18f8990be	Update tdigest_aggregate_support output for PG15+ (#7849 ) Regress test tdigest_aggregate_support has been failing since at least Citus 12.0, when tdigest extension is installed in Postgres. This appears to be because of an omission by commit `03832f3` and a change in the implementation of Postgres random() function (pg commit [d4f109e4a](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=d4f109e4a)). To reproduce the test diff: - Checkout [tdigest ](https://github.com/tvondra/tdigest)and run `make; make install` - In citus regress directory run `make check-multi` or `./citus_tests/run_test.py tdigest_aggregate_support` There are two parts to this commit: 1. Revert `Output: xxxxx` in EXPLAIN VERBOSE. Citus commit `fe4ac51` normalized EXPLAIN VERBOSE output because of a change between pg12 and pg13. When pg12 support was no longer required, the rule was removed from normalize.sed and `Output: xxxx` was reverted in the impacted regress output files (`03832f3`), but `tdigest_aggregate_support` was omitted. 2. Adjust the query results; the tdigest_aggregate_support test file has a comment _verifying results - should be stable due to seed while inserting the data, if failure due to data these queries could be removed or check for certain ranges_ but the result values in this commit are consistent across citus 12.0 (pg 15), citus 12.1 (pg 16) and citus 13.0 (pg 17), or since the Postgres changed their [implementation of random](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=d4f109e4a), so proposing to go with these results.	2025-03-12 12:43:01 +03:00
Naisila Puka	7e1f22999b	Bump to latest PG minors 17.2, 16.6, 15.10, 14.15 (#7843 ) Similar to `5ef2cd67ed`, we use the commit sha of a local build of the images, pushed.	2025-03-12 12:43:00 +03:00
Naisila Puka	0642a4dc08	Propagate MERGE ... WHEN NOT MATCHED BY SOURCE (#7807 ) DESCRIPTION: Propagates MERGE ... WHEN NOT MATCHED BY SOURCE It seems like there is not much needed to be done here. `get_merge_query_def` from `ruleutils_17` is updated with "WHEN NOT MATCHED BY SOURCE" therefore `deparse_shard_query` parses the merge query for execution on the shard correctly. Relevant PG commit: https://github.com/postgres/postgres/commit/0294df2f1	2025-03-12 12:43:00 +03:00
Naisila Puka	74d945f5ae	PG17 - Propagate EXPLAIN options: MEMORY and SERIALIZE (#7802 ) DESCRIPTION: Propagates MEMORY and SERIALIZE options of EXPLAIN The options for `MEMORY` can be true or false. Default is false. The options for `SERIALIZE` can be none, text or binary. Default is none. I referred to how we added support for WAL option in this PR [Support EXPLAIN(ANALYZE, WAL)](https://github.com/citusdata/citus/pull/4196). For the tests however, I used the same tests as Postgres, not like the tests in the WAL PR. I used exactly the same tests as Postgres does, I simply distributed the table beforehand. See below the relevant Postgres commits from where you can see the tests added as well: - [Add EXPLAIN (MEMORY)](https://github.com/postgres/postgres/commit/5de890e36) - [Invent SERIALIZE option for EXPLAIN.](https://github.com/postgres/postgres/commit/06286709e) This PR required a lot of copying of Postgres static functions regarding how `EXPLAIN` works for `MEMORY` and `SERIALIZE` options. Specifically, these copy-pastes were required for updating `ExplainWorkerPlan()` function, which is in fact based on postgres' `ExplainOnePlan()`: ```C /* copied from explain.c to update ExplainWorkerPlan() in citus according to ExplainOnePlan() in postgres / #define BYTES_TO_KILOBYTES(b) typedef struct SerializeMetrics static bool peek_buffer_usage(ExplainState es, const BufferUsage usage); static void show_buffer_usage(ExplainState es, const BufferUsage usage); static void show_memory_counters(ExplainState es, const MemoryContextCounters mem_counters); static void ExplainIndentText(ExplainState es); static void ExplainPrintSerialize(ExplainState es, SerializeMetrics metrics); static SerializeMetrics GetSerializationMetrics(DestReceiver *dest); ``` _Note_: it looks like we were missing some `buffers` option details as well. I put them together with the memory option, like the code in Postgres explain.c, as I didn't want to change the copied code. However, I tested locally and there is no big deal in previous Citus versions, and you can also see that existing Citus tests with `buffers true` didn't change. Therefore, I prefer not to backport "buffers" changes to previous versions.	2025-03-12 12:43:00 +03:00
Mehmet YILMAZ	7682d135a4	PG17 - Add Regression Test for REINDEX support in event triggers (#7819 ) This PR adds regression tests to verify REINDEX support with event triggers. Tests validates trigger execution, shard placement consistency, and distributed index rebuilding without disruption.	2025-03-12 12:43:00 +03:00
Mehmet YILMAZ	08d94f9eb6	PG17 - Add Regression Test for Access Method Behavior on Partitioned Tables (#7818 ) This PR adds a regression test to verify the behavior of access methods for partitioned and distributed tables, including: - Creating partitioned tables with heap. - Distributing tables using create_distributed_table. - Switching access methods to columnar with ALTER TABLE. - Validating access method inheritance for new partitions. Relecant PG17 commit: https://github.com/postgres/postgres/commit/374c7a229	2025-03-12 12:43:00 +03:00
Naisila Puka	8f436e4a48	Add tests with xmltext() and random(min, max) (#7824 ) xmltext() converts text into xml text nodes. Test with columnar and citus tables. Relevant PG17 commit: https://github.com/postgres/postgres/commit/526fe0d79 random(min, max) generates random numbers in a specified range Add tests like the ones for random() in aggregate_support.sql References: https://github.com/citusdata/citus/blob/main/src/test/regress/sql/aggregate_support.sql#L493-L532 https://github.com/citusdata/citus/pull/7183 Relevant PG17 commit: https://github.com/postgres/postgres/commit/e6341323a	2025-03-12 12:43:00 +03:00
Naisila Puka	8940665d17	Allow configuring sslnegotiation using citus.node_conn_info (#7821 ) Relevant PG commit: https://github.com/postgres/postgres/commit/d39a49c1e PR similar to https://github.com/citusdata/citus/pull/5203	2025-03-12 12:26:06 +03:00
Naisila Puka	1d57a36ecc	Add pg17 jsonpath methods tests (#7820 ) various jsonpath methods were added in PG17 Relevant PG commit: https://github.com/postgres/postgres/commit/66ea94e8e Here we add the same test as in pg15_jsonpath.sql for the new additions	2025-03-12 12:26:06 +03:00
Naisila Puka	658632642a	Disallow infinite values for partition interval in create_time_partitions udf (#7822 ) PG17 added +/- infinity values for the interval data type Relevant PG commit: https://github.com/postgres/postgres/commit/519fc1bd9	2025-03-12 12:26:06 +03:00
Naisila Puka	3e96a19606	Adds JSON_TABLE() support, and SQL/JSON constructor/query functions tests (#7816 ) DESCRIPTION: Adds JSON_TABLE() support PG17 has added basic `JSON_TABLE()` functionality `JSON_TABLE()` allows `JSON` data to be converted into a relational view and thus used, for example, in a `FROM` clause, like other tabular data. We treat `JSON_TABLE` the same as correlated functions (e.g., recurring tuples). In the end, for multi-shard `JSON_TABLE` commands, we apply the same restrictions as reference tables (e.g., cannot perform a lateral outer join when a distributed subquery references a (reference table)/(json table) etc.) Relevant PG17 commits: [basic JSON table](https://github.com/postgres/postgres/commit/de3600452), [nested paths in json table](https://github.com/postgres/postgres/commit/bb766cde6) Onder had previously added json table support for PG15BETA1, but we reverted that commit because json table was reverted in PG15. `ce7f1a530f` Previous relevant PG15Beta1 commit: https://github.com/postgres/postgres/commit/4e34747c8 Therefore, I referred to Onder's commit for this commit as well, with a few changes due to some differences between PG15/PG17: 1) In PG15Beta1, we had also `PLAN` clauses for `JSON_TABLE` https://github.com/postgres/postgres/commit/fadb48b00, and Onder's commit includes tests for those as well. However, `PLAN` nodes are _not_ added in PG17. Therefore, I didn't include the `json_table_select_only` test, which had mostly queries involving `PLAN`. I only included the last query from that test. 2) In PG15 timeline (Citus 11.1), we didn't support outer joins where the outer rel is a recurring one and the inner one is a non-recurring one. However, [Onur added support for that one in Citus 11.2](https://github.com/citusdata/citus/pull/6512), therefore I updated the tests from Onder's commit accordingly. 3) PG17 json table has nested paths and columns, therefore I added a test with a distributed table, which is exactly the same as the one in sqljson_jsontable in PG17. https://github.com/postgres/postgres/commit/bb766cde6 This pull request also adds some basic tests on validation of SQL/JSON constructor functions JSON(), JSON_SCALAR(), and JSON_SERIALIZE(), and also SQL/JSON query functions JSON_EXISTS(), JSON_QUERY(), and JSON_VALUE(). The relevant PG commits are the following: [JSON(), JSON_SCALAR(), JSON_SERIALIZE()](https://github.com/postgres/postgres/commit/03734a7fe) [JSON_EXISTS(), JSON_VALUE(), JSON_QUERY()](https://github.com/postgres/postgres/commit/6185c9737)	2025-03-12 12:26:05 +03:00
Naisila Puka	2112aa1860	Add tests for inserting with AT LOCAL operator (#7815 ) PG17 has added support for AT LOCAL operator it converts the given time type to time stamp with the session's TimeZone value as time zone. Here we add tests that validate that we can use AT LOCAL at INSERT commands Relevant PG commit: https://github.com/postgres/postgres/commit/97957fdba With the tests, we verify that we evaluate AT LOCAL at the coordinator and then perform the insert remotely.	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	1cf5c190aa	Error out for ALTER TABLE ... ALTER COLUMN ... SET EXPRESSION (#7814 ) PG17 added support for ALTER TABLE ... ALTER COLUMN ... SET EXPRESSION. Relevant PG commit: https://github.com/postgres/postgres/commit/5d06e99a3 We currently don't support propagating this command for Citus tables. It is added to future work. This PR disallows `ALTER TABLE ... ALTER COLUMN ... SET EXPRESSION` on all Citus table types (local, distributed, and partitioned distributed) by adding an error check in `ErrorIfUnsupportedAlterTableStmt`. A new regression test verifies that each table type fails with a consistent error message when attempting to set an expression.	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	24585a8c04	Error out for ALTER TABLE ... SET ACCESS METHOD DEFAULT (#7803 ) PG17 introduced ALTER TABLE ... SET ACCESS METHOD DEFAULT This PR introduces and enforces an error check preventing ALTER TABLE ... SET ACCESS METHOD DEFAULT on both Citus local tables (added via citus_add_local_table_to_metadata) and distributed/partitioned distributed tables. The regression tests now demonstrate that each table type raises an error advising users to explicitly specify an access method, rather than relying on DEFAULT. This ensures consistent behavior across local and distributed environments in Citus. The reason why we currently don't support this is that we can't simply propagate the command as it is, because the default table access method may be different across Citus cluster nodes. Relevant PG commit: https://github.com/postgres/postgres/commit/d61a6cad6	2025-03-12 12:25:49 +03:00
Naisila Puka	b7d04038cb	Add tests for FORCE_NULL * and FORCE_NOT_NULL * options for COPY FROM (#7812 ) These options already existed in PG17, and we support them and have tests for them in `multi_copy.sql`. In PG17, their capability was extended to specify ALL columns at once using . Citus performs the COPY correctly, as is validated by the added tests in this PR. Relevant PG commit: https://github.com/postgres/postgres/commit/f6d4c9cf1 Copy-pasting from Postgres documentation what these options do, such that the reviewer may better understand the tests added: `FORCE_NOT_NULL`: Do not match the specified columns' values against the null string. In the default case where the null string is empty, this means that empty values will be read as zero-length strings rather than nulls, even when they are not quoted. If is specified, the option will be applied to all columns. This option is allowed only in `COPY FROM`, and only when using `CSV` format. `FORCE_NULL`: Match the specified columns' values against the null string, even if it has been quoted, and if a match is found set the value to `NULL`. In the default case where the null string is empty, this converts a quoted empty string into `NULL`. If * is specified, the option will be applied to all columns. This option is allowed only in `COPY FROM`, and only when using `CSV` format. `FORCE_NULL` and `FORCE_NOT_NULL` can be used simultaneously on the same column. This results in converting quoted null strings to null values and unquoted null strings to empty strings. Explain it to me like I'm a 5-year-old, for a text column: `FORCE_NULL` looks for empty strings and registers them as `NULL` `FORCE_NOT_NULL` looks for null values and registers them as empty strings.	2025-03-12 12:25:49 +03:00
Naisila Puka	5e9f8d838c	Error for COPY FROM ... on_error, log_verbosity with Citus tables (#7811 ) PG17 added the new ON_ERROR option for COPY FROM. When this option is specified, COPY skips soft errors and continues copying. Relevant PG commits: -- https://github.com/postgres/postgres/commit/9e2d87011 -- https://github.com/postgres/postgres/commit/b725b7eec I tried it locally with Citus tables. Without further implementation, it doesn't work correctly. Therefore, we error out for now, and add it to future work. PG17 also added log_verbosity option, which controls the amount of messages emitted during processing. This is currently used in COPY FROM when ON_ERROR option is set to ignore. Therefore, we error out for this option as well. Relevant PG17 commit: https://github.com/postgres/postgres/commit/f5a227895	2025-03-12 12:25:49 +03:00
Naisila Puka	202ad077bd	PG17: ALTER INDEX ALTER COLUMN SET STATISTICS DEFAULT (#7808 ) DESCRIPTION: Propagates ALTER INDEX ALTER COLUMN SET STATISTICS DEFAULT We automatically support this. Adding tests only. We currently don't support ALTER TABLE ALTER COLUMN SET STATISTICS Relevant PG commit: https://github.com/postgres/postgres/commit/4f622503d	2025-03-12 12:25:49 +03:00
Naisila Puka	a383ef6831	Adds PG17.1 support - Regression tests sanity (#7661 ) This is the final commit that adds PG17 compatibility with Citus's current capabilities. You can use Citus community, release-13.0 branch, with PG17.1. --------- Specifically, this commit: - Enables PG17 in the configure script. - Adds PG17 tests to CI using test images that have 17.1 - Fixes an upgrade test: see below for details In `citus_prepare_upgrade()`, don't drop any_value when upgrading from PG16+, because PG16+ has its own any_value function. Attempting to do so results in the error seen in [pg16-pg17 upgrade](https://github.com/citusdata/citus/actions/runs/11768444117/job/32778340003?pr=7661): ``` ERROR: cannot drop function any_value(anyelement) because it is required by the database system CONTEXT: SQL statement "DROP AGGREGATE IF EXISTS pg_catalog.any_value(anyelement)" ``` When 16 becomes the minimum supported Postgres version, the drop statements can be removed. --------- Several PG17 Compatibility commits have been merged before this final one. All these subtasks are done https://github.com/citusdata/citus/issues/7653 See the list below: Compilation PR: https://github.com/citusdata/citus/pull/7699 Ruleutils PR: https://github.com/citusdata/citus/pull/7725 Sister PR for tests: https://github.com/citusdata/the-process/pull/159 Helpful smaller PRs: - https://github.com/citusdata/citus/pull/7714 - https://github.com/citusdata/citus/pull/7726 - https://github.com/citusdata/citus/pull/7731 - https://github.com/citusdata/citus/pull/7732 - https://github.com/citusdata/citus/pull/7733 - https://github.com/citusdata/citus/pull/7738 - https://github.com/citusdata/citus/pull/7745 - https://github.com/citusdata/citus/pull/7747 - https://github.com/citusdata/citus/pull/7748 - https://github.com/citusdata/citus/pull/7749 - https://github.com/citusdata/citus/pull/7752 - https://github.com/citusdata/citus/pull/7755 - https://github.com/citusdata/citus/pull/7757 - https://github.com/citusdata/citus/pull/7759 - https://github.com/citusdata/citus/pull/7760 - https://github.com/citusdata/citus/pull/7761 - https://github.com/citusdata/citus/pull/7762 - https://github.com/citusdata/citus/pull/7765 - https://github.com/citusdata/citus/pull/7766 - https://github.com/citusdata/citus/pull/7768 - https://github.com/citusdata/citus/pull/7769 - https://github.com/citusdata/citus/pull/7771 - https://github.com/citusdata/citus/pull/7774 - https://github.com/citusdata/citus/pull/7776 - https://github.com/citusdata/citus/pull/7780 - https://github.com/citusdata/citus/pull/7781 - https://github.com/citusdata/citus/pull/7785 - https://github.com/citusdata/citus/pull/7788 - https://github.com/citusdata/citus/pull/7793 - https://github.com/citusdata/citus/pull/7796 --------- Co-authored-by: Colm <colmmchugh@microsoft.com>	2025-03-12 12:25:49 +03:00
Naisila Puka	28b0b0e7a8	Bump Citus version into 13.0.0 (#7792 ) We are using `release-13.0` branch for both development and release, to deliver PG17 support in Citus. Afterwards, we will (probably) merge this branch into main. Some potential changes for main branch, after we are done working on release-13.0: - Merge changes from `release-13.0` to `main` - Figure out what changes were there on 12.2, move them to 13.1 version. In a nutshell: rename `12.1--12.2` to `13.0--13.1` and fix issues. - Set version to 13.1devel	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	80c6479408	PG17 compatibility: Fix Test Failure in multi_alter_table_add_const (#7733 ) In earlier versions of PostgreSQL, exclusion constraints were not allowed on partitioned tables. This is why the error in your regression test (ERROR: exclusion constraints are not supported on partitioned tables) was raised in PostgreSQL 16. In PostgreSQL 17, exclusion constraints are now allowed on partitioned tables, which is why the error no longer appears when you attempt to add an exclusion constraint. The constraint exclusion mechanism, described in the documentation, relies on CHECK constraints to decide which partitions or child tables need to be queried. [CHECK constraints](https://www.postgresql.org/docs/current/ddl-partitioning.html#DDL-PARTITIONING-CONSTRAINT-EXCLUSION) ```diff -- Check "ADD EXCLUDE" errors out for partitioned table since the postgres does not allow it ALTER TABLE AT_AddConstNoName.citus_local_partitioned_table ADD EXCLUDE(partition_col WITH =); -ERROR: exclusion constraints are not supported on partitioned tables -- Check "ADD CHECK" SET client_min_messages TO DEBUG1; ALTER TABLE AT_AddConstNoName.citus_local_partitioned_table ADD CHECK (dist_col > 0); DEBUG: the constraint name on the shards of the partition is too long, switching to sequential and local execution mode to prevent self deadlocks: longlonglonglonglonglonglonglonglonglonglonglo_537570f5_5_check DEBUG: verifying table "longlonglonglonglonglonglonglonglonglonglonglonglonglonglongabc" DEBUG: verifying table "p1" RESET client_min_messages; SELECT con.conname FROM pg_catalog.pg_constraint con INNER JOIN pg_catalog.pg_class rel ON rel.oid = con.conrelid INNER JOIN pg_catalog.pg_namespace nsp ON nsp.oid = connamespace WHERE rel.relname = 'citus_local_partitioned_table'; conname -------------------------------------------------- + citus_local_partitioned_table_partition_col_excl citus_local_partitioned_table_check -(1 row) +(2 rows) ```	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	29bd3dc41c	PG17 compatibility: Fix Isolation Test Failure in isolation_multiuser_locking (#7714 ) This PR enhances `isolation_multiuser_locking.spec` test compatibility across multiple PostgreSQL versions by handling differences in error messages and behavior. Key updates include: - Error Message Handling: Adjustments to manage version-specific error messages, ensuring consistent test results. - Modified to address variations in locking behavior across PostgreSQL versions, ensuring test stability in multiuser scenarios. - REINDEX Behavior Adjustment: This PR accounts for a behavioral change introduced in PostgreSQL by commit ecb0fd337, which alters how REINDEX interacts with system catalogs. https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=ecb0fd337 --------- Co-authored-by: Mehmet YILMAZ <mehmet.yilmaz@microsoft.com>	2025-03-12 12:25:49 +03:00
Naisila Puka	09e96831b3	Fix pg17 test (#7797 ) Broken from this commit `e3db375149` https://github.com/citusdata/citus/actions/runs/12429202397/attempts/1#summary-34702334056	2025-03-12 12:25:49 +03:00
Naisila Puka	b22c95933c	PG17 Compatibility - Fix HideCitusDependentObjects function (#7796 ) There is a crash when running vanilla tests because of the `citus.hide_citus_dependent_objects` GUC. We turn on this GUC only for the pg vanilla tests. This GUC runs the following function `HideCitusDependentObjectsOnQueriesOfPgMetaTables`. This function doesn't take into account the new `mergeJoinCondition`. I rewrote the function such that it checks for merge join conditions as well. Relevant PG commit: https://github.com/postgres/postgres/commit/0294df2f1 The crash could be reproduced locally like the following: ```SQL SET citus.hide_citus_dependent_objects TO on; CREATE OR REPLACE FUNCTION pg_catalog.is_citus_depended_object(oid,oid) RETURNS bool LANGUAGE C AS 'citus', $$is_citus_depended_object$$; -- try a system catalog MERGE INTO pg_class c USING (SELECT 'pg_depend'::regclass AS oid) AS j ON j.oid = c.oid WHEN MATCHED THEN UPDATE SET reltuples = reltuples + 1 RETURNING j.oid; CREATE VIEW classv AS SELECT * FROM pg_class; MERGE INTO classv c USING pg_namespace n ON n.oid = c.relnamespace WHEN MATCHED AND c.oid = 'pg_depend'::regclass THEN UPDATE SET reltuples = reltuples - 1 RETURNING c.oid; -- crash happens here ```	2025-03-12 12:25:49 +03:00
Naisila Puka	c662e68e44	Remove redundant normalize (#7794 ) Redundant from this commit `acd7b1e690`	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	915276ee7f	PG17 compatibility: Fix Test Failure in local_table_join (#7732 ) PostgreSQL 17 seems to have introduced improvements in how correlated subqueries are handled during plan generation. Instead of generating a trivial subplan with WHERE true, it now applies more specific filtering (WHERE (key = 5)), which makes the execution plan more efficient. https://github.com/postgres/postgres/commit/b262ad44 ``` diff -dU10 -w /__w/citus/citus/src/test/regress/expected/local_table_join.out /__w/citus/citus/src/test/regress/results/local_table_join.out --- /__w/citus/citus/src/test/regress/expected/local_table_join.out.modified 2024-11-05 09:53:50.423970699 +0000 +++ /__w/citus/citus/src/test/regress/results/local_table_join.out.modified 2024-11-05 09:53:50.463971296 +0000 @@ -1420,32 +1420,32 @@ ) as subq_1 ) as subq_2; DEBUG: Wrapping relation "custom_pg_type" to a subquery DEBUG: generating subplan 204_1 for subquery SELECT typdefault FROM local_table_join.custom_pg_type WHERE true ERROR: direct joins between distributed and local tables are not supported HINT: Use CTE's or subqueries to select from local tables and use them in joins -- correlated sublinks are not yet supported because of #4470, unless we convert not-correlated table SELECT COUNT(*) FROM distributed_table d1 JOIN postgres_table using(key) WHERE d1.key IN (SELECT key FROM distributed_table WHERE d1.key = key and key = 5); DEBUG: Wrapping relation "postgres_table" to a subquery -DEBUG: generating subplan XXX_1 for subquery SELECT key FROM local_table_join.postgres_table WHERE true +DEBUG: generating subplan 206_1 for subquery SELECT key FROM local_table_join.postgres_table WHERE (key OPERATOR(pg_catalog.=) 5) ``` Co-authored-by: Naisila Puka <37271756+naisila@users.noreply.github.com>	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	3935710c17	PG17 compatibility: Fix Test Failure in local_dist_join_mixed (#7731 ) PostgreSQL 16 adds an extra condition (id IS NOT NULL) to the subquery. This condition is likely used to ensure that no null values are processed in the subquery. Instead of using the condition id IS NOT NULL, PostgreSQL 17 generates the subplan with a trivial condition (WHERE true), indicating that it does not need to explicitly check for non-null values. PostgreSQL 17 likely includes optimizations to handle null checks more efficiently. The WHERE (id IS NOT NULL) condition that was present in PostgreSQL 16 may now be considered redundant by the planner, as it is implicitly handled by the query execution engine. https://github.com/postgres/postgres/commit/b262ad44 ```diff SELECT foo1.id FROM (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo9, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo8, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo7, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo6, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo5, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo4, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo3, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo2, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo10, (SELECT local.id, local.title FROM local, distributed WHERE local.id = distributed.id ) as foo1 WHERE foo1.id = foo9.id AND foo1.id = foo8.id AND foo1.id = foo7.id AND foo1.id = foo6.id AND foo1.id = foo5.id AND foo1.id = foo4.id AND foo1.id = foo3.id AND foo1.id = foo2.id AND foo1.id = foo10.id AND foo1.id = foo1.id ORDER BY 1; ... -DEBUG: generating subplan XXX_10 for subquery SELECT id FROM local_dist_join_mixed.local WHERE (id IS NOT NULL) +DEBUG: generating subplan XXX_10 for subquery SELECT id FROM local_dist_join_mixed.local WHERE true ... ```	2025-03-12 12:25:49 +03:00
Colm	11f76cb4bb	PG17 compatibility: ensure get_progress() output is consistent (#7793 ) in regress test isolation_progress_monitoring, with an ORDER BY. The implementation of get_progress() uses a tuplestore to hold the step and progress values, and tuplestore does not provide any guarantee on the ordering of the tuples so ORDER BY ensures stable test output. Also make the output more user friendly by including the column names. Fixing occasional failures seen in isolation_progress_monitoring. ![Screenshot (86)](https://github.com/user-attachments/assets/a019639f-559f-408d-b8a8-8b7a44d8095d)	2025-03-12 12:25:49 +03:00
Teja Mupparti	35d1160ace	PG17 Compatibility: Support MERGE features in Citus with clean exceptions (#7781 ) - Adapted `pgmerge.sql` tests from PostgreSQL community's `merge.sql` to Citus by converting tables into Citus local tables. - Identified two new PostgreSQL 17 MERGE features (`RETURNING` support and MERGE on updatable views) not yet supported by Citus. - Implemented changes to detect unsupported features and raise clean exceptions, ensuring pgmerge tests pass without diffs. - Addressed breaking changes caused by `MERGE ... WHEN NOT MATCHED BY SOURCE` restructuring, reducing diffs in pgmerge tests. - Segregated unsupported test cases into `merge_unsupported.sql` to maintain clarity and avoid large diffs in test files. - Prepared the Citus MERGE planner to handle new PostgreSQL changes, reducing remaining test discrepancies. All merge tests now pass cleanly, with unsupported cases clearly isolated. Relevant PG commits: c649fa24a https://github.com/postgres/postgres/commit/c649fa24a 0294df2f1 https://github.com/postgres/postgres/commit/0294df2f1 --------- Co-authored-by: naisila <nicypp@gmail.com>	2025-03-12 12:25:49 +03:00
Colm	088731e9db	PG17 compatibility: account for identity columns in partitioned tables. (#7785 ) PG17 added support for identity columns in partitioned tables: https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=699586315 A consequence is that a table with an identity column cannot be attached as a partition. But Citus on Postgres 17 will generate identity column for the partitions if the parent table has one (or more) identity columns when propagating distributed table DDL to worker nodes, as happens in the `generated_identity` regress test in #7768: ``` CREATE TABLE partitioned_table ( a bigint CONSTRAINT myconname GENERATED BY DEFAULT AS IDENTITY (START WITH 10 INCREMENT BY 10), b bigint GENERATED ALWAYS AS IDENTITY (START WITH 10 INCREMENT BY 10), c int ) PARTITION BY RANGE (c); CREATE TABLE partitioned_table_1_50 PARTITION OF partitioned_table FOR VALUES FROM (1) TO (50); CREATE TABLE partitioned_table_50_500 PARTITION OF partitioned_table FOR VALUES FROM (50) TO (1000); SELECT create_distributed_table('partitioned_table', 'a'); - create_distributed_table ---------------------------------------------------------------------- - -(1 row) - +ERROR: table "partitioned_table_1_50" being attached contains an identity column "a" +DETAIL: The new partition may not contain an identity column. ``` It is the Citus-generated ATTACH PARTITION statement that errors out, because the Citus-generated CREATE TABLE for the partitions included identity column definitions. The fix is straightforward - when propagating the CREATE TABLE ddl for a partition of a table with an identity column, don't include the identity column(s), they will be inherited on attaching the partition. In Citus on Postgres 16 (or less) partitions do not inherit identity; the partitions in the example would not have any identity columns so it was not an issue previously.	2025-03-12 12:25:49 +03:00
Colm	c3d21b807a	PG17 compatibility: fix plan diffs in multi_explain (#7780 ) Regress test `multi_explain` has two queries that have a different query plan with PG17. Here is part of the plan diff for the query labelled _Union and left join subquery pushdown_ in `multi_explain.sql` (for the complete diff, search for `multi_explain` [here](https://github.com/citusdata/citus/actions/runs/12158205599/attempts/1)): ``` -> Sort Sort Key: ((users.composite_id).tenant_id), ((users.composite_id).user_id), subquery_2.hasdone, events.event_time - -> Hash Left Join - Hash Cond: (users.composite_id = subquery_2.composite_id) - -> HashAggregate - Group Key: ((users.composite_id).tenant_id), ((users.composite_id).user_id), users.composite_id, ('action=>1'::text), events.event_time + -> Nested Loop Left Join + Join Filter: (users.composite_id = subquery_2.composite_id) + -> Unique + -> Sort + Sort Key: ((users.composite_id).tenant_id), ((users.composite_id).user_id), users.composite_id, ('action=>1'::text), events.event_time -> Append ``` The change is the same in both queries; a hash left join with subquery_1 on the outer and subquery_2 on the inner side of the join is now a nested loop left join with subquery_1 on the outer and subquery_2 on the inner; additionally, the chosen method of uniquifying the UNION in subquery_1 has changed from hashed grouping to sort followed by unique, as shown in the diff above. The PG17 commit that caused this plan change is likely _[Fix MergeAppend to more accurately compute the number of rows that need to be sorted](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=9d1a5354f)_ because it impacts the estimated rows counts of UNION paths. Comparing a costed plan of the query between PG16 and PG17 I noticed that with PG16 the rows estimate for the UNION in subquery_1 is 4, whereas with PG17 the rows estimate is 2. A lower rows estimate in the outer side of the join may result in nested loop looking cheaper than hash join for the left outer join, hence the plan change in the two queries where there is a UNION on the outer side of a left outer join. The proposed fix achieves a consistent plan across all supported postgres versions by temporarily disabling nested loop join and sort for the two impacted queries; the postgres optimizer selects hash join for the outer left join and hashed aggregation for the UNION operation. I investigated tweaking the queries, but was not able to arrive at a consistent plan, and I believe the SQL operator (e.g. join, group by, union) implementations are orthogonal to the intent of the test, so this should be a satisfactory solution, particularly as it avoids introducing a second alternative output file for `multi_explain`.	2025-03-12 12:25:49 +03:00
Colm	592416250c	PG17 compatibility: account for MAINTAIN privilege in regress tests (#7774 ) This PR addresses regress tests impacted by the introduction of [the MAINTAIN privilege in PG17](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=ecb0fd337). The impacted tests include `generated_identity`, `create_single_shard_table`, `grant_on_sequence_propagation`, `grant_on_foreign_server_propagation`, `single_node_enterprise`, `multi_multiuser_master_protocol`, `multi_alter_table_row_level_security`, `shard_move_constraints` which show the following error: ``` SELECT start_metadata_sync_to_node('localhost', :worker_2_port); - start_metadata_sync_to_node ---------------------------------------------------------------------- - -(1 row) - +ERROR: unrecognized aclright: 16384 ``` and `multi_multiuser_master_protocol`, where the `pg_class.relacl` column has 'm' for MAINTAIN if applicable: ``` relname \| rolname \| relacl ---------------------+-------------+------------------------------------------------------------ trivial_full_access \| full_access \| - trivial_postgres \| postgres \| {postgres=arwdDxt/postgres,full_access=arwdDxt/postgres} + trivial_postgres \| postgres \| {postgres=arwdDxtm/postgres,full_access=arwdDxtm/postgres} ``` The PR updates function `convert_aclright_to_string()` in citus_ruleutils.c to include a case for `ACL_MAINTAIN`. Per the comment on `convert_aclright_to_string()` in citus_ruleutils.c, it is a copy of `convert_aclright_to_string()` in Postgres (where it is in `src/backend/utils/adt/acl.c`), so requires updating to be consistent with Postgres. With this change Citus can recognize the MAINTAIN privilege, and will not emit the `unrecognized aclright` error. The PR also adds an alternative goldfile for `multi_multiuser_master_protocol`. Note that `convert_aclright_to_string()` in Postgres includes access types SET and ALTER SYSTEM on system parameters (aka GUCs), added by [this PG16 commit](https://github.com/postgres/postgres/commit/a0ffa885e). If Citus were to have a requirement to support granting SET and ALTER SYSTEM we would need to update `convert_aclright_to_string()` in citus_ruleutils.c with SET and ALTER SYSTEM.	2025-03-12 12:25:49 +03:00
Colm	beb222ea8d	PG17 compatibility: fix multi-1 diffs caused by PG17 optimizer enhancements (#7769 ) This fix ensures that the expected DEBUG error messages from the router planner in `multi_router_planner`, `multi_router_planner_fast_path` and `query_single_shard_table` are present with PG17. In `query_single_shard_table` the diff: ``` SELECT COUNT() FROM citus_local_table t1 WHERE t1.b IN ( SELECT b+1 FROM nullkey_c1_t1 t2 WHERE t2.b = t1.a ); -DEBUG: router planner does not support queries that reference non-colocated distributed tables +DEBUG: Local tables cannot be used in distributed queries. ``` occurred because of[ this PG17 commit](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=9f1337639) which enables the optimizer to pull up a correlated ANY subquery to a join. The fix inhibits subquery pull up by including a volatile function in the predicate involving the ANY subquery, preserving the pre-PG17 optimizer treatment of the query. In the case of `multi_router_planner` and `multi_router_planner_fast_path` the diffs: ``` -- partition_column is null clause does not prune out any shards, -- all shards remain after shard pruning, not router plannable SELECT FROM articles_hash a WHERE a.author_id is null; -DEBUG: Router planner cannot handle multi-shard select queries +DEBUG: Creating router plan ``` are because of [this PG17 commit](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=b262ad440), which enables the optimizer to detect and remove redundant IS (NOT) NULL expressions. The fix is to adjust the table definition so the column used for distribution is not marked NOT NULL, thus preserving the pre-PG17 query planning behavior. Finallly, a rule is added to `normalize.sed` to ignore DEBUG logging in CREATE MATERIALIZED VIEW AS statements introduced by [this PG17 commit](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=b4da732fd64); _when creating materialized views, use REFRESH logic to load data_, a consequence of which is that with `client_min_messages` at `DEBUG2` Postgres emits extra detail for CREATE MATERIALIZED VIEW AS statements. ``` CREATE MATERIALIZED VIEW mv_articles_hash_empty AS SELECT * FROM articles_hash WHERE author_id = 1; DEBUG: Creating router plan DEBUG: query has a single distribution column value: 1 +DEBUG: drop auto-cascades to type multi_router_planner.pg_temp_61391 +DEBUG: drop auto-cascades to type multi_router_planner.pg_temp_61391[] ``` The rule can be changed to a normalization, or possibly dropped, when 17 becomes the minimum supported version.	2025-03-12 12:25:49 +03:00
Colm	f8335c1484	PG17 compatibility: fix diffs in create_index, privileges vanilla tests (#7766 ) PG17 regress sanity (#7653) fix; address diffs in vanilla tests `create_index` and `privileges`. There is a change from `permission denied` to `must be owner of`, seen in create_index: ``` @@ -2970,21 +2970,21 @@ REINDEX TABLE pg_toast.pg_toast_1260; ERROR: permission denied for table pg_toast_1260 REINDEX INDEX pg_toast.pg_toast_1260_index; -ERROR: permission denied for index pg_toast_1260_index +ERROR: must be owner of index pg_toast_1260_index ``` and privileges: ``` @@ -2945,41 +2945,43 @@ ERROR: permission denied for table maintain_test REINDEX INDEX maintain_test_a_idx; -ERROR: permission denied for index maintain_test_a_idx +ERROR: must be owner of index maintain_test_a_idx REINDEX SCHEMA reindex_test; REINDEX INDEX maintain_test_a_idx; +ERROR: must be owner of index maintain_test_a_idx REINDEX SCHEMA reindex_test; ``` The fix updates function `RangeVarCallbackForReindexIndex()` in `index.c` with changes made by the introduction of the [MAINTAIN privilege in PG17](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=ecb0fd337) to the function `RangeVarCallbackForReindexIndex()` in `indexcmds.c`. The code is under a Postgres 17 version directive, which can be removed when 17 becomes the oldest supported Postgres version.	2025-03-12 12:25:49 +03:00
Colm	1797ab8a4f	PG17 compatibility: Fix check-style, broken by PG17 columnar test fix… (#7776 ) … (`698699d89e`) --------- Co-authored-by: naisila <nicypp@gmail.com>	2025-03-12 12:25:49 +03:00
Colm	808626ea78	PG17 compatibility (#7653 ): Fix test diffs in columnar schedule (#7768 ) This PR fixes diffs in `columnnar_chunk_filtering` and `columnar_paths` tests. In `columnnar_chunk_filtering` an expression `(NOT (SubPlan 1))` changed to `(NOT (ANY (a = (SubPlan 1).col1)))`. This is due to [aPG17 commit](https://github.com/postgres/postgres/commit/fd0398fc) that improved how scalar subqueries (InitPlans) and ANY subqueries (SubPlans) are EXPLAINed in expressions. The fix uses a helper function which converts the PG17 format to the pre-PG17 format. It is done this way because pre-PG17 EXPLAIN does not provide enough context to convert to the PG17 format. The helper function can (and should) be retired when 17 becomes the minimum supported PG. In `columnar_paths`, a merge join changed to a hash join. This is due to [this PG17 commit](`f7816aec23`), which improved the PG optimizer's ability to estimate the size of a CTE scan. The impacted query involves a CTE scan with a point predicate `(a=123)` and before the change the CTE size was estimated to be 5000, but with the change it is correctly (given the data in the table) estimated to be 1, making hash join a more attractive join method. The fix is to have an alternative goldfile for pre-PG17. I tried, but was unable, to force a specific kind of join method using the GUCs (`enable_nestloop`, `enable_hashjoin`, `enable_mergejoin`), but it was not possible to obtain a consistent plan across all supported PG versions (in some cases the join inputs switched sides).	2025-03-12 12:25:49 +03:00
Colm	6254ad81fc	PG17 compatibility: revert #7764 (#7775 ) Revert PG17 compatibility fix #7764	2025-03-12 12:25:49 +03:00
Naisila Puka	1074035446	PG17 compatibility: fix some tests outputs (#7765 ) There are two commits in this PR: 1) Remove domain_default column since it has been removed from PG17 Relevant PG commit: `78806a9509` 78806a95095c4fb9230a441925244690d9c07d23 2) pg_stat_statements reset output diff fix pg_stat_statements reset output changed in PG17, fix idea from Relevant PG commits: `6ab1dbd26b` 6ab1dbd26bbf307055d805feaaca16dc3e750d36	2025-03-12 12:25:49 +03:00
Colm	0de7b5a240	PG17 compatibility: fix diff in tableam (#7771 ) Test `tableam` expects that this CREATE TABLE statement: `CREATE TABLE test_partitioned(id int, p int, val int) PARTITION BY RANGE (p) USING fake_am;` will produce this error: `specifying a table access method is not supported on a partitioned table` but as of [this PG commit](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=374c7a229) it is possible to specify an access method on a partitioned table. This fix moves the CREATE TABLE statement to pg17, and adds an additional test to show parent access method is inherited.	2025-03-12 12:25:49 +03:00
Mehmet YILMAZ	9615b52863	PG17 compatibility: Fix Test Failure in multi_name_lengths multi_create_table_constraints (#7726 ) PG 17 Removes outer parentheses from CHECK constraints we add them back for pg15,pg16 compatibility e.g. change CHECK other_col >= 100 to CHECK (other_col >= 100) Relevant PG commit: e59fcbd712c777eb2987d7c9ad542a7e817954ec `e59fcbd712` CI link https://github.com/citusdata/citus/actions/runs/11844794788 ```difft SELECT "Constraint", "Definition" FROM table_checks WHERE relid='public.check_example_365068'::regclass; Constraint \| Definition -------------------------------------+----------------------------------- - check_example_other_col_check \| CHECK (other_col >= 100) - check_example_other_other_col_check \| CHECK (abs(other_other_col) >= 100) + check_example_other_col_check \| CHECK other_col >= 100 + check_example_other_other_col_check \| CHECK abs(other_other_col) >= 100 ``` Co-authored-by: Mehmet YILMAZ <mehmet.yilmaz@microsoft.com>	2025-03-12 12:25:49 +03:00
Colm	a74bb6280c	PG17 regress sanity: fix error unrecognized alter database option tablespace seen in database vanilla test (#7764 ) Disable DDL propagation for the vanilla test suite. This enables the vanilla `database ` test to pass, where previously it was correctly returning `ERROR: unrecognized ALTER DATABASE option: tablespace` because release-13.0 does not propagate this ALTER DATABASE variant. We (Citus team) discussed cherry picking [#7253](https://github.com/citusdata/citus/pull/7253) from main to release-13.0 because it does propagate ALTER DATABASE tablespace option (as well as a couple of others) but decided fixing the regress test was not the proper context for that. The fix disables `citus.enable_metadata_sync` when running vanilla, we discussed disabling `citus.enable_create_database_propagation` but this is not in release-13.0.	2025-03-12 12:25:49 +03:00
Colm	6043fcb263	PG17 regress test sanity: fix diffs in union_pushdown. (#7762 ) Preserve the test error message by adjusting the query so that PG17 cannot pull it up to a join. Another instance of a subquery that can be pulled up to a join with PG17 (#7745) This should have been fixed in, but slipped by, #7745	2025-03-12 12:25:49 +03:00
Naisila Puka	ed71e65333	PG17 compatibility: Adjust print_extension_changes function for extra type outputs in PG17 (#7761 ) In PG17, Auto-generated array types, multirange types, and relation rowtypes are treated as dependent objects, hence changing the output of the print_extension_changes function. Relevant PG commit: e5bc9454e527b1cba97553531d8d4992892fdeef `e5bc9454e5` Here we create a table with only the basic extension types in order to avoid printing extra ones for now. This can be removed when we drop PG16 support. https://github.com/citusdata/citus/actions/runs/11960253650/attempts/1#summary-33343972656 ```diff \| table pg_dist_rebalance_strategy + \| type citus.distribution_type[] + \| type citus.pg_dist_object + \| type pg_dist_shard + \| type pg_dist_shard[] + \| type pg_dist_shard_placement + \| type pg_dist_shard_placement[] + \| type pg_dist_transaction + \| type pg_dist_transaction[] \| view citus_dist_stat_activity \| view pg_dist_shard_placement ```	2025-03-12 12:25:49 +03:00
Naisila Puka	ae104f06a6	PG17 compatibility: fix backend type orders in test (#7760 ) This work was already done by @m3hm3t and approved as part of https://github.com/citusdata/citus/pull/7722 I separated it in this PR since the previous one contained other changes which we don't currently want to merge. Relevant PG commit: --------- Co-authored-by: Mehmet YILMAZ <mehmety87@gmail.com>	2025-03-12 12:25:49 +03:00
Colm	b46d311e30	PG17 compatibility: Normalize COPY error messages (#7759 ) A recent Postgres commit () that refactored error messages is the cause of the diffs in pg16 regress test when running Citus on Postgres 17. The fix changes the pg16 goldfile and includes a normalization rule for the error messages so pg16 will pass when running with version 16 of Postgres. () https://git.postgresql.org/gitweb/?p=postgresql.git;a=commit;h=498ee9ee2f	2025-03-12 12:25:49 +03:00
Colm	4c080c48cd	PG17 compatibility: add helper function for EXPLAIN diffs in scalar subquery output (#7757 ) PG17 changed how scalar subquery outputs appear in EXPLAIN output (). This commit changes impacted regress goldfiles to the PG17 format, and adds a helper function to covert pre-PG17 plans to the PG17 format. The conversion is required when testing Citus on pgversions prior to 17. The helper function can and should be removed when 17 becomes the minimum supported version. () https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=fd0398fcb	2025-03-12 12:25:49 +03:00
Colm	81bda6fb8e	PG17 compatibility: add/fix tests with correlated subqueries that can be pulled to a join (#7745 ) Fix Test Failure in subquery_in_where, set_operations, dml_recursive in PG17 #7741 The test failures are caused by[ this commit in PG17](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=9f1337639), which enables correlated subqueries to be pulled up to a join. Prior to this, the correlated subquery was implemented as a subplan. In citus, it is not possible to pushdown a correlated subplan, but with a different plan in PG17 the query can be executed, per the test diff from `subquery_in_where`: ``` 37,39c37,41 < DEBUG: generating subplan XXX_1 for CTE event_id: SELECT user_id AS events_user_id, "time" AS events_time, event_type FROM public.events_table < DEBUG: Plan XXX query after replacing subqueries and CTEs: SELECT count() AS count FROM ... < ERROR: correlated subqueries are not supported when the FROM clause contains a CTE or subquery --- > count > --------------------------------------------------------------------- > 0 > (1 row) > ``` This is because with pg17 `= ANY subquery` in the queries can be implemented as a join, instead of as a subplan filter on a table scan. For example, `SELECT FROM test a WHERE x IN (SELECT x FROM test b UNION SELECT y FROM test c WHERE a.x = c.x) ORDER BY 1,2` (from set_operations) has this plan in pg17; note that the subquery is the inner side of a nested loop join: ``` ┌───────────────────────────────────────────────────┐ │ QUERY PLAN │ ├───────────────────────────────────────────────────┤ │ Sort │ │ Sort Key: a.x, a.y │ │ -> Nested Loop │ │ -> Seq Scan on test a │ │ -> Subquery Scan on "ANY_subquery" │ │ Filter: (a.x = "ANY_subquery".x) │ │ -> HashAggregate │ │ Group Key: b.x │ │ -> Append │ │ -> Seq Scan on test b │ │ -> Seq Scan on test c │ │ Filter: (a.x = x) │ └───────────────────────────────────────────────────┘ ``` and this plan in pg16 (and previous pg versions); the subquery is a correlated subplan filter on a table scan: ``` ┌───────────────────────────────────────────────┐ │ QUERY PLAN │ ├───────────────────────────────────────────────┤ │ Sort │ │ Sort Key: a.x, a.y │ │ -> Seq Scan on test a │ │ Filter: (SubPlan 1) │ │ SubPlan 1 │ │ -> HashAggregate │ │ Group Key: b.x │ │ -> Append │ │ -> Seq Scan on test b │ │ -> Seq Scan on test c │ │ Filter: (a.x = x) │ └───────────────────────────────────────────────┘ ``` The fix Modifies the queries causing the test failures so that an ANY subquery is not folded to a join, preserving the expected output of the tests. A similar approach was taken for existing regress tests in the[ postgres commit](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=9f1337639). See the `join `regress test, for example. We also add pg17 specific tests that leverage this improvement in Postgres with Citus distributed planning as well.	2025-03-12 12:25:47 +03:00
Colm	9dcd812a40	PG17 compatibility: Preserve DEBUG output in cte_inline (#7755 ) Regression test cte_inline has the following diff; ``` DEBUG: CTE cte_1 is going to be inlined via distributed planning DEBUG: CTE cte_1 is going to be inlined via distributed planning DEBUG: Creating router plan -DEBUG: query has a single distribution column value: 1 ``` DEBUG message `query has a single distribution column value` does not appear with PG17. This is because PG17 can recognize when a Result node does not need to have an input node, so the predicate on the distribution column is not present in the query plan. Comparing the query plan obtained before PG17: ``` │ Result │ │ One-Time Filter: false │ │ -> GroupAggregate │ │ -> Seq Scan on public.test_table │ │ Filter: (test_table.key = 1) │ ``` with the PG17 query plan: ``` ┌──────────────────────────────────┐ │ QUERY PLAN │ ├──────────────────────────────────┤ │ Result │ │ One-Time Filter: false │ └──────────────────────────────────┘ ``` we see that the Result node in the PG16 plan has an Aggregate node, but the Result node in the PG17 plan does not have any input node; PG17 recognizes it is not needed given a Filter that evaluates to False at compile-time. The Result node is present in both plans because PG in both versions can recognize when a combination of predicates equate to false at compile time; this is the because the successive predicates in the test query (key=6, key=5, key=4, etc) become contradictory when the CTEs are inlined. Here is an example query showing the effect of the CTE inlining: ``` select count(*), key FROM test_table WHERE key = 1 AND key = 2 GROUP BY key; ``` In this case, the WHERE clause obviously evaluates to False. The PG16 query plan for this query is: ``` ┌────────────────────────────────────┐ │ QUERY PLAN │ ├────────────────────────────────────┤ │ GroupAggregate │ │ -> Result │ │ One-Time Filter: false │ │ -> Seq Scan on test_table │ │ Filter: (key = 1) │ └────────────────────────────────────┘ ``` The PG17 query plan is: ``` ┌────────────────────────────────┐ │ QUERY PLAN │ ├────────────────────────────────┤ │ GroupAggregate │ │ -> Result │ │ One-Time Filter: false │ └────────────────────────────────┘ ``` In both plans the PG optimizer is able to derive the predicate 1=2 from the equivalence class { key, 1, 2 } and then constant fold this to False. But, in the PG16 plan the Result node has an input node (a sequential scan on test_table), while in the PG17 plan the Result node does not have any input. This is because PG17 recognizes that when the Result filter resolves to False at compile time it is not necessary to set an input on the Result. I think this is a consequence of this PG17 commit: https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=b262ad440 which handles redundant IS [NOT] NULL predicates, but also refactored evaluating of predicates to true/false at compile-time, enabling optimizations such as those seen here. Given the reason for the diff, the fix preserves the test output by modifying the query so the predicates are not contradictory when the CTEs are inlined.	2025-03-12 11:01:49 +03:00
Naisila Puka	46f89ccf65	citus_indent fix (#7746 )	2025-03-12 11:01:49 +03:00
Naisila Puka	51c2e63c30	PG17 compatibility: add COLLPROVIDER_BUILTIN option and fix tests (#7752 ) In PG17 adds builtin C.UTF-8 locale option, we add it in the code to avoid "unknown collation provider" in vanilla tests. Relevant PG commit: `f69319f2f1` f69319f2f1fb16eda4b535bcccec90dff3a6795e Also in PG17, colliculocale, daticulocale renamed to colllocale, datlocale Here we fix the following tests to avoid alternative output pg15 pg16 multi_mx_create_table multi_schema_support Relevant PG commit: `f696c0cd5f` f696c0cd5f299f1b51e214efc55a22a782cc175d	2025-03-12 11:01:49 +03:00
Naisila Puka	9a413e0c32	PG17 compatibility: Check whether table AM is default (#7747 ) PG 17 added support for DEFAULT in ALTER TABLE .. SET ACCESS METHOD Relevant PG commit: d61a6cad6418f643a5773352038d0dfe5d3535b8 `d61a6cad64` In that case, name in `AlterTableCmd->name` would be null. Add a null check here to avoid crash.	2025-03-12 11:01:49 +03:00
Naisila Puka	5540096b9a	PG17 compatibility - Check if there are blocks left in columnar_scan_analyze_next_block (#7738 ) In PG17, the outer loop in `acquire_sample_rows()` changed from `while (BlockSampler_HasMore(&bs))` to `while (table_scan_analyze_next_block(scan, stream))` Relevant PG commit: 041b96802efa33d2bc9456f2ad946976b92b5ae1 `041b96802e` It is expected that the `scan_analyze_next_block` function will check if there are any blocks left. So we add that check in `columnar_scan_analyze_next_block` Without this fix, we will have an indefinite loop causing timeout. Specifically, in our test schedules, `multi schedule` stuck at `drop_column_partitioned_table` test `multi-mx` schedule stuck at `start_stop_metadata_sync` test `columnar schedule` stuck at `columnar_create` test	2025-03-12 11:01:49 +03:00
Mehmet YILMAZ	c8d9a1bd10	PG17 compatibility: Fix -1/Null diff in attstattarget test output (#7749 ) Changed `attstattarget` in `pg_attribute` to use `NullableDatum`, allowing null representation for default statistics target in PostgreSQL 17. Relevant PG commit: 6a004f1be87d34cfe51acf2fe2552d2b08a79273 `6a004f1be8` ```diff -- verify statistics is set SELECT c.relname, a.attstattarget FROM pg_attribute a JOIN pg_class c ON a.attrelid = c.oid AND c.relname LIKE 'test\_idx%' ORDER BY c.relname, a.attnum; relname \| attstattarget -----------+--------------- test_idx \| 4646 - test_idx2 \| -1 + test_idx2 \| test_idx2 \| 10000 test_idx2 \| 3737 (4 rows) ```	2025-03-12 11:01:49 +03:00
Mehmet YILMAZ	7e8bff034f	PG17 compatibility: Fix -1/Null diff in stxstattarget test output (#7748 ) Changed stxstattarget in pg_statistic_ext to use nullable representation, removing explicit -1 for default statistics target in PostgreSQL 17. Relevant PG commit: 012460ee93c304fbc7220e5b55d9d0577fc766ab `012460ee93` ```diff SELECT stxstattarget, stxrelid::regclass FROM pg_statistic_ext WHERE stxnamespace IN ( SELECT oid FROM pg_namespace WHERE nspname IN ('statistics''TestTarget') ) AND stxname SIMILAR TO '%\_\d+' ORDER BY stxstattarget, stxrelid::regclass ASC; stxstattarget \| stxrelid ---------------+----------------------------------- - -1 \| "statistics'TestTarget".t1_980000 - -1 \| "statistics'TestTarget".t1_980002 ... + \| "statistics'TestTarget".t1_980000 + \| "statistics'TestTarget".t1_980002 ... ```	2025-03-12 11:01:49 +03:00
Naisila Puka	41ea21ee0c	PG17 compatibility: ruleutils (#7725 ) PG17 compatibility - Part 2 https://github.com/citusdata/citus/pull/7699 was the first PG17 compatibility PR merged to main branch, which provided ONLY successful Citus compilation with PG17.0. This PR, consider it as Part 2, provides ruleutils changes for PG17. Ruleutils changes is the first thing we should merge, after successful build. It's the core for deparsing logic in Citus. # Question: How do we add ruleutils changes? - We add a new ruleutils file specific to PG17. - We keep track of the changes in Postgres's ruleutils file from here https://github.com/postgres/postgres/commits/REL_17_0/src/backend/utils/adt/ruleutils.c - Per each commit in that history that belongs only to 17.0, we add the relevant changes to static functions to our ruleutils file for PG17. It's like a manual commit copying. # Check the PR's commits for detailed steps https://github.com/citusdata/citus/pull/7725/commits	2025-03-12 11:01:49 +03:00
Naisila Puka	dce54db494	PG17 compatibility: Resolve compilation issues (#7699 ) This PR provides successful compilation against PG17.0. - Remove ExecFreeExprContext call Relevant PG commit d060e921ea5aa47b6265174c32e1128cebdbc3df `d060e921ea` - PG17 uses streaming IO in analyze, fix scan_analyze_next_block function Relevant PG commit 041b96802efa33d2bc9456f2ad946976b92b5ae1 `041b96802e` - Define ObjectClass for PG17+ only since it's removed Relevant PG commit: 89e5ef7e21812916c9cf9fcf56e45f0f74034656 `89e5ef7e21` - Remove ReorderBufferTupleBuf structure. Relevant PG commit: 08e6344fd6423210b339e92c069bb979ba4e7cd6 `08e6344fd6` - Define colliculocale and daticulocale since they have been renamed Relevant PG commit: f696c0cd5f299f1b51e214efc55a22a782cc175d `f696c0cd5f` - makeStringConst defined in PG17 Relevant PG commit: de3600452b61d1bc3967e9e37e86db8956c8f577 `de3600452b` - RangeVarCallbackOwnsTable was replaced by RangeVarCallbackMaintainsTable Relevant PG commit: ecb0fd33720fab91df1207e85704f382f55e1eb7 `ecb0fd3372` - attstattarget is nullable, define pg compatible functions for it Relevant PG commit: 4f622503d6de975ac87448aea5cea7de4bc140d5 `4f622503d6` - stxstattarget is nullable in PG17, write compat functions for it Relevant PG commit: 012460ee93c304fbc7220e5b55d9d0577fc766ab `012460ee93` - Use ResourceOwner to track WaitEventSet in PG17 Relevant PG commit: 50c67c2019ab9ade8aa8768bfe604cd802fe8591 `50c67c2019` - getIdentitySequence now uses Relation instead of relation_id Relevant PG commit: 509199587df73f06eda898ae13284292f4ae573a `509199587d` - Remove no-op tuplestore_donestoring function Relevant PG commit: 75680c3d805e2323cd437ac567f0677fdfc7b680 `75680c3d80` - MergeAction can have 3 merge kinds (now enum) in PG17, write compat Relevant PG commit: 0294df2f1f842dfb0eed79007b21016f486a3c6c `0294df2f1f` - EXPLAIN (MEMORY) is added, make changes to ExplainOnePlan Relevant PG commit: 5de890e3610d5a12cdaea36413d967cf5c544e20 `5de890e361` - LIMIT_OPTION_DEFAULT has been removed as it's useless, use LIMIT_OPTION_COUNT Relevant PG commit: a6be0600ac3b71dda8277ab0fcbe59ee101ac1ce `a6be0600ac` - write compat for create_foreignscan_path bcs of more arguments in PG17 Relevant PG commit: 9e9931d2bf40e2fea447d779c2e133c2c1256ef3 `9e9931d2bf` - pgprocno and lxid have been combined into a struct in PGPROC Relevant PG commits: 28f3915b73f75bd1b50ba070f56b34241fe53fd1 `28f3915b73` ab355e3a88de745607f6dd4c21f0119b5c68f2ad `ab355e3a88` 024c521117579a6d356050ad3d78fdc95e44eefa `024c521117` - Simplify CitusNewNode (#7434) postgres refactored newNode() in PG 17, the main point for doing this is the original tricks is no longer neccessary for modern compilers[1]. This does the same for Citus. This should have no backward compatibility issues since it just replaces palloc0fast with palloc0. This is good for forward compatibility since palloc0fast no longer exists in PG 17. [1] https://www.postgresql.org/message-id/b51f1fa7-7e6a-4ecc-936d-90a8a1659e7c@iki.fi (cherry picked from commit `4b295cc`)	2025-03-12 11:01:49 +03:00
Naisila Puka	6bd3474804	Rename foreach_ macros to foreach_declared_ macros (#7700 ) This is prep work for successful compilation with PG17 PG17added foreach_ptr, foreach_int and foreach_oid macros Relevant PG commit 14dd0f27d7cd56ffae9ecdbe324965073d01a9ff `14dd0f27d7` We already have these macros, but they are different with the PG17 ones because our macros take a DECLARED variable, whereas the PG16 macros declare a locally-scoped loop variable themselves. Hence I am renaming our macros to foreach_declared_ I am separating this into its own PR since it touches many files. The main compilation PR is https://github.com/citusdata/citus/pull/7699	2025-03-12 11:01:49 +03:00
Maxim Korotkov	d885e1a016	background task execution: fixed dereference of NULL (#7694 ) In the function TaskConcurrentCancelCheck() the pointer "task" was utilized after checking against NULL, which can lead to dereference of the null pointer. To avoid the problem, added a separate handling of the case when the pointer is null with an interruption of execution. Fixes: #7693. Fixes: 1f8675da4382f6e("nonblocking concurrent task execution via background workers") Signed-off-by: Maksim Korotkov <m.korotkov@postgrespro.ru>	2025-03-05 15:07:58 +00:00
Karina	26ad52713c	Check for Citus table in worker_copy_table_to_node (#7662 ) Fixes #6795 The `worker_copy_table_to_node` is not supposed to be called for Citus tables. When this function was initially introduced in #6098 , it had the respective check. But the check was omitted, since `worker_copy_table_to_node` called for Citus table finishes with error anyway: ``` ERROR: cannot execute a distributed query from a query on a shard DETAIL: Executing a distributed query in a function call that may be pushed to a remote node can lead to incorrect results. ``` It turns out that in some cases this error does not occur. See #6795 I suggest restoring that check. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2025-03-05 14:33:52 +00:00
Maxim Korotkov	afcda3feff	casual blocks: fixed potential NULL dereference (#7704 ) The result of FindWorkerNode() is usually checked against NULL.	2025-03-05 13:05:21 +00:00
Onur Tirtir	30bf960c5c	Avoid artifact name collision for flaky test detection jobs	2025-02-24 14:02:13 +03:00
eaydingol	117bd1d04f	Disable nonmaindb interface (#7905 ) DESCRIPTION: The PR disables the non-main db related features. The non-main db related features were introduced in https://github.com/citusdata/citus/pull/7203.	2025-02-21 13:36:19 +03:00
Karina	711aec80fa	Fix system_queries test to actually test the problem (#7613 ) The test added in #7604 doesn't reach the `HasRangeTableRef` function and thus doesn't test what it should. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2025-02-07 14:29:13 +00:00
michailtoksovo	829665ebca	Fix typo: collcet -> collect (#7734 ) Just a tiny typo fix in comment	2025-02-07 14:03:34 +00:00
mulander	f7c57351a7	Update 13 blog URL	2025-02-06 17:59:22 +02:00
mulander	565c309a1e	Update README.md Replace packages for 13.0.1. Drop mention of Centos, we are no longer building packages for it. Change release blog title, URL change pending.	2025-02-06 17:59:22 +02:00
Onur Tirtir	cee0f31ddb	Port recent CI fixes and 13.0.1 changelog entry to main (#7882 ) Although we will re-create the main branch from release-13.0 soon, let's get the CI on main up and running fwiw.	2025-02-04 17:15:47 +03:00
Onur Tirtir	2d8be01853	Disable 2PC recovery while executing ALTER EXTENSION cmd during Citus upgrade tests (cherry picked from commit `b6b73e2f4c`)	2025-02-04 16:53:32 +03:00
Naisila Puka	9a0cc282b7	Changelog entries for v13.0.1 (#7873 ) (cherry picked from commit `d28a5eae6c`)	2025-02-04 16:51:33 +03:00
Gürkan İndibay	7073f06153	Updates github checkout actions to v4 (#7611 ) (cherry picked from commit 3fe22406e62fb40da12a0d91f3ecc0cba81cdb24)	2025-02-04 16:50:01 +03:00
Onur Tirtir	8783cae57f	Avoid publishing artifacts with conflicting names .. as documented in actions/upload-artifact#480. (cherry picked from commit `0d4c676b07`)	2025-02-04 16:49:20 +03:00
Onur Tirtir	b6e3f39583	Fix flaky citus upgrade test (cherry picked from commit `4cad81d643`)	2025-02-04 16:49:12 +03:00
Onur Tirtir	a28f75cc77	Upgrade download-artifacts action to 4.1.8 (cherry picked from commit `5317cc7310`)	2025-02-04 16:49:06 +03:00
Onur Tirtir	af5fced935	Upgrade upload-artifacts action to 4.6.0 (cherry picked from commit `398a2ea197`)	2025-02-04 16:47:04 +03:00
Naisila Puka	7b6a828c74	Changelog entries for 13.0.0 (#7850 )	2025-01-22 12:22:31 +03:00
Naisila Puka	f7bead22d4	Remove accidentally added citus-tools empty submodule (#7842 ) Accidentally added here `4775715691`	2025-01-13 16:49:50 +03:00
Naisila Puka	5ef2cd67ed	Bump pg versions 14.15, 15.10, 16.6 (#7829 ) Bump PG versions to the latest minors 14.15, 15.10, 16.6 There is a libpq symlink issue when the images are built remotely https://github.com/citusdata/citus/actions/runs/12583502447/job/35071296238 Hence, we use the commit sha of a local build of the images, pushed. This is temporary, until we find the underlying cause of the symlink issue. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2025-01-13 16:24:51 +03:00
Seda Gündoğdu	70f84e4aee	Remove Debian Buster support from packaging pipelines (#7828 ) Remove Debian Buster support from packaging-test-pipelines Co-authored-by: Gürkan İndibay <gindibay@microsoft.com>	2025-01-02 12:22:22 +03:00
Naisila Puka	0a6adf4ccc	EXPLAIN generic_plan NOT supported in Citus (#7825 ) We thought we provided support for this in `b8c493f2c4` However the use of parameters in SQL is not supported in Citus. Since generic plan queries use parameters, we can't support for now. Relevant PG16 commit https://github.com/postgres/postgres/commit/3c05284 Fixes #7813 with proper error message	2025-01-02 01:00:40 +03:00
Teja Mupparti	ab7c13beb5	For scenarios, such as, Bug 3697586: Server crashes when assigning distributed transaction: Raise an ERROR instead of a crash	2024-12-26 10:45:59 -08:00
Onur Tirtir	73411915a4	Avoid re-assigning the global pid for client backends and bg workers when the application_name changes (#7791 ) DESCRIPTION: Fixes a crash that happens because of unsafe catalog access when re-assigning the global pid after application_name changes. When application_name changes, we don't actually need to try re-assigning the global pid for external client backends because application_name doesn't affect the global pid for such backends. Plus, trying to re-assign the global pid for external client backends would unnecessarily cause performing a catalog access when the cached local node id is invalidated. However, accessing to the catalog tables is dangerous in certain situations like when we're not in a transaction block. And for the other types of backends, i.e., the Citus internal backends, we need to re-assign the global pid when the application_name changes because for such backends we simply extract the global pid inherited from the originating backend from the application_name -that's specified by originating backend when openning that connection- and this doesn't require catalog access.	2024-12-23 14:01:53 +00:00
Naisila Puka	665d72a2f5	Bump postgres versions in CI and dev: 14.14, 15.9, 16.5 (#7779 ) Upgrade postgres versions to: - 14.14 - 15.9 - 16.5 Depends on https://github.com/citusdata/the-process/pull/163 We had some errors with the latest minors, so this is a 2-level bump for now.	2024-12-23 15:15:15 +03:00
Emel Şimşek	0355b12c7f	Add changelog entries for 12.1.6 (#7770 ) Add changelog entries for 12.1.6	2024-12-04 08:11:33 +00:00
Pavel Seleznev	fe6d198ab2	Remove warnings on some builds (#7680 ) Co-authored-by: Pavel Seleznev <PNSeleznev@sberbank.ru>	2024-12-03 17:10:36 +03:00
Colm	248ff5d52a	[Bug Fix] Query on distributed tables with window partition may cause segfault #7705 (#7718 ) This PR is a proposed fix for issue [7705](https://github.com/citusdata/citus/issues/7705). The following is the background and rationale for the fix (please refer to [7705](https://github.com/citusdata/citus/issues/7705) for context); The `varnullingrels `field was introduced to the Var node struct definition in Postgres 16. Its purpose is to associate a variable with the set of outer join relations that can cause the variable to be NULL. The `varnullingrels ` for the variable `"gianluca_camp_test"."start_timestamp"` in the problem query is 3, because the variable "gianluca_camp_test"."start_timestamp" is coming from the inner (nullable) side of an outer join and 3 is the RT index (aka relid) of that outer join. The problem occurs when the Postgres planner attempts to plan the combine query. The format of a combine query is: ``` SELECT <targets> FROM pg_catalog.citus_extradata_container(); ``` There is only one relation in a combine query, so no outer joins are present, but the non-empty `varnullingrels `field causes the Postgres planner to access structures for a non-existent relation. The source of the problem is that, when creating the target list for the combine query, function MasterAggregateMutator() uses copyObject() to construct a Var node before setting the master table ID, and this copies over the non-empty varnullingrels field in the case of the `"gianluca_camp_test"."start_timestamp"` var. The proposed solution is to have MasterAggregateMutator() use makeVar() instead of copyObject(), and only set the fields that make sense for the combine query; var type, collation and type modifier. The `varnullingrels `field can be left empty because there is only one relation in the combine query. A new regress test issue_7705.sql is added to exercise the fix. The issue is not specific to window functions, any target expression that cannot be pushed down and contains at least one column from the inner side of a left outer join (so has a non-empty varnullingrels field) can cause the same issue. More about Citus combine queries [here](https://github.com/citusdata/citus/tree/main/src/backend/distributed#combine-query-planner). More about Postgres varnullingrels [here](https://github.com/postgres/postgres/blob/master/src/backend/optimizer/README).	2024-11-13 15:19:59 +00:00
Colm McHugh	c52f36019f	[Bug Fix] [SEGFAULT] Querying distributed tables with window partition may cause segfault #7705 In function MasterAggregateMutator(), when the original Node is a Var node use makeVar() instead of copyObject() when constructing the Var node for the target list of the combine query. The varnullingrels field of the original Var node is ignored because it is not relevant for the combine query; copying this cause the problem in issue 7705, where a coordinator query had a Var with a reference to a non-existent join relation.	2024-11-06 19:26:29 +00:00
Erik Karsten	f6959715dc	fix: typo runnnig -> running (#7686 ) Very small PR, no changes to behaviour. Just a typo fix :-) Under `src/backend/distributed/sql/udfs/citus_finalize_upgrade_to_citus11/` the sql has a typo "runnnig", which will be displayed to the user if the `citus_check_cluster_node_health()` fails when calling `citus_finish_citus_upgrade();` Co-authored-by: eaydingol <60466783+eaydingol@users.noreply.github.com>	2024-09-17 09:28:46 +03:00
Parag Jain	5bad6c6a1d	[Bug Fix] : writing incorrect data to target Merge repartition Command (#7659 ) We were writing incorrect data to target collection in some cases of merge command. In case of repartition when source query is RELATION. We were referring to incorrect attribute number that was resulting into this incorrect behavior. Example : ![image](https://github.com/user-attachments/assets/a101cb36-7976-459c-befb-96a55a5b3dc1) ![image](https://github.com/user-attachments/assets/e5c83b7b-5b8e-4d79-a927-95684dc9ba49) I have added fixed tests as part of this PR , Thanks.	2024-09-12 21:16:39 -07:00
Mehmet YILMAZ	4775715691	Fix race condition in citus_set_coordinator_host when adding multiple coordinator nodes concurrently (#7682 ) When multiple sessions concurrently attempt to add the same coordinator node using `citus_set_coordinator_host`, there is a potential race condition. Both sessions may pass the initial metadata check (`isCoordinatorInMetadata`), but only one will succeed in adding the node. The other session will fail with an assertion error (`Assert(!nodeAlreadyExists)`), causing the server to crash. Even though the `AddNodeMetadata` function takes an exclusive lock, it appears that the lock is not preventing the race condition before the initial metadata check. - Issue: The current logic allows concurrent sessions to pass the check for existing coordinators, leading to an attempt to insert duplicate nodes, which triggers the assertion failure. - Impact: This race condition leads to crashes during operations that involve concurrent coordinator additions, as seen in https://github.com/citusdata/citus/issues/7646. Test Plan: - Isolation Test Limitation: An isolation test was added to simulate concurrent additions of the same coordinator node, but due to the behavior of PostgreSQL locking mechanisms, the test does not trigger the edge case. The lock applied within the function serializes the operations, preventing the race condition from occurring in the isolation test environment. While the edge case is difficult to reproduce in an isolation test, the fix addresses the core issue by ensuring concurrency control through proper locking. - Existing Tests: All existing tests related to node metadata and coordinator management have been run to ensure that no regressions were introduced. After the Fix: - Concurrent attempts to add the same coordinator node will be serialized. One session will succeed in adding the node, while the others will skip the operation without crashing the server. Co-authored-by: Mehmet YILMAZ <mehmet.yilmaz@microsoft.com>	2024-09-09 17:09:56 +03:00
Mehmet YILMAZ	68d28ecdc0	Add Debugging Instructions to Devcontainer Setup in CONTRIBUTING.md (#7673 ) Description: This PR adds a section to CONTRIBUTING.md that explains how to set up debugging in the devcontainer using VS Code. Changes: - New Debugging Section: Clear instructions on starting the debugger, selecting the appropriate PostgreSQL process, and setting breakpoints for easier troubleshooting. Purpose: - Improved Contributor Workflow: Enables contributors to debug the Citus extension within the devcontainer, enhancing productivity and making it easier to resolve issues. --------- Co-authored-by: Mehmet YILMAZ <mehmet.yilmaz@microsoft.com>	2024-08-23 12:16:18 +03:00
eaydingol	9e1852eac7	Check if the limit is null (#7665 ) DESCRIPTION: Add a check to see if the given limit is null. Fixes a bug by checking if the limit given in the query is null when the actual limit is computed with respect to the given offset. Prior to this change, null is interpreted as 0 during the limit calculation when both limit and offset are given. Fixes #7663	2024-07-31 14:53:38 +03:00
Hanefi Onaldi	2a263fe69a	Add changelog entries for 12.1.5 (#7648 )	2024-07-17 12:21:51 +00:00
Parag Jain	3c467e6e02	Support MERGE command for single_shard_distributed Target (#7643 ) This PR has following changes : 1. Enable MERGE command for single_shard_distributed targets.	2024-07-16 08:08:44 -07:00
Nils Dijk	accb7d09f7	bump postgres versions in CI and dev (#7655 ) Upgrade postgres versions to: - 14.12 - 15.7 - 16.3 Depends on https://github.com/citusdata/the-process/pull/158	2024-07-12 15:26:23 +00:00
Gürkan İndibay	8ac9f0fcee	Adds changelog for 12.1.4 (#7632 )	2024-07-12 09:43:33 +00:00
Gürkan İndibay	c603c3ed74	Removes el/7 and ol/7 as runners (#7650 ) Removes el/7 and ol/7 as runners and update checkout action to v4 We use EL/7 and OL/7 runners to test packaging for these distributions. However, for the past two weeks, we've encountered errors during the checkout step in the pipelines. The error message is as follows: ``` /__e/node20/bin/node: /lib64/libm.so.6: version `GLIBC_2.27' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.20' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libstdc++.so.6: version `CXXABI_1.3.9' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.21' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libc.so.6: version `GLIBC_2.28' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib64/libc.so.6: version `GLIBC_2.25' not found (required by /__e/node20/bin/node) ``` The GCC version within the EL/7 and OL/7 Docker images is 2.17, and we cannot upgrade it. Therefore, we need to remove these images from the packaging test pipelines. Consequently, we will no longer verify if the code builds for EL/7 and OL/7. However, we are not using these packaging images as runners within the packaging infrastructure, so we can continue to use these images for packaging. Additional Info: I learned that Marlin team fully dropped the el/7 support so we will drop in further releases as well	2024-07-12 12:25:12 +03:00
Nils Dijk	e776a7ebbb	CI: move to github container registry (#7652 ) We move the CI images to the github container registry. Given we mostly (if not solely) run these containers on github actions infra it makes sense to have them hosted closer to where they are needed. Image changes: https://github.com/citusdata/the-process/pull/157	2024-07-12 11:26:38 +03:00
Jelte Fennema-Nio	58fef24142	Update Citus Technical Documentation about the rebalancer (#7638 ) The sections about the rebalancer algorithm and the backround tasks were empty. --------- Co-authored-by: Marco Slot <marco.slot@gmail.com> Co-authored-by: Steven Sheehy <17552371+steven-sheehy@users.noreply.github.com>	2024-06-27 16:07:38 +02:00
Jelte Fennema-Nio	aaaf637a6b	Redo #7620 : Fix merge command when insert value does not have source distributed column (#7627 ) Related to issue #7619, #7620 Merge command fails when source query is single sharded and source and target are co-located and insert is not using distribution key of source. Example ``` CREATE TABLE source (id integer); CREATE TABLE target (id integer ); -- let's distribute both table on id field SELECT create_distributed_table('source', 'id'); SELECT create_distributed_table('target', 'id'); MERGE INTO target t USING ( SELECT 1 AS somekey FROM source WHERE source.id = 1) s ON t.id = s.somekey WHEN NOT MATCHED THEN INSERT (id) VALUES (s.somekey) ERROR: MERGE INSERT must use the source table distribution column value HINT: MERGE INSERT must use the source table distribution column value ``` Author's Opinion: If join is not between source and target distributed column, we should not force user to use source distributed column while inserting value of target distributed column. Fix: If user is not using distributed key of source for insertion let's not push down query to workers and don't force user to use source distributed column if it is not part of join. This reverts commit `fa4fc0b372`. Co-authored-by: paragjain <paragjain@microsoft.com>	2024-06-17 14:07:25 +00:00
Jelte Fennema-Nio	fa4fc0b372	Revert rebase merge of #7620 (#7626 ) Because we want to track PR numbers and to make backporting easy we (pretty much always) use squash-merges when merging to master. We accidentally used a rebase merge for PR #7620. This reverts those changes so we can redo the merge using squash merge. This reverts all commits from `eedb607c` to `9e71750fc`.	2024-06-17 15:46:00 +02:00
paragjain	9e71750fcd	fixing flakyness in test	2024-06-15 14:55:36 -07:00
paragjain	e62ae64d00	some more	2024-06-15 14:55:36 -07:00
paragjain	76f68f47c4	removing flakyness from test	2024-06-15 14:55:36 -07:00
Jelte Fennema-Nio	d5231c34ab	Revert "Try to fix failure" This reverts commit `89f7217660`.	2024-06-15 14:55:36 -07:00
Jelte Fennema-Nio	f883cfdd77	Try to fix failure	2024-06-15 14:55:36 -07:00
paragjain	7c8a366ba2	some more	2024-06-15 14:55:36 -07:00
paragjain	06e9c29950	some more	2024-06-15 14:55:36 -07:00
paragjain	493140287a	fix some indent	2024-06-15 14:55:36 -07:00
paragjain	ec25b433d4	adding update and delete tests	2024-06-15 14:55:36 -07:00
paragjain	eedb607cd5	merge command fix	2024-06-15 14:55:36 -07:00
Jelte Fennema-Nio	8c9de08b76	Fix CI issues after Github Actions networking changes (#7624 ) For some reason using localhost in our hba file doesn't have the intended effect anymore in our Github Actions runners. Probably because of some networking change (IPv6 maybe) or some change in the `/etc/hosts` file. Replacing localhost with the equivalent loopback IPv4 and IPv6 addresses resolved this issue.	2024-06-14 16:20:23 +02:00
Gürkan İndibay	2874d7af46	Updates github checkout actions to v4 (#7611 ) Updates checkout plugin for github actions to v4. Can not update the version for check-sql-snapshots since new plugin causes below error in the docker image this step is using . Please refer to: https://github.com/citusdata/citus/actions/runs/9286197994/job/25552373953 Error: ``` /__e/node20/bin/node: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.27' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.28' not found (required by /__e/node20/bin/node) /__e/node20/bin/node: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.25' not found (required by /__e/node20/bin/node) ```	2024-05-31 20:52:17 +03:00
Gürkan İndibay	0ab42e7a80	Adds null check for node in HasRangeTableRef (#7609 ) DESCRIPTION: Adds null check for node in HasRangeTableRef to prevent errors	2024-05-28 11:03:38 +03:00
Evgeny Nechayev	fcc72d8a23	Use macro wrapper to access PGPROC data, which allow to improve compa… (#7607 ) DESCRIPTION: Use macro wrapper to access PGPROC data, to improve compatibility with PostgreSQL forks.	2024-05-28 00:39:13 +00:00
Gürkan İndibay	553d5ba15d	Adds changelog for 12.1.3 (#7587 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2024-04-22 15:38:51 +03:00
Jelte Fennema-Nio	a0151aa31d	Greatly speed up "\d tablename" on servers with many tables (#7577 ) DESCRIPTION: Fix performance issue when using "\d tablename" on a server with many tables We introduce a filter to every query on pg_class to automatically remove shards. This is useful to make sure \d and PgAdmin are not cluttered with shards. However, the way we were introducing this filter was using `securityQuals` which can have negative impact on query performance. On clusters with 100k+ tables this could cause a simple "\d tablename" command to take multiple seconds, because a skipped optimization by Postgres causes a full table scan. This changes the code to introduce this filter in the regular `quals` list instead of in `securityQuals`. Which causes Postgres to use the intended optimization again. For reference, this was initially reported as a Postgres issue by me: https://www.postgresql.org/message-id/flat/4189982.1712785863%40sss.pgh.pa.us#b87421293b362d581ea8677e3bfea920	2024-04-16 17:26:12 +02:00
Xing Guo	ada3ba2507	Add missing volatile qualifier. (#7570 ) Variables being modified in the PG_TRY block and read in the PG_CATCH block should be qualified with volatile. The variable waitEventSet is modified in the PG_TRY block (line 1085) and read in the PG_CATCH block (line 1095). The variable relation is modified in the PG_TRY block (line 500) and read in the PG_CATCH block (line 515). Besides, the variable objectAddress doesn't need the volatile qualifier. Ref: C99 7.13.2.1[^1], > All accessible objects have values, and all other components of the abstract machine have state, as of the time the longjmp function was called, except that the values of objects of automatic storage duration that are local to the function containing the invocation of the corresponding setjmp macro that do not have volatile-qualified type and have been changed between the setjmp invocation and longjmp call are indeterminate. [^1]: https://www.open-std.org/jtc1/sc22/wg14/www/docs/n1256.pdf DESCRIPTION: Correctly mark some variables as volatile --------- Co-authored-by: Hong Yi <zouzou0208@gmail.com>	2024-04-16 15:29:14 +02:00
Karina	41e2af8ff5	Use expecteddir option in _run_pg_regress() (#7582 ) Fix check-arbitrary-configs tests failure with current REL_16_STABLE. This is the same problem as described in #7573. I missed pg_regress call in _run_pg_regress() in that PR. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-04-16 08:44:47 +00:00
Jelte Fennema-Nio	a263ac6f5f	Speed up GetForeignKeyOids (#7578 ) DESCRIPTION: Fix performance issue in GetForeignKeyOids on systems with many constraints GetForeignKeyOids was showing up in CPU profiles when distributing schemas on systems with 100k+ constraints. The reason was that this function was doing a sequence scan of pg_constraint to get the foreign keys that referenced the requested table. This fixes that by finding the constraints referencing the table through pg_depend instead of pg_constraint. We're doing this indirection, because pg_constraint doesn't have an index that we can use, but pg_depend does.	2024-04-16 08:16:40 +00:00
Jelte Fennema-Nio	110b4192b2	Fix PG upgrades when invalid rebalance strategies exist (#7580 ) DESCRIPTION: Fix PG upgrades when invalid rebalance strategies exist Without this change an upgrade of a cluster with an invalid rebalance strategy would fail with an error like this: ``` cache lookup failed for shard_cost_function with oid 6077337 CONTEXT: SQL statement "SELECT citus_validate_rebalance_strategy_functions( NEW.shard_cost_function, NEW.node_capacity_function, NEW.shard_allowed_on_node_function)" PL/pgSQL function citus_internal.pg_dist_rebalance_strategy_trigger_func() line 5 at PERFORM SQL statement "INSERT INTO pg_catalog.pg_dist_rebalance_strategy SELECT name, default_strategy, shard_cost_function::regprocedure::regproc, node_capacity_function::regprocedure::regproc, shard_allowed_on_node_function::regprocedure::regproc, default_threshold, minimum_threshold, improvement_threshold FROM public.pg_dist_rebalance_strategy" PL/pgSQL function citus_finish_pg_upgrade() line 115 at SQL statement ``` This fixes that by disabling the trigger and simply re-inserting the invalid rebalance strategy without checking. We could also silently remove it, but this seems nicer.	2024-04-15 14:26:33 +00:00
Jelte Fennema-Nio	16604a6601	Use an index to get FDWs that depend on extensions (#7574 ) DESCRIPTION: Fix performance issue when distributing a table that depends on an extension When the database contains many objects this function would show up in profiles because it was doing a sequence scan on pg_depend. And with many objects pg_depend can get very large. This starts using an index scan to only look for rows containing FDWs, of which there are expected to be very few (often even zero).	2024-04-15 12:42:56 +00:00
Jelte Fennema-Nio	cdf51da458	Speed up SequenceUsedInDistributedTable (#7579 ) DESCRIPTION: Fix performance issue when creating distributed tables if many already exist This builds on the work to speed up EnsureSequenceTypeSupported, and now does something similar for SequenceUsedInDistributedTable. SequenceUsedInDistributedTable had a similar O(number of citus tables) operation. This fixes that and speeds up creation of distributed tables significantly when many distributed tables already exist. Fixes #7022	2024-04-15 12:01:55 +00:00
Jelte Fennema-Nio	381f31756e	Speed up EnsureSequenceTypeSupported (#7575 ) DESCRIPTION: Fix performance issue when creating distributed tables and many already exist EnsureSequenceTypeSupported was doing an O(number of distributed tables) operation. This can become very slow with lots of Citus tables, which now happens much more frequently in practice due to schema based sharding. Partially addresses #7022	2024-04-15 10:28:11 +00:00
Onur Tirtir	3586aab17a	Allow providing "host" parameter via citus.node_conninfo (#7541 ) And when that is the case, directly use it as "host" parameter for the connections between nodes and use the "hostname" provided in pg_dist_node / pg_dist_poolinfo as "hostaddr" to avoid host name lookup. This is to avoid allowing dns resolution (and / or setting up DNS names for each host in the cluster). This already works currently when using IPs in the hostname. The only use of setting host is that you can then use sslmode=verify-full and it will validate that the hostname matches the certificate provided by the node you're connecting too. It would be more flexible to make this a per-node setting, but that requires SQL changes. And we'd like to backport this change, and backporting such a sql change would be quite hard while backporting this change would be very easy. And in many setups, a different hostname for TLS validation is actually not needed. The reason for that is query-from-any node: With query-from-any-node all nodes usually have a certificate that is valid for the same "cluster hostname", either using a wildcard cert or a Subject Alternative Name (SAN). Because if you load balance across nodes you don't know which node you're connecting to, but you still want TLS validation to do it's job. So with this change you can use this same "cluster hostname" for TLS validation within the cluster. Obviously this means you don't validate that you're connecting to a particular node, just that you're connecting to one of the nodes in the cluster, but that should be fine from a security perspective (in most cases). Note to self: This change requires updating https://docs.citusdata.com/en/latest/develop/api_guc.html#citus-node-conninfo-text. DESCRIPTION: Allows overwriting host name for all inter-node connections by supporting "host" parameter in citus.node_conninfo	2024-04-15 09:51:11 +00:00
Karina	41d99249d9	Use expecteddir option when running vanilla tests (#7573 ) In PostgreSQL 16 a new option expecteddir was introduced to pg_regress. Together with fix in [196eeb6b](https://github.com/postgres/postgres/commit/196eeb6b) it causes check-vanilla failure if expecteddir is not specified. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-04-10 16:08:54 +00:00
Onur Tirtir	3929a5b2a6	Fix incorrect "VALID UNTIL" assumption made for roles in node activation (#7534 ) Fixes https://github.com/citusdata/citus/issues/7533. DESCRIPTION: Fixes incorrect `VALID UNTIL` setting assumption made for roles when syncing them to new nodes	2024-03-20 11:38:33 +00:00
Emel Şimşek	fdd658acec	Fix crash caused by some form of ALTER TABLE ADD COLUMN statements. (#7522 ) DESCRIPTION: Fixes a crash caused by some form of ALTER TABLE ADD COLUMN statements. When adding multiple columns, if one of the ADD COLUMN statements contains a FOREIGN constraint ommitting the referenced columns in the statement, a SEGFAULT occurs. For instance, the following statement results in a crash: ``` ALTER TABLE lt ADD COLUMN new_col1 bool, ADD COLUMN new_col2 int references rt; ``` Fixes #7520.	2024-03-20 11:06:05 +03:00
Onur Tirtir	0acb5f6e86	Fix assertion failure in maintenance daemon during Citus upgrades (#7537 ) Fixes https://github.com/citusdata/citus/issues/7536. Note to reviewer: Before this commit, the following results in an assertion failure when executed locally and this won't be the case anymore: ```console make -C src/test/regress/ check-citus-upgrade-local citus-old-version=v10.2.0 ``` Note that this doesn't happen on CI as we don't enable assertions there. --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2024-03-20 00:10:12 +00:00
Onur Tirtir	d129064280	Refactor the code that supports node-wide object mgmt commands from non-main dbs (#7544 ) RunPreprocessNonMainDBCommand and RunPostprocessNonMainDBCommand are the entrypoints for this module. These functions are called from utility_hook.c to support some of the node-wide object management commands from non-main databases. To add support for a new command type, one needs to define a new NonMainDbDistributeObjectOps object and add it to GetNonMainDbDistributeObjectOps.	2024-03-19 14:26:17 +01:00
Hanefi Onaldi	bf05bf51ec	Refactor one helper function (#7562 ) The code looks simpler and easier to read now.	2024-03-18 12:06:49 +00:00
eaydingol	8afa2d0386	Change the order in which the locks are acquired (#7542 ) This PR changes the order in which the locks are acquired (for the target and reference tables), when a modify request is initiated from a worker node that is not the "FirstWorkerNode". To prevent concurrent writes, locks are acquired on the first worker node for the replicated tables. When the update statement originates from the first worker node, it acquires the lock on the reference table(s) first, followed by the target table(s). However, if the update statement is initiated in another worker node, the lock requests are sent to the first worker in a different order. This PR unifies the modification order on the first worker node. With the third commit, independent of the node that received the request, the locks are acquired for the modified table and then the reference tables on the first node. The first commit shows a sample output for the test prior to the fix. Fixes #7477 --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2024-03-10 10:20:08 +03:00
copetol	12f56438fc	Fix segfault when using certain DO block in function (#7554 ) When using a CASE WHEN expression in the body of the function that is used in the DO block, a segmentation fault occured. This fixes that. Fixes #7381 --------- Co-authored-by: Konstantin Morozov <vzbdryn@yahoo.com>	2024-03-08 14:21:42 +01:00
Karina	f0043b64a1	Fix server crash when trying to execute activate_node_snapshot() on a single-node cluster (#7552 ) This fixes #7551 reported by Egor Chindyaskin Function activate_node_snapshot() is not meant to be called on a cluster without worker nodes. This commit adds ERROR report for such case to prevent server crash.	2024-03-07 11:08:19 +01:00
eaydingol	edcdbe67b1	Fix: store the previous shard cost for order verification (#7550 ) Store the previous shard cost so that the invariant checking performs as expected.	2024-03-06 14:46:49 +03:00
sminux	d59c93bc50	fix bad copy-paste rightComparisonLimit (#7547 ) DESCRIPTION: change for #7543	2024-03-05 08:49:35 +01:00
Gürkan İndibay	51009d0191	Add support for alter/drop role propagation from non-main databases (#7461 ) DESCRIPTION: Adds support for distributed `ALTER/DROP ROLE` commands from the databases where Citus is not installed --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2024-02-28 08:58:28 +00:00
Onur Tirtir	f4242685e3	Add failure handling for CREATE DATABASE commands (#7483 ) In preprocess phase, we save the original database name, replace dbname field of CreatedbStmt with a temporary name (to let Postgres to create the database with the temporary name locally) and then we insert a cleanup record for the temporary database name on all nodes *(\\). And in postprocess phase, we first rename the temporary database back to its original name for local node and then return a list of distributed DDL jobs i) to create the database with the temporary name and then ii) to rename it back to its original name on other nodes. That way, if CREATE DATABASE fails on any of the nodes, the temporary database will be cleaned up by the cleanup records that we inserted in preprocess phase and in case of a failure, we won't leak any databases called as the name that user intended to use for the database. Solves the problem documented in https://github.com/citusdata/citus/issues/7369 for CREATE DATABASE commands. (\\):* To ensure that we insert cleanup records on all nodes, with this PR we also start requiring having the coordinator in the metadata because otherwise we would skip inserting a cleanup record for the coordinator.	2024-02-23 17:02:32 +00:00
Nils Dijk	cbb90cc4ae	Devcontainer: enable coredumps (#7523 ) Add configuration for coredumps and document how to make sure they are enabled when developing in a devcontainer. --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2024-02-23 13:38:11 +00:00
Onur Tirtir	9ddee5d02a	Test that we check unsupported options for CREATE DATABASE from non-main dbs (#7532 ) When adding CREATE/DROP DATABASE propagation in #7240, luckily we've added EnsureSupportedCreateDatabaseCommand() check into deparser too just to be on the safe side. That way, today CREATE DATABASE commands from non-main dbs don't silently allow unsupported options. I wasn't aware of this when merging #7439 and hence wanted to add a test so that we don't mistakenly remove that check from deparser in future.	2024-02-23 10:37:11 +00:00
eaydingol	3509b7df5a	Add support for SECURITY LABEL on ROLE propagation from non-main databases (#7525 ) DESCRIPTION: Adds support for distributed "SECURITY LABEL on ROLE" commands from the databases where Citus is not installed.	2024-02-23 09:54:19 +03:00
Gürkan İndibay	211415dd4b	Removes granted by statement to fix flaky test errors (#7526 ) Fix for the #7519 In metadata sync phase, grant statements for roles are being fetched and propagated from catalog tables. However, in some cases grant .. with admin option clauses executes after the granted by statements which causes #7519 error. We will fix this issue with the grantor propagation task in the project	2024-02-21 18:37:25 +03:00
Karina	683e10ab69	Fix error in master_disable_node/citus_disable_node (#7492 ) This fixes #7454: master_disable_node() has only two arguments, but calls citus_disable_node() that tries to read three arguments Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-02-21 11:35:27 +00:00
Halil Ozan Akgül	852bcc5483	Add support for create / drop database propagation from non-main databases (#7439 ) DESCRIPTION: Adds support for distributed `CREATE/DROP DATABASE ` commands from the databases where Citus is not installed --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2024-02-21 10:44:01 +00:00
Gürkan İndibay	b3ef1b7e39	Add support for grant on database propagation from non-main databases (#7443 ) DESCRIPTION: Adds support for distributed `GRANT .. ON DATABASE TO USER` commands from the databases where Citus is not installed --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2024-02-21 13:14:58 +03:00
Onur Tirtir	56e014e64e	Clarify resource-cleaner apis (#7518 ) Rename InsertCleanupRecordInCurrentTransaction -> InsertCleanupOnSuccessRecordInCurrentTransaction and hardcode policy type as CLEANUP_DEFERRED_ON_SUCCESS. Rename InsertCleanupRecordInSubtransaction -> InsertCleanupRecordOutsideTransaction.	2024-02-20 08:57:08 +00:00
Gürkan İndibay	71ccbcf3e2	Adds changelog for v11.0.10 (#7513 )	2024-02-20 08:06:57 +00:00
Gürkan İndibay	2cbfdbfa46	Adds Grant Role support from non-main db (#7404 ) DESCRIPTION: Adds support for distributed role-membership management commands from the databases where Citus is not installed (`GRANT <role> TO <role>`) This PR also refactors the code-path that allows executing some of the node-wide commands so that we use send deparsed query string to other nodes instead of the `queryString` passed into utility hook. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2024-02-19 17:53:27 +03:00
Gürkan İndibay	9a0cdbf5af	Fixes granted by cascade/restrict statements for revoke (#7517 ) DESCRIPTION: Fixes incorrect propagating of `GRANTED BY` and `CASCADE/RESTRICT` clauses for `REVOKE` statements There are two issues fixed in this PR 1. granted by statement will appear for revoke statements as well 2. revoke/cascade statement will appear after granted by Since granted by statements does not appear in statements, this bug hasn't been visible until now. However, after activating the granted by statement for revoke, order problem arised and this issue was fixed order problem for cascade/revoke as well In summary, this PR provides usage of granted by statements properly now with the correct order of statements. We can verify the both errors, fixed with just single statement REVOKE dist_role_3 from non_dist_role_3 granted by test_admin_role cascade;	2024-02-19 15:44:21 +03:00
Onur Tirtir	74b55d0546	Enforce using werkzeug 2.3.7 for failure tests and update Postgres versions to latest minors (#7491 ) Let's use version 2.3.7 to fix the following error as we do in docker images created in https://github.com/citusdata/the-process/ repo. ``` ImportError: cannot import name 'url_quote' from 'werkzeug.urls' (/home/onurctirtir/.local/share/virtualenvs/regress-ffZKpSmO/lib/python3.9/site-packages/werkzeug/urls.py) ``` And changing werkzeug version required rebuilding Pipfile.lock file in src/test/regress. Before updating this Pipfile.lock file, we want to make sure that versions specified there don't break any tests. And to ensure that this is the case, https://github.com/citusdata/the-process/pull/155 synchronizes requirements.txt file based on new Pipfile.lock and hence this PR updates test image suffix accordingly. Also, while updating https://github.com/citusdata/the-process/pull/155, I also had to update Postgres versions to latest minors to make image builds passing again and updating Postgres versions in images requires updating Postgres versions in this repo too. While doing that, we also update Postgres version used in devcontainer too.	2024-02-16 14:38:32 +00:00
eaydingol	15a3adebe8	Support SECURITY LABEL ON ROLE from any node (#7508 ) DESCRIPTION: Propagates SECURITY LABEL ON ROLE statement from any node	2024-02-15 20:34:15 +03:00
Gürkan İndibay	59da0633bb	Fixes invalid grantor field parsing in grant role propagation (#7451 ) DESCRIPTION: Resolves an issue that disrupts distributed GRANT statements with the grantor option In this issue 3 issues are being solved: 1.Correcting the erroneous appending of multiple granted by in the deparser. 2Adding support for grantor (granted by) in grant role propagation. 3. Implementing grantor (granted by) support during the metadata sync grant role propagation phase. Limitations: Currently, the grantor must be created prior to the metadata sync phase. During metadata sync, both the creation of the grantor and the grants given by that role cannot be performed, as the grantor role is not detected during the dependency resolution phase. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2024-02-15 08:27:29 +00:00
Gürkan İndibay	c665cb8af3	Adds changelog for 11.0.9,11.1.7,11.2.2,11.3.1,12.0.1,12.1.2 (#7507 )	2024-02-14 08:40:28 +03:00
Ivan Vyazmitinov	2fae91c5df	Force LC_COLLATE=C for sort in check_gucs_are_alphabetically_sorted.sh (#7489 ) Fixed gucs check, as described [here](https://github.com/citusdata/citus/pull/7286#discussion_r1481049261)	2024-02-08 12:21:21 +01:00
Onur Tirtir	689c6897a4	Refactor CREATE / DROP database functions for better readability (#7486 )	2024-02-08 01:55:50 +03:00
eaydingol	f01c5f2593	Move remaining citus_internal functions (#7478 ) Moves the following functions to the Citus internal schema: citus_internal_local_blocked_processes citus_internal_global_blocked_processes citus_internal_mark_node_not_synced citus_internal_unregister_tenant_schema_globally citus_internal_update_none_dist_table_metadata citus_internal_update_placement_metadata citus_internal_update_relation_colocation citus_internal_start_replication_origin_tracking citus_internal_stop_replication_origin_tracking citus_internal_is_replication_origin_tracking_active #7405 --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2024-02-07 16:58:17 +03:00
Filip Sedlák	6869b3ad10	Fail early when shard can't be safely moved to a new node (#7467 ) DESCRIPTION: citus_move_shard_placement now fails early when shard cannot be safely moved The implementation is quite simplistic - `citus_move_shard_placement(...)` will fail with an error if there's any new node in the cluster that doesn't have reference tables yet. It could have been finer-grained, i.e. erroring only when trying to move a shard to an unitialized node. Looking at the related functions - `replicate_reference_tables()` or `citus_rebalance_start()`, I think it's acceptable behaviour. These other functions also treat "any" unitialized node as a temporary anomaly. Fixes #7426 --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2024-02-07 12:04:52 +00:00
Karina	9ff8436f14	Create directories and files with pg_file_create_mode and pg_dir_create_mode permissions (#7479 ) Since Postgres commit da9b580d files and directories are supposed to be created with pg_file_create_mode and pg_dir_create_mode permissions when default permissions are expected. This fixes a failure of one of the postgres tests: If we create file add.conf containing ``` shared_preload_libraries='citus' ``` and run postgres tests ``` TEMP_CONFIG=/path/to/add.conf make installcheck -C src/bin/pg_ctl/ ``` then 001_start_stop.pl fails with ``` .../data/base/pgsql_job_cache mode must be 0750 ``` in the log. In passing this also stops creating directories that we haven't used since Citus 7.4 This change explicitely doesn't change permissions of certificates/keys that we create. --------- Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-02-07 12:48:31 +01:00
eaydingol	594cb6f274	Move more citus internal functions (#7473 ) Moves the following functions: citus_internal_delete_colocation_metadata citus_internal_delete_partition_metadata citus_internal_delete_placement_metadata citus_internal_delete_shard_metadata citus_internal_delete_tenant_schema	2024-01-31 23:00:04 +03:00
eaydingol	d05174093b	Move citus internal functions (#7470 ) Move more functions to citus_internal schema, the list: citus_internal_add_placement_metadata citus_internal_add_shard_metadata citus_internal_add_tenant_schema citus_internal_adjust_local_clock_to_remote citus_internal_database_command #7405	2024-01-31 11:45:19 +00:00
Onur Tirtir	3ce731d497	Make multi_metadata_sync runnable via run_test.py (#7472 )	2024-01-31 09:50:16 +00:00
Onur Tirtir	6f43d5c02f	Enhance technical README for DDL propagation (#7471 )	2024-01-31 10:30:14 +01:00
Onur Tirtir	5aedec4242	Improve error message for recursive CTEs (#7407 ) Fixes #2870	2024-01-30 15:12:48 +00:00
eaydingol	f6ea619e27	Move citus internal functions (#7466 ) Move the following functions from pg_catalog to citus_internal: citus_internal_add_object_metadata citus_internal_add_partition_metadata #7405	2024-01-30 12:27:10 +03:00
Onur Tirtir	9c243d4477	Improve check_gucs_are_alphabetically_sorted.sh (#7460 ) Apparently https://github.com/citusdata/citus/pull/7452 was not enough, need to consider the GUC-like expressions only within RegisterCitusConfigVariables function.	2024-01-26 12:10:35 +00:00
eaydingol	5d673874f7	Move citus internal functions (#7456 ) Move citus_internal_acquire_citus_advisory_object_class_lock and citus_internal_add_colocation_metadata functions from pg_catalog to citus_internal. #7405	2024-01-26 11:46:05 +03:00
Onur Tirtir	24188959ed	Improve the script that sorts GUCs in alphabetical order (#7452 ) Soon we will have occurrences of "citus.X" in shared_library_init.c that are not part of GUC defs, so we need to use a more precise regular expression.	2024-01-25 11:22:39 +03:00
eaydingol	542212c3d8	Make citus_internal schema public (#7450 ) DESCRIPTION: Makes citus_internal schema public #7405	2024-01-24 17:11:10 +03:00
Onur Tirtir	3de5601bcc	Replace LOCAL_HOST_NAME with LocalHostName (#7449 ) The only usages of LOCAL_HOST_NAME were in functions that are only used during regression tests and in places where it was used incorrectly.	2024-01-24 13:50:39 +00:00
Onur Tirtir	1d096df7f4	Not use hardcoded LOCAL_HOST_NAME but citus.local_hostname to distinguish loopback connections (#7436 ) Fixes a bug that breaks queries from non-maindbs when citus.local_hostname is set to a value different than "localhost". This is a very old bug doesn't cause a problem as long as Citus catalog is available to FindWorkerNode(). And the catalog is always available unless we're in non-main database, which might be the case on main but not on older releases, hence not adding a `DESCRIPTION`. For this reason, I don't see a reason to backport this. Maybe we should totally refrain using LOCAL_HOST_NAME in all code-paths, but not doing that in this PR as the other paths don't seem to be breaking something that is user-facing. ```c char * GetAuthinfo(char hostname, int32 port, char user) { char authinfo = NULL; bool isLoopback = (strncmp(LOCAL_HOST_NAME, hostname, MAX_NODE_LENGTH) == 0 && PostPortNumber == port); if (IsTransactionState()) { int64 nodeId = WILDCARD_NODE_ID; / -1 is a special value for loopback connections (task tracker) / if (isLoopback) { nodeId = LOCALHOST_NODE_ID; } else { WorkerNode worker = FindWorkerNode(hostname, port); if (worker != NULL) { nodeId = worker->nodeId; } } authinfo = GetAuthinfoViaCatalog(user, nodeId); } return (authinfo != NULL) ? authinfo : ""; } ```	2024-01-24 12:58:55 +00:00
Filip Sedlák	8b48d6ab02	Log username in the failed connection message (#7432 ) This patch includes the username in the reported error message. This makes debugging easier when certain commands open connections as other users than the user that is executing the command. ``` monitora_snapshot=# SELECT citus_move_shard_placement(102030, 'monitora.db-dev-worker-a', 6005, 'monitora.db-dev-worker-a', 6017); ERROR: connection to the remote node monitora_user@monitora.db-dev-worker-a:6017 failed with the following error: fe_sendauth: no password supplied Time: 40,198 ms ```	2024-01-24 11:24:23 +00:00
Halil Ozan Akgül	1cb2e1e4e8	Fixes create user queries from Citus non-main databases with other users (#7442 ) This PR makes the connections to other nodes for `mark_object_distributed` use the same user as `execute_command_on_remote_nodes_as_user` so they'll use the same connection.	2024-01-24 12:57:54 +03:00
Gokhan Gulbiz	3ffb831beb	Update contributing docs (#7447 ) This is a minor change to use a generic name instead of our legacy CI provider name in the contributing documentation.	2024-01-24 09:50:49 +01:00
Gürkan İndibay	863713e9b7	Refactors ExtendedTaskList methods (#7372 ) ExecuteTaskListIntoTupleDestWithParam and ExecuteTaskListIntoTupleDest are nearly the same. I parameterized and a made a reusable structure here --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2024-01-24 06:00:19 +00:00
Teja Mupparti	11d7c27352	Fix assertions in other PG versions too, the original fix is in PR-7379	2024-01-23 15:10:06 -08:00
Jelte Fennema-Nio	9683bef2ec	Replace more spurious strdups with pstrdups (#7441 ) DESCRIPTION: Remove a few small memory leaks In #7440 one instance of a strdup was removed. But there were a few more. This removes the ones that are left over, or adds a comment why strdup is on purpose.	2024-01-23 13:28:26 +01:00
Marco Slot	72fbea20c4	Replace spurious strdup with pstrdup (#7440 ) Not sure why we never found this using valgrind, but using strdup will cause memory leaks because the pointer is not tracked in a memory context.	2024-01-23 11:55:03 +01:00
eaydingol	ee11492a0e	Generate qualified relation name (#7427 ) This change refactors the code by using generate_qualified_relation_name from id instead of using a sequence of functions to generate the relation name. Fixes #6602	2024-01-22 17:32:49 +03:00
zhjwpku	4b295cc857	Simplify CitusNewNode (#7434 ) postgres refactored newNode() in PG 17, the main point for doing this is the original tricks is no longer neccessary for modern compilers[1]. This does the same for Citus. This should have no backward compatibility issues since it just replaces palloc0fast with palloc0. This is good for forward compatibility since palloc0fast no longer exists in PG 17. [1] https://www.postgresql.org/message-id/b51f1fa7-7e6a-4ecc-936d-90a8a1659e7c@iki.fi	2024-01-22 14:55:14 +01:00
Jelte Fennema-Nio	14ecebe47c	Fix problems with make check (#7433 ) This fixes two problems: 1. Allow `make check -j20` to work, by disabling parallelism. This was reported by a user in #7432 2. Actually run all the tests by forwarding to `make check` instead of `check-full`, because confusingly `check-full` does not run all the tests.	2024-01-19 17:11:29 +01:00
Gürkan İndibay	188614512f	Adds comment on database and role propagation (#7388 ) DESCRIPTION: Adds comment on database and role propagation. Example commands are as below comment on database <db_name> is '<comment_text>' comment on database <db_name> is NULL comment on role <role_name> is '<comment_text>' comment on role <role_name> is NULL --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2024-01-18 20:58:44 +03:00
Jelte Fennema-Nio	5ec056a172	Add pytest test example about connecting to a worker (#7386 ) I noticed while reviewing #7203 that there as no example of executing sql on a worker for the pytest README. Since this is a pretty common thing that people want to do, this PR adds that.	2024-01-18 15:05:24 +03:00
Jelte Fennema-Nio	fcfedff8d1	Support running isolation_update_node in flaky test detection (#7425 ) I noticed in #7423 that `isolation_update_node` could not be run using flaky test detection. This fixes that.	2024-01-17 15:36:26 +00:00
Valery	6cf6cf37fd	Adds information to explain output when using citus.explain_distributed_queries=false (#7412 ) Fixes https://github.com/citusdata/citus/issues/6490	2024-01-17 15:04:42 +00:00
zhjwpku	51e607878b	remove a duplicate forward declaration and polish some comments (#7371 ) remove a duplicate forward declaration and polish some comments Signed-off-by: Zhao Junwang <zhjwpku@gmail.com>	2024-01-17 14:30:23 +00:00
Karina	21464adfec	Make isolation_update_node test system independent (#7423 ) Test isolation_update_node fails on some systems with the following error: ``` -s2: WARNING: connection to the remote node non-existent:57637 failed with the following error: could not translate host name "non-existent" to address: Name or service not known +s2: WARNING: connection to the remote node non-existent:57637 failed with the following error: could not translate host name "non-existent" to address: Temporary failure in name resolution ``` This slightly modifies an already existing [normalization rule](`739c6d26df/src/test/regress/bin/normalize.sed (L217-L218)`) to fix it. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-01-17 13:39:07 +00:00
Onur Tirtir	04b374fc01	Fix upgrade tests (#7413 ) Adding upgrade_basic_before_non_mixed.sql file because while upgrade_basic_after_non_mixed exist, its before variation didn't exist as we don't have any "before" steps. However, run_test.py assumes that all "after" files do have a "before" variation as well. So this PR adds an empty upgrade_basic_before_non_mixed.sql file. Also, given that we don't have such a version called as 12.1devel anymore, change it to 12.1.1. And finally, let CI skip testing flakyness for upgrade tests both because it's quite hard to get flaky-test-detection job working for upgrade tests and also because in the end it is not much useful to test upgrade tests against flakyness.	2024-01-16 12:37:18 +00:00
Halil Ozan Akgül	739c6d26df	Fix inserting to pg_dist_object for queries from other nodes (#7402 ) Running a query from a Citus non-main database that inserts to pg_dist_object requires a new connection to the main database itself. This PR adds that connection to the main database. --------- Co-authored-by: Jelte Fennema-Nio <github-tech@jeltef.nl>	2024-01-11 16:05:14 +03:00
Teja Mupparti	00068e07c5	Fix the incorrect column count after ALTER TABLE, this fixes the bug #7378 (please read the analysis in the bug for more information)	2024-01-10 12:49:44 -08:00
LightDB Enterprise Postgres	9a91136a3d	Fix timeout when underlying socket is changed in a MultiConnection (#7377 ) When there are multiple localhost entries in /etc/hosts like following /etc/hosts: ``` 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 ::1 localhost localhost.localdomain localhost6 localhost6.localdomain6 127.0.0.1 localhost ``` multi_cluster_management check will failed: ``` @@ -857,20 +857,21 @@ ERROR: group 14 already has a primary node -- check that you can add secondaries and unavailable nodes to a group SELECT groupid AS worker_2_group FROM pg_dist_node WHERE nodeport = :worker_2_port \gset SELECT 1 FROM master_add_node('localhost', 9998, groupid => :worker_1_group, noderole => 'secondary'); ?column? ---------- 1 (1 row) SELECT 1 FROM master_add_node('localhost', 9997, groupid => :worker_1_group, noderole => 'unavailable'); +WARNING: could not establish connection after 5000 ms ?column? ---------- 1 (1 row) ``` This actually isn't just a problem in test environments, but could occur as well during actual usage when a hostname in pg_dist_node resolves to multiple IPs and one of those IPs is unreachable. Postgres will then automatically continue with the next IP, but Citus should listen for events on the new socket. Not on the old one. Co-authored-by: chuhx43211 <chuhx43211@hundsun.com>	2024-01-10 10:49:53 +00:00
zhjwpku	8e979f7ac6	[performance improvement] remove duplicate LoadShardList call (#7380 ) LoadShardList is called twice, which is not neccessary, and there is no need to sort the shard placement list since we only want to know the list length.	2024-01-10 11:15:19 +01:00
Onur Tirtir	1d55debb98	Support CREATE / DROP database commands from any node (#7359 ) DESCRIPTION: Adds support for issuing `CREATE`/`DROP` DATABASE commands from worker nodes With this commit, we allow issuing CREATE / DROP DATABASE commands from worker nodes too. As in #7278, this is not allowed when the coordinator is not added to metadata because we don't ever sync metadata changes to coordinator when adding coordinator to the metadata via `SELECT citus_set_coordinator_host('<hostname>')`, or equivalently, via `SELECT citus_add_node(<coordinator_node_name>, <coordinator_node_port>, 0)`. We serialize database management commands by acquiring a Citus specific advisory lock on the first primary worker node if there are any workers in the cluster. As opposed to what we've done in https://github.com/citusdata/citus/pull/7278 for role management commands, we try to avoid from running into distributed deadlocks as much as possible. This is because, while distributed deadlocks that can happen around role management commands can be detected by Citus, this is not the case for database management commands because most of them cannot be run inside in a transaction block. In that case, Citus cannot even detect the distributed deadlock because the command is not part of a distributed transaction at all, then the command execution might not return the control back to the user for an indefinite amount of time.	2024-01-08 16:47:49 +00:00
Karina	20dc58cf5d	Fix getting heap tuple size (#7387 ) This fixes #7230. First of all, using HeapTupleHeaderGetDatumLength(heapTuple) is definetly wrong, it gives a number that's 4 times less than the correct tuple size (heapTuple.t_len). See https://github.com/postgres/postgres/blob/REL_16_0/src/include/access/htup_details.h#L455-L456 https://github.com/postgres/postgres/blob/REL_16_0/src/include/varatt.h#L279 https://github.com/postgres/postgres/blob/REL_16_0/src/include/varatt.h#L225-L226 When I fixed it, the limit_intermediate_size test failed, so I tried to understand what's going on there. In original commit `fd546cf` these queries were supposed to fail. Then in `b3af63c` three of the queries that were supposed to fail suddenly worked and tests were changed to pass without understanding why the output had changed or how to keep test testing what it had to test. Even comments saying that these queries should fail were left untouched. Commit message gives no clue about why exactly test has changed: > It seems that when we use adaptive executor instead of task tracker, we > exceed the intermediate result size less in the test. Therefore updated > the tests accordingly. Then `3fda2c3` also blindly raised the limit for one of the queries to keep it working: `3fda2c3254 (diff-a9b7b617f9dfd345318cb8987d5897143ca1b723c87b81049bbadd94dcc86570R19)` When in `fe3caf3` that HeapTupleHeaderGetDatumLength(heapTuple) call was finally added, one of those test queries became failing again. The other two of them now also failing after the fix. I don't understand how exactly the calculation of "intermediate result size" that is limited by citus.max_intermediate_result_size had changed through `b3af63c` and `fe3caf3`, but these numbers are now closer to what they originally were when this limitation was added in `fd546cf`. So these queries should fail, like in the original version of the limit_intermediate_size test. Co-authored-by: Karina Litskevich <litskevichkarina@gmail.com>	2024-01-08 17:09:30 +01:00
Onur Tirtir	968ac74cde	Fix foreign_key_to_reference_shard_rebalance test (#7400 ) foreign_key_to_reference_shard_rebalance failed because partition of 2024 year does not exist, fixed by add default partition. Replaces https://github.com/citusdata/citus/pull/7396 by adding a rule that allows properly testing foreign_key_to_reference_shard_rebalance via run_test.py. Closes #7396 Co-authored-by: chuhx <148182736+cstarc1@users.noreply.github.com>	2024-01-04 13:16:45 +01:00
Onur Tirtir	d940cfa992	Do nothing if the database is not distributed (#7392 ) Fixes the remaining cases reported in https://github.com/citusdata/citus/issues/7370.	2024-01-03 17:03:06 +03:00
Gürkan İndibay	c3579eef06	Adds REASSIGN OWNED BY propagation (#7319 ) DESCRIPTION: Adds REASSIGN OWNED BY propagation This pull request introduces the propagation of the "Reassign owned by" statement. It accommodates both local and distributed roles for both the old and new assignments. However, when the old role is a local role, it undergoes filtering and is not propagated. On the other hand, if the new role is a local role, the process involves first creating the role on worker nodes before propagating the "Reassign owned" statement.	2023-12-28 15:15:58 +03:00
Gürkan İndibay	181b8ab6d5	Adds additional alter database propagation support (#7253 ) DESCRIPTION: Adds database connection limit, rename and set tablespace propagation In this PR, below statement propagations are added alter database <database_name> with allow_connections = <boolean_value>; alter database <database_name> rename to <database_name2>; alter database <database_name> set TABLESPACE <table_space_name> --------- Co-authored-by: Jelte Fennema-Nio <github-tech@jeltef.nl> Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-12-26 14:55:04 +03:00
Halil Ozan Akgül	b877d606c7	Adds 2PC distributed commands from other databases (#7203 ) DESCRIPTION: Adds support for 2PC from non-Citus main databases This PR only adds support for `CREATE USER` queries, other queries need to be added. But it should be simple because this PR creates the underlying structure. Citus main database is the database where the Citus extension is created. A non-main database is all the other databases that are in the same node with a Citus main database. When a `CREATE USER` query is run on a non-main database we: 1. Run `start_management_transaction` on the main database. This function saves the outer transaction's xid (the non-main database query's transaction id) and marks the current query as main db command. 2. Run `execute_command_on_remote_nodes_as_user("CREATE USER <username>", <username to run the command>)` on the main database. This function creates the users in the rest of the cluster by running the query on the other nodes. The user on the current node is created by the query on the outer, non-main db, query to make sure consequent commands in the same transaction can see this user. 3. Run `mark_object_distributed` on the main database. This function adds the user to `pg_dist_object` in all of the nodes, including the current one. This PR also implements transaction recovery for the queries from non-main databases.	2023-12-22 19:19:41 +03:00
Jodi-Ann Francis	6801a1ed1e	PG16 update GRANT... ADMIN \| INHERIT \| SET, and REVOKE Allowing GRANT ADMIN to now also be INHERIT or SET in support of psql16 GRANT role_name [, ...] TO role_specification [, ...] [ WITH { ADMIN \| INHERIT \| SET } { OPTION \| TRUE \| FALSE } ] [ GRANTED BY role_specification ] Fixes: #7148 Related: #7138 See review changes from https://github.com/citusdata/citus/pull/7164	2023-12-13 15:57:02 -05:00
Naisila Puka	dbdde111c1	Add missing order by clause in failure_split_cleanup test (#7363 ) https://github.com/citusdata/citus/actions/runs/6903353045/attempts/1#summary-18781959638 ```diff ARRAY['-100000'], ARRAY[:worker_1_node, :worker_2_node], 'force_logical'); ERROR: server closed the connection unexpectedly CONTEXT: while executing command on localhost:9060 SELECT operation_id, object_type, object_name, node_group_id, policy_type FROM pg_dist_cleanup where operation_id = 777 ORDER BY object_name; operation_id \| object_type \| object_name \| node_group_id \| policy_type --------------+-------------+-----------------------------------------------------------+---------------+------------- 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981000 \| 1 \| 0 - 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981002 \| 1 \| 1 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981002 \| 2 \| 0 + 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981002 \| 1 \| 1 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981003 \| 2 \| 1 777 \| 4 \| citus_shard_split_publication_1_10_777 \| 2 \| 0 (5 rows) ``` Similar attempt to fix in `c9f2fc892d` There were some more missing ORDER BY stuff, so I added them	2023-11-24 18:26:06 +03:00
Nils Dijk	47bd9d8917	Devcontainer: add code formatting tools (#7355 ) The devcontainer missed two tools used by code formatting, as done by `ci/fix_style.sh` The missing tools were both python tools, used for formatting our python scripts. - black - isort This change adds both tools. The way it does this is by keeping a `requirements.txt` in `.devcontainer/` containing all python dependencies we need to install. When installing both tools in a clean environment we have exported all installed packages with `pip freeze` into the `requirements.txt` assuming this is all related to the two tools installed. Since python installs the binaires in `~/.local/bin/` we also move some scripts we manually install from `~/.bin/` to that same directory. At first it seemed like vscode's devcontainers were not having that on the path. However, when the container has that directory when it starts the directory does get added to `$PATH` by `~/.profile`. This makes the whole environment a bit more streamlined.	2023-11-24 13:03:01 +00:00
Naisila Puka	c019acc01b	Run wal2json cdc test for pg16 as well (#7361 ) pg16 wal2json package is now available, adding the tests back. Basically reverting `f253bb3210` Sister PR https://github.com/citusdata/the-process/pull/153	2023-11-24 14:40:23 +03:00
Nils Dijk	0620c8f9a6	Sort includes (#7326 ) This change adds a script to programatically group all includes in a specific order. The script was used as a one time invocation to group and sort all includes throught our formatted code. The grouping is as follows: - System includes (eg. `#include<...>`) - Postgres.h (eg. `#include "postgres.h"`) - Toplevel imports from postgres, not contained in a directory (eg. `#include "miscadmin.h"`) - General postgres includes (eg . `#include "nodes/..."`) - Toplevel citus includes, not contained in a directory (eg. `#include "citus_verion.h"`) - Columnar includes (eg. `#include "columnar/..."`) - Distributed includes (eg. `#include "distributed/..."`) Because it is quite hard to understand the difference between toplevel citus includes and toplevel postgres includes it hardcodes the list of toplevel citus includes. In the same manner it assumes anything not prefixed with `columnar/` or `distributed/` as a postgres include. The sorting/grouping is enforced by CI. Since we do so with our own script there are not changes required in our uncrustify configuration.	2023-11-23 18:19:54 +01:00
Gürkan İndibay	3b556cb5ed	Adds create / drop database propagation support (#7240 ) DESCRIPTION: Adds support for propagating `CREATE`/`DROP` database In this PR, create and drop database support is added. For CREATE DATABASE: * "oid" option is not supported * specifying "strategy" to be different than "wal_log" is not supported * specifying "template" to be different than "template1" is not supported The last two are because those are not saved in `pg_database` and when activating a node, we cannot assume what parameters were provided when creating the database. And "oid" is not supported because whether user specified an arbitrary oid when creating the database is not saved in pg_database and we want to avoid from oid collisions that might arise from attempting to use an auto-assigned oid on workers. Finally, in case of node activation, GRANTs for the database are also propagated. --------- Co-authored-by: Jelte Fennema-Nio <github-tech@jeltef.nl> Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-21 16:43:51 +03:00
Naisila Puka	cedcc220bf	Fixes flaky VACUUM (freeze, process toast true) result (#7348 ) https://app.circleci.com/pipelines/github/citusdata/citus/34550/workflows/5b802f66-2666-4623-a209-6d7799f7ee5f/jobs/1229153 ```diff VACUUM (FREEZE, PROCESS_TOAST true) local_vacuum_table; SELECT relfrozenxid::text::integer > :frozenxid AS frozen_performed FROM pg_class WHERE oid=:reltoastrelid::regclass; frozen_performed ------------------ - t + f (1 row) ``` Process toast option in vacuum was introduced in PG14. The failing test was supposed to be a part of `multi_utilities.sql`, but it was included in `pg14.sql` to avoid alternative output for PG13. See `ba62c0a148 (diff-ed03478f693155e2fe092e9ad356bf884dc097f554e8d75eff562d52bbcf7a75L255-L272)` for reference. However, now that we don't support PG13 anymore, we can move this test to `multi_utilities.sql`. Moving the test, plus inserting data before running vacuum freeze such that the freeze is more meaningful and not flaky, fixes the flakiness problem of the test.	2023-11-17 18:58:06 +03:00
Naisila Puka	c88bf5ff1c	Cleanup leftover replication slots in publication test (#7354 )	2023-11-17 15:11:38 +03:00
Japin Li	e14e8667cc	Fix redundant variable declaration (#7353 ) The `$workerCount` declare twice in src/test/regress/pg_regress_multi.pl.	2023-11-17 13:01:23 +03:00
Gürkan İndibay	32b0fc23f5	Removes unnecessary package installations in packaging pipelines (#7341 ) With the recent changes in packaging images, linux package installations to execute validate_output is unnecessary now. In this PR, I removed them to make the pipeline more effective. - [x] Remove the test warning before merge	2023-11-17 08:51:56 +03:00
Naisila Puka	55d500de8d	Remove accidentally added gucs.out (#7349 )	2023-11-16 14:51:31 +03:00
Hanefi Onaldi	5efd3f181a	Fix wrong PR links in changelog (#7350 ) When preparing changelog for 12.1.1 release, I accidentally swapped the PR numbers for the two commits. This commit fixes the changelog to point to the correct PRs.	2023-11-16 14:12:17 +03:00
Naisila Puka	0d1f18862b	Propagates SECURITY LABEL ON ROLE stmt (#7304 ) We propagate `SECURITY LABEL [for provider] ON ROLE rolename IS labelname` to the worker nodes. We also make sure to run the relevant `SecLabelStmt` commands on a newly added node by looking at roles found in `pg_shseclabel`. See official docs for explanation on how this command works: https://www.postgresql.org/docs/current/sql-security-label.html This command stores the role label in the `pg_shseclabel` catalog table. This commit also fixes the regex string in `check_gucs_are_alphabetically_sorted.sh` script such that it escapes the dot. Previously it was looking for all strings starting with "citus" instead of "citus." as it should. To test this feature, I currently make use of a special GUC to control label provider registration in PG_init when creating the Citus extension.	2023-11-16 13:12:30 +03:00
Naisila Puka	c6fbb72c02	Fix flaky multi_prepare_plsql (#7346 ) Simple need of an `ORDER BY` clause Ran into this twice this week already! https://github.com/citusdata/citus/actions/runs/6849701315/attempts/1#summary-18622563506 https://github.com/citusdata/citus/actions/runs/6875051160/attempts/1#summary-18698009952 ```diff SELECT nspname, typname FROM pg_type JOIN pg_namespace ON pg_namespace.oid = pg_type.typnamespace WHERE typname = 'prepare_ddl_type_backup'; nspname \| typname -------------+------------------------- - public \| prepare_ddl_type_backup otherschema \| prepare_ddl_type_backup + public \| prepare_ddl_type_backup (2 rows) ```	2023-11-15 13:28:43 +03:00
Naisila Puka	a960799dfb	Clean up leftover replication slots in tests (#7338 ) This commit fixes the flakiness in `logical_replication` and `citus_non_blocking_split_shard_cleanup` tests. The flakiness was related to leftover replication slots. Below is a flaky example for each test: logical_replication https://github.com/citusdata/citus/actions/runs/6721324131/attempts/1#summary-18267030604 citus_non_blocking_split_shard_cleanup https://github.com/citusdata/citus/actions/runs/6721324131/attempts/1#summary-18267006967 ```diff -- Replication slots should be cleaned up SELECT slot_name FROM pg_replication_slots; slot_name --------------------------------- -(0 rows) + citus_shard_split_slot_19_10_17 +(1 row) ``` The tests by themselves are not flaky: 32 flaky test schedules each with 20 runs run successfully. https://github.com/citusdata/citus/actions/runs/6822020127?pr=7338 The conclusion is that: 1. `multi_tenant_isolation_nonblocking` is the problematic test running before `logical_replication` in the `enterprise_schedule`, so I added a cleanup at the end of `multi_tenant_isolation_nonblocking`. https://github.com/citusdata/citus/actions/runs/6824334614/attempts/1#summary-18560127461 2. `citus_split_shard_by_split_points_negative` is the problematic test running before `citus_non_blocking_split_shards_cleanup` in the split schedule. Also added cleanup line. For details on the investigation of leftover replication slots, please check the PR https://github.com/citusdata/citus/pull/7338	2023-11-14 18:50:54 +03:00
Naisila Puka	cdef2d5224	Random tests refactoring (#7342 ) While investigating replication slots leftovers in PR https://github.com/citusdata/citus/pull/7338, I ran into the following refactoring/cleanup that can be done in our test suite: - Add separate test to remove non default nodes - Remove coordinator removal from `add_coordinator` test Use `remove_coordinator_from_metadata` test where needed - Don't print nodeids in `multi_multiuser_auth` and `multi_poolinfo_usage` tests - Use `startswith` when checking for isolation or failure tests - Add some dependencies accordingly in `run_test.py` for running flaky test schedules	2023-11-14 12:49:15 +03:00
Naisila Puka	e4ac3e6d9a	Bump PG versions to latest minors 14.10, 15.5, 16.1 (#7336 ) Postgres got minor updates on Nov9, this starts using the images with the latest version for our tests, namely 14.10, 15.5 and 16.1. These minor updates were compatible with Citus. Sister PR: https://github.com/citusdata/the-process/pull/152	2023-11-13 15:05:38 +03:00
Onur Tirtir	240313e286	Support role commands from any node (#7278 ) DESCRIPTION: Adds support from issuing role management commands from worker nodes It's unlikely to get into a distributed deadlock with role commands, we don't care much about them at the moment. There were several attempts to reduce the chances of a deadlock but we didn't any of them merged into main branch yet, see: #7325 #7016 #7009	2023-11-10 09:58:51 +00:00
Naisila Puka	57ff762c82	Fix VACUUM flakiness in multi_utilities (#7334 ) When I run this test in my local, the size of the table after the DELETE command is around 58785792. Hence, I assume that the diffs suggest that the Vacuum had no effect. The current solution is to run the VACUUM command three times instead of once. Example diff: https://github.com/citusdata/citus/actions/runs/6722231142/attempts/1#summary-18269870674 ```diff insert into local_vacuum_table select i from generate_series(1,1000000) i; delete from local_vacuum_table; VACUUM local_vacuum_table; SELECT CASE WHEN s BETWEEN 20000000 AND 25000000 THEN 22500000 ELSE s END FROM pg_total_relation_size('local_vacuum_table') s ; s ---------- - 22500000 + 58785792 (1 row) ``` See more diff examples in the PR description https://github.com/citusdata/citus/pull/7334	2023-11-09 21:00:24 +03:00
dependabot[bot]	c028d929b5	Bump werkzeug from 2.3.7 to 3.0.1 in /.devcontainer/src/test/regress Bumps [werkzeug](https://github.com/pallets/werkzeug) from 2.3.7 to 3.0.1. - [Release notes](https://github.com/pallets/werkzeug/releases) - [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/werkzeug/compare/2.3.7...3.0.1) --- updated-dependencies: - dependency-name: werkzeug dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2023-11-09 17:14:14 +01:00
dependabot[bot]	d4663212f4	Bump werkzeug from 2.3.7 to 3.0.1 in /src/test/regress Bumps [werkzeug](https://github.com/pallets/werkzeug) from 2.3.7 to 3.0.1. - [Release notes](https://github.com/pallets/werkzeug/releases) - [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/werkzeug/compare/2.3.7...3.0.1) --- updated-dependencies: - dependency-name: werkzeug dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2023-11-09 17:14:14 +01:00
Nils Dijk	0dac63afc0	move pg_version_constants.h to toplevel include (#7335 ) In preparation of sorting and grouping all includes we wanted to move this file to the toplevel includes for good grouping/sorting.	2023-11-09 15:09:39 +00:00
Hanefi Onaldi	92228b279a	Add changelog entries for 12.1.1 (#7332 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-09 14:19:28 +00:00
Naisila Puka	0dc41ee5a0	Fix flaky multi_mx_insert_select_repartition test (#7331 ) https://github.com/citusdata/citus/actions/runs/6745019678/attempts/1#summary-18336188930 ```diff insert into target_table SELECT a*2 FROM source_table RETURNING a; -NOTICE: executing the command locally: SELECT bytes FROM fetch_intermediate_results(ARRAY['repartitioned_results_xxxxx_from_4213582_to_0','repartitioned_results_xxxxx_from_4213584_to_0']::text[],'localhost',57638) bytes +NOTICE: executing the command locally: SELECT bytes FROM fetch_intermediate_results(ARRAY['repartitioned_results_3940758121873413_from_4213584_to_0','repartitioned_results_3940758121873413_from_4213582_to_0']::text[],'localhost',57638) bytes ``` The elements in the array passed to `fetch_intermediate_results` are the same, but in the opposite order than expected. To fix this flakiness, we can omit the `"SELECT bytes FROM fetch_intermediate_results..."` line. From the following logs, it is understandable that the intermediate results have been fetched.	2023-11-08 15:15:33 +03:00
Onur Tirtir	444e6cb7d6	Remove useless variables (#7327 ) To fix warnings observed when using different compiler versions.	2023-11-07 16:39:08 +03:00
cvbhjkl	e535f53ce5	Fix typo in local_executor.c (#7324 ) Fix a typo 'remaning' -> 'remaining' in local_executor.c	2023-11-03 12:14:11 +00:00
Onur Tirtir	21646ca1e9	Fix flaky isolation_get_all_active_transactions.spec test (#7323 ) Fix the flaky test that results in following diff by waiting until the backend that we want to terminate really terminates, until 5secs. ```diff --- /__w/citus/citus/src/test/regress/expected/isolation_get_all_active_transactions.out.modified 2023-11-01 16:30:57.648749795 +0000 +++ /__w/citus/citus/src/test/regress/results/isolation_get_all_active_transactions.out.modified 2023-11-01 16:30:57.656749877 +0000 @@ -114,13 +114,13 @@ -------------------- t (1 row) step s3-show-activity: SET ROLE postgres; select count() from get_all_active_transactions() where process_id IN (SELECT FROM selected_pid); count ----- - 0 + 1 (1 row) ```	2023-11-03 09:00:32 +01:00
Onur Tirtir	5e2439a117	Make some more tests re-runable (#7322 ) * multi_mx_create_table * multi_mx_function_table_reference * multi_mx_add_coordinator * create_role_propagation * metadata_sync_helpers * text_search https://github.com/citusdata/citus/pull/7278 requires this.	2023-11-02 18:32:56 +03:00
Jelte Fennema-Nio	85b997a0fb	Fix flaky multi_alter_table_statements (#7321 ) Sometimes multi_alter_table_statements would fail in CI like this: ```diff -- Verify that DROP NOT NULL works ALTER TABLE lineitem_alter ALTER COLUMN int_column2 DROP NOT NULL; SELECT "Column", "Type", "Modifiers" FROM table_desc WHERE relid='lineitem_alter'::regclass; - Column \| Type \| Modifiers ---------------------------------------------------------------------- - l_orderkey \| bigint \| not null - l_partkey \| integer \| not null - l_suppkey \| integer \| not null - l_linenumber \| integer \| not null - l_quantity \| numeric(15,2) \| not null - l_extendedprice \| numeric(15,2) \| not null - l_discount \| numeric(15,2) \| not null - l_tax \| numeric(15,2) \| not null - l_returnflag \| character(1) \| not null - l_linestatus \| character(1) \| not null - l_shipdate \| date \| not null - l_commitdate \| date \| not null - l_receiptdate \| date \| not null - l_shipinstruct \| character(25) \| not null - l_shipmode \| character(10) \| not null - l_comment \| character varying(44) \| not null - float_column \| double precision \| default 1 - date_column \| date \| - int_column1 \| integer \| - int_column2 \| integer \| - null_column \| integer \| -(21 rows) - +ERROR: schema "alter_table_add_column" does not exist -- COPY should succeed now SELECT master_create_empty_shard('lineitem_alter') as shardid \gset ``` Reading from table_desc apparantly has an issue that if the schema gets deleted from one of the items, while it is being read that we get such an error. This change fixes that by not running multi_alter_table_statements in parallel with alter_table_add_column anymore. This is another instance of the same issue as in #7294	2023-11-02 16:42:45 +03:00
Jelte Fennema-Nio	f171ec98fc	Fix flaky failure_distributed_results (#7307 ) Sometimes in CI we run into this failure: ```diff SELECT resultId, nodeport, rowcount, targetShardId, targetShardIndex FROM partition_task_list_results('test', $$ SELECT * FROM source_table $$, 'target_table') NATURAL JOIN pg_dist_node; -WARNING: connection to the remote node localhost:xxxxx failed with the following error: connection not open +ERROR: connection to the remote node localhost:9060 failed with the following error: connection not open SELECT * FROM distributed_result_info ORDER BY resultId; - resultid \| nodeport \| rowcount \| targetshardid \| targetshardindex ---------------------------------------------------------------------- - test_from_100800_to_0 \| 9060 \| 22 \| 100805 \| 0 - test_from_100801_to_0 \| 57637 \| 2 \| 100805 \| 0 - test_from_100801_to_1 \| 57637 \| 15 \| 100806 \| 1 - test_from_100802_to_1 \| 57637 \| 10 \| 100806 \| 1 - test_from_100802_to_2 \| 57637 \| 5 \| 100807 \| 2 - test_from_100803_to_2 \| 57637 \| 18 \| 100807 \| 2 - test_from_100803_to_3 \| 57637 \| 4 \| 100808 \| 3 - test_from_100804_to_3 \| 9060 \| 24 \| 100808 \| 3 -(8 rows) - +ERROR: current transaction is aborted, commands ignored until end of transaction block -- fetch from worker 2 should fail SAVEPOINT s1; +ERROR: current transaction is aborted, commands ignored until end of transaction block SELECT fetch_intermediate_results('{test_from_100802_to_1,test_from_100802_to_2}'::text[], 'localhost', :worker_2_port) > 0 AS fetched; -ERROR: could not open file "base/pgsql_job_cache/xx_x_xxx/test_from_100802_to_1.data": No such file or directory -CONTEXT: while executing command on localhost:xxxxx +ERROR: current transaction is aborted, commands ignored until end of transaction block ROLLBACK TO SAVEPOINT s1; +ERROR: savepoint "s1" does not exist -- fetch from worker 1 should succeed SELECT fetch_intermediate_results('{test_from_100802_to_1,test_from_100802_to_2}'::text[], 'localhost', :worker_1_port) > 0 AS fetched; - fetched ---------------------------------------------------------------------- - t -(1 row) - +ERROR: current transaction is aborted, commands ignored until end of transaction block -- make sure the results read are same as the previous transaction block SELECT count(*), sum(x) FROM read_intermediate_results('{test_from_100802_to_1,test_from_100802_to_2}'::text[],'binary') AS res (x int); - count \| sum ---------------------------------------------------------------------- - 15 \| 863 -(1 row) - +ERROR: current transaction is aborted, commands ignored until end of transaction block ROLLBACk; ``` As outlined in the #7306 I created, the reason for this is related to only having a single connection open to the node. Finding and fixing the full cause is not trivial, so instead this PR starts working around this bug by forcing maximum parallelism. Preferably we'd want this workaround not to be necessary, but that requires spending time to fix this. For now having a less flaky CI is good enough.	2023-11-02 12:31:56 +00:00
Jelte Fennema-Nio	b47c8b3fb0	Fix flaky insert_select_connection_leak (#7302 ) Sometimes in CI insert_select_connection_leak would fail like this: ```diff END; SELECT worker_connection_count(:worker_1_port) - :pre_xact_worker_1_connections AS leaked_worker_1_connections, worker_connection_count(:worker_2_port) - :pre_xact_worker_2_connections AS leaked_worker_2_connections; leaked_worker_1_connections \| leaked_worker_2_connections -----------------------------+----------------------------- - 0 \| 0 + -1 \| 0 (1 row) -- ROLLBACK BEGIN; INSERT INTO target_table SELECT * FROM source_table; INSERT INTO target_table SELECT * FROM source_table; ROLLBACK; SELECT worker_connection_count(:worker_1_port) - :pre_xact_worker_1_connections AS leaked_worker_1_connections, worker_connection_count(:worker_2_port) - :pre_xact_worker_2_connections AS leaked_worker_2_connections; leaked_worker_1_connections \| leaked_worker_2_connections -----------------------------+----------------------------- - 0 \| 0 + -1 \| 0 (1 row) \set VERBOSITY TERSE -- Error on constraint failure BEGIN; INSERT INTO target_table SELECT * FROM source_table; SELECT worker_connection_count(:worker_1_port) AS worker_1_connections, worker_connection_count(:worker_2_port) AS worker_2_connections \gset SAVEPOINT s1; INSERT INTO target_table SELECT a, CASE WHEN a < 50 THEN b ELSE null END FROM source_table; @@ -89,15 +89,15 @@ leaked_worker_1_connections \| leaked_worker_2_connections -----------------------------+----------------------------- 0 \| 0 (1 row) END; SELECT worker_connection_count(:worker_1_port) - :pre_xact_worker_1_connections AS leaked_worker_1_connections, worker_connection_count(:worker_2_port) - :pre_xact_worker_2_connections AS leaked_worker_2_connections; leaked_worker_1_connections \| leaked_worker_2_connections -----------------------------+----------------------------- - 0 \| 0 + -1 \| 0 (1 row) ``` Source: https://github.com/citusdata/citus/actions/runs/6718401194/attempts/1#summary-18258258387 A negative amount of leaked connectios is obviously not possible. For some reason there was a connection open when we checked the initial amount of connections that was closed afterwards. This could be the from the maintenance daemon or maybe from the previous test that had not fully closed its connections just yet. The change in this PR doesnt't actually fix the cause of the negative connection, but it simply considers it good as well, by changing the result to zero for negative values. With this fix we might sometimes miss a leak, because the negative number can cancel out the leak and still result in a 0. But since the negative number only occurs sometimes, we'll still find the leak often enough.	2023-11-02 13:15:43 +01:00
Cédric Villemain	0678a2fd89	Fix #7242 , CALL(@0) crash backend (#7288 ) When executing a prepared CALL, which is not pure SQL but available with some drivers like npgsql and jpgdbc, Citus entered a code path where a plan is not defined, while trying to increase its cost. Thus SIG11 when plan is a NULL pointer. Fix by only increasing plan cost when plan is not null. However, it is a bit suspicious to get here with a NULL plan and maybe a better change will be to not call ShardPlacementForFunctionColocatedWithDistTable() with a NULL plan at all (in call.c:134) bug hit with for example: ``` CallableStatement proc = con.prepareCall("{CALL p(?)}"); proc.registerOutParameter(1, java.sql.Types.BIGINT); proc.setInt(1, -100); proc.execute(); ``` where `p(bigint)` is a distributed "function" and the param the distribution key (also in a distributed table), see #7242 for details Fixes #7242	2023-11-02 13:15:24 +01:00
Jelte Fennema-Nio	5a48a1602e	Debug flaky logical_replication test (#7309 ) Sometimes in CI our logical_replication test fails like this: ```diff +++ /__w/citus/citus/src/test/regress/results/logical_replication.out.modified 2023-11-01 14:15:08.562758546 +0000 @@ -40,21 +40,21 @@ SELECT count() from pg_publication; count ------- 0 (1 row) SELECT count() from pg_replication_slots; count ------- - 0 + 1 (1 row) SELECT count(*) FROM dist; count ------- ``` It's hard to understand what is going on here, just based on the wrong number. So this PR changes the test to show the name of the subscription, publication and replication slot to make finding the cause easier. In passing this also fixes another flaky test in the same file that our flaky test detection picked up. This is done by waiting for resource cleanup after the shard move.	2023-11-02 13:15:02 +01:00
Jelte Fennema-Nio	6fed82609c	Do not download all artifacts for flaky test detection (#7320 ) This is causing 404 failures due to a race condition: https://github.com/actions/toolkit/issues/1235 It also makes the tests take unnecessarily long. This was tested by changing a test file and seeing that the flaky test detection was still working.	2023-11-02 12:13:29 +00:00
Onur Tirtir	9867c5b949	Fix flaky multi_mx_node_metadata.sql test (#7317 ) Fixes the flaky test that results in following diff: ```diff --- /__w/citus/citus/src/test/regress/expected/multi_mx_node_metadata.out.modified 2023-11-01 14:22:12.890476575 +0000 +++ /__w/citus/citus/src/test/regress/results/multi_mx_node_metadata.out.modified 2023-11-01 14:22:12.914476657 +0000 @@ -840,24 +840,26 @@ (1 row) \c :datname - - :master_port SELECT datname FROM pg_stat_activity WHERE application_name LIKE 'Citus Met%'; datname ------------ db_to_drop (1 row) DROP DATABASE db_to_drop; +ERROR: database "db_to_drop" is being accessed by other users SELECT datname FROM pg_stat_activity WHERE application_name LIKE 'Citus Met%'; datname ------------ -(0 rows) + db_to_drop +(1 row) -- cleanup DROP SEQUENCE sequence CASCADE; NOTICE: drop cascades to default value for column a of table reference_table ```	2023-11-02 11:02:34 +00:00
Gürkan İndibay	184c8fc1ee	Enriches statement propagation document (#7267 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com> Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2023-11-02 09:59:34 +00:00
Jelte Fennema-Nio	a6e86884f6	Fix flaky isolation_metadata_sync_deadlock (#7312 ) Sometimes isolation_metadata_sync_deadlock fails in CI like this: ```diff diff -dU10 -w /__w/citus/citus/src/test/regress/expected/isolation_metadata_sync_deadlock.out /__w/citus/citus/src/test/regress/results/isolation_metadata_sync_deadlock.out --- /__w/citus/citus/src/test/regress/expected/isolation_metadata_sync_deadlock.out.modified 2023-11-01 16:03:15.090199229 +0000 +++ /__w/citus/citus/src/test/regress/results/isolation_metadata_sync_deadlock.out.modified 2023-11-01 16:03:15.098199312 +0000 @@ -110,10 +110,14 @@ t (1 row) step s2-stop-connection: SELECT stop_session_level_connection_to_node(); stop_session_level_connection_to_node ------------------------------------- (1 row) + +teardown failed: ERROR: localhost:57638 is a metadata node, but is out of sync +HINT: If the node is up, wait until metadata gets synced to it and try again. +CONTEXT: SQL statement "SELECT master_remove_distributed_table_metadata_from_workers(v_obj.objid, v_obj.schema_name, v_obj.object_name)" ``` Source: https://github.com/citusdata/citus/actions/runs/6721938040/attempts/1#summary-18268946448 To fix this we now wait for the metadata to be fully synced to all nodes at the start of the teardown steps.	2023-11-02 10:39:05 +01:00
Jelte Fennema-Nio	ea5551689e	Prepare github actions pipelines for merge queue (#7315 ) Github has a built in merge queue. I think it would be good to try this out, to speed up merging PRs when multiple people want to merge at the same time. This PR does not enable it yet, but it starts triggering Github actions also for the `merge_queue` event. This is a requirement for trying them out. Announcment: https://github.blog/2023-07-12-github-merge-queue-is-generally-available/ Docs: https://docs.github.com/en/repositories/configuring-branches-and-merges-in-your-repository/configuring-pull-request-merges/managing-a-merge-queue	2023-11-02 08:23:34 +00:00
Onur Tirtir	2cf4c04023	Fix flaky global_cancel.sql test (#7316 )	2023-11-01 23:59:41 +01:00
Jelte Fennema-Nio	e3c93c303d	Fix flaky citus_non_blocking_split_shard_cleanup (#7311 ) Sometimes in CI citus_non_blocking_split_shard_cleanup failed like this: ```diff --- /__w/citus/citus/src/test/regress/expected/citus_non_blocking_split_shard_cleanup.out.modified 2023-11-01 15:07:14.280551207 +0000 +++ /__w/citus/citus/src/test/regress/results/citus_non_blocking_split_shard_cleanup.out.modified 2023-11-01 15:07:14.292551358 +0000 @@ -106,21 +106,22 @@ ----------------------------------- (1 row) \c - - - :worker_2_port SET search_path TO "citus_split_test_schema"; -- Replication slots should be cleaned up SELECT slot_name FROM pg_replication_slots; slot_name --------------------------------- -(0 rows) + citus_shard_split_slot_19_10_17 +(1 row) -- Publications should be cleanedup SELECT count(*) FROM pg_publication; count ``` It's expected that the replication slot is sometimes not cleaned up if we don't wait until resource cleanup completes. This PR starts doing that here.	2023-11-01 16:21:12 +00:00
Gürkan İndibay	5903196020	Removes use-base-schedule flag from CI (#7301 ) Normally, tests which are written non-dependent to other tests can use minimal-tests and should use as well. However, in our test settings base-schedule is being used which may cause unnecessary dependencies and so unrelated errors that developers don't see in their local environment With this change, default setting will be minimal, so that tests will be free of unnecessary dependencies.	2023-11-01 15:52:22 +00:00
Jelte Fennema-Nio	c9f2fc892d	Fix flaky failure_split_cleanup (#7299 ) Sometimes failure_split_cleanup failed in CI like this: ```diff ERROR: server closed the connection unexpectedly CONTEXT: while executing command on localhost:9060 SELECT operation_id, object_type, object_name, node_group_id, policy_type FROM pg_dist_cleanup where operation_id = 777 ORDER BY object_name; operation_id \| object_type \| object_name \| node_group_id \| policy_type --------------+-------------+-----------------------------------------------------------+---------------+------------- 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981000 \| 1 \| 0 - 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981002 \| 1 \| 1 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981002 \| 2 \| 0 + 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981002 \| 1 \| 1 777 \| 1 \| citus_failure_split_cleanup_schema.table_to_split_8981003 \| 2 \| 1 777 \| 4 \| citus_shard_split_publication_1_10_777 \| 2 \| 0 (5 rows) -- we need to allow connection so that we can connect to proxy ``` Source: https://github.com/citusdata/citus/actions/runs/6717642291/attempts/1#summary-18256014949 It's the common problem where we're missing a column in the ORDER BY clause. This fixes that by adding an node_group_id to the query in question.	2023-11-01 14:08:51 +00:00
Jelte Fennema-Nio	c83c556702	Fix flaky isolation_master_update_node (#7303 ) Sometimes in CI isolation_master_update_node fails like this: ```diff ------------------ (1 row) step s2-abort: ABORT; step s1-abort: ABORT; FATAL: terminating connection due to administrator command FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly +server closed the connection unexpectedly master_remove_node ------------------ ``` This just seesm like a random error line. The only way to reasonably fix this is by adding an extra output file. So that's what this PR does.	2023-11-01 16:44:45 +03:00
Jelte Fennema-Nio	2bccb58157	Run github actions on main (#7292 ) We want the nice looking green checkmark on our main branch too. This PR includes running on pushes to release branches too, but that won't come into effect until we have release branches with this workflow file.	2023-11-01 13:12:20 +01:00
Jelte Fennema-Nio	0d83ab57de	Fix flaky multi_cluster_management (#7295 ) One of our most flaky and most anoying tests is multi_cluster_management. It usually fails like this: ```diff SELECT citus_disable_node('localhost', :worker_2_port); citus_disable_node -------------------- (1 row) SELECT public.wait_until_metadata_sync(60000); +WARNING: waiting for metadata sync timed out wait_until_metadata_sync -------------------------- (1 row) ``` This tries to address that by hardening wait_until_metadata_sync. I believe the reason for this warning is that there is a race condition in wait_until_metadata_sync. It's possible for the pre-check to fail, then have the maintenance daemon send a notification. And only then have the backend start to listen. I tried to fix it in two ways: 1. First run LISTEN, and only then read do the pre-check. 2. If we time out, check again just to make sure that we did not miss the notification somehow. And don't show a warning if all metadata is synced after the timeout. It's hard to know for sure that this fixes it because the test is not repeatable and I could not reproduce it locally. Let's just hope for the best. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-01 10:46:01 +00:00
Jelte Fennema-Nio	20ae42e7fa	Fix flaky multi_reference_table test (#7294 ) Sometimes multi_reference_table failed in CI like this: ```diff \c - - - :master_port DROP INDEX reference_schema.reference_index_2; \c - - - :worker_1_port SELECT "Column", "Type", "Modifiers" FROM table_desc WHERE relid='reference_schema.reference_table_ddl_1250019'::regclass; - Column \| Type \| Modifiers ---------------------------------------------------------------------- - value_2 \| double precision \| default 25.0 - value_3 \| text \| not null - value_4 \| timestamp without time zone \| - value_5 \| double precision \| -(4 rows) - +ERROR: schema "citus_local_table_queries" does not exist \di reference_schema.reference_index_2* List of relations Schema \| Name \| Type \| Owner \| Table ``` Source: https://github.com/citusdata/citus/actions/runs/6707535961/attempts/2#summary-18226879513 Reading from table_desc apparantly has an issue that if the schema gets deleted from one of the items, while it is being read that we get such an error. This change fixes that by not running multi_reference_table in parallel with citus_local_tables_queries anymore.	2023-11-01 10:12:06 +00:00
Cédric Villemain	37415ef8f5	Allow citus__size on index related to a distributed table (#7271 ) I just enhanced the existing code to check if the relation is an index belonging to a distributed table. If so the shardId is appended to relation (index) name and the _size function are executed as before. There is a change in an extern function: `extern StringInfo GenerateSizeQueryOnMultiplePlacements(...)` It's possible to create a new function and deprecate this one later if compatibility is an issue. Fixes https://github.com/citusdata/citus/issues/6496. DESCRIPTION: Allows using Citus size functions on distributed tables indexes. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-11-01 09:05:51 +00:00
Jelte Fennema-Nio	a76a832553	Fix flaky validate_constraint test (#7293 ) Sometimes validate constraint would fail like this: ```diff validatable_constraint_8000016 \| t (10 rows) DROP TABLE constrained_table; +ERROR: deadlock detected +DETAIL: Process 16602 waits for ShareRowExclusiveLock on relation 56258 of database 16384; blocked by process 16601. +Process 16601 waits for AccessShareLock on relation 56120 of database 16384; blocked by process 16602. +HINT: See server log for query details. DROP TABLE referenced_table CASCADE; DROP TABLE referencing_table; DROP SCHEMA validate_constraint CASCADE; -NOTICE: drop cascades to 3 other objects +NOTICE: drop cascades to 4 other objects DETAIL: drop cascades to type constraint_validity drop cascades to view constraint_validations_in_workers drop cascades to view constraint_validations +drop cascades to table constrained_table SET search_path TO DEFAULT; ``` Source: https://github.com/citusdata/citus/actions/runs/6708383699?pr=7291 This change fixes that by not running together with the foreign_key_to_reference_table test anymore. In passing it also simplifies dropping of the test its resources.	2023-11-01 09:41:28 +01:00
Jelte Fennema-Nio	81aa660b31	Fix flaky test detection (#7291 ) PR #7289 broke flaky test detction. This fixes that.	2023-10-31 15:59:16 +00:00
Gokhan Gulbiz	ce58c04304	Disable CircleCI (#7276 ) We are switching to Github Actions. In the test period it has worked well enough, so now we can stop using CircleCI.	2023-10-31 16:00:10 +01:00
Jelte Fennema-Nio	83e3fb817d	Only put major Postgres version in CI task name (#7289 ) Making tasks in CI required before merging to master is important and useful. The way this works is by saving the exact names of the required tasks in the admin interface of the repo. It has a search box to add them so it's not completely horrible, but doing so is quite a hassle since we have so many jobs. So limiting the amount of churn in this list of required jobs is quite useful. This changes the names of tasks to only include the major versions of Postgres, not the minor ones. Otherwise the next time we bump the minor versions we would have to remove and re-add each of the jobs.	2023-10-31 14:05:09 +01:00
Emel Şimşek	ee8f4bb7e8	Start Maintenance Daemon for Main DB at the server start. (#7254 ) DESCRIPTION: This change starts a maintenance deamon at the time of server start if there is a designated main database. This is the code flow: 1. User designates a main database: `ALTER SYSTEM SET citus.main_db = "myadmindb";` 2. When postmaster starts, in _PG_Init, citus calls `InitializeMaintenanceDaemonForMainDb` This function registers a background worker to run `CitusMaintenanceDaemonMain `with `databaseOid = 0 ` 3. `CitusMaintenanceDaemonMain ` takes some special actions when databaseOid is 0: - Gets the citus.main_db value. - Connects to the citus.main_db - Now the `MyDatabaseId `is available, creates a hash entry for it. - Then follows the same control flow as for a regular db,	2023-10-30 09:44:13 +03:00
Nils Dijk	d0b093c975	automatically add a breakpoint that breaks on postgres errors (#7279 ) When debugging postgres it is quite hard to get to the source for `errfinish` in `elog.c`. Instead of relying on the developer to set a breakpoint in the `elog.c` file for `errfinish` for `elevel == ERROR`, this change adds the breakpoint to `.gdbinit`. This makes sure that whenever a debugger is attached to a postgres backend it will break on postgres errors. When attaching the debugger a small banner is printed that explains how to disable the breakpoint.	2023-10-27 16:57:51 +02:00
Benjamin O	f9218d9780	Support replacing IPv6 Loopback in `normalize.sed` (#7269 ) I had a test failure issue due to my machine using the IPv6 loopback address. This change to the `normalize.sed` solves that issue.	2023-10-27 16:42:55 +02:00
Gokhan Gulbiz	2bf1472c8e	Move GHA environment variables to workflow file (#7275 ) Since GHA does not interpolate env variables in a matrix context, This PR defines them in a separate job and uses them in other jobs.	2023-10-26 14:54:58 +03:00
Naisila Puka	10198b18e8	Technical readme small fixes (#7261 )	2023-10-23 13:43:43 +03:00
Naisila Puka	1fe16fa746	Remove unnecessary pre-fastpath code (#7262 ) This code was here because we first implemented `fast path planner` via [#2606](https://github.com/citusdata/citus/pull/2606) and then later `deferred pruning` [#3369](https://github.com/citusdata/citus/pull/3369) So, for some years, this code was useful.	2023-10-23 13:01:48 +03:00
zhjwpku	2d1444188c	Fix wrong comments around HasDistributionKey() (#7223 ) HasDistributionKey & HasDistributionKeyCacheEntry returns true when the corresponding table has a distribution key, the comments state the opposite, which should be fixed. Signed-off-by: Zhao Junwang <zhjwpku@gmail.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-10-18 10:53:00 +02:00
Onur Tirtir	db13afaa7b	Fix flaky columnar_create.sql test (#7266 )	2023-10-17 16:58:17 +03:00
Gürkan İndibay	71a4633dad	Fixes typo and renames multi_process_utility (#7259 )	2023-10-17 16:39:37 +03:00
Onur Tirtir	5eaf6c221e	Fix flaky test detection job (#7256 ) We were getting such errors in flaky-test detection job: ``` Unable to process file command 'output' successfully ``` Even though we don't seem to be writing multiple lines to $GITHUB_OUTPUT, this seems to be the right fix. https://docs.github.com/en/actions/using-workflows/workflow-commands-for-github-actions#multiline-strings	2023-10-16 14:20:55 +03:00
Jelte Fennema-Nio	788e09a39a	Add a test for citus_shards where table names have spaces (#7224 ) There was a bug reported for previous versions of Citus where shard\_size was returning NULL for tables with spaces in them. It works fine on the main branch though, but I'm still adding a test for this to the main branch because it seems a good test to have.	2023-10-16 11:38:24 +02:00
Nils Dijk	fb08f9b198	Remove software-properties-common from dev container after use (#7255 ) During the creation of the devcontainer we need to add a ppa repository, which is easiest done via software-properies-common. As turns out this installes pkexec into the container as a side effect. When vscode tries to attach a debugger it first checks if pkexec is installed as this gives a nicer popup asking for elevation of rights to attach to the process. However, since dev containers don't have a windowing system running pkexec isn't working as expected and thus prevents the debugger from attaching. Without pkexec in the container vscode 'falls back' to plain old sudo which we can run passwordless in the container. For pkexec to be removed we need to first purge software-propertied-common as well as autoremove all packages that were installed due to the installation of said package. By performing this all in one step we minimize the size of the layer we are creating.	2023-10-12 17:47:44 +02:00
Gokhan Gulbiz	e0b0cdbb87	CircleCI to GHA migration (#7154 ) Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2023-10-10 16:58:50 +03:00
Emel Şimşek	e9035f6d32	Send keepalive messages in split decoder periodically to avoid wal receiver timeouts during large shard splits. (#7229 ) DESCRIPTION: Send keepalive messages during the logical replication phase of large shard splits to avoid timeouts. During the logical replication part of the shard split process, split decoder filters out the wal records produced by the initial copy. If the number of wal records is big, then split decoder ends up processing for a long time before sending out any wal records through pgoutput. Hence the wal receiver may time out and restarts repeatedly causing our split driver code catch up logic to fail. Notes: 1. If the wal_receiver_timeout is set to a very small number e.g. 600ms, it may time out before receiving the keepalives. My tests show that this code works best when the` wal_receiver_timeout `is set to 1minute, which is the default value. 2. Once a logical replication worker time outs, a new one gets launched. The new logical replication worker sets the pg_stat_subscription columns to initial values. E.g. the latest_end_lsn is set to 0. Our driver logic in `WaitForGroupedLogicalRepTargetsToCatchUp` can not handle LSN value to go back. This is the main reason for it to get stuck in the infinite loop.	2023-10-09 22:33:08 +03:00
Nils Dijk	76fdfa3c0f	Add devcontainer for development purposes (#7102 ) This change adds a devcontainer configuration to the Citus project. This devcontainer allows for quick generation of isolated development environments, either local on the machine of a developer or in a cloud, like github codepaces. The devcontainer is updated automatically by github actions when its configuration changes. For more detailed instructions on how to quickstart the development in a container see CONTRIBUTING.md	2023-10-09 15:37:21 +02:00
Nils Dijk	6d8725efb0	Fix leaking of memory and memory contexts in Foreign Constraint Graphs (#7236 ) DESCRIPTION: Fix leaking of memory and memory contexts in Foreign Constraint Graphs Previously, every time we (re)created the Foreign Constraint Relationship Graph, we created a new Memory Context while loosing a reference to the previous context. This old context could still have left over memory in there causing a memory leak. With this patch we statically have one memory context that we lazily initialize the first time we create our foreign constraint relationship graph. On every subsequent creation, beside destroying our previous hashmap we also reset our memory context to remove any left over references.	2023-10-09 13:05:51 +02:00
Onur Tirtir	858d99be33	Take improvement_threshold into the account in citus_add_rebalance_strategy() (#7247 ) DESCRIPTION: Makes sure to take improvement_threshold into the account in `citus_add_rebalance_strategy()`. Fixes https://github.com/citusdata/citus/issues/7188.	2023-10-09 13:13:08 +03:00
Önder Kalacı	7d6c401dd3	Update technical readme (#7248 ) Fix a wrong query, reported by @naisila	2023-10-06 13:37:37 +03:00
Önder Kalacı	0dca65c84d	Addd missing image to Technical Readme (#7243 ) DESCRIPTION: PR description that will go into the change log, up to 78 characters	2023-09-29 22:24:10 +02:00
Önder Kalacı	185ac5e01e	Citus Technical Readme (#7207 ) This commit aims to add a comprehensive guide that covers all essential aspects of Citus, including planning, execution, locking mechanisms, shard moves, 2PC, and many other major components of Citus. Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-09-29 16:50:52 +03:00
dependabot[bot]	c323f49e83	Bump cryptography from 41.0.3 to 41.0.4 in /src/test/regress (#7231 ) Bumps [cryptography](https://github.com/pyca/cryptography) from 41.0.3 to 41.0.4. Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Nils Dijk <nils@citusdata.com>	2023-09-27 15:36:58 +02:00
Onur Tirtir	27ac44eb2a	Fix mixed Citus upgrade tests (#7218 ) When testing rolling Citus upgrades, coordinator should not be upgraded until we upgrade all the workers. --------- Co-authored-by: Jelte Fennema-Nio <github-tech@jeltef.nl>	2023-09-26 17:52:52 +03:00
Nils Dijk	b87fbcbf79	Shard moves/isolate report LSN's in lsn format (#7227 ) DESCRIPTION: Shard moves/isolate report LSN's in lsn format While investigating an issue with our catchup mechanism on certain postgres versions we noticed we print LSN's in the format of the native long type. This is an uncommon representation for LSN's in postgres logs. This patch changes the output of our log message to go from the long type representation to the native LSN type representation. Making it easier for postgres users to recognize and compare LSN's with other related reports. example of new output: ``` 2023-09-25 17:28:47.544 CEST [11345] LOG: The LSN of the target subscriptions on node localhost:9701 have increased from 0/0 to 0/E1ED20F8 at 2023-09-25 17:28:47.544165+02 where the source LSN is 1/415DCAD0 ```	2023-09-26 13:47:50 +02:00
Gürkan İndibay	7fa109c977	Adds alter user missing features (#7204 ) DESCRIPTION: Adds alter user rename propagation and enriches alter user tests --------- Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2023-09-26 12:28:07 +03:00
Gürkan İndibay	a9d28ca96f	Adds make clean to installation steps (#7052 ) If you make a fresh install make clean is not required. However, if you install before, without a make install, one can get errors --------- Co-authored-by: aykut-bozkurt <51649454+aykut-bozkurt@users.noreply.github.com>	2023-09-25 12:42:23 +03:00
Onur Tirtir	111b4c19bc	Make sure to disallow creating a replicated distributed table concurrently (#7219 ) See explanation in https://github.com/citusdata/citus/issues/7216. Fixes https://github.com/citusdata/citus/issues/7216. DESCRIPTION: Makes sure to disallow creating a replicated distributed table concurrently	2023-09-25 11:14:35 +03:00
Hanefi Onaldi	f72cd7ffd2	Update README.md for Citus 12.1 release (#7214 ) Also remove old customers from the readme	2023-09-22 18:35:33 +03:00
Hanefi Onaldi	01e3c24793	Update url for release blog	2023-09-22 17:47:57 +03:00
Hanefi Onaldi	f17d31fd94	Update PG and Citus versions in readme	2023-09-22 17:47:57 +03:00
Hanefi Onaldi	5926ec8bbb	Fix broken blog link	2023-09-22 17:47:57 +03:00
Teresa Giacomini	ab8a3fab74	Update README.md Update README.md to remove old customers	2023-09-22 17:47:57 +03:00
Nils Dijk	0f28a69f12	Use the $(DLSUFFIX) instead of hard coded extensions for cdc (#7221 ) When cdc got added the makefiles hardcoded the `.so` extension instead of using the platform specifc `$(DLSUFFIX)` variable used by `pgxs.mk`. Also don't remove installed cdc artifacts on `make clean`.	2023-09-22 16:24:18 +02:00
aykut-bozkurt	2c190d0689	Fix the changelog entry for citus_pause_node_within_txn() UDF (#7215 )	2023-09-20 16:45:04 +03:00
Jelte Fennema-Nio	71e556e090	Remove useless test output (#7209 ) This was sometimes failing when running locally due to some local shard still existing due to. This fixes that. We normally silence all `drop schema cascade` output like this anyway to avoid unnecessary diffs when modifying a test later on.	2023-09-19 14:12:46 +02:00
Gürkan İndibay	b0e982d0b5	Removes centos 7 for PG 16 in packaging pipelines (#7205 ) centos 7 and oracle 7 is not being supported for newer releases by Postgres. Therefore, getting package download errors in packaging pipelines. This PR removes el/7 and ol/7 Postgres 16 pipelines	2023-09-19 14:37:35 +03:00
Naisila Puka	4e46708789	Adds PostgreSQL 16.0 Support (#7201 ) This commit concludes PG16.0 Support in Citus. The main PG16 support work has been done for 16beta3 https://github.com/citusdata/citus/pull/6952 There was some extra work needed for 16rc1 https://github.com/citusdata/citus/pull/7173 And this PR yet introduces some extra work needed to 16.0 :) `pgstat_fetch_stat_local_beentry` has been renamed to `pgstat_get_local_beentry_by_index` in PG16.0 Relevant PG commit: `8dfa37b797` 8dfa37b797843a83a5756ea3309055e8953e1a86 Sister PR https://github.com/citusdata/the-process/pull/150	2023-09-15 12:23:04 +03:00
Gürkan İndibay	7c0b289761	Adds alter database set option (#7181 ) DESCRIPTION: Adds support for ALTER DATABASE <db_name> SET .. statement propagation SET statements in Postgres has a common structure which is already being used in Alter Function statement. In this PR, I added a util file; citus_setutils and made it usable for both for alter database<db_name>set .. and alter function ... set ... statements. With this PR, below statements will be propagated ```sql ALTER DATABASE name SET configuration_parameter { TO \| = } { value \| DEFAULT } ALTER DATABASE name SET configuration_parameter FROM CURRENT ALTER DATABASE name RESET configuration_parameter ALTER DATABASE name RESET ALL ``` Additionally, there was a bug in processing float values in the common code block. I fixed this one as well Previous ```C case T_Float: { appendStringInfo(buf, " %s", strVal(value)); break; } ``` Now ```C case T_Float: { appendStringInfo(buf, " %s", nodeToString(value)); break; } ```	2023-09-14 16:29:16 +03:00
aykut-bozkurt	26dc407f4a	bump citus and columnar into 12.2devel (#7200 )	2023-09-14 12:03:09 +03:00
aykut-bozkurt	9eafd032da	Changelog entries for 12.1.0 (#7194 ) Co-authored-by: naisila <nicypp@gmail.com> Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-09-13 12:12:24 +03:00
Gürkan İndibay	e0683aab84	Removes ubuntu:kinetic pipelines since it's EOL (#7195 ) ubuntu:kinetic is EOL so removing it's pipeline https://fridge.ubuntu.com/2023/06/14/ubuntu-22-10-kinetic-kudu-reaches-end-of-life-on-july-20-2023/	2023-09-12 14:56:29 +03:00
Gürkan İndibay	e5e64b7454	Adds alter database propagation - with and refresh collation (#7172 ) DESCRIPTION: Adds ALTER DATABASE WITH ... and REFRESH COLLATION VERSION support This PR adds supports for basic ALTER DATABASE statements propagation support. Below statements are supported: ALTER DATABASE <database_name> with IS_TEMPLATE <true/false>; ALTER DATABASE <database_name> with CONNECTION LIMIT <integer_value>; ALTER DATABASE <database_name> REFRESH COLLATION VERSION; --------- Co-authored-by: Jelte Fennema-Nio <jelte.fennema@microsoft.com>	2023-09-12 14:09:15 +03:00
Naisila Puka	1da99f8423	PG16 - Don't propagate GRANT ROLE with INHERIT/SET option (#7190 ) We currently don't support propagating these options in Citus Relevant PG commits: https://github.com/postgres/postgres/commit/e3ce2de https://github.com/postgres/postgres/commit/3d14e17 Limitation: We also need to take care of generated GRANT statements by dependencies in attempt to distribute something else. Specifically, this part of the code in `GenerateGrantRoleStmtsOfRole`: ``` grantRoleStmt->admin_opt = membership->admin_option; ``` In PG16, membership also has `inherit_option` and `set_option` which need to properly be part of the `grantRoleStmt`. We can skip for now since #7164 will take care of this soon, and also this is not an expected use-case.	2023-09-12 12:47:37 +03:00
Naisila Puka	c1dc378504	Fix WITH ADMIN FALSE propagation (#7191 )	2023-09-11 15:58:24 +03:00
Onur Tirtir	d628a4c21a	Add citus_schema_move() function (#7180 ) Add citus_schema_move() that can be used to move tenant tables within a distributed schema to another node. The function has two variations as simple wrappers around citus_move_shard_placement() and citus_move_shard_placement_with_nodeid() respectively. They pick a shard that belongs to the given tenant schema and resolve the source node that contain the shards under given tenant schema. Hence their signatures are quite similar to underlying functions: ```sql -- citus_schema_move(), using target node name and node port CREATE OR REPLACE FUNCTION pg_catalog.citus_schema_move( schema_id regnamespace, target_node_name text, target_node_port integer, shard_transfer_mode citus.shard_transfer_mode default 'auto') RETURNS void LANGUAGE C STRICT AS 'MODULE_PATHNAME', $$citus_schema_move$$; -- citus_schema_move(), using target node id CREATE OR REPLACE FUNCTION pg_catalog.citus_schema_move( schema_id regnamespace, target_node_id integer, shard_transfer_mode citus.shard_transfer_mode default 'auto') RETURNS void LANGUAGE C STRICT AS 'MODULE_PATHNAME', $$citus_schema_move_with_nodeid$$; ```	2023-09-08 12:03:53 +03:00
Naisila Puka	8894c76ec0	PG16 - Add rules option to CREATE COLLATION (#7185 ) Relevant PG commit: https://github.com/postgres/postgres/commit/30a53b7 30a53b7	2023-09-07 13:50:47 +03:00
Naisila Puka	2df88042b3	Add tests with JSON_ARRAYAGG and JSON_OBJECTAGG aggregates (#7186 ) Relevant PG commit: `7081ac46ac` 7081ac46ace8c459966174400b53418683c9fe5c	2023-09-07 13:29:39 +03:00
Naisila Puka	7e5136f2de	Add tests with publications with schema and table of the same schema (#7184 ) Relevant PG commit: https://github.com/postgres/postgres/commit/13a185f 13a185f It was backpatched through PG15 so I added this test in publication.sql instead of pg16.sql	2023-09-06 16:40:36 +03:00
Naisila Puka	b2fc763bc3	PG16 - Add tests with random_normal (#7183 ) Relevant PG commit: https://github.com/postgres/postgres/commit/38d8176	2023-09-06 14:57:24 +03:00
Naisila Puka	5c658b4eb7	PG16 - Add citus_truncate_trigger for Citus foreign tables (#7170 ) Since in PG16, truncate triggers are supported on foreign tables, we add the citus_truncate_trigger to Citus foreign tables as well, such that the TRUNCATE command is propagated to the table's single local shard as well. Note that TRUNCATE command was working for foreign tables even before this commit: see https://github.com/citusdata/citus/pull/7170#issuecomment-1706240593 for details This commit also adds tests with user-enabled truncate triggers on Citus foreign tables: both trigger on the shell table and on its single foreign local shard. Relevant PG commit: https://github.com/postgres/postgres/commit/3b00a94	2023-09-05 19:42:39 +03:00
zhjwpku	205b159606	get rid of {Push/Pop}OverrideSearchPath (#7145 )	2023-09-05 17:40:22 +02:00
aykut-bozkurt	8eb3360017	Fixes visibility problems with dependency propagation (#7028 ) Problem: Previously we always used an outside superuser connection to overcome permission issues for the current user while propagating dependencies. That has mainly 2 problems: 1. Visibility issues during dependency propagation, (metadata connection propagates some objects like a schema, and outside transaction does not see it and tries to create it again) 2. Security issues (it is preferrable to use current user's connection instead of extension superuser) Solution (high level): Now, we try to make a smarter decision on whether should we use an outside superuser connection or current user's metadata connection. We prefer using current user's connection if any of the objects, which is already propagated in the current transaction, is a dependency for a target object. We do that since we assume if current user has permissions to create the dependency, then it can most probably propagate the target as well. Our assumption is expected to hold most of the times but it can still be wrong. In those cases, transaction would fail and user should set the GUC `citus.create_object_propagation` to `deferred` to work around it. Solution: 1. We track all objects propagated in the current transaction (we can handle subtransactions), 2. We propagate dependencies via the current user's metadata connection if any dependency is created in the current transaction to address issues listed above. Otherwise, we still use an outside superuser connection. DESCRIPTION: Fixes some object propagation errors seen with transaction blocks. Fixes https://github.com/citusdata/citus/issues/6614 --------- Co-authored-by: Nils Dijk <nils@citusdata.com>	2023-09-05 18:04:16 +03:00
Naisila Puka	9f067731c0	Adds PostgreSQL 16 RC1 support (#7173 )	2023-09-05 14:32:41 +03:00
Emel Şimşek	a849570f3f	Improve the performance of CitusHasBeenLoaded function for a database that does not do CREATE EXTENSION citus but load citus.so. (#7123 ) For a database that does not create the citus extension by running ` CREATE EXTENSION citus;` `CitusHasBeenLoaded ` function ends up querying the `pg_extension` table every time it is invoked. This is not an ideal situation for a such a database. The idea in this PR is as follows: ### A new field in MetadataCache. Add a new variable `extensionCreatedState `of the following type: ``` typedef enum ExtensionCreatedState { UNKNOWN = 0, CREATED = 1, NOTCREATED = 2, } ExtensionCreatedState; ``` When the MetadataCache is invalidated, `ExtensionCreatedState` will be set to UNKNOWN. ### Invalidate MetadataCache when CREATE/DROP/ALTER EXTENSION citus commands are run. - Register a callback function, named `InvalidateDistRelationCacheCallback`, for relcache invalidation during the shared library initialization for `citus.so`. This callback function is invoked in all the backends whenever the relcache is invalidated in one of the backends. (This could be caused many DDLs operations). - In the cache invalidation callback,` InvalidateDistRelationCacheCallback`, invalidate `MetadataCache` zeroing it out. - In `CitusHasBeenLoaded`, perform the costly citus is loaded check only if the `MetadataCache` is not valid. ### Downsides Any relcache invalidation (caused by various DDL operations) will case Citus MetadataCache to get invalidated. Most of the time it will be unnecessary. But we rely on that DDL operations on relations will not be too frequent.	2023-09-05 13:29:35 +03:00
Hanefi Onaldi	1d540b60fb	Create a new colocation properly after breaking one (#6929 ) When breaking a colocation, we need to create a new colocation group record in pg_dist_colocation for the relation. It is not sufficient to have a new colocationid value in pg_dist_partition only. This patch also fixes a bug when deleting a colocation group if no tables are left in it. Previously we passed a relation id as a parameter to DeleteColocationGroupIfNoTablesBelong function, where we should have passed a colocation id. Fixes: #6928	2023-09-05 11:21:47 +03:00
Hanefi Onaldi	c22547d221	Create a new colocation properly after braking one When braking a colocation, we need to create a new colocation group record in pg_dist_colocation for the relation. It is not sufficient to have a new colocationid value in pg_dist_partition only. This patch also fixes a bug when deleting a colocation group if no tables are left in it. Previously we passed a relation id as a parameter to DeleteColocationGroupIfNoTablesBelong function, where we should have passed a colocation id.	2023-09-05 10:58:46 +03:00
Jelte Fennema	bdf085eabb	Add some small improvements to python testing framework (#7159 ) 1. Adds an `sql_row` function, for when a query returns a single row with multiple columns. 2. Include a `notice_handler` for easier debugging 3. Retry dropping replication slots when they are "in use", this is often an ephemeral state and can cause flaky tests	2023-09-05 09:34:56 +02:00
Ivan Vyazmitinov	e94bf93152	#6548 2PC recovery is extremely ineffective on a cluster with multiple DATABASEs fix (#7174 )	2023-09-04 15:28:22 +02:00
Naisila Puka	de9af078b0	PG16 - Add reindex database/system tests (#7167 ) In PG16, REINDEX DATABASE/SYSTEM name is optional. We already don't propagate these commands automatically. Testing here with run_command_on_workers. Relevant PG commit: https://github.com/postgres/postgres/commit/2cbc3c1	2023-09-04 11:31:57 +03:00
Naisila Puka	cf71e80bfd	PG16 - Add tests for createdb with ICU_RULES option (#7161 ) When we create a database, it already needs to be manually created in the workers as well. This new icu_rules option should work as the other options as well. Added a test for that. Relevant PG commit: https://github.com/postgres/postgres/commit/30a53b7	2023-09-04 11:13:46 +03:00
zhjwpku	9fd4ef042f	avoid rebuilding MetadataCache for each placement insertion (#7163 )	2023-09-04 09:57:25 +02:00
zhjwpku	5034f8eba5	polish the codebase by fixing dozens of typos (#7166 )	2023-09-01 12:21:53 +02:00
Naisila Puka	05443a77ad	Adds test for COPY FROM failure in Citus foreign tables (#7160 )	2023-09-01 12:20:07 +03:00
Gürkan İndibay	b8bded6454	Adds citus_pause_node udf (#7089 ) DESCRIPTION: Presenting citus_pause_node UDF enabling pausing by node_id. citus_pause_node takes a node_id parameter and fetches all the shards in that node and puts AccessExclusiveLock on all the shards inside that node. With this lock, insert is disabled, until citus_pause_node transaction is closed. --------- Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2023-09-01 11:39:30 +03:00
Gürkan İndibay	4a1a5491ce	Refactors grant statements (#7153 ) DESCRIPTION: Refactors all grant statements to use common code blocks to deparse	2023-09-01 09:49:46 +03:00
zhjwpku	f03291a8c8	remove useless code block (#7158 )	2023-08-29 17:15:22 +02:00
Naisila Puka	a17fae36b9	Disable statistics collection (#7162 ) Enabled by mistake in `ba40eb363c`	2023-08-29 16:09:19 +03:00
Onur Tirtir	10e20d97db	Not undistribute Citus local table when converting it to a reference table / single-shard table (#7131 ) Replaces https://github.com/citusdata/citus/pull/7120. Closes https://github.com/citusdata/citus/issues/4692. #7120 added the same functionality by implementing a transactional --but scoped to Citus local tables-- version of TransferShards(). It was passing all the regression tests but didn't feel like an intuitive approach. This PR instead adds that functionality via the functions that we use when creating a distributed table, namely, CreateShardsOnWorkers() and CopyLocalDataIntoShards(). We insert entries into pg_dist_placement for the new shard placement(s) and then call CreateShardsOnWorkers() to create those placement(s) on workers. Then we use CopyFromLocalTableIntoDistTable() to copy the data from the local shard placement to the new shard placement(s). CopyFromLocalTableIntoDistTable() is a new function that re-uses the underlying logic of CopyLocalDataIntoShards() that allows copying data from a local table into a distributed table. We tell CopyLocalDataIntoShards() to read from local shard placement table and to write the tuples into shard placement/s of the reference / single-shard table. Before doing this, we temporarily delete metadata record for the local placement to avoid from duplicating the data in the local shard placement. Finally, we drop the local shard placement if we were creating a single-shard placement table and that effectively means moving the local shard placement to the appropriate worker as we've already created the new shard placement on the worker. While the main motivation behind adding this functionality is to avoid from the limitations when UndistributeTable() is called for a Citus local table (during table conversion), this indeed optimizes how we convert a Citus local table to a reference table / single-shard table. This is because, the prior logic was causing to use more disk space due to the duplication of the data during UndistributeTable(). DESCRIPTION: Allow creating reference / distributed-schema tables from local tables added to metadata and that use identity columns - [x] Add tests. - [x] Test django-tenants.	2023-08-29 13:12:07 +03:00
Onur Tirtir	a830862717	Not undistribute Citus local table when converting it to a reference table / single-shard table	2023-08-29 12:57:28 +03:00
Onur Tirtir	34e3119b48	Intersect shard placements in a table type agnostic way If we're in the middle of a table type conversion (such as from Citus local table to a reference table), the table might not have all the placements that we expect from the table type. For this reason, we should intersect the placements of tables at hand when creating inter-shard ddl tasks.	2023-08-29 12:57:28 +03:00
Onur Tirtir	5bdf19f517	Use CopyShardForeignConstraintCommandList in WorkerCreateShardCommandList What we do to collect foreign key constraint commands in WorkerCreateShardCommandList is quite similar to what we do in CopyShardForeignConstraintCommandList. Plus, the code that we used in WorkerCreateShardCommandList before was not able to properly handle foreign key constraints between Citus local tables --when creating a reference table from the referencing one. With a few slight modifications made to CopyShardForeignConstraintCommandList, we can use the same logic in WorkerCreateShardCommandList too.	2023-08-29 12:57:28 +03:00
zhjwpku	d97f786296	PQputCopyData's return value 0 should be considered fail (#7152 )	2023-08-29 11:19:18 +02:00
Onur Tirtir	d5d1684c45	Use correct errorCode for the errors thrown during recovery (#7146 )	2023-08-28 11:03:38 +03:00
Naisila Puka	afab879de3	PG16 - Add COPY FROM default tests (#7143 ) Already supported in Citus, adding the same tests as in PG Relevant PG commit: https://github.com/postgres/postgres/commit/9f8377f	2023-08-24 15:52:09 +03:00
Naisila Puka	70c8aba967	PG16 - Add tests for CREATE/ALTER TABLE .. STORAGE (#7140 ) Relevant PG commits: https://github.com/postgres/postgres/commit/784cedd https://github.com/postgres/postgres/commit/b9424d0	2023-08-24 15:26:40 +03:00
Gürkan İndibay	8d3a06c1c7	Adds grant/revoke privileges on database propagation (#7109 ) DESCRIPTION: Adds grant/revoke propagation support for database privileges Following the implementation of support for granting and revoking database privileges, certain tests that issued grants for worker nodes experienced failures. These ones are fixed in this PR as well.	2023-08-24 14:43:19 +03:00
Gürkan İndibay	553780e3f1	Removes ubuntu/bionic from packaging pipelines (#7142 ) DESCRIPTION: Removes ubuntu/bionic from packaging pipelines Since pg16 beta is not available for ubuntu/bionic and ubuntu/bionic support is EOL, I need to remove this os from pipeline https://ubuntu.com/blog/ubuntu-18-04-eol-for-devices Additionally, added concurrency support for GH Actions Packaging pipeline	2023-08-24 10:30:33 +03:00
Naisila Puka	b8c493f2c4	PG16 - Add GENERIC_PLAN option to EXPLAIN (#7141 )	2023-08-23 20:15:54 +03:00
Naisila Puka	c73ef405f5	PG16 - IS JSON predicate and SYSTEM_USER tests (#7137 ) Support the IS JSON predicate Relevant PG commit: https://github.com/postgres/postgres/commit/6ee30209 SYSTEM_USER Relevant PG commit: https://github.com/postgres/postgres/commit/0823d061	2023-08-23 14:13:56 +03:00
Marco Slot	ba55fd67d7	Rename planner_readme.md to README.md (#7139 )	2023-08-23 13:47:18 +03:00
Naisila Puka	36b51d617c	PG16 - Throw meaningful error for stats without a name on Citus tables (#7136 ) Relevant PG commit: `624aa2a13b` 624aa2a13bd02dd584bb0995c883b5b93b2152df	2023-08-23 10:25:01 +03:00
Gürkan İndibay	371f094b68	Removes pg_send_cancellation (#7135 ) DESCRIPTION: Removes pg_send_cancellation and all references	2023-08-21 17:29:44 +03:00
zhjwpku	ba2a0aec16	fix some obvious typo and reduce usage of magic number (#7130 ) fix some obvious typo and reduce usage of magic number Signed-off-by: Zhao Junwang <zhjwpku@gmail.com>	2023-08-18 14:50:20 +00:00
Naisila Puka	682dca1f12	Adds PG16Beta3 support (#6952 ) DESCRIPTION: Adds PG16Beta3 support This is the final commit that adds PG16 compatibility with Citus's current features. You can use Citus community with PG16Beta3. This commit: - Enables PG16 in the configure script. - Adds PG16 tests to CI using test images that have 16beta3 - Skips wal2json cdc test since wal2json package is not available for PG16 yet - Fixes an isolation test Several PG16 Compatibility commits have been merged before this final one. All these subtasks are done https://github.com/citusdata/citus/issues/7017 See the list below: 1 - `42d956888d` Resolve compilation issues 2 - `0d503dd5ac` Ruleutils and successful CREATE EXTENSION 3 - `907d72e60d` Some test outputs 4 - `7c6b4ce103` Outer join checks, subscription password, crash fixes 5 - `6056cb2c29` get_relation_info hook to avoid crash from adjusted partitioning 6 - `b36c431abb` Rework PlannedStmt and Query's Permission Info 7 - `ee3153fe50` More test output fixes 8 - `2c50b5f7ff` varnullingrels additions 9 - `b2291374b4` More test output fixes 10- `a2315fdc67` New options to vacuum and analyze 11- `9fa72545e2` Fix AM dependency and grant's admin option 12- `2d6cf8e79a` One more outer join check Stay tuned for PG16 new features in Citus :)	2023-08-17 21:02:59 +03:00
Naisila Puka	2d6cf8e79a	PG16 compatibility - one more outer join check (#7126 ) PG16 compatibility - part 11 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` part 9 `b2291374b4` part 10 `a2315fdc67` part 11 `9fa72545e2` This commit is in the series of PG16 compatibility commits. We already took care of the majority of necessary outer join checks in part 4 `7c6b4ce103` However, In RelationInfoContainsOnlyRecurringTuples, we need to add one more check of whether we are dealing with an outer join RTE using IsRelOptOuterJoin function. This prevents an outer join crash in sqlancer_failures.sql test. We expect one more commit of PG compatibility with Citus's current features are regression tests sanity.	2023-08-17 19:07:18 +03:00
zhjwpku	b10320be6f	fix wrong type convertion (#7116 ) partitionMethod and replicationModel are both type char, there seems meaningless to convert them to type Oid implicitly.	2023-08-17 13:53:43 +02:00
Naisila Puka	a5ce601c07	Bump PG14 and PG15 versions for CI tests (#7111 ) Postgres got minor updates on Aug10, this commit starts using the images with the latest version for our tests, namely 14.9 and 15.4. Depends on https://github.com/citusdata/the-process/pull/147 For CI images, we needed to regenerate Pipfile.lock, mainly because of an issue with pyyaml version: https://github.com/yaml/pyyaml/issues/601 We also needed to remove a failing test in subquery_local_tables.sql. Relevant PG commit: `b0e390e6d1` b0e390e6d1d68b92e9983840941f8f6d9e083fe0 Issue: https://github.com/citusdata/citus/issues/7119 For joins where consider_join_pushdown is false, we cannot get the information that we used to get, which prevents doing the distributed planning. Team already contacted PG committers for this. Until then, we remove the test from the schedule.	2023-08-17 11:53:19 +03:00
Naisila Puka	9fa72545e2	PG16 compatibility - fix AM dependency and grant's admin option (#7113 ) PG16 compatibility - part 11 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` part 9 `b2291374b4` part 10 `a2315fdc67` This commit is in the series of PG16 compatibility commits. It fixes AM dependency and grant's admin option: - Fix with admin option in grants grantstmt->admin_opt no longer exists in PG16 instead, grantstmt has a list of options, one of them is admin option. Relevant PG commit: `e3ce2de09d` e3ce2de09d814f8770b2e3b3c152b7671bcdb83f - Fix pg_depend entry to AMs after ALTER TABLE .. SET ACCESS METHOD Relevant PG commit: `97d8910104` 97d89101045fac8cb36f4ef6c08526ea0841a596 More PG16 compatibility commits are coming soon: We are very close to merging "PG16Beta3 Support - Regression tests sanity"	2023-08-17 11:22:34 +03:00
Naisila Puka	71c475af52	Fix GetUndistributableDependency (#7124 ) This is a leftover task from merging enterprise to community. Roles are distributed in community now, the comment is stale and the check is redundant.	2023-08-17 10:57:22 +03:00
Naisila Puka	a2315fdc67	PG16 compatibility - new options to vacuum and analyze (#7114 ) PG16 compatibility - part 10 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` part 9 `b2291374b4` This commit is in the series of PG16 compatibility commits. It: - Adds buffer_usage_limit to vacuum and analyze - Adds process_main, skip_database_stats, only_database_stats to vacuum Important Note: adding these options is actually required for check-vanilla tests to succeed. However, in concept, this PR belongs to "PG16 new features", rather than "PG16 regression tests sanity" Relevant PG commits: `1cbbee0338` 1cbbee03385763b066ae3961fc61f2cd01a0d0d7 `4211fbd841` 4211fbd8413b26e0abedbe4338aa7cda2cd469b4 `a46a7011b2` a46a7011b27188af526047a111969f257aaf4db8 More PG16 compatibility commits are coming soon ...	2023-08-16 16:18:28 +03:00
Naisila Puka	b982f2dee6	Changes PROCESS_TOAST default value to true (#7122 ) Process toast should be true by default, like in PG.	2023-08-16 14:40:24 +03:00
Naisila Puka	b2291374b4	PG16 compatibility - more test output fixes (#7112 ) PG16 compatibility - part 9 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` part 8 `2c50b5f7ff` This commit is in the series of PG16 compatibility commits. It makes some changes to our tests in order to be compatible with the following in PG16: - Fix multi_subquery_in_where_reference_clause test somehow PG got rid of the outer join (e.g., explain doesn't show outer joins), hence we can pushdown the subquery. Changing to users_reference_table - Fix unqualified column names for views in PG16 Relevant PG commit: `47bb9db759` 47bb9db75996232ea71fc1e1888ffb0e70579b54 - Fix global_cancel test Error wording and detail changed Relevant PG commit: `2631ebab7b` 2631ebab7b18bdc079fd86107c47d6104a6b3c6e - Fix local_table_join_test with lateral subquery Possible relevant PG commit: `ae89129aa3` ae89129aa3555c263b8c3ccc4c0f1ef7e46201aa I removed the where clause and the limit count error was hit again. With the where clause the query unexpectedly works. - Fix test outputs Relevant PG commits: -- `1349d2790b` -- `f4c7c410ee` For multi_explain and multi_complex_count_distinct there were too many places touched so I just added an alternative test output. For the other tests I modified the problematic parts. More PG16 compatibility commits are coming soon ...	2023-08-15 13:49:25 +03:00
Naisila Puka	2c50b5f7ff	PG16 compatibility - varnullingrels additions (#7107 ) PG16 compatibility - part 7 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` part 7 `ee3153fe50` This commit is in the series of PG16 compatibility commits. PG16 introduced a new entry varnnullingrels to Var, which represents our partkey in pg_dist_partition. This commit does the necessary changes in Citus to support this. Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-15 13:07:55 +03:00
Naisila Puka	ee3153fe50	PG16 compatibility - more test output fixes (#7108 ) PG16 compatibility - part 7 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` part 6 `b36c431abb` This commit is in the series of PG16 compatibility commits. It makes some changes to our tests in order to be compatible with the following in PG16: - PG16 removed logic for converting a table to a view Relevant PG commit: `b23cd185fd` b23cd185fd5410e5204683933f848d4583e34b35 - Fix changed error message in certificate verification Relevant PG commit: `8eda731465` 8eda7314652703a2ae30d6c4a69c378f6813a7f2 - Fix backend type order in tests Relevant PG commit: `0c679464a8` 0c679464a837079acc75ff1d45eaa83f79e05690 - Reduce log level to omit extra NOTICE in create collation in PG16 Relevant PG commit: `a14e75eb0b` a14e75eb0b6a73821e0d66c0d407372ec8376105 That commit made LOCALE parameter apply regardless of the provider used, and it printed the following notice: NOTICE: using standard form "und-u-ks-level2" for ICU locale "@colStrength=secondary" We omit this notice to omit output change between pg versions. - Fix columnar_memory test TopMemoryContext now has more children contexts Possible relevant PG commit: `9d3ebba729` 9d3ebba729ebaf5882a92f0f5f662a3312037605 memusage is now around 8.5 MB, whereas it was less than 8MB before. To avoid differences between PG versions, I changed the test to compare to less than 9 MB. It still reflects very well the improvement from 28MB. - Alternative test output for GRANTOR values in pg_auth_members grantor changed in PG16 Relevant PG commit: `ce6b672e44` ce6b672e4455820a0348214be0da1a024c3f619f - Remove redundant grouping columns from our tests Relevant PG commit: `8d83a5d0a2` 8d83a5d0a2673174dc478e707de1f502935391a5 - Fix tests with different order in Filters Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-09 18:04:32 +03:00
Naisila Puka	b36c431abb	PG16 compatibility - Rework PlannedStmt and Query's Permission Info (#7098 ) PG16 compatibility - Part 6 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` part 5 `6056cb2c29` This commit is in the series of PG16 compatibility commits. It handles the Permission Info changes in PG16. See below: The main issue lies in the following entries of PlannedStmt: { rtable permInfos } Each rtable has an int perminfoindex, and its actual permission info is obtained through the following: permInfos[perminfoindex] We had crashes because perminfoindexes were not updated in the finalized planned statement after distributed planner hook. So, basically, everywhere we set a query's or planned statement's rtable entry, we need to set the rteperminfos/permInfos accordingly. Relevant PG commits: `a61b1f7482` a61b1f74823c9c4f79c95226a461f1e7a367764b `b803b7d132` b803b7d132e3505ab77c29acf91f3d1caa298f95 More PG16 compatibility commits are coming soon ...	2023-08-09 15:23:00 +03:00
Naisila Puka	6056cb2c29	PG16 compatibility - get_relation_info hook to avoid crash from adjusted partitioning (#7099 ) PG16 compatibility - Part 5 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` part 4 `7c6b4ce103` This commit is in the series of PG16 compatibility commits. Find the explanation below: If we allow to adjust partitioning, we get a crash when accessing amcostestimate of partitioned indexes, because amcostestimate is NULL for them. The following PG commit is the culprit: `3c569049b7` 3c569049b7b502bb4952483d19ce622ff0af5fd6 Previously, partitioned indexes would just be ignored. Now, they are added in the list. However get_relation_info expects the tables which have partitioned indexes to have the inh flag set properly. AdjustPartitioningForDistributedPlanning plays with that flag, hence we don't get the desired behaviour. The hook is simply removing all partitioned indexes from the list. More PG16 compatibility commits are coming soon ...	2023-08-08 15:51:21 +03:00
Naisila Puka	7c6b4ce103	PG16 compatibility - outer join checks, subscription password, crash fixes (#7097 ) PG16 compatibility - Part 4 Check out part 1 `42d956888d` part 2 `0d503dd5ac` part 3 `907d72e60d` This commit is in the series of PG16 compatibility commits. It adds some outer join checks to the planner, the new password_required option to the subscription, and a crash fix related to PGIOAlignedBlock, see below for more details: - Fix PGIOAlignedBlock Assert crash in PG16 Relevant PG commit: `faeedbcefd` faeedbcefd40bfdf314e048c425b6d9208896d90 - Pass planner info as argument to make_simple_restrictinfo Pre PG16 passing plannerInfo to make_simple_restrictinfo was only needed for placeholder Vars, which is not the case in this part of the codebase because we are building the expression from shard intervals which don't have placeholder vars. However, PG16 is counting baserels appearing in clause_relids and is deleting the rels mentioned in plannerinfo->outer_join_rels Hence directly accessing plannerinfo. We will crash if we leave it as NULL. For reference `2489d76c49 (diff-e045c41eda9686451a7993e91518e40056b3739365e39eb1b70ae438dc1f7c76R207)` Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d - Add outer join checks, root->simple_rel_array - fix rebalancer to include passwork_required option Relevant PG commit: `c3afe8cf5a` c3afe8cf5a1e465bd71e48e4bc717f5bfdc7a7d6 More PG16 compatibility commits are coming soon ...	2023-08-04 14:51:28 +03:00
Naisila Puka	907d72e60d	PG16 compatibility - some test outputs (#7100 ) PG16 compatibility - Part 3 Check out part 1 `42d956888d` and part 2 `0d503dd5ac` This commit is in the series of PG compatibility. It makes some changes to our tests in order to be compatible with the following in PG16: Use debug_parallel_query in PG16+, force_parallel_mode otherwise Relevant PG commit `5352ca22e0` 5352ca22e0012d48055453ca9992a9515d811291 HINT changed to DETAIL in PG16 Relevant PG commit: `56d0ed3b75` 56d0ed3b756b2e3799a7bbc0ac89bc7657ca2c33 Fix removed read-only server setting lc_collate Relevant PG commit: `b0f6c43716` b0f6c437160db640d4ea3e49398ebc3ba39d1982 Fix unsupported join alias expression in sqlancer_failures Relevant PG commit: `2489d76c49` 2489d76c4906f4461a364ca8ad7e0751ead8aa0d More PG16 compatibility commits are coming soon ...	2023-08-04 13:03:15 +03:00
Önder Kalacı	4ae3982d14	Add single-shard router Merge command support (#7088 ) Similar to https://github.com/citusdata/citus/pull/7077. As PG 16+ has changed the join restriction information for certain outer joins, MERGE is also impacted given that is is also underlying an outer join. See #7077 for the details.	2023-08-04 08:16:29 +03:00
Naisila Puka	0d503dd5ac	PG16 compatibility: ruleutils and successful CREATE EXTENSION (#7087 ) PG16 compatibility - Part 2 Part 1 provided successful compilation against pg16beta2. `42d956888d` This PR provides ruleutils changes with pg16beta2 and successful CREATE EXTENSION command. Note that more changes are needed in order to have successful regression tests. More commits are coming soon ... For any_value changes, I referred to this commit `8ef94dc1f5` where we did something similar for PG14 support.	2023-08-02 16:04:51 +03:00
Önder Kalacı	960a5f6104	Improve failure handling of distributed execution (#7090 ) Prior to this commit, the code would skip processing the errors happened for local commands. Prior to https://github.com/citusdata/citus/pull/5379, it might make sense to allow the execution continue. But, as of today, if a modification fails on any placement, we can safely fail the execution. The first commit show the problem in action. The second commit includes the fix and the test fixes.	2023-08-01 16:47:59 +03:00
Onur Tirtir	dd6ea1ebd5	Makes sure to handle NULL constraints for ADD COLUMN commands (#7093 ) DESCRIPTION: Fixes a bug that causes an unexpected error when adding a column with a NULL constraint Fixes https://github.com/citusdata/citus/issues/7092.	2023-08-01 11:07:47 +03:00
Önder Kalacı	cb5eb73048	Add support for router INSERT .. SELECT commands (#7077 ) Tradionally our planner works in the following order: router - > pushdown -> repartition -> pull to coordinator However, for INSERT .. SELECT commands, we did not support "router". In practice, that is not a big issue, because pushdown planning can handle router case as well. However, with PG 16, certain outer joins are converted to JOIN without any conditions (e.g., JOIN .. ON (true)) and the filters are pushed down to the tables. When the filters are pushed down to the tables, router planner can detect. However, pushdown planner relies on JOIN conditions. An example query: ``` INSERT INTO agg_events (user_id) SELECT raw_events_first.user_id FROM raw_events_first LEFT JOIN raw_events_second ON raw_events_first.user_id = raw_events_second.user_id WHERE raw_events_first.user_id = 10; ``` As a side effect of this change, now we can also relax certain limitation that "pushdown" planner emposes, but not "router". So, with this PR, we also allow those. Closes https://github.com/citusdata/citus/pull/6772 DESCRIPTION: Prevents unnecessarily pulling the data into coordinator for some INSERT .. SELECT queries that target a single-shard group	2023-07-28 15:07:20 +03:00
Teja Mupparti	846cbc3a39	In the MERGE join clause, there is a datatype mismatch between target's distribution column and the expression originating from the source. If the types are different, Citus uses different hash functions for the two column types, which might lead to incorrect repartitioning of the result data	2023-07-27 16:06:00 -07:00
Nils Dijk	186804c119	fix flappyness of shard_rebalancer operations test (#7083 ) Fixes flappyness where the order of shards was dependent on the physical layout in the heap. Failed here https://app.circleci.com/pipelines/github/citusdata/citus/33844/workflows/1651f8f5-6e6a-457e-9d35-34b8788ea6d1/jobs/1189836 ```diff --- /home/circleci/project/src/test/regress/expected/shard_rebalancer.out.modified 2023-07-24 12:51:27.126284675 +0000 +++ /home/circleci/project/src/test/regress/results/shard_rebalancer.out.modified 2023-07-24 12:51:27.170285079 +0000 @@ -2571,24 +2571,24 @@ CREATE TABLE test_with_all_shards_excluded(a int PRIMARY KEY); SELECT create_distributed_table('test_with_all_shards_excluded', 'a', colocate_with:='none', shard_count:=4); create_distributed_table -------------------------- (1 row) SELECT shardid FROM pg_dist_shard; shardid --------- - 433504 433505 433506 433507 + 433504 (4 rows) SELECT rebalance_table_shards('test_with_all_shards_excluded', excluded_shard_list:='{102073, 102074, 102075, 102076}'); rebalance_table_shards ------------------------ (1 row) DROP TABLE test_with_all_shards_excluded; SET citus.shard_count TO 2; ```	2023-07-27 16:24:35 +02:00
Carol Smith	df86a91393	Rename CODEOFCONDUCT.MD to CODE_OF_CONDUCT.md	2023-07-25 08:18:22 -07:00
Carol Smith	a42f58c7c4	Create CODEOFCONDUCT.MD Adding Code of Conduct file to /citus repo reflecting the Microsoft Open Source Code of Conduct.	2023-07-25 08:18:22 -07:00
zhjwpku	6a00517312	[typo] fix typo in comments (#7073 ) %s/pg_dist_local_node_group/pg_dist_local_group/g Signed-off-by: Zhao Junwang <zhjwpku@gmail.com>	2023-07-25 16:43:55 +03:00
Önder Kalacı	862dae823e	Expand EnableNonColocatedRouterQueryPushdown to cover shard colocation (e.g., shard index) (#7076 ) Previously, we only checked whether the relations are colocated, but we ignore the shard indexes. That causes certain queries still to be accidentally router. We should enforce colocation checks for both shard index and table colocation id to make the check restrictive enough. For example, the following query should not be router, and after this patch, it won't: ```SQL SELECT user_id FROM ((SELECT user_id FROM raw_events_first WHERE user_id = 15) EXCEPT (SELECT user_id FROM raw_events_second where user_id = 17)) as foo; ``` DESCRIPTION: Enforce shard level colocation with citus.enable_non_colocated_router_query_pushdown	2023-07-25 16:20:13 +03:00
ahmet gedemenli	3f11139b5c	Do not move a shard to a node that it already exists on	2023-07-25 13:38:33 +03:00
ahmet gedemenli	c968dc9c27	Do not rebalance if replication factor is greater than the node count	2023-07-25 13:38:33 +03:00
Nils Dijk	c2f46f0f3f	Update README.md - slack badge (#7075 ) Use a badge for slack again, although no member count, still better compared to the text.	2023-07-24 14:48:49 +02:00
Gürkan İndibay	3f0e1efb5a	Fixes error surpressions in packaging pipelines (#7054 ) DESCRIPTION: PR description that will go into the change log, up to 78 characters There are 4 errors arised recently and I fixed them in this PR. Problems and fixes are as below: 1. When executing make step in packaging pipeline, if it gets error, we can not detect it since there are additional operations after make in one line. With this fix, now if an error occured after make execution, we can detect and see the step red and failed here, 2. Recently we started to get the error ` fatal: detected dubious ownership in repository at '/__w/citus/citus' ` as below https://github.com/citusdata/citus/actions/runs/5542692968/jobs/10117706723#step:7:9 There is a fix for that one as well. 3. fixed the requirements issue arised related to urllib3 library version 4. Getting errors with centos-8 docker image with the new postgres-dev packages. Now, changed centos-8 image with almalinux-8 and now it works	2023-07-24 14:44:27 +03:00
Carol Smith	da7dd1cc54	Update README.md Adding code of conduct language to README doc.	2023-07-21 17:10:45 -07:00
Naisila Puka	42d956888d	PG16 compatibility: Resolve compilation issues (#7005 ) This PR provides successful compilation against PG16Beta2. It does some necessary refactoring to prepare for full support of version 16, in https://github.com/citusdata/citus/pull/6952 . Change RelFileNode to RelFileNumber or RelFileLocator Relevant PG commit b0a55e43299c4ea2a9a8c757f9c26352407d0ccc new header for varatt.h Relevant PG commit: d952373a987bad331c0e499463159dd142ced1ef drop support for Abs, use fabs Relevant PG commit 357cfefb09115292cfb98d504199e6df8201c957 tuplesort PGcommit: d37aa3d35832afde94e100c4d2a9618b3eb76472 Relevant PG commit: d37aa3d35832afde94e100c4d2a9618b3eb76472 Fix vacuum in columnar Relevant PG commit: 4ce3afb82ecfbf64d4f6247e725004e1da30f47c older one: b6074846cebc33d752f1d9a66e5a9932f21ad177 Add alloc_flags to pg_clean_ascii Relevant PG commit: 45b1a67a0fcb3f1588df596431871de4c93cb76f Merge GetNumConfigOptions() into get_guc_variables() Relevant PG commit: 3057465acfbea2f3dd7a914a1478064022c6eecd Minor PG refactor PG_FUNCNAME_MACRO __func__ Relevant PG commit 320f92b744b44f961e5d56f5f21de003e8027a7f Pass NULL context to stringToQualifiedNameList, typeStringToTypeName The pre-PG16 error behaviour for the following stringToQualifiedNameList & typeStringToTypeName was ereport(ERROR, ...) Now with PG16 we have this context input. We preserve the same behaviour by passing a NULL context, because of the following: (copy paste comment from PG16) If "context" isn't an ErrorSaveContext node, this behaves as errstart(ERROR, domain), and the errsave() macro ends up acting exactly like ereport(ERROR, ...). Relevant PG commit 858e776c84f48841e7e16fba7b690b76e54f3675 Use RangeVarCallbackMaintainsTable instead of RangeVarCallbackOwnsTable Relevant PG commit: 60684dd834a222fefedd49b19d1f0a6189c1632e FIX THIS: Not implemented grant-level control of role inheritance see PG commit e3ce2de09d814f8770b2e3b3c152b7671bcdb83f Make Scan node abstract PG commit: 8c73c11a0d39049de2c1f400d8765a0eb21f5228 Change in Var representations, get_relids_in_jointree PG commit 2489d76c4906f4461a364ca8ad7e0751ead8aa0d Deadlock detection changes because SHM_QUEUE is removed Relevant PG Commit: d137cb52cb7fd44a3f24f3c750fbf7924a4e9532 TU_UpdateIndexes Relevant PG commit 19d8e2308bc51ec4ab993ce90077342c915dd116 Use object_ownercheck and object_aclcheck functions Relevant PG commits: afbfc02983f86c4d71825efa6befd547fe81a926 c727f511bd7bf3c58063737bcf7a8f331346f253 Rework Permission Info for successful compilation Relevant PG commits: postgres/postgres@a61b1f7 postgres/postgres@b803b7d --------- Co-authored-by: onderkalaci <onderkalaci@gmail.com>	2023-07-21 14:32:37 +03:00
Naisila Puka	a282953274	Fix ScanKeyInit RegProcedure and Datum arguments (#7072 ) Index scans in PG16 return empty sets because of extra compatibility enforcement for `ScanKeyInit` arguments. Could be one of the relevant PG commits: `c8b2ef05f4` This PR fixes all incompatible `RegProcedure` and `Datum` arguments in all `ScanKeyInit` functions used throughout the codebase. Helpful for https://github.com/citusdata/citus/pull/6952	2023-07-21 14:11:10 +03:00
Teja Mupparti	87dc88f837	Isolate schema sharding/MERGE tests into a new file, and use the new GUC parameter	2023-07-19 12:23:45 -07:00
mulander	6498e1eb6c	Fix typo in distributed (#7069 ) DESCRIPTION: PR description that will go into the change log, up to 78 characters	2023-07-18 21:02:09 +02:00
aykut-bozkurt	832fc4a8f0	readme update for 12.0 (#7068 )	2023-07-18 20:09:27 +03:00
Nils Dijk	96a3d82e13	Update slack link in README.md for self-serve signup (#7058 ) The link in our readme directly goes to our channel, meaning people finding the link here for the first time are unable to join slack this way. Given that the target audience using this link is most likely not part of the slack channel yet it would be better to link to our self serve signup flow at slack.citusdata.com, which is the same we use on citusdata.com. From simple testing you should still get redirected to the channel if you are already joined and signed in.	2023-07-17 12:59:46 +02:00
Halil Ozan Akgül	c99a93ffa7	Move SQL file changes for citus_shard_sizes fixes into the new 11.3-2 version (#7050 ) This PR moves `citus_shard_sizes` changes from #7003, and #7018 to into a new Citus version, 11.3-2	2023-07-14 17:19:54 +03:00
aykut-bozkurt	609a5465ea	Bump Citus version into 12.1devel (#7061 )	2023-07-14 13:12:30 +03:00
Gürkan İndibay	0f0b60c29c	Fix format attribute and IsLocalReplicationOriginSessionActive errors (#7055 ) This PR fixes the following: - in oraclelinux-7 `Make` step ``` /usr/bin/ld: utils/replication_origin_session_utils.o: relocation R_X86_64_PC32 against undefined symbol `IsLocalReplicationOriginSessionActive' can not be used when making a shared object; recompile with -fPIC /usr/bin/ld: final link failed: Bad value collect2: error: ld returned 1 exit status ``` `IsLocalReplicationOriginSessionActive` function has improper inline declaration, fixed that - in centos-7 `Make` step ``` utils/background_jobs.c: In function 'StartCitusBackgroundTaskExecutor': utils/background_jobs.c:1746:6: warning: function might be possible candidate for 'gnu_printf' format attribute [-Wsuggest-attribute=format] database, user, jobId, taskId); ^ ``` should use `pg_attribute_printf(3,4)` instead of `pg_attribute_printf(3,0)` since the number of arguments varies for `SafeSnprintf(char str, rsize_t count, const char fmt, ...)` --------- Co-authored-by: naisila <nicypp@gmail.com>	2023-07-13 17:41:57 +03:00
aykut-bozkurt	ee255cd46e	Changelog entries for 12.0.0 (#7049 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Gokhan Gulbiz <ggulbiz@gmail.com>	2023-07-13 14:46:58 +03:00
Onur Tirtir	2c11e4d7f9	Deparse ALTER TABLE commands if ADD COLUMN is the only subcommand (#7032 ) Some clients send ALTER TABLE .. ADD COLUMN .. commands together with some other DDLs and this makes it impossible to directly send the original DDL command to the workers. For this reason, this commit adds support for deparsing such ALTER TABLE commands so that we can avoid from directly sending the original one to the workers. Partially fixes https://github.com/citusdata/citus/issues/690. Fixes #3678	2023-07-12 18:28:45 +03:00
Onur Tirtir	f3cdb6d1bf	Deparse ALTER TABLE commands if ADD COLUMN is the only subcommand And stabilize multi_alter_table_statements.sql.	2023-07-12 18:17:47 +03:00
Onur Tirtir	6365f47b57	Properly handle index storage options for ADD CONSTRAINT / COLUMN	2023-07-11 17:42:43 +03:00
Onur Tirtir	ae142e1764	Properly handle IF NOT EXISTS for ADD COLUMN	2023-07-11 17:42:43 +03:00
Onur Tirtir	d4789a2c3a	Stabilize test helper sql files multi_test_helpers is run in parallel with others, so need to stabilize other test helpers too to make multi_test_helpers runnable multiple times.	2023-07-06 10:47:41 +03:00
Onur Tirtir	001437bdfe	Refactor AppendAlterTableCmdAddConstraint to reuse it for ADD COLUMN too	2023-07-06 10:47:41 +03:00
Onur Tirtir	56f1daa800	Refactor the code that extends constraint/index names on shards into a func	2023-07-06 10:47:41 +03:00
Onur Tirtir	ba1ea9b5bd	Refactor the code that prepares constraint objects in an alter table stmt into a func	2023-07-06 10:47:41 +03:00
Halil Ozan Akgül	613cced1ae	Use citus_shard_sizes in citus_tables (#7018 ) Fixes #7019 This PR updates citus_tables view to use citus_shard_sizes function, instead of citus_total_relation_size to improve performance.	2023-07-05 11:40:34 +03:00
aykut-bozkurt	719d92c8b9	mat view should not be converted to tenant table (#7043 ) We allow materialized view to exist in distrbuted schema but they should not be tried to be converted to a tenant table since they cannot be distributed. Fixes https://github.com/citusdata/citus/issues/7041	2023-07-04 17:28:03 +03:00
Ahmet Gedemenli	5051be86ff	Skip distributed schema insertion into pg_dist_schema, if already exists (#7044 ) Inserting into `pg_dist_schema` causes unexpected duplicate key errors, for distributed schemas that already exist. With this commit we skip the insertion if the schema already exists in `pg_dist_schema`. The error: ```sql SET citus.enable_schema_based_sharding TO ON; CREATE SCHEMA sc2; CREATE SCHEMA IF NOT EXISTS sc2; NOTICE: schema "sc2" already exists, skipping ERROR: duplicate key value violates unique constraint "pg_dist_schema_pkey" DETAIL: Key (schemaid)=(17294) already exists. ``` fixes: #7042	2023-07-04 15:19:07 +03:00
Gokhan Gulbiz	e0d3476526	Add locking mechanism for tenant monitoring probabilistic approach (#7026 ) This PR * Addresses a concurrency issue in the probabilistic approach of tenant monitoring by acquiring a shared lock for tenant existence checks. * Changes `citus.stat_tenants_sample_rate_for_new_tenants` type to double * Renames `citus.stat_tenants_sample_rate_for_new_tenants` to `citus.stat_tenants_untracked_sample_rate`	2023-07-03 13:08:03 +03:00
Jelte Fennema	ac24e11986	Change default rebalance strategy to by_disk_size (#7033 ) DESCRIPTION: Change default rebalance strategy to by_disk_size When introducing rebalancing by disk size we didn't make it the default initially. The main reason was, because we expected some problems with it. We have indeed had some problems/bugs with it over the years, and have fixed all of them. By now we're quite confident in its stability, and that it pretty much always gives better results than by_shard_count. So this PR makes by_disk_size the new default. We don't change the default when some other strategy than by_shard_count is the current default. This is in case someone defined their own rebalance strategy and marked this as the default themselves. Note: It explicitly does nothing during a downgrade, because there's no way of knowing if the rebalance strategy before the upgrade was by_disk_size or by_shard_count. And even in previous versions by_disk_size is considered superior for quite some time.	2023-07-03 11:08:24 +02:00
Jelte Fennema	fd1427de2c	Change by_disk_size rebalance strategy to have a base size (#7035 ) One problem with rebalancing by disk size is that shards in newly created collocation groups are considered extremely small. This can easily result in bad balances if there are some other collocation groups that do have some data. One extremely bad example of this is: 1. You have 2 workers 2. Both contain about 100GB of data, but there's a 70MB difference. 3. You create 100 new distributed schemas with a few empty tables in them 4. You run the rebalancer 5. Now all new distributed schemas are placed on the node with that had 70MB less. 6. You start loading some data in these shards and quickly the balance is completely off To address this edge case, this PR changes the by_disk_size rebalance strategy to add a a base size of 100MB to the actual size of each shard group. This can still result in a bad balance when shard groups are empty, but it solves some of the worst cases.	2023-06-27 16:37:09 +02:00
Halil Ozan Akgül	03a4769c3a	Fix Reference Table Check for CDC (#7025 ) Previously reference table check only looked at `partition method = 'n'`. This PR adds `replication model = 't'` to that.	2023-06-23 16:37:35 +03:00
Teja Mupparti	387b5f80f9	Fixes the bug#6785	2023-06-22 10:44:45 -07:00
Ahmet Gedemenli	99edb2675f	Improve error/hint messages related to schema-based sharding (#7027 ) Improve error/hint messages related to schema-based sharding	2023-06-22 18:10:12 +03:00
Ahmet Gedemenli	44e3c3b9c6	Improve error message for CREATE SCHEMA .. CREATE TABLE (#7024 ) Improve error message for CREATE SCHEMA .. CREATE TABLE when enable_schema_based_sharding is enabled.	2023-06-21 15:24:09 +03:00
aykut-bozkurt	565c5260fd	Properly handle error at owner check (#6984 ) We did not properly handle the error at ownership check method, which causes `max stack depth for errors` as in https://github.com/citusdata/citus/issues/6980. Fix: In case of an error, we should rollback subtransaction and throw the message with log level to `LOG_SERVER_ONLY`. Note: We prevent logs from the client to prevent pg vanilla test failures due to Citus logs which differs from the actual Postgres logs. (For context: https://github.com/citusdata/citus/pull/6130) I also needed to fix a flaky test: `multi_schema_support` DESCRIPTION: Fixes a bug related to non-existent objects in DDL commands. Fixes https://github.com/citusdata/citus/issues/6980	2023-06-21 14:50:01 +03:00
Naisila Puka	69af3e8509	Drop PG13 Support Phase 2 - Remove PG13 specific paths/tests (#7007 ) This commit is the second and last phase of dropping PG13 support. It consists of the following: - Removes all PG_VERSION_13 & PG_VERSION_14 from codepaths - Removes pg_version_compat entries and columnar_version_compat entries specific for PG13 - Removes alternative pg13 test outputs - Removes PG13 normalize lines and fix the test outputs based on that It is a continuation of `5bf163a27d`	2023-06-21 14:18:23 +03:00
aykut-bozkurt	1bb667ce6e	Fix create schema authorization bug (#7015 ) Fixes a bug related to `CREATE SCHEMA AUTHORIZATION <rolename>` for single shard tables. We should properly fetch schema name from role specification if schema name is not given.	2023-06-20 22:05:17 +03:00
aykut-bozkurt	f667f14029	Rewind tuple store to fix scrollable with hold cursor fetches (#7014 ) We need to rewind the tuplestorestate's tuple index to get correct results on fetching scrollable with hold cursors. `PersistHoldablePortal` is responsible for persisting out tuplestorestate inside a with hold cursor before commiting a transaction. It rewinds the cursor like below (`ExecutorRewindcalls` calls `rescan`): ```c if (portal->cursorOptions & CURSOR_OPT_SCROLL) { ExecutorRewind(queryDesc); } ``` At the end, it adjusts tuple index for holdStore in the portal properly. ```c if (portal->cursorOptions & CURSOR_OPT_SCROLL) { if (!tuplestore_skiptuples(portal->holdStore, portal->portalPos, true)) elog(ERROR, "unexpected end of tuple stream"); } ``` DESCRIPTION: Fixes incorrect results on fetching scrollable with hold cursors. Fixes https://github.com/citusdata/citus/issues/7010	2023-06-19 23:00:18 +03:00
Teja Mupparti	58da8771aa	This pull request introduces support for nonroutable merge commands in the following scenarios: 1) For distributed tables that are not colocated. 2) When joining on a non-distribution column for colocated tables. 3) When merging into a distributed table using reference or citus-local tables as the data source. This is accomplished primarily through the implementation of the following two strategies. Repartition: Plan the source query independently, execute the results into intermediate files, and repartition the files to co-locate them with the merge-target table. Subsequently, compile a final merge query on the target table using the intermediate results as the data source. Pull-to-coordinator: Execute the plan that requires evaluation at the coordinator, run the query on the coordinator, and redistribute the resulting rows to ensure colocation with the target shards. Direct the MERGE SQL operation to the worker nodes' target shards, using the intermediate files colocated with the data as the data source.	2023-06-19 12:23:40 -07:00
Xin Li	c10cb50aa9	Support custom cast from / to timestamptz in time partition management UDFs (#6923 ) This is to implement custom cast of table partition column type from / to `timestamptz` in time partition management UDFs, as proposed in ticket #6454 The general idea is for a time partition column with type other than `date`, `timestamp`, or `timestamptz`, users can provide custom bidirectional cast between the column type and `timestamptz`, the UDFs then will be able to create and drop time partitions for such tables. Fixes #6454 --------- Signed-off-by: Xin Li <xin@swirldslabs.com> Co-authored-by: Marco Slot <marco.slot@microsoft.com> Co-authored-by: Ahmet Gedemenli <afgedemenli@gmail.com>	2023-06-19 17:49:05 +03:00
Halil Ozan Akgül	d71ad4b65a	Add Publication Tests for Tenant Schema Tables (#7011 ) This PR adds schema based sharding tests to publication.sql file	2023-06-19 12:39:41 +03:00
aykut-bozkurt	fba5c8dd30	ALTER TABLE <tblname> SET SCHEMA <schemaname> for single shard tables (#7004 ) Adds support for altering schema of single shard tables. We do that in 2 steps. 1. Undistribute the tenant table at `preprocess` step, 2. Distribute new schema if it is a distributed schema after DDLs are propagated. DESCRIPTION: Adds support for altering a table's schema to/from distributed schemas.	2023-06-19 10:21:13 +03:00
Nils Dijk	ce2ba1d07e	Optimize QueryPushdownSqlTaskList on memory and cpu (#6945 ) While going over this piece of code (a long time ago) it was bothering to me we keep a bool array with the size of shardcount to iterate only over shards present in the list of non-pruned shards. Especially since we keep min/max of the set shards to optimize iteration. Postgres has the bitmapset datastructure which a) takes significantly less space, b) has iterator functions to only iterate over set bits, c) can efficiently skip long sequences of unset bits and d) stops quickly once the last set bit has been reached. I have been contemplating if it is worth to keep the minShardOffset because of readability and the efficient skipping of unset bits, however, I have decided to keep it -although less readable-, as there are known usecases where 100k+ shards are pruned to single digit shards. If these would end up at the end of `shardcount` a hotloop of zero checks on the first iteration _could_ cause a theoretical performance regression. All in all, this code is using less memory in all cases where it matters, and less cpu in most cases, while using more idiomatic datastructures for the task at hand.	2023-06-16 16:06:22 +02:00
Marco Slot	3adc1575d9	Fix DROP CONSTRAINT in command string with other commands (#7012 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-06-16 15:54:37 +02:00
Pino de Candia	f4a90da8c8	Replace Slack heroku app with plain link in the Readme banner. (#6985 )	2023-06-16 15:34:11 +02:00
Onur Tirtir	12a093b456	Allow using generated identity column based on int/smallint when creating a distributed table (#7008 ) Allow using generated identity column based on int/smallint when creating a distributed table so that applications that rely on those data types don't break. Inserting into / modifying such columns from workers is not allowed but it's better than not allowing such columns altogether.	2023-06-16 14:34:23 +03:00
Halil Ozan Akgül	04f6868ed2	Add citus_schemas view (#6979 ) DESCRIPTION: Adds citus_schemas view The citus_schemas view will be created in public schema if it exists, if not the view will be created in pg_catalog. Need to: - [x] Add tests - [x] Fix tests	2023-06-16 14:21:58 +03:00
Naisila Puka	5bf163a27d	Remove PG13 from CI and Configure (#7002 ) DESCRIPTION: Drops PG13 Support This commit is the first phase of dropping PG13 support. It consists of the following: - Removes pg13 from CI tests Among other things, Citus upgrade tests should now use PG14. Earliest Citus version supporting PG14 is 10.2. We also pick 11.3 version for upgrade_pg_dist_cleanup tests. Therefore, we run the citus upgrade tests with versions 10.2 and 11.3. - Removes pg13 from configure script - Remove upgrade_columnar_metapage upgrade tests We populate first_row_number column of columnar.stripe table during citus 10.1-10.2 upgrade. Given that we start from citus 10.2.0, which is the oldest version supporting PG14, we don't have that upgrade path anymore. Hence we remove these tests. - Removes upgrade_pg_dist_object_test and upgrade_partition_constraints tests These upgrade tests require the citus old version to be less than 10.0. Given that we drop support for PG13, we run upgrade tests with PG14, which starts with 10.2. So we remove these upgrade tests. - Documents that upgrade_post_11 should upgrade from version less than 11 In this way we make sure we run citus_finalize_upgrade_to_citus11 script - Adds needed alternative output for upgrade_citus_finish_citus_upgrade Given that we use 11.3 as the citus old version as well, we add this alternative output because pg_catalog.citus_finish_citus_upgrade() makes sense if last_upgrade_major_version < 11. See below for reference: pg_catalog.citus_finish_citus_upgrade(): ... IF last_upgrade_major_version < 11 THEN PERFORM citus_finalize_upgrade_to_citus11(); performed_upgrade := true; END IF; IF NOT performed_upgrade THEN RAISE NOTICE 'already at the latest distributed schema version (%)', last_upgrade_version_string; RETURN; END IF; ... And that's it :) The second phase of dropping PG13 support will consist in removing all the PG13 specific compilation paths/tests in the Citus repo. Will be done soon.	2023-06-15 14:54:06 +03:00
Ahmet Gedemenli	002a88ae7f	Error for single shard table creation if replication factor > 1 (#7006 ) Error for single shard table creation if replication factor > 1	2023-06-15 13:13:45 +03:00
Emel Şimşek	4f793abc4a	Turn on GUC_REPORT flag for search_path to enable reporting back the parameter value upon change. (#6983 ) DESCRIPTION: Turns on the GUC_REPORT flag for search_path. This results in postgres to report the parameter status back in addition to Command Complete packet. In response to the following command, > SET search_path TO client1; postgres sends back the following packets (shown in pseudo form): C (Command Complete) SET + S (Parameter Status) search_path = client1	2023-06-14 17:35:52 +03:00
Naisila Puka	3cc7a4aa42	Fix pg14-pg15 upgrade_distributed_triggers test (#6981 ) This test is only relevant for pg14-15 upgrade. However, the check on `upgrade_distributed_triggers_after` didn't take into consideration the case when we are doing pg15-16 upgrade. Hence, I added one more condition to the test: existence of `upgrade_distributed_triggers` schema which can only be created in pg14.	2023-06-14 15:32:38 +03:00
Onur Tirtir	dbdf04e8ba	Rename pg_dist tenant_schema to pg_dist_schema (#7001 )	2023-06-14 12:12:15 +03:00
Naisila Puka	ba40eb363c	Fix some gucs' initial and boot values, and flag combinations (#6957 ) PG16beta1 added some sanity checks for GUCS, find the Relevant PG commits below: 1- Add check on initial and boot values when loading GUCs `a73952b795` 2- Extend check_GUC_init() with checks on flag combinations when loading GUCs `009f8d1714` I fixed our currently problematic GUCS, we can merge this directly into main as these make sense for any PG version. There was a particular NodeConninfo issue: Previously we would rely on the fact that NodeConninfo initial value is an empty string. However, with PG16 enforcing same initial and boot values, we can't use an empty initial value for NodeConninfo anymore. Therefore we add a new flag to indicate whether we are at boot check.	2023-06-14 11:55:52 +03:00
Ahmet Gedemenli	7b0bc62173	Support CREATE TABLE .. AS SELECT .. commands for tenant tables (#6998 ) Support CREATE TABLE .. AS SELECT .. commands for tenant tables	2023-06-13 17:54:09 +03:00
Halil Ozan Akgül	772d194357	Changes citus_shard_sizes view's Shard Name Column to Shard Id (#7003 ) citus_shard_sizes view had a shard name column we use to extract shard id. This PR changes the column to shard id so we don't do unnecessary string operation.	2023-06-13 16:36:35 +03:00
Gokhan Gulbiz	e0ccd155ab	Make citus_stat_tenants work with schema-based tenants. (#6936 ) DESCRIPTION: Enabling citus_stat_tenants to support schema-based tenants. This pull request modifies the existing logic to enable tenant monitoring with schema-based tenants. The changes made are as follows: - If a query has a partitionKeyValue (which serves as a tenant key/identifier for distributed tables), Citus annotates the query with both the partitionKeyValue and colocationId. This allows for accurate tracking of the query. - If a query does not have a partitionKeyValue, but its colocationId belongs to a distributed schema, Citus annotates the query with only the colocationId. The tenant monitor can then easily look up the schema to determine if it's a distributed schema and make a decision on whether to track the query. --------- Co-authored-by: Jelte Fennema <jelte.fennema@microsoft.com>	2023-06-13 14:11:45 +03:00
aykut-bozkurt	5acbd735ca	Move 2 functions to correct files (#7000 ) Followup item from https://github.com/citusdata/citus/pull/6933#discussion_r1217896933	2023-06-13 11:43:48 +03:00
Jelte Fennema	b96d3171a2	Small fix to cherry-pick instructions (#6997 ) It wasn't creating the branch	2023-06-12 18:21:33 +02:00
aykut-bozkurt	213d363bc3	Add citus_schema_distribute/undistribute udfs to convert a schema into a tenant schema / back to a regular schema (#6933 ) * Currently we do not allow any Citus tables other than Citus local tables inside a regular schema before executing `citus_schema_distribute`. * `citus_schema_undistribute` expects only single shard distributed tables inside a tenant schema. DESCRIPTION: Adds the udf `citus_schema_distribute` to convert a regular schema into a tenant schema. DESCRIPTION: Adds the udf `citus_schema_undistribute` to convert a tenant schema back to a regular schema. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-06-12 18:41:31 +03:00
Jelte Fennema	e37ee16d59	Add a section on backporting to CONTRIBUTING.md (#6995 ) Backporting changes is pretty easy, but the steps are not obvious if you're new to the project.	2023-06-12 10:42:26 +02:00
Gokhan Gulbiz	2c509b712a	Tenant monitoring performance improvements (#6868 ) - [x] Use spinlock instead of lwlock per tenant [`b437aa9`](`b437aa9e52`) - [x] Use hashtable to store tenant stats [`ccd464b`](`ccd464ba04`) - [x] Introduce a new GUC for specifying the sampling rate of new tenant entries in the tenant monitor. [`a8d3805`](`a8d3805bd6`) Below are the pgbench metrics with select-only workloads from my local machine. Here is the [script](https://gist.github.com/gokhangulbiz/7a2308470597dc06734ff7c08f87c656) I used for benchmarking. \| \| Connection Count \| Initial Implementation (TPS) \| On/Off Diff \| Final Implementation -Run#1 (TPS) \| On/Off Diff \| Final Implementation -Run#2 (TPS) \| On/Off Diff \| Final Implementation -Run#3 (TPS) \| On/Off Diff \| Avg On/Off Diff \| \| --- \| ---------------- \| ---------------------------- \| ----------- \| ---------------------------------- \| ----------- \| ---------------------------------- \| ----------- \| ---------------------------------- \| ----------- \| --------------- \| \| On \| 32 \| 37488.69839 \| \-17% \| 42859.94402 \| \-5% \| 43379.63121 \| \-2% \| 42636.2264 \| \-7% \| \-5% \| \| Off \| 32 \| 43909.83121 \| \| 45139.63151 \| \| 44188.77425 \| \| 45451.9548 \| \| \| \| On \| 300 \| 30463.03538 \| \-15% \| 33265.19957 \| \-7% \| 34685.87233 \| \-2% \| 34682.5214 \| \-1% \| \-3% \| \| Off \| 300 \| 35105.73594 \| \| 35637.45423 \| \| 35331.33447 \| \| 35113.3214 \| \| \|	2023-06-11 12:17:31 +03:00
Ahmet Gedemenli	2f13b37ce4	Fix flaky multi_schema_support (#6991 ) Dropping a leftover table, delete some unnecessary command, add some ORDER BY to avoid flakiness in `multi_schema_support`	2023-06-09 17:03:58 +03:00
Naisila Puka	50e6c50534	Remove flaky rebalance plan from test (#6990 ) Looks like sometimes shards are a slightly different size than we expect, 16k vs 8k, resulting in a different rebalance plan.	2023-06-09 15:59:30 +03:00
Ahmet Gedemenli	e6ac9f2a68	Propagate ALTER SCHEMA .. OWNER TO .. (#6987 ) Propagate `ALTER SCHEMA .. OWNER TO ..` commands to workers	2023-06-09 15:32:18 +03:00
Halil Ozan Akgül	3acadd7321	Citus Clock tests with Single Shard Tables (#6938 ) This PR tests Citus clock with single shard tables.	2023-06-09 15:06:46 +03:00
Naisila Puka	2ba3bffe1e	Random warning fixes (#6974 ) Citus build with PG16 fails because of the following warnings: - using char* instead of Datum - using pointer instead of oid - candidate function for format attribute - remove old definition from PG11 compatibility `62bf571ced` This commit fixes the above.	2023-06-09 14:36:43 +03:00
Emel Şimşek	8b2024b730	When Creating a FOREIGN KEY without a name, schema qualify referenced table name in deparser. (#6986 ) DESCRIPTION: Fixes a bug which causes an error when creating a FOREIGN KEY constraint without a name if the referenced table is schema qualified. In deparsing the `ALTER TABLE s1.t1 ADD FOREIGN KEY (key) REFERENCES s2.t2; `, command back from its cooked form, we should schema qualify the REFERENCED table. Fixes #6982.	2023-06-09 14:13:13 +03:00
Onur Tirtir	fa8870217d	Enable logical planner for single-shard tables (#6950 ) * Enable using logical planner for single-shard tables * Improve non-colocated table error in physical planner * Favor distributed tables over reference tables when chosing anchor shard	2023-06-08 10:57:23 +03:00
Halil Ozan Akgül	b569d53a0c	Single shard misc udfs (#6956 ) This PR tests: - shards_colocated - citus_shard_cost_by_disk_size - citus_update_shard_statistics - citus_update_table_statistics	2023-06-07 13:30:50 +03:00
Emel Şimşek	6369645db4	Restore Test Coverage for Pushing Down Subqueries. (#6976 ) When we add the coordinator in metadata, reference tables gets replicated to coordinator. As a result we lose some test coverage since some queries start to run locally instead of getting pushed down. This PR adds new test cases involving distributed tables instead of reference tables for covering distributed execution in related cases.	2023-06-07 12:14:34 +03:00
Ahmet Gedemenli	8d8968ae63	Disable ALTER TABLE .. SET SCHEMA for tenant tables (#6973 ) Disables `ALTER TABLE .. SET SCHEMA` for tenant tables. Disables `ALTER TABLE .. SET SCHEMA` for tenant schemas.	2023-06-07 11:02:53 +03:00
Halil Ozan Akgül	3f7bc0cbf5	Single Shard Partition Column UDFs (#6964 ) This PR fixes and tests: - debug_equality_expression - partition_column_id	2023-06-06 17:55:40 +03:00
Halil Ozan Akgül	7e486345f1	Fix citus_table_type column in citus_tables and citus_shards views for single shard tables (#6971 ) `citus_table_type` column of `citus_tables` and `citus_shards` will show "schema" for tenants schema tables and "distributed" for single shard tables that are not in a tenant schema.	2023-06-06 16:20:11 +03:00
Naisila Puka	c2f117c559	Citus Revise tree-walk APIs to include context (#6975 ) Without revising there are Warnings in PG16 build Relevant PG commit `1c27d16e6e` 1c27d16e6e5c1f463bbe1e9ece88dda811235165	2023-06-06 14:17:51 +03:00
Teja Mupparti	f6a516dab5	Refactor repartitioning code into generic format	2023-06-05 09:06:05 -07:00
Naisila Puka	1c9e3fabc2	Bump PGversions for CI tests (#6969 ) Postgres got minor updates in May, this starts using the images with the latest version for our tests. These new Postgres versions didn't cause any compilation issues or test failures. Depends on https://github.com/citusdata/the-process/pull/136	2023-06-05 14:03:39 +03:00
Naisila Puka	48f068d08e	Remove AssertArg and AssertState (#6970 ) PG16 removed them. They were already identical to Assert. We can merge this directly to main branch Relevant PG commit: `b1099eca8f` b1099eca8f38ff5cfaf0901bb91cb6a22f909bc6 Co-authored-by: onderkalaci <onderkalaci@gmail.com>	2023-06-05 13:25:21 +03:00
Emel Şimşek	3fda2c3254	Change test files in multi and multi-1 schedules to accommodate coordinator in the metadata. (#6939 ) Changes test files in multi and multi-1 schedules such that they accomodate coordinator in metadata. Changes fall into the following buckets: 1. When coordinator is in metadata, reference table shards are present in coordinator too. This changes test outputs checking the table size, shard numbers etc. for reference tables. 2. When coordinator is in metadata, postgres tables are converted to citus local tables whenever a foreign key relationship to them is created. This changes some test cases which tests it should not be possible to create foreign keys to postgres tables. 3. Remove lines that add/remove coordinator for testing purposes.	2023-06-05 10:37:48 +03:00
Ahmet Gedemenli	976ab5a9be	Disable some udfs for tenant tables (#6965 ) Disables following UDFs for tenant tables: * update_distributed_table_colocation // i) table_name cannot be a tenant table ii) colocate_with cannot be a tenant table * undistribute_table * alter_distributed_table // i) table_name cannot be a tenant table ii) colocate_with cannot be a tenant table Also, see: https://gist.github.com/onurctirtir/4c20217200f29b1b1fdaf187d1ecb4f3?permalink_comment_id=4587463#gistcomment-4587463	2023-06-02 15:49:13 +03:00
ahmet gedemenli	2bd6ff0e93	Use schema name in the error msg	2023-06-02 15:25:14 +03:00
ahmet gedemenli	fccfee08b6	Style	2023-06-02 14:48:07 +03:00
ahmet gedemenli	f68ea20009	Disable alter_distributed_table for tenant tables	2023-06-02 14:48:07 +03:00
ahmet gedemenli	4b67e398b1	Disable undistribute_table for tenant tables	2023-06-02 14:48:07 +03:00
ahmet gedemenli	f4b2494d0c	Disable update_distributed_table_colocation for tenant tables	2023-06-02 14:48:07 +03:00
Halil Ozan Akgül	3e183746b7	Single Shard Misc UDFs 2 (#6963 ) Creating a second PR to make reviewing easier. This PR tests: - replicate_reference_tables - fix_partition_shard_index_names - isolate_tenant_to_new_shard - replicate_table_shards	2023-06-02 13:46:14 +03:00
Halil Ozan Akgül	ac7f732be2	Add Single Shard Table Tests for Dependency UDFs (#6960 ) This PR tests: - citus_get_all_dependencies_for_object - citus_get_dependencies_for_object - is_citus_depended_object	2023-06-02 11:57:53 +03:00
Teja Mupparti	ff2062e8c3	Rename insert-select redistribute code base to generic purpose	2023-06-01 09:43:43 -07:00
Halil Ozan Akgül	9961d39d97	Adds Single Shard Table Tests for Foreign Key UDFs (#6959 ) This PR adds tests for: - get_referencing_relation_id_list - get_referenced_relation_id_list - get_foreign_key_connected_relations	2023-06-01 12:56:06 +03:00
Ahmet Gedemenli	3cd81a7107	Add test for rebalancer with single shard tables (#6949 ) Adds test for shard moves / rebalancer with single shard tables	2023-05-31 14:58:23 +03:00
ahmet gedemenli	8ace5a7af5	Use citus_drain_node with single shard tables	2023-05-31 14:01:52 +03:00
ahmet gedemenli	ee42af7ad2	Add test for rebalancer with single shard tables	2023-05-31 11:48:49 +03:00
Teja Mupparti	f9dbe7784b	This commit adds a safety-net to the issue seen in #6785 . The fix for the underlying issue will be in the PR#6943	2023-05-30 10:53:05 -07:00
Halil Ozan Akgül	d99a5e2f62	Single Shard Table Tests for Shard Lock UDFs (#6944 ) This PR adds single shard table tests for shard lock UDFs, `shard_lock_metadata`, `shard_lock_resources`	2023-05-30 12:23:41 +03:00
Halil Ozan Akgül	5b54700b93	Single Shard Table Tests for Time Partitions (#6941 ) This PR adds tests for time partitions UDFs and view with single shard tables.	2023-05-29 14:18:56 +03:00
Halil Ozan Akgül	9d9b3817c1	Single Shard Table Columnar UDFs Tests (#6937 ) Adds columnar UDF tests for single shard tables.	2023-05-29 13:53:00 +03:00
Halil Ozan Akgül	321fcfcdb5	Add Support for Single Shard Tables in update_distributed_table_colocation (#6924 ) Adds Support for Single Shard Tables in `update_distributed_table_colocation`. This PR changes checks that make sure tables should be hash distributed table to hash or single shard distributed tables.	2023-05-29 11:47:50 +03:00
Ahmet Gedemenli	1ca80813f6	Citus UDFs support for single shard tables (#6916 ) Verify Citus UDFs work well with single shard tables SUPPORTED * citus_table_size * citus_total_relation_size * citus_relation_size * citus_shard_sizes * truncate_local_data_after_distributing_table * create_distributed_function // test function colocated with a single shard table * undistribute_table * alter_table_set_access_method UNSUPPORTED - error out for single shard tables * master_create_empty_shard * create_distributed_table_concurrently * create_distributed_table * create_reference_table * citus_add_local_table_to_metadata * citus_split_shard_by_split_points * alter_distributed_table	2023-05-26 17:30:05 +03:00
Onur Tirtir	246b054a7d	Add support for schema-based-sharding via a GUC (#6866 ) DESCRIPTION: Adds citus.enable_schema_based_sharding GUC that allows sharding the database based on schemas when enabled. * Refactor the logic that automatically creates Citus managed tables * Refactor CreateSingleShardTable() to allow specifying colocation id instead * Add support for schema-based-sharding via a GUC ### What this PR is about: Add citus.enable_schema_based_sharding GUC to enable schema-based sharding. Each schema created while this GUC is ON will be considered as a tenant schema. Later on, regardless of whether the GUC is ON or OFF, any table created in a tenant schema will be converted to a single shard distributed table (without a shard key). All the tenant tables that belong to a particular schema will be co-located with each other and will have a shard count of 1. We introduce a new metadata table --pg_dist_tenant_schema-- to do the bookkeeping for tenant schemas: ```sql psql> \d pg_dist_tenant_schema Table "pg_catalog.pg_dist_tenant_schema" ┌───────────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├───────────────┼─────────┼───────────┼──────────┼─────────┤ │ schemaid │ oid │ │ not null │ │ │ colocationid │ integer │ │ not null │ │ └───────────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "pg_dist_tenant_schema_pkey" PRIMARY KEY, btree (schemaid) "pg_dist_tenant_schema_unique_colocationid_index" UNIQUE, btree (colocationid) psql> table pg_dist_tenant_schema; ┌───────────┬───────────────┐ │ schemaid │ colocationid │ ├───────────┼───────────────┤ │ 41963 │ 91 │ │ 41962 │ 90 │ └───────────┴───────────────┘ (2 rows) ``` Colocation id column of pg_dist_tenant_schema can never be NULL even for the tenant schemas that don't have a tenant table yet. This is because, we assign colocation ids to tenant schemas as soon as they are created. That way, we can keep associating tenant schemas with particular colocation groups even if all the tenant tables of a tenant schema are dropped and recreated later on. When a tenant schema is dropped, we delete the corresponding row from pg_dist_tenant_schema. In that case, we delete the corresponding colocation group from pg_dist_colocation as well. ### Future work for 12.0 release: We're building schema-based sharding on top of the infrastructure that adds support for creating distributed tables without a shard key (https://github.com/citusdata/citus/pull/6867). However, not all the operations that can be done on distributed tables without a shard key necessarily make sense (in the same way) in the context of schema-based sharding. For example, we need to think about what happens if user attempts altering schema of a tenant table. We will tackle such scenarios in a future PR. We will also add a new UDF --citus.schema_tenant_set() or such-- to allow users to use an existing schema as a tenant schema, and another one --citus.schema_tenant_unset() or such-- to stop using a schema as a tenant schema in future PRs.	2023-05-26 10:49:58 +03:00
Halil Ozan Akgül	2c7beee562	Fix citus.tenant_stats_limit test by setting it to 2 (#6899 ) citus.tenant_stats_limit was set to 2 when we were adding tests for it. Then we changed it to 10, making the tests incorrect. This PR fixes that without breaking other tests.	2023-05-23 17:44:07 +03:00
Jelte Fennema	350a0f6417	Support running Citus upgrade tests with run_test.py (#6832 ) Citus upgrade tests require some additional logic to run, because we have a before and after schedule and we need to swap the Citus version in-between. This adds that logic to `run_test.py`. In passing this makes running upgrade tests locally multiple times faster by caching tarballs.	2023-05-23 14:38:54 +02:00
Emel Şimşek	02f815ce1f	Disable local execution when Explain Analyze is requested for a query. (#6892 ) DESCRIPTION: Fixes a crash when explain analyze is requested for a query that is normally locally executed. When explain analyze is requested for a query, a task with two queries is created. Those two queries are 1. Wrapped Query --> `SELECT ... FROM worker_save_query_explain_analyze(<query>, <explain analyze options>)` 2. Fetch Query -->` SELECT explain_analyze_output, execution_duration FROM worker_last_saved_explain_analyze();` When the query is locally executed a task with multiple queries causes a crash in production. See the Assert at `57455dc64d/src/backend/distributed/executor/tuple_destination.c`#:~:text=Assert(task%2D%3EqueryCount%20%3D%3D%201)%3B This becomes a critical issue when auto_explain extension is used. When auto_explain extension is enabled, explain analyze is automatically requested for every query. One possible solution could be not to create two queries for a locally executed query. The fetch part may not have to be a query since the values are available in local variables. Until we enable local execution for explain analyze, it is best to disable local execution. Fixes #6777.	2023-05-23 14:33:22 +03:00
Emel Şimşek	f9a5be59b9	Run replicate_reference_tables background task as superuser. (#6930 ) DESCRIPTION: Fixes a bug in background shard rebalancer where the replicate reference tables task fails if the current user is not a superuser. This change is to be backported to earlier releases. We should fix the permissions for replicate_reference_tables on main branch such that it can be run by non-superuser roles. Fixes #6925. Fixes #6926.	2023-05-18 23:46:32 +03:00
Hanefi Onaldi	6a83290d91	Add ORDER BY clauses to some flaky tests (#6931 ) I observed a flaky test output [here](https://app.circleci.com/pipelines/github/citusdata/citus/32692/workflows/32464a22-7fd6-440a-9ff7-cfa62f9ff58a/jobs/1126144) and added `ORDER BY` clauses to similar queries in the failing test file. ```diff SELECT pg_identify_object_as_address(classid, objid, objsubid) from pg_catalog.pg_dist_object where objid IN('viewsc.prop_view3'::regclass::oid, 'viewsc.prop_view4'::regclass::oid); pg_identify_object_as_address --------------------------------- - (view,"{viewsc,prop_view3}",{}) (view,"{viewsc,prop_view4}",{}) + (view,"{viewsc,prop_view3}",{}) (2 rows) ```	2023-05-18 12:45:39 +03:00
Onur Tirtir	8ff9dde4b3	Prevent pushing down INSERT .. SELECT queries that we shouldn't (and allow some more) (#6752 ) Previously INSERT .. SELECT planner were pushing down some queries that should not be pushed down due to wrong colocation checks. It was checking whether one of the table in SELECT part and target table are colocated. But now, we check colocation for all tables in SELECT part and the target table. Another problem with INSERT .. SELECT planner was that some queries, which is valid to be pushed down, were not pushed down due to unnecessary checks which are currently supported. e.g. UNION check. As solution, we reused the pushdown planner checks for INSERT .. SELECT planner. DESCRIPTION: Fixes a bug that causes incorrectly pushing down some INSERT .. SELECT queries that we shouldn't DESCRIPTION: Prevents unnecessarily pulling the data into coordinator for some INSERT .. SELECT queries DESCRIPTION: Drops support for pushing down INSERT .. SELECT with append table as target Fixes #6749. Fixes #1428. Fixes #6920. --------- Co-authored-by: aykutbozkurt <aykut.bozkurt1995@gmail.com>	2023-05-17 15:05:08 +03:00
Onur Tirtir	56d217b108	Mark objects as distributed even when pg_dist_node is empty (#6900 ) We mark objects as distributed objects in Citus metadata only if we need to propagate given the command that creates it to worker nodes. For this reason, we were not doing this for the objects that are created while pg_dist_node is empty. One implication of doing so is that we defer the schema propagation to the time when user creates the first distributed table in the schema. However, this doesn't help for schema-based sharding (#6866) because we want to sync pg_dist_tenant_schema to the worker nodes even for empty schemas too. * Support test dependencies for isolation tests without a schedule * Comment out a test due to a known issue (#6901) * Also, reduce the verbosity for some log messages and make some tests compatible with run_test.py.	2023-05-16 11:45:42 +03:00
Onur Tirtir	e7abde7e81	Prevent downgrades when there is a single-shard table in the cluster (#6908 ) Also add a few tests for Citus/PG upgrade/downgrade scenarios.	2023-05-16 09:44:28 +02:00
Onur Tirtir	893ed416f1	Disable citus.enable_non_colocated_router_query_pushdown by default (#6909 ) Fixes #6779. DESCRIPTION: Disables citus.enable_non_colocated_router_query_pushdown GUC by default to ensure generating a consistent distributed plan for the queries that reference non-colocated distributed tables We already have tests for the cases where this GUC is disabled, so I'm not adding any more tests in this PR. Also make multi_insert_select_window idempotent. Related to: #6793	2023-05-15 12:07:50 +03:00
Jelte Fennema	07b8cd2634	Forward to existing emit_log_hook in our log hook (#6877 ) DESCRIPTION: Forward to existing emit_log_hook in our log hook This makes us work better with other extensions installed in Postgres. Without this change we would overwrite their emit_log_hook, causing it to never be called. Fixes #6874	2023-05-09 16:55:56 +02:00
Ivan Kush	e3c6b8a10e	Fix flaky clolumnar_permissions test (#6913 ) As attr_num isn't ordered, order may be random. And regression test may be failed. This MR adds attr_num to ORDER BY ``` 3 --- /build/contrib/citus/src/test/regress/expected/columnar_permissions.out.modified 2023-05-05 11:13:44.926085432 +0000 4 +++ /build/contrib/citus/src/test/regress/results/columnar_permissions.out.modified 2023-05-05 11:13:44.934085414 +0000 5 @@ -124,24 +124,24 @@ 6 from columnar.chunk 7 where relation in ('no_access'::regclass, 'columnar_permissions'::regclass) 8 order by relation, stripe_num; 9 relation \| stripe_num \| attr_num \| chunk_group_num \| value_count 10 ----------------------+------------+----------+-----------------+------------- 11 no_access \| 1 \| 1 \| 0 \| 1 12 no_access \| 2 \| 1 \| 0 \| 1 13 no_access \| 3 \| 1 \| 0 \| 1 14 columnar_permissions \| 1 \| 1 \| 0 \| 1 15 columnar_permissions \| 1 \| 2 \| 0 \| 1 16 - columnar_permissions \| 2 \| 1 \| 0 \| 1 17 columnar_permissions \| 2 \| 2 \| 0 \| 1 18 - columnar_permissions \| 3 \| 1 \| 0 \| 1 19 + columnar_permissions \| 2 \| 1 \| 0 \| 1 20 columnar_permissions \| 3 \| 2 \| 0 \| 1 21 + columnar_permissions \| 3 \| 1 \| 0 \| 1 22 columnar_permissions \| 4 \| 1 \| 0 \| 1 23 columnar_permissions \| 4 \| 2 \| 0 \| 1 24 (11 rows) ``` Co-authored-by: Ivan Kush <ivan.kush@tantorlabs.ru>	2023-05-09 12:42:37 +02:00
Hanefi Onaldi	06e6f8e428	Normalize columnar version in tests (#6917 ) When we bump columnar version, some tests fail because of the output change. Instead of changing those lines every time, I think it is better to normalize it in tests.	2023-05-08 16:10:55 +03:00
aykut-bozkurt	73c771d6ed	Update readme for 11.3 (#6903 ) Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com> Co-authored-by: Jelte Fennema <jelte.fennema@microsoft.com>	2023-05-05 19:08:35 +03:00
Naisila Puka	905fd46410	Fixes flakiness in background_rebalance_parallel test (#6910 ) Fixes the following flaky outputs by decreasing citus_task_wait loop interval, and changing the order of wait commands. https://app.circleci.com/pipelines/github/citusdata/citus/32102/workflows/19958297-6c7e-49ef-9bc2-8efe8aacb96f/jobs/1089589 ``` diff SELECT job_id, task_id, status, nodes_involved FROM pg_dist_background_task WHERE job_id in (:job_id) ORDER BY task_id; job_id \| task_id \| status \| nodes_involved --------+---------+----------+---------------- 17779 \| 1013 \| done \| {50,56} 17779 \| 1014 \| running \| {50,57} - 17779 \| 1015 \| running \| {50,56} - 17779 \| 1016 \| blocked \| {50,57} + 17779 \| 1015 \| done \| {50,56} + 17779 \| 1016 \| running \| {50,57} 17779 \| 1017 \| runnable \| {50,56} 17779 \| 1018 \| blocked \| {50,57} 17779 \| 1019 \| runnable \| {50,56} 17779 \| 1020 \| blocked \| {50,57} (8 rows) ``` https://github.com/citusdata/citus/pull/6893#issuecomment-1525661408 ```diff SELECT job_id, task_id, status, nodes_involved FROM pg_dist_background_task WHERE job_id in (:job_id) ORDER BY task_id; job_id \| task_id \| status \| nodes_involved --------+---------+----------+---------------- 17779 \| 1013 \| done \| {50,56} - 17779 \| 1014 \| running \| {50,57} + 17779 \| 1014 \| runnable \| {50,57} 17779 \| 1015 \| running \| {50,56} 17779 \| 1016 \| blocked \| {50,57} 17779 \| 1017 \| runnable \| {50,56} 17779 \| 1018 \| blocked \| {50,57} 17779 \| 1019 \| runnable \| {50,56} 17779 \| 1020 \| blocked \| {50,57} (8 rows) ```	2023-05-05 16:47:01 +03:00
Hanefi Onaldi	3217e3f181	Fix flaky background rebalance parallel test (#6893 ) A test in background_rebalance_parallel.sql was failing intermittently where the order of tasks in the output was not deterministic. This commit fixes the test by removing id columns for the background tasks in the output. A sample failing diff before this patch is below: ```diff SELECT D.task_id, (SELECT T.command FROM pg_dist_background_task T WHERE T.task_id = D.task_id), D.depends_on, (SELECT T.command FROM pg_dist_background_task T WHERE T.task_id = D.depends_on) FROM pg_dist_background_task_depend D WHERE job_id in (:job_id) ORDER BY D.task_id, D.depends_on ASC; task_id \| command \| depends_on \| command ---------+---------------------------------------------------------------------+------------+--------------------------------------------------------------------- - 1014 \| SELECT pg_catalog.citus_move_shard_placement(85674026,50,57,'auto') \| 1013 \| SELECT pg_catalog.citus_move_shard_placement(85674025,50,56,'auto') - 1016 \| SELECT pg_catalog.citus_move_shard_placement(85674032,50,57,'auto') \| 1015 \| SELECT pg_catalog.citus_move_shard_placement(85674031,50,56,'auto') - 1018 \| SELECT pg_catalog.citus_move_shard_placement(85674038,50,57,'auto') \| 1017 \| SELECT pg_catalog.citus_move_shard_placement(85674037,50,56,'auto') - 1020 \| SELECT pg_catalog.citus_move_shard_placement(85674044,50,57,'auto') \| 1019 \| SELECT pg_catalog.citus_move_shard_placement(85674043,50,56,'auto') + 1014 \| SELECT pg_catalog.citus_move_shard_placement(85674038,50,57,'auto') \| 1013 \| SELECT pg_catalog.citus_move_shard_placement(85674037,50,56,'auto') + 1016 \| SELECT pg_catalog.citus_move_shard_placement(85674044,50,57,'auto') \| 1015 \| SELECT pg_catalog.citus_move_shard_placement(85674043,50,56,'auto') + 1018 \| SELECT pg_catalog.citus_move_shard_placement(85674026,50,57,'auto') \| 1017 \| SELECT pg_catalog.citus_move_shard_placement(85674025,50,56,'auto') + 1020 \| SELECT pg_catalog.citus_move_shard_placement(85674032,50,57,'auto') \| 1019 \| SELECT pg_catalog.citus_move_shard_placement(85674031,50,56,'auto') (4 rows) ``` Notice that the dependent and dependee tasks have some commands, but they have different task ids.	2023-05-05 12:07:46 +03:00
Teja Mupparti	b58665773b	Move all pre-15-defined routines to the bottom of the file	2023-05-04 10:07:08 -07:00
Naisila Puka	072ae44742	Adjusts query's CoerceViaIO & RelabelType nodes that are improper for deparsing (#6391 ) Adjusts query's CoerceViaIO & RelabelType nodes that are improper for deparsing The standard planner converts some `::text` casts to `::cstring` and here we convert back because `cstring` is a pseudotype and it cannot be casted to most types. This problem occurs in CoerceViaIO nodes. There was another problem with RelabelType nodes fixed in the following PR: https://github.com/citusdata/citus/pull/4580 We undo the changes in that PR, and fix both CoerceViaIO and RelabelType nodes in the planning phase (not in the deparsing phase in ruleutils) Fixes https://github.com/citusdata/citus/issues/5646 Fixes https://github.com/citusdata/citus/issues/5033 Fixes https://github.com/citusdata/citus/issues/6061	2023-05-04 16:46:02 +03:00
Önder Kalacı	1662694471	Update CHANGELOG.md (#6907 ) Change `citus_stats_tenants` to `citus_stat_tenants` Thanks @clairegiordano for noticing	2023-05-04 11:45:02 +03:00
Onur Tirtir	aeaa48c197	Add support for creating distributed tables without shard key [merging the main devel branch] (#6867 ) DESCRIPTION: Adds support for creating distributed tables without shard key Commits proposed in this PR have already been reviewed in other PRs noted for each commit. With this PR, we allow creating distributed tables without specifying a shard key via create_distributed_table(). Here are the the important details about those tables: * Specifying `shard_count` is not allowed because it is assumed to be 1. * We mostly call such tables as "single-shard" distributed table in code / comments. * `colocate_with` param allows colocating such single-shard tables to each other. * We define this table type, i.e., SINGLE_SHARD_DISTRIBUTED, as a subclass of DISTRIBUTED_TABLE because we mostly want to treat them as distributed tables in terms of SQL / DDL / operation support. * Metadata for such tables look like: - distribution method => DISTRIBUTE_BY_NONE - replication model => REPLICATION_MODEL_STREAMING - colocation id => != INVALID_COLOCATION_ID (distinguishes from Citus local tables) * We assign colocation groups for such tables to different nodes in a round-robin fashion based on the modulo of "colocation id". There are also still more work that needs to be done, such as improving SQL support, making sure that Citus operations work well such distributed tables and making sure that latest features merged in at 11.3 / 12.0 (such as CDC) works fine. We will take care of them in subsequent PRs. In this release, we will build schema-based-sharding on top of this infrastructure. And it's likely that we will use this infra for some other nice features in future too.	2023-05-03 17:15:22 +03:00
Ahmet Gedemenli	4321286005	Disable master_create_empty_shard udf for single shard tables (#6902 )	2023-05-03 17:02:43 +03:00
Onur Tirtir	db2514ef78	Call null-shard-key tables as single-shard distributed tables in code	2023-05-03 17:02:43 +03:00
Onur Tirtir	39b7711527	Add support for more pushable / non-pushable insert .. select queries with null-shard-key tables (#6823 ) * Add support for dist insert select by selecting from a reference table. This was the only pushable insert .. select case that #6773 didn't cover. * For the cases where we insert into a Citus table but the INSERT .. SELECT query cannot be pushed down, allow pull-to-coordinator when possible. Remove the checks that we had at the very beginning of CreateInsertSelectPlanInternal so that we can try insert .. select via pull-to-coordinator for the cases where we cannot push-down the insert .. select query. What we support via pull-to-coordinator is still limited due to lacking of logical planner support for SELECT queries, but this commit at least allows using pull-to-coordinator for the cases where the select query can be planned via router planner, without limiting ourselves to restrictive top-level checks. Also introduce some additional restrictions into CreateDistributedInsertSelectPlan for the cases it was missing to check for null-shard-key tables. Indeed, it would make more sense to have those checks for distributed tables in general, via separate PRs against main branch. See https://github.com/citusdata/citus/pull/6817. * Add support for inserting into a Postgres table.	2023-05-03 16:24:20 +03:00
Onur Tirtir	85745b46d5	Add initial sql support for distributed tables that don't have a shard key (#6773/#6822) Enable router planner and a limited version of INSERT .. SELECT planner for the queries that reference colocated null shard key tables. * SELECT / UPDATE / DELETE / MERGE is supported as long as it's a router query. * INSERT .. SELECT is supported as long as it only references colocated null shard key tables. Note that this is not only limited to distributed INSERT .. SELECT but also covers a limited set of query types that require pull-to-coordinator, e.g., due to LIMIT clause, generate_series() etc. ... (Ideally distributed INSERT .. SELECT could handle such queries too, e.g., when we're only referencing tables that don't have a shard key, but today this is not the case. See https://github.com/citusdata/citus/pull/6773#discussion_r1140130562.	2023-05-03 16:24:20 +03:00
Onur Tirtir	ac0ffc9839	Add a config for arbitrary config tests where all the tables are null-shard-key tables (#6783/#6788)	2023-05-03 16:18:27 +03:00
Ahmet Gedemenli	cdf54ff4b1	Add DDL support null-shard-key tables(#6778/#6784/#6787/#6859) Add tests for ddl coverage: * indexes * partitioned tables + indexes with long names * triggers * foreign keys * statistics * grant & revoke statements * truncate & vacuum * create/test/drop view that depends on a dist table with no shard key * policy & rls test * alter table add/drop/alter_type column (using sequences/different data types/identity columns) * alter table add constraint (not null, check, exclusion constraint) * alter table add column with a default value / set default / drop default * alter table set option (autovacuum) * indexes / constraints without names * multiple subcommands Adds support for * Creating new partitions after distributing (with null key) the parent table * Attaching partitions to a distributed table with null distribution key (and automatically distribute the new partition with null key as well) * Detaching partitions from it	2023-05-03 16:18:27 +03:00
Onur Tirtir	fa467e05e7	Add support for creating distributed tables with a null shard key (#6745 ) With this PR, we allow creating distributed tables with without specifying a shard key via create_distributed_table(). Here are the the important details about those tables: * Specifying `shard_count` is not allowed because it is assumed to be 1. * We mostly call such tables as "null shard-key" table in code / comments. * To avoid doing a breaking layout change in create_distributed_table(); instead of throwing an error, it will inform the user that `distribution_type` param is ignored unless it's explicitly set to NULL or 'h'. * `colocate_with` param allows colocating such null shard-key tables to each other. * We define this table type, i.e., NULL_SHARD_KEY_TABLE, as a subclass of DISTRIBUTED_TABLE because we mostly want to treat them as distributed tables in terms of SQL / DDL / operation support. * Metadata for such tables look like: - distribution method => DISTRIBUTE_BY_NONE - replication model => REPLICATION_MODEL_STREAMING - colocation id => != INVALID_COLOCATION_ID (distinguishes from Citus local tables) * We assign colocation groups for such tables to different nodes in a round-robin fashion based on the modulo of "colocation id". Note that this PR doesn't care about DDL (except CREATE TABLE) / SQL / operation (i.e., Citus UDFs) support for such tables but adds a preliminary API.	2023-05-03 16:18:27 +03:00
aykut-bozkurt	2d005ac777	Query Generator Seed (#6883 ) - Give seed number as argument to query generator to reproduce a previous run. - Expose the difference between results, if any, as artifact on CI.	2023-05-03 15:54:11 +03:00
Teja Mupparti	e444dd4f3f	MERGE: Support reference table as source with local table as target	2023-05-02 11:37:29 -07:00
Hanefi Onaldi	efd41e8ea5	Bump columnar to 11.3 (#6898 ) When working on changelog, Marco suggested in https://github.com/citusdata/citus/pull/6856#pullrequestreview-1386601215 that we should bump columnar version to 11.3 as well. This PR aims to contain all the necessary changes to allow upgrades to and downgrades from 11.3.0 for columnar. Note that updating citus extension version does not affect columnar as the two extension versions are not really coupled. The same changes will also be applied to the release branch in https://github.com/citusdata/citus/pull/6897	2023-05-02 11:58:32 +03:00
Hanefi Onaldi	934430003e	Changelog entries for 11.3.0 (#6856 ) In this release, I tried something different. I experimented with adding the PR number and title to the changelog right before each changelog entry. This way, it is easier to track where a particular changelog entry comes from. After reviews are over, I plan to remove those lines with PR numbers and titles. I went through all the PRs that are merged after 11.2.0 release and came up with a list of PRs that may need help with changelog entries. You can see details on PRs grouped in several sections below. ## PRs with missing entries The following PRs below do not have a changelog entry. If you think that this is a mistake, please share it in this PR along with a suggestion on what the changelog item should be. PR #6846 : fix 3 flaky tests in failure schedule PR #6844 : Add CPU usage to citus_stat_tenants PR #6833 : Fix citus_stat_tenants period updating bug PR #6787 : Add more tests for ddl coverage PR #6842 : Add build-cdc-* temporary directories to .gitignore PR #6841 : Add build-cdc-* temporary directories to .gitignore PR #6840 : Bump Citus to 12.0devel PR #6824 : Fixes flakiness in multi_metadata_sync test PR #6811 : Backport identity column improvements to v11.2 PR #6830 : In run_test.py actually return worker_count PR #6825 : Fixes flakiness in multi_cluster_management test PR #6816 : Refactor run_test.py PR #6817 : Explicitly disallow local rels when inserting into dist table PR #6821 : Rename citus stats tenants PR #6822 : Add some more tests for initial sql support PR #6819 : Fix flakyness in citus_split_shard_by_split_points_deferred_drop PR #6814 : Make python-regress based tests runnable with run_test.py PR #6813 : Fix flaky multi_mx_schema_support test PR #6720 : Convert columnar tap tests to pytest PR #6812 : Revoke statistics permissions from public and grant them to pg_monitor PR #6769 : Citus stats tenants guc PR #6807 : Fix the incorrect (constant) value passed to pointer-to-bool parameter, pass a NULL as the value is not used PR #6797 : Attribute local queries and cached plans on local execution PR #6796 : Parse the annotation string correctly PR #6762 : Add logs to citus_stats_tenants PR #6773 : Add initial sql support for distributed tables that don't have a shard key PR #6792 : Disentangle MERGE planning code from the modify-planning code path PR #6761 : Citus stats tenants collector view PR #6791 : Make 8 more tests runnable multiple times via run_test.py PR #6786 : Refactor some of the planning code to accommodate a new planning path for MERGE SQL PR #6789 : Rename AllRelations.. functions to AllDistributedRelations.. PR #6788 : Actually skip arbitrary_configs_router & nested_execution for AllNullDistKeyDefaultConfig PR #6783 : Add a config for arbitrary config tests where all the tables are null-shard-key tables PR #6784 : Fix attach partition: citus local to null distributed PR #6782 : Add an arbitrary config test heavily based on multi_router_planner_fast_path.sql PR #6781 : Decide what to do with router planner error at one place PR #6778 : Support partitioning for dist tables with null dist keys PR #6766 : fix pip lock file PR #6764 : Make workerCount configurable for regression tests PR #6745 : Add support for creating distributed tables with a null shard key PR #6696 : This implements MERGE phase-III PR #6767 : Add pytest depedencies to Pipfile PR #6760 : Decide core distribution params in CreateCitusTable PR #6759 : Add multi_create_fdw into minimal_schedule PR #6743 : Replace CITUS_TABLE_WITH_NO_DIST_KEY checks with HasDistributionKey() PR #6751 : Stabilize single_node.sql and others that report illegal node removal PR #6742 : Refactor CreateDistributedTable() PR #6747 : Remove unused lock functions PR #6744 : Fix multiple output version arbitrary config tests PR #6741 : Stabilize single node tests PR #6740 : Fix string eval bug in migration files check PR #6736 : Make run_test.py and create_test.py importable without errors PR #6734 : Don't blanket ignore flake8 E402 error PR #6737 : Fixes bookworm packaging pipeline problem PR #6735 : Fix run_test.py on python 3.9 PR #6733 : MERGE: In deparser, add missing check for RETURNING clause. PR #6714 : Remove auto_explain workaround in citus explain hook for ALTER TABLE PR #6719 : Fix flaky test PR #6718 : Add more powerfull dependency tracking to run_test.py PR #6710 : Install non-vulnerable cryptography package PR #6711 : Support compilation and run tests on latest PG versions PR #6700 : Add auto-formatting and linting to our python code PR #6707 : Allow multi_insert_select to run repeatably PR #6708 : Fix flakyness in failure_create_distributed_table_non_empty PR #6698 : Miscellaneous cleanup PR #6704 : Update README for 11.2 PR #6703 : Fix dubious ownership error from git PR #6690 : Bump Citus to 11.3devel ## Too long changelog entries The following PRs have changelog entries that are too long to fit in a single line. I'd expect authors to supply at changelog entries in `DESCRIPTION:` lines that are at most 78 characters. If you want to supply multi-line changelog items, you can have multiple lines that start with `DESCRIPTION:` instead. PR #6837 : fixes update propagation bug when `citus_set_coordinator_host` is called more than once PR #6738 : Identity column implementation refactorings PR #6756 : Schedule parallel shard moves in background rebalancer by removing task dependencies between shard moves across colocation groups. PR #6793 : Add a GUC to disallow planning the queries that reference non-colocated tables via router planner PR #6726 : fix memory leak during altering distributed table with a lot of partition and shards PR #6722 : fix memory leak during distribution of a table with a lot of partitions PR #6693 : prevent memory leak during ConvertTable with a lot of partitions ## Empty changelog entries. The following PR had an empty `DESCRIPTION:` line. This generates an empty changelog line that needs to be removed manually. Please either provide a short entry, or remove `DESCRIPTION:` line completely. PR #6810 : Make CDC decoder an independent extension PR #6827 : Makefile changes to build CDC in builddir for pgoutput and wal2json. --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-05-02 11:29:24 +03:00
Ahmet Gedemenli	59ccf364df	Ignore nodes not allowed for shards, when planning rebalance steps (#6887 ) We are handling colocation groups with shard group count less than the worker node count, using a method different than the usual rebalancer. See #6739 While making the decision of using this method or not, we should've ignored the nodes that are marked `shouldhaveshards = false`. This PR excludes those nodes when making the decision. Adds a test such that: coordinator: [] worker 1: [1_1, 1_2] worker 2: [2_1, 2_2] (rebalance) coordinator: [] worker 1: [1_1, 2_1] worker 2: [1_2, 2_2] If we take the coordinator into account, the rebalancer considers the first state as balanced and does nothing (because shard_count < worker_count) But with this pr, we ignore the coordinator because it's shouldhaveshards = false So the rebalancer distributes each colocation group to both workers Also, fixes an unrelated flaky test in the same file	2023-05-01 12:21:08 +02:00
aykut-bozkurt	8cb69cfd13	break sequence dependency during table creation (#6889 ) We need to break sequence dependency for a table while creating the table during non-transactional metadata sync to ensure idempotency of the creation of the table. Problem: When we send `SELECT pg_catalog.worker_drop_sequence_dependency(logicalrelid::regclass::text) FROM pg_dist_partition` to workers during the non-transactional sync, table might not be in `pg_dist_partition` at worker, and sequence dependency is not broken at the worker. Solution: We break sequence dependency via `SELECT pg_catalog.worker_drop_sequence_dependency(logicalrelid::regclass::text)` for each table while creating it at the workers. It is safe to send since the udf is a no-op when there is no sequence dependency. DESCRIPTION: Fixes a bug related to sequence idempotency at non-transactional sync. Fixes https://github.com/citusdata/citus/issues/6888.	2023-04-28 15:09:09 +03:00
Hanefi Onaldi	135aaf45ca	Add missing entry for 10.0.8 (#6891 ) When creating tags for backport releases, I realized that I missed one changelog item. Adding it on the default branch in a commit. See #6885 for the relevant PR for the release branch.	2023-04-27 16:01:04 +03:00
aykut-bozkurt	a7fa1db696	fix flaky test regex (#6890 ) There was a bug related to regex. We sometimes caught the wrong line when the test name is also included in comments. Example: We caught the wrong line as multi_metadata_sync is included in the comment before the test line. ``` # ---------- # multi_metadata_sync tests the propagation of mx-related metadata changes to metadata workers # multi_unsupported_worker_operations tests that unsupported operations error out on metadata workers # ---------- test: multi_metadata_sync ``` Solution: Restrict regex rule better.	2023-04-27 13:14:40 +03:00
Hanefi Onaldi	5fc5931506	Skip some versions on changelog (#6882 ) We had 10.1.5, 10.0.7, and 9.5.11 in the changelog, but those versions are already used in enterprise repository. This commit skips those versions and uses 10.1.6, 10.0.8, and 9.5.12 instead to prevent clashes.	2023-04-26 12:05:27 +03:00
Hanefi Onaldi	15152eac94	Add changelog entries for backport releases (#6869 ) We plan to have a series of backport releases. This PR contains separate commits for each patch version for 11.2 to 9.5 major versions. We plan to cherry pick each commit to relevant release branches and hence the need to have separate commits for each version.	2023-04-25 13:21:08 +03:00
Hanefi Onaldi	f7fd0dbae7	Add changelog entries for 11.2.1	2023-04-25 13:06:59 +03:00
Hanefi Onaldi	c36adc8426	Add changelog entries for 11.1.6	2023-04-25 13:06:01 +03:00
Hanefi Onaldi	214bc39a5a	Add changelog entries for 11.0.8	2023-04-25 13:05:44 +03:00
Hanefi Onaldi	65f957d345	Add changelog entries for 10.2.9	2023-04-25 13:05:20 +03:00
Hanefi Onaldi	db77cb084b	Add changelog entries for 10.1.5	2023-04-25 13:04:58 +03:00
Hanefi Onaldi	61c7cc0a96	Add changelog entries for 10.0.7	2023-04-25 13:04:27 +03:00
Hanefi Onaldi	da71b74f1d	Add changelog entries for 9.5.11	2023-04-25 13:03:23 +03:00
Jelte Fennema	a5f4fece13	Fix running PG upgrade tests with run_test.py (#6829 ) In #6814 we started using the Python test runner for upgrade tests in run_test.py, instead of the Perl based one. This had a problem though, not all tests in minimal_schedule can be run with the Python runner. This adds a separate minimal schedule for the pg_upgrade tests which doesn't include the tests that break with the Python runner. This PR also fixes various other issues that came up while testing the upgrade tests.	2023-04-24 15:54:32 +02:00
aykut-bozkurt	a6a7271e63	Query generator test tool (#6686 ) - Query generator is used to create queries, allowed by the grammar which is documented at `query_generator/query_gen.py` (currently contains only joins). - This PR adds a CI test which utilizes the query generator to compare the results of generated queries that are executed on Citus tables and local (undistributed) tables. It fails if there is an unexpected error at results. The error can be related to Citus, the query generator, or even Postgres. - The tool is configured by the file `query_generator/config/config.yaml`, which limits table counts at generated queries and sets many table related parameters (e.g. row count). - Run time of the CI task can be configured from the config file. By default, we run 250 queries with maximum table count of 40 inside each query.	2023-04-23 20:28:26 +03:00
aykut-bozkurt	08e2820c67	skip restriction clause if it contains placeholdervar (#6857 ) `PlaceHolderVar` is not relevant to be processed inside a restriction clause. Otherwise, `pull_var_clause_default` would throw error. PG would create the restriction to physical `Var` that `PlaceHolderVar` points to anyway, so it is safe to skip this restriction. DESCRIPTION: Fixes a bug related to WHERE clause list which contains placeholder. Fixes https://github.com/citusdata/citus/issues/6758	2023-04-17 18:14:01 +03:00
Emel Şimşek	2675a68218	Make coordinator always in metadata by default in regression tests. (#6847 ) DESCRIPTION: Changes the regression test setups adding the coordinator to metadata by default. When creating a Citus cluster, coordinator can be added in metadata explicitly by running `citus_set_coordinator_host ` function. Adding the coordinator to metadata allows to create citus managed local tables. Other Citus functionality is expected to be unaffected. This change adds the coordinator to metadata by default when creating test clusters in regression tests. There are 3 ways to run commands in a sql file (or a schedule which is a sequence of sql files) with Citus regression tests. Below is how this PR adds the coordinator to metadata for each. 1. `make <schedule_name>` Changed the sql files (sql/multi_cluster_management.sql and sql/minimal_cluster_management.sql) which sets up the test clusters such that they call `citus_set_coordinator_host`. This ensures any following tests will have the coordinator in metadata by default. 2. `citus_tests/run_test.py <sql_file_name>` Changed the python code that sets up the cluster to always call ` citus_set_coordinator_host`. For the upgrade tests, a version check is included to make sure `citus_set_coordinator_host` function is available for a given version. 3. ` make check-arbitrary-configs ` Changed the python code that sets up the cluster to always call `citus_set_coordinator_host `. #6864 will be used to track the remaining work which is to change the tests where coordinator is added/removed as a node.	2023-04-17 14:14:37 +03:00
Gokhan Gulbiz	8782ea1582	Ensure partitionKeyValue and colocationId are set for proper tenant stats gathering (#6834 ) This PR updates the tenant stats implementation to set partitionKeyValue and colocationId in ExecuteLocalTaskListExtended, in addition to LocallyExecuteTaskPlan. This ensures that tenant stats can be properly gathered regardless of the code path taken. The changes were initially made while testing stored procedure calls for tenant stats.	2023-04-17 09:35:26 +03:00
Onur Tirtir	f87a2d02b0	Move the common logic related to creating a Citus table down to CreateCitusTable (#6836 ) .. rather than having it in user facing functions. That way, we can use the same logic for creating Citus tables from other places too. This would be useful for creating tenant tables via a simple function call in the utility hook, for schema-based sharding purposes.	2023-04-14 16:13:39 +03:00
aykut-bozkurt	3286ec59e9	fix 3 flaky tests in failure schedule (#6846 ) Fixed 3 flaky tests in failure tests which caused flakiness in other tests due to changed node and group sequence ids during node addition-removal.	2023-04-13 13:13:28 +03:00
Halil Ozan Akgül	9ba70696f7	Add CPU usage to citus_stat_tenants (#6844 ) This PR adds CPU usage to `citus_stat_tenants` monitor. CPU usage is tracked in periods, similar to query counts.	2023-04-12 16:23:00 +03:00
Emel Şimşek	e7a25d82c9	When creating a HTAB we need to use HASH_COMPARE flag in order to set a user defined comparison function. (#6845 ) DESCRIPTION: Fixes memory errors, caught by valgrind, of type "conditional jump or move depends on uninitialized value" When running Citus tests under Postgres with valgrind, the test cases calling into `NonBlockingShardSplit` function produce valgrind errors of type "conditional jump or move depends on uninitialized value". The issue is caused by creating a HTAB in a wrong way. HASH_COMPARE flag should have been used when creating a HTAB with user defined comparison function. In the absence of HASH_COMPARE flag, HTAB falls back into built-in string comparison function. However, valgrind somehow discovers that the match function is not assigned to the user defined function as intended. Fixes #6835	2023-04-11 21:24:33 +03:00
Halil Ozan Akgül	8b50e95dc8	Fix citus_stat_tenants period updating bug (#6833 ) Fixes the bug that causes updating the citus_stat_tenants periods incorrectly. `TimestampDifferenceExceeds` expects the difference in milliseconds but it was microseconds, this is fixed. `tenantStats->lastQueryTime` was updated during monitoring too, now it's updated only when there are tenant queries.	2023-04-11 17:40:07 +03:00
aykut-bozkurt	a20f7e1a55	fixes update propagation bug when `citus_set_coordinator_host` is called more than once (#6837 ) DESCRIPTION: Fixes update propagation bug when `citus_set_coordinator_host` is called more than once. Fixes https://github.com/citusdata/citus/issues/6731.	2023-04-11 11:27:16 +03:00
rajeshkt78	1713246e1b	Add build-cdc-* temporary directories to .gitignore (#6841 ) The CDC decoder buillds different versions of CDC base decoders during the build. Since the source files are copied to the temporay directories, they come in git status for files to be added. So these directories and a temporary CDC TAP test directory(tmpcheck) are added to .gitignore file.	2023-04-10 15:40:20 +05:30
Onur Tirtir	0194657c5d	Bump Citus to 12.0devel (#6840 )	2023-04-10 12:05:18 +03:00
rajeshkt78	29c8d9633a	Makefile changes to build CDC in builddir for pgoutput and wal2json. (#6827 ) DESCRIPTION: Makefile changes to build different versions of CDC decoder for different base decoders like pgoutput and wal2json with the same name and copy it to $packagelib/cdc_decoders dir. This helps the user to use logical replication slots normally with pgoutput without being aware of CDC decoder. 1) Changed src/backend/distributed/cdc/Makefile to setup a build directory for CDC in build-cdc-$(DECODER) dir and copy the source files (.c.h and Makefile.decoder) to the build dir and build it for each base decoder. 2) copy the pgoutput.so and wal2json.so into the above build dir and install them in PG packagelibdir/citus_decoders directory. 3)Added a testcase 016_cdc_wal2json.pl for testing the wal2json decoder using pg_recv_logical_changes function.	2023-04-06 17:03:12 +05:30
Naisila Puka	84f2d8685a	Adds control for background task executors involving a node (#6771 ) DESCRIPTION: Adds control for background task executors involving a node ### Background and motivation Nonblocking concurrent task execution via background workers was introduced in [#6459](https://github.com/citusdata/citus/pull/6459), and concurrent shard moves in the background rebalancer were introduced in [#6756](https://github.com/citusdata/citus/pull/6756) - with a hard dependency that limits to 1 shard move per node. As we know, a shard move consists of a shard moving from a source node to a target node. The hard dependency was used because the background task runner didn't have an option to limit the parallel shard moves per node. With the motivation of controlling the number of concurrent shard moves that involve a particular node, either as source or target, this PR introduces a general new GUC citus.max_background_task_executors_per_node to be used in the background task runner infrastructure. So, why do we even want to control and limit the concurrency? Well, it's all about resource availability: because the moves involve the same nodes, extra parallelism won’t make the rebalance complete faster if some resource is already maxed out (usually cpu or disk). Or, if the cluster is being used in a production setting, the moves might compete for resources with production queries much more than if they had been executed sequentially. ### How does it work? A new column named nodes_involved is added to the catalog table that keeps track of the scheduled background tasks, pg_dist_background_task. It is of type integer[] - to store a list of node ids. It is NULL by default - the column will be filled by the rebalancer, but we may not care about the nodes involved in other uses of the background task runner. Table "pg_catalog.pg_dist_background_task" Column \| Type ============================================ job_id \| bigint task_id \| bigint owner \| regrole pid \| integer status \| citus_task_status command \| text retry_count \| integer not_before \| timestamp with time zone message \| text +nodes_involved \| integer[] A hashtable named ParallelTasksPerNode keeps track of the number of parallel running background tasks per node. An entry in the hashtable is as follows: ParallelTasksPerNodeEntry { node_id // The node is used as the hash table key counter // Number of concurrent background tasks that involve node node_id // The counter limit is citus.max_background_task_executors_per_node } When the background task runner assigns a runnable task to a new executor, it increments the counter for each of the nodes involved with that runnable task. The limit of each counter is citus.max_background_task_executors_per_node. If the limit is reached for any of the nodes involved, this runnable task is skipped. And then, later, when the running task finishes, the background task runner decrements the counter for each of the nodes involved with the done task. The following functions take care of these increment-decrement steps: IncrementParallelTaskCountForNodesInvolved(task) DecrementParallelTaskCountForNodesInvolved(task) citus.max_background_task_executors_per_node can be changed in the fly. In the background rebalancer, we simply give {source_node, target_node} as the nodesInvolved input to the ScheduleBackgroundTask function. The rest is taken care of by the general background task runner infrastructure explained above. Check background_task_queue_monitor.sql and background_rebalance_parallel.sql tests for detailed examples. #### Note This PR also adds a hard node dependency if a node is first being used as a source for a move, and then later as a target. The reason this should be a hard dependency is that the first move might make space for the second move. So, we could run out of disk space (or at least overload the node) if we move the second shard to it before the first one is moved away. Fixes https://github.com/citusdata/citus/issues/6716	2023-04-06 14:12:39 +03:00
Gokhan Gulbiz	fa00fc6e3e	Add upgrade/downgrade paths between v11.2.2 and v11.3.1 (#6820 ) DESCRIPTION: PR description that will go into the change log, up to 78 characters --------- Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2023-04-06 12:46:09 +03:00
Ahmet Gedemenli	83a2cfbfcf	Move cleanup record test to upgrade schedule (#6794 ) DESCRIPTION: Move cleanup record test to upgrade schedule	2023-04-06 11:42:49 +03:00
Naisila Puka	fc479bfa49	Fixes flakiness in multi_metadata_sync test (#6824 ) Fixes flakiness in multi_metadata_sync test https://app.circleci.com/pipelines/github/citusdata/citus/31863/workflows/ea937480-a4cc-4646-815c-bb2634361d98/jobs/1074457 ```diff SELECT logicalrelid, repmodel FROM pg_dist_partition WHERE logicalrelid = 'mx_test_schema_1.mx_table_1'::regclass OR logicalrelid = 'mx_test_schema_2.mx_table_2'::regclass; logicalrelid \| repmodel -----------------------------+---------- - mx_test_schema_1.mx_table_1 \| s mx_test_schema_2.mx_table_2 \| s + mx_test_schema_1.mx_table_1 \| s (2 rows) ``` This is a simple issue of missing `ORDER BY` clauses. I went ahead and added some other missing ones in the same file as well. Also, I replaced existing `ORDER BY logicalrelid` with `ORDER BY logicalrelid::text`, in order to compare names, not OIDs.	2023-04-06 11:19:32 +03:00
Halil Ozan Akgül	52ad2d08c7	Multi tenant monitoring (#6725 ) DESCRIPTION: Adds views that monitor statistics on tenant usages This PR adds `citus_stats_tenants` view that monitors the tenants on the cluster. `citus_stats_tenants` shows the node id, colocation id, tenant attribute, read count in this period and last period, and query count in this period and last period of the tenant. Tenant attribute currently is the tenant's distribution column value, later when schema based sharding is introduced, this meaning might change. A period is a time bucket the queries are counted by. Read and query counts for this period can increase until the current period ends. After that those counts are moved to last period's counts, which cannot change. The period length can be set using 'citus.stats_tenants_period'. `SELECT` queries are counted as _read_ queries, `INSERT`, `UPDATE` and `DELETE` queries are counted as _write_ queries. So in the view read counts are `SELECT` counts and query counts are `SELECT`, `INSERT`, `UPDATE` and `DELETE` count. The data is stored in shared memory, in a struct named `MultiTenantMonitor`. `citus_stats_tenants` shows the data from local tenants. `citus_stats_tenants` show up to `citus.stats_tenant_limit` number of tenants. The tenants are scored based on the number of queries they run and the recency of those queries. Every query ran increases the score of tenant by `ONE_QUERY_SCORE`, and after every period ends the scores are halved. Halving is done lazily. To retain information a longer the monitor keeps up to 3 times `citus.stats_tenant_limit` tenants. When the tenant count hits `3 * citus.stats_tenant_limit`, last `citus.stats_tenant_limit` tenants are removed. To see all stored tenants you can use `citus_stats_tenants(return_all_tenants := true)` - [x] Create collector view that gets data from all nodes. #6761 - [x] Add monitoring log #6762 - [x] Create enable/disable GUC #6769 - [x] Parse the annotation string correctly #6796 - [x] Add local queries and prepared statements #6797 - [x] Rename to citus_stat_statements #6821 - [x] Run pgbench - [x] Fix role permissions #6812 --------- Co-authored-by: Gokhan Gulbiz <ggulbiz@gmail.com> Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2023-04-05 17:44:17 +03:00
Jelte Fennema	d04d32b314	In run_test.py actually return worker_count (#6830 ) Fixes a small mistake that was missed in the refactor of run_test.py that was done in #6816.	2023-04-05 16:38:57 +03:00
Naisila Puka	eda3cc418a	Fixes flakiness in multi_cluster_management test (#6825 ) Fixes flakiness in multi_cluster_management test https://app.circleci.com/pipelines/github/citusdata/citus/31816/workflows/2f455a30-1c0b-4b21-9831-f7cf2169df5a/jobs/1071444 ```diff SELECT public.wait_until_metadata_sync(); +WARNING: waiting for metadata sync timed out wait_until_metadata_sync -------------------------- (1 row) ``` Default timeout value is 15000. I increased it to 60000.	2023-04-05 15:50:22 +03:00
Jelte Fennema	e5e5eb35c7	Refactor run_test.py (#6816 ) Over the last few months run_test.py got more and more complex. This refactors the code in `run_test.py` to be better understandable. Mostly this splits up separate pieces of logic into separate functions.	2023-04-05 11:11:30 +02:00
Onur Tirtir	d4f9de7875	Explicitly disallow local rels when inserting into dist table (#6817 )	2023-04-04 17:46:43 +02:00
Jelte Fennema	dcee370270	Fix flakyness in citus_split_shard_by_split_points_deferred_drop (#6819 ) In CI we would sometimes get this failure: ```diff -- The original shard is marked for deferred drop with policy_type = 2. -- The previous shard should be dropped at the beginning of the second split call SELECT * from pg_dist_cleanup; record_id \| operation_id \| object_type \| object_name \| node_group_id \| policy_type -----------+--------------+-------------+--------------------------------------------------------------------------+---------------+------------- + 60 \| 778 \| 3 \| citus_shard_split_slot_18_21216_778 \| 16 \| 0 512 \| 778 \| 1 \| citus_split_shard_by_split_points_deferred_schema.table_to_split_8981001 \| 16 \| 2 -(1 row) +(2 rows) ``` Replication slots sometimes cannot be deleted right away. Which is hard to resolve, but luckily we can filter these cleanup records out easily by filtering by policy_type. While debugging this issue I learnt that we did not use `GetNextCleanupRecordId` in all places where we created cleanup records. This caused test failures when running tests multiple times, when they set `citus.next_cleanup_record_id`. I tried fixing that by calling GetNextCleanupRecordId in all places but that caused many other tests to fail due to deadlocks. So, instead this adresses that issue by using `ALTER SEQUENCE ... RESTART` instead of `citus.next_cleanup_record_id`. In a follow up PR we should probably get rid of `citus.next_cleanup_record_id`, since it's only used in one other file.	2023-04-04 09:45:48 +02:00
Marco Slot	7c0589abb8	Do not override combinefunc of custom aggregates with common names (#6805 ) DESCRIPTION: Fix an issue that caused some queries with custom aggregates to fail While playing around with https://github.com/pgvector/pgvector I noticed that the AVG query was broken. That's because we treat it as any other AVG by breaking it down in SUM and COUNT, but there are no SUM/COUNT functions in this case, but there is a perfectly usable combinefunc. This PR changes our aggregate logic to prefer custom aggregates with a combinefunc even if they have a common name. Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-04-03 19:43:09 +02:00
rajeshkt78	d5df892394	Make CDC decoder an independent extension (#6810 ) DESCRIPTION: - The CDC decoder is refacroted into a seperate extension that can be used loaded dynamically without having to reload citus. - CDC decoder code can be compiled using DECODER flag to work with different decoders like pgoutput and wal2json. by default the base decode is "pgoutput". - the dynamic_library_path config is adjusted dynamically to prefer the decoders in cdc_decoders directory in citus init so that the users can use the replication subscription commands without having to make any config changes.	2023-04-03 21:32:15 +05:30
Ahmet Gedemenli	697bb55fc5	Refactor shard transfers (#6631 ) DESCRIPTION: Refactor and unify shard move and copy functions Shard move and copy functions share a lot of code in common. This PR unifies these functions into one, along with some helper functions. To preserve the current behavior, we'll introduce and use an enum parameter, and hardcoded strings for producing error/warning messages.	2023-04-03 10:43:54 +03:00
Jelte Fennema	92b358fe0a	Make python-regress based tests runnable with run_test.py (#6814 ) For some tests such as upgrade tests and arbitrary config tests we set up the citus cluster using Python. This setup is slightly different from the perl based setup script (`multi_regress.pl`). Most importantly it uses replication factor 1 by default. This changes our run_test.py script to be able to run a schedule using python instead of `multi_regress.pl`, for the tests that require it. For now arbitrary config tests are still not runnable with `run_test.py`, but this brings us one step closer to being able to do that. Fixes #6804	2023-03-31 17:07:12 +02:00
Marco Slot	343d1c5072	Refactor executor utility functions into multiple files (#6593 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-03-31 13:07:48 +02:00
Jelte Fennema	085b59f586	Fix flaky multi_mx_schema_support test (#6813 ) This happened sometimes: ```diff SELECT objid::oid::regnamespace as "Distributed Schemas" FROM pg_catalog.pg_dist_object WHERE objid::oid::regnamespace IN ('mx_old_schema', 'mx_new_schema'); Distributed Schemas --------------------- - mx_old_schema mx_new_schema + mx_old_schema (2 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/31706/workflows/edc84a6a-dfef-42b3-ab5c-54daa64c2154/jobs/1065463 In passing make multi_mx_schema_support runnable with run_test.py	2023-03-31 12:36:53 +02:00
Jelte Fennema	7b60cdd13b	Convert columnar tap tests to pytest (#6720 ) Having as little Perl as possible in our repo seems a worthy goal. Sadly Postgres its Perl based TAP infrastructure was the only way in which we could run tests that were hard to do using only SQL commands. This change adds infrastructure to run such "application style tests" using python and converts all our existing Perl TAP tests to this new infrastructure. Some of the helper functions that are added in this PR are currently unused. Most of these will be used by the CDC PR that depends on this. Some others are there because they were needed by the PgBouncer test framework that this is based on, and the functions seemed useful enough to citus testing to keep. The main features of the test suite are: 1. Application style tests using a programming language that our developers know how to write. 2. Caching of Citus clusters in-between tests using the ["fixture" pattern][fixture] from `pytest` to achieve speedy tests. To make this work in practice any changes made during a test are automatically undone. Schemas, replication slots, subscriptions, publications are dropped at the end of each test. And any changes made by `ALTER SYSTEM` or manually editing of `pg_hba.conf` are undone too. 3. Automatic parallel execution of tests using the `-n auto` flag that's added by `pytest-xdist`. This improved the speed of tests greatly with the similar test framework I created for PgBouncer. Right now it doesn't help much yet though, since this PR only adds two tests (one of which takes ~10 times longer than the other). Possible future improvements are: 1. Clean up even more things at the end of each test (e.g. users that were created). These are fairly easy to add, but I have not done so yet since they were not needed yet for this PR or the CDC PR. So I would not be able to test the cleanup easily. 2. Support for query block detection similar to what we can now do using isolation tests. [fixture]: https://docs.pytest.org/en/6.2.x/fixture.html	2023-03-31 12:25:19 +02:00
Teja Mupparti	01ea5f58a9	Fix the incorrect value passed to pointer-to-bool parameter, pass a NULL as the value is not used for this invocation.	2023-03-30 10:45:32 -07:00
aykut-bozkurt	104e85e18f	stabilize metadata syncing (#6728 ) Motivation Some customers experienced out of memory or max allocation block size errors during metadata sync when they had a lot of shards, partitions, indexes, or columns. This PR has motivation to prevent those 2 types of memory failures to boost the scalability of Citus and unlock some customers with huge clusters by letting them add new nodes and upgrade their Citus version above 11.0 which introduced important features e.g. query from any node. Problems Memory errors are caused by the fact that we finish all the metadata sync operations within a single coordinated transaction, which causes mainly 3 problems: 1. Collecting metadata sync commands without freeing until the end of the transaction, 2. Each modification causes PG invalidations related to cache memory. PG stores those invalidations until the end of transaction (for visibility guarantees) to notify other backends about the invalidations. As we do a lot of modifications during the metadata syncing within single coordinated transaction, PG can sometimes exceed max allocation block size at worker nodes due to huge invalidation messages, 3. Citus has MetadataCacheMemory for fast access to metadata objects. To see the effects of the modifications inside the same transaction, we locally process PG invalidations and rebuild many objects without freeing invalidated ones until the end of transaction for simplicity. Solution We decided to add nontransactional mode for metadata sync, where we send each command in separate transaction and reset memory context after each transaction. User can switch to nontransactional mode via a GUC if they hit memory problems during the sync. (Default mode is transactional) We created a common api for both transactional (old mode) and nontransactional modes to have a uniform code and to not disturb test coverage by introducing new code paths. Below items are addressed for the solution: - [x] Commit-1 Add a method to send multiple commands to worker list reusing bare connections. Change will be useful for metadata sync api, - [x] Commit-2 Create MetadataSyncContext api to encapsulate both transactional and nontransactional modes, - [x] Commit-3 Let nontransactional sync mode create transaction per shell table during dropping the shell tables from worker, - [x] Commit-4 Add new metadata sync methods which uses MetadataSyncContext api so that during the sync we can 1. free memory to prevent OOM, 2. use either transactional or nontransactional modes according to the GUC `citus.metadata_sync_transaction_mode`. - [x] Commit-5 Let `ActivateNode` use new metadata sync api, - [x] Commit-6 Let `activate_node_snapshot` use new metadata sync api, - [x] Commit-7 Remove unused old metadata sync methods, - [x] Commit-8 Drop table, if exists, during table dependency creation, - [x] Commit-9 Do not enforce distributed transaction at `EnsureCoordinatorInitiatedOperation`, - [x] Commit-10 Do not acquire strict lock on separate transaction to localhost as we already take the lock before, - [x] Commit-11 Let `AddNodeMetadata` to use metadatasync api during `citus_add_node`, - [x] Commit-12 Force activated bare connections to close at transaction end, - [x] Commit-13 Add failure tests for nontransactional metadata sync mode, - [x] Verify OOM and max allowed allocation block errors do not happen with nontransactional sync mode. DESCRIPTION: Fixes memory leak and max allocation block errors during metadata syncing. DESCRIPTION: Introduces nontransactional mode for metadata sync. DESCRIPTION: Introduces the GUC `citus.metadata_sync_mode` to switch sync modes.	2023-03-30 11:21:13 +03:00
aykutbozkurt	dc57e4b2d8	PR #6728 / commit - 13 Add failure tests for nontransactional metadata sync mode.	2023-03-30 11:06:16 +03:00
aykutbozkurt	f2f0ec9dda	PR #6728 / commit - 12 Force activated bare connections to close at transaction end.	2023-03-30 11:06:16 +03:00
aykutbozkurt	35dbdae5a4	PR #6728 / commit - 11 Let AddNodeMetadata to use metadatasync api during node addition.	2023-03-30 11:06:16 +03:00
aykutbozkurt	fe00b3263a	PR #6728 / commit - 10 Do not acquire strict lock on separate transaction to localhost as we already take the lock before. But make sure that caller has the ExclusiveLock.	2023-03-30 11:06:16 +03:00
aykutbozkurt	a74232bb39	PR #6728 / commit - 9 Do not enforce distributed transaction at `EnsureCoordinatorInitiatedOperation`.	2023-03-30 10:53:22 +03:00
aykutbozkurt	cf4e93a332	PR #6728 / commit - 8 Drop table, if exists, during table dependency creation.	2023-03-30 10:53:22 +03:00
aykutbozkurt	f8fb20cc95	PR #6728 / commit - 7 Remove unused old metadata sync methods.	2023-03-30 10:53:22 +03:00
aykutbozkurt	1fb3de14df	PR #6728 / commit - 6 Let `activate_node_snapshot` use new metadata sync api.	2023-03-30 10:53:22 +03:00
aykutbozkurt	bc25ba51c3	PR #6728 / commit - 5 Let `ActivateNode` use new metadata sync api.	2023-03-30 10:53:22 +03:00
aykutbozkurt	29ef9117e6	PR #6728 / commit - 4 Add new metadata sync methods which uses MemorySyncContext api so that during the sync we can - free memory to prevent OOM, - use either transactional or nontransactional modes according to the GUC .	2023-03-30 10:53:22 +03:00
aykutbozkurt	8feb8c634a	PR #6728 / commit - 3 Let nontransactional sync mode create transaction per shell table during dropping the shell tables from worker.	2023-03-30 10:53:20 +03:00
aykutbozkurt	85d50203d1	PR #6728 / commit - 2 - Create MetadataSyncContext api to encapsulate both transactional and nontransactional modes, - Add a GUC to switch between metadata sync transaction modes.	2023-03-30 10:52:46 +03:00
aykutbozkurt	98abd68178	PR #6728 / commit - 1 Add a method to send multiple commands to worker list reusing the same bare connections. Change will be useful for metadata sync api.	2023-03-30 10:52:46 +03:00
Gokhan Gulbiz	e71bfd6074	Identity column implementation refactorings (#6738 ) This pull request proposes a change to the logic used for propagating identity columns to worker nodes in citus. Instead of creating a dependent sequence for each identity column and changing its default value to `nextval(seq)/worker_nextval(seq)`, this update will pass the identity columns as-is to the worker nodes. Please note that there are a few limitations to this change. 1. Only bigint identity columns will be allowed in distributed tables to ensure compatibility with the DDL from any node functionality. Our current distributed sequence implementation only allows insert statements from all nodes for bigint sequences. 2. `alter_distributed_table` and `undistribute_table` operations will not be allowed for tables with identity columns. This is because we do not have a proper way of keeping sequence states consistent across the cluster. DESCRIPTION: Prevents using identity columns on data types other than `bigint` on distributed tables DESCRIPTION: Prevents using `alter_distributed_table` and `undistribute_table` UDFs when a table has identity columns DESCRIPTION: Fixes a bug that prevents enforcing identity column restrictions on worker nodes Depends on #6740 Fixes #6694	2023-03-30 10:41:01 +03:00
Emel Şimşek	d3fb9288ab	Schedule parallel shard moves in background rebalancer by removing task dependencies between shard moves across colocation groups. (#6756 ) DESCRIPTION: This PR removes the task dependencies between shard moves for which the shards belong to different colocation groups. This change results in scheduling multiple tasks in the RUNNABLE state. Therefore it is possible that the background task monitor can run them concurrently. Previously, all the shard moves planned in a rebalance operation took dependency on each other sequentially. For instance, given the following table and shards colocation group 1 colocation group 2 table1 table2 table3 table4 table 5 shard11 shard21 shard31 shard41 shard51 shard12 shard22 shard32 shard42 shard52 if the rebalancer planner returned the below set of moves ` {move(shard11), move(shard12), move(shard41), move(shard42)}` background rebalancer scheduled them such that they depend on each other sequentially. ``` {move(reftables) if there is any, none} \| move( shard11) \| move(shard12) \| {move(shard41)<--- move(shard12)} This is an artificial dependency move(shard41) \| move(shard42) ``` This results in artificial dependencies between otherwise independent moves. Considering that the shards in different colocation groups can be moved concurrently, this PR changes the dependency relationship between the moves as follows: ``` {move(reftables) if there is any, none} {move(reftables) if there is any, none} \| \| move(shard11) move(shard41) \| \| move(shard12) move(shard42) ``` --------- Co-authored-by: Jelte Fennema <jelte.fennema@microsoft.com>	2023-03-29 22:03:37 +03:00
Marco Slot	ce4bcf6de0	Propagate CREATE/ALTER/DROP PUBLICATION statements (#6776 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-03-29 15:25:35 +02:00
Gokhan Gulbiz	e618345703	Handle identity columns properly in the router planner (#6802 ) DESCRIPTION: Fixes a bug with insert..select queries with identity columns Fixes #6798	2023-03-29 15:50:12 +03:00
Marco Slot	e5fd1c3a87	Fix TAP tests after CREATE PUBLICATION changes	2023-03-29 00:59:12 +02:00
Marco Slot	8ad444f8ef	Hide shards from CDC subscriptions	2023-03-29 00:59:12 +02:00
Marco Slot	b09d239809	Propagate CREATE PUBLICATION statements	2023-03-29 00:59:12 +02:00
Teja Mupparti	37500806d6	Add appropriate locks for MERGE to run in parallel	2023-03-28 09:45:40 -07:00
rajeshkt78	85b8a2c7a1	CDC implementation for Citus using Logical Replication (#6623 ) Description: Implementing CDC changes using Logical Replication to avoid re-publishing events multiple times by setting up replication origin session, which will add "DoNotReplicateId" to every WAL entry. - shard splits - shard moves - create distributed table - undistribute table - alter distributed tables (for some cases) - reference table operations The citus decoder which will be decoding WAL events for CDC clients, ignores any WAL entry with replication origin that is not zero. It also maps the shard names to distributed table names.	2023-03-28 16:00:21 +05:30
Onur Tirtir	616b5018a0	Add a GUC to disallow planning the queries that reference non-colocated tables via router planner (#6793 ) Today we allow planning the queries that reference non-colocated tables if the shards that query targets are placed on the same node. However, this may not be the case, e.g., after rebalancing shards because it's not guaranteed to have those shards on the same node anymore. This commit adds citus.enable_non_colocated_router_query_pushdown GUC that can be used to disallow planning such queries via router planner, when it's set to false. Note that the default value for this GUC will be "true" for 11.3, but we will alter it to "false" on 12.0 to not introduce a breaking change in a minor release. Closes #692. Even more, allowing such queries to go through router planner also causes generating an incorrect plan for the DML queries that reference distributed tables that are sharded based on different replication factor settings. For this reason, #6779 can be closed after altering the default value for this GUC to "false", hence not now. DESCRIPTION: Adds `citus.enable_non_colocated_router_query_pushdown` GUC to ensure generating a consistent distributed plan for the queries that reference non-colocated distributed tables (when set to "false", the default is "true").	2023-03-28 13:10:29 +03:00
Teja Mupparti	9bab819f26	Disentangle MERGE planning code from the modify-planning code path	2023-03-27 10:41:46 -07:00
Onur Tirtir	372a93b529	Make 8 more tests runnable multiple times via run_test.py (#6791 ) Soon I will be doing some changes related to #692 in router planner and those changes require updating ~5/6 tests related to router planning. And to make those test files runnable by run_test.py multiple times, we need to make some other tests (that they're run in parallel / they badly depend on) ready for run_test.py too.	2023-03-27 12:19:06 +03:00
Teja Mupparti	da7db53c87	Refactor some of the planning code to accomodate a new planning path for MERGE SQL	2023-03-22 11:29:24 -07:00
Onur Tirtir	e1f1d63050	Rename AllRelations.. functions to AllDistributedRelations.. (#6789 ) Because they're only interested in distributed tables. Even more, this replaces HasDistributionKey() check with IsCitusTableType(DISTRIBUTED_TABLE) because this doesn't make a difference on main and sounds slightly more intuitive. Plus, this would also allow safely using this function in https://github.com/citusdata/citus/pull/6773.	2023-03-22 15:15:23 +03:00
Onur Tirtir	4960ced175	Add an arbitrary config test heavily based on multi_router_planner_fast_path.sql (#6782 ) This would be useful for testing #6773. This is because, given that #6773 only adds support for router / fast-path queries, theoretically almost all the tests that we have in that test file should work for null-shard-key tables too (and they indeed do). I deliberately did not replace multi_router_planner_fast_path.sql with the one that I'm adding into arbitrary configs because we might still want to see when we're able to go through fast-path planning for the usual distributed tables (the ones that have a shard key).	2023-03-22 10:49:08 +03:00
Ahmet Gedemenli	2713e015d6	Check before logicalrep for rebalancer, error if needed (#6754 ) DESCRIPTION: Check before logicalrep for rebalancer, error if needed Check if we can use logical replication or not, in case of shard transfer mode = auto, before executing the shard moves. If we can't, error out. Before this PR, we used to error out in the middle of shard moves: ```sql set citus.shard_count = 4; -- just to get the error sooner select citus_remove_node('localhost',9702); create table t1 (a int primary key); select create_distributed_table('t1','a'); create table t2 (a bigint); select create_distributed_table('t2','a'); select citus_add_node('localhost',9702); select rebalance_table_shards(); NOTICE: Moving shard 102008 from localhost:9701 to localhost:9702 ... NOTICE: Moving shard 102009 from localhost:9701 to localhost:9702 ... NOTICE: Moving shard 102012 from localhost:9701 to localhost:9702 ... ERROR: cannot use logical replication to transfer shards of the relation t2 since it doesn't have a REPLICA IDENTITY or PRIMARY KEY ``` Now we check and error out in the beginning, without moving the shards. fixes: #6727	2023-03-21 16:34:52 +03:00
Onur Tirtir	aa465b6de1	Decide what to do with router planner error at one place (#6781 )	2023-03-21 14:04:07 +03:00
aykut-bozkurt	aa33988c6e	fix pip lock file (#6766 ) ci/fix_styles.sh were complaining about `black` and `isort` packages are not found even if I `pipenv install --dev` due to broken lock file. I regenerated the lock file and now it works fine. We also wanted to upgrade required python version for the pipfile.	2023-03-21 00:58:12 +03:00
aykut-bozkurt	ea3093bdb6	Make workerCount configurable for regression tests (#6764 ) Make worker count flexible in our regression tests instead of hardcoding it to 2 workers.	2023-03-20 12:06:31 +03:00
Teja Mupparti	cf55136281	1) Restrict MERGE command INSERT to the source's distribution column Fixes #6672 2) Move all MERGE related routines to a new file merge_planner.c 3) Make ConjunctionContainsColumnFilter() static again, and rearrange the code in MergeQuerySupported() 4) Restore the original format in the comments section. 5) Add big serial test. Implement latest set of comments	2023-03-16 13:43:08 -07:00
Teja Mupparti	1e42cd3da0	Support MERGE on distributed tables with restrictions This implements the phase - II of MERGE sql support Support routable query where all the tables in the merge-sql are distributed, co-located, and both the source and target relations are joined on the distribution column with a constant qual. This should be a Citus single-task query. Below is an example. SELECT create_distributed_table('t1', 'id'); SELECT create_distributed_table('s1', 'id', colocate_with => ‘t1’); MERGE INTO t1 USING s1 ON t1.id = s1.id AND t1.id = 100 WHEN MATCHED THEN UPDATE SET val = s1.val + 10 WHEN MATCHED THEN DELETE WHEN NOT MATCHED THEN INSERT (id, val, src) VALUES (s1.id, s1.val, s1.src) Basically, MERGE checks to see if There are a minimum of two distributed tables (source and a target). All the distributed tables are indeed colocated. MERGE relations are joined on the distribution column MERGE .. USING .. ON target.dist_key = source.dist_key The query should touch only a single shard i.e. JOIN AND with a constant qual MERGE .. USING .. ON target.dist_key = source.dist_key AND target.dist_key = <> If any of the conditions are not met, it raises an exception. (cherry picked from commit `44c387b978`) This implements MERGE phase3 Support pushdown query where all the tables in the merge-sql are Citus-distributed, co-located, and both the source and target relations are joined on the distribution column. This will generate multiple tasks which execute independently after pushdown. SELECT create_distributed_table('t1', 'id'); SELECT create_distributed_table('s1', 'id', colocate_with => ‘t1’); MERGE INTO t1 USING s1 ON t1.id = s1.id WHEN MATCHED THEN UPDATE SET val = s1.val + 10 WHEN MATCHED THEN DELETE WHEN NOT MATCHED THEN INSERT (id, val, src) VALUES (s1.id, s1.val, s1.src) *The only exception for both the phases II and III is, UPDATEs and INSERTs must be done on the same shard-group as the joined key; for example, below scenarios are NOT supported as the key-value to be inserted/updated is not guaranteed to be on the same node as the id distribution-column. MERGE INTO target t USING source s ON (t.customer_id = s.customer_id) WHEN NOT MATCHED THEN - - INSERT(customer_id, …) VALUES (<non-local-constant-key-value>, ……); OR this scenario where we update the distribution column itself MERGE INTO target t USING source s On (t.customer_id = s.customer_id) WHEN MATCHED THEN UPDATE SET customer_id = 100; (cherry picked from commit `fa7b8949a8`)	2023-03-16 13:43:08 -07:00
Jelte Fennema	b8b85072d6	Add pytest depedencies to Pipfile (#6767 ) In #6720 I'm adding a `pytest` based testing framework. This adds the dependencies for those. They have already been [merged into our docker files][the-process-merge] in the the-process repo preparation for #6720. But by not having them on our citus main branch it is impossible to make changes to the Pipfile, because our CI Dockerfiles and master are out of date. Since #6720 will need some more discussion and might take a few more weeks to be merged, this takes out the Pipfile changes. By merging this PR we can unblock new Pipfile changes. Unblocks and partially addresses #6766 [the-process-merge]: https://github.com/citusdata/the-process/pull/117	2023-03-15 14:53:14 +01:00
Onur Tirtir	a0a41943d7	Remove pg_depend entries from columnar metadata indexes to columnar-am (inserted in #5456 ) (#6628 ) DESCRIPTION: Fixes (pg_dump/pg_upgrade) dependency loop warnings caused by pg_depend entries inserted by citus_columnar Fixes #5510. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.	2023-03-15 01:24:57 +03:00
Onur Tirtir	9550ebd118	Remove pg_depend entries from columnar metadata indexes to columnar-am In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.	2023-03-14 17:13:52 +03:00
Onur Tirtir	be0735a329	Use "cpp" to expand "#include" directives in columnar sql files	2023-03-14 17:13:52 +03:00
Onur Tirtir	2b4be535de	Do clean-up before upgrade_columnar_before to make it runnable multiple times So that flaky test detector can run upgrade_columnar_before.sql multiple times.	2023-03-14 17:13:52 +03:00
Onur Tirtir	994f67185f	Make upgrade_columnar_after runnable multiple times This commit hides port numbers in upgrade_columnar_after because the port numbers assigned to nodes in upgrade schedule differ from the ones that flaky test detector assigns.	2023-03-14 17:13:52 +03:00
Onur Tirtir	821f26cc74	Fix flaky test detection for upgrade tests When run_test.py is run for an upgrade_._after.sql then, then automatically run the corresponding uprade_._before.sql file first. This is because all those upgrade_._after.sql files depend on the objects created in upgrade_._before.sql files by definition.	2023-03-14 17:13:52 +03:00
Onur Tirtir	f68fc9e69c	Decide core distribution params in CreateCitusTable (#6760 ) Decide core distribution params in CreateCitusTable to reduce the chances of creating Citus tables based on incorrect combinations of distribution method and replication model params. Also introduce DistributedTableParams struct to encapsulate the parameters that are specific to distributed tables.	2023-03-14 14:24:52 +03:00
Onur Tirtir	cc945fa331	Add multi_create_fdw into minimal_schedule (#6759 ) So that we can run the tests that require fake_fdw by using minimal schedule too. Also move multi_create_fdw.sql up in multi_1_schedule to make it available to more tests.	2023-03-14 10:22:34 +03:00
Onur Tirtir	20a5f3af2b	Replace CITUS_TABLE_WITH_NO_DIST_KEY checks with HasDistributionKey() (#6743 ) Now that we will soon add another table type having DISTRIBUTE_BY_NONE as distribution method and that we want the code to interpret such tables mostly as distributed tables, let's make the definition of those other two table types more strict by removing CITUS_TABLE_WITH_NO_DIST_KEY macro. And instead, use HasDistributionKey() check in the places where the logic applies to all table types that have / don't have a distribution key. In future PRs, we might want to convert some of those HasDistributionKey() checks if logic only applies to Citus local / reference tables, not the others. And adding HasDistributionKey() also allows us to consider having DISTRIBUTE_BY_NONE as the distribution method as a "table attribute" that can apply to distributed tables too, rather something that determines the table type.	2023-03-10 13:55:52 +03:00
Onur Tirtir	e3cf7ace7c	Stabilize single_node.sql and others that report illegal node removal (#6751 ) See https://app.circleci.com/pipelines/github/citusdata/citus/30859/workflows/223d61db-8c1d-4909-9aea-d8e470f0368b/jobs/1009243.	2023-03-08 15:25:36 +03:00
Onur Tirtir	d82c11f793	Refactor CreateDistributedTable() (#6742 ) Split the main logic that allows creating a Citus table into the internal function CreateCitusTable(). Old CreateDistributedTable() function was assuming that it's creating a reference table when the distribution method is DISTRIBUTE_BY_NONE. However, soon this won't be the case when adding support for creating single-shard distributed tables because their distribution method would also be the same. Now the internal method CreateCitusTable() doesn't make any assumptions about table's replication model or such. Instead, it expects callers to properly set all such metadata bits. Even more, some of the parameters the old CreateDistributedTable() takes --such as the shard count-- were not meaningful for a reference table, and would be the same as for new table type.	2023-03-08 13:38:51 +03:00
Emel Şimşek	4043abd5aa	Exclude-Generated-Columns-In-Copy (#6721 ) DESCRIPTION: Fixes a bug in shard copy operations. For copying shards in both shard move and shard split operations, Citus uses the COPY statement. A COPY all statement in the following form ` COPY target_shard FROM STDIN;` throws an error when there is a GENERATED column in the shard table. In order to fix this issue, we need to exclude the GENERATED columns in the COPY and the matching SELECT statements. Hence this fix converts the COPY and SELECT all statements to the following form: ``` COPY target_shard (col1, col2, ..., coln) FROM STDIN; SELECT (col1, col2, ..., coln) FROM source_shard; ``` where (col1, col2, ..., coln) does not include a GENERATED column. GENERATED column values are created in the target_shard as the values are inserted. Fixes #6705. --------- Co-authored-by: Teja Mupparti <temuppar@microsoft.com> Co-authored-by: aykut-bozkurt <51649454+aykut-bozkurt@users.noreply.github.com> Co-authored-by: Jelte Fennema <jelte.fennema@microsoft.com> Co-authored-by: Gürkan İndibay <gindibay@microsoft.com>	2023-03-07 18:15:50 +03:00
Ahmet Gedemenli	03f1bb70b7	Rebalance shard groups with placement count less than worker count (#6739 ) DESCRIPTION: Adds logic to distribute unbalanced shards If the number of shard placements (for a colocation group) is less than the number of workers, it means that some of the workers will remain empty. With this PR, we consider these shard groups as a colocation group, in order to make them be distributed evenly as much as possible across the cluster. Example: ```sql create table t1 (a int primary key); create table t2 (a int primary key); create table t3 (a int primary key); set citus.shard_count =1; select create_distributed_table('t1','a'); select create_distributed_table('t2','a',colocate_with=>'t1'); select create_distributed_table('t3','a',colocate_with=>'t2'); create table tb1 (a bigint); create table tb2 (a bigint); select create_distributed_table('tb1','a'); select create_distributed_table('tb2','a',colocate_with=>'tb1'); select citus_add_node('localhost',9702); select rebalance_table_shards(); ``` Here we have two colocation groups, each with one shard group. Both shard groups are placed on the first worker node. When we add a new worker node and try to rebalance table shards, the rebalance planner considers it well balanced and does nothing. With this PR, the rebalancer tries to distribute these shard groups evenly across the cluster as much as possible. For this example, with this PR, the rebalancer moves one of the shard groups to the second worker node. fixes: #6715	2023-03-06 14:14:27 +03:00
Emel Şimşek	ed7cc8f460	Remove unused lock functions (#6747 ) Code cleanup. This change removes two unused functions seemingly left over after a previous refactoring of shard move code.	2023-03-06 13:59:45 +03:00
Jelte Fennema	b489d763e1	Use pg_total_relation_size in citus_shards (#6748 ) DESCRIPTION: Correctly report shard size in citus_shards view When looking at citus_shards, people are interested in the actual size that all the data related to the shard takes up on disk. `pg_total_relation_size` is the function to use for that purpose. The previously used `pg_relation_size` does not include indexes or TOAST. Especially the missing toast can have enormous impact on the size of the shown data.	2023-03-06 10:53:12 +01:00
Gledis Zeneli	dc7fa0d5af	Fix multiple output version arbitrary config tests (#6744 ) With this small change, arbitrary config tests can have multiple acceptable correct outputs. For an arbitrary config tests named `t`, now you can define `expected/t.out`, `expected/t_0.out`, `expected/t_1.out` etc and the test will succeed if the output of `sql/t.sql` is equal to any of the `t.out` or `t_{0, 1, ...}.out` files.	2023-03-03 21:06:59 +03:00
Onur Tirtir	0d401344c2	Stabilize single node tests (#6741 ) First of all, we set next_shard_id for single_node_truncate.sql because shard ids in the test output were changing whenever we modify a prior test file, such as single_node.sql. Then the flaky test detector started complaining about single_node_truncate.sql. We fix that by specifying the correct test dependency for it in run_test.py. We also do the same for single_node.sql.	2023-03-03 17:17:08 +03:00
Onur Tirtir	a9820e96a3	Make single_node_truncate.sql re-runnable First of all, this commit sets next_shard_id for single_node_truncate.sql because shard ids in the test output were changing whenever we modify a prior test file. Then the flaky test detector started complaining about single_node_truncate.sql. We fix that by specifying the correct test dependency for it in run_test.py.	2023-03-02 16:33:18 +03:00
Onur Tirtir	40105bf1fc	Make single_node.sql re-runnable	2023-03-02 16:33:17 +03:00
Gokhan Gulbiz	f027a47ca8	Fix string eval bug in migration files check (#6740 )	2023-03-02 08:44:57 +03:00
aykut-bozkurt	e2654deeae	fix memory leak during altering distributed table with a lot of partition and shards (#6726 ) 2 improvements to prevent memory leaks during altering or undistributing distributed tables with a lot of partitions and shards: 1. Free memory for each call to ConvertTable so that colocated and partition tables at `AlterDistributedTable`, `UndistributeTable`, or `AlterTableSetAccessMethod` will not cause an increase in memory usage, 2. Free memory while executing attach partition commands for each partition table at `AlterDistributedTable` to prevent an increase in memory usage. DESCRIPTION: Fixes memory leak issue during altering distributed table with a lot of partition and shards. Fixes https://github.com/citusdata/citus/issues/6503.	2023-02-28 21:23:41 +03:00
Jelte Fennema	17ad61678f	Make run_test.py and create_test.py importable without errors (#6736 ) Allowing scripts to be importable is good practice in general and it's required for the pytest testing framework that I'm adding in a follow up PR.	2023-02-28 00:34:42 +03:00
Jelte Fennema	c018e29bec	Don't blanket ignore flake8 E402 error (#6734 ) Instead this starts ignoring it in specific places only, because most files don't actually need it ignored.	2023-02-27 18:17:15 +03:00
Gürkan İndibay	7b8e614039	Fixes bookworm packaging pipeline problem (#6737 ) Recently, I changed Python execution structure into virtual. Therefore, now there is no need change built in python for the images. Since Github is provisioning images with specific permissions, this issue caused error. With this PR, I removed unnecessary installation of pip and setuptools in container docker image Additionally, removed some unnecessary sudos and used ap-get instead of apt in one place	2023-02-27 15:28:36 +03:00
Jelte Fennema	24ad8574b5	Fix run_test.py on python 3.9 (#6735 ) In #6718 I accidentally added Python type hint syntax that was only supported on Python 3.10. Our CI uses 3.9, so this PR changes that to a syntax that's supported on 3.9 too.	2023-02-27 10:12:18 +01:00
Teja Mupparti	9cbfdc86dd	MERGE: In deparser, add missing check for RETURNING clause.	2023-02-26 22:38:14 -08:00
Teja Mupparti	d7b499929c	Rearrange the common code into a newfunction to facilitate the multiple checks of the same conditions in a multi-modify MERGE statement	2023-02-24 12:55:11 -08:00
aykut-bozkurt	a7689c3f8d	fix memory leak during distribution of a table with a lot of partitions (#6722 ) We have memory leak during distribution of a table with a lot of partitions as we do not release memory at ExprContext until all partitions are not distributed. We improved 2 things to resolve the issue: 1. We create and delete MemoryContext for each call to `CreateDistributedTable` by partitions, 2. We rebuild the cache after we insert all the placements instead of each placement for a shard. DESCRIPTION: Fixes memory leak during distribution of a table with a lot of partitions and shards. Fixes https://github.com/citusdata/citus/issues/6572.	2023-02-17 18:12:49 +03:00
Emel Şimşek	756c1d3f5d	Remove auto_explain workaround in citus explain hook for ALTER TABLE (#6714 ) When auto_explain module is loaded and configured, EXPLAIN will be implicitly run for all the supported commands. Postgres does not support `EXPLAIN` for `ALTER` command. However, auto_explain will try to `EXPLAIN` other supported commands internally triggered by `ALTER`. For instance, `ALTER TABLE target_table ADD CONSTRAINT fkey_167 FOREIGN KEY (col_1) REFERENCES ref_table(key) ... ` command may trigger a SELECT command in the following form for foreign key validation purpose: `SELECT fk.col_1 FROM ONLY target_table fk LEFT OUTER JOIN ONLY ref_table pk ON ( pk.key OPERATOR(pg_catalog.=) fk.col_1) WHERE pk.key IS NULL AND (fk.col_1 IS NOT NULL) ` For Citus tables, the Citus utility hook should ensure that constraint validation is skipped for shell tables but they are done for shard tables. The reason behind this design choice can be summed up as: - An ALTER TABLE command via coordinator node is run in a distributed transaction. - Citus does not support nested distributed transactions. - A SELECT query on a distributed table (aka shell table) is also run in a distributed transaction. - Therefore, Citus does not support running a SELECT query on a shell table while an ALTER TABLE command is running. With `eadc88a800` a bug is introduced breaking the skip constraint validation behaviour of Citus. With this change, we see that validation queries on distributed tables are triggered within `ALTER` command adding constraints with validation check. This regression did not cause an issue for regular use cases since the citus executor hook blocks those queries heuristically when there is an ALTER TABLE command in progress. The issue is surfaced as a crash (#6424 Workers, when configured to use auto_explain, crash during distributed transactions.) when auto_explain is enabled. This is due to auto_explain trying to execute the SELECT queries in a nested distributed transaction. Now since the regression with constraint validation is fixed in https://github.com/citusdata/citus/issues/6543, we should be able to remove the workaround.	2023-02-17 17:47:03 +03:00
aykut-bozkurt	9e69dd0e7f	fix single tuple result memory leak (#6724 ) We should not omit to free PGResult when we receive single tuple result from an internal backend. Single tuple results are normally freed by our ReceiveResults for `tupleDescriptor != NULL` flow but not for those with `tupleDescriptor == NULL`. See PR #6722 for details. DESCRIPTION: Fixes memory leak issue with query results that returns single row.	2023-02-17 14:15:09 +03:00
Teja Mupparti	ca65d2ba0b	Fix flaky tests local_shards_execution and local_shards_execution_replication. O Simple fix is to add ORDER BY to have definitive results. O Add search_path explicitly after reconnecting, this avoids creating objects in public schema which prevents us from repetitive running of tests. O multi_mx_modification is not designed to run repetitive, so isolate it.	2023-02-15 09:18:10 -08:00
Hanefi Onaldi	902d4262f9	CI checks to check for missing downgrade updates (#6661 ) A branch that touches a set of upgrade scripts is also expected to touch corresponding downgrade scripts as well. To ensure that I introduce a new CI script. If this script fails, read the output and make sure you update the downgrade scripts in the printed list.	2023-02-15 18:20:14 +03:00
Jelte Fennema	b02a5b5b78	Add more powerfull dependency tracking to run_test.py (#6718 ) Some of our tests depend on previous tests. Normally all these tests should be part of a base schedule, but that's not always the case. The flaky test detection script should ensure that we don't introduce other dependencies by accident in new tests. But we have many old tests that are not worth the effort of changing. This adds a way to define such test dependencies in `run_test.py`, so that it can make sure to run any dependencies before the actual test.	2023-02-15 17:20:05 +03:00
Jelte Fennema	3ba639f162	Install non-vulnerable cryptography package (#6710 ) Our repo was complaining about the cryptography package being vulnerable. This updates it, including our mitmproxy fork, because that was pinning an outdated version. Relevant commit on our mitmproxy fork: `2fd18ef051` Relevant PR on the-process: https://github.com/citusdata/the-process/pull/112	2023-02-14 18:03:10 +01:00
aykut-bozkurt	273911ac7f	prevent memory leak during ConvertTable with a lot of partitions (#6693 ) Prevents memory leak during ConvertTable call for a table with a lot of partitions. DESCRIPTION: Fixes memory leak during undistribution and alteration of a table with a lot of partitions.	2023-02-13 15:22:13 +03:00
Jelte Fennema	3200187757	Support compilation and run tests on latest PG versions (#6711 ) Postgres got minor updates this starts using the images with the latest version for our tests. These new Postgres versions caused a compilation issue in PG14 and PG13 due to some function being backported that we had already backported ourselves. Due this backport being a static inline function it doesn't matter who provides this and there will be no linkage errors when either running old Citus packages on new PG versions or the other way around.	2023-02-10 16:02:03 +01:00
Jelte Fennema	dd51938f20	Add auto-formatting and linting to our python code (#6700 ) We're getting more and more python code in the repo. This adds some tools to make sure that styling is consistent and we're not doing easy to miss mistakes. - Format python files with black - Run python files through isort - Fix issues reported by flake8 - Add .venv to gitignore	2023-02-10 13:25:44 +01:00
Jelte Fennema	b01d67c943	Add python code checks to CI	2023-02-10 13:14:28 +01:00
Jelte Fennema	7bf3084b28	Add python tools to check-style and reindent make targets	2023-02-10 13:14:28 +01:00
Jelte Fennema	92dab9b441	Add .venv to gitignore	2023-02-10 13:05:37 +01:00
Jelte Fennema	9f41ea2157	Fix issues reported by flake8	2023-02-10 13:05:37 +01:00
Jelte Fennema	188cc7d2ae	Run python files through isort	2023-02-10 13:05:37 +01:00
Jelte Fennema	530b24a887	Format python files with black	2023-02-10 13:05:37 +01:00
Jelte Fennema	42970665fc	Add linting and formatting tools for python	2023-02-10 13:05:37 +01:00
Jelte Fennema	09be4bb5fd	Allow multi_insert_select to run repeatably (#6707 ) It was not cleaning up all the tables it created. This changes it to create a dedicated schema for this test, like we have for many others.	2023-02-10 10:06:42 +01:00
Jelte Fennema	590df5360c	Fix flakyness in failure_create_distributed_table_non_empty (#6708 ) The failure_create_distributed_table_non_empty test would sometimes fail like this: ```diff -- in the first test, cancel the first connection we sent from the coordinator SELECT citus.mitmproxy('conn.cancel(' \|\| pg_backend_pid() \|\| ')'); - mitmproxy ---------------------------------------------------------------------- - -(1 row) - +ERROR: canceling statement due to user request +CONTEXT: COPY mitmproxy_result, line 0 +SQL statement "COPY mitmproxy_result FROM '/home/circleci/project/src/test/regress/tmp_check/mitmproxy.fifo'" +PL/pgSQL function citus.mitmproxy(text) line 11 at EXECUTE SELECT create_distributed_table('test_table', 'id'); ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/30474/workflows/be1c9f9d-22c9-465c-964a-dcdd1cb8c99c/jobs/985441 Because the cancel command had no filter it would actually sometimes cancel the mitmproxy cancel command itself. This PR addresses that by simply removing this test. This is basically the exact same issue as #6217, only in a different place in the file. It's fixed here by removing the test since there's already many different similar tests.	2023-02-10 09:55:12 +01:00
Teja Mupparti	0824d9c1fb	Miscellaneous cleanup	2023-02-09 13:05:59 -08:00
Adam Wolk	b58de7f52c	Update README for 11.2	2023-02-08 18:54:55 +01:00
Jelte Fennema	69b7f23932	Fix dubious ownership error from git (#6703 ) We started getting this error in CI: ``` Summary coverage rate: lines......: 43.4% (28347 of 65321 lines) functions..: 53.2% (2544 of 4786 functions) branches...: no data found fatal: detected dubious ownership in repository at '/home/circleci/project' To add an exception for this directory, call: git config --global --add safe.directory /home/circleci/project Error: exit status 128 ``` This fixes that by running the proposed command to command in CI. This error is related to a CVE that does not apply to this case, since this is not a multiuser system. Commit on git itself that fixed the CVE: `8959555cee`	2023-02-08 17:44:42 +01:00
Hanefi Onaldi	414a95e259	Create CodeQL workflow for static analysis (#5868 ) Introducing a new Github Actions Workflow to run our statical analysis tool, CodeQL. Relevant Github docs page: https://docs.github.com/en/code-security/code-scanning/automatically-scanning-your-code-for-vulnerabilities-and-errors/customizing-code-scanning The Github action that we use for security scanning: https://github.com/github/codeql-action	2023-02-07 15:25:17 +03:00
Onur Tirtir	483b51392f	Bump Citus to 11.3devel (#6690 )	2023-02-06 10:23:25 +00:00
Gokhan Gulbiz	b6a4652849	Stop background daemon before dropping the database (#6688 ) DESCRIPTION: Stop maintenance daemon when dropping a database even without Citus extension Fixes #6670	2023-02-03 15:15:44 +03:00
Onur Tirtir	c7f8c5de99	Add changelog entries for 11.2.0 (#6671 ) Co-authored-by: Naisila Puka <37271756+naisila@users.noreply.github.com>	2023-02-03 10:52:08 +03:00
Jelte Fennema	f061dbb253	Also reset transactions at connection shutdown (#6685 ) In #6314 I refactored the connection cleanup to be simpler to understand and use. However, by doing so I introduced a use-after-free possibility (that valgrind luckily picked up): In the `ShouldShutdownConnection` path of `AfterXactHostConnectionHandling` we free connections without removing the `transactionNode` from the dlist that it might be part of. Before the refactoring this wasn't a problem, because the dlist would be completely reset quickly after in `ResetGlobalVariables` (without reading or writing the dlist entries). The refactoring changed this by moving the `dlist_delete` call to `ResetRemoteTransaction`, which in turn was called in the `!ShouldShutdownConnection` path of `AfterXactHostConnectionHandling`. Thus this `!ShouldShutdownConnection` path would now delete from the `dlist`, but the `ShouldShutdownConnection` path would not. Thus to remove itself the deleting path would sometimes update nodes in the list that were freed right before. There's two ways of fixing this: 1. Call `dlist_delete` from both of paths. 2. Call `dlist_delete` from neither of the paths. This commit implements the second approach, and #6684 implements the first. We need to choose which approach we prefer. To make calling `dlist_delete` from both paths actually work, we also need to use a slightly different check to determine if we need to call dlist_delete. Various regression tests showed that there can be cases where the `transactionState` is something else than `REMOTE_TRANS_NOT_STARTED` but the connection was not added to the `InProgressTransactions` list One example of such a case is when running `TransactionStateMachine` without calling `StartRemoteTransactionBegin` beforehand. In those cases the connection won't be added to `InProgressTransactions`, but the `transactionState` is changed to `REMOTE_TRANS_SENT_COMMAND`. Sidenote: This bug already existed in 11.1, but valgrind didn't catch it back then. My guess is that this happened because #6314 was merged after the initial release branch was cut. Fixes #6638	2023-02-02 16:05:34 +01:00
Hanefi Onaldi	47ff03123b	Improve rebalance reporting for retried tasks (#6683 ) If there is a problem with an ongoing rebalance, we did not show details on background tasks that are stuck in runnable state. Similar to how we show details for errored tasks, we now show details on tasks that are being retried. Earlier we showed the following output when a task was stuck: ``` ┌────────────────────────────┐ │ { ↵│ │ "tasks": [ ↵│ │ ], ↵│ │ "task_state_counts": {↵│ │ "done": 13, ↵│ │ "blocked": 2, ↵│ │ "runnable": 1 ↵│ │ } ↵│ │ } │ └────────────────────────────┘ ``` Now we show details like the following: ``` +----------------------------------------------------------------------- \| { \| "tasks": [ \| { \| "state": "runnable", \| "command": "SELECT pg_catalog.citus_move_shard_placement(1 \| "message": "ERROR: Moving shards to a node that shouldn't \| "retried": 2, \| "task_id": 3 \| } \| ], \| "task_state_counts": { \| "blocked": 1, \| "runnable": 1 \| } \| } +----------------------------------------------------------------------- ```	2023-01-31 15:26:52 +03:00
Jelte Fennema	14c31fbb07	Fix background rebalance when reference table has no PK (#6682 ) DESCRIPTION: Fix background rebalance when reference table has no PK For the background rebalance we would always fail if a reference table that was not replicated to all nodes would not have a PK (or replica identity). Even when we used force_logical or block_writes as the shard transfer mode. This fixes that and adds some regression tests. Fixes #6680	2023-01-31 12:18:29 +01:00
Gürkan İndibay	d919506076	Fixes validate Output phase of packaging pipeline (#6678 ) Pyenv is installed in our container images but I found out that pyenv is not being activated since it is activated from ~/bashrc script and in GitHub Actions (GHA) this script is not being executed Since pyenv is not activated, default python versions comes from docker images is being used and in this case we get errors for python version 3.11. Additionally, $HOME directory is /github/home for containers executed under GHA and our pyenv installation is under /root directory which is normally home directory for our packaging containers This PR activates usage of pyenv and additionally uses pyenv virtualenv feature to execute validate_output function in isolation --------- Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2023-01-31 13:59:09 +03:00
aykut-bozkurt	8a9bb272e4	fix dropping table_name option from foreign table (#6669 ) We should disallow dropping table_name option if foreign table is in metadata. Otherwise, we get table not found error which contains shardid. DESCRIPTION: Fixes an unexpected foreign table error by disallowing to drop the table_name option. Fixes #6663	2023-01-30 17:24:30 +03:00
Marco Slot	a482b36760	Revert "Support MERGE on distributed tables with restrictions" (#6675 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-01-30 15:01:59 +01:00
Hanefi Onaldi	0962cf7517	Allow empty lines in arbitrary config schedules (#6654 ) This change is a precursor to attempts to add more editorconfig rules in our codebase. It is a good idea to comply with POSIX standards and have an empty newline at the end of text files. However, once we have such a rule, arbitrary configs scripts used to fail before this change. Related: #5981	2023-01-30 16:30:12 +03:00
Onur Tirtir	a9e1b98973	Fall-back to seq-scan when accessing columnar metadata if the index doesn't exist (#6624 )	2023-01-30 16:07:43 +03:00
Onur Tirtir	594684bb33	Do clean-up before columnar_create to make it runnable multiple times So that flaky test detector can run columnar_create.sql multiple times.	2023-01-30 15:58:34 +03:00
Onur Tirtir	1c51ddae49	Fall-back to seq-scan when accessing columnar metadata if the index doesn't exist Fixes #6570. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, instead of inserting such dependency edges from indexes to columnar-am, we allow columnar metadata accessors to fall-back to sequential scan during pg upgrades.	2023-01-30 15:58:34 +03:00
Jelte Fennema	1109b70e58	Fix flaky isolation_non_blocking_shard_split test (#6666 ) Sometimes isolation_non_blocking_shard_split would fail like this: ```diff step s2-show-pg_dist_cleanup: SELECT object_name, object_type, policy_type FROM pg_dist_cleanup; object_name \|object_type\|policy_type ------------------------------+-----------+----------- +citus_shard_split_slot_2_10_39\| 3\| 0 public.to_split_table_1500001 \| 1\| 2 -(1 row) +(2 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/30237/workflows/edcf34b7-d7d3-4d10-8293-b6f59b00cdf2/jobs/970960 The reason is that replication slots have now become part of pg_dist_cleanup too, and sometimes they cannot be cleaned up right away. This is harmless as they will be cleaned up eventually. So this simply filters out the replication slots for those tests.	2023-01-30 13:44:23 +01:00
Jelte Fennema	10603ed5d4	Fix flaky multi_reference_table test (#6664 ) Sometimes in CI our multi_reference_table test fails like this: ```diff WHERE colocated_table_test.value_2 = reference_table_test.value_2; LOG: join order: [ "colocated_table_test" ][ reference join "reference_table_test" ] value_2 --------- - 1 2 + 1 (2 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/30223/workflows/ce3ab5db-310f-4e30-ba0b-c3b31927d9b6/jobs/970041 We forgot an ORDER BY in this test.	2023-01-30 13:32:38 +01:00
aykut-bozkurt	ab71cd01ee	fix multi level recursive plan (#6650 ) Recursive planner should handle all the tree from bottom to top at single pass. i.e. It should have already recursively planned all required parts in its first pass. Otherwise, this means we have bug at recursive planner, which needs to be handled. We add a check here and return error. DESCRIPTION: Fixes wrong results by throwing error in case recursive planner multipass the query. We found 3 different cases which causes recursive planner passes the query multiple times. 1. Sublink in WHERE clause is planned at second pass after we recursively planned a distributed table at the first pass. Fixed by PR #6657. 2. Local-distributed joins are recursively planned at both the first and the second pass. Issue #6659. 3. Some parts of the query is considered to be noncolocated at the second pass as we do not generate attribute equivalances between nondistributed and distributed tables. Issue #6653	2023-01-27 21:25:04 +03:00
Jelte Fennema	0903091343	Add valgrind support to run_test.py (#6667 ) Running tests with valgrind was not possible with our run_test.py script yet. This adds that support.	2023-01-27 16:01:59 +01:00
Gokhan Gulbiz	4e26464969	Allow plain pg foreign tables without a table_name option (#6652 )	2023-01-27 16:34:11 +03:00
Jelte Fennema	81dcddd1ef	Actually skip constraint validation on shards after shard move (#6640 ) DESCRIPTION: Fix foreign key validation skip at the end of shard move In `eadc88a` we started completely skipping foreign key constraint validation at the end of a non blocking shard move, instead of only for foreign keys to reference tables. However, it turns out that this didn't work at all because of a hard to notice bug: By resetting the SkipConstraintValidation flag at the end of our utility hook, we actually make the SET command that sets it a no-op. This fixes that bug by removing the code that resets it. This is fine because #6543 removed the only place where we set the flag in C code. So the resetting of the flag has no purpose anymore. This PR also adds a regression test, because it turned out we didn't have any otherwise we would have caught that the feature was completely broken. It also moves the constraint validation skipping to the utility hook. The reason is that #6550 showed us that this is the better place to skip it, because it will also skip the planning phase and not just the execution.	2023-01-27 13:08:05 +01:00
aykut-bozkurt	8870f0f90b	fix order of recursive sublink planning (#6657 ) We should do the sublink conversations at the end of the recursive planning because earlier steps might have transformed the query into a shape that needs recursively planning the sublinks. DESCRIPTION: Fixes early sublink check at recursive planner. Related to PR https://github.com/citusdata/citus/pull/6650	2023-01-27 14:35:16 +03:00
Onur Tirtir	97dba0ac00	Fix uninit mem acceess in UpdateFunctionDistributionInfo (#6658 ) Fixes #6655. heap_modify_tuple() fetches values[i] if replace[i] is set true, regardless of the fact that whether isnull[i] is true or false. So similar to replace[], let's init values[] & isnull[] too. DESCRIPTION: Fixes an uninitialized memory access in create_distributed_function()	2023-01-27 11:00:41 +03:00
Onur Tirtir	d2d507eb85	Fix columnar README.md (#6633 ) Reported in #6626.	2023-01-26 10:39:39 +03:00
Emel Şimşek	24f6136f72	Fixes ADD {PRIMARY KEY/UNIQUE} USING INDEX cmd (#6647 ) This change allows creating a constraint without a name using an index. The index name will be used as the constraint name the same way postgres handles it. Fixes issue #6644 This commit also cleans up some leftovers from nameless constraint checks. With this commit, we now fully support adding all nameless constraints directly to a table. Co-authored-by: naisila <nicypp@gmail.com>	2023-01-25 21:28:07 +03:00
Emel Şimşek	2169e0222b	Propagates NOT VALID option for FK&CHECK constraints w/out a name (#6649 ) Adds NOT VALID option to deparser. When we need to deparse: "ALTER TABLE ADD FOREIGN KEY ... NOT VALID" "ALTER TABLE ADD CHECK ... NOT VALID" NOT VALID option should be propagated to workers. Fixes issue #6646 This commit also uses AppendColumnNameList function instead of repeated code blocks in two appropriate places in the "ALTER TABLE" deparser.	2023-01-25 20:41:04 +03:00
Hanefi Onaldi	94b63f35a5	Prevent crashes on update with returning clauses (#6643 ) If an update query on a reference table has a returns clause with a subquery that accesses some other local table, we end-up with an crash. This commit prevents the crash, but does not prevent other error messages from happening due to Citus not being able to pushdown the results of that subquery in a valid SQL command. Related: #6634	2023-01-24 20:07:43 +03:00
Jelte Fennema	aa9cd16d15	Use correct guc value to disable statistics collection (#6641 ) The `citus.enable_statistics_collection` is a boolean GUC not an integer one. Setting it to `-1` showed errors in the logs.	2023-01-24 15:32:50 +01:00
Naisila Puka	3c96b2a0cd	Remove unused function RelationUsesIdentityColumns (#6645 ) Cleanup from #6591	2023-01-24 17:10:05 +03:00
Jelte Fennema	d21ff0f883	Fix regression in allowed foreign keys on distributed tables (#6550 ) DESCRIPTION: Fix regression in allowed foreign keys on distributed tables In commit `eadc88a` we changed how we skip foreign key validation. The goal was to skip it in more cases. However, one change had the unintended regression of introducing failures when trying to create certain foreign keys. This reverts that part of the change. The way of skipping validation of foreign keys that was introduced in `eadc88a` was skipping validation during execution. The reason that this caused this regression was because some foreign key validation queries already fail during planning. In those cases it never gets to the execution step where it would later be skipped. Fixes #6543	2023-01-24 14:26:17 +01:00
Jelte Fennema	7a7880aec9	Fix regression in allowed foreign keys on distributed tables (#6550 ) DESCRIPTION: Fix regression in allowed foreign keys on distributed tables In commit `eadc88a` we changed how we skip foreign key validation. The goal was to skip it in more cases. However, one change had the unintended regression of introducing failures when trying to create certain foreign keys. This reverts that part of the change. The way of skipping validation of foreign keys that was introduced in `eadc88a` was skipping validation during execution. The reason that this caused this regression was because some foreign key validation queries already fail during planning. In those cases it never gets to the execution step where it would later be skipped. Fixes #6543	2023-01-24 16:09:21 +03:00
Jelte Fennema	93fcc5c5d8	Move tablespace directory creation to pg_regress_multi.pl (#6629 ) Multiple `check-xxx` targets create tablespaces. If you run two of these at the same time you would get an error like: ```diff CREATE TABLESPACE test_tablespace LOCATION :'test_tablespace'; +ERROR: directory "/home/rajesh/citus/citus/src/test/regress/tmp_check/ts0/PG_14_202107181" already in use as a tablespace ``` This fixes that by moving creation of table space directory creation and removal to pg_regress_multi.pl instead of being in the Makefile.	2023-01-20 12:34:33 +00:00
Emel Şimşek	58368b7783	Enable adding FOREIGN KEY constraints on Citus tables without a name. (#6616 ) DESCRIPTION: Enable adding FOREIGN KEY constraints on Citus tables without a name This PR enables adding a foreign key to a distributed/reference/Citus local table without specifying the name of the constraint, e.g. `ALTER TABLE items ADD FOREIGN KEY (user_id) REFERENCES users (id);`	2023-01-20 01:43:52 +03:00
Gokhan Gulbiz	2388fbea6e	Identity Column Support on Citus Managed Tables (#6591 ) DESCRIPTION: Identity Column Support on Citus Managed Tables	2023-01-19 15:45:41 +03:00
Marco Slot	64e3fee89b	Remove shardstate leftovers (#6627 ) Remove ShardState enum and associated logic. Co-authored-by: Marco Slot <marco.slot@gmail.com> Co-authored-by: Ahmet Gedemenli <afgedemenli@gmail.com>	2023-01-19 11:43:58 +03:00
Teja Mupparti	44c387b978	Support MERGE on distributed tables with restrictions This implements the phase - II of MERGE sql support Support routable query where all the tables in the merge-sql are distributed, co-located, and both the source and target relations are joined on the distribution column with a constant qual. This should be a Citus single-task query. Below is an example. SELECT create_distributed_table('t1', 'id'); SELECT create_distributed_table('s1', 'id', colocate_with => ‘t1’); MERGE INTO t1 USING s1 ON t1.id = s1.id AND t1.id = 100 WHEN MATCHED THEN UPDATE SET val = s1.val + 10 WHEN MATCHED THEN DELETE WHEN NOT MATCHED THEN INSERT (id, val, src) VALUES (s1.id, s1.val, s1.src) Basically, MERGE checks to see if There are a minimum of two distributed tables (source and a target). All the distributed tables are indeed colocated. MERGE relations are joined on the distribution column MERGE .. USING .. ON target.dist_key = source.dist_key The query should touch only a single shard i.e. JOIN AND with a constant qual MERGE .. USING .. ON target.dist_key = source.dist_key AND target.dist_key = <> If any of the conditions are not met, it raises an exception.	2023-01-18 11:05:27 -08:00
Onur Tirtir	e5245a39d1	Save logs & results in upgrade tests / arbitrary config tests too (#6625 )	2023-01-18 16:27:59 +03:00
Ahmet Gedemenli	b3b135867e	Remove shardstate from placement insert functions (#6615 )	2023-01-18 09:52:38 +01:00
Hanefi Onaldi	f21dfd5fae	Rebalance Progress Reporting API (#6576 ) citus_job_list() lists all background jobs by simply showing the records in pg_dist_background_job. citus_job_status(job_id bigint, raw boolean default false) shows the status of a single background job by appending a jsonb details column to the associated row from pg_dist_background_job. If the raw argument is set, machine readable sizes are used instead of human readable alternatives. citus_rebalance_status(raw boolean default false) shows the status of the last rebalance operation. If the raw argument is set, machine readable sizes are used instead of human readable alternatives.	2023-01-16 16:17:31 +03:00
Jelte Fennema	92689a8362	Make GPIDs work with pg_dist_poolinfo (#6588 ) The original implementation of GPIDs didn't work correctly when using `pg_dist_poolinfo` together with PgBouncer. The reason is that it assumed that once a connection was made to a worker, the originating GPID should stay the same for ever. But when pg_dist_poolinfo is used this isn't the case, because the same connection on the worker might be used by different backends of the coordinator. This fixes that issue by updating the GPID whenever a new application name is set on a connection. This is the only thing that's needed, because PgBouncer already sets the application name correctly on the server connection whenever a client is updated.	2023-01-13 14:39:19 +00:00
Marco Slot	ad3407b5ff	Revert "Make the metadata syncing less resource invasive [Phase-1]" (#6618 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2023-01-13 13:56:55 +01:00
Ahmet Gedemenli	ed6dd8b086	Mark 0/0 lsn results as null (#6617 ) Marks `source_lsn` and `target_lsn` fields as null if the result is 0/0	2023-01-13 15:33:43 +03:00
Emel Şimşek	28ed013a91	Enable ALTER TABLE ... ADD CHECK (#6606 ) DESCRIPTION: Enable adding CHECK constraints on distributed tables without the client having to provide a constraint name. This PR enables the following command syntax for adding check constraints to distributed tables. ALTER TABLE ... ADD CHECK ... by creating a default constraint name and transforming the command into the below syntax before sending it to workers. ALTER TABLE ... ADD CONSTRAINT \<conname> CHECK ...	2023-01-12 23:31:06 +03:00
Ahmet Gedemenli	a6ad4574f6	Transfer shard with nodeid (#6612 ) DESCRIPTION: Introduce citus_copy_shard_placement UDF with node id DESCRIPTION: Introduce citus_move_shard_placement UDF with node id DESCRIPTION: Use new shard transfer functions with node id for rebalancing New shard transfer functions to be used with nodeid instead of hostname and port. Use these functions in shard rebalancer.	2023-01-12 18:33:46 +03:00
Ahmet Gedemenli	9b9d8e7abd	Use shard transfer UDFs with node ids for rebalancing	2023-01-12 16:57:51 +03:00
Ahmet Gedemenli	e5fef40c06	Introduce citus_move_shard_placement UDF with nodeid	2023-01-12 16:57:51 +03:00
Ahmet Gedemenli	e19c545fbf	Introduce citus_copy_shard_placement UDF with nodeid	2023-01-12 16:57:51 +03:00
Emel Şimşek	d322f9e382	Handle DEFERRABLE option for the relevant constraints at deparser. (#6613 ) Table Constraints UNIQUE, PRIMARY KEY and EXCLUDE may have option DEFERRABLE in their command syntax. This PR handles the option when deparsing the relevant constraint statements. NOT DEFERRABLE and INITIALLY IMMEDIATE (if DEFERRABLE} are the default values for the option so we only append the non-default values to the alter table statement.	2023-01-12 12:32:38 +03:00
Jelte Fennema	34df853bda	Fix bug introduced by #6412 (#6590 ) In #6412 I made a change to not re-assign the global PID if it was already set. This inadvertently introduced a regression where `userId` and `databaseId` would not be set on the backend data when the global PID was assigned in the authentication hook. This fixes it by doing two things: 1. Removing `userId` from `BackendData`, since it's not used anywhere anyway. 2. Move assignment of `databaseId` to dedicated `SetBackendDataDatabaseId` function, that isn't a no-op when global pid is already set. Since #6412 is not released yet this does not need a description.	2023-01-10 16:21:57 +01:00
Jelte Fennema	17775dad5d	Only run package builds on pull requests (#6605 ) Recently a package test build pipeline was introduced, to build citus on all OS that we build packages for. However, every pull request would run each build twice. This fixes that by only running it for the pull request event, not for the push event. Example of duplicate run: ![image](https://user-images.githubusercontent.com/1162278/211028723-8c0e8aa0-e267-4665-811c-6cecd4286621.png)	2023-01-06 16:02:49 +00:00
Jelte Fennema	c2b4087ff0	Quote all identifiers that we use for logical replication (#6604 ) In #6598 it was noticed that Citus could generate syntactically invalid statements during logical replication. With #6603 we resolved the direct issue, by only generating valid subscription names. But there was also the underlying problem that we did not escape certain identifier strings. While in theory this should be okay since we should only generate names that are valid, this issue reiterated that we should not take this for granted. As an extra line of defense this quotes all identifiers we use during logical replication setup.	2023-01-06 14:12:03 +00:00
Jelte Fennema	44e09128f0	Fix failures in mx_base_schedule (#6601 ) Apparently no-one actually ran the mx_base_schedule, because the tests in schedule itself were already failing. This updates it to be in line with multi_mx_schedule again to make the tests pass again. Notably it doesn't contain multi_mx_node_metadata and multi_extension. Because those tests take long to run and the were not necessary to make multi_mx_create_table pass again.	2023-01-06 14:48:18 +01:00
Ahmet Gedemenli	26b170e1a8	Use %u instead of %i for naming subscriptions & roles (#6603 ) DESCRIPTION: Fix the modifier for subscription and role creation fixes: #6598 Reported by @ivyazmitinov	2023-01-06 14:38:32 +01:00
Ahmet Gedemenli	bc3383170e	Fix crash when trying to replicate a ref table that is actually dropped (#6595 ) DESCRIPTION: Fix crash when trying to replicate a ref table that is actually dropped see #6592 We should have a real solution for it.	2023-01-06 14:52:08 +03:00
Emel Şimşek	db7a70ef3e	Enable ALTER TABLE ... ADD UNIQUE and ADD EXCLUDE. (#6582 ) DESCRIPTION: Adds support for creating table constraints UNIQUE and EXCLUDE via ALTER TABLE command without client having to specify a name. ALTER TABLE ... ADD CONSTRAINT <conname> UNIQUE ... ALTER TABLE ... ADD CONSTRAINT <conname> EXCLUDE ... commands require the client to provide an explicit constraint name. However, in postgres it is possible for clients not to provide a name and let the postgres generate it using the following commands ALTER TABLE ... ADD UNIQUE ... ALTER TABLE ... ADD EXCLUDE ... This PR enables the same functionality for citus tables.	2023-01-05 18:12:32 +03:00
Emel Şimşek	135c519c62	Fix flakyness for multi_name_lengths test (#6599 ) Fix flakyness for multi_name_lengths test.	2023-01-05 17:27:16 +03:00
aykut-bozkurt	f79ee13eef	we can reuse some steps in jobs in circle-ci config. (#6164 ) - Reuse steps in jobs via common commands, - Use yaml alias and anchors to share common params for workflow jobs.	2023-01-05 11:23:39 +03:00
Önder Kalacı	eb75decbeb	Undo planner extended statistics override (#6492 )	2023-01-04 13:25:57 +01:00
Önder Kalacı	a1aa96b32c	Make the metadata syncing less resource invasive [Phase-1] (#6537 )	2023-01-04 11:36:45 +01:00
Ahmet Gedemenli	235047670d	Drop SHARD_STATE_TO_DELETE (#6494 ) DESCRIPTION: Drop `SHARD_STATE_TO_DELETE` and use the cleanup records instead Drops the shard state that is used to mark shards as orphaned. Now we insert cleanup records into `pg_dist_cleanup` so "orphaned" shards will be dropped either by maintenance daemon or internal cleanup calls. With this PR, we make the "cleanup orphaned shards" functions to be no-op, as they would not be needed anymore. This PR includes some naming changes about placement functions. We don't need functions that filter orphaned shards, as there will be no orphaned shards anymore. We will also be introducing a small script with this PR, for users with orphaned shards. We'll basically delete the orphaned shard entries from `pg_dist_placement` and insert cleanup records into `pg_dist_cleanup` for each one of them, during Citus upgrade. We also have a lot of flakiness fixes in this PR. Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2023-01-03 14:38:16 +03:00
Jelte Fennema	f56904fe04	Fix flakyness in isolation_insert_vs_vacuum (#6589 ) Sometimes our `isolation_insert_vs_vacuum` test would fail like this. ```diff step s2-vacuum-analyze: VACUUM ANALYZE test_insert_vacuum; - + <waiting ...> step s1-commit: COMMIT; +step s2-vacuum-analyze: <... completed> ``` The reason seems to be that VACUUM ANALYZE tries to take some locks that conflict with the other transaction, but these locks somehow get released or VACUUM ANALYZE stops waiting for them. This is somewhat expected since VACUUM has some special locking logic. To solve the flakyness we now trigger VACUUM ANALYZE to always report as blocking and after that we wait explicitly wait for it to complete. This is done like is suggested by the flaky test tips from postgres: `c68a183990/src/test/isolation/README (L152)` I've confirmed that this fixes the issue suing our flaky-test-debugging CI workflow.	2023-01-02 16:51:58 +01:00
Ahmet Gedemenli	0c74e4cc0f	Fix some flaky tests (#6587 ) Fix for some simple flakiness'. All `DROP USER' and cleanup function calls.	2022-12-29 10:19:09 +03:00
Ahmet Gedemenli	f824c996b3	Remove duplicate split entry in run_test.py (#6586 ) Nothing important. Removing the duplicate `"split" in test_schedule` check	2022-12-28 18:18:24 +03:00
Ahmet Gedemenli	1b1e737e51	Drop cleanup on failure (#6584 ) DESCRIPTION: Defers cleanup after a failure in shard move or split We don't need to do a cleanup in case of failure on a shard transfer or split anymore. Because, * Maintenance daemon will clean them up anyway. * We trigger a cleanup at the beginning of shard transfers/splits. * The cleanup on failure logic also can fail sometimes and instead of the original error, we throw the error that is raised by the cleanup procedure, and it causes confusion.	2022-12-28 15:48:44 +03:00
Ahmet Gedemenli	cfc17385e9	Some minor improvements on flakiness detection (#6585 ) * Skip some exceptional test files in the flaky workflow, like multi_extension * Run some tests without a schedule, like single_node_enterprise * Use minimal schedule for the tests in split and operations schedules	2022-12-28 15:08:39 +03:00
Ahmet Gedemenli	eba9abeee2	Fix leftover shard copy on the target node when tx with move is aborted (#6583 ) DESCRIPTION: Cleanup the shard on the target node in case of a failed/aborted shard move Inserts a cleanup record for the moved shard placement on the target node. If the move operation succeeds, the record will be deleted. If not, it will remain there to be cleaned up later. fixes: #6580	2022-12-27 22:42:46 +03:00
Naisila Puka	e937935935	Clean up normalize file (#6578 )	2022-12-26 12:08:27 +03:00
Naisila Puka	bc49616426	Fix link of path to flaky tests docs (#6579 )	2022-12-23 12:09:22 +03:00
Naisila Puka	9c78f693f2	Fix typo in fix_style.sh (#6577 ) For reference, this is the print stack trace PR https://github.com/citusdata/citus/pull/6539	2022-12-22 16:24:26 +03:00
Ahmet Gedemenli	cf93eb46c6	Fix split schedule (#6571 ) * Drop enterprise_split_schedule as it's not even called in our CI pipeline. It's actually a subset of split_schedule, except for `citus_split_shard_by_split_points_deferred_drop`. Added that one into split_schedule and dropped the enterprise one. * Delete `citus_non_blocking_shard_split_cleanup.out`, as there is no sql file for it. It seems it's renamed to some other test and the sql file is deleted, but we forgot to delete the output file. * 6 test files are chained to each other with dependent objects. Unified them into one test file so that the flaky check will not fail for them anymore. * Some cleanup lines to prevent the flakiness check from failing.	2022-12-22 13:20:06 +03:00
Ahmet Gedemenli	96bf648d1a	Unify dependent test files into one	2022-12-22 13:06:26 +03:00
Ahmet Gedemenli	acf3539a90	Fix split schedule	2022-12-22 13:06:26 +03:00
Ahmet Gedemenli	497a7589d0	Use failtester for flakyness detection (#6569 ) We need failtester image to run failure tests in flakiness detection workflow.	2022-12-21 18:56:51 +03:00
Hanefi Onaldi	303db172f8	Use Citus version comparison in upgrade tests, not equality (#6568 ) We have several version checks in our Citus upgrade tests. However, as we drop support for PG versions, we need to update the Citus versions used in our CI images. Therefore we must compare Citus versions in our tests instead of using equality checks so that the queries are ran in all the associated Citus versions. For example, we have many conditionals where we early exit if the Citus version is not equal to 9.0. However, as of today we never use version 9.0 and thus we always early exit in those tests.	2022-12-21 14:01:57 +03:00
Hanefi Onaldi	d6833c877b	Add changelog entries for 11.1.5 (#6567 )	2022-12-20 14:56:26 +03:00
Teja Mupparti	9a9989fc15	Support MERGE Phase – I All the tables (target, source or any CTE present) in the SQL statement are local i.e. a merge-sql with a combination of Citus local and Non-Citus tables (regular Postgres tables) should work and give the same result as Postgres MERGE on regular tables. Catch and throw an exception (not-yet-supported) for all other scenarios during Citus-planning phase.	2022-12-18 20:32:15 -08:00
Emel Şimşek	5268d0a6cb	Enable PRIMARY KEY generation via ALTER TABLE even if the constraint name is not provided (#6520 ) DESCRIPTION: Support ALTER TABLE .. ADD PRIMARY KEY ... command Before processing > ALTER TABLE ... ADD PRIMARY KEY ... command 1. Create a primary key name to use as the constraint name. 2. Change the ALTER TABLE ... ADD PRIMARY KEY ... command to into ALTER TABLE ... ADD CONSTRAINT \<constraint name> PRIMARY KEY ... form. This is the only form we can specify a name for a primary key. If we run ALTER TABLE .. ADD PRIMARY KEY, postgres would create a constraint name internally in its own scheme. But the problem is that we need to create constraint names for shards in our own scheme which is \<constraint name>_\<shardid>. Hence we need to create a name and send it to workers so that the workers can append the shardid. 4. Run the changed command on the coordinator to make sure we are using the same constraint name across the board. 5. Send the changed command to workers such that it is executed for the main table as well as for the shards. Fixes #6515.	2022-12-16 20:34:00 +03:00
aykut-bozkurt	9c0073ba57	remove unused boundary type (#6563 ) Removes unused job boundary tag `SUBQUERY_MAP_MERGE_JOB`. Only usage is at `BuildMapMergeJob`, which is only called when the boundary = `JOIN_MAP_MERGE_JOB`. Hence, it should be safe to remove.	2022-12-16 18:19:22 +03:00
Önder Kalacı	f7e881a4c4	Do not create additional WaitEventSet for RemoteSocketClosed checks (#6505 ) Fixes #6501 Before this commit, we created an additional WaitEventSet for checking whether the remote socket is closed per connection - only once at the start of the execution. However, for certain workloads, such as pgbench select-only workloads, the creation/deletion of the additional WaitEventSet adds ~7% CPU overhead, which is also reflected on the benchmark results. With this commit, we use the same WaitEventSet for the purposes of checking the remote socket at the start of the execution. We use "rebuildWaitEventSet" flag so that the executor can re-use the existing WaitEventSet. As a result, we see the following improvements on PG 15: main : 120051 tps, 0.532 ms latency avg. avoid_wes_rebuild: 127119 tps, 0.503 ms latency avg. And, on PG 14, as expected, there is no difference main : 129191 tps, 0.495 ms latency avg. avoid_wes_rebuild: 129480 tps, 0.494 ms latency avg. But, note that PG 15 is slightly (~1.5%) slower than PG 14. That is probably the overhead of checking the remote socket.	2022-12-14 22:53:38 +01:00
Onder Kalaci	feb5534c65	Do not create additional WaitEventSet for RemoteSocketClosed checks Before this commit, we created an additional WaitEventSet for checking whether the remote socket is closed per connection - only once at the start of the execution. However, for certain workloads, such as pgbench select-only workloads, the creation/deletion of the additional WaitEventSet adds ~7% CPU overhead, which is also reflected on the benchmark results. With this commit, we use the same WaitEventSet for the purposes of checking the remote socket at the start of the execution. We use "rebuildWaitEventSet" flag so that the executor can re-use the existing WaitEventSet. As a result, we see the following improvements on PG 15: main : 120051 tps, 0.532 ms latency avg. avoid_wes_rebuild: 127119 tps, 0.503 ms latency avg. And, on PG 14, as expected, there is no difference main : 129191 tps, 0.495 ms latency avg. avoid_wes_rebuild: 129480 tps, 0.494 ms latency avg. But, note that PG 15 is slightly (~1.5%) slower than PG 14. That is probably the overhead of checking the remote socket.	2022-12-14 22:42:55 +01:00
Onder Kalaci	d52da55ac0	Move WaitEvent to DistributedExecution Prep. for caching WaitEventsSet/WaitEvents	2022-12-14 21:59:19 +01:00
Nils Dijk	b5b73d78c3	add prepare and finish pg upgrade functions to 11.2-1 (#6560 ) Fixes a missed include in #6315. While adding the cluster clock we have added some extra steps to `citus_prepare_pg_upgrade` and `citus_finish_pg_upgrade`. These changes were not added to the citus upgrade and downgrade scripts, this allowed for a syntax error to slip in. This PR adds the new versions of both UDF's to the upgrade script while adding the old version to the downgrade script. This exposed the syntax error which is also solved.	2022-12-14 12:34:22 +01:00
Gokhan Gulbiz	556161be32	Fix make recipe mapping in test runner (#6561 )	2022-12-14 12:57:13 +03:00
aykut-bozkurt	8be4ce546e	fix vanilla test status on CI (#6555 ) - Because of the make command used for vanilla tests, test status is always shown as success on CI. As a fix, I added `&& false` at the end of the copying diff file to make the command fail when check-vanilla fails. ```make check-vanilla: all $(pg_regress_multi_check) --vanillatest \|\| (cp $(vanilla_diffs_file) $(citus_abs_srcdir)/regression.diffs && false) ``` - I also fixed some vanilla tests that fails due to recently added clock related operators shown up at some queries.	2022-12-13 11:15:47 +03:00
Gürkan İndibay	3f091e3493	Give nicer error message when using alter_table_set_access_method on a view (#6553 ) DESCRIPTION: Fixes alter_table_set_access_method error for views. Fixes #6001	2022-12-12 23:56:22 +03:00
aykut-bozkurt	1ad1a0a336	add citus_task_wait udf to wait on desired task status (#6475 ) We already have citus_job_wait to wait until the job reaches the desired state. That PR adds waiting on task state to allow more granular waiting. It can be used for Citus operations. Moreover, it is also useful for testing purposes. (wait until a task reaches specified state) Related to #6459.	2022-12-12 22:41:03 +03:00
aykut-bozkurt	80686907a3	print stack trace from core files instead of uploading them (#6539 ) Prints stack trace from core files instead of uploading them.	2022-12-12 18:53:17 +03:00
Gokhan Gulbiz	d307e342a2	Use test name parameter in flakiness detection (#6559 ) This PR changes test-flakyness CI job to pass the test name instead of the file path to `run_test.py` script.	2022-12-12 17:53:25 +03:00
aykut-bozkurt	3da6e3e743	bgworkers with backend connection should handle SIGTERM properly (#6552 ) Fixes task executor SIGTERM handling. Problem: When task executors are sent SIGTERM, their default handler `bgworker_die`, which is set at worker startup, logs FATAL error. But they do not release locks there before logging the error, which sometimes causes hanging of the monitor. e.g. Monitor waits for the lock forever at pg_stat flush after calling proc_exit. Solution: Because executors have connection to backend, they should handle SIGTERM similar to normal backends. Normal backends uses `die` handler, in which they set ProcDiePending flag and the next CHECK_FOR_INTERRUPTS call handles it gracefully by releasing any lock before termination.	2022-12-12 16:44:36 +03:00
dependabot[bot]	f6b8990fc7	Bump certifi from 2022.9.14 to 2022.12.7 in /src/test/regress (#6554 ) Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.9.14 to 2022.12.7. Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-12-12 14:13:08 +01:00
Gokhan Gulbiz	e2a73ad8a8	Flaky Test Detection CI Workflow (#6495 ) This PR adds a new CI workflow named ```flaky-test``` to run flaky test detection on newly introduced regression tests. Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2022-12-12 14:36:23 +03:00
Ahmet Gedemenli	190307e8d8	Wait for cleanup function (#6549 ) Adding a testing function `wait_for_resource_cleanup` which waits until all records in `pg_dist_cleanup` are cleaned up. The motivation is to prevent flakiness in our tests, since the `NOTICE: cleaned up X orphaned resources` message is not consistent in many cases. This PR replaces `citus_cleanup_orphaned_resources` calls with `wait_for_resource_cleanup` calls.	2022-12-08 13:19:25 +03:00
Teja Mupparti	cbb33167f9	Fix the flaky test in clock.sql	2022-12-07 09:47:35 -08:00
Ahmet Gedemenli	989a3b54c9	Disable maintenance daemon for cleanup test (#6551 ) Disabling the cleanup in maintenance daemon, to prevent a flaky test.	2022-12-07 20:28:55 +03:00
Onur Tirtir	1e415bb1c3	Support outer joins where the outer rel is a recurring one and the inner one is a non-recurring one (#6512 ) DESCRIPTION: Adds support for outer joins having a recurring rel in the outer side of the join (e.g., \<reference table\> LEFT JOIN \<distributed table\>) Closes #6219. Closes #521 If the outer part of an outer join is a recurring rel (i.e., reference table or an intermediate_result injected into the query during the earlier stages of the recursive planning), Citus cannot run the join query if the other side of the join is not a recurring rel (i.e., distributed table). See DeferredErrorIfUnsupportedRecurringTuplesJoin for the reasoning. And to support such joins, now we start recursively planning distributed side of such joins so that non-recurring rel becomes an intermediate result (and hence a recurring rel) since Citus already knows how to compute an outer join between two recurring rels already. In the simplest scenario, this means to convert _"\<reference\> LEFT JOIN \<distributed\>"_ to _"\<reference\> LEFT JOIN \<intermediate_result\>"_ by wrapping the distributed table into a subquery. - [x] Add support for outer joins having a recurring rel in the outer side and a "distributed table" () in the inner side of the join - [x] Expand "distributed table" concept to "distributed rel" in first item. This means that; - [x] Currently RecursivelyPlanNonRecurringJoinNode doesn't know how to wrap a sub join tree that constitutes a recurring rel, e.g., rhs clause of the following join: `reference LEFT OUTER <distributed INNER JOIN distributed>`; fix this. - [x] Similar to previous item, currently RecursivelyPlanNonRecurringJoinNode doesn't know how to handle subqueries constituting a distributed rel, e.g., `SELECT FROM ref LEFT JOIN (SELECT * FROM dist_1) u1 ON (ref.a = u1.a);`; fix this. - [x] Add lateral join checks for now-supported outer joins into recursive planner - [x] Fix regressions tests - [x] Verified each test output file by first un-distributing Citus tables involved in related queries and re-running the test file. - [x] Some of the tests --that were not supposed to return any data before but this PR adds support for-- were likely to get flaky, so added some "ORDER BY"s to them. - [x] Continue doing manual testing and start writing a test file for the join clauses that this PR adds support for --not only rely on existing tests See https://github.com/citusdata/citus/issues/6546 for what we could do further.	2022-12-07 18:44:00 +03:00
Onur Tirtir	b177975371	Add new regression tests	2022-12-07 18:27:50 +03:00
Onur Tirtir	2803470b58	Add lateral join checks for outer joins and drop the useless ones for semi joins	2022-12-07 18:27:50 +03:00
Onur Tirtir	e7e4881289	Phase - III: recursively plan non-recurring sub join trees too	2022-12-07 18:27:50 +03:00
Onur Tirtir	f52381387e	Phase - II: recursively plan non-recurring subqueries too	2022-12-07 18:27:50 +03:00
Onur Tirtir	f339450a9d	Phase - I: recursively plan non-recurring relations	2022-12-07 18:27:50 +03:00
Ahmet Gedemenli	3cc5d9842a	Remove IF EXISTS from cleanup on failure test for subscription object (#6547 ) Nothing critical. Just improving a DROP SUBSCRIPTION test for a cleanup after failure scenario.	2022-12-07 17:51:36 +03:00
Jelte Fennema	7499c3073d	Push coverage to codeclimate (#6538 ) In addition to pushing coverage results to codecov, this now also pushes them to codeclimate. This is meant so we can evaluate codeclimate.	2022-12-06 16:01:20 +01:00
Ahmet Gedemenli	cb02d62369	Unique names for replication artifacts (#6529 ) DESCRIPTION: Create replication artifacts with unique names We're creating replication objects with generic names. This disallows us to enable parallel shard moves, as two operations might use the same objects. With this PR, we'll create below objects with operation specific names, by appending OparationId to the names. * Subscriptions * Publications * Replication Slots * Users created for subscriptions	2022-12-06 15:48:16 +03:00
Teja Mupparti	e14dc5d45d	Address the issues/comments from the original PR# 6315 1) Regular users fail to use clock UDF with permission issue. 2) Clock functions were declared as STABLE, whereas by definition they are VOLATILE. By design, any clock/time functions will return different results for each call even within a single SQL statement. Note: UDF citus_get_transaction_clock() is a misnomer as it internally calls the clock tick which always returns different results for every invocation in the same transaction.	2022-12-05 11:06:21 -08:00
aykut-bozkurt	65f256eec4	* add SIGTERM handler to gracefully terminate task executors, \ (#6473 ) Adds signal handlers for graceful termination, cancellation of task executors and detecting config updates. Related to PR #6459. #### How to handle termination signal? Monitor need to gracefully terminate all running task executors before terminating. Hence, we have sigterm handler for the monitor. #### How to handle cancellation signal? Monitor need to gracefully cancel all running task executors before terminating. Hence, we have sigint handler for the monitor. #### How to detect configuration changes? Monitor has SIGHUP handler to reflect configuration changes while executing tasks.	2022-12-02 18:15:31 +03:00
aykut-bozkurt	6781ace3a1	find core files from correct path on CI (#6535 ) Finds core files from correct path on CI. According to default core pattern on CI, core is generated at the location relative to binary is executed. It can be safe to set core pattern before running binary but to change a kernel param(in our case kernel.core_pattern), you need related privilege in docker container. Or you have to change it at image build. But, by default, on CI machines, kernel pattern contains a relative path to binary + pid + process name, so we do not need to set it explicitly for now. (Example core file name on CI machine: `core.2559.!usr!lib!postgresql!14!bin!postgres`)	2022-12-02 18:04:29 +03:00
songjinzhou	ad6450b793	fix the problem #5763 (#6519 ) Co-authored-by: TsinghuaLucky912 <tsinghualucky912@foxmail.com> Fixes https://github.com/citusdata/citus/issues/5763	2022-12-02 13:49:32 +01:00
Ahmet Gedemenli	3b24c47470	Fix flaky cleanup tests (#6530 ) We are having some flakiness in our test schedule because of the objects leftover from shard moves/splits. With this commit we prevent logging cleanup object counts. fixes: #6534	2022-12-02 12:39:36 +03:00
Hanefi Onaldi	d4394b2e2d	Fix spacing in multiline strings (#6533 ) When using multiline strings, we occasionally forget to add a single space at the end of the first line. When this line is concatenated with the next one, the resulting string has a missing space.	2022-12-01 23:42:47 +03:00
Fabrízio de Royes Mello	37f3dff1ca	Simplify columnar perf example (#6526 ) Rewrite the plpython function to generate random words in SQL to simplify the usage and run the example.	2022-12-01 20:05:40 +01:00
songjinzhou	29f0196fdf	Add support for SET ACCESS METHOD in altering a distributed table (#6525 ) Co-authored-by: TsinghuaLucky912 <postgres@localhost.localdomain>	2022-12-01 17:45:32 +01:00
Gürkan İndibay	c2193608c9	Add jobs to test builds on different distros (#6499 ) With this PR, citus code will be tested in all packaging environments. Sometimes, there can be compile errors which blocks packaging and in this case unplanned delays may occur. By testing the code in packaging environments, I'm aiming to detect any compilation errors before packaging. Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2022-12-01 19:11:41 +03:00
Hanefi Onaldi	1f29c16262	Fix misleading GUC description (#6532 ) citus.skip_advisory_lock_permission_checks skips checks when it is set to 'on', not 'off'	2022-12-01 15:43:02 +03:00
Ahmet Gedemenli	0e92244bfe	Cleanup for shard moves (#6472 ) DESCRIPTION: Extend cleanup process for replication artifacts This PR adds new cleanup record types for: * Subscriptions * Replication slots * Publications * Users created for subscriptions We add records for these object types, to `pg_dist_cleanup` during creation phase. Once the operation is done, in case of success or failure, we iterate those records and drop the objects. With this PR we will not be dropping any of these objects during the operation. In short, we will always be deferring the drop. One thing that's worth mentioning is that we sort cleanup records before processing (dropping) them, because of dependency relations among those objects, e.g a subscription might depend on a publication. Therefore, we always drop subscriptions before publications. We have some renames in this PR: * `TryDropOrphanedShards` -> `TryDropOrphanedResources` * `DropOrphanedShardsForCleanup` -> `DropOrphanedResourcesForCleanup` * `run_try_drop_marked_shards` -> `run_try_drop_marked_resources` as these functions now process replication artifacts as well. This PR drops function `DropAllLogicalReplicationLeftovers` and its all usages, since now we rely on the deferring drop mechanism.	2022-11-30 15:38:05 +03:00
aykut-bozkurt	1f8675da43	nonblocking concurrent task execution via background workers (#6459 ) Improvement on our background task monitoring API (PR #6296) to support concurrent and nonblocking task execution. Mainly we have a queue monitor background process which forks task executors for `Runnable` tasks and then monitors their status by fetching messages from shared memory queue in nonblocking way.	2022-11-30 14:29:46 +03:00
aykut-bozkurt	83ef600f27	fix false full join pushdown error check (#6523 ) Problem: Currently, we error out if we detect recurring tuples in one side without checking the other side of the join. Solution: When one side of the full join consists recurring tuples and the other side consists nonrecurring tuples, we should not pushdown to prevent duplicate results. Otherwise, safe to pushdown.	2022-11-30 14:17:56 +03:00
Gokhan Gulbiz	bc118ee551	Change GUC propagation flag's default value to off (#6516 ) This PR changes ```citus.propagate_session_settings_for_loopback_connection``` default value to off not to expose this feature publicly at this point. See #6488 for details.	2022-11-29 13:25:53 +03:00
Jelte Fennema	e12d97def2	Fix flakyness in multi_metadata_access (#6524 ) Sometimes multi_metadata_access failed like this in CI: ```diff AND ext.extname = 'citus' AND nsp.nspname = 'pg_catalog' AND NOT has_table_privilege(pg_class.oid, 'select'); oid --------------------------- - pg_dist_authinfo pg_dist_clock_logical_seq + pg_dist_authinfo (2 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/28784/workflows/e462f118-eb64-4a3f-941a-e04115334f9b/jobs/883443 This fixes that by ordering the column.	2022-11-29 10:00:06 +01:00
Philip Dubé	617cac4024	Grammar: it's to its (#6527 ) Includes an error message & one case of its to it's Also fix "to the to" typos	2022-11-29 05:56:44 +00:00
Philip Dubé	cf69fc3652	Grammar: it's to its Includes an error message & one case of its to it's Also fix "to the to" typos	2022-11-28 20:43:44 +00:00
Jelte Fennema	68de2ce601	Include gpid in all internal application names (#6431 ) When debugging issues it's quite useful to see the originating gpid in the application_name of a query on a worker. This already happens for most queries, but not for queries created by the rebalancer or by run_command_on_worker. This adds a gpid to those two application_names too. Note, that if the GPID of the new application_names is different than the current GPID of the backend the backend will continue to keep the old gpid as its actual GPID. This PR is just meant to make sure that the application_name is as useful as it can be for users to look at. Updating of gpids will be done in a follow-up PR, and adding gpids to all internal connections will make this easier.	2022-11-25 11:16:33 +01:00
Teja Mupparti	edaf88e0ff	Fix the dangling pointer bug in get_merged_argument_list()	2022-11-22 09:41:10 -08:00
Onur Tirtir	80faf47ab5	Fix dangling pointer warning in AnyTableReplicated (#6504 ) DESCRIPTION: Fixes a potential dangling pointer issue Need to backport to 11.0 & 11.1 since we might want to release packages for debian/bookworm based on those branches in future.	2022-11-21 16:42:00 +03:00
Jelte Fennema	a477ffdf4b	Correctly fix OpenSSL 3.0 warnings (#6502 ) In #6038 I tried to fix OpenSSL 3.0 warnings with PG13, but I had made a mistake when doing that. This actually fixes these warnings.	2022-11-18 14:35:41 +01:00
Emel Şimşek	8e5ba45b74	Fixes a bug that causes crash when using auto_explain extension with ALTER TABLE...ADD FOREIGN KEY... queries. (#6470 ) Fixes a bug that causes crash when using auto_explain extension with ALTER TABLE...ADD FOREIGN KEY... queries. Those queries trigger a SELECT query on the citus tables as part of the foreign key constraint validation check. At the explain hook, workers try to explain this SELECT query as a distributed query causing memory corruption in the connection data structures. Hence, we will not explain ALTER TABLE...ADD FOREIGN KEY... and the triggered queries on the workers. Fixes #6424.	2022-11-15 17:53:39 +03:00
Hanefi Onaldi	0ee973368b	CI: Bump PG versions to latest minors (#6493 ) Related: https://github.com/citusdata/the-process/pull/97	2022-11-15 16:31:13 +03:00
Hanefi Onaldi	2e0ee262d0	Fix changelog format (#6480 ) This error was due to a mistake in #6479	2022-11-14 11:00:54 +03:00
Teja Mupparti	7358b826ef	Remove the explicit-transaction requirement for the UDF citus_get_transaction_clock() as implicit transactions too use this UDF.	2022-11-10 10:54:36 -08:00
Marco Slot	77fbcfaf14	Propagate BEGIN properties to worker nodes (#6483 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-11-10 18:08:43 +01:00
rajeshkt78	d7d5f0df3e	Added a workaround for a bug in git ls-files command. (#6487 ) DESCRIPTION: Added a workaround for a bug in git ls-files command. https://community.garden.io/t/command-git-ls-files-ignored-failed-with-code-128/117 Option "--cached" is added to avoid this issue.	2022-11-10 16:22:39 +05:30
rajeshkt78	7d75bbf734	Update fix_gitignore.sh	2022-11-10 15:31:55 +05:30
Rajesh Kumar Thandapani	d5abcefc98	Added a workaround for a bug in git ls-files command.	2022-11-10 15:28:21 +05:30
Hanefi Onaldi	01ec971108	Add changelog entries for 11.0.7 (#6479 )	2022-11-08 12:01:09 +03:00
Onur Tirtir	ed2204cd1d	Improve test targets in Makefile (#5542 )	2022-11-08 10:07:20 +03:00
Onur Tirtir	e0363470bc	Add missing targets to check-full	2022-11-08 09:59:55 +03:00
Onur Tirtir	7b3e55f903	Add missing dependencies to test targets	2022-11-08 09:59:55 +03:00
Marco Slot	fcaabfdcf3	Remove remaining master_create_distributed_table usages (#6477 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-11-04 16:30:06 +01:00
Marco Slot	666696c01c	Deprecate citus.replicate_reference_tables_on_activate, make it always off (#6474 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-11-04 16:21:10 +01:00
Naisila Puka	b8c7a9844c	Add docs on handling alternative test outputs (#6469 ) I recently cleaned up our test suite from redundant test outputs: #6111 #6140 #6214 #6140 #6434 I had to check many files manually, as they didn't have any documentation on why the alternative test output existed in the first place. Adding a section in our test docs to remind developers to add alternative test outputs with enough information/keywords.	2022-11-03 10:55:50 +03:00
Onur Tirtir	1af28b3f27	Use CommitContext for subxact mgmt and reduce memory usage in CommitContext (#6099 ) (Hopefully) Fixes #5000. If memory allocation done for `SubXactContext state` in `PushSubXact()` fails, then `PopSubXact()` might segfault, for example, when grabbing the topmost `SubXactContext` from `activeSubXactContexts` if this is the first ever subxact within the current xact, with the following stack trace: ```c citus.so!list_nth_cell(const List list, int n) (\opt\pgenv\pgsql-14.3\include\server\nodes\pg_list.h:260) citus.so!PopSubXact(SubTransactionId subId) (\home\onurctirtir\citus\src\backend\distributed\transaction\transaction_management.c:761) citus.so!CoordinatedSubTransactionCallback(SubXactEvent event, SubTransactionId subId, SubTransactionId parentSubid, void * arg) (\home\onurctirtir\citus\src\backend\distributed\transaction\transaction_management.c:673) CallSubXactCallbacks(SubXactEvent event, SubTransactionId mySubid, SubTransactionId parentSubid) (\opt\pgenv\src\postgresql-14.3\src\backend\access\transam\xact.c:3644) AbortSubTransaction() (\opt\pgenv\src\postgresql-14.3\src\backend\access\transam\xact.c:5058) AbortCurrentTransaction() (\opt\pgenv\src\postgresql-14.3\src\backend\access\transam\xact.c:3366) PostgresMain(int argc, char ** argv, const char * dbname, const char * username) (\opt\pgenv\src\postgresql-14.3\src\backend\tcop\postgres.c:4250) BackendRun(Port * port) (\opt\pgenv\src\postgresql-14.3\src\backend\postmaster\postmaster.c:4530) BackendStartup(Port * port) (\opt\pgenv\src\postgresql-14.3\src\backend\postmaster\postmaster.c:4252) ServerLoop() (\opt\pgenv\src\postgresql-14.3\src\backend\postmaster\postmaster.c:1745) PostmasterMain(int argc, char argv) (\opt\pgenv\src\postgresql-14.3\src\backend\postmaster\postmaster.c:1417) main(int argc, char argv) (\opt\pgenv\src\postgresql-14.3\src\backend\main\main.c:209) ``` For this reason, to be more defensive against memory-allocation errors that could happen at `PushSubXact()`, now we use our pre-allocated memory context for the objects created in `PushSubXact()`. This commit also attempts reducing the memory allocations done under CommitContext to reduce the chances of consuming all the memory available to CommitContext. Note that it's problematic to encounter with such a memory-allocation error for other objects created in `PushSubXact()` as well, so above is an example scenario that might result in a segfault. DESCRIPTION: Fixes a bug that might cause segfaults when handling deeply nested subtransactions	2022-11-03 00:57:32 +03:00
Onur Tirtir	a5f7f001b0	Make sure to disallow triggers that depend on extensions (#6399 ) DESCRIPTION: Makes sure to disallow triggers that depend on extensions We were already doing so for `ALTER trigger DEPENDS ON EXTENSION` commands. However, we also need to disallow creating Citus tables having such triggers already, so this PR fixes that.	2022-11-02 16:27:31 +03:00
Alexander Kukushkin	deeacfee04	Improve a query that terminates compeling backends from citus_update_node() (#6468 ) DESCRIPTION: Improve a query that terminates compeling backends from citus_update_node() 1. Use pg_blocking_pids() function instead of self join on pg_locks. It exists since 9.6 and more accurate than pg_locks. 2. Prefix all function calls with pg_catalog schema to prevent privilege escalation by creating functions with similar names in a public schema. 3. Change logs and update comments to reflect the fact that the pg_terminate_backend() function only sends SIGTERM but not wating for the actual backend termination.	2022-11-02 12:32:00 +01:00
Alexander Kukushkin	402a30a2b7	Allow citus_update_node() to work with nodes from different clusters (#6466 ) DESCRIPTION: Allow citus_update_node() to work with nodes from different clusters citus_update_node(), citus_nodename_for_nodeid(), and citus_nodeport_for_nodeid() functions only checked for nodes in their own clusters and hence last two returned NULLs and the first one showed an error is the nodeId was from a different cluster. Fixes https://github.com/citusdata/citus/issues/6433	2022-11-02 10:07:01 +01:00
oohira	3f66f3d9dd	Add missing space to citus.shard_count description (#6464 ) DESCRIPTION: Add missing space to citus.shard_count description	2022-10-31 10:37:14 +01:00
Teja Mupparti	69f75af62d	Remove unused macros	2022-10-28 10:38:07 -07:00
Teja Mupparti	01103ce05d	This implements a new UDF citus_get_cluster_clock() that returns a monotonically increasing logical clock. Clock guarantees to never go back in value after restarts, and makes best attempt to keep the value close to unix epoch time in milliseconds. Also, introduces a new GUC "citus.enable_cluster_clock", when true, every distributed transaction is stamped with logical causal clock and persisted in a catalog pg_dist_commit_transaction.	2022-10-28 10:15:08 -07:00
Nils Dijk	9249fd5c5d	add security.md from template (#6462 ) Recently a question was posed in the community how to handle security related reports to Citus. Other Microsoft owned repositories include a `SECURITY.md` file explaining how security related incidents can be reported. Thanks @JelteF for finding these. Looking around in internal systems I found a checklist for opensourcing repositories where a SECURITY.md template was provided. For now we only add the `SECURITY.md` file as it was prompted in the community how to handle these.	2022-10-26 14:56:08 +02:00
Ahmet Gedemenli	c379ff8614	Drop defer drop gucs (#6447 ) DESCRIPTION: Drops GUC defer_drop_after_shard_split DESCRIPTION: Drops GUC defer_drop_after_shard_move Drop GUCs and related parts from the code. Delete tests that specifically added for the GUCs. Keep tests that can be used without the GUCs. Update test output changes. The motivation for this PR is to have an "always deferring" mechanism. These two GUCs provide an option to not deferring dropping objects during a shard move/split, and dropping them immediately. With this PR, we will be always deferring dropping orphaned shards and other types of objects. We will have a separate PR to extend the deferred cleanup operation, so that we would create records for deferred drop, for Subscriptions, Publications, Replication Slots etc. This will make us be able to keep track of created objects that needs to be dropped, during a shard move/split. We will have objects created specifically for the current operation; and those objects will be dropped at the end. We have an issue (a draft roadmap) for enabling parallel shard moves. For details please see: https://github.com/citusdata/citus/issues/6437	2022-10-25 16:48:34 +03:00
Hanefi Onaldi	915d1b3b38	Repartition tests for numeric types with neg scale (#6358 ) This PR adds some test cases where repartition join correctly prunes shards on two tables that have numeric columns with negative scale.	2022-10-24 20:59:05 +03:00
Jelte Fennema	20a4d742aa	Fix flakyness in failure_split_cleanup (#6450 ) Sometimes in CI our failure_split_cleanup test would fail like this: ```diff CALL pg_catalog.citus_cleanup_orphaned_resources(); -NOTICE: cleaned up 79 orphaned resources +NOTICE: cleaned up 82 orphaned resources SELECT operation_id, object_type, object_name, node_group_id, policy_type ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/28107/workflows/4ec712c9-98b5-4e90-9806-e02a37d71679/jobs/846107 The reason was that previous tests in the schedule would also create some orphaned resources. Sometimes some of those would already be cleaned up by the maintenance daemon, resulting in a different number of cleaned up resources than expected. This cleans up any previously created resources at the start of the test without logging how many exactly were cleaned up. As a bonus this now also allows running this test using check-failure-base.	2022-10-24 17:35:31 +02:00
Hanefi Onaldi	ee2a9a7fe7	Add changelog entries for 11.1.4 (#6456 )	2022-10-24 13:57:08 +03:00
Hanefi Onaldi	7c5b787b9c	Add changelog entries for 11.1.4	2022-10-24 12:46:41 +03:00
Onur Tirtir	2d14dd85e9	Not hardcode "false" in UpdateAutoConvertedForConnectedRelations (#6452 ) This didn't cause any bugs since today we're always calling UpdateAutoConvertedForConnectedRelations with autoconverted=false, so we don't need to backport this to anywhere.	2022-10-21 18:14:20 +03:00
Onur Tirtir	dbe2749bbf	Drop unreachable code from query_pushdown_planning.c (#6451 ) Given that we cannot continue after a `RaiseDeferredErrorInternal(.., ERROR)` call.	2022-10-21 18:04:31 +03:00
Jelte Fennema	7f05ad033a	Add a section on PR descriptions to flaky test docs (#6446 ) Good PR descriptions for flaky tests are quite helpful when reviewing. Although obviously no PR description is the same, there's a few common pieces of information that are useful for all PRs that fix flaky tests.	2022-10-21 16:52:31 +02:00
aykut-bozkurt	162c8a5160	Drop worker_fetch_foreign_file/worker_repartition_cleanup only if they exist when upgrading Citus (#6441 ) We should not introduce breaking sql changes to upgrade files after they are released. We did that for worker_fetch_foreign_file in v9.0.0 and worker_repartition_cleanup in v9.2.0. Later when we try to drop those udfs, they were missing for some clients unexpectedly due to breaking change in an old upgrade script. For that case, the fix is to add DROP IF EXISTS for those 2 udfs in 11.0-4--11.1-1.	2022-10-21 14:32:42 +03:00
Emel Şimşek	02fd1e6c03	Fix the crash that happens when using auto_explain extension with recursive queries (#6406 ) This crash happens with recursively planned queries. For such queries, subplans are explained via the ExplainOnePlan function of postgresql. This function reconstructs the query description from the plan therefore it expects the ActiveSnaphot for the query be available. This fix makes sure that the snapshot is in the stack before calling ExplainOnePlan. Fixes #2920.	2022-10-19 18:04:45 +03:00
Jelte Fennema	737e2bb1bb	Don't leak search_path to workers on DDL (#6444 ) DESCRIPTION: Don't leak search_path to workers on DDL For DDL we have to set the `search_path` on workers to the same as on the coordinator for some DDL to work. Previously this search_path would leak outside of the transaction that was used for the DDL. This fixes that by using `SET LOCAL` instead of `SET`. The only place where we still use plain `SET` is for DDL commands that are not allowed within transactions, such as `CREATE INDEX CONCURRENLTY`. This fixes this flaky test: ```diff CONTEXT: SQL statement "SELECT change_id FROM distributed_triggers.data_changes WHERE shard_key_value = NEW.shard_key_value AND object_id = NEW.object_id ORDER BY change_id DESC LIMIT 1" -PL/pgSQL function record_change() line XX at SQL statement +PL/pgSQL function distributed_triggers.record_change() line 17 at SQL statement while executing command on localhost:57638 DELETE FROM data_ref_table where shard_key_value = 'hello'; ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27849/workflows/75ae5f1a-100b-4b7a-b991-7de069f39ee1/jobs/831429 I had tried to fix this flaky test in #5894 and then I tried implementing a better fix in #5896, where @marcocitus suggested this better fix. This change reverts the fix from #5894 and implements the fix suggested by Marco. Our multi_mx_alter_distributed_table test actually depended on the old buggy search_path leaking behavior. After fixing the bug that test would fail like this: ```diff CALL proc_0(1.0); DEBUG: pushing down the procedure -NOTICE: Res: 3 -DETAIL: from localhost:xxxxx +ERROR: relation "test_proc_colocation_0" does not exist +CONTEXT: PL/pgSQL function mx_alter_distributed_table.proc_0(double precision) line 5 at SQL statement +while executing command on localhost:57637 RESET client_min_messages; ``` I fixed this test by fully qualifying the table names used in the procedure. I think it's quite unlikely that actual users depend on this behavior though. Since it would require first doing DDL before calling a procedure in a session where the search_path was changed after connecting.	2022-10-19 16:47:35 +02:00
Ahmet Gedemenli	cdbda9ea6b	Add failure test for shard move (#6325 ) DESCRIPTION: Adds failure test for shard move DESCRIPTION: Remove function `WaitForAllSubscriptionsToBecomeReady` and related tests Adding some failure tests for shard moves. Dropping the not-needed-anymore function `WaitForAllSubscriptionsToBecomeReady`, as the subscriptions now start as ready from the beginning because we don't use logical replication table sync workers anymore. fixes: #6260	2022-10-19 14:25:26 +02:00
Gokhan Gulbiz	56da3cf6aa	Increase node_connection_timeout to prevent flakiness in shard_rebalancer regression tests (#6445 ) In CI shard_rebalancer sometimes fails with this error: ```diff SET citus.node_connection_timeout to 60; BEGIN; SET LOCAL citus.shard_replication_factor TO 2; SET citus.log_remote_commands TO ON; SET SESSION citus.max_adaptive_executor_pool_size TO 5; SELECT replicate_table_shards('dist_table_test_2', max_shard_copies := 4, shard_transfer_mode:='block_writes'); +WARNING: could not establish connection after 60 ms ``` Source https://app.circleci.com/pipelines/github/citusdata/citus/28128/workflows/38eeacc4-4191-4366-87ed-9a628414965a/jobs/847458?invite=true#step-107-21 This PR avoids this issue by increasing ```citus.node_connection_timeout``` to 35s.	2022-10-19 13:03:14 +03:00
Onur Tirtir	5aec88d084	Not try locking relations referencing to views (#6430 ) Since there can't be such a foreign key already. This mainly fixes the error that Citus throws when trying to truncate a distributed view. Fixes #5990.	2022-10-19 11:24:22 +03:00
Önder Kalacı	93e162def6	Bump PG version to 15 on the README (#6442 )	2022-10-18 13:22:28 -05:00
Jelte Fennema	f756db39c4	Add docs on how to fix flaky tests (#6438 ) I fixed a lot of flaky tests recently and I found some patterns in the type of issues and type of fixes. This adds a document that lists these types of issues and explains how to fix them.	2022-10-18 15:52:01 +02:00
Gokhan Gulbiz	e87eda6496	Introduce a new GUC to propagate local settings to new connections in rebalancer (#6396 ) DESCRIPTION: Introduce ```citus.propagate_session_settings_for_loopback_connection``` GUC to propagate local settings to new connections. Fixes: #5289	2022-10-18 12:50:30 +03:00
Jelte Fennema	60eb67b908	Increase shard move test coverage by improving advisory locks (#6429 ) To be able to test non-blocking shard moves we take an advisory lock, so we can pause the shard move at an interesting moment. Originally this was during the logical replication catch up phase. But when I added tests for the rebalancer progress I moved this lock before the initial data copy. This allowed testing of the rebalance progress, but inadvertently made our non-blocking tests not actually test if we held unintended locks during logical replication catch up. This fixes that by creating two types of advisory locks, one before the copy and one after. This causes the tests to actually test their intended scenario again. Furthermore it starts using one of these locks for blocking shard moves too. Which allowed me to reduce the complexity of the rebalance progress test suite quite a bit. It also allowed enabling some flaky tests again, because this stopped them from being flaky. And finally it allowed testing of rebalance progress for blocking shard copy operations as well. In passing it fixes a flaky test during parallel blocking shard moves by ordering the output.	2022-10-17 17:32:28 +02:00
Ahmet Gedemenli	96912d9ba1	Add status column to get_rebalance_progress() (#6403 ) DESCRIPTION: Adds status column to get_rebalance_progress() Introduces a new column named `status` for the function `get_rebalance_progress()`. For each ongoing shard move, this column will reveal information about that shard move operation's current status. For now, candidate status messages could be one of the below. * Not Started * Setting Up * Copying Data * Catching Up * Creating Constraints * Final Catchup * Creating Foreign Keys * Completing * Completed	2022-10-17 16:55:31 +03:00
Naisila Puka	8323f4f12c	Cleans up test outputs (#6434 )	2022-10-17 15:13:07 +03:00
Hanefi Onaldi	82ea76bc0c	Bump PG15 CI images to 15.0 (#6439 ) Related: citusdata/the-process#95	2022-10-15 13:14:17 +03:00
Önder Kalacı	037eeb3918	Use Azure Cosmos DB for PostgreSQL instead of Azure Database for PostgreSQL in the README (#6432 ) For more details: https://devblogs.microsoft.com/cosmosdb/distributed-postgresql-comes-to-azure-cosmos-db/ Co-authored-by: Claire Giordano <claire@citusdata.com>	2022-10-14 18:17:30 +02:00
Onur Tirtir	4152a391c2	Properly set col names for shard rels that citus_extradata_container points to (#6428 ) Deparser function set_relation_column_names() knows that it needs to re-evaluate column names based on relation's tuple descriptor when the rte belongs to a relation (RTE_RELATION). However before this commit, it didn't know about the fact that citus might wrap such an rte with an rte that points to citus_extradata_container() placeholder. And because of this, it was simply taking the column aliases (e.g., "bar" in "foo AS bar") into the account and this might result in an incorrectly deparsed query as in below case: * Say, if we had view based on following query: ```sql SELECT a FROM table; ``` * And if we rename column "a" to "b", the view query normally becomes: ```sql SELECT b AS a FROM table; ``` * So before this commit, deparsing a query based on that view was resulting in such a query due to deparsing based on the column aliases, which is not correct: ```sql SELECT a FROM table; ``` Fixes #5932. DESCRIPTION: Fixes a bug that might cause failing to query the views based on tables that have renamed columns	2022-10-14 17:31:25 +03:00
Önder Kalacı	8b624b5c9d	Detect remotely closed sockets and add a single connection retry in the executor (#6404 ) PostgreSQL 15 exposes WL_SOCKET_CLOSED in WaitEventSet API, which is useful for detecting closed remote sockets. In this patch, we use this new event and try to detect closed remote sockets in the executor. When a closed socket is detected, the executor now has the ability to retry the connection establishment. Note that, the executor can retry connection establishments only for the connection that has not been used. Basically, this patch is mostly useful for preventing the executor to fail if a cached connection is closed because of the worker node restart (or worker failover). In other words, the executor cannot retry connection establishment if we are in a distributed transaction AND any command has been sent over the connection. That requires more sophisticated retry mechanisms. For now, fixing the above use case is enough. Fixes #5538 Earlier discussions: #5908, #6259 and #6283 ### Summary of the current approach regards to earlier trials As noted, we explored some alternatives before getting into this. https://github.com/citusdata/citus/pull/6283 is simple, but lacks an important property. We should be checking for `WL_SOCKET_CLOSED` _before_ sending anything over the wire. Otherwise, it becomes very tricky to understand which connection is actually safe to retry. For example, in the current patch, we can safely check `transaction->transactionState == REMOTE_TRANS_NOT_STARTED` before restarting a connection. #6259 does what we intent here (e.g., check for sending any command). However, as @marcocitus noted, it is very tricky to handle `WaitEventSets` in multiple places. And, the executor is designed such that it reacts to the events. So, adding anything `pre-executor` seemed too ugly. In the end, I converged into this patch. This patch relies on the simplicity of #6283 and also does a very limited handling of `WaitEventSets`, just for our purpose. Just before we add any connection to the execution, we check if the remote session has already closed. With that, we do a brief interaction of multiple wait event processing, but with different purposes. The new wait event processing we added does not even consider cancellations. We let that handled by the main event processing loop. Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-10-14 15:08:49 +02:00
Hanefi Onaldi	4d037f03fe	Add changelog entries for 11.1.3 (#6435 )	2022-10-14 13:04:35 +03:00
Jelte Fennema	0cee79a7ab	Actually enable improved blocked process detection (#6426 ) In #6405 I added better improved blocked process detection for isolation tests. But when cleaning up unnecessary code I cleaned up a bit too much. This actually includes the new function definition in our migrations.	2022-10-13 09:50:37 +02:00
Jelte Fennema	ecc37b9028	Fix flakyness in multi_partitioning (#6427 ) In CI multi_partitioning sometimes fails with this error: ```diff SELECT citus_remove_node('localhost', :master_port); - citus_remove_node ---------------------------------------------------------------------- - -(1 row) - +ERROR: tuple concurrently deleted -- d) invalid tables for helper UDFs ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27993/workflows/685e5b20-c923-43e5-8a0d-b932ef4c4914/jobs/839466 This PR avoids this concurrency issue by not running the multi_partitioning test in parallel with other tests.	2022-10-13 10:33:37 +03:00
Onur Tirtir	20847515fa	Hint users to call "citus_set_coordinator_host" first (#6425 ) If an operation requires having coordinator in pg_dist_node and if that is not the case, then we automatically add the coordinator into pg_dist_node if user didn't add any worker nodes yet. However, if user have already added some worker nodes before, we throw an error. With this commit, we improve the error thrown in that case. Closes #6423 based on the discussion made there.	2022-10-12 18:18:51 +03:00
Jelte Fennema	6277ffd69e	Reduce isolation flakyness by improving blocked process detection (#6405 ) Sometimes our CI randomly fails on a test in a way similar to this: ```diff step s2-drop: DROP TABLE cancel_table; - + <waiting ...> +step s2-drop: <... completed> starting permutation: s1-timeout s1-begin s1-sleep10000 s1-rollback s1-reset s1-drop ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26524/workflows/5415b84f-13a3-482f-bef9-648314c79a67/jobs/756377 I tried to fix that already in #6252 by disabling the maintenance daemon during isolation tests. But it seems that hasn't fixed all cases of these errors. This is another attempt at fixing these issues that seems to have better results. What it does is that it starts using the pInterestingPids parameter that citus_isolation_test_session_is_blocked receives. With this change we start filter out block-edges that are not caused by any of these pids. In passing this change also makes it possible to run `isolation_create_distributed_table_concurrently` with `check-isolation-base`	2022-10-12 16:35:09 +02:00
Hanefi Onaldi	ec3eebbaf6	Rename a function that collides with PG15 (#6422 ) PG15 introduced a function called ReplicationSlotName that causes conflicts with our function with the same name. I solved this issue by renaming our function to ReplicationSlotNameForNodeAndOwner Relevant PG commit: `c3b5992b91`	2022-10-12 13:24:04 +03:00
Jelte Fennema	cb34adf7ac	Don't reassign global PID when already assigned (#6412 ) DESCRIPTION: Fix bug in global PID assignment for rebalancer sub-connections In CI our isolation_shard_rebalancer_progress test would sometimes fail like this: ```diff +isolationtester: canceling step s1-rebalance-c1-block-writes after 60 seconds step s1-rebalance-c1-block-writes: SELECT rebalance_table_shards('colocated1', shard_transfer_mode:='block_writes'); - <waiting ...> + +ERROR: canceling statement due to user request step s7-get-progress: ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27855/workflows/2a7e335a-f3e8-46ed-b6bd-6920d42f7214/jobs/831710 It turned out this was an actual bug in the way our assigning of global PIDs interacts with the way we connect to ourselves as the shard rebalancer. The first command the shard rebalancer sends is a SET ommand to change the application_name to `citus_rebalancer`. If `StartupCitusBackend` is called after this command is processed, then it overwrites the global PID that was extracted from the previous application_name. This makes sure that we don't do that, and continue to use the original global PID. While it might seem that we only call `StartupCitusBackend` once for each query backend, this isn't actually the case. Whenever pg_dist_partition gets ANALYZEd by autovacuum we indirectly call `StartupCitusBackend` again, because we invalidate the cache then. In passing this fixes two other things as well: 1. It sets `distributedCommandOriginator` correctly in `AssignGlobalPID`, by using IsExternalClientBackend(). This doesn't matter much anymore, since AssignGlobalPID effectively becomes a no-op in this PR for any non-external client backends. 2. It passes the application_name to InitializeBackendData in StartupCitusBackend, instead of INVALID_CITUS_INTERNAL_BACKEND_GPID (which effectively got casted to NULL). In practice this doesn't change the behaviour of the call, since the call is a no-op for every backend except the maintenance daemon. And the behaviour of the call is the same for NULL as for the application_name of the maintenance daemon.	2022-10-11 16:41:01 +02:00
Naisila Puka	b5d70d2e11	Fix flakyness in alter_table_set_access_method (#6421 ) We decrease verbosity level here to avoid the flaky output https://app.circleci.com/pipelines/github/citusdata/citus/27936/workflows/dc63128a-1570-41a0-8722-08f3e3cfe301/jobs/836153 ```diff select alter_table_set_access_method('ref','heap'); NOTICE: creating a new table for alter_table_set_access_method.ref NOTICE: moving the data of alter_table_set_access_method.ref NOTICE: dropping the old alter_table_set_access_method.ref NOTICE: drop cascades to 2 other objects -DETAIL: drop cascades to materialized view m_ref -drop cascades to view v_ref +DETAIL: drop cascades to view v_ref +drop cascades to materialized view m_ref CONTEXT: SQL statement "DROP TABLE alter_table_set_access_method.ref CASCADE" NOTICE: renaming the new table to alter_table_set_access_method.ref alter_table_set_access_method ------------------------------- (1 row) ```	2022-10-11 16:31:24 +03:00
Naisila Puka	89aa9a015f	Fixes empty password issue (#6417 )	2022-10-11 15:56:44 +03:00
Onur Tirtir	0b81f68def	Use memcpy instead of memcpy_s to avoid pointless limits in columnar (#6419 ) DESCRIPTION: Raises memory limits in columnar from 256MB to 1GB for reads and writes This doesn't completely fix #5918 but at least increases the buffer limits that might cause throwing an error when reading from or writing into into columnar storage. A way better approach to fix this is documented in #6420. Replacing memcpy_s with memcpy is quite safe in those places since we anyway make sure to allocate enough amount of memory before writing into related buffers.	2022-10-11 14:57:31 +03:00
aykut-bozkurt	442cdb2ea5	pg_regress needs the option dlpath for postgres tests to find regress.so (#6416 ) When you run vanilla tests in your local environment, some of the tests tries to find path for regress.so which is not in default lib path. That is why we need to specify regress.so path as dlpath option. Example failure: ``` LOAD :'regresslib'; +ERROR: could not access file "/home/aykutbozkurt/.pgenv/pgsql-15beta4/lib/regress.so": No such file or directory ``` It is actually in `~/.pgenv/src/postgresql-15beta4/src/test/regress/regress.so` which is found by `$regresslibdir`.	2022-10-11 14:43:06 +03:00
Hanefi Onaldi	4f8d6f6558	Bump PG15 CI images to rc2 (#6407 ) When bumping to RC2, we needed to update one test. The following is the commit message for the change: Remove references to optimization PG15 reverted PG15 introduced an optimization on GROUP BY keys that is now reverted on RC2. Relevant PG Commit: Revert "Optimize order of GROUP BY keys". 443df6e2db932a7cd6d85ddfb67e11a43345130d Depends on: https://github.com/citusdata/the-process/pull/94	2022-10-11 14:30:59 +03:00
Hanefi Onaldi	cbe4298c5b	Remove references to optimization PG15 reverted PG15 introduced an optimization on GROUP BY keys that is now reverted on RC2. Relevant PG commit: Revert "Optimize order of GROUP BY keys". 443df6e2db932a7cd6d85ddfb67e11a43345130d	2022-10-10 21:54:08 +03:00
Hanefi Onaldi	30af70926f	Bump PG15 CI images to rc2	2022-10-10 21:54:08 +03:00
Onur Tirtir	517b72a9d5	Fix use-after-free in GetAlterTriggerStateCommand() (#6413 ) Fix use-after-free in GetAlterTriggerStateCommand() introduced in #6398.	2022-10-10 16:38:21 +03:00
Gokhan Gulbiz	1776bdf654	Limit citus_drain_node to drain the specified node only (#6361 ) DESCRIPTION: Fixes citus_drain_node to drain the specified worker only. Fixes #6267	2022-10-09 13:33:08 +03:00
Onur Tirtir	86e186f671	Retain trigger settings when re-creating the triggers (on shards) (#6398 ) Fixes https://github.com/citusdata/citus/issues/6394. DESCRIPTION: Fixes a bug that causes creating disabled-triggers on shards as enabled Since CREATE TRIGGER doesn't have syntax support to specify whether the trigger should be enabled/disabled, the underlying PG function (`pg_get_triggerdef()`) that we use to generate the command to create the trigger is not enough. For this reason, we append a second command to enable/disable trigger, right after creating it. We don't retain explicit extension dependencies set by using `ALTER trigger DEPENDS ON EXTENSION` commands too, but apparently right fix for that is to throw an error as in `PreprocessAlterTriggerDependsStmt()`; so, opened a separate PR to fix that #6399.	2022-10-06 14:51:07 +03:00
Naisila Puka	27e867afbc	Propagates column aliases (#6400 ) Propagates column aliases in the shard-level commands	2022-10-06 12:27:31 +03:00
Naisila Puka	b5cba3a3fe	Use original relation to retrieve column name because of syscache (#6387 ) During alter_distributed_table, we create a new table like the original table but with the altered options. To retrieve the name of the distribution column, we were using the attribute syscache of the new table, since we already created the new table as identical to the original table. However, the attribute syscaches of these two tables are not the same if the original table has dropped columns. The reason is that dropped columns are all still present in the cache. Hence, for example, the attnos would be different in the syscaches. So, let's use the attribute syscache of the original table.	2022-10-06 12:08:00 +03:00
Ying Xu	f21cbe68f8	[Columnar] Bugfix for Columnar: options ignored during ALTER TABLE rewrite (#6337 ) DESCRIPTION: Fixes a bug that prevents retaining columnar table options after a table-rewrite A fix for this issue: Columnar: options ignored during ALTER TABLE rewrite #5927 The OID for the temporary table created during ALTER TABLE was not the same as the original table's OID so the columnar options were not being applied during rewrite. The change is that I applied the original table's columnar options to the new table so that it has the correct options during write. I also added a test.	2022-10-05 11:42:09 -07:00
Ahmet Gedemenli	e36890ce55	Add source_lsn and target_lsn fields into get_rebalance_progress (#6364 ) DESCRIPTION: Adds source_lsn and target_lsn fields into get_rebalance_progress Adding two fields named `source_lsn` and `target_lsn` to the function `get_rebalance_progress`. Target lsn data is fetched in `GetShardStatistics`, by expanding the query sent to workers (joining with pg_subscription_rel and pg_stat_subscription). Then put into the hashmap, for each shard. Source lsn data is fetched in `BuildWorkerShardStatististicsHash`, in the loop that iterate each node, by sending a pg_current_wal_lsn query to each node. Then put into the hashmap, for each node.	2022-10-05 11:12:24 +03:00
Hanefi Onaldi	e0f8666131	Fix downgrades from 10.2-4 to 10.2-2 (#6383 ) DESCRIPTION: Fixes a bug in `ALTER EXTENSION citus UPDATE` We had a series of changes on columnar that made it impossible for a Citus user to downgrade from 10.2-4 to 10.2-2. Since we test downgrades to immediate previous versions, we did not capture this in our tests. Here are the series of changes. - `10.2-1` introduced a btree index named `columnar.stripe_first_row_number_idx` - `10.2-3` had a unique index with the same name. To accomplish that, we dropped the btree index, and create a unique index with the same name. - `10.2-4` introduced `columnar_ensure_am_depends_catalog()` that adds pg_depend entries so that Columnar access method depended on objects such as `stripe_first_row_number_idx` If a user upgrades to `>=10.2-4` we create a dependency record, and this prevents users from downgrading to an earlier version than `10.2-3` since the downgrade file `columnar--10.2-3--10.2-2.sql` wanted to drop the unique index and create a btree index instead. However this created an error because columnar am depended on that index. We do not usually like to update earlier migration versions, but there is no other solution that I could think of. ## Notes to reviewer: Consider reviewing the commits one by one. - Commit#1 aims to improve downgrade scripts overall. - Commit#2 documents the failure - Commit#3 fixes the problem by updating all the files that attempted to drop `stripe_first_row_number_idx` index. Related: #6041	2022-10-04 20:39:50 +03:00
Hanefi Onaldi	11a9a3771f	Ensure no dependencies to index before drop	2022-10-04 18:56:20 +03:00
Hanefi Onaldi	5ddd4754a2	Document failing downgrades from 10.2-4 to 10.2-2	2022-10-04 18:56:20 +03:00
Hanefi Onaldi	0efd6f7829	Fix tests for missing downgrades	2022-10-04 18:56:20 +03:00
Jelte Fennema	aea4964b39	Fix flakyness in isolation_shard_rebalancer_progress (#6397 ) On our CI our isolation_shard_rebalancer_progress would sometimes randomly fail like this: ```diff table_name\|shardid\|shard_size\|sourcename\|sourceport\|source_shard_size\|targetname\|targetport\|target_shard_size\|progress\|operation_type ----------+-------+----------+----------+----------+-----------------+----------+----------+-----------------+--------+-------------- -colocated1\|1500001\| 49152\|localhost \| 57637\| 49152\|localhost \| 57638\| 73728\| 1\|move -colocated2\|1500005\| 376832\|localhost \| 57637\| 376832\|localhost \| 57638\| 401408\| 1\|move +colocated1\|1500001\| 49152\|localhost \| 57637\| 49152\|localhost \| 57638\| 81920\| 1\|move +colocated2\|1500005\| 376832\|localhost \| 57637\| 376832\|localhost \| 57638\| 409600\| 1\|move (2 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27688/workflows/8c5ca443-5f21-4f21-b74f-0ca7bde69648/jobs/823648/parallel-runs/1 The shard sizes would be slightly larger or smaller than expected. This fixes this by fixing the output to the nearest expected shard size. To do so I used a trick described in this stack overflow answer: https://stackoverflow.com/a/33147437/2570866 When investigating I ran into one more random failure: ```diff -step s1-shard-move-c1-block-writes: <... completed> +step s4-shard-move-sep-block-writes: <... completed> citus_move_shard_placement -------------------------- (1 row) -step s4-shard-move-sep-block-writes: <... completed> +step s1-shard-move-c1-block-writes: <... completed> citus_move_shard_placement -------------------------- ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27707/workflows/c3ff4fc7-5068-4096-ab9f-803c941ddac0/jobs/824622/parallel-runs/29?filterBy=FAILED This random failure happens, because the two parallel moves can complete at the same time. So, it's non-deterministic which one finishes first. To make this deterministic I used the "marker" feature from the isolation tester. And finally I ran into a third random failure: ```diff table_name\|shardid\|shard_size\|sourcename\|sourceport\|source_shard_size\|targetname\|targetport\|target_shard_size\|progress\|operation_type ----------+-------+----------+----------+----------+-----------------+----------+----------+-----------------+--------+-------------- -colocated1\|1500001\| 50000\|localhost \| 57637\| 50000\|localhost \| 57638\| 50000\| 1\|move -colocated2\|1500005\| 400000\|localhost \| 57637\| 400000\|localhost \| 57638\| 400000\| 1\|move +colocated1\|1500001\| 50000\|localhost \| 57637\| 50000\|localhost \| 57638\| 8000\| 1\|move +colocated2\|1500005\| 400000\|localhost \| 57637\| 400000\|localhost \| 57638\| 8000\| 1\|move colocated1\|1500002\| 200000\|localhost \| 57637\| 200000\|localhost \| 57638\| 0\| 0\|move colocated2\|1500006\| 8000\|localhost \| 57637\| 8000\|localhost \| 57638\| 0\| 0\|move ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/27707/workflows/c3ff4fc7-5068-4096-ab9f-803c941ddac0/jobs/824622/parallel-runs/30?filterBy=FAILED This happened in two of the tests only. For now I commented these tests out. I have some ideas on how to fix these, but these ideas require more impactful changes than I would like in this PR. One of these tests had a copy paste error too, in passing I fixed that in the commented out line.	2022-10-04 17:05:42 +02:00
Hanefi Onaldi	24f247b5a1	Cleanup multi_utility_warnings test This test used to contain some utility commands that Citus did not support. However we added support for most of the commands, and this test got outdated. We used to error out on community when user attempted to use pooler options. Now that we open sourced all enterprise features, the test can now be removed.	2022-10-04 15:27:42 +03:00
Jelte Fennema	5c64227223	Hopefully reduce flaky tests by disabling the maintenance daemon (#6252 ) Sometimes our CI randomly fails on a test in a way similar to this: ```diff step s2-drop: DROP TABLE cancel_table; - + <waiting ...> +step s2-drop: <... completed> starting permutation: s1-timeout s1-begin s1-sleep10000 s1-rollback s1-reset s1-drop ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26524/workflows/5415b84f-13a3-482f-bef9-648314c79a67/jobs/756377 Another example of a failure like this: ```diff stop_session_level_connection_to_node ------------------------------------- (1 row) step s3-display: SELECT * FROM ref_table ORDER BY id, value; SELECT * FROM dist_table ORDER BY id, value; - + <waiting ...> +step s3-display: <... completed> id\|value --+----- ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26551/workflows/91dca4b2-bb1c-4cae-b2ef-ce3f9c689ce5/jobs/757781 A step that shouldn't be blocked is detected as "waiting..." temporarily and then gets unblocked automatically immediately after. I'm not certain of the reason for this, but one explanation is that the maintenance daemon is doing something that blocks the query. In the shown case my hunch is that it could be the deferred shard deletion. This PR disables all the features of the maintenance daemon during isolation testing to try and prevent process from randomly being detected as blocking. NOTE: I'm not certain that this will actually fix this issue. If the issue persists even after this change, at least we know that it's not the maintenance daemon that's blocking it.	2022-10-04 14:33:57 +03:00
Hanefi Onaldi	813542dfa1	Fix flaky isolation_citus_dist_activity test (#6395 ) For the sake of documentation, here is a failing diff: ```diff step s2-view-dist: SELECT query, citus_nodename_for_nodeid(citus_nodeid_for_gpid(global_pid)), citus_nodeport_for_nodeid(citus_nodeid_for_gpid(global_pid)), state, wait_event_type, wait_event, usename, datname FROM citus_dist_stat_activity WHERE query NOT ILIKE ALL(VALUES('%pg_prepared_xacts%'), ('%COMMIT%'), ('%BEGIN%'), ('%pg_catalog.pg_isolation_test_session_is_blocked%'), ('%citus_add_node%')) AND backend_type = 'client backend' ORDER BY query DESC; query \|citus_nodename_for_nodeid\|citus_nodeport_for_nodeid\|state \|wait_event_type\|wait_event\|usename \|datname ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------+-------------------------+-------------------+---------------+----------+--------+---------- ALTER TABLE test_table ADD COLUMN x INT; \|localhost \| 57636\|idle in transaction\|Client \|ClientRead\|postgres\|regression -(1 row) + + SELECT coalesce(to_jsonb(array_agg(csa_from_one_node.)), '[{}]'::JSONB) + FROM ( + SELECT global_pid, worker_query AS is_worker_query, pg_stat_activity. FROM + pg_stat_activity LEFT JOIN get_all_active_transactions() ON process_id = pid + ) AS csa_from_one_node; + \|localhost \| 57638\|active \| \| \|postgres\|regression +(2 rows) ``` This failure can be seen at [this CI run](https://app.circleci.com/pipelines/github/citusdata/citus/27653/workflows/d769701c-8f6e-4f97-a412-16f7b9b288a6/jobs/821416)	2022-10-04 13:09:09 +02:00
Hanefi Onaldi	580ab012bf	Note PG release candidate support in changelog (#6390 ) Co-authored-by: Joe Nelson <jonels@microsoft.com>	2022-09-30 22:25:24 +03:00
Hanefi Onaldi	d3a7a42a29	Bump PG15 CI images to rc1 (#6388 ) Update the test images from PG15beta4 to PG15rc1. There is a new commit in 15rc1 that improves message styles. We also update the messages accordingly. Relevant PG commit: [517484b5820e9e20057ff066b5df7d09cbb5f464](`517484b582`) Depends on: https://github.com/citusdata/the-process/pull/93	2022-09-30 18:01:26 +03:00
Hanefi Onaldi	a38428b665	Bump PG15 CI images to rc1	2022-09-30 17:15:48 +03:00
Hanefi Onaldi	8be8eb9d8c	Update hints on trigger rename of partitions There is a new commit in REL_15_STABLE that improves message styles. Relevant PG commit: 517484b5820e9e20057ff066b5df7d09cbb5f464	2022-09-30 16:37:56 +03:00
Ahmet Gedemenli	d0fa10a98c	Bump Citus to 11.2devel (#6385 )	2022-09-30 14:47:42 +03:00
Onur Tirtir	17cf137c4c	Add changelog entries for 11.1.2 (#6386 )	2022-09-30 12:42:05 +03:00
Hanefi Onaldi	7e0edee4ec	Add tests for CREATE DATABASE with OID option (#6376 ) PG15 now allows users to specify oids when creating databases. This feature is a side effect of a bigger feature in pg_upgrade. Relevant PG Commit: pg_upgrade: Preserve database OIDs. aa01051418f10afbdfa781b8dc109615ca785ff9	2022-09-27 19:54:51 +02:00
Nils Dijk	9cad6a5324	Fix/python protobuf (#6378 ) Depends on https://github.com/citusdata/the-process/pull/92 Closes: #6371 Updates test dependencies to not rely on a known vulnerable dependency Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-27 14:46:27 +02:00
Naisila Puka	63e4d23722	Tests moving a shard with RLS owned by nonbypassrls & nonsuperuser (#6369 )	2022-09-27 14:53:23 +03:00
Naisila Puka	1b26d57288	Adds tests for suppressed constants in postgres_fdw queries (#6370 ) PG15 has suppressed some casts on constants when querying foreign tables. For example, we can use text to represent a type that's an enum on the remote side. A comparison on such a column will get shipped as "var = 'foo'::text". But there's no enum = text operator on the remote side. If we leave off the explicit cast, the comparison will work. Test we behave in the same way with a Citus foreign table Reminder: foreign tables cannot be distributed/reference, can only be Citus local Relevant PG commit: `f8abb0f5e1`	2022-09-27 13:40:48 +03:00
Hanefi Onaldi	30ac6f0fe9	Add tests for jsonpath changes on PG15 PostgreSQL 15 had some changes to jsonpath to conform with ECMA-262 referenced by SQL standard. This commit adds tests to make sure Citus also supports the same standards. Relevant pg commit: e26114c817b610424010cfbe91a743f591246ff1	2022-09-26 22:55:54 +03:00
Jelte Fennema	24e06af6d2	Reuse connections for Splits and Logical Replication (#6314 ) In Split, Logical replication logic and ShardCleaner we call `SendCommandListToWorkerOutsideTransaction` and `SendOptionalCommandListToWorkerOutsideTransaction` frequently. This opens new connection for each of those calls, even though we already have a perfectly good connection lying around. This PR adds two new APIs `SendCommandListToWorkerOutsideTransactionWithConnection` and `SendOptionalCommandListToWorkerOutsideTransactionWithConnection` that allow sending a list of queries in a transaction over an existing connection. We also update the callers (Split, ShardCleaner, Logical Replication) to use these new APIs instead. Co-authored-by: Nitish Upreti <niupre@microsoft.com> Co-authored-by: Onder Kalaci <onderkalaci@gmail.com>	2022-09-26 13:37:40 +02:00
Naisila Puka	dc9723fa45	Comment about column list for fk ON DELETE SET in PG15 (#6372 ) As a part of `a868cc049a`	2022-09-26 11:45:05 +03:00
Jelte Fennema	d9a9a3263b	Revert replica identity creation order for shard moves (#6367 ) In Citus 11.1.0 we changed the order of doing the initial data copy and the replica identity creation when doing a non blocking shard move. This was done to try and increase the speed with which shard moves could be done. But after doing more extensive performance testing this change turned out to have a negative impact on the speed of moves on the setups that I tested. Looking at the resource usage metrics of the VMs the reason for this seems to be that these shard moves were bottlenecked by disk bandwidth. While creating replica identities in bulk after the initial copy will reduce CPU usage a bit, it does require an additional sequence scan of the just written data. So when a VM is bottlenecked on disk, it makes sense to spend a little bit more CPU to avoid an additional scan. Since PKs are usually simple indexes that don't require lots of CPU to update, as opposed to e.g. GiST indexes. This reverts the order change to avoid a regression on shard move speed in these cases. For future releases we might consider re-evaluating our index creation order for other indexes too, and create "simple" indexes before the copy.	2022-09-23 14:55:25 +02:00
Onur Tirtir	a868cc049a	Not allow ON DELETE/UPDATE SET DEFAULT actions on columns that default to sequences (#6340 ) Given that we drop DEFAULT nextval('sequence') expressions from shard relation columns, allowing `ON DELETE/UPDATE SET DEFAULT` on such columns might cause inserting NULL values as a result of a delete/update operation. For this reason, we disallow ON DELETE/UPDATE SET DEFAULT actions on columns that default to sequences. DESCRIPTION: Disallows having ON DELETE/UPDATE SET DEFAULT actions on columns that default to sequences Fixes #6339.	2022-09-23 03:34:02 -07:00
Onur Tirtir	de24a3eda5	Not drop default col exprs from shard when adding local table to metadata (#6323 ) As we did for GENERATED STORED columns in #4613, we should not drop column default expressions that are not based on sequences from shard relation since such expressions need to exist e.g. for foreign key actions. For the column default expressions that are based on sequences we cannot do much, so we need to disallow having ON DELETE SET DEFAULT actions on such columns in a separate PR, see #6339. Fixes #6318. DESCRIPTION: Fixes a bug that might cause inserting incorrect DEFAULT values when applying foreign key actions	2022-09-23 03:05:08 -07:00
Naisila Puka	1ede0b9db3	Add tests to verify we support security invoker views (#6362 ) PG15 added support for security invoker views. Relevant PG conmit: `7faa5fc84b` These views check the permissions for the underlying tables of the view invoker user, not the view definer user. When the view has underlying distributed tables, the queries to the shards are sent by opening connections with the current user, which is the view invoker, no matter what the type of the view is. This means that, for distributed views, they were always behaving like security invoker views. Check the following issue for more details: https://github.com/citusdata/citus/issues/6161 So, Citus doesn't fully support security definer views. However Citus does fully support security invoker views. We add tests to make sure we cover different cases.	2022-09-23 10:55:46 +03:00
Ahmet Gedemenli	bae4b47c2f	Fix dropping replication slot (#6359 ) DESCRIPTION: Fixes dropping replication slots As detected by a flaky test, Citus sometimes fails to drop replication slots, possibly due to a race condition, at the end of a shard split. With this PR, we retry to drop them in case of an `OBJECT_IN_USE` error, consistently for 20 seconds. fixes: #6326	2022-09-21 16:29:56 +03:00
Onder Kalaci	03ac8b4f82	Add tests for PG15 new aggregate commands Both tests include pushdown and pull to coordinator type of aggregate execution. Relevant PG commits: Add min() and max() aggregates for xid8 400fc6b6487ddf16aa82c9d76e5cfbe64d94f660 Add range_agg with multirange inputs 7ae1619bc5b1794938c7387a766b8cae34e38d8a Co-authored-by: Onder Kalaci <onderkalaci@gmail.com>	2022-09-20 17:08:17 +03:00
Önder Kalacı	b4119ebbf4	Readme updates for Citus 11.1 (#6351 )	2022-09-19 19:36:26 +03:00
Nitish Upreti	e9508b2603	Shard Split : Add / Update logging (#6336 ) DESCRIPTION: Improve logging during shard split and resource cleanup ### DESCRIPTION This PR makes logging improvements to Shard Split : 1. Update confusing logging to fix #6312 2. Added new `ereport(LOG` to make debugging easier as part of telemetry review.	2022-09-16 09:39:08 -07:00
Onur Tirtir	8b5cdaf0e9	Add changelog entries for 11.1.1 (#6354 )	2022-09-16 11:08:21 +02:00
Marco Slot	8544346a78	Allow create_distributed_table_concurrently on an empty node (#6353 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-16 10:55:02 +02:00
Onur Tirtir	57e354ac91	Add changelog entries for 11.1.0 (#6349 ) Created by executing `prepare_changelog.pl citus 11.1.0 2022-03-29`.	2022-09-16 11:16:07 +03:00
Onder Kalaci	766f340ce0	Prevent failures on partitioned distributed tables with statistics objects on PG 15 Comment from the code is clear on this: /* * The statistics objects of the distributed table are not relevant * for the distributed planning, so we can override it. * * Normally, we should not need this. However, the combination of * Postgres commit 269b532aef55a579ae02a3e8e8df14101570dfd9 and * Citus function AdjustPartitioningForDistributedPlanning() * forces us to do this. The commit expects statistics objects * of partitions to have "inh" flag set properly. Whereas, the * function overrides "inh" flag. To avoid Postgres to throw error, * we override statlist such that Postgres does not try to process * any statistics objects during the standard_planner() on the * coordinator. In the end, we do not need the standard_planner() * on the coordinator to generate an optimized plan. We call * into standard_planner() for other purposes, such as generating the * relationRestrictionContext here. * * AdjustPartitioningForDistributedPlanning() is a hack that we use * to prevent Postgres' standard_planner() to expand all the partitions * for the distributed planning when a distributed partitioned table * is queried. It is required for both correctness and performance * reasons. Although we can eliminate the use of the function for * the correctness (e.g., make sure that rest of the planner can handle * partitions), it's performance implication is hard to avoid. Certain * planning logic of Citus (such as router or query pushdown) relies * heavily on the relationRestrictionList. If * AdjustPartitioningForDistributedPlanning() is removed, all the * partitions show up in the, causing high planning times for * such queries. */	2022-09-15 14:36:05 +03:00
aykut-bozkurt	739b91afa6	ensure we have more active nodes than replication factor. (#6341 ) DESCRIPTION: Fixes floating exception during create_distributed_table_concurrently. Fixes #6332. During create_distributed_table_concurrently, when there is no active primary node, it fails with floating exception. We added similar check with create_distributed_table. It will fail with proper message if current active node is less than replication factor.	2022-09-14 18:20:50 +03:00
Marco Slot	4ab415c43a	Fix escaping in sequence dependency queries (#6345 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-14 17:43:24 +03:00
Sameer Awasekar	4851b4e8f2	Introduce code changes to fix Issue:6303 (#6328 ) The PR introduces code changes to fix Issue [6303](https://github.com/citusdata/citus/issues/6303) `create_distributed_table_concurrently` following drop column, creates a buggy situation in split decoder. * Consider the below scenario: * Session1 : Drop column followed by create_distributed_table_concurrently * Session2 : Concurrent insert workload The child shards created by `create_distributed_table_concurrently` will have less columns than the source shard because some column were dropped. The incoming tuple from session2 will have more columns as the writes happened on source shard. But now the tuple needs to be applied on child shard. So we need to format existing tuple according to child schema and skip dropped column values. The PR fixes this by reformatting the tuple according the target child schema. Test: 1) isolation_create_distributed_concurrently_after_drop_column - Repros the issue and tests on the same.	2022-09-14 19:56:32 +05:30
Marco Slot	7a92d873b6	Fix bugs in CheckIfRelationWithSameNameExists (#6343 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-14 15:42:46 +02:00
Nils Dijk	da527951ca	Fix: rebalance stop non super user (#6334 ) No need for description, fixing issue introduced with new feature for 11.1 Fixes #6333 Due to Postgres' C api being o-indexed and postgres' attributes being 1-indexed, we were reading the wrong Datum as the Task owner when cancelling. Here we add a test to show the error and fix the off-by-one error.	2022-09-13 23:19:31 +02:00
Hanefi Onaldi	f34467dcb3	Remove missing declaration warning (#6330 ) When I built Citus on PG15beta4 locally, I get a warning message. ``` utils/background_jobs.c:902:5: warning: declaration does not declare anything [-Wmissing-declarations] __attribute__((fallthrough)); ^ 1 warning generated. ``` This is a hint to the compiler that we are deliberately falling through in a switch-case block.	2022-09-13 13:48:51 +03:00
Jelte Fennema	f13b140621	Show citus_copy_shard_placement progress in get_rebalance_progress (#6322 ) DESCRIPTION: Show citus_copy_shard_placement progress in get_rebalance_progress When rebalancing to a new node that does not have reference tables yet the rebalancer will first copy the reference tables to the nodes. Depending on the size of the reference tables, this might take a long time. However, there's no indication of what's happening at this stage of the rebalance. This PR improves this situation by also showing the progress of any citus_copy_shard_placement calls when calling get_rebalance_progress.	2022-09-13 08:59:52 +00:00
Naisila Puka	76ff4ab188	Adds support for unlogged distributed sequences (#6292 ) We can now do the following: - Distribute sequence with logged/unlogged option - ALTER TABLE my_sequence SET LOGGED/UNLOGGED - ALTER SEQUENCE my_sequence SET LOGGED/UNLOGGED Relevant PG commit `344d62fb9a`	2022-09-13 10:53:39 +03:00
Hanefi Onaldi	5cfcc63308	Add warning messages for cluster commands on partitioned tables (#6306 ) PG15 introduces `CLUSTER` commands for partitioned tables. Similar to a `CLUSTER` command with no supplied table names, these commands also can not be run inside transaction blocks and therefore can not be propagated in a distributed transaction block with ease. Therefore we raise warnings. Relevant PG commit: cfdd03f45e6afc632fbe70519250ec19167d6765	2022-09-13 00:05:58 +03:00
Hanefi Onaldi	164f2fa0a6	PG15: Add support for NULLS NOT DISTINCT (#6308 ) Relevant PG commit: 94aa7cc5f707712f592885995a28e018c7c80488	2022-09-12 23:47:37 +03:00
Marco Slot	b79111527e	Avoid blocking writes in create_distributed_table_concurrently (#6324 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 12:09:37 -07:00
Nils Dijk	cda3686d86	Feature: run rebalancer in the background (#6215 ) DESCRIPTION: Add a rebalancer that uses background tasks for its execution Based on the baclground jobs and tasks introduced in #6296 we implement a new rebalancer on top of the primitives of background execution. This allows the user to initiate a rebalance and let Citus execute the long running steps in the background until completion. Users can invoke the new background rebalancer with `SELECT citus_rebalance_start();`. It will output information on its job id and how to track progress. Also it returns its job id for automation purposes. If you simply want to wait till the rebalance is done you can use `SELECT citus_rebalance_wait();` A running rebalance can be canelled/stopped with `SELECT citus_rebalance_stop();`.	2022-09-12 20:46:53 +03:00
Marco Slot	48f7d6c279	Show local managed tables in citus_tables and hide tables owned by extensions (#6321 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 17:49:17 +03:00
Marco Slot	b036e44aa4	Fix bug preventing isolate_tenant_to_new_shard with text column (#6320 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-12 16:29:57 +02:00
naisila	b84251ac08	Bump test images to 15beta4	2022-09-12 15:20:17 +03:00
naisila	47bea76c6c	Revert "Support JSON_TABLE on PG 15 (#6241 )" This reverts commit `1f4fe35512`.	2022-09-12 15:20:17 +03:00
naisila	53ffbe440a	Revert SQL/JSON features in ruleutils_15.c Reverting the following commits: `977ddaae56` `4a5cf06def` `9ae19c181f` `30447117e5` `f9c43f4332` `21dba4ed08` `262932da3e` We have to manually make changes to this file. Follow the relevant PG commit in ruleutils.c & make the exact same changes in ruleutils_15.c Relevant PG commit: 96ef3237bf741c12390003e90a4d7115c0c854b7	2022-09-12 15:20:17 +03:00
Önder Kalacı	c6f108e626	Add tests for allowing SET NULL/DEFAULT for subseet of columns (#6319 ) PG 15 added support for that (d6f96ed94e73052f99a2e545ed17a8b2fdc1fb8a). We also add support, but we already do not support ON DELETE SET NULL/DEFAULT for distribution column. So, in essence, we add support for reference tables and Citus local tables. Semi-related: We should really consider fixing: https://github.com/citusdata/citus/issues/6318	2022-09-12 14:10:17 +03:00
Onder Kalaci	36f8c48560	Add tests for allowing SET NULL/DEFAULT for subseet of columns PG 15 added support for that (d6f96ed94e73052f99a2e545ed17a8b2fdc1fb8a). We also add support, but we already do not support ON DELETE SET NULL/DEFAULT for distribution column. So, in essence, we add support for reference tables and Citus local tables.	2022-09-12 13:56:09 +03:00
Marco Slot	2e943a64a0	Make shard moves more idempotent (#6313 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-09 18:21:36 +02:00
Jelte Fennema	a2d86214b2	Share more replication code between moves and splits (#6310 ) The logical replication catchup part for shard splits and shard moves is very similar. This abstracts most of that similarity away into a single function. This also improves the logic for non blocking shard splits a bit, by using faster foreign key creation. It also parallelizes index creation which shard moves were already doing, but shard splits did not.	2022-09-09 16:45:38 +02:00
Marco Slot	ba2fe3e3c4	Remove do_repair option from citus_copy_shard_placement (#6299 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-09 15:44:30 +02:00
Nils Dijk	00a94c7f13	Implement infrastructure to run sql jobs in the background (#6296 ) DESCRIPTION: Add infrastructure to run long running management operations in background This infrastructure introduces the primitives of jobs and tasks. A task consists of a sql statement and an owner. Tasks belong to a Job and can depend on other tasks from the same job. When there are either runnable or running tasks we would like to make sure a bacgrkound task queue monitor process is running. A Task could be in running state while there is actually no monitor present due to a database restart or failover. Once the monitor starts it will reset any running task to its runnable state. To make sure only one background task queue monitor is ever running at once it will acquire an advisory lock that self conflicts. Once a task is done it will find all tasks depending on this task. After checking that the task doesn't have unmet dependencies it will transition the task from blocked to runnable state for the task to be picked up on a subsequent task start. Currently only one task can be running at a time. This can be improved upon in later releases without changes to the higher level API. The initial goal for this background tasks is to allow a rebalance to run in the background. This will be implemented in a subsequent PR.	2022-09-09 16:11:19 +03:00
Jelte Fennema	76137e967f	Create all foreign keys quickly at the end of a shard move (#6148 ) Previously we would create foreign keys to reference table in an extra fast way at the end of a shard move. This uses that same logic to also do it for foreign keys between distributed tables. Fixes #6141	2022-09-09 09:58:33 +02:00
Nils Dijk	cc0eeea4c5	remove redundant call to TerminateBackgroundWorker (#6307 ) Remove redundant call to TerminateBackgroundWorker Discussion: https://github.com/citusdata/citus/pull/6296#discussion_r965926695	2022-09-09 07:37:02 +02:00
Ahmet Gedemenli	eadc88a800	Introduce GUC citus.skip_constraint_validation (#6281 ) Introduces a new GUC named citus.skip_constraint_validation, which basically skips constraint validation when set to on. For some several places that we hack to skip the foreign key validation phase, now we use this GUC.	2022-09-08 18:13:18 +03:00
Hanefi Onaldi	79ba490b1f	Merge pull request #6256 from citusdata/pg15-tests	2022-09-07 13:27:27 +03:00
Hanefi Onaldi	a557a196aa	Add tests for numeric with scale greater than precision	2022-09-07 13:12:04 +03:00
Hanefi Onaldi	4db113496f	Add tests for new COPY features in PG15	2022-09-07 13:12:04 +03:00
Hanefi Onaldi	3e4e42253f	Add tests for new regexp sql functions	2022-09-07 13:12:04 +03:00
Jelte Fennema	e29db74a19	Don't override postgres C symbols with our own (#6300 ) When introducing our overrides of pg_cancel_backend and pg_terminate_backend we accidentally did that in such a way that we cannot call the original pg_cancel_backend and pg_terminate_backend from C anymore. This happened because we defined the exact same symbols in our shared library as postgres does in its own binary. This fixes that by using a different names for the C function than for the SQL function. Making this work in all upgrade and downgrade scenarios is not trivial though, because we actually need to remove the C function definition. Postgres errors in two different times when the symbol that a C function wants to call is not defined in the library it expects it in: 1. When creating the SQL function definition 2. When calling the SQL function Item 1 causes an issue when creating our extension for the first time. We then go execute all the migrations that we have. So if the 11.0 migration contains a SQL function definition that still references the pg_cancel_backend symbol, that migration will fail. This issue is solved by actually changing the SQL definition in the old migration. This is not enough to fix all issues though. Item 2 causes an issue after an upgrade to 11.1, because it won't have the new definition of the SQL function. This is solved by recreating the SQL functions in the migration to 11.1. That way it gets the new definition. Then finally there's the case of downgrades. To continue to make our pg_cancel_backend SQL function work after downgrading, we will need to make a patch release for 11.0 that includes the new citus_cancel_backend symbol. This is done in a separate commit.	2022-09-07 11:27:05 +02:00
Nitish Upreti	d7404a9446	'Deferred Drop' and robust 'Shard Cleanup' for Splits. (#6258 ) DESCRIPTION: This PR adds support for 'Deferred Drop' and robust 'Shard Cleanup' for Splits. Common Infrastructure This PR introduces new common infrastructure so as any operation that wants robust cleanup of resources can register with the cleaner and have the resources cleaned appropriately based on a specified policy. 'Shard Split' is the first consumer using this new infrastructure. Note : We only support adding 'shards' as resources to be cleaned-up right now but the framework will be extended to support other resources in future. Deferred Drop for Split Deferred Drop Support ensures that shards undergoing split are not dropped inline as part of operation but dropped later when no active read queries are running on shard. This helps with : Avoids any potential deadlock scenarios that can cause long running Split operation to rollback. Avoids Split operation blocking writes and then getting blocked (due to running queries on the shard) when trying to drop shards. Deferred drop is the new default behavior going forward. Shard Cleaner Extension Shard Cleaner is a background task responsible for deferred drops in case of 'Move' operations. The cleaner has been extended to ensure robust cleanup of shards (dummy shards and split children) in case of a failure based on the new infrastructure mentioned above. The cleaner also handles deferred drop for 'Splits'. TESTING: New test ''citus_split_shard_by_split_points_deferred_drop' to test deferred drop support. New test 'failure_split_cleanup' to test shard cleanup with failures in different stages. Update 'isolation_blocking_shard_split and isolation_non_blocking_shard_split' for deferred drop. Added non-deferred drop version of existing tests : 'citus_split_shard_no_deferred_drop' and 'citus_non_blocking_splits_no_deferred_drop'	2022-09-06 12:11:20 -07:00
Gokhan Gulbiz	ac96370ddf	Use IsMultiStatementTransaction for SELECT .. FOR UPDATE queries (#6288 ) * Use IsMultiStatementTransaction instead of IsTransaction for row-locking operations. * Add regression test for SELECT..FOR UPDATE statement	2022-09-06 16:38:41 +02:00
Emel Şimşek	6f06ff78cc	Throw an error if there is a RangeTblEntry that is not assigned an RTE identity. (#6295 ) * Fix issue : 6109 Segfault or (assertion failure) is possible when using a SQL function * DESCRIPTION: Ensures disallowing the usage of SQL functions referencing to a distributed table and prevents a segfault. Using a SQL function may result in segmentation fault in some cases. This change fixes the issue by throwing an error message when a SQL function cannot be handled. Fixes #6109. * DESCRIPTION: Ensures disallowing the usage of SQL functions referencing to a distributed table and prevents a segfault. Using a SQL function may result in segmentation fault in some cases. This change fixes the issue by throwing an error message when a SQL function cannot be handled. Fixes #6109. Co-authored-by: Emel Simsek <emel.simsek@microsoft.com>	2022-09-06 15:46:41 +02:00
aykut-bozkurt	69726648ab	verify shards if exists for insert, delete, update (#6280 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-06 15:29:14 +02:00
Hanefi Onaldi	b15cb146a3	Merge pull request #6297 from citusdata/disallow-numeric-negative-scale Relevant PG commit: 085f931f52494e1f304e35571924efa6fcdc2b44	2022-09-06 12:59:59 +03:00
Hanefi Onaldi	85b19c851a	Disallow distributing by numeric with negative scale PG15 allows numeric scale to be negative or greater than precision. This causes issues and we may end up routing queries to a wrong shard due to differing hash results after rounding. Formerly, when specifying NUMERIC(precision, scale), the scale had to be in the range [0, precision], which was per SQL spec. PG15 extends the range of allowed scales to [-1000, 1000]. A negative scale implies rounding before the decimal point. For example, a column might be declared with a scale of -3 to round values to the nearest thousand. Note that the display scale remains non-negative, so in this case the display scale will be zero, and all digits before the decimal point will be displayed. Relevant PG commit: 085f931f52494e1f304e35571924efa6fcdc2b44	2022-09-06 12:40:56 +03:00
Naisila Puka	d7f41cacbe	Prohibit renaming child trigger on distributed partition pre PG15 (#6290 ) Pre PG15, renaming child triggers on partitions is allowed. When creating a trigger in a distributed parent partitioned table, the triggers on the shards of the partitions have the same name with the triggers on the corresponding parent shards of the parent table. Therefore, they don't have the same appended shard id as the shard id of the partition. Hence, when trying to rename a child trigger on a partition of a distributed table, we can't correctly find the triggers on the shards of the partition in order to rename them since we append a different shard id to the name of the trigger. Since we can't find the trigger we get a misleading error of inexistent trigger. In this commit we prohibit renaming child triggers on distributed partitions altogether.	2022-09-06 12:19:25 +03:00
Naisila Puka	fd9b3f4ae9	Add tests to make sure distributed clone trigger rename fails in PG15 (#6291 ) Relevant PG commit: 80ba4bb383538a2ee846fece6a7b8da9518b6866	2022-09-06 11:04:14 +03:00
Marco Slot	e6b1845931	Change split logic to avoid EnsureReferenceTablesExistOnAllNodesExtended (#6208 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-05 22:02:18 +02:00
Önder Kalacı	bd13836648	Add citus.skip_advisory_lock_permission_checks (#6293 )	2022-09-05 17:47:41 +02:00
Jelte Fennema	1c5b8588fe	Address race condition in InitializeBackendData (#6285 ) Sometimes in CI our isolation_citus_dist_activity test fails randomly like this: ```diff step s2-view-dist: SELECT query, citus_nodename_for_nodeid(citus_nodeid_for_gpid(global_pid)), citus_nodeport_for_nodeid(citus_nodeid_for_gpid(global_pid)), state, wait_event_type, wait_event, usename, datname FROM citus_dist_stat_activity WHERE query NOT ILIKE ALL(VALUES('%pg_prepared_xacts%'), ('%COMMIT%'), ('%BEGIN%'), ('%pg_catalog.pg_isolation_test_session_is_blocked%'), ('%citus_add_node%')) AND backend_type = 'client backend' ORDER BY query DESC; query \|citus_nodename_for_nodeid\|citus_nodeport_for_nodeid\|state \|wait_event_type\|wait_event\|usename \|datname ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------+-------------------------+-------------------+---------------+----------+--------+---------- INSERT INTO test_table VALUES (100, 100); \|localhost \| 57636\|idle in transaction\|Client \|ClientRead\|postgres\|regression -(1 row) + + SELECT coalesce(to_jsonb(array_agg(csa_from_one_node.)), '[{}]'::JSONB) + FROM ( + SELECT global_pid, worker_query AS is_worker_query, pg_stat_activity. FROM + pg_stat_activity LEFT JOIN get_all_active_transactions() ON process_id = pid + ) AS csa_from_one_node; + \|localhost \| 57636\|active \| \| \|postgres\|regression +(2 rows) step s3-view-worker: ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26692/workflows/3406e4b4-b686-4667-bec6-8253ee0809b1/jobs/765119 I intended to fix this with #6263, but the fix turned out to be insufficient. This PR tries to address the issue by setting distributedCommandOriginator correctly in more situations. However, even with this change it's still possible to reproduce the flaky test in CI. In any case this should fix at least some instances of this issue. In passing this changes the isolation_citus_dist_activity test to allow running it multiple times in a row.	2022-09-02 14:23:47 +02:00
Ahmet Gedemenli	7c8cc7fc61	Fix flakiness for view tests (#6284 )	2022-09-02 10:12:07 +03:00
Marco Slot	432f399a5d	Allow citus_internal application_name with additional suffix (#6282 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-09-01 14:26:43 +02:00
Naisila Puka	9e2b96caa5	Add pg14->pg15 upgrade test for dist. triggers on part. tables (#6265 ) PRE PG15, Renaming the parent triggers on partitioned tables doesn't recurse to renaming the child triggers on the partitions as well. In PG15, Renaming triggers on partitioned tables recurses to renaming the triggers on the partitions as well. Add an upgrade test to make sure we are not breaking anything with distributed triggers on distributed partitioned tables. Relevant PG commit: 80ba4bb383538a2ee846fece6a7b8da9518b6866	2022-09-01 12:32:44 +03:00
Naisila Puka	317dda6af1	Use RelationGetPrimaryKeyIndex for citus catalog tables (#6262 ) pg_dist_node and pg_dist_colocation have a primary key index, not a replica identity index. Citus catalog tables are created in public schema, which has replica identity index by default as primary key index. Later the citus catalog tables are moved to pg_catalog schema. During pg_upgrade, all tables are recreated, and given that pg_dist_colocation is found in pg_catalog schema, it is recreated in that schema, and when it is recreated it doesn't have a replica identity index, because catalog tables have no replica identity. Further action: Do we even need to acquire this lock on the primary key index? Postgres doesn't acquire such locks on indexes before deleting catalog tuples. Also, catalog tuples don't have replica identities by definition.	2022-09-01 11:56:31 +03:00
Jelte Fennema	c14bf3a660	Add a job to CI to check tests for flakyness (#6276 ) We have lots of flaky tests in CI and most of these random failures are very hard/impossible to reproduce locally. This adds a job definition to CI that allows adding a temporary job to rerun the same test in CI a lot of times. This will very often reproduce the random failures. If you then try to change the test or code to fix the random failure, you can confirm that it's indeed fixed by using this job. A future improvement to this job would be to run it (or a variant of it) automatically for every newly added test, and maybe even changed tests. This is not implemented in this PR. An example of this job running can be found here: https://app.circleci.com/pipelines/github/citusdata/citus/26682/workflows/a2638385-35bc-443c-badc-7713a8101313	2022-08-31 14:09:39 +02:00
Jelte Fennema	8bb082e77d	Fix reporting of progress on waiting and moved shards (#6274 ) In commit `31faa88a4e` I removed some features of the rebalance progress monitor. I did this because the plan was to remove the foreground shard rebalancer later in the PR that would add the background shard rebalancer. So, I didn't want to spend time fixing something that we would throw away anyway. As it turns out we're not removing the foreground shard rebalancer after all, so it made sens to fix the stuff that I broke. This PR does that. For the most part this commit reverts the changes in commit `31faa88a4e`. It's not a full revert though, because it keeps the improved tests and the changes to `citus_move_shard_placement`.	2022-08-31 14:55:47 +03:00
Naisila Puka	98dcbeb304	Specifies that our CustomScan providers support projections (#6244 ) Before, this was the default mode for CustomScan providers. Now, the default is to assume that they can't project. This causes performance penalties due to adding unnecessary Result nodes. Hence we use the newly added flag, CUSTOMPATH_SUPPORT_PROJECTION to get it back to how it was. In PG15 support branch we created explain functions to ignore the new Result nodes, so we undo that in this commit. Relevant PG commit: 955b3e0f9269639fb916cee3dea37aee50b82df0	2022-08-31 10:48:01 +03:00
Jelte Fennema	24e695ca27	Fix flakyness in multi_utilities (#6272 ) Sometimes in CI our multi_utilities test fails like this: ```diff VACUUM (INDEX_CLEANUP ON, PARALLEL 1) local_vacuum_table; SELECT CASE WHEN s BETWEEN 20000000 AND 25000000 THEN 22500000 ELSE s END size FROM pg_total_relation_size('local_vacuum_table') s ; size ---------- - 22500000 + 39518208 (1 row) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26641/workflows/5caea99c-9f58-4baa-839a-805aea714628/jobs/762870 Apparently VACUUM is not as reliable in cleaning up as we thought. This increases the range of allowed values. Important to note is that the range is still completely outside of the allowed range of the initial size. So we know for sure that some data was cleaned up.	2022-08-30 14:32:34 -07:00
Jelte Fennema	f22a47981a	Fix flakyness in adaptive_executor (#6275 ) Sometimes in CI our adaptive_executor test would fail randomly with the following error: ```diff SELECT sum(result::bigint) FROM run_command_on_workers($$ SELECT count(*) FROM pg_stat_activity WHERE pid <> pg_backend_pid() AND query LIKE '%8010090%' $$); sum ----- - 4 + 2 (1 row) END; ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26665/workflows/40665680-0044-4852-8fe4-5fd628f9fb47/jobs/764371 This means that the low slow start interval did not have any effect on the number of connections being opened. I could see two possibilities for this to happen: 1. CI was slow and actually doing the start of the second connection. I tried to solve this by doubling the time a query to the worker takes. 2. The second option is that the shards were queried in the oposite order than we expect. This would mean that the first query to the worker completes quickly because there's no, sleep because it doesn't contain any rows. I tried to solve this option by adding a row to each shard. After trying to reproduce the random failure in CI it turned out that I needed both of these fixes to resolve the random failure.	2022-08-30 23:23:30 +02:00
Jelte Fennema	8354853dec	Fix flakyness in citus_split_shard_columnar_partitioned (#6273 ) On CI our citus_split_shard_columnar_partitioned test would sometimes randomly fail like this: ```diff 8970008 \| colocated_dist_table \| -2147483648 \| 2147483647 \| localhost \| 57637 8970009 \| colocated_partitioned_table \| -2147483648 \| 2147483647 \| localhost \| 57637 8970010 \| colocated_partitioned_table_2020_01_01 \| -2147483648 \| 2147483647 \| localhost \| 57637 - 8970011 \| reference_table \| \| \| localhost \| 57637 8970011 \| reference_table \| \| \| localhost \| 57638 + 8970011 \| reference_table \| \| \| localhost \| 57637 (13 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26651/workflows/f695b4fb-ad81-46ff-b97e-0100e5d167ea/jobs/763517 This is a harmless diff due to a missing column in the order by list. This fixes that by adding the nodeport as a tiebreaker.	2022-08-30 19:54:50 +03:00
Marco Slot	6bb31c5d75	Add non-blocking variant of create_distributed_table (#6087 ) Added create_distributed_table_concurrently which is nonblocking variant of create_distributed_table. It bases on the split API which takes advantage of logical replication to support nonblocking split operations. Co-authored-by: Marco Slot <marco.slot@gmail.com> Co-authored-by: aykutbozkurt <aykut.bozkurt1995@gmail.com>	2022-08-30 15:35:40 +03:00
Jelte Fennema	d68654680b	Fix flakyness in isolation_citus_dist_activity (#6263 ) Sometimes in CI our isolation_citus_dist_activity test fails randomly like this: ```diff step s2-view-dist: SELECT query, citus_nodename_for_nodeid(citus_nodeid_for_gpid(global_pid)), citus_nodeport_for_nodeid(citus_nodeid_for_gpid(global_pid)), state, wait_event_type, wait_event, usename, datname FROM citus_dist_stat_activity WHERE query NOT ILIKE ALL(VALUES('%pg_prepared_xacts%'), ('%COMMIT%'), ('%BEGIN%'), ('%pg_catalog.pg_isolation_test_session_is_blocked%'), ('%citus_add_node%')) AND backend_type = 'client backend' ORDER BY query DESC; query \|citus_nodename_for_nodeid\|citus_nodeport_for_nodeid\|state \|wait_event_type\|wait_event\|usename \|datname ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------+-------------------------+-------------------+---------------+----------+--------+---------- INSERT INTO test_table VALUES (100, 100); \|localhost \| 57636\|idle in transaction\|Client \|ClientRead\|postgres\|regression -(1 row) + + SELECT coalesce(to_jsonb(array_agg(csa_from_one_node.)), '[{}]'::JSONB) + FROM ( + SELECT global_pid, worker_query AS is_worker_query, pg_stat_activity. FROM + pg_stat_activity LEFT JOIN get_all_active_transactions() ON process_id = pid + ) AS csa_from_one_node; + \|localhost \| 57636\|active \| \| \|postgres\|regression +(2 rows) step s3-view-worker: ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26605/workflows/56d284d2-5bb3-4e64-a0ea-7b9b1626e7cd/jobs/760633 The reason for this is that citus_dist_stat_activity sometimes shows the query that it uses itself to get the data from pg_stat_activity. This is actually a bug, because it's a worker query and thus shouldn't show up there. To try and solve this bug, we remove two small opportunities for a race condition. These race conditions could happen when the backenddata was marked as active, but the distributedCommandOriginator was not set correctly yet/anymore. There was an opportunity for this to happen both during connection start and shutdown.	2022-08-30 12:57:37 +03:00
Önder Kalacı	33af407ac8	Add missing orderbys (#6271 )	2022-08-30 12:49:15 +03:00
Jelte Fennema	895a484b39	Hopefully fix flakyeness in drop_partitioned_table (#6270 ) Sometimes in CI our drop_partitioned_talbe test would fail with the following error: ```diff NOTICE: issuing SELECT worker_drop_distributed_table('drop_partitioned_table.child1') NOTICE: issuing SELECT worker_drop_distributed_table('drop_partitioned_table.child1') NOTICE: issuing DROP TABLE IF EXISTS drop_partitioned_table.child1_727001 CASCADE -NOTICE: issuing SELECT pg_catalog.citus_internal_delete_colocation_metadata(100047) -NOTICE: issuing SELECT pg_catalog.citus_internal_delete_colocation_metadata(100047) +NOTICE: issuing SELECT pg_catalog.citus_internal_delete_colocation_metadata(100046) +NOTICE: issuing SELECT pg_catalog.citus_internal_delete_colocation_metadata(100046) ROLLBACK; NOTICE: issuing ROLLBACK NOTICE: issuing ROLLBACK ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26631/workflows/31536032-e1ba-493b-b12a-f40757f3a7d6/jobs/762170 For some reason the colocationid of the distributed partitioned table would be one less than we expected. Why this happens I'm not sure, but it seems fairly harmless that it does. In an attempt to work around this flakyness I now reset the colocation id sequence right before creating the table in question. This is good practice in general, because it allows us to run the test successfully using `check-minimal` and it also allows us to rerun it multiple times.	2022-08-30 12:21:16 +03:00
Jelte Fennema	5c95604154	Always copy normalized files after a regress run (#6254 ) Our python based tests didn't always copy the normalized files after the regress run. I had the problem where running the following command would result in non-normalized files in the expected directory after running our PG upgrade tests locally: ``` cp src/test/regress/{results,expected}/upgrade_list_citus_objects.out ``` This PR fixes that by always running `copy_modified` even if the tests fail. The same was already being done for our perl based tests at the end of the `pg_regress_multi.pl` file.	2022-08-30 07:15:29 +00:00
Naisila Puka	13fe89f018	Fixes flakyness in columnar_permissions test (#6266 ) `columnar_permissions.sql` test is flaky due to a missing `ORDER BY` clauses. Added the other `ORDER BY` clauses for consistency in the test. ```diff where relation in ('no_access'::regclass, 'columnar_permissions'::regclass); relation \| chunk_group_row_limit \| stripe_row_limit \| compression \| compression_level ----------------------+-----------------------+------------------+-------------+------------------- - no_access \| 10000 \| 150000 \| zstd \| 3 columnar_permissions \| 10000 \| 2222 \| none \| 3 + no_access \| 10000 \| 150000 \| zstd \| 3 (2 rows) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26610/workflows/79f03ef9-7674-4567-a087-02536c9ddf04/jobs/760942	2022-08-29 14:33:26 +02:00
Önder Kalacı	1df943e0d5	Use Posix locale in the tests (#6261 ) Commit `9653a0065e` has changed it to C.UTF-8 , which fails on MacOS	2022-08-29 12:52:03 +02:00
Ahmet Gedemenli	0855a9d1d4	Use SUM for calculating non partitioned table sizes (#6222 ) We currently do a `pg_relation_total_size('t1') + pg_relation_total_size('t2') + ..` on shard lists, especially when rebalancing the shards. This in some cases goes huge. With this PR, we basically use a SUM for all table sizes, instead of using thousands of pluses.	2022-08-26 18:02:14 +03:00
Sameer Awasekar	4df8eca77f	Add worker_split_shard_release_dsm udf to release dynamic shared memory (#6248 ) The code introduces worker_split_shard_release_dsm udf to release the dynamic shared memory segment allocated during non-blocking split workflow.	2022-08-26 18:27:32 +05:30
Jelte Fennema	77dd49fcf8	Fix flakyness in failure_online_move_shard_placement (#6250 ) Sometimes in CI failure_online_move_shard_placement fails with the following error: ```diff SELECT citus.mitmproxy('conn.onQuery(query="^ALTER SUBSCRIPTION .* ENABLE").cancel(' \|\| :pid \|\| ')'); mitmproxy ----------- (1 row) SELECT master_move_shard_placement(101, 'localhost', :worker_1_port, 'localhost', :worker_2_proxy_port); -ERROR: canceling statement due to user request +ERROR: tuple concurrently updated +CONTEXT: while executing command on localhost:9060 -- failure on polling subscription state ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26441/workflows/dd6e3475-6121-47b3-aea3-4ac92be114f4/jobs/751476/steps This error is not completely harmless, because based on the logs it mean that our cleanup logic failed, which in turn means that replication slots are left around: ``` 2022-08-24 16:01:29.247 UTC [1219] ERROR: XX000: tuple concurrently updated 2022-08-24 16:01:29.247 UTC [1219] LOCATION: simple_heap_update, heapam.c:4179 2022-08-24 16:01:29.247 UTC [1219] STATEMENT: ALTER SUBSCRIPTION citus_shard_move_subscription_10 DISABLE ``` However, we have other mechanisms to clean up any leftovers in case of a failed cleanup. So it's not that big of a problem. The reason we run into this error is arguably because of a Postgres bug, so I created a patch for Postgres that fixes this. While we wait for this (or a similar) patch to be merged, this PR disables the flaky test. There's still a test that tests in case of a connection "kill" instead of a "cancel", so I don't think we lose very important coverage by disabling this test. When trying to reproduce this I only hit this issue in the cancel case, so I don't think there's a need to disable the kill case for now.	2022-08-26 12:49:45 +02:00
Jelte Fennema	2a0c0b3ba6	Fix flakyness in failure_connection_establishment (#6251 ) In CI sometimes failure_connection_establishment would fail with the following error: ```diff -- cancel all connections to this node SELECT citus.mitmproxy('conn.onAuthenticationOk().cancel(' \|\| pg_backend_pid() \|\| ')'); - mitmproxy ---------------------------------------------------------------------- - -(1 row) - +ERROR: canceling statement due to user request +CONTEXT: COPY mitmproxy_result, line 1: "" +SQL statement "COPY mitmproxy_result FROM '/home/circleci/project/src/test/regress/tmp_check/mitmproxy.fifo'" +PL/pgSQL function citus.mitmproxy(text) line 11 at EXECUTE SELECT * FROM citus_check_cluster_node_health(); ``` The reason for this is that the mitm command that was used is very broad and doesn't actually do what the comment says. What happens is that if any connection is made, the current backend is cancelled, which is not the always the same as the backend that made the connection. My assessment is that likely the maintenance daemon makes a connection to the node while we are executing the mitmproxy command. The mitmproxy command goes through, and then triggers a cancel of itself due to the connection made by the maintenance daemon. This PR simply removes this test, since it doesn't seem to test what it intended to test anyway. There's also still the "kill" version of this test, which does do the intended thing. So I don't think we lose important coverage by removing this test.	2022-08-26 10:01:36 +00:00
Jelte Fennema	18015ca501	Fix flakyness in multi_transaction_recovery (#6249 ) Sometimes in CI multi_transaction_recovery would fail with the following error: ```diff SET LOCAL citus.defer_drop_after_shard_move TO OFF; SELECT citus_move_shard_placement((SELECT * FROM selected_shard), 'localhost', :worker_1_port, 'localhost', :worker_2_port, shard_transfer_mode := 'block_writes'); - citus_move_shard_placement ---------------------------------------------------------------------- - -(1 row) - +ERROR: could not find placement matching "localhost:57637" +HINT: Confirm the placement still exists and try again. COMMIT; ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26510/workflows/8269ea93-d9b4-4376-ae0e-8332a5c15fc6/jobs/755548 The reason for this was that when choosing `selected_shard` we didn't ensure that it was actually located on the node that we were moving it from. Instead we simply picked the first shard for the table that was returned by the query. To fix this issue this PR adds a filter to only choose shards that are located on the intended node.	2022-08-26 11:48:55 +02:00
Jelte Fennema	9749622399	Fix flakyness in isolation_distributed_deadlock_detection (#6240 ) Our isolation_distributed_deadlock_detection test would fail randomly in CI in three different ways. The first type of failure looked like this: ```diff check_distributed_deadlocks --------------------------- t (1 row) -step s1-update-5: <... completed> step s5-update-1: <... completed> ERROR: canceling the transaction since it was involved in a distributed deadlock +step s1-update-5: <... completed> step s1-commit: ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26399/workflows/d213ee85-397a-467a-9ffb-39e4f44e6688/jobs/749533 This random change in output was harmless and happened because when the deadlock detector cancelled a query, two queries would continue: The one that was cancelled would throw an error (and thus complete), and the one that was unblocked would now complete. It was random which of the two the isolation tester would first detect as completed. To resolve this PR starts using the ["marker" feature][1], this allows us to make sure one of the steps won't be marked as completed until the other one completed first. The second random failure was very similar: ```diff check_distributed_deadlocks --------------------------- t (1 row) -step s2-update-2: <... completed> -step s3-update-3: <... completed> -ERROR: canceling the transaction since it was involved in a distributed deadlock step s6-commit: COMMIT; step s5-update-6: <... completed> +step s2-update-2: <... completed> +step s3-update-3: <... completed> +ERROR: canceling the transaction since it was involved in a distributed deadlock step s5-commit: ``` Again a harmless difference in test output. In this case it's possible that the deadlock detector would not detect the unblocked processes right away, and would thus continue with to the next step. This step was a commit on a session that was not blocked, and which thus could complete without issues. To solve this I changed the order of the commits at the end of the permutation, to always have the first session that would commit be the session that would be unblocked the last. This ensures that no commit will ever be executed before completing all the queries. The third issue was different and looked like this: ```diff step s4-update-5: <... completed> step s4-commit: COMMIT; +step s1-update-4: <... completed> +isolationtester: canceling step s3-update-4 after 5 seconds step s3-update-4: <... completed> +ERROR: canceling statement due to user request +step s2-update-2: <... completed> step s3-commit: COMMIT; -step s2-update-2: <... completed> -step s1-update-4: <... completed> step s1-commit: ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26411/workflows/9089beec-4f0f-4027-b4ce-0e84889afc06/jobs/750143 The reason for this failure is not entirely clear to me, but I was able to remove the flakyness without impacting the goal of the test. What was happening was that both `s1` and `s3` were waiting for `s4` to commit and release it's lock on the row 4. For some reason it wasn't deterministic which of the two sessions would be granted the lock after it was released by row 4. The test expected `s3` to be granted the lock, but sometimes it would be granted to `s1` instead. Which would in turn cause `s3` to still be blocked. To solve this I simply removed `s1` completely from this test. It wasn't actually part of the cycle that the deadlock detector should detect and was an unrelated appendage: ```mermaid graph TD; s2-->s3; s3-->s4; s1-->s4; s4-->s5; s5-->s6; s6-->s5; ``` By removing `s1` completely there was no contention for the lock and `s3` could always acquire it. [1]: `a73d6c87f2/src/test/isolation/README (L163-L188)`	2022-08-26 12:03:40 +03:00
Jelte Fennema	b5cd1676f9	Fix flakyness in multi_utilities (#6245 ) In CI multi_utilities would sometimes fail randomly with this error: ```diff VACUUM (INDEX_CLEANUP ON, PARALLEL 1) local_vacuum_table; SELECT pg_size_pretty( pg_total_relation_size('local_vacuum_table') ); pg_size_pretty ---------------- - 21 MB + 22 MB (1 row) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26459/workflows/da47d9b6-f70b-49fe-806f-5ebf75bf0b11/jobs/752482 This is a harmless change in output where the relation size after vacuuming was slightly more than we expected. This changes the size checks for the local_vacuum_table to allow a wider range of values. It uses the same trick as #6216 to show the actual value when it's outside this valid range, which is useful if this test ever starts failing again.	2022-08-25 22:50:47 +02:00
Jelte Fennema	00485d45a6	Make multi_utilities not leak tables (#6246 ) When trying to fix #6245 I realized that multi_utilities was leaking some tables that it created during the test. This fixes that by creating all these tables in a schema that's dedicated for this test.	2022-08-25 19:33:03 +03:00
Jelte Fennema	1688bcda33	Fix errors in base_schedule (#6247 ) When running `make check-base` locally it would fail with two different errors. The first one was this: ```diff SELECT create_distributed_table('pg_class', 'relname'); -ERROR: cannot create a citus table from a catalog table +ERROR: deadlock detected +DETAIL: Process 28950 waits for ExclusiveLock on relation 16551 of database 16384; blocked by process 28951. +Process 28951 waits for RowExclusiveLock on relation 1259 of database 16384; blocked by process 28950. +HINT: See server log for query details. SELECT create_reference_table('pg_class'); ``` This happened because multi_behavioral_analytics_create_table and multi_create_table were being run in parallel. Running them separately resolved this issue. The second one was this: ```diff CREATE OR REPLACE FUNCTION wait_until_metadata_sync(timeout INTEGER DEFAULT 15000) RETURNS void LANGUAGE C STRICT AS 'citus'; +ERROR: duplicate key value violates unique constraint "pg_proc_proname_args_nsp_index" +DETAIL: Key (proname, proargtypes, pronamespace)=(wait_until_metadata_sync, 23, 2200) already exists. -- Add some helper functions for sending commands to mitmproxy ``` Which was because failure_test_helpers and multi_test_helpers were trying to create the same function at the exact same time. The easy fix here is to simply not create this function in the failure_test_helpers file. This is fine, because any test schedule that runs failure_test_helpers also runs multi_test_helpers.	2022-08-25 18:06:41 +02:00
Jelte Fennema	ee5af1ab90	Use C.UTF-8 locale in tests (#6242 ) I upgraded my OS to Ubuntu 22.04 a while back and since then some tests order output slightly differently. I think it might be because of the glibc upgrade that changed ordering for things like underscores and spaces. Changing the locale to C.UTF-8 solves this issue.	2022-08-25 13:10:49 +02:00
Önder Kalacı	3ed6fea1cf	Prevent Merge command on distributed tables [PG 15] (#6238 )	2022-08-25 13:27:08 +03:00
Marco Slot	9bf3c3dd5c	Add an allow_unsafe_constraints flag for constraints without distribution column (#6237 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-25 11:37:50 +03:00
Gokhan Gulbiz	69d2fcf5c0	Use the same colocation group for child and parent rels when altering a distributed table (#6225 ) * Alter_distributed_table colocateWith:none bug fix for partitioned tables. * Regression tests added for alter_distributed_table colocateWith:none for partitioned tables * Update query comparision to be more accurate	2022-08-25 11:23:59 +03:00
Marco Slot	ac07d33a29	Remove unused reduceQuery from physical planning (#6221 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-24 17:24:27 +00:00
Naisila Puka	1f4fe35512	Support JSON_TABLE on PG 15 (#6241 ) Postgres supports JSON_TABLE feature on PG 15. We treat JSON_TABLE the same as correlated functions (e.g., recurring tuples). In the end, for multi-shard JSON_TABLE commands, we apply the same restrictions as reference tables (e.g., cannot be in the outer part of an outer join etc.) Co-authored-by: Onder Kalaci <onderkalaci@gmail.com>	2022-08-24 19:11:18 +03:00
Naisila Puka	35b4ddc355	Pg15 support (#6085 ) * Adjust configure script to allow PG15 * Adds copy of ruleutils_14.c as ruleutils_15.c * Uses get_namespace_name_or_temp in ruleutils_15.c Relevant PG commit: 48c5c9068211e0a04fd9553c8714b2821ed3ad17 * Clean up code using "(expr) ? true : false" in ruleutils_15.c Relevant PG commit: fd0625c7a9c679c0c1e896014b8f49a489c3a245 * Change varno from Index (unsigned int) to int in ruleutils_15.c Relevant PG commit: e3ec3c00d85bd2844ffddee83df2bd67c4f8297f * Adds find_recursive_union to ruleutils_15.c Relevant PG commit: 3f50b82639637c9908afa2087de7588450aa866b * Fix display of SQL-std func's args in INSERT/SELECT in ruleutils_15.c Relevant PG commit: a8d8445a7b2f80f6d0bfe97b19f90bd2cbef8759 * Fix ruleutils_15.c's dumping of whole-row Vars in more contexts Relevant PG commit: 43c2175121c829c8591fc5117b725f1f22bfb670 * Fix assorted missing logic for GroupingFunc nodes in ruleutils_15.c Relevant PG commit: 2591ee8ec44d8cbc8e1226550337a64c684746e4 * Adds grammar support for SQL/JSON clauses in ruleutils_15.c Relevant PG commit: f79b803dcc98d707450e158db3638dc67ff8380b * Adds SQL/JSON constructors to ruleutils_15.c Relevant PG commits: f4fb45d15c59d7add2e1b81a9d477d0119a9691a cc7401d5ca498a84d9b47fd2e01cebd8e830e558 * Adds support for MERGE in ruleutils_15.c Relevant PG commit: 7103ebb7aae8ab8076b7e85f335ceb8fe799097c * Add IS JSON predicate to ruleutils_15.c Relevant PG commit: 33a377608fc29cdd1f6b63be561eab0aee5c81f0 * Add SQL/JSON query functions to ruleutils_15.c Relevant PG commit: 1a36bc9dba8eae90963a586d37b6457b32b2fed4 * Adds three different SQL/JSON values to ruleutils_15.c Relevant PG commits: 606948b058dc16bce494270eea577011a602810e 49082c2cc3d8167cca70cfe697afb064710828ca * Adds JSON table functions in ruleutils_15.c Relevant PG commit: 4e34747c88a03ede6e9d731727815e37273d4bc9 * Add PLAN function for JSON table in ruleutils_15.c Relevant PG commit: fadb48b00e02ccfd152baa80942de30205ab3c4f * Remove extra blank lines before block-closing braces ruleutils_15.c Relevant PG commit: 24d2b2680a8d0e01b30ce8a41c4eb3b47aca5031 * set_deparse_plan: Reuse variable to appease Coverity ruleutils_15.c Relevant PG commit: e70813fbc4aaca35ec012d5a426706bd54e4acab * Mechanical code beautification ruleutils_15.c Relevant PG commit: 23e7b38bfe396f919fdb66057174d29e17086418 * Rename value_type to item_type in ruleutils_15.c Relevant PG commit: 3ab9a63cb638a1fd99475668e2da9c237495aeda * Show 'AS "?column?"' explicitly when it's important in ruleutils_15.c Relevant PG commit: c7461fc25558832dd347a9c8150b0f1ed85e36e8 * Fix ruleutils_15.c issues with dropped cols in funcs-returning-composite Relevant PG commit: c1d1e8469c77ce6b8e5310955580b4a3eee7fe96 * Change comment regarding functions returning composite in ruleutils_15.c Relevant PG commit: c2fa113ddb1117b1f03e91960f65d5d7d8a90270 * Replace int nodes with bool nodes where needed In PG15, Boolean nodes are added. Pre PG15, internal Boolean values in Create Role commands were represented by Integer nodes. This commit replaces int nodes logic with bool nodes logic where needed. Mostly there are CREATE ROLE logic changes. Relevant PG commit: 941460fcf731a32e6a90691508d5cfa3d1f8eeaf * Handle new option colliculocale in CREATE COLLATION logic In PG15, there is an added option to use ICU as global locale provider. pg_collation has three locale-related fields: collcollate and collctype, which are libc-related fields, and a new one colliculocale, which is the ICU-related field. Only the libc-related fields or the ICU-related field is set, never both. Relevant PG commits: f2553d43060edb210b36c63187d52a632448e1d2 54637508f87bd5f07fb9406bac6b08240283be3b * Add PG15 tests to CI using test images that have 15beta2 (#6093) * Change warning message in pg_signal_backend() Relevant PG commit: 7fa945b857cc1b2964799411f1633468826861ff * Revert "Add missing ifdef for PG 15" This reverts commit `c7b51025ab`. * Fixes tests for ALTER TRIGGER RENAME consistency for part. tables Relevant PG commit: 80ba4bb383538a2ee846fece6a7b8da9518b6866 * Prevent creating child triggers on partitions when adding new node Pre PG15, tgisinternal is true for a "child" trigger on a partition cloned from the trigger on the parent. In PG15, tgisinternal is false in that case. However, we don't want to create this trigger on the partition since it will create a conflict when we try to attach the partition to the parent table: ERROR: trigger "..." for relation "{partition_name}" already exists Relevant PG commit: f4566345cf40b068368cb5617e61318da60676ec * Fix tests for generated columns dependency changes In PG15, For GENERATED columns, all dependencies of the generation expression are recorded as NORMAL dependencies of the column itself. This requires CASCADE to drop generated cols with the original col. PRE PG15, dependencies were recorded as AUTO, with which generated columns are silently dropped with the original column. Relevant PG commit: cb02fcb4c95bae08adaca1202c2081cfc81a28b5 * Explicitly cast catalog "char" column to text before concatenation Relevant PG commit: 07eee5a0dc642d26f44d65c4e6263304208e8583 * Remove 'AS "?column?"' from test outputs There were some instances in the following tst outputs in planning debug outputs where AS "?column?" is added. We add a normalization rule to remove it as it is not important. cte_inline.out recursive_relation_planning_restriction_pushdown.out Relevant PG commit: c7461fc25558832dd347a9c8150b0f1ed85e36e8 * Use pg_backup_stop(PG15) instead of pg_stop_backup(PG<15) Add an alternative test output because of the change in the backup modes of Postgres. Specifically here, there is a renaming issue: pg_stop_backup PRE PG15 vs pg_backup_stop PG15+ The alternative output can be deleted when we drop support for PG14 Relevant PG commit: 39969e2a1e4d7f5a37f3ef37d53bbfe171e7d77a * Adds citus.mitmfifo GUC Previously we setting this configuration parameter in the fly for failure tests schedule. However, PG15 doesn't allow that anymore: reserved prefixes like "citus" cannot be used to set non-existing GUCs. Relevant PG commit: 88103567cb8fa5be46dc9fac3e3b8774951a2be7 * Handles EXPLAIN output diffs in PG15 - Extra result lines To handle extra "Result" lines in explain outputs, we add explain method to multi_test_helpers.sql file - plan_without_result_lines() is added for cases where we want the whole explain output with only "Result" lines removed * Handles EXPLAIN output diffs in PG15, Hash Agg/Join leverage To handle differences in usage of GroupAggregate vs HashAggregate or Merge Join vs Hash join in cases where this detail doesn't seem to matter, we use coordinator_plan(). - coordinator_plan() is updated to remove "Result" lines There are some cases where we have subplans so we add a new function that prints all Task Count lines as well - coordinator_plan_with_subplans() Still not sure of the relevant PG commit Could be db0d67db2401eb6238ccc04c6407a4fd4f985832 but disabling enable_group_by_reordering didn't help. * Handles EXPLAIN output diffs in PG15: enable_group_by_reordering Relevant PG commit db0d67db2401eb6238ccc04c6407a4fd4f985832 * Normalizes Memory Usage, Buckets, Batches for PG15 explain diffs We create a new function in multi_test_helpers, which is similar to explain_merge function in PG15. This explain helper function normalies Memory Usage, Buckets and Batches, and we use it in the tests which give a different output for PG15. * Bump test images to 15beta3 (#6172) * Omit namespace in post-copy errmsg Relevant PG commit: 069d33d0c5a021601245e44df77a0423ddd69359 * Handles EXPLAIN output diffs in PG15: extra arrows&result lines To handle extra "->" arrows resulting from extra Result lines in explain outputs, we add the following explain method to multi_test_helpers.sql file - plan_without_arrows() is added for cases where we want the whole explain output without arrows and without Result lines * Alters public schema's owner to pg_database_owner in PG15 In PG15, public schema is owned by pg_database_owner role. In multi_extension, we drop and recreate the ppublic schema, hence its owner become the default user in our tests, postgres. Change that to pg_database_owner for PG15 consistency. This results in alternative test output for public schema grants in the following test: grant_on_schema_propagation.sql Relevant PG commit: b073c3ccd06e4cb845e121387a43faa8c68a7b62 * Add alternative test outputs for change in Insert Select display citus_local_tables_queries.sql coordinator_shouldhaveshards.sql cte_inline.sql insert_select_repartition.sql intermediate_result_pruning.sql local_shard_execution.sql local_shard_execution_replicated.sql multi_deparse_shard_query.sql multi_insert_select.sql multi_insert_select_conflict.sql multi_mx_insert_select_repartition.sql mx_coordinator_shouldhaveshards.sql single_node.sql Relevant PG commit: a8d8445a7b2f80f6d0bfe97b19f90bd2cbef8759 * Fixes columnar tap tests for PG15 In PG15, Perl test modules have been moved to a new namespace. Also, postgres node new() and get_new_node() methods have been unified to one method: new() We create separate tap tests for PG13/14 and PG15+ and update the Makefiles accordingly. Relevant PG commits: 201a76183e2056c2217129e12d68c25ec9c559c8 b3b4d8e68ae83f432f43f035c7eb481ef93e1583 * Handles EXPLAIN output diffs in PG15: HashAgg Leverage,alt. output Still not sure of the relevant PG commit Could be db0d67db2401eb6238ccc04c6407a4fd4f985832 but disabling enable_group_by_reordering didn't help.	2022-08-24 17:59:17 +02:00
Naisila Puka	ddbd10d2e7	Rename server version checks in tests (#6239 )	2022-08-24 16:31:52 +03:00
Jelte Fennema	5c0205ce10	Fix flakyness in multi_replicate_reference_table (#6235 ) In CI multi_replicate_reference_table would sometimes fail like this: ```diff -- detects correctly that referecence table doesn't have replica identity SELECT replicate_reference_tables(); -ERROR: cannot use logical replication to transfer shards of the relation initially_not_replicated_reference_table since it doesn't have a REPLICA IDENTITY or PRIMARY KEY +ERROR: cannot use logical replication to transfer shards of the relation ref_table since it doesn't have a REPLICA IDENTITY or PRIMARY KEY DETAIL: UPDATE and DELETE commands on the shard will error out during logical replication unless there is a REPLICA IDENTITY or PRIMARY KEY. HINT: If you wish to continue without a replica identity set the shard_transfer_mode to 'force_logical' or 'block_writes'. ``` Because `CitusTableTypeIdList` returns tables in heap order so it's a bit random which one is first in the list. And the test contained multiple tables that didn't have a primary key or replica identity. So it made sense that the error could be for either one of these tables. This PR makes the test output consistent by changing one of the tables to have a primary key. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26387/workflows/fc3196e7-ddf2-4000-a70b-5ac71c836321/jobs/748940	2022-08-24 13:34:10 +03:00
Jelte Fennema	e230c849fd	CI: Store postgres logs as artifacts (#6236 ) If a test fails sometimes the diff of the output isn't very helpful. In those cases looking at the postgres logs can help a lot. We were only storing these logs as artifacts for arbitrary config tests and tap tests, now we also store them for our regular test runs.	2022-08-24 12:24:47 +02:00
aykut-bozkurt	041f88d7bf	Revert "Revert "Creates new colocation for colocate_with:='none' too"" (#6227 ) This reverts commit `d171a736ab`.	2022-08-24 10:54:04 +03:00
Marco Slot	bad8196da3	Verify that we can replicate reference tables using rebalancer (#6232 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-24 00:34:21 +02:00
Jelte Fennema	1dd775fae8	Speed up logical replication tests to fix flakyness (#6229 ) The isolation_tenant_isolation_nonblocking test would sometimes randomly fail in CI, because we have a limit of runtime limit of 2 minutes per test. ``` test isolation_tenant_isolation_nonblocking ... make: *** [Makefile:171: check-enterprise-isolation] Terminated Too long with no output (exceeded 2m0s): context deadline exceeded ``` One solution would obviously be to increase the timeout, but instead I spent some time to increase the speed of our tests by tweaking some timings. On my local machine the time it took to run the isolation_tenant_isolation_nonblocking test went from 75s to 15s. So now we should easily stay within the 2 minute per test limit. I also checked if the new settings improved other logical replication tests, but the impect differs wildly per test. One other example of a test that runs much quicker due to the change is isolation_non_blocking_shard_split_fkey. But the shard move tests I tried are impacted much less. Example of failed tests: https://app.circleci.com/pipelines/github/citusdata/citus/26373/workflows/4fa660e4-63c8-4844-bef8-70a7bea902b7/jobs/748199	2022-08-23 17:37:31 +02:00
Jelte Fennema	21780b4f65	Fix flakyness in ch_benchmarks_1 (#6228 ) One of our arbitrary config tests would sometimes fail like this in CI: ```diff su_nationkey, cust_nation, l_year; - supp_nation \| cust_nation \| l_year \| revenue ---------------------------------------------------------------------- - 9 \| C \| 2008 \| 3.00 -(1 row) - +ERROR: cannot connect to localhost:10212 to fetch intermediate results +CONTEXT: while executing command on localhost:10211 ``` When looking at the logs it seems like we were running out of connections: ``` 2022-08-23 14:03:52.856 UTC [28122] FATAL: sorry, too many clients already 2022-08-23 14:03:52.860 UTC [21027] ERROR: cannot connect to localhost:10212 to fetch intermediate results ``` This happened with `CitusThreeWorkersManyShards` config. This test on purpose tries to push the limits of Citus quite far. And the `ch_benchmarks_1` test is also run in parallel with a few more ones. So it's not too weird that it ran out of connections. This doubles the connection limit in the arbitrary config tests to hopefully not hit this error again. Example of failed test: https://app.circleci.com/pipelines/github/citusdata/citus/26365/workflows/7a1b5688-85cc-4bc3-ade5-9bd1d83cd0ed/jobs/747908/parallel-runs/1	2022-08-23 17:24:27 +02:00
Jelte Fennema	e0ada050aa	Enable binary logical replication for shard moves (#6017 ) Using binary encoding can save a lot of CPU cycles, both on the sender and on the receiver. Since the walsender and walreceiver processes are single threaded, this can matter a lot for the throughput if they are bottlenecked on CPU. This feature is only available in PG14, not PG13. It should be safe to always enable because it's only used for types that support binary encoding according to the PG docs: > Even when this option is enabled, only data types that have binary > send and receive functions will be transferred in binary. But in case it causes problems, it can still be disabled by setting `citus.enable_binary_protocol` to `false`.	2022-08-23 16:38:00 +02:00
aykut-bozkurt	07cfba461a	ensuring reference tables on nodes should not create colocation entry. (#6224 ) We create colocation entry in create_reference_table.	2022-08-23 16:17:59 +03:00
Jelte Fennema	cc7e93a56a	Fix flakyness in failure_connection_establishment (#6226 ) In CI our failure_connection_establishment sometimes failed randomly with the following error: ```diff -- verify a connection attempt was made to the intercepted node, this would have cause the -- connection to have been delayed and thus caused a timeout SELECT * FROM citus.dump_network_traffic() WHERE conn=0; conn \| source \| message ------+--------+--------- - 0 \| coordinator \| [initial message] -(1 row) +(0 rows) SELECT citus.mitmproxy('conn.allow()'); ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26318/workflows/d3354024-9a67-4b01-9416-5cf79aec6bd8/jobs/745558 The way I fixed this was by removing the dump_network_traffic call. This might sound simple, but doing this while continuing to let the test serve its intended purpose required quite some more changes. This dump_network_traffic call was there because we didn't want to show warnings in the queries above, because the exact warnings were not reliable. The main reason this error was not reliable was because we were using round-robin task assignment. We did the same query twice, so that it would hit the node with the intercepted connection in one of those connections. Instead of doing that I'm now using the "first-replica" policy and do the queries only once. This works, because the first placements by placementid for each of the used tables are on the second node, so first-replica will cause the first connection to go there. This solved most of the flakyness, but when confirming that the flakyness was fixed I found some additional errors: ```diff -- show that INSERT failed SELECT citus.mitmproxy('conn.allow()'); mitmproxy ----------- (1 row) SELECT count(*) FROM single_replicatated WHERE key = 100; - count ---------------------------------------------------------------------- - 0 -(1 row) - +ERROR: could not establish any connections to the node localhost:9060 after 400 ms RESET client_min_messages; ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26321/workflows/fd5f4622-400c-465e-8d82-83f5f55a87ec/jobs/745666 I addressed this with a combination of two things: 1. Only change citus.node_connection_timeout for the queries that we want to test timeout behaviour for. When those queries are done I reset the value to the default again. 2. Change our mitm framework to only delay the initial connection packet instead of all packets. I think sometimes a follow on packet of a previous connection attempt was causing the next connection attempt to be delayed even if `conn.allow()` was already called. For our tests we only care about connection timeouts, so there's no reason to delay any other packets than the initial connection packet. Then there was some last flakyness in the exact error that was given: ```diff -- tests for connectivity checks SELECT name FROM r1 WHERE id = 2; WARNING: could not establish any connections to the node localhost:9060 after 900 ms +WARNING: connection to the remote node localhost:9060 failed with the following error: name ------ bar (1 row) ``` Source: https://app.circleci.com/pipelines/github/citusdata/citus/26338/workflows/9610941c-4d01-4f62-84dc-b91abc56c252/jobs/746467 I don't have a good explaination for this slight change in error message, but given that it is missing the actual error message I expected this to be related to some small difference in timing: e.g. the server responding to the connection attempt right after the coordinator determined that the connection timed out. To solve this last flakyness I increased the connection timeouts and made the difference between the timeout and the delay a bit bigger. With these tweaks I wasn't able to reproduce this error on CI anymore. Finally, I made most of the same changes to failure_failover_to_local_execution, since it was using the `conn.delay()` mitm method too. The only change that I left out was the timing increase, since it might not be strictly necessary and increases time it takes to run the test. If this test ever becomes flaky the first thing we should try is increase its timeout.	2022-08-23 15:04:20 +03:00
Jelte Fennema	506c16efdf	Fix flakyness in failure_single_select (#6223 ) The failure_single_select test would sometimes fail with an error that's similar to this: ```diff -- cancel after first SELECT; txn should fail and nothing should be marked as invalid SELECT citus.mitmproxy('conn.onQuery(query="^SELECT").cancel(' \|\| pg_backend_pid() \|\| ')'); - mitmproxy ---------------------------------------------------------------------- - -(1 row) - +ERROR: canceling statement due to user request +CONTEXT: COPY mitmproxy_result, line 1: "" +SQL statement "COPY mitmproxy_result FROM '/home/circleci/project/src/test/regress/tmp_check/mitmproxy.fifo'" +PL/pgSQL function citus.mitmproxy(text) line 11 at EXECUTE BEGIN; ``` This error looked very to the one from #6217 and indeed the cause turned out to be similar. Because we were canceling all SELECT queries, we would actually sometimes cancel our mitmproxy SELECT queries itself. This puts some additional restrictions on the queries that we cancel, most importantly it should contain the name of the table that we're selecting from. I was able to reproduce the original issue locally pretty reliably. With the changes in this PR it didn't happen again. In passing this also changes one other failure test that was cancelling all selects and puts similar additional restrictions on those cancellations. Example of failed test in CI: https://app.circleci.com/pipelines/github/citusdata/citus/26305/workflows/4d942b91-f83c-453c-8d9a-ae22d608e756/jobs/745071	2022-08-22 20:06:33 +02:00
Hanefi Onaldi	28b04dc9f4	Merge pull request #6078 from citusdata/bump-pg-versions	2022-08-22 17:52:56 +03:00
Hanefi Onaldi	a570dfead2	Remove outdated defaults for PG versions in CI configs	2022-08-22 17:16:52 +03:00
Hanefi Onaldi	616b1758c2	Add more normalization rules	2022-08-22 17:16:52 +03:00
Hanefi Onaldi	e33ba7da9e	Decrease min messages for normalization	2022-08-22 17:16:52 +03:00
Hanefi Onaldi	9ec9209fd9	Bump PG versions in CI configs	2022-08-22 17:16:52 +03:00
Marco Slot	639588bee0	Remove unused functions (#6220 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-22 11:53:25 +03:00
Jelte Fennema	e2a24b921e	Fix flakyness in failure_create_distributed_table_non_empty (#6217 ) The failure_create_distributed_table_non_empty test would sometimes fail like this: ```diff -- in the first test, cancel the first connection we sent from the coordinator SELECT citus.mitmproxy('conn.cancel(' \|\| pg_backend_pid() \|\| ')'); - mitmproxy ---------------------------------------------------------------------- - -(1 row) - +ERROR: canceling statement due to user request +CONTEXT: COPY mitmproxy_result, line 1: "" +SQL statement "COPY mitmproxy_result FROM '/home/circleci/project/src/test/regress/tmp_check/mitmproxy.fifo'" +PL/pgSQL function citus.mitmproxy(text) line 11 at EXECUTE SELECT create_distributed_table('test_table', 'id'); ``` Because the cancel command had no filter it would actually sometimes cancel the mitmproxy cancel command itself. This PR addresses that by filtering on CREATE TABLE, which is one of the command that create_distributed_table will send to the workers. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26252/workflows/1b7e5464-cca4-4ec1-99b3-48ddf25c29fa/jobs/742829	2022-08-20 01:23:25 +03:00
Jelte Fennema	4ce17f015b	Fix flakyness in columnar_memory test (#6216 ) Sometimes in CI the columnar_memory test was using slightly more memory than expected. ```diff SELECT CASE WHEN 1.0 * TopMemoryContext / :top_post BETWEEN 0.98 AND 1.02 THEN 1 ELSE 1.0 * TopMemoryContext / :top_post END AS top_growth FROM columnar_test_helpers.columnar_store_memory_stats(); --[ RECORD 1 ]- -top_growth \| 1 +-[ RECORD 1 ]------------------ +top_growth \| 1.0206132116232119 -- before this change, max mem usage while executing inserts was 28MB and ``` This PR changes the expectation to be slightly higher, such that this random increase in memory usage doesn't cause a flaky test. Failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26256/workflows/c0870f66-3346-4f8d-a1d3-36dfd7c98289/jobs/743028	2022-08-19 23:46:28 +02:00
Jelte Fennema	de475feb69	Actually connect to the right database in logical_replication test (#6211 ) In the logical_replication test we test that the cleanup logic at the start of a shard move works as expected. To do so we create a subscription and publication slot manually. This changes the test to make that subscription actually connect to the database that the publication is in. Useful for #5987 #6085	2022-08-20 00:09:50 +03:00
Jelte Fennema	dfa6c26d7d	Increase isolation timeout because of shards splits (#6213 ) Recently isolation tests involving shard splits have been randomly failing in CI with timeouts. It's possible that there's an actual bug here, but it's also quite likely that our timeout is just slightly too low for the combination of shard splits and the CI VM having a bad day. Increasing the timeout is fairly low cost and allows us to find out if there's an actual bug or if its simply slowness. So that's what this PR does. If it turns out to be an actual bug, we can decrease the timeout again when we fix it. Examples of failed tests: 1. https://app.circleci.com/pipelines/github/citusdata/citus/26241/workflows/9e0bb721-d798-481b-907c-914236b63e38/jobs/742409 2. https://app.circleci.com/pipelines/github/citusdata/citus/26171/workflows/8f352e3b-e6e4-4f7f-b0d0-2543f62a0209/jobs/739470	2022-08-19 22:37:45 +03:00
Naisila Puka	9cfadd7965	Deletes unnecessary test outputs pt2 (#6214 )	2022-08-19 18:21:13 +03:00
Jelte Fennema	85305b2773	Don't run any isolation tests in parallel (#6212 ) By running isolation tests in parallel we're just asking for flaky tasks. The first test might temporarily block one of the commands in the second test, which we then detect as waiting like this: ```diff step s2-vacuum-analyze: VACUUM ANALYZE test_insert_vacuum; - + <waiting ...> step s1-commit: COMMIT; +step s2-vacuum-analyze: <... completed> ``` Debugging flaky tests is also much harder when they are run in parallel. This PR starts running all our isolation tests sequentially. The reason for opening this PR was me seeing this failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26194/workflows/ff57e2cf-8ac4-40fe-bc0c-74a7f8fecb53/jobs/740454 As well as having fixed a similar issue recently in #6122	2022-08-19 17:05:36 +02:00
Önder Kalacı	616ff2a3fe	Adjust some isolation test for the recent PG commits (#6210 ) * Adjust some isolation test for the recent PG commits In `3f32395612`, Postgres starts any isolation session with `set application_name`. However, one of the tests we had expected that it is exactly the first command in the session. The test tries to show that even if a gpid has not been assigned, we can show it in the citus_lock_waits graph. Now that, it is literally not possible to have such test as gpid would be assigned after `set application_name` command. Still, it is good to have a test where a command is blocked on the parser	2022-08-19 17:06:34 +03:00
Jelte Fennema	e6a1a86db0	Improve debugability for columnar_memory flakyness (#6203 ) Sometimes the columnar_memory test fails in CI with the following error: ```diff SELECT 1.0 * TopMemoryContext / :top_post BETWEEN 0.98 AND 1.02 AS top_growth_ok FROM columnar_test_helpers.columnar_store_memory_stats(); -[ RECORD 1 ]-+-- -top_growth_ok \| t +top_growth_ok \| f -- before this change, max mem usage while executing inserts was 28MB and ``` This is almost certainly a harmless failure that simply requires bumping the margin a little bit. However, it's impossible to say with the current output. I was unable to reproduce this on-demand on my local machine or even in CI. So this changes the test to include the actual value difference in the size of TopMemoryContext when it's outside the expected range. Then next time it fails we at least have some information about why. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/25966/workflows/d472a57b-419a-4f33-b8bc-2e174a98d4d6/jobs/730576	2022-08-19 15:41:16 +02:00
Jelte Fennema	3f4440ff69	Improve debugability of failures in isolation_ref2ref_foreign_keys (#6197 ) As shown in #6196 the output of s1-view-locks is sometimes not as expected. However, because it's output is very minimal it's hard to understand the reason for that. This adds some more columns and aggregates less, so we can more easily see what locks are unexpectedly held or released. In passing this also fixes the following flaky part of this test by excluding locks taken by the maintenance daemon. After running it with this more detailed output for s1-view-locks it became obvious that that was the problem here. ```diff diff -dU10 -w /home/jelte/work/citus/src/test/regress/expected/isolation_ref2ref_foreign_keys.out /home/jelte/work/citus/src/test/regress/results/isolation_ref2ref_foreign_keys.out --- /home/jelte/work/citus/src/test/regress/expected/isolation_ref2ref_foreign_keys.out.modified 2022-08-18 15:42:08.689525233 +0200 +++ /home/jelte/work/citus/src/test/regress/results/isolation_ref2ref_foreign_keys.out.modified 2022-08-18 15:42:08.729525233 +0200 @@ -288,21 +288,22 @@ step s1-view-locks: SELECT mode, count(*) FROM pg_locks WHERE locktype='advisory' GROUP BY mode ORDER BY 1, 2; mode \|count ------------------------+----- -(0 rows) +ShareUpdateExclusiveLock\| 1 +(1 row) starting permutation: s2-begin s2-insert-table-3 s1-view-locks s2-rollback s1-view-locks step s2-begin: BEGIN; step s2-insert-table-3: INSERT INTO ref_table_3 VALUES (7, 5); step s1-view-locks: ```	2022-08-19 15:12:09 +02:00
Jelte Fennema	25e5cf2e50	Fix flakyness in failure_setup (#6205 ) In CI sometimes failure_setup will fail with the following error: ```diff SELECT master_add_node('localhost', :worker_2_proxy_port); -- an mitmproxy which forwards to the second worker - master_add_node ---------------------------------------------------------------------- - 2 -(1 row) - +ERROR: connection to the remote node localhost:9060 failed with the following error: could not connect to server: Connection refused + Is the server running on host "localhost" (127.0.0.1) and accepting + TCP/IP connections on port 9060? +could not connect to server: Connection refused + Is the server running on host "localhost" (127.0.0.1) and accepting + TCP/IP connections on port 9060? +could not connect to server: Cannot assign requested address + Is the server running on host "localhost" (::1) and accepting + TCP/IP connections on port 9060? diff -dU10 -w /home/circleci/project/src/test/regress/expected/failure_online_move_shard_placement.out /home/circleci/project/src/test/regress/results/failure_online_move_shard_placement.out ``` This then breaks all the tests run after it as well, because we're missing one worker node. Locally I was able to reproduce this error by sleeping for 10 seconds in the forked process sleep before actually starting mitmproxy. So I'm expecting what's happening in CI is that due to limited resources, mitmproxy is not up yet when we try to add its port as a workernode. This PR fixes this by waiting until mitmproxy is listening on its socket before actually starting to run our tests. This fixed it locally for me when I made the forked process sleep for 10 seconds before starting mitmproxy. In passing it also improves the detection and errors that we already had for the case where something was already listening on the mitmproxy port. Because both @gledis69 and me were changing things in our CI images at the same time this also includes a bump of the style checker tools. Closes #6200	2022-08-19 13:03:08 +00:00
Jelte Fennema	3fadb98380	Fix compilation warning on PG13 + OpenSSL 3.0 (#6038 ) This removes some warnings that are present when building on Ubuntu 22.04. It removes warnings on PG13 + OpenSSL 3.0. OpenSSL 3.0 has marked some functions that we use as deprecated, but we want to continue support OpenSSL 1.0.1 for the time being too. This indicates that to OpenSSL 3.0, so it doesn't show warnings.	2022-08-19 05:51:47 -07:00
Onur Tirtir	d83fa7af34	Add a changelog entry for 10.2.8 (#6207 )	2022-08-19 15:40:30 +03:00
Jelte Fennema	fe1668e43f	Fix flakyness in multi_utilities (#6204 ) Sometimes this multi_utilities would fail with the following error: ```diff SET citus.log_remote_commands TO ON; -- should propagate to all workers because no table is specified ANALYZE; NOTICE: issuing BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED;SELECT assign_distributed_transaction_id(0, 3461, '2022-08-19 01:56:06.35816-07'); DETAIL: on server postgres@localhost:57637 connectionId: 1 NOTICE: issuing BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED;SELECT assign_distributed_transaction_id(0, 3461, '2022-08-19 01:56:06.35816-07'); DETAIL: on server postgres@localhost:57638 connectionId: 2 NOTICE: issuing SET citus.enable_ddl_propagation TO 'off' DETAIL: on server postgres@localhost:57637 connectionId: 1 -NOTICE: issuing SET citus.enable_ddl_propagation TO 'off' -DETAIL: on server postgres@localhost:xxxxx connectionId: xxxxxxx NOTICE: issuing ANALYZE DETAIL: on server postgres@localhost:57637 connectionId: 1 +NOTICE: issuing SET citus.enable_ddl_propagation TO 'off' +DETAIL: on server postgres@localhost:57638 connectionId: 2 NOTICE: issuing ANALYZE DETAIL: on server postgres@localhost:57638 connectionId: 2 ``` This is simply a harmless change in output due to some timing differences. This PR makes the test output consistent by only logging the remote ANALYZE commands, not the SET commands.	2022-08-19 12:38:55 +02:00
Marco Slot	5160cafa82	Do not propagate GRANT ON SCHEMA from CREATE EXTENSION (#6175 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-08-19 13:23:47 +03:00
Jelte Fennema	8ce12eb51f	Fix flakyness in failure_insert_select_repartition (#6202 ) This fixes our most commonly randomly failing failure test. The failing diff is as follows: ```diff SELECT citus.mitmproxy('conn.onQuery(query="fetch_intermediate_results").kill()'); mitmproxy ----------- (1 row) INSERT INTO target_table SELECT * FROM source_table; -ERROR: connection to the remote node localhost:xxxxx failed with the following error: connection not open +ERROR: could not open file "base/pgsql_job_cache/10_0_40/repartitioned_results_20770193413_from_4213590_to_1.data": No such file or directory +CONTEXT: while executing command on localhost:9060 +while executing command on localhost:57637 SELECT * FROM target_table ORDER BY a; ``` As far as I can tell this is the cause of a race condition: After killing fetch_intermediate_results on worker 9060, the previously created data file gets cleaned up. The fetch_intermediate_results call that's sent to worker 57637 will be cancelled and rolled back soon because of the failure on the other connection. But if that fetch_intermediate_results call is able to connect to 9060 before it is cancelled, it won't find the file it's looking for there anymore. So while it's not the error we expect, it does indicate that we succeeded. To avoid this issue instead of killing the fetch_intermediate_results call directly, we kill the COPY command that it uses to do the fetch. This results in stable output as can be seen here, where 227 runs of failure_insert_select_repartition succeeded: https://app.circleci.com/pipelines/github/citusdata/citus/26168/workflows/9c64a3b6-f46c-4725-9fb4-8f6a2d00a023/jobs/739389 To be clear this changes the test to affects the opposite fetch_intermediate_results call. This kills the fetch_intermediate_results call of worker 57637, instead of killing the fetch_intermediate_results call on worker 9060. Example of failing test: https://app.circleci.com/pipelines/github/citusdata/citus/26147/workflows/780e95ea-264a-4c9f-ad2e-cf11449a795e/jobs/738467	2022-08-19 09:11:07 +00:00
Naisila Puka	5a9fdc221b	Add explicit alias to avoid debug output diff in pg15 (#6183 )	2022-08-19 11:39:18 +03:00
Onur Tirtir	1f1b02b64b	Merge pull request #6199 from citusdata/cl-10.2.6-11.0.6 Add changelog entries for 10.2.7 & 11.0.6	2022-08-19 10:57:42 +03:00
Onur Tirtir	b6b8f198d9	Add changelog entries for 11.0.6	2022-08-19 10:41:39 +03:00
Onur Tirtir	efb0d5f48e	Add changelog entries for 10.2.7	2022-08-19 10:41:26 +03:00
Gledis Zeneli	2b74735496	Update stylechecker version (#6194 ) Update stylechecker image to include versions similar to the other test images.	2022-08-19 01:29:45 +03:00
Jelte Fennema	31faa88a4e	Track rebalance progress at the shard move level (#6187 ) We're in the processes of totally changing the shard rebalancer experience and infrastructure. Soon the shard rebalancer will include retries, crash recovery and support for running in the background. These improvements come at a cost though, the way the get_rebalance_progress UDF currently works is very hard to replicate with this new structure. This is mostly because the old behaviour doesn't really make sense anymore with this new infrastructure. A new and better way to track the progress will be included as part of the new infrastructure. This PR is in preparation of the new code rebalancer experience. It changes the get_rebalance_progress UDF to only display the moves that are in progress at the moment, not the ones that happened in the past or that are planned in the future. Another option would have been to completely remove the current get_rebalance_progress functionality and point people to the new way of tracking progress. But old blogposts still reference the old UDF and users might have some automation on top of it. Showing the progress of the current moves is fairly simple to achieve, even with the new infrastructure. So this PR is a kind of compromise: It doesn't have complete feature parity with the old get_rebalance_progress, but the most common use cases will still work. There's also an advantage of the change: You can now see progress of shard moves that were triggered by calling citus_move_shard_placement manually. Instead of only being able to see progress of moves that were initiated using get_rebalance_table_shards.	2022-08-18 18:57:04 +02:00
Önder Kalacı	961fcff5db	Properly add / remove coordinator for isolation tests (#6181 ) We used to rely on a seperate session to add the coordinator. However, that might prevent the existing sessions to get assigned proper gpids, which causes flaky tests.	2022-08-18 17:32:12 +03:00
Jelte Fennema	7dca028391	Fix flakyness in isolation_reference_table (#6193 ) The newly introduced isolation_reference_table test had some flakyness, because the assumption on how the arbitrary reference table gets chosen was incorrect. This introduces a VACUUM FULL at the start of the test to ensure the assumption actually holds. Example of failed test: https://app.circleci.com/pipelines/github/citusdata/citus/26108/workflows/0a5cd526-006b-423e-8b67-7411b9c6be36/jobs/736802	2022-08-18 15:47:28 +03:00
Jelte Fennema	0a045afd3a	Fix flakyness in columnar_first_row_number test (#6192 ) When running columnar_first_row_number in parallel with the columnar_query test sometimes it would fail. This bug is tracked in #6191. For now to make CI less flaky we simply don't run these tests in parallel. Example of failed test: https://app.circleci.com/pipelines/github/citusdata/citus/26106/workflows/75d00ea9-23f8-4bff-a927-bced19e1f81b/jobs/736713 Fixes #6184	2022-08-18 15:32:57 +03:00
Jelte Fennema	d16b458e2a	Remove the flaky rollback_to_savepoint test (#6190 ) This removes a flaky test that I introduced in #3868 after I fixed the issue described in #3622. This test is sometimes fails randomly in CI. The way it fails indicates that there might be some bug: A connection breaks after rolling back to a savepoint. I tried reproducing this issue locally, but I wasn't able to. I don't understand what causes the failure. Things that I tried were: 1. Running the test with: ```sql SET citus.force_max_query_parallelization = true; ``` 2. Running the test with: ```sql SET citus.max_adaptive_executor_pool_size = 1; ``` 3. Running the test in parallel with the same tests that it is run in parallel with in multi_schedule. None of these allowed me to reproduce the issue locally. So I think it's time to give on fixing this test and simply remove the test. The regression that this test protects against seems very unlikely to reappear, since in #3868 I also added a big comment about the need for the newly added `UnclaimConnection` call. So, I think the need for the test is quite small, and removing it will make our CI less flaky. In case the cause of the bug ever gets found, I tracked the bug in #6189 Example of a failing CI run: https://app.circleci.com/pipelines/github/citusdata/citus/26098/workflows/f84741d9-13b1-4ae7-9155-c21ed3466951/jobs/736424 For reference the unexpected diff is this (so both warnings and an error): ```diff INSERT INTO t SELECT i FROM generate_series(1, 100) i; +WARNING: connection to the remote node localhost:57638 failed with the following error: +WARNING: +CONTEXT: while executing command on localhost:57638 +ERROR: connection to the remote node localhost:57638 failed with the following error: ROLLBACK; ``` This test is also mentioned as the most failing regression test in #5975	2022-08-18 15:14:16 +03:00
Önder Kalacı	418b4f96d6	Merge pull request #6166 from citusdata/fix_seq_ownership Support Sequences owned by columns that are added before distributing tables	2022-08-18 11:16:14 +02:00
Onder Kalaci	9ec8e627c1	Support Sequences owned by columns before distributing tables There are 3 different ways that a sequence can be interacting with tables. (1) and (2) are already supported. This commit adds support for (3). (1) column DEFAULT nextval('seq'): The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \|------------------ sequence <--------\| (2) serial columns: Bigserial/small serial etc: The dependency is roughly like below, and ExpandCitusSupportedTypes() is responsible for finding the depending sequences. schema <--- table <--- column <---- default value ^ \| \| \| sequence <--------\| (3) Sequence OWNED BY table.column: Added support for this type of resolution in this commit. The dependency is almost like the following, and ExpandCitusSupportedTypes() is NOT responsible for finding the dependency. schema <--- table <--- column ^ \| sequence	2022-08-18 10:29:40 +02:00
Naisila Puka	69ffdbf0e3	Uses object name in cannot distribute object error (#6186 ) Object type ids have changed in PG15 because of at least two added objects in the list: OBJECT_PARAMETER_ACL, OBJECT_PUBLICATION_NAMESPACE To avoid different output between pg versions, let's use the object name in the error, and put the object id in the error detail. Relevant PG commits: a0ffa885e478f5eeacc4e250e35ce25a4740c487 5a2832465fd8984d089e8c44c094e6900d987fcd	2022-08-18 11:05:17 +03:00
Ying Xu	91473635db	[Columnar] Check for existence of Citus before creating Citus_Columnar (#6178 ) * Added a check to see if Citus has already been loaded before creating citus_columnar * added tests	2022-08-17 15:12:42 -07:00
Nils Dijk	a9d47a96f6	Fix reference table lock contention (#6173 ) DESCRIPTION: Fix reference table lock contention Dropping and creating reference tables unintentionally blocked on each other due to the use of an ExclusiveLock for both the Drop and conditionally copying existing reference tables to (new) nodes. The patch does the following: - Lower lock lever for dropping (reference) tables to `ShareLock` so they don't self conflict - Treat reference tables and distributed tables equally and acquire the colocation lock when dropping any table that is in a colocation group - Perform the precondition check for copying reference tables twice, first time with a lower lock that doesn't conflict with anything. Could have been a NoLock, however, in preparation for dropping a colocation group, it is an `AccessShareLock` During normal operation the first check will always pass and we don't have to escalate that lock. Making it that we won't be blocked on adding and remove reference tables. Only after a node addition the first `create_reference_table` will still need to acquire an `ExclusiveLock` on the colocation group to perform the copy.	2022-08-17 18:19:28 +02:00
Ahmet Gedemenli	0631e1998b	Fix upgrade paths for #6100 (#6176 ) * Fix upgrade paths for #6100 Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2022-08-17 18:56:53 +03:00
Naisila Puka	20a0e0ed39	Grant create on public to some users where necessary (for PG15) (#6180 )	2022-08-17 17:35:10 +03:00
Jelte Fennema	3f6ce889eb	Use CreateSimpleHash (and variants) whenever possible (#6177 ) This is a refactoring PR that starts using our new hash table creation helper function. It adds a few more macros for ease of use, because C doesn't have default arguments. It also adds a macro to check if a struct contains automatic padding bytes. No struct that is hashed using tag_hash should have automatic padding bytes, because those bytes are undefined and thus using them to create a hash will result in undefined behaviour (usually a random hash).	2022-08-17 13:01:59 +03:00
aykut-bozkurt	52efe08642	default mode for shard splitting is set to auto. (#6179 )	2022-08-17 12:18:47 +03:00
aykut-bozkurt	be06d65721	Nonblocking tenant isolation is supported by using split api. (#6167 )	2022-08-17 11:13:07 +03:00
Jelte Fennema	78a5013e24	Support changing CPU priorities for backends and shard moves (#6126 ) Intro This adds support to Citus to change the CPU priority values of backends. This is created with two main usecases in mind: 1. Users might want to run the logical replication part of the shard moves or shard splits at a higher speed than they would do by themselves. This might cause some small loss of DB performance for their regular queries, but this is often worth it. During high load it's very possible that the logical replication WAL sender is not able to keep up with the WAL that is generated. This is especially a big problem when the machine is close to running out of disk when doing a rebalance. 2. Users might have certain long running queries that they don't impact their regular workload too much. Be very careful!!! Using CPU priorities to control scheduling can be helpful in some cases to control which processes are getting more CPU time than others. However, due to an issue called "[priority inversion][1]" it's possible that using CPU priorities together with the many locks that are used within Postgres cause the exact opposite behavior of what you intended. This is why this PR only allows the PG superuser to change the CPU priority of its own processes. Currently it's not recommended to set `citus.cpu_priority` directly. Currently the only recommended interface for users is the setting called `citus.cpu_priority_for_logical_replication_senders`. This setting controls CPU priority for a very limited set of processes (the logical replication senders). So, the dangers of priority inversion are also limited with when using it for this usecase. Background Before reading the rest it's important to understand some basic background regarding process CPU priorities, because they are a bit counter intuitive. A lower priority value, means that the process will be scheduled more and whatever it's doing will thus complete faster. The default priority for processes is 0. Valid values are from -20 to 19 inclusive. On Linux a larger difference between values of two processes will result in a bigger difference in percentage of scheduling. Handling the usecases Usecase 1 can be achieved by setting `citus.cpu_priority_for_logical_replication_senders` to the priority value that you want it to have. It's necessary to set this both on the workers and the coordinator. Example: ``` citus.cpu_priority_for_logical_replication_senders = -10 ``` Usecase 2 can with this PR be achieved by running the following as superuser. Note that this is only possible as superuser currently due to the dangers mentioned in the "Be very carefull!!!" section. And although this is possible it's NOT recommended: ```sql ALTER USER background_job_user SET citus.cpu_priority = 5; ``` OS configuration To actually make these settings work well it's important to run Postgres with more a more permissive value for the 'nice' resource limit than Linux will do by default. By default Linux will not allow a process to set its priority lower than it currently is, even if it was lower when the process originally started. This capability is necessary to reset the CPU priority to its original value after a transaction finishes. Depending on how you run Postgres this needs to be done in one of two ways: If you use systemd to start Postgres all you have to do is add a line like this to the systemd service file: ```conf LimitNice=+0 # the + is important, otherwise its interpreted incorrectly as 20 ``` If that's not the case you'll have to configure `/etc/security/limits.conf` like so, assuming that you are running Postgres as the `postgres` OS user: ``` postgres soft nice 0 postgres hard nice 0 ``` Finally you'd have add the following line to `/etc/pam.d/common-session` ``` session required pam_limits.so ``` These settings would allow to change the priority back after setting it to a higher value. However, to actually allow you to set priorities even lower than the default priority value you would need to change the values in the config to something lower than 0. So for example: ```conf LimitNice=-10 ``` or ``` postgres soft nice -10 postgres hard nice -10 ``` If you use WSL2 you'll likely have to do another thing. You have to open a new shell, because when PAM is only used during login, and WSL2 doesn't actually log you in. You can force a login like this: ``` sudo su $USER --shell /bin/bash ``` Source: https://stackoverflow.com/a/68322992/2570866 [1]: https://en.wikipedia.org/wiki/Priority_inversion	2022-08-16 13:07:17 +03:00
Jelte Fennema	1a01c896f0	Fix description of citus.distributed_deadlock_detection_factor (#5860 ) The long description of the `citus.distributed_deadlock_detection_factor` setting was incorrectly stating that 1000 would disable it. Instead -1 is the value that disables distributed deadlock detection.	2022-08-16 01:19:49 +03:00
Jelte Fennema	43c2a1e88b	Share more code between splits and moves (#6152 ) When introducing non-blocking shard split functionality it was based heavily on the non-blocking shard moves. However, differences between usage was slightly to big to be able to reuse the existing functions easily. So, most logical replication code was simply copied to dedicated shard split functions and modified for that purpose. This PR tries to create a more generic logical replication infrastructure that can be used by both shard splits and shard moves. There's probably more code sharing possible in the future, but I believe this is at least a good start and addresses the lowest hanging fruit. This also adds a CreateSimpleHash function that makes creating the most common type of hashmap common.	2022-08-15 20:21:51 +03:00
Marco Slot	b491d87931	Merge pull request #6170 from citusdata/marcocitus/fix-htab-leaks	2022-08-15 17:50:48 +02:00
Marco Slot	6c73576606	Fix HTAB memory leaks	2022-08-15 16:10:24 +02:00
Önder Kalacı	c076fb72db	Merge pull request #6165 from citusdata/maryxu/chunk_filtering_test Updated columnar_chunk_filter test for PG15	2022-08-12 09:43:22 +02:00
yxu2162	e1322ec905	Change for PG15 test because hash_mem_multiplier was changed to 2 as a default instead of 1 which was what PG13/14 have	2022-08-11 09:49:56 -07:00
Teja Mupparti	e962113c63	Remove the GUC mention in the error message as this config is meant for advanced users	2022-08-11 09:43:14 -07:00
Önder Kalacı	31cdf27fd6	Merge pull request #6157 from citusdata/add_missing_schema Set missing search_path in the tests	2022-08-11 13:11:24 +02:00
Önder Kalacı	627feb6326	Merge branch 'main' into add_missing_schema	2022-08-11 13:02:50 +02:00
aykut-bozkurt	ccf1e0f584	Pg vanilla tests can be run with citus created. (#6018 )	2022-08-11 12:53:22 +03:00
Önder Kalacı	73fcbdf12c	Merge branch 'main' into add_missing_schema	2022-08-11 11:28:41 +02:00
Jelte Fennema	fd07cc9baf	Fix flakyness in create index concurrently isolation tests (#6158 ) This creates consistent test output for isolation tests that involve `CREATE INDEX CONCURRENTLY`. `CREATE INDEX CONCURRENTLY` is sometimes temporarily detected as blocking, even though it will complete without any other queries needing to be run. This change makes sure that we wait until that happens without running any other queries in the meantime. This way we always get consistent output. The way we do that is addressed by using an empty step in the same session as the `CREATE INDEX CONCURRENLTY` command. Doing so forces the isolation tester to wait until the command is finished and not continue with steps from other sessions. This is [the recommended approach by Postgres][1]. There's two separate cases which are addressed in slightly different ways: 1. If `CREATE INDEX CONCURRENTLY` is actually blocked on another session: Add an empty step right after the commit of blocking session. e.g. `"s2-ddl-create-index-concurrently" "s1-commit" "s2-empty"` 2. If it's not actually blocked on another session: Add [an asterisk marker][2] to make it look like it's blocked (because sometimes this happens randomly) and right after that we add an empty step to trigger waiting. e.g. `"s2-ddl-create-index-concurrently"(*) "s2-empty" "s1-commit"` In passing this also enables isolation tests that were disabled due to a bug that has already been fixed for a while. Fixes #5993 Related to #5910 and #2966 [1]: `5f0adec253/src/test/isolation/README (L197-L204)` [2]: `5f0adec253/src/test/isolation/README (L174-L179)` Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2022-08-11 10:29:11 +02:00
aykut-bozkurt	898801504e	sysid should be parsed as int. (#6150 )	2022-08-11 10:44:46 +03:00
Önder Kalacı	e267914d84	Merge pull request #6156 from citusdata/fix_typo Typos of this type is not supported in PG15	2022-08-11 07:44:17 +02:00
Hanefi Onaldi	294400b2eb	Fix typos in tests that fail on PG15	2022-08-10 22:45:28 +03:00
Onder Kalaci	00ce7235cb	Set missing search_path in the tests On PG 15, public schema requires explicit GRANT, so lets avoid the conflict helpful for #6085	2022-08-10 18:04:10 +02:00
Onder Kalaci	44947d5634	This is not supported in PG15 so fix earlier	2022-08-10 17:44:03 +02:00
Önder Kalacı	6de8afec9f	Merge pull request #6143 from citusdata/naisila/fix_column_rename Renames remaining regclass to relation in columnar.options	2022-08-10 17:25:13 +02:00
naisila	ea209bd11d	Rename remaining regclass to relation in columnar.options	2022-08-10 15:38:53 +02:00
aykut-bozkurt	166272963a	log NOTICE createdb only if EnableUnsupportedFeatureMessages GUC is enabled. (#6151 )	2022-08-09 21:21:22 +03:00
aykut-bozkurt	cc694b6bcf	we consider stat object as invalid if it is not owned by current user (#6130 )	2022-08-09 20:59:30 +03:00
Hanefi Onaldi	2cdd49be5d	Merge pull request #6138 from citusdata/remove_source_files Remove .source files PostgreSQL 15 dropped usage of .source files that are used to generate .sql and .out files by replacing some placeholders with the actual values before test runs. Instead, the information is passed from pg_regress to the .sql and .out files directly via env variables. Those variables are read via \getenv psql command in relevant test files. PostgreSQL 15 introduced some changes to pg_regress binary that allowed this to happen. However this change is not backported to earlier versions, and thus we come up with a similar mechanism in pg_regress_multi that works in all supported PG versions. We also needed to make some changes to `copy` and `\copy` commands. `\copy` does not support variable interpolation, and we need to store `\copy` commands in a variable and use that variable in a consecutive line to let interpolation do its magic. Relevant PG commits: - postgres/postgres@33d3eeadb2 adds `\getenv` command to psql. - postgres/postgres@d1029bb5a2 updates all `.source` files to be supported `sql` or `out` files without actually renaming them. `pg_regress.c` is patched to set some environment variables that contain paths to relevant directories, and the `.source` files use the newly introduced `\getenv` to read those paths. - postgres/postgres@dc9c3b0ff2 renames all `.source` files into either `.sql` or `.out` files.	2022-08-09 15:26:54 +03:00
Hanefi Onaldi	6ef96ac560	Use client side \copy when accessing test files	2022-08-09 15:00:42 +03:00
Hanefi Onaldi	a58523f1d8	Remove all references to .source files	2022-08-09 14:15:52 +03:00
Hanefi Onaldi	9f52fa7610	Remove dynamic translation of regression test scripts, step 2. This commit is inspired by a commit dc9c3b0ff21465fa89d71eecf5e6cc956d647eca from PostgreSQL 15 that shares the same header. I also removed some gitignore rules so that I can add some files to git worktree. We used to ignore the generated files, that are no longer generated after this commit. -------------------- Below is the commit message from PostgreSQL 15 commit dc9c3b0ff21465fa89d71eecf5e6cc956d647eca : "git mv" all the input/.source and output/.source files into the corresponding sql/ and expected/ directories. Then remove the pg_regress and Makefile infrastructure associated with dynamic translation. Discussion: https://postgr.es/m/1655733.1639871614@sss.pgh.pa.us	2022-08-09 14:15:52 +03:00
Hanefi Onaldi	b6bd9ab87b	Remove dynamic translation of regression test scripts, step 1. This commit is inspired by a commit d1029bb5a26cb84b116b0dee4dde312291359f2a from PostgreSQL 15 that shares the same header. -------------------- Below is the commit message from PostgreSQL 15 commit d1029bb5a26cb84b116b0dee4dde312291359f2a : pg_regress has long had provisions for dynamically substituting path names into regression test scripts and result files, but use of that feature has always been a serious pain in the neck, mainly because updating the result files requires tedious manual editing. Let's get rid of that in favor of passing down the paths in environment variables. In addition to being easier to maintain, this way is capable of dealing with path names that require escaping at runtime, for example paths containing single-quote marks. (There are other stumbling blocks in the way of actually building in a path that looks like that, but removing this one seems like a good thing to do.) The key coding rule that makes that possible is to concatenate pieces of a dynamically-variable string using psql's \set command, and then use the :'variable' notation to quote and escape the string for the next level of interpretation. In hopes of making this change more transparent to "git blame", I've split it into two steps. This commit adds the necessary pg_regress.c support and changes all the *.source files in-place so that they no longer require any dynamic translation. The next commit will just "git mv" them into the regular sql/ and expected/ directories. Discussion: https://postgr.es/m/1655733.1639871614@sss.pgh.pa.us	2022-08-09 14:15:52 +03:00
Hanefi Onaldi	4185543910	Pass source directory in env to regression tests PostgreSQL 15 dropped usage of .source files that are used to generate .sql and .out files by replacing some placeholders with the actual values before test runs. Instead, the information is passed from pg_regress to the .sql and .out files directly via env variables. Those variables are read via \getenv psql command in relevant test files. PostgreSQL 15 commit d1029bb5a26cb84b116b0dee4dde312291359f2a introduced some changes to pg_regress binary that allowed this to happen. However this change is not backported to earlier versions of PG, and thus we come up with a similar mechanism in pg_regress_multi that works in all available PG versions.	2022-08-09 14:15:51 +03:00
Jelte Fennema	8017693b2f	Allow specifying the shard_transfer_mode when replicating reference tables (#6070 ) When using `citus.replicate_reference_tables_on_activate = off`, reference tables need to be replicated later. This can be done using the `replicate_reference_tables()` UDF. However, this function only allowed blocking replication. This changes the function to default to logical replication instead, and allows choosing any of our existing shard transfer modes.	2022-08-09 13:21:31 +03:00
Jelte Fennema	a645cb4b94	Better test failure debugging for arbitrary-configs (#5861 ) This improves debugging of arbitrary configs in two ways: 1. Enable logging of distributed deadlock detection 2. Show output of `psql` commands	2022-08-09 12:25:20 +03:00
Marco Slot	469c71524c	Merge pull request #6146 from citusdata/marcocitus/fix-copy-shard-placement	2022-08-09 09:44:06 +02:00
Marco Slot	3b57ff2867	Fix crash in citus_copy_shard_placement	2022-08-09 09:31:05 +02:00
Önder Kalacı	76a31f3234	Merge pull request #6147 from citusdata/naisila/explain_costs_false Explain w/out costs in ch_bench to avoid PG15 output diff	2022-08-09 09:30:01 +02:00
naisila	796d90d293	Explain w/out costs in ch_bench to avoid PG15 output diff	2022-08-09 07:53:27 +03:00
Naisila Puka	bcbba99c96	Clean up large_table_shard_count guc leftovers (#6144 )	2022-08-09 06:31:57 +03:00
Naisila Puka	3806f6f6a9	Add ORDER BY in pg_locks to avoid output order diffs (#6145 )	2022-08-09 06:02:07 +03:00
Naisila Puka	ce944c3c0f	Remove bogus guc citus.compression (#6142 )	2022-08-09 05:21:32 +03:00
Jelte Fennema	dd548ee3c7	Use faster custom copy logic for non-blocking shard moves (#6119 ) DESCRIPTION: Use faster custom copy logic for non-blocking shard moves Non-blocking shard moves consist of two main phases: 1. Initial data copy 2. Catchup phase This changes the first of these phases significantly. Previously we used the copy logic provided by postgres subscriptions. This meant we didn't have to implement it ourselves, but it came with the downside of little control. When implementing shard splits we needed more control to even make it work, so we implemented our own logic for copying data between nodes. This PR starts using that logic for non-blocking shard moves. Doing so has four main advantages: 1. It uses COPY in binary format when possible, which is cheaper to encode and decode. Furthermore it very often results in less data that needs to be sent over the network. 2. It allows us to create the primary key (or other replica identity) after doing the initial data copy. This should give some speed up over the total run, because creating an index is bulk is much faster than incrementally building it. 3. It doesn't require a replication slot per parallel copy. Increasing the maximum number of replication slots uses resources in postgres, even if they are not used. So reducing the number of replication slots that shard moves need is nice. 4. Logical replication table_sync workers are slow to start up, so if lots of shards need to be copied that can make it quite slow. This can happen easily when combining Postgres partitioning with Citus.	2022-08-08 17:09:43 +02:00
Marco Slot	cc2afb4b63	Merge pull request #6137 from citusdata/marcocitus/tenant-isolation	2022-08-08 13:56:02 +02:00
Marco Slot	6aee8f35a6	Fix tenant isolation failure tests	2022-08-08 13:33:23 +02:00
Marco Slot	ead9d28835	Avoid deadlocks on split failure by closing connections	2022-08-08 13:33:23 +02:00
Marco Slot	044dd26e40	Reimplement tenant isolation on top of block shard split	2022-08-08 13:33:23 +02:00
Naisila Puka	3401b31c13	Deletes unnecessary test outputs (#6140 )	2022-08-08 11:19:14 +03:00
Naisila Puka	9eedf6dcf8	Reduce log level to avoid alternative output for PG15 (#6139 )	2022-08-07 16:07:58 +03:00
Teja Mupparti	430c201d03	get_current_transaction_id() UDF is not printing the timestamp of the current transaction on the coordinator even when non-null	2022-08-05 10:12:07 -07:00
Naisila Puka	73f515f651	Add another expr to ORDER BY clause for consistency (#6136 )	2022-08-05 15:42:25 +03:00
aykut-bozkurt	4992533e33	support grant statement propagation for aggregates (#6132 )	2022-08-05 14:47:33 +03:00
Ahmet Gedemenli	8b68b0b5bb	Fix pg upgrade script for foreign tables (#6100 ) Fixes unexpected error for foreign tables when upgrading pg	2022-08-05 13:35:17 +03:00
Sameer Awasekar	e236711eea	Introduce Non-Blocking Shard Split Workflow	2022-08-04 16:32:38 +02:00
aykut-bozkurt	b67abdd28c	we should not log error in preprocess if attached partition is missing. (#6131 )	2022-08-04 15:49:14 +03:00
Naisila Puka	a1c630a16e	Reduce shard_count to reduce drain_node execution time (#6128 ) master_drain_node in distributed_triggers.sql test file takes too long to execute. It is directly dependent on the shard count. Hence I reduced shard count from 32 to 4 (default in tests), since this doesn't affect the validity of the tests.	2022-08-04 15:34:13 +03:00
aykut-bozkurt	3ddc089651	stop distributing views with no distributed dependency if GUC DistributeLocalViews is set false. (#6083 )	2022-08-04 12:34:40 +03:00
aykut-bozkurt	4ffe436bf9	we validate constraint as well if the statement is alter domain drop constraint (#6125 )	2022-08-03 23:06:33 +03:00
Jelte Fennema	dff71abc32	Fix flakyness in isolation_data_migration.spec (#6122 ) The tests isolation_concurrent_dml and isolation_data_migration tests were being run in parallel, but they were interfering with each others output. Sometimes queries from isolation_concurrent_dml were blocking create_distributed_table in isolation_data_migration: 1. https://app.circleci.com/pipelines/github/citusdata/citus/25562/workflows/f9d0a6ff-bb7a-4b71-9fcf-1a3e46d54425/jobs/713270 2. https://app.circleci.com/pipelines/github/citusdata/citus/25562/workflows/1e22454c-1623-48a7-97fb-c6803c7959c7/jobs/713223 3. https://app.circleci.com/pipelines/github/citusdata/citus/25562/workflows/618c419e-eefb-4582-9482-322dbb9ac96d/jobs/713110 This fixes it changing the schedule to not run these tests in parallel.	2022-08-03 17:56:49 +03:00
aykut-bozkurt	a662331668	qualify text dict and conf respect missingok (#6120 )	2022-08-03 13:13:53 +03:00
Jelte Fennema	8bbc1a45e1	Fix flakyness in isolation_replicate_reference_tables_to_coordinator.spec (#6123 ) When the deadlock detector kills s2-update-dist-table both sessions finish at the same time. The order in which they are displayed can be swapped. To counteract this we start using the ["marker" feature][1] of the isolationtester framework to create consistent output. In passing this also sets the next_shard_id to the expected value by this test so it can be run using `make check-isolation-base`. Failed CI test: https://app.circleci.com/pipelines/github/citusdata/citus/25562/workflows/dfe6f88a-c306-4d91-b771-d5d1deb1798d/jobs/713417 [1]: `ec62ce55a8/src/test/isolation/README (L152)`	2022-08-03 12:00:30 +02:00
aykut-bozkurt	f91f0f4b55	Merge pull request #6088 from citusdata/validate-address use address method to decide if we should run preprocess	2022-08-02 21:19:29 +03:00
aykutbozkurt	7387c7ed3d	address method should take parameter isPostprocess	2022-08-02 21:00:23 +03:00
aykutbozkurt	c98a68662a	introduces operation type for dist ops	2022-08-02 20:42:32 +03:00
aykutbozkurt	57ce4cf8c4	use address method to decide if we should run preprocess and postprocess steps for a distributed object	2022-08-02 20:42:32 +03:00
Jelte Fennema	8866d9ac32	Reduce setup time of check-minimal and check-minimal-mx (#6117 ) This change reduces the setup time of our minimal schedules in two ways: 1. Don't run `multi_cluster_managament`, but instead run a much smaller sql file with almost the same results. `multi_cluster_management` adds and removes lots of nodes and tests all kinds of failure scenarios. This is not needed for the minimal schedules. The only reason we were using it there was to get a working cluster of the layout that the tests expected. The new `minimal_cluster_management` test achieves this with much less work, going from ~2s to ~0.5s. 2. Parallelize a bit more of the helper tests.	2022-08-02 17:58:59 +03:00
Naisila Puka	28e22c4abf	Reduce log level to avoid alternative output for PG15 (#6118 ) We are reducing the log level here to avoid alternative test output in PG15 because of the change in the display of SQL-standard function's arguments in INSERT/SELECT in PG15. The log level changes can be reverted when we drop support for PG14 Relevant PG commit: a8d8445a7b2f80f6d0bfe97b19f90bd2cbef8759	2022-08-02 11:56:28 +03:00
Önder Kalacı	5e3162fa05	Merge pull request #6116 from citusdata/main_pg_15 Add missing ifdef for PG 15 for testing & development purposes	2022-08-02 09:54:27 +02:00
Onder Kalaci	c7b51025ab	Add missing ifdef for PG 15	2022-08-02 09:46:53 +02:00
Jelte Fennema	abffa6c3b9	Use shard split copy code for blocking shard moves (#6098 ) The new shard copy code that was created for shard splits has some advantages over the old shard copy code. The old code was using worker_append_table_to_shard, which wrote to disk twice. And it also didn't use binary copy when that was possible. Both of these issues were fixed in the new copy code. This PR starts using this new copy logic also for shard moves, not just for shard splits. On my local machine I created a single shard table like this. ```sql set citus.shard_count = 1; create table t(id bigint, a bigint); select create_distributed_table('t', 'id'); INSERT into t(id, a) SELECT i, i from generate_series(1, 100000000) i; ``` I then turned `fsync` off to make sure I wasn't bottlenecked by disk. Finally I moved this shard between nodes with `citus_move_shard_placement` with `block_writes`. Before this PR a move took ~127s, after this PR it took only ~38s. So for this small test this resulted in spending ~70% less time. And I also tried the same test for a table that contained large strings: ```sql set citus.shard_count = 1; create table t(id bigint, a bigint, content text); select create_distributed_table('t', 'id'); INSERT into t(id, a, content) SELECT i, i, 'aunethautnehoautnheaotnuhetnohueoutnehotnuhetncouhaeohuaeochgrhgd.athbetndairgexdbuhaobulrhdbaetoausnetohuracehousncaoehuesousnaceohuenacouhancoexdaseohusnaetobuetnoduhasneouhaceohusnaoetcuhmsnaetohuacoeuhebtokteaoshetouhsanetouhaoug.lcuahesonuthaseauhcoerhuaoecuh.lg;rcydabsnetabuesabhenth' from generate_series(1, 20000000) i; ```	2022-08-01 20:10:36 +03:00
Naisila Puka	5060d0ab17	Remove leftover PG version_above_11 checks from tests (#6112 )	2022-08-01 15:38:19 +03:00
Naisila Puka	85324f3acc	Clean up multi_shard_commit_protocol guc leftovers (#6110 )	2022-08-01 15:22:02 +03:00
Naisila Puka	f9b02946b1	Delete PG version_above_ten alternative test outputs (#6111 )	2022-08-01 14:32:36 +03:00
Onur Tirtir	0a04b115aa	Add CHANGELOG entries for 11.0.5 (#6108 )	2022-08-01 12:39:56 +02:00
aykut-bozkurt	f372e93d22	we supress notice log during looking up function oid to not break pg vanilla tests. (#6082 )	2022-08-01 10:14:35 +03:00
Önder Kalacı	5490c85f49	Merge pull request #6097 from citusdata/fix_relation_acess_2 Add missing relation access record for local utility command	2022-07-29 17:00:17 +02:00
Önder Kalacı	cbdc2b3019	Merge branch 'main' into fix_relation_acess_2	2022-07-29 16:45:02 +02:00
Marco Slot	ccc3b1bacf	Merge pull request #6105 from citusdata/marcocitus/fix-process-exit Fixes a crash that can happen due to catalog read in shmem_exit	2022-07-29 14:22:20 +02:00
Marco Slot	6d6e44166f	Avoid catalog read via superuser() call in DecrementSharedConnectionCounter	2022-07-29 14:05:41 +02:00
Onder Kalaci	bdaeb40b51	Add missing relation access record for local utility command While testing `5670dffd33`, I realized that we have a missing RecordNonDistTableAccessesForTask() for local utility commands. Although we don't have to record the relation access for local only cases, we really want to keep the behaviour for scale-out be the same with single node on all aspects. We wouldn't want any single node complex transaction to work on single machine, but not on multi node cluster. Hence, we apply the same restrictions. For example, on a distributed cluster, the following errors, and after this commit this errors locally as well ```SQL CREATE TABLE ref(a int primary key); INSERT INTO ref VALUES (1); CREATE TABLE dist(a int REFERENCES ref(a)); SELECT create_reference_table('ref'); SELECT create_distributed_table('dist', 'a'); BEGIN; SELECT * FROM dist; TRUNCATE ref CASCADE; ERROR: cannot execute DDL on table "ref" because there was a parallel SELECT access to distributed table "dist" in the same transaction HINT: Try re-running the transaction with "SET LOCAL citus.multi_shard_modify_mode TO 'sequential';" COMMIT; ``` We also add the comprehensive test suite and run the same locally.	2022-07-29 11:36:33 +02:00
Önder Kalacı	51a43dce4b	Merge pull request #6091 from citusdata/minor_fixes Remove useless PG version compats	2022-07-29 10:46:17 +02:00
Onder Kalaci	149771792b	Remove useless version compats most likely leftover from earlier versions	2022-07-29 10:31:55 +02:00
Onder Kalaci	24a9735e1c	Remove unusued gitattributes	2022-07-29 10:30:14 +02:00
Ying Xu	7c1a93b26b	Removed USE_PGXS snippet in Makefile that was blocking citus build when flag is set (#6101 ) Code snippet in Makefile was blocking Citus build when USE_PGXS flag was set. This was included for port to FSPG but is not needed for Citus engine and can be safely removed.	2022-07-28 14:15:45 -07:00
aykut-bozkurt	a218198e8f	reindex object address should return invalid addresses for unsepported object types in reindex stmt (#6096 )	2022-07-28 15:31:49 +03:00
Marco Slot	e001ef76cf	Merge pull request #6059 from citusdata/marcocitus/fix-insert-select	2022-07-28 13:35:30 +02:00
Marco Slot	cff013a057	Fix issues with insert..select casts and column ordering	2022-07-28 13:23:57 +02:00
aykut-bozkurt	789d5b9ef9	null check for server in GetObjectAddressByServerName (#6095 )	2022-07-28 13:13:28 +03:00
Önder Kalacı	5670dffd33	Merge pull request #6092 from citusdata/fix_relation_acess Fix relation access tracking for local only transactions	2022-07-28 11:35:40 +02:00
Onder Kalaci	b41c3fd30d	Add tests	2022-07-28 11:27:59 +02:00
Onder Kalaci	0a5112964d	Call relation access hash clean-up irrespective of remote transaction state Mainly because local-only transactions should be cleaned up	2022-07-28 11:27:59 +02:00
Onder Kalaci	d67cf907a2	Detach relation access tracking from connection management	2022-07-28 11:27:59 +02:00
Ying Xu	fdf090758b	Bugfix for IN clause to be considered during planner phase in Columnar (#6030 ) Reported bug #5803 shows that we are currently not sending the IN clause to our planner for columnar. This PR fixes it by checking for ScalarArrayOpExpr in ExtractPushdownClause so that we do not skip it. Also added a test case for this new addition.	2022-07-27 11:06:49 -07:00
Jelte Fennema	0f50bef696	Avoid possible information leakage about existing users (#6090 )	2022-07-27 17:46:32 +02:00
Ahmet Gedemenli	2b2a529653	Error out for views with circular dependencies (#6051 ) Adds error check for views with circular dependencies	2022-07-27 17:57:45 +03:00
aykut-bozkurt	b08e5ec29d	added some missing object address callbacks (#6056 )	2022-07-27 17:36:04 +03:00
Naisila Puka	1259d83511	Smallfix in CreateCollationDDL logic (#6089 )	2022-07-27 14:33:31 +03:00
Önder Kalacı	14c56ce0dd	Merge pull request #6066 from citusdata/fix_colocation_lock Use colocation locks in lower level UDFs and create_distributed_table	2022-07-27 10:10:25 +02:00
Onder Kalaci	5bc8a81aa7	Add colocation checks for shard splits	2022-07-27 10:01:19 +02:00
Onder Kalaci	12fa3aaf6b	Concurrent shard move/copy and colocated table creation fix It turns out that create_distributed_table and citus_move/copy_shard_placement does not work well concurrently. To fix that, we need to acquire a lock, which sounds like a good use of colocation lock. However, the current usage of colocation lock is limited to higher level UDFs like rebalance_table_shards etc. Those usage of lock is still useful, but we cannot acquire the same lock on citus_move_shard_placement etc. because the coordinator connects to itself to acquire the lock. Hence, the high level UDF blocks itself. To fix that, we use one more colocation lock, with the placements are the main objects to consider.	2022-07-27 10:01:19 +02:00
Önder Kalacı	9332a53088	Merge pull request #6086 from citusdata/use_less_mem_for_index_fix Reduce memory consumption of index name fix for partitioned tables	2022-07-27 10:01:08 +02:00
Onder Kalaci	f076e81166	Do not cache all the metadata during fix_all_partition_shard_index_names	2022-07-27 09:49:08 +02:00
Onder Kalaci	26fdcb68f0	Optimize StringJoin() for when prefix-postfix is needed Before this commit, we required multiple copies of the same stringInfo if we needed to append/prepend data to the stringInfo. Now, we optionally get prefix/postfix. For large string operations, this can save up to %10 memory.	2022-07-27 09:49:08 +02:00
Onder Kalaci	b8008999dc	Reduce memory consumption while adjust partition index names Previously, CreateFixPartitionShardIndexNames() created all the relevant query strings for all the shards, and executed the large query string. And, in terms of the memory consumption, this huge command (and its ExprContext generated while running the command) is the main bottleneck/ With this change, we are reducing the total amount of memory usage to almost 1/shard_count. On my local machine, a distributed partitioned table with 120 partitions, each 32 shards, the total memory consumption reduced from ~3GB to ~0.1GB. And, the total execution time increased from ~28 seconds to ~30 seconds. This seems like a good trade-off.	2022-07-27 09:49:08 +02:00
aykut-bozkurt	5f27445b69	enable propagation warnings before postgres vanilla tests (#6081 )	2022-07-27 10:34:41 +03:00
Önder Kalacı	b3dcfddeb3	Merge pull request #6076 from citusdata/fix_backends Check the PGPROC's validity properly	2022-07-26 17:59:49 +02:00
Onder Kalaci	6c65d29924	Check the PGPROC's validity properly We used to only check whether the PID is valid or not. However, Postgres does not necessarily set the PID of the backend to 0 when it exists. Instead, we need to be able to check it from procArray. IsBackendPid() is what pg_stat_activity also relies on for a similar purpose.	2022-07-26 17:44:44 +02:00
Hanefi Onaldi	fba71f7c15	Merge pull request #6075 from citusdata/stable-libpq We have been testing with the 'latest' version of libpq when the CI images were build. This has the downside that rebuilding the images often break our tests due to different errors returned from libpq. With this change we will actually test with a stable version of libpq that is based on the postgres minor version that we test against. This will make it easier to maintain postgres images over time, as well as running all tests locally, where we change libpq in sync with the postgres server version.	2022-07-26 01:54:00 +03:00
Hanefi Onaldi	f944f97d01	Normalize messages from different libpq versions Historically we have been testing with the 'latest' version of libpq when the CI images were build. This has the downside that rebuilding the images often break our tests due to different errors returned from libpq. With this change we will actually test with a stable version of libpq that is based on the postgres minor version that we test against. This will make it easier to maintain postgres images over time, as well as running _all_ tests locally, where we change libpq in sync with the postgres server version.	2022-07-26 01:41:34 +03:00
Nils Dijk	dc30ee874a	use images that are build with the same libpq version as the minor postgres version	2022-07-26 01:41:33 +03:00
aykut-bozkurt	67ac3da2b0	added citus_depended_objects udf and HideCitusDependentObjects GUC to hide citus depended objects from pg meta queries (#6055 ) use RecurseObjectDependencies api to find if an object is citus depended make vanilla tests runnable to see if citus_depended function is working correctly	2022-07-25 16:43:34 +03:00
Marco Slot	e2a9495334	Merge pull request #6073 from citusdata/marcocitus/with-hold	2022-07-22 10:32:15 +02:00
Marco Slot	5fabf94e39	Allow WITH HOLD cursors with parameters	2022-07-21 12:00:59 +02:00
Hanefi Onaldi	eb3e5ee227	Introduce citus_locks view citus_locks combines the pg_locks views from all nodes and adds global_pid, nodeid, and relation_name. The columns of citus_locks don't change based on the Postgres version, however the pg_locks's columns do. Postgres 14 added one more column to pg_locks (waitstart timestamptz). citus_locks has the most expansive column set, including the newly added column. If citus_locks is queried in a Postgres version where pg_locks doesn't have some columns, the values for those columns in citus_locks will be NULL	2022-07-21 03:06:57 +03:00
Nitish Upreti	3d569cc49a	Shard Split support for Columnar and Partitioned Table (#6067 ) DESCRIPTION: This PR extends support for Partitioned and Columnar tables in blocking 'citus_split_shard_by_split_points' workflow. Columnar Support : No special handling required. Just removing checks that fails split for columnar table and adding test coverage. Partitioned Table Support : Skip copying of parent table as they are empty, The partitions contain data and are treated as co-located shards that will be copied separately. Attach partitions to parent on destination after inserting new shard metadata and before creating foreign key constraints. MISC: Fix Bug #4949 where Blocking shard moves fails if there is a foreign key between partitioned distributed tables (from child to parent). TEST: Added new test 'citus_split_shards_columnar_partitioned' for splitting 'partitioned' and 'columnar + partitioned' table. Added new test 'shard_move_constraints_blocking' to add coverage for shard move bug fix. Updated test 'citus_split_shard_by_split_points_negative' to allow columnar and partitioned table.	2022-07-20 12:24:50 -07:00
Nils Dijk	bbb1da944f	allow ./configure to pass without checking the postgres version (#6072 ) For working on initial changes to postgres beta versions make the version check in `./configure` default, but optional. Normal users will still get the postgres version check error when building on other postgres versions, however, advanced users can use this flag to force configure to pass and find the compilation errors Citus would run into. Use of the flag is not advised for users not understanding what this does.	2022-07-20 19:56:17 +03:00
Naisila Puka	7d6410c838	Drop postgres 12 support (#6040 ) * Remove if conditions with PG_VERSION_NUM < 13 * Remove server_above_twelve(&eleven) checks from tests * Fix tests * Remove pg12 and pg11 alternative test output files * Remove pg12 specific normalization rules * Some more if conditions in the code * Change RemoteCollationIdExpression and some pg12/pg13 comments * Remove some more normalization rules	2022-07-20 17:49:36 +03:00
aykut-bozkurt	c085ac026a	Merge pull request #6069 from citusdata/assertion-fix fix assertion bugs related to list length	2022-07-20 12:00:11 +03:00
aykutbozkurt	108ca875ad	fix assertion bugs related to list length	2022-07-20 10:53:12 +03:00
Hanefi Onaldi	6a32061c08	Renames configure.in to fix warnings (#6034 ) When building packages on ubuntu jammy, we started to see some warnings. autoreconf: warning: autoconf input should be named 'configure.ac', not 'configure.in'	2022-07-19 18:24:15 +02:00
aykut-bozkurt	a5af78feb8	Merge pull request #6063 from citusdata/list-of-address-api change address method to return list of addresses	2022-07-19 18:22:23 +03:00
aykutbozkurt	ebb6d1c8c0	refactor code where GetObjectAddressFromParseTree is called because it returns list of addresses now	2022-07-19 18:13:12 +03:00
aykutbozkurt	9d232d7b00	change address method to return list of addresses	2022-07-19 18:13:11 +03:00
Önder Kalacı	9d5ca41e8c	Merge pull request #6022 from citusdata/baby_step_pg_15 PG 15 compatibility: Resolve compile issues + adjust shmem requests for	2022-07-18 17:48:50 +02:00
Önder Kalacı	90b1afe31e	Merge branch 'main' into baby_step_pg_15	2022-07-18 15:02:39 +02:00
Nitish Upreti	5b3537cdff	Shard Split for Citus (#6029 ) * Blocking split setup * Add missing type * Missing API from Metadata Sync * Shard Split e2e code * Worker Split Copy DestReceiver skeleton * Basic destreceiver code * worker_split_copy UDF * UDF calling * Split points are text * Isolate Tenant and Split Shard Unification * Fixing executor and misc * Reindent code * Fixing UDF definitions * Hello World Local Copy works * Remote copy hello world works * Local and Remote binary test * Fixing text local copy and adding tests * Hello World shard split works * Negative tests * Blocking Split workflow works * Refactor * Bug fix * Reindent * Cleaning up and adding comments * Basic test for shard split workflow * ReIndent * Circle CI integration * Removing include causing circle-ci build failure * Remove SplitCopyDestReceiver and use PartitionedResultDestReceiver * Add support for citus.enable_binary_protocol * Reindent * Fix build break * Update Test * Cleanup on catch * Addressing open comments * Update downgrade script and quote schema/table in COPY statement * Fix metadata sync issue. Update regression test * Isolation test and bug fix * Add Isolation test, fix foreign constraint deadlock issue * Misc code review comments * Test name needing to be quoted * Refactor code from review comments * Explaining shardGroupSplitIntervalListList * Fix upgrade & downgrade * Fix broken test * Test fix Round 2 * Fixing bug and modifying test appropriately * Fully qualify copy udf name. Run Reindent * Address PR comments * Fix null handling when creating AuxiliaryStructures * Ensure local copy is triggered in tests * Limit max shards that can be created with split * Test failure fix * Remove split_mode and use shard_transfer_mode instead' * Fix test failure * Fix test failure * Fixing permission issue when splitting non-superuser owned tables * Fix test expected output * Remove extra space * Fix test * attempt to fix test * Addressing Marco's PR comment * Only clean shards created by workflow * Remove from merge * Update test	2022-07-18 02:54:15 -07:00
Önder Kalacı	f745b3fae8	Merge pull request #6065 from citusdata/remove_unused_code Remove unused code	2022-07-17 10:18:09 +02:00
Onder Kalaci	3eaef027e2	Remove unused code Probably left over from removing old repartitioning code	2022-07-15 10:28:46 +02:00
Onder Kalaci	483a3a5875	PG 15 Compat: Resolve compile issues + shmem requests Similar to #5897, one more step for running Citus with PG 15. This PR at least make Citus run with PG 15. I have not tried running the tests with PG 15. Shmem changes are based on `4f2400cb3f` Compile breaks are mostly due to #6008	2022-07-15 10:11:39 +02:00
Hanefi Onaldi	ae58ca5783	Replace isolation tester func only once on enterprise tests (#6064 ) This is a continuation of a refactor (with commit sha `2b7cf0c097`) that aimed to use Citus helper UDFs by default in iso tests. PostgreSQL isolation test infrastructure uses some UDFs to detect whether concurrent sessions block each other. Citus implements alternatives to that UDF so that we are able to detect and report distributed transactions that get blocked on the worker nodes as well. We needed to explicitly replace PG helper functions with Citus implementations in each isolation file. Now we replace them by default.	2022-07-14 19:16:53 +03:00
ywj	1675519f93	Support citus_columnar as separate extension (#5911 ) * Support upgrade and downgrade and separate columnar as citus_columnar extension Co-authored-by: Yanwen Jin <yanwjin@microsoft.com> Co-authored-by: Jeff Davis <jeff@j-davis.com>	2022-07-13 21:08:29 -07:00
Hanefi Onaldi	968bba1a7e	Add changelog entry for 11.0.4 (#6060 )	2022-07-13 08:12:01 -07:00
Önder Kalacı	beebbfc9ff	Merge pull request #6057 from citusdata/fix_read_rep_error Fix errors while promoting read-replicas to primary	2022-07-13 15:14:21 +02:00
Onder Kalaci	6cd7319f12	Add more generic read-replica tests	2022-07-13 14:58:30 +02:00
Onder Kalaci	3c343d4563	Add regression tests for LOCK command citus.use_secondary_nodes=always mode	2022-07-13 14:27:11 +02:00
Onder Kalaci	b2e9a5baf1	Make sure citus_is_coordinator works on read replicas	2022-07-13 14:11:18 +02:00
Onder Kalaci	8ab696f7e2	LOCK COMMAND does not require primaries at the start	2022-07-13 14:08:49 +02:00
aykut-bozkurt	79fd5eca8a	Merge pull request #6048 from citusdata/relation-is-valid-check we should check if relation is valid after fetching a relation	2022-07-06 22:49:35 +03:00
aykutbozkurt	da089d72c5	we should check if relation is valid after fetching a relation	2022-07-06 16:35:01 +03:00
Halil Ozan Akgül	ac6ccab739	Merge pull request #6042 from citusdata/remove_wrong_parameter_from_get_all_active_transactions Removes incorrect parameter from get_all_active_transactions	2022-07-06 11:50:13 +03:00
Halil Ozan Akgul	1490acbbe9	Removes incorrect parameter from get_all_active_transactions	2022-07-06 11:35:46 +03:00
Hanefi Onaldi	2b7cf0c097	Replace iso tester func only once (#5964 ) Use Citus helper UDFs by default in iso tests PostgreSQL isolation test infrastructure uses some UDFs to detect whether concurrent sessions block each other. Citus implements alternatives to that UDF so that we are able to detect and report distributed transactions that get blocked on the worker nodes as well. We needed to explicitly replace PG helper functions with Citus implementations in each isolation file. Now we replace them by default.	2022-07-06 11:04:31 +03:00
Hanefi Onaldi	60b6119cc1	Merge pull request #6047 from citusdata/changelog-11.0.3 Add changelog entry for 11.0.3	2022-07-05 13:29:37 +03:00
Hanefi Onaldi	c33915c3e6	Add changelog entry for 11.0.3	2022-07-05 13:17:40 +03:00
aykut-bozkurt	b5723ffba7	Merge pull request #6044 from citusdata/alter-index-rename-non-index * alter index/table rename weird syntax supported,	2022-07-05 10:36:55 +03:00
aykutbozkurt	d53a7760b0	* alter index/table rename weird syntax supported, * correct the wrong level of lock if the weird syntax is used	2022-07-04 21:27:47 +03:00
aykut-bozkurt	4e9ea834b6	Merge pull request #6033 from citusdata/vacuum-index-cleanup-auto auto is a valid option for vacuum index_cleanup.	2022-07-04 19:43:02 +03:00
aykutbozkurt	ba62c0a148	auto is a valid option for vacuum index_cleanup.	2022-07-04 19:27:55 +03:00
Ahmet Gedemenli	c8e1e243b8	Fix matviews for citus_add_local_table_to_metadata (#6023 )	2022-07-04 17:00:07 +03:00
Hanefi Onaldi	f60809a6c1	Fix downgrade scripts from 11.0-2 to 11.0-1 (#6039 )	2022-06-29 22:43:50 +03:00
Önder Kalacı	9777a454db	Merge pull request #6036 from citusdata/fix_no_worker_node Fix upgrades to Citus 11 when there are no nodes in the metadata	2022-06-29 10:44:57 +02:00
Onder Kalaci	bab4c0a8c3	Fixes a bug that prevents upgrades when there are no worker nodes	2022-06-28 15:54:49 +02:00
Önder Kalacı	7a4253ace0	Merge pull request #6026 from citusdata/fix_col_names Fixes a bug that prevents using COMPRESSION and CONSTRAINT on a column	2022-06-28 13:58:50 +02:00
Onder Kalaci	bd3a070369	Fixes a bug that prevents upgrades when there COMPRESSION and DEFAULT columns	2022-06-28 13:36:00 +02:00
aykut-bozkurt	ace800851a	Merge pull request #5946 from citusdata/propagate-vacuum propagate 'vacuum;' to all worker nodes	2022-06-23 15:48:02 +03:00
aykutbozkurt	8194dc4c62	* Added isolation tests for vacuum, * Added more regression tests for more vacuum options, * Fixed deadlock for unqualified vacuum when there is only 1 worker, * Supported lock_skipped for vacuum.	2022-06-23 15:33:14 +03:00
aykutbozkurt	1d6c81245c	fix bug, which is column mismatch of shard tasks when specifying column names for citus tables in vacuum and analyze commands	2022-06-23 15:33:14 +03:00
Aykut Bozkurt	6986f53835	propagate unqualified vacuum and analyze to all worker nodes	2022-06-23 15:33:14 +03:00
Gledis Zeneli	57d9cc1975	Update README.md for handling mitmproxy (#6019 ) Update docs for handling mitmproxy in failure testing.	2022-06-22 14:57:17 +03:00
Marco Slot	57455dc64d	Merge pull request #6012 from citusdata/marcocitus/readme-11 Update README.md for Citus 11	2022-06-17 17:36:01 +02:00
Marco Slot	6c2218f56e	Update README.md for Citus 11	2022-06-17 17:02:19 +02:00
Hanefi Onaldi	26172636c9	Add changelog entries for 11.0.2 (#6007 )	2022-06-16 16:50:49 +03:00
Ahmet Gedemenli	1ee3e8b7f4	Fix creating stats bug when CREATE TABLE LIKE (#6006 )	2022-06-16 12:43:47 +03:00
Jelte Fennema	184c7c0bce	Make enterprise features open source (#6008 ) This PR makes all of the features open source that were previously only available in Citus Enterprise. Features that this adds: 1. Non blocking shard moves/shard rebalancer (`citus.logical_replication_timeout`) 2. Propagation of CREATE/DROP/ALTER ROLE statements 3. Propagation of GRANT statements 4. Propagation of CLUSTER statements 5. Propagation of ALTER DATABASE ... OWNER TO ... 6. Optimization for COPY when loading JSON to avoid double parsing of the JSON object (`citus.skip_jsonb_validation_in_copy`) 7. Support for row level security 8. Support for `pg_dist_authinfo`, which allows storing different authentication options for different users, e.g. you can store passwords or certificates here. 9. Support for `pg_dist_poolinfo`, which allows using connection poolers in between coordinator and workers 10. Tracking distributed query execution times using citus_stat_statements (`citus.stat_statements_max`, `citus.stat_statements_purge_interval`, `citus.stat_statements_track`). This is disabled by default. 11. Blocking tenant_isolation 12. Support for `sslkey` and `sslcert` in `citus.node_conninfo`	2022-06-16 00:23:46 -07:00
Burak Velioglu	e244e9ffb6	Fix dropping temporary view without specifying the explicit schema name (#6003 )	2022-06-15 16:41:12 +02:00
Marco Slot	fb2dea3fc6	Merge pull request #6004 from citusdata/marcocitus/domain-fix Fix bug in unqualified, non-existing DROP DOMAIN IF EXISTS	2022-06-15 14:07:54 +02:00
Marco Slot	ee34e1ed9d	Fix bug in unqualified, non-existing DROP DOMAIN IF EXISTS	2022-06-15 13:59:08 +02:00
Ahmet Gedemenli	268d3fa3a6	Fix materialized view intermediate result filename (#5982 )	2022-06-14 15:07:08 +03:00
Marco Slot	d19d876b5f	Merge pull request #5995 from citusdata/marcocitus/citus-finish-upgrade Introduce a citus_finish_citus_upgrade() procedure	2022-06-13 13:25:33 +02:00
Onder Kalaci	af22a30b48	Use citus_finish_citus_upgrade() in the tests We already have tests relying on citus_finalize_upgrade_to_citus11(). Now, adjust those to rely on citus_finish_citus_upgrade() and always call citus_finish_citus_upgrade().	2022-06-13 13:15:15 +02:00
Marco Slot	36c4ec6d53	Introduce a citus_finish_citus_upgrade() function	2022-06-13 13:15:15 +02:00
Marco Slot	3a32c28714	Merge pull request #5994 from citusdata/clairegiordano-slackbadge	2022-06-03 17:50:46 +02:00
Claire Giordano	f0a19d48ad	Fix the broken link to the Slack badge	2022-06-02 17:30:56 -07:00
Halil Ozan Akgül	e8a09b135d	Merge pull request #5974 from citusdata/fix_undistribute_table_drops_citus_bug Fixes the bug where undistribute can drop Citus extension	2022-05-31 17:29:17 +03:00
Halil Ozan Akgul	b255706189	Fixes the bug where undistribute can drop Citus extension	2022-05-31 16:23:28 +03:00
Önder Kalacı	fc151a9b80	Merge pull request #5979 from citusdata/improve_metadata_sync_disabled fix assertion failure & honor enable_metadata_sync in node operations	2022-05-30 17:01:07 +02:00
Onder Kalaci	89c1ccb7a5	Show that no metadata is sent when disabled	2022-05-30 13:41:06 +02:00
Onder Kalaci	7157152f6c	Do not send metadata changes during add node if citus.enable_metadata_sync is set to false	2022-05-30 13:24:31 +02:00
Onder Kalaci	010a2a408e	Avoid assertion failure on citus_add_node	2022-05-30 12:22:09 +02:00
Gledis Zeneli	beef392f5a	Fix memory error with citus_add_node reported by valgrind test (#5967 ) The error comes due to the datum jsonb in pg_dist_metadata_node.metadata being 0 in some scenarios. This is likely due to not copying the data when receiving a datum from a tuple and pg deciding to deallocate that memory when the table that the tuple was from is closed. Also fix another place in the code that might have been susceptible to this issue. I tested on both multi-vg and multi-1-vg and the test were successful.	2022-05-28 00:22:00 +03:00
Ahmet Gedemenli	26d927178c	Propagate dependent views upon distribution (#5950 )	2022-05-26 14:23:45 +03:00
jeff-davis	74ce210f8b	Columnar: fix wraparound bug. (#5962 ) columnar_vacuum_rel() now advances relfrozenxid. Fixes #5958.	2022-05-25 07:50:48 -07:00
Burak Velioglu	86781ec85a	Merge pull request #5966 from citusdata/velioglu/fix_depending_view_creation Fix schema and ownership of view while altering depending distributed table	2022-05-24 17:09:20 +03:00
Burak Velioglu	1d7dda991f	Create view and materialized views with right schema and owner while altering the distributed table. To be able to alter view's owner without enforcing sequential mode. Alter view process functions have been udpated to use metadata connection.	2022-05-24 15:27:30 +03:00
Gledis Zeneli	27ddb4fc8e	Do not obtain AccessShareLock before actual lock (#5965 ) Do not obtain AccessShareLock before acquiring the distributed locks. Acquiring an AccessShareLock ensures that the relations which we are trying to get a distributed lock on will not be dropped in the time between when the LOCK command is issued and the LOCK commands are send to the worker. However, this also leads to distributed deadlocks in such scenarios: ```sql -- for dist lock acquiring order coor, w1, w2 -- on w2 LOCK t1 IN ACCESS EXLUSIVE MODE; -- acquire AccessShareLock locally on t1 to ensure it is not dropped while we get ready to distribute the lock -- concurrently on w1 LOCK t1 IN ACCESS EXLUSIVE MODE; -- acquire AccessShareLock locally on t1 to ensure it is not dropped while we get ready to distribute the lock -- acquire dist lock on coor, w1, gets blocked on local AccessShareLock on w2 -- on w2 continuation of the execution above -- starts to acquire dist locks and gets blocked on the coor by the lock acquired by w1 -- distributed deadlock ``` We opt for avoiding such deadlocks with the cost of the possibility of running into errors when the relations on which we are trying to acquire locks on get dropped.	2022-05-23 13:06:38 +03:00
Önder Kalacı	2a768176c4	Merge pull request #5960 from citusdata/parallelize_node_activate Parallelize metadata syncing on node activate	2022-05-23 09:25:11 +02:00
Onder Kalaci	dd02e1755f	Parallelize metadata syncing on node activate It is often useful to be able to sync the metadata in parallel across nodes. Also citus_finalize_upgrade_to_citus11() uses start_metadata_sync_to_primary_nodes() after this commit. Note that this commit does not parallelize all pieces of node activation or metadata syncing. Instead, it tries to parallelize potenially large parts of metadata, which is the objects and distributed tables (in general Citus tables). In the future, it would be nice to sync the reference tables in parallel across nodes. Create ~720 distributed tables / ~23450 shards ```SQL -- declaratively partitioned table CREATE TABLE github_events_looooooooooooooong_name ( event_id bigint, event_type text, event_public boolean, repo_id bigint, payload jsonb, repo jsonb, actor jsonb, org jsonb, created_at timestamp ) PARTITION BY RANGE (created_at); SELECT create_time_partitions( table_name := 'github_events_looooooooooooooong_name', partition_interval := '1 day', end_at := now() + '24 months' ); CREATE INDEX ON github_events_looooooooooooooong_name USING btree (event_id, event_type, event_public, repo_id); SELECT create_distributed_table('github_events_looooooooooooooong_name', 'repo_id'); SET client_min_messages TO ERROR; ``` across 1 node: almost same as expected ```SQL SELECT start_metadata_sync_to_primary_nodes(); Time: 15664.418 ms (00:15.664) select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 14284.069 ms (00:14.284) ``` across 7 nodes: ~3.5x improvement ```SQL SELECT start_metadata_sync_to_primary_nodes(); ┌──────────────────────────────────────┐ │ start_metadata_sync_to_primary_nodes │ ├──────────────────────────────────────┤ │ t │ └──────────────────────────────────────┘ (1 row) Time: 25711.192 ms (00:25.711) -- across 7 nodes select start_metadata_sync_to_node(nodename,nodeport) from pg_dist_node; Time: 82126.075 ms (01:22.126) ```	2022-05-23 09:15:48 +02:00
jeff-davis	a2f5b068e6	Columnar: tighten security and improve visibility. (#5922 ) Move internal storage details to a separate schema with no public access to limit the possibility for information leakage. Create views with public access that show storage details for those columnar tables where the user has ownership privileges. Include mapping between relation ID and storage ID for easier interpretation.	2022-05-20 15:30:31 -07:00
Hanefi Onaldi	52541c5802	Add normalization rules for flaky isolation tests We remove `<waiting ...>` and `<... completed>` outputs for some CREATE INDEX CONCURRENTLY commands since they can cause flakiness in some scenarios. Postgres calls WaitForOlderSnapshots() and this can cause CREATE INDEX CONCURRENTLY commands for shards to get blocked by each other for brief periods of time. The extra waits can pop-up, or they can get completed at different lines in the output files. To remedy that, we rename those indexes to be captured by the new normalization rule.	2022-05-21 00:55:47 +03:00
Ying Xu	a1151c2395	Clear metadatacache during abort for create extension (#5907 ) * Bug fix for bug #5876. Memset MetadataCacheSystem every time there is an abort * Created an ObjectAccessHook that saves the transactionlevel of when citus was created and will clear metadatacache if that transaction level is rolled back. Added additional tests to make sure metadatacache is cleared	2022-05-20 13:47:58 -07:00
Marco Slot	c692bb10f7	Merge pull request #5961 from citusdata/marcocitus/cache-backend-type Add caching for functions that check the backend type	2022-05-20 19:33:43 +02:00
Marco Slot	7abcfac61f	Add caching for functions that check the backend type	2022-05-20 19:02:37 +02:00
Marco Slot	37dda19f31	Merge pull request #5924 from citusdata/marcocitus/nested-execution Improve nested execution checks and add GUC to disable	2022-05-20 19:02:18 +02:00
Marco Slot	09ec366ff5	Improve nested execution checks and add GUC to disable	2022-05-20 18:55:43 +02:00
Marco Slot	e683993449	Fix prepared statement bug when switching from local to remote execution	2022-05-20 18:55:43 +02:00
jeff-davis	a9f8a60007	Columnar: support relation options with ALTER TABLE. (#5935 ) Columnar: support relation options with ALTER TABLE. Use ALTER TABLE ... SET/RESET to specify relation options rather than alter_columnar_table_set() and alter_columnar_table_reset(). Not only is this more ergonomic, but it also allows better integration because it can be treated like DDL on a regular table. For instance, citus can use its own ProcessUtility_hook to distribute the new settings to the shards. DESCRIPTION: Columnar: support relation options with ALTER TABLE.	2022-05-20 08:35:00 -07:00
Marco Slot	16fa0dad85	Merge pull request #5959 from citusdata/marcocitus/run-command-on Allow distributed execution from run_command_on_* functions	2022-05-20 15:35:44 +02:00
Marco Slot	ad5214b50c	Allow distributed execution from run_command_on_* functions	2022-05-20 15:26:47 +02:00
Gledis Zeneli	b1d1df8214	Merge pull request #5938 from citusdata/propagate-lock DESCRIPTION: * Lock statements will propagate locks to all the nodes in the metadata when a citus tables or a view is locked. Locks will also be propagated for all citus tables that appear in the definition of views that are locked (recursively applies to the views inside the locked view). * TRUNCATE-ing a citus table from a worker node is no longer allowed if the coordinator is not added to the metadata. When TRUNCATE is called on a citus table that table needs to be locked in all the nodes before the TRUCATE is executed. If the coordinator is not in the metadata, the coordinator will not be aware of the lock on the table and can access the table's shard while a TRUNCATE is executing. Despite not being recommended, this behavior can be allowed by setting `SET citus.allow_unsafe_locks_from_workers TO 'on';`. * `pg_dump` which calls LOCK TABLE is affected in the same way as TRUNCATE. An error is thrown when locking from worker when coordinator is not in the metadata.	2022-05-20 13:08:36 +03:00
gledis69	e5f645a204	Merge branch 'master' into propagate-lock	2022-05-20 12:29:51 +03:00
gledis69	4731630741	Add distributing lock command support	2022-05-20 12:28:07 +03:00
Önder Kalacı	431311732a	Adjust arbitrary configs metadata sync (#5956 ) As of Citus 11, we already sync metadata by default. It is useful to keep one schedule without metadata sync.	2022-05-20 11:02:53 +03:00
Marco Slot	757ecba968	Merge pull request #5941 from citusdata/marcocitus/run_command_on_coordinator Add a run_command_on_coordinator function	2022-05-19 10:32:35 +02:00
Marco Slot	79d7e860e6	Add a run_command_on_coordinator function	2022-05-19 10:26:09 +02:00
Marco Slot	fa9cee409c	Fix downgrade scripts and add new downgrade tests	2022-05-19 10:26:09 +02:00
Ahmet Gedemenli	4fe43c68bc	Merge pull request #5955 from citusdata/fix-rename-sequence-schema-qualifying Fix schemaname qualify for rename seq stmts	2022-05-18 19:43:25 +03:00
Ahmet Gedemenli	48d5c9a1b5	Fix schemaname qualify for rename seq stmts	2022-05-18 19:04:22 +03:00
Önder Kalacı	66378d00da	Merge pull request #5912 from citusdata/relax_disable_node Adds "synchronous" option to citus_disable_node() UDF	2022-05-18 17:34:41 +02:00
Onder Kalaci	0596062f96	Serialize reference table modifications with node changes & restore point With Citus MX enabled, when a reference table is modified, it does some operations on the first worker node(e.g., acquire locks). If node metadata is locked (via add node or create restore point), the changes to the reference tables should be blocked.	2022-05-18 17:23:38 +02:00
Onder Kalaci	127450466e	Do not warn unncessarily when a node is removed In the past (pre-11), we allowed removing worker nodes that had active placements for replicated distributed table, without even checking if there are any other replicas of the same placement. However, with #5469, we prevent disabling nodes via a hard error when there is the last active placement of shard, as we do for reference tables. Note that otherwise, we'd allow users to lose data. As of today, the NOTICE is completely irrelevant.	2022-05-18 17:23:38 +02:00
Onder Kalaci	b4dbd84743	Prevent distributed queries while disabling first worker node First worker node has a special meaning for modifications on the replicated tables It is used to acquire a remote lock, such that the modifications are serialized. With this commit, we make sure that we do not let any distributed query to see a different 'first worker node' while first worker node is disabled. Note that, maybe implicitly mentioned above, when first worker node is disabled, the first worker node changes, that's why we have to handle the situation.	2022-05-18 17:21:12 +02:00
Onder Kalaci	db998b3d66	Adds "sync" option to citus_disable_node() UDF Before this commit, we had: ```SQL SELECT citus_disable_node(nodename, nodeport, force boolean DEFAULT false) ``` Where, we allow forcing to disable first worker node with `force:=true`. However, it entails the risk for losing data / diverging placement data etc. With `force` flag, we control disabling the first worker node, and with `async` flag we control whether the changes are done via bg worker or immediately. ```SQL SELECT citus_disable_node(nodename, nodeport, force boolean DEFAULT false, sync boolean DEFAULT false) ``` Where we can achieve all the following: \| Mode \| Data loss possibility \| Can run in 2PC \| Handle multiple node failures \| Immediately effective \| \| --- \|--- \|--- \|--- \|--- \| \| force:false, sync: false \| false \| true \| true \| false \| \| force:false, sync: true \| false \| false \| false \| true \| \| force:true, sync: false \| true \| true \| true \| false \| \| force:true, sync: true \| false \| false \| false \| true \|	2022-05-18 17:21:12 +02:00
Önder Kalacı	69d007deec	Merge pull request #5940 from citusdata/index_name Fixes a bug that prevents dropping/altering indexes	2022-05-18 17:20:00 +02:00
Onder Kalaci	2cc4053fc1	Fixes a bug that prevents dropping/altering indexes There are two problems in this area. First, when there are expressions on the index name, we should call `transformIndexExpression()` before generating the index name. That is what Postgres does. Second, because of `40c24bfef9` PG 13 and PG 14 generates different names for indexes with function calls even for local PG tables. Assume we have: ```SQL create table t(id int); select create_distributed_table('t', 'id'); create index ON t (my_very_boring_function(id)); ``` On PG 13, the name of the index is `t_expr_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_expr_idx" btree (my_very_boring_function(id::bigint)) ``` On PG 14, the name of the index is `t_my_very_boring_function_idx` ```SQL \d t Table "public.t" ┌────────┬─────────┬───────────┬──────────┬─────────┐ │ Column │ Type │ Collation │ Nullable │ Default │ ├────────┼─────────┼───────────┼──────────┼─────────┤ │ id │ integer │ │ │ │ └────────┴─────────┴───────────┴──────────┴─────────┘ Indexes: "t_my_very_boring_function_idx" btree (my_very_boring_function(id::bigint)) ``` The second issue is not very critical. The important part is that we adjust regression tests to drop all the indexes, which ensures the index names are sane on any version.	2022-05-18 16:35:17 +02:00
Nils Dijk	e25a5d7837	Merge pull request #5931 from citusdata/refactor/dedupe-object-propagation Refactor: reduce complexity and code duplication for Object Propagation	2022-05-18 16:30:31 +02:00
Nils Dijk	b71a08955a	Refactor: reduce complexity and code duplication for Object Propagation Over time we have added significantly improved the support for objects to be propagated by Citus as to make scaling out the database more seamless. It became evident that there was a lot of code duplication that got into the codebase to implement the propagation. This PR tries to reduce the amount of repeated code that is at most only slightly different. To make things worse, most of the differences were actually oversights instead of correct. This Patch introduces 3 reusable sets of pre/post processing steps for respectively - create - alter - drop With the use of the common functionality we should have more coherent behaviour between different supported object by Citus. Some steps either omit the Pre or Post processing step if they would not make sense to include. All tests pass, only 1 test needed changing, foreign servers, as the dropping of foreign servers didn't implement support for dropping multiple foreign servers at once. Given the common approach correctly supports dropping of multiple objects, either distributed or not, the test that assumed it wouldn't work was now obsolete.	2022-05-18 15:58:28 +02:00
Önder Kalacı	b04222155d	Merge pull request #5923 from citusdata/update_view Mark existing views as distributed when upgrade to 11.0+	2022-05-18 15:50:29 +02:00
Onder Kalaci	ee45e7bfbf	Mark existing views as distributed when upgrade to 11.0+ We have a mechanism which ensures that newly distributed objects are recorded during `alter extension citus update`. However, the logic was lacking "view"s. With this commit, we make sure that existing views are also marked as distributed during upgrade.	2022-05-18 15:43:17 +02:00
Nils Dijk	14c6c799f2	suppress notices when more dependencies are found (#5954 ) We are nearing the 100 objects being propagated in `master_copy_shard_placement` and with the extra supported objects this gets pushed over a 100 objects. When a 100 objects are reached for propagation a notice will be shown to the user, informing them it might take a while to finish the operation. During testing this is not important to see. Since the message contains the exact number of objects to be propagated the tests becomes very unstable when merging community into enterprsie. This change makes that the test output stays stable.	2022-05-18 14:31:10 +03:00
Hanefi Onaldi	313104ab9b	Grep logs for deterministic global_cancel test results (#5948 )	2022-05-18 11:09:54 +03:00
Halil Ozan Akgül	f8450065f0	Merge pull request #5951 from citusdata/revert_colocation_fix Revert "Creates new colocation for colocate_with:='none' too"	2022-05-17 16:44:31 +03:00
Halil Ozan Akgul	d171a736ab	Revert "Creates new colocation for colocate_with:='none' too" This reverts commit `f74447b3b7`.	2022-05-17 15:32:22 +03:00
Ahmet Gedemenli	aa8f46ead0	Fix schema name bug for sequences (#5937 )	2022-05-16 18:11:57 +03:00
Halil Ozan Akgül	187d06c3b5	Merge pull request #4797 from citusdata/new-record-for-every-colocation Creates new colocation for colocate_with:='none' too	2022-05-16 14:35:08 +03:00
Halil Ozan Akgul	f74447b3b7	Creates new colocation for colocate_with:='none' too	2022-05-16 13:39:05 +03:00
Teja Mupparti	e56fc34404	Fixes: #5787 In prepared statements, map any unused parameters to a generic type.	2022-05-13 19:31:05 -07:00
Burak Velioglu	544e6c7428	Merge pull request #5914 from citusdata/velioglu/alter_view_propagation Introduce alter view propagation	2022-05-13 13:34:25 +03:00
Burak Velioglu	1875516ae9	Add ALTER VIEW support Adds support for propagation ALTER VIEW commands to - Change owner of view - SET/RESET option - Rename view and view's column name - Change schema of the view Since PG also supports targeting views with ALTER TABLE commands, related code also added to direct such ALTER TABLE commands to ALTER VIEW commands while sending them to workers.	2022-05-13 13:21:53 +03:00
Marco Slot	1f17fa8b63	Merge pull request #5888 from citusdata/marcocitus/is-coordinator	2022-05-13 10:18:23 +02:00
Marco Slot	6fad5dc207	Add a citus_is_coordinator function	2022-05-13 10:02:52 +02:00
Ahmet Gedemenli	613d9c0dca	Merge pull request #5934 from citusdata/fix-alter-statistics-nspname Fix alter statistics namespace name	2022-05-11 19:07:59 +03:00
Ahmet Gedemenli	00e0f4d8e6	Fix alter statistics namespace name	2022-05-11 18:44:37 +03:00
Gledis Zeneli	4c6f62efc6	Switch to using LOCK instead of lock_relation_if_exists in TRUNCATE (#5930 ) Breaking down #5899 into smaller PR-s This particular PR changes the way TRUNCATE acquires distributed locks on the relations it is truncating to use the LOCK command instead of lock_relation_if_exists. This has the benefit of using pg's recursive locking logic it implements for the LOCK command instead of us having to resolve relation dependencies and lock them explicitly. While this does not directly affect truncate, it will allow us to generalize this locking logic to then log different relations where the pg recursive locking will become useful (e.g. locking views). This implementation is a bit more complex that it needs to be due to pg not supporting locking foreign tables. We can however, still lock foreign tables with lock_relation_if_exists. So for a command: TRUNCATE dist_table_1, dist_table_2, foreign_table_1, foreign_table_2, dist_table_3; We generate and send the following command to all the workers in metadata: ```sql SEL citus.enable_ddl_propagation TO FALSE; LOCK dist_table_1, dist_table_2 IN ACCESS EXCLUSIVE MODE; SELECT lock_relation_if_exists('foreign_table_1', 'ACCESS EXCLUSIVE'); SELECT lock_relation_if_exists('foreign_table_2', 'ACCESS EXCLUSIVE'); LOCK dist_table_3 IN ACCESS EXCLUSIVE MODE; SEL citus.enable_ddl_propagation TO TRUE; ``` Note that we need to alternate between the lock command and lock_table_if_exists in order to preserve the TRUNCATE order of relations. When pg supports locking foreign tables, we will be able to massive simplify this logic and send a single LOCK command.	2022-05-11 18:38:48 +03:00
Burak Velioglu	f11d851ef7	Merge pull request #5889 from citusdata/velioglu/view_propagation Introduce CREATE/DROP VIEW	2022-05-10 14:25:06 +03:00
Burak Velioglu	1460452442	Introduce CREATE/DROP VIEW Adds support for propagating create/drop view commands and views to worker node while scaling out the cluster. Since views are dropped while converting the table type, metadata connection will be used while propagating view commands to not switch to sequential mode.	2022-05-10 13:07:14 +03:00
Burak Velioglu	a2158794bd	Merge pull request #5926 from citusdata/velioglu/syncMetadataViaObject Use object address instead of relation id on DDLJob to decide on syncing metadata	2022-05-06 15:43:49 +03:00
Burak Velioglu	06a94d167e	Use object address instead of relation id on DDLJob to decide on syncing metadata	2022-05-05 17:59:44 +03:00
Önder Kalacı	63f229928f	Merge pull request #5925 from citusdata/use_less_mem Refrain reading the metadata cache for all tables during upgrade	2022-05-05 09:07:47 +02:00
Onder Kalaci	f193e16a01	Refrain reading the metadata cache for all tables during upgrade First, it is not needed. Second, in the past we had issues regarding this: https://github.com/citusdata/citus/pull/4344 When I create 10k tables, ~120K shards, this saves 40Mb of memory during ALTER EXTENSION citus UPDATE. Before the change: MetadataCacheMemoryContext: 41943040 ~ 40MB After the change: MetadataCacheMemoryContext: 8192	2022-05-04 16:44:06 +02:00
Marco Slot	0e1e2275f0	Merge pull request #5920 from citusdata/marcocitus/show-shards-guc	2022-05-03 15:00:09 +02:00
Marco Slot	ceb593c9da	Convert citus.hide_shards_from_app_name_prefixes to citus.show_shards_for_app_name_prefixes	2022-05-03 14:22:13 +02:00
Jeff Davis	3e1180de78	PG15: handle extra argument to parse_analyze_varparams(). From PG commit 25751f54b8.	2022-05-02 10:12:03 -07:00
Jeff Davis	b6a5617ea8	PG15: handle pg_analyze_and_rewrite_* renaming. From PG commit 791b1b71da.	2022-05-02 10:12:03 -07:00
Jeff Davis	33ee4877d4	PG15: rename pgstat_initstats() -> pgstat_init_relation(). From PG commits bff258a273 and be902e2651.	2022-05-02 10:12:03 -07:00
Jeff Davis	033f9cfff7	PG15: update copied pg_get_object_address() code. Account for PG commits 5a2832465fd8 and a0ffa885e478.	2022-05-02 10:12:03 -07:00
Jeff Davis	bd455f42e3	PG15: handle change to SeqScan structure. Account for PG commit 2226b4189b. The one site dependent on it can do just as well with a Scan instead of a SeqScan.	2022-05-02 10:12:03 -07:00
Jeff Davis	3799f95742	PG15: Value -> String, Integer, Float. Handle PG commit 639a86e36a.	2022-05-02 10:12:03 -07:00
Jeff Davis	26f5e20580	PG15: update integer parsing APIs. Account for PG commits 3c6f8c011f and cfc7191dfe.	2022-05-02 10:12:03 -07:00
Jeff Davis	70c915a0f2	PG15: Handle data type changes in pg_collation. Account for PG commit 54637508f8.	2022-05-02 10:12:03 -07:00
Jeff Davis	9915fe8a1a	PG15: Handle different ways to get publication actions. Account for PG commit 52e4f0cd47.	2022-05-02 10:12:03 -07:00
Jeff Davis	1c1ef7ab8d	PG15: Handle extra argument to RelationCreateStorage. Account for PG commit 9c08aea6a309. Introduce RelationCreateStorage_compat.	2022-05-02 10:12:03 -07:00
Jeff Davis	ac952b2cc2	PG15: Handle extra argument to ExecARDeleteTriggers. Account for PG commit ba9a7e3921. Introduce ExecARDeleteTriggers_compat.	2022-05-02 10:12:03 -07:00
Jeff Davis	f944722c6a	PG15: Use RelationGetSmgr() instead of RelationOpenSmgr(). Handle PG commit f10f0ae420.	2022-05-02 10:12:03 -07:00
Hanefi Onaldi	518fb0873e	Introduce one new alternative text output to fix flakiness (#5913 ) Here is a flaky test output that is quite hard to fix: ```diff diff -dU10 -w /home/circleci/project/src/test/regress/expected/isolation_master_update_node_1.out /home/circleci/project/src/test/regress/results/isolation_master_update_node.out --- /home/circleci/project/src/test/regress/expected/isolation_master_update_node_1.out.modified 2022-03-21 19:03:54.237042562 +0000 +++ /home/circleci/project/src/test/regress/results/isolation_master_update_node.out.modified 2022-03-21 19:03:54.257043084 +0000 @@ -49,18 +49,20 @@ <waiting ...> step s2-update-node-1-force: <... completed> master_update_node ------------------ (1 row) step s2-abort: ABORT; step s1-abort: ABORT; FATAL: terminating connection due to administrator command -SSL connection has been closed unexpectedly +server closed the connection unexpectedly + This probably means the server terminated abnormally + before or while processing the request. ``` I could not come up with a solution that would decrease the flakiness in the test outputs. We already have 3 output files for the same test and now I introduced a 4th one. I can also add complex regular expressions that span multiple lines, and normalize these error messages. Feel free to suggest a normalized error message in a comment here. ## Current alternative file contents `isolation_master_update_node.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly ``` `isolation_master_update_node_0.out` ``` step s1-abort: ABORT; WARNING: this step had a leftover error message FATAL: terminating connection due to administrator command server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. ``` `isolation_master_update_node_1.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command SSL connection has been closed unexpectedly ``` new file: `isolation_master_update_node_2.out` ``` step s1-abort: ABORT; FATAL: terminating connection due to administrator command server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. ```	2022-04-28 16:52:02 +03:00
Önder Kalacı	1f6783b526	Merge pull request #5906 from citusdata/relax_disable_node Do not set coordinator's metadatasynced column to false	2022-04-25 09:31:40 +02:00
Onder Kalaci	5fc7661169	Do not set coordinator's metadatasynced column to false After a disable_node	2022-04-25 09:25:59 +02:00
Önder Kalacı	f938f393e5	Merge pull request #5901 from citusdata/fix_local_procs_deadlocks Do not assign distributed transaction ids for local execution	2022-04-25 09:25:24 +02:00
Onder Kalaci	a2debe0f02	Do not assign distributed transaction ids for local execution In the past, for all modifications on the local execution, we enabled 2PC (with `6a7ed7b309`). This also required us to enable coordinated transactions via https://github.com/citusdata/citus/pull/4831 . However, it does have a very substantial impact on the distributed deadlock detection. The distributed deadlock detection is designed to avoid single-statement transactions because they cannot lead to any actual deadlocks. The implementation is to skip backends without distributed transactions are assigned. Now that we assign single statement local executions in the lock graphs, we are conflicting with the design of distributed deadlock detection. In general, we should fix it. However, one might think that it is not a big deal, even if the processes show up in the lock graphs, the deadlock detection should not be causing any false positives. That is false, unless https://github.com/citusdata/citus/issues/1803 is fixed. Now that local processes are considered as a single distributed backend, the lock graphs might find: local execution 1 [tx id: 1] -> any local process [tx id: 0] any local process [tx id: 0] -> local execution 2 [tx id: 2] And, decides that there is a distributed deadlock. This commit is: (a) right thing to do, as local execuion should not need any distributed tx id (b) Eliminates performance issues that might come up with deadlock detection does a lot of unncessary checks (c) After moving local execution after the remote execution via https://github.com/citusdata/citus/pull/4301, the vauge requirement for assigning distributed tx ids are already gone.	2022-04-13 13:25:12 +02:00
Hanefi Onaldi	6254f30305	Add arbitrary config tests for function DDL statements (#5885 )	2022-04-12 16:03:10 +03:00
Önder Kalacı	dd78c81378	Fix flaky isolation - 1 (#5900 ) * Do not show any PG internal queries	2022-04-11 20:43:51 -07:00
Hanefi Onaldi	f34fc37478	Merge pull request #5895 from citusdata/changelog-updates	2022-04-11 16:03:15 +03:00
Hanefi Onaldi	3ec1fc48fc	Add changelog entries for 11.0.1_beta	2022-04-11 14:06:25 +03:00
Burak Velioglu	31df111ecb	Merge pull request #5893 from citusdata/velioglu/fix_function_in_tx Create function in transaction according to create object propagation guc	2022-04-08 17:51:41 +03:00
Burak Velioglu	5d9599f964	Create function in transaction according to create object propagation guc	2022-04-08 17:15:31 +03:00
Nils Dijk	31493288de	Merge pull request #5764 from citusdata/feature/domain-type Feature: propagate DOMAIN objects	2022-04-08 16:14:18 +02:00
Nils Dijk	8897361f95	Implement DOMAIN propagation for citus	2022-04-08 15:25:39 +02:00
Jelte Fennema	6d8c5931d6	Work around flaky test related to search_path (#5894 ) For some reason search_path is not always set correctly on the worker when calling a distributed function, this shows up when calling `insert_document` in our distributed_triggers test. The underlying reason is currently unknown and warrants deeper investigation. Currently this test is one of the main causes for random CI failures. So this change sets the search_path of each function explicitly, to reduce these failures. So other devs can be more efficient, while I continue investigating the root cause of this issue. Also changes explicit `SET citus.enable_unsafe_triggers = false` to `RESET citus.enable_unsafe_triggers` in passing.	2022-04-08 16:09:33 +03:00
Önder Kalacı	78d8561b59	Merge pull request #5880 from citusdata/rename_metadata_func Rename metadata sync to node metadata sync where applicable	2022-04-07 21:53:55 +02:00
Onder Kalaci	b0b91bab04	Rename metadata sync to node metadata sync where applicable	2022-04-07 17:51:31 +02:00
Marco Slot	54f97c7b2b	Merge pull request #5879 from citusdata/marcocitus/fix-unique-index	2022-04-07 16:28:04 +02:00
Marco Slot	2304815356	Allow adding a unique constraint with an index	2022-04-07 16:00:31 +02:00
Marco Slot	67fdecfcb0	Merge pull request #5882 from citusdata/marcocitus/fix-explain	2022-04-07 15:59:44 +02:00
Marco Slot	c0827703ec	Fix EXPLAIN ANALYZE JSON format for subplans	2022-04-07 11:38:20 +02:00
Marco Slot	f9bbcb8840	Merge pull request #5883 from citusdata/marcocitus/fix-explain-parameters	2022-04-07 11:36:47 +02:00
Marco Slot	544dce919a	Handle user-defined type parameters in EXPLAIN ANALYZE	2022-04-07 11:14:32 +02:00
Hanefi Onaldi	b092a1a496	Use consistent naming for tap test workflows in CI config (#5874 ) All the job names in our CI config are of the form `test-1[34]_.*` except for tap-recovery tests.	2022-04-05 05:53:55 +02:00
Marco Slot	b69713937d	Merge pull request #5878 from citusdata/marcocitus/remove-old-repartitioning	2022-04-04 20:18:38 +02:00
Marco Slot	9476f377b5	Remove old re-partitioning functions	2022-04-04 18:11:52 +02:00
Marco Slot	b511e28e80	Merge pull request #5875 from citusdata/marcocitus/tablesample	2022-04-01 16:47:03 +02:00
Marco Slot	8c8c3b665d	Add TABLESAMPLE support	2022-04-01 15:51:40 +02:00
Ahmet Gedemenli	5374c771a9	Merge pull request #5872 from citusdata/schema-tests-arbitrary Add schema tests to arbitrary configs	2022-04-01 16:17:13 +03:00
Ahmet Gedemenli	a62de6494d	Add schema tests to arbitrary configs	2022-04-01 13:57:17 +03:00
jeff-davis	c485a04139	Separate build of citus.so and citus_columnar.so. (#5805 ) * Separate build of citus.so and citus_columnar.so. Because columnar code is statically-linked to both modules, it doesn't make sense to load them both at once. A subsequent commit will make the modules entirely separate and allow loading them both simultaneously. Author: Yanwen Jin * Separate citus and citus_columnar modules. Now the modules are independent. Columnar can be loaded by itself, or along with citus. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2022-03-31 19:47:17 -07:00
Gledis Zeneli	c9aab7fb8b	Add TRUNCATE arbitrary config tests (#5848 ) Adds TRUNCATE arbitrary config tests. Also adds the ability to skip tests from particular configs.	2022-03-31 14:14:47 +03:00
Önder Kalacı	a0a2e80c78	Merge pull request #5869 from citusdata/do_not_hide_shards_from_pg Only hide shards from client backends or non-pg background workers	2022-03-30 17:43:22 +02:00
Onder Kalaci	9043a1ed3f	Only hide shards from client backends and pg bg workers The aim of hiding shards is to hide shards from client applications. Certain bg workers (such as pg_cron or Citus maintanince daemon) should be treated like client applications because users can run queries from such bg workers. And, these bg workers should follow the similar application_name checks as client backeends. Certain other bg workers, such as logical replication or postgres' parallel workers, should never hide shards. They are internal operations. Similarly the other backend types like the walsender or checkpointer or autovacuum should never hide shards.	2022-03-30 16:56:12 +02:00
Ahmet Gedemenli	b94422d5cf	Merge pull request #5867 from citusdata/arbitrary-configs-view-test Add view tests to arbitrary configs	2022-03-30 16:17:31 +03:00
Ahmet Gedemenli	f74d3eedc8	Add tests for materialized views	2022-03-30 16:01:11 +03:00
Ahmet Gedemenli	8ef2da8192	Add view tests to arbitrary configs	2022-03-30 12:28:31 +03:00
Önder Kalacı	670fae99f7	Add tests with function dependencies on tables (#5866 ) We are not sure if we have such tests, but lets add anyway	2022-03-29 18:04:07 +03:00
Ahmet Gedemenli	1e1e66eeed	Add index tests to arbitrary configs (#5862 )	2022-03-29 13:49:05 +03:00
Ahmet Gedemenli	b52823f8b4	Fix typo in error message for truncating foreign tables (#5864 )	2022-03-29 13:14:16 +03:00
Önder Kalacı	6cb04c0d84	Merge pull request #5865 from citusdata/add_tests_check_mx add missing check_mx for the schedules	2022-03-29 11:33:42 +02:00
Onder Kalaci	23ff095905	add missing check_mx	2022-03-29 10:35:12 +02:00
Hanefi Onaldi	7dc0a94293	Merge pull request #5852 from citusdata/citus-11.0.0-changelog-1647961698	2022-03-24 16:15:54 +03:00
Hanefi Onaldi	36ca2639f0	Add changelog entry for 11.0.0	2022-03-24 13:48:09 +03:00
Ahmet Gedemenli	6300b86f8a	Merge pull request #5842 from citusdata/drop-pg12-support Drop PG12 support	2022-03-24 02:31:28 +03:00
Ahmet Gedemenli	42c46a0824	Drop PG12 support	2022-03-23 18:16:04 +03:00
Halil Ozan Akgül	40f3fbbc62	Merge pull request #5819 from citusdata/turn_metadata_sync_on_in_arbitrary_tests Refactor arbitrary configs to make MX more explicit	2022-03-23 17:44:47 +03:00
Halil Ozan Akgul	c843ebe48e	Turn metadata sync on in arbitrary config tests	2022-03-23 15:19:52 +03:00
Jelte Fennema	3a44fa827a	Add versions of forboth that don't need ListCell (#5856 ) We've had custom versions of Postgres its `foreach` macro which with a hidden ListCell for quite some time now. People like these custom macros, because they are easier to use and require less boilerplate. This adds similar custom versions of Postgres its `forboth` macro. Now you don't need ListCells anymore when looping over two lists at the same time.	2022-03-23 14:50:36 +03:00
Ahmet Gedemenli	b5448e43e3	Fix aggregate signature bug (#5854 )	2022-03-23 13:42:03 +03:00
Burak Velioglu	0c8aca7c5e	Merge pull request #5849 from citusdata/velioglu/support_func Add support for deparsing ALTER FUNCTION ... SUPPORT ... commands	2022-03-22 22:23:40 +03:00
Burak Velioglu	db9f0d926c	Add support for deparsing ALTER FUNCION ... SUPPORT ... commands	2022-03-22 21:55:55 +03:00
Önder Kalacı	0ba334626b	Merge pull request #5851 from citusdata/remove_cte_inline Remove citus.enable_cte_inlining GUC	2022-03-22 19:09:24 +01:00
Onder Kalaci	af4ba3eb1f	Remove citus.enable_cte_inlining GUC In Postgres 12+, users can adjust whether to inline/not inline CTEs by [NOT] MATERIALIZED keywords. So, this GUC is already useless.	2022-03-22 17:14:44 +01:00
Halil Ozan Akgül	37aefec537	Merge pull request #5847 from citusdata/alter_collation_encoding_does_not_exist_bug Fixes ALTER COLLATION encoding does not exist bug	2022-03-22 18:30:49 +03:00
Halil Ozan Akgul	4690c42121	Fixes ALTER COLLATION encoding does not exist bug	2022-03-22 17:42:20 +03:00
Marco Slot	9c7fde92b6	Merge pull request #5840 from citusdata/marcocitus/fix-repartition	2022-03-22 13:53:52 +01:00
Marco Slot	32c23c2775	Disallow re-partition joins when no hash function defined	2022-03-22 13:42:53 +01:00
Onur Tirtir	11f246785e	Merge pull request #5836 from citusdata/fix/cannot-distribute-tmp-schemas	2022-03-22 15:30:31 +03:00
Onur Tirtir	11433ed357	Create DDL job for create enum command in postprocess as we do for composite types Since now we don't throw an error for enums that user attempts creating in temp schema, the preprocess / DDL job that contains the prepared statement (to idempotently create the enum type) gets executed. As a result, we were emitting the following warning because of the error the underlying worker connection throws: ```sql WARNING: cannot PREPARE a transaction that has operated on temporary objects CONTEXT: while executing command on localhost:xxxxx WARNING: connection to the remote node localhost:xxxxx failed with the following error: another command is already in progress ERROR: cannot PREPARE a transaction that has operated on temporary objects CONTEXT: while executing command on localhost:xxxxx ```	2022-03-22 15:09:23 +03:00
Onur Tirtir	dc31102630	Locally create objects having a dependency that we cannot distribute We were already doing so for functions & types believing that this cannot be the case for other object types. However, as in #5830, we cannot distribute an object that user attempts creating in temp schema. Even more, this doesn't only apply to functions and types but also to many other object types. So with this commit, we teach preprocess/postprocess functions (that need to create dependencies on worker nodes) how to skip trying to distribute such objects. We also start identifying temp schemas as the objects that we don't know how to propagate to worker nodes so that we can simply create objects locally if user attempts creating them in a temp schema. There are 36 callers of `EnsureDependenciesExistOnAllNodes` in the codebase atm and for the most we still need to throw a hard error (i.e.: not use `DeferErrorIfHasUnsupportedDependency` beforehand), such as: i) user explicitly wants to create a distributed object * CreateCitusLocalTable * CreateDistributedTable * master_create_worker_shards * master_create_empty_shard * create_distributed_function * EnsureExtensionFunctionCanBeDistributed ii) we don't want to skip altering distributed table on worker nodes * PostprocessIndexStmt * PostprocessCreateTriggerStmt * PostprocessCreateStatisticsStmt iii) object is already distributed / handled by Citus before, so we aren't okay with not propagating the ALTER command * PostprocessAlterTableSchemaStmt * PostprocessAlterCollationOwnerStmt * PostprocessAlterCollationSchemaStmt * PostprocessAlterDatabaseOwnerStmt * PostprocessAlterExtensionSchemaStmt * PostprocessAlterFunctionOwnerStmt * PostprocessAlterFunctionSchemaStmt * PostprocessAlterSequenceOwnerStmt * PostprocessAlterSequenceSchemaStmt * PostprocessAlterStatisticsSchemaStmt * PostprocessAlterStatisticsOwnerStmt * PostprocessAlterTextSearchConfigurationSchemaStmt * PostprocessAlterTextSearchDictionarySchemaStmt * PostprocessAlterTextSearchConfigurationOwnerStmt * PostprocessAlterTextSearchDictionaryOwnerStmt * PostprocessAlterTypeSchemaStmt * PostprocessAlterForeignServerOwnerStmt iv) we already cannot create those objects in temp schemas, so skipping for now * PostprocessCreateExtensionStmt * PostprocessCreateForeignServerStmt Also note that there are 3 more callers of `EnsureDependenciesExistOnAllNodes` in enterprise in addition to those 36 but we don't need to do anything specific about them due to the same reasoning given in iii).	2022-03-22 15:09:23 +03:00
Halil Ozan Akgül	001551d732	Merge pull request #5837 from citusdata/underscore_type_name Fixes the type names that start with underscore bug	2022-03-22 14:35:55 +03:00
Halil Ozan Akgul	50bace9cfb	Fixes the type names that start with underscore bug	2022-03-22 14:24:30 +03:00
Halil Ozan Akgül	98d1018966	Merge pull request #5823 from citusdata/citus_coordinator_node_id Introduces citus_coordinator_nodeid	2022-03-22 14:23:46 +03:00
Halil Ozan Akgul	4dbc760603	Introduces citus_coordinator_node_id	2022-03-22 10:34:22 +03:00
Hanefi Onaldi	9f204600af	Allow all possible option types for text search objects (#5838 )	2022-03-21 20:01:53 +01:00
Halil Ozan Akgül	6c05e4b35c	Add check_mx to operations schedule (#5818 )	2022-03-21 19:09:26 +03:00
Burak Velioglu	83b0e98595	Merge pull request #5835 from citusdata/velioglu/poly_aggregate Add support for zero-argument polymorphic aggregates	2022-03-21 16:22:50 +03:00
Burak Velioglu	d4625ec6a1	Add support for zero-argument polymorphic aggregates	2022-03-21 16:10:40 +03:00
Ahmet Gedemenli	46c6630328	Qualify CREATE AGGREGATE stmts in Preprocess (#5834 )	2022-03-21 13:55:09 +03:00
Hanefi Onaldi	c18c63a930	Merge pull request #5810 from citusdata/changelog-updates	2022-03-19 01:19:53 +03:00
Hanefi Onaldi	44546e9234	Add changelog entries for 10.2.5	2022-03-18 19:48:38 +03:00
Burak Velioglu	2994b10024	Merge pull request #5827 from citusdata/velioglu/local_object_creation Create type locally if it has undistributable dependency	2022-03-18 18:36:51 +03:00
Burak Velioglu	2c2064bf36	Create type locally if it has undistributable dependency	2022-03-18 18:23:32 +03:00
Marco Slot	0944c631f1	Merge pull request #5824 from citusdata/marcocitus/multi-query-transaction	2022-03-18 16:20:27 +01:00
Marco Slot	055bbd6212	Use coordinated transaction when there are multiple queries per task	2022-03-18 15:04:27 +01:00
Marco Slot	a1108d2a20	Merge pull request #5828 from citusdata/marcocitus/fix-lock	2022-03-18 14:46:38 +01:00
Marco Slot	cab243218d	Avoid locks in relation_is_a_known_shard	2022-03-18 14:37:39 +01:00
Marco Slot	eeafa67bea	Merge pull request #5816 from citusdata/marcocitus/fix-type	2022-03-17 13:50:41 +01:00
Marco Slot	5bb5359da0	Fix worker node version check	2022-03-17 13:23:02 +01:00
Marco Slot	22a18fc1f2	Fix typo in upgrade function	2022-03-17 13:23:02 +01:00
Jelte Fennema	68bfc8d1c0	Use good initdb options in arbitrary configs tests (#5802 ) In `pg_regress_multi.pl` we're running `initdb` with some options that the `common.py` `initdb` is currently not using. All these flags seem reasonable, so this brings `common.py` in line with `pg_regress_multi.pl`. In passing change the `--nosync` flag to `--no-sync`, since that's what the PG documentation lists as the official option name (but both work).	2022-03-17 13:22:23 +01:00
Jelte Fennema	b0e406a478	Disable ddl propagation when creating users in arbitrary config tests (#5814 ) This should help with failing enterprise tests.	2022-03-16 15:12:20 +01:00
Ahmet Gedemenli	eddfea18c2	Fix role creation issue on schema tests (#5812 )	2022-03-16 13:49:28 +01:00
Burak Velioglu	e21980fb89	Merge pull request #5809 from citusdata/velioglu/unmark_deps_on_worker Drop distributed table on worker with ProcessUtilityParseTree	2022-03-15 17:59:22 +03:00
Burak Velioglu	333c73a53c	Drop distributed table on worker with ProcessUtilityParseTree	2022-03-15 17:42:01 +03:00
Gledis Zeneli	56ab64b747	Patches #5758 with some more error checks (#5804 ) Add error checks to detect failed connection and don't ping secondary nodes to detect self reference.	2022-03-15 15:02:47 +03:00
Marco Slot	5fd0ec7dab	Merge pull request #5801 from citusdata/marcocitus/row-share-lock	2022-03-15 11:00:04 +01:00
Hanefi Onaldi	c0cd8f3d56	Wait until metadata sync before testing distributed sequences	2022-03-15 10:28:51 +01:00
Marco Slot	e42a798707	Always use RowShareLock in pg_dist_node when syncing metadata	2022-03-15 10:28:51 +01:00
Ahmet Gedemenli	36b33e2491	Add sequence tests to arbitrary config (#5771 ) Add sequence tests to arbitrary config (#5771)	2022-03-14 19:16:24 +03:00
Jelte Fennema	41c6393e82	Parallelize cluster setup in arbitrary config tests (#5738 ) Cluster setup time is significant in arbitrary configs. We can parallelize this a bit more. Runtime of the following command decreases from ~25 seconds to ~22 seconds on my machine with this change: ``` make -C src/test/regress/ check-arbitrary-base CONFIGS=CitusDefaultClusterConfig EXTRA_TESTS=prepared_statements_1 ``` Currently we can only run different configs in parallel. However, when working on a feature or trying to fix a bug this is not important. In those cases you simply want to run a single test file on a single config. And you want to run that every time you made a change to the code that you think fixes the issue. This PR allows parallelising running of bash commands. So `initdb` and `pg_ctl start` is run in parallel for all nodes in the cluster. Instead of one waiting for the other. When you run the above command nothing is being run in parallel. After this PR, cluster setup is being run in parallel.	2022-03-14 16:42:20 +01:00
Jelte Fennema	5063257252	Disable fsync in arbitrary config tests (#5800 ) We have fsync enabled for regular tests already in `pg_regress_multi.pl`. This does the same for the arbitrary config tests. On my machine this changes the runtime from the following command from ~37 to ~25 seconds: ```bash make -C src/test/regress/ check-arbitrary-configs CONFIGS=CitusDefaultClusterConfig ```	2022-03-14 18:12:38 +03:00
Marco Slot	5d07273ca6	Merge pull request #5466 from citusdata/improve_error_msgs	2022-03-14 15:23:41 +01:00
Onder Kalaci	338752d96e	Guard against hard wait event set errors Similar to https://github.com/citusdata/citus/pull/5158, but this time instead of the executor, use this in all the remaining places.	2022-03-14 14:35:56 +01:00
Onder Kalaci	953951007c	Move wait event error checks to connection manager	2022-03-14 14:35:56 +01:00
Onur Tirtir	216b9b5b7a	Fix an incorrect error message related with fkeys between replicated dist tables (#5796 ) This is not supported in enterprise too.	2022-03-14 14:34:09 +01:00
Hanefi Onaldi	b24e1dfccc	Propagate text search commands to all worker nodes (#5797 ) Here is a list of some functions, and the `TargetWorkerSet` parameters they supply to `NodeDDLTaskList`: PostprocessCreateTextSearchConfigurationStmt - NON_COORDINATOR_NODES PreprocessDropTextSearchConfigurationStmt - NON_COORDINATOR_METADATA_NODES PreprocessAlterTextSearchConfigurationSchemaStmt - NON_COORDINATOR_METADATA_NODES I guess this means that, if metadata syncing is disabled on the node, we may have some issues. Consider the following: Let's assume the user has metadata syncing disabled. 2 workers. `CREATE TEXT SEARCH CONFIGURATION ...` will get propagated to all workers. `ALTER ... CONFIGURATION ...` will not get propagated to workers. After adding a new non-metadata node, the new node will get the altered configuration as it reads from catalog. At this point CONFIGURATION definitions got diverged in the cluster. I suggest that we always use `NON_COORDINATOR_METADATA_NODES` in all the TEXT SEARCH operations here.	2022-03-14 14:44:34 +03:00
Marco Slot	c5031ac7b5	Merge pull request #5793 from citusdata/seq_expression	2022-03-14 09:01:55 +01:00
Onder Kalaci	db529facab	Only change the sequence types if the target column type is a supported sequence type Before this commit, we erroneously converted the sequence type to the column's type it is used. However, it is possible that the sequence is used in an expression which then converted to a type that cannot be a sequence, such as text. With this commit, we only try this conversion if the column type is a supported sequence type (e.g., smallint, int and bigint). Note that we do this conversion because if the column type is a bigint and the sequence is NOT a bigint, users would be in trouble because sequences would generate values that are out of the range of the column. (The other ways are already not supported such as the column is int and the sequence is bigint would fail on the worker.) In other words, with this commit, we scope this optimization only when the target column type is a supported sequence type. Otherwise, we let users to more freely use the sequences.	2022-03-11 16:06:00 +01:00
Halil Ozan Akgül	37fafd007c	Turn metadata sync on in isolation_update_node and isolation_update_node_lock_writes tests (#5779 )	2022-03-11 16:39:20 +03:00
Ahmet Gedemenli	d06146360d	Support GRANT ON SCHEMA commands in CREATE SCHEMA statements (#5789 ) * Support GRANT ON SCHEMA commands in CREATE SCHEMA statements * Add test * add comment * Rename to GetGrantCommandsFromCreateSchemaStmt	2022-03-11 14:47:45 +03:00
Jelte Fennema	e5d5c7be93	Start erroring out for unsupported lateral subqueries (#5753 ) With the introduction of #4385 we inadvertently started allowing and pushing down certain lateral subqueries that were unsafe to push down. To be precise the type of LATERAL subqueries that is unsafe to push down has all of the following properties: 1. The lateral subquery contains some non recurring tuples 2. The lateral subquery references a recurring tuple from outside of the subquery (recurringRelids) 3. The lateral subquery requires a merge step (e.g. a LIMIT) 4. The reference to the recurring tuple should be something else than an equality check on the distribution column, e.g. equality on a non distribution column. Property number four is considered both hard to detect and probably not used very often. Thus this PR ignores property number four and causes query planning to error out if the first three properties hold. Fixes #5327	2022-03-11 11:59:18 +01:00
Halil Ozan Akgül	c9913b135c	Turn metadata sync on in isolation_ref2ref_foreign_keys test (#5791 )	2022-03-11 13:30:11 +03:00
Halil Ozan Akgül	2edaf0971c	Turn metadata sync on in isolation reference copy vs all (#5790 ) * Turn metadata sync on in isolation_reference_copy_vs_all test * Update the output of isolation_reference_copy_vs_all test	2022-03-11 11:27:46 +03:00
Hanefi Onaldi	3f72cda27a	Merge pull request #5772 from citusdata/distributed-dictionaries Add support for TEXT SEARCH DICTIONARY objects TEXT SEARCH DICTIONARY objects depend on TEXT SEARCH TEMPLATE objects. Since we do not yet support distributed TS TEMPLATE objects, we skip dependency checks for text search templates, similar to what we do for roles. The user is expected to manually create the TEXT SEARCH TEMPLATE objects before a) adding new nodes, b) creating TEXT SEARCH DICTIONARY objects.	2022-03-11 03:54:29 +03:00
Hanefi Onaldi	b0eb685101	Add support for TEXT SEARCH DICTIONARY objects TEXT SEARCH DICTIONARY objects depend on TEXT SEARCH TEMPLATE objects. Since we do not yet support distributed TS TEMPLATE objects, we skip dependency checks for text search templates, similar to what we do for roles. The user is expected to manually create the TEXT SEARCH TEMPLATE objects before a) adding new nodes, b) creating TEXT SEARCH DICTIONARY objects.	2022-03-11 03:40:20 +03:00
Marco Slot	49467e27e6	Ensure worker_save_query_explain_analyze always fully qualifies types (#5776 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: Marco Slot <marco.slot@gmail.com>	2022-03-10 07:30:11 -08:00
Gledis Zeneli	2cb02bfb56	Fix node adding itself with citus_add_node leading to deadlock (Fix #5720 ) (#5758 ) If a worker node is being added, a command is sent to get the server_id of the worker from the pg_dist_node_metadata table. If the worker's id is the same as the node executing the code, we will know the node is trying to add itself. If the node tries to add itself without specifying `groupid:=0` the operation will result in an error.	2022-03-10 17:46:33 +03:00
Burak Velioglu	2a7a5da526	Merge pull request #5783 from citusdata/velioglu/ensure_for_all Ensure dependencies exists for all alter owner commands	2022-03-10 16:48:28 +03:00
Burak Velioglu	547f6b18ef	Ensure dependencies exists for all alter owner commands	2022-03-10 16:37:55 +03:00
Ahmet Gedemenli	4312486141	Remove unnecessary schema name from CREATE SCHEMA stmts (#5785 )	2022-03-10 15:19:14 +03:00
Hanefi Onaldi	d153c2de0d	Fix some typos in comments	2022-03-10 15:03:26 +03:00
Ahmet Gedemenli	551a7d1383	Support CREATE SCHEMA without name (#5782 )	2022-03-10 13:38:00 +03:00
Marco Slot	e2424756bb	Merge pull request #5774 from citusdata/marcocitus/object-propagation-default	2022-03-09 18:19:56 +01:00
Marco Slot	8e43c8094d	Fix CREATE EXTENSION propagation with custom version	2022-03-09 17:40:50 +01:00
Marco Slot	7559ad12ba	Change create_object_propagation default to immediate	2022-03-09 17:40:50 +01:00
Burak Velioglu	76f249a05a	Merge pull request #5761 from citusdata/velioglu/cyclic_dep Error out if object has unsupported or circular dependency	2022-03-09 16:55:10 +03:00
Burak Velioglu	bbe1b16125	Check whether the object has unsupported or circular dependency	2022-03-09 16:37:53 +03:00
Jelte Fennema	c8839de68b	Don't use cascading deletes in Citus 11 migration script (#5767 ) Using CASCADE in a DELETE can inadvertently delete things we don't intend to. It's safer to fail hard and make the user delete depending things manually.	2022-03-09 14:35:23 +01:00
Halil Ozan Akgül	333bcc7948	Global PID Helper Functions (#5768 ) * Introduces citus_nodename_for_nodeid and citus_nodeport_for_nodeid functions * Introduces citus_nodeid_for_gpid and citus_pid_for_gpid functions * Add tests	2022-03-09 13:15:59 +03:00
Ahmet Gedemenli	264cf78842	Disable use_citus_managed_tables for Postgres config (#5773 )	2022-03-08 17:13:49 +03:00
Önder Kalacı	ad346fedcf	Merge pull request #5766 from citusdata/implement_ddl_blocked Improve citus_lock_waits	2022-03-07 11:20:23 +01:00
Onder Kalaci	c32b2de1a7	Improve citus_lock_waits 1) Remove useless columns 2) Show backends that are blocked on a DDL even before gpid is assigned 3) One minor bugfix, where we clear distributedCommandOriginator properly.	2022-03-07 11:10:44 +01:00
Ahmet Gedemenli	2a3c0c1914	Revert upgrade script changes (#5757 )	2022-03-07 13:04:58 +03:00
Önder Kalacı	25c8de3657	Merge pull request #5762 from citusdata/fix_drop_partition Handle dropping the partitioned tables properly	2022-03-07 10:21:18 +01:00
Onder Kalaci	24fcd2a88c	Handle dropping the partitioned tables properly Before this commit, we might be leaving some metadata on the workers. Now, we handle DROP SCHEMA .. CASCADE properly to avoid any metadata leakage.	2022-03-07 10:02:54 +01:00
Nils Dijk	3801576dfb	Move pg_dist_object to pg_catalog (#5765 ) DESCRIPTION: Move pg_dist_object to pg_catalog Historically `pg_dist_object` had been created in the `citus` schema as an experiment to understand if we could move our catalog tables to a branded schema. We quickly realised that this interfered with the UX on our managed services and other environments, where users connected via a user with the name of `citus`. By default postgres put the username on the search_path. To be able to read the catalog in the `citus` schema we would need to grant access permissions to the schema. This caused newly created objects like tables etc, to default to this schema for creation. This failed due to the write permissions to that schema. With this change we move the `pg_dist_object` catalog table to the `pg_catalog` schema, where our other schema's are also located. This makes the catalog table visible and readable by any user, like our other catalog tables, for debugging purposes. Note: due to the change of schema, we had to disable 1 test that was running into a discrepancy between the schema and binary. Secondly, we needed to make the lookup functions for the `pg_dist_object` relation and their indexes less strict on the fallback of the naming due to an other test that, due to an unfortunate cache invalidation, needed to lookup the relation again. This makes that we won't default to _only_ resolving from `pg_catalog` outside of upgrades.	2022-03-04 17:40:38 +00:00
Halil Ozan Akgül	12d4486567	Merge pull request #5760 from citusdata/update_citus_dist_stat_activity Update citus_dist_stat_activity and citus_worker_stat_activity to use citus_stat_activity	2022-03-04 17:42:53 +03:00
Halil Ozan Akgul	0500a62515	Updates citus_dist_stat_activity to use citus_stat_activity	2022-03-04 17:28:17 +03:00
Ahmet Gedemenli	b8eedcd261	Notice when create_distributed_function called without params (#5752 ) * Notice when create_distributed_function called without params * Move variable comments to top * Add valid check for cache entry * add objtype to notice msg * update test outputs * Add more tests * Address feedback	2022-03-04 17:26:39 +03:00
Önder Kalacı	28443aee0c	Merge pull request #5755 from citusdata/calculate_gpid Introduce citus_calculate_gpid and citus_backend_gpid	2022-03-04 11:46:36 +01:00
Önder Kalacı	bd6a6563ff	Merge branch 'master' into calculate_gpid	2022-03-04 11:34:12 +01:00
Burak Velioglu	341c3ee90a	Merge pull request #5728 from citusdata/velioglu/citus_table_check_relation Prevent creation of Citus tables if any of the dependencies cannot be distributed	2022-03-03 20:37:26 +03:00
Burak Velioglu	cb6d67a9a9	Make sure that all dependencies of citus tables can be distributed	2022-03-03 20:08:09 +03:00
Onder Kalaci	c7b67ba0ea	Add citus_backend_gpid() And also citus_calculate_gpid(nodeId,pid). These UDFs are just wrappers for the existing functions. Useful for testing and simple manipulation of citus_stat_activity.	2022-03-03 15:29:40 +01:00
Halil Ozan Akgül	90974fdc8f	Merge pull request #5731 from citusdata/citus_stat_activity Introduces citus_stat_activity view	2022-03-03 17:05:35 +03:00
Halil Ozan Akgul	06a0509b1a	Introduces citus_stat_activity view	2022-03-03 16:19:20 +03:00
Marco Slot	ab614194fd	Merge pull request #5742 from citusdata/marcocitus/colocation	2022-03-03 13:11:16 +01:00
Marco Slot	ddf7cf29f3	Sync pg_dist_colocation as a batch	2022-03-03 12:48:48 +01:00
Marco Slot	3ba61244b8	Synchronize pg_dist_colocation metadata	2022-03-03 11:01:59 +01:00
Marco Slot	f7d3405148	Merge pull request #5698 from citusdata/marcocitus/internal-reserved-connections	2022-03-03 10:33:06 +01:00
Marco Slot	43e4dd3808	Add a citus.internal_reserved_connections setting	2022-03-02 19:13:53 +01:00
Önder Kalacı	2f0c346c8a	Merge pull request #5748 from citusdata/improve_visibility_of_citus_stats Improve visibility rules for non-priviledge roles on citus_stat_activity	2022-03-02 18:11:35 +01:00
Onder Kalaci	e80a36c4b6	Improve visibility rules for non-priviledge roles It seems like our approach is way too restrictive and some places are wrong. Now, we follow very similar approach to pg_stat_activity. Some of the changes are pre-requsite for implementing citus_dist_stat_activity via citus_stat_activity.	2022-03-02 18:04:01 +01:00
Önder Kalacı	b36c58f231	Merge pull request #5556 from citusdata/add_udf_for_upgrades Add a new API for enabling Citus MX for clusters upgrading from earli…	2022-03-02 17:16:21 +01:00
Onder Kalaci	35ec9721b4	Add a new API for enabling Citus MX for clusters upgrading from earlier versions Clusters created pre-Citus 11 mostly didn't have metadata sync enabled. For those clusters, we add a utility UDF which fixes some minor issues and sync the necessary objects to the workers.	2022-03-02 17:02:55 +01:00
Burak Velioglu	95fdbe1370	Merge pull request #5750 from citusdata/fix_test_ent Unblock enterprise regression test failures	2022-03-02 14:45:37 +03:00
Onder Kalaci	98751058a9	Add Primary key to the table Otherwise enterprise tests fail	2022-03-02 12:03:59 +01:00
Marco Slot	43f798c476	Merge pull request #5751 from citusdata/marcocitus/revert-columnar-build	2022-03-02 11:50:27 +01:00
Marco Slot	dcfbb51b6b	Revert "Build Columnar.so and make Citus depends on it (#5661 )" This reverts commit `a4133c69e8`.	2022-03-02 11:33:15 +01:00
Ahmet Gedemenli	7ad0415fa3	Merge pull request #5735 from citusdata/aggregate-propagate Propagate CREATE AGGREGATE commands	2022-03-02 11:31:21 +03:00
Ahmet Gedemenli	e1809af376	Propagate CREATE AGGREGATE commands	2022-03-02 10:52:43 +03:00
Önder Kalacı	1e876abc56	Merge pull request #5749 from citusdata/improve_check_multi Drop function in the tests on a never version	2022-03-02 08:51:26 +01:00
Onder Kalaci	b79a0052a4	Drop function in the tests on a never version As dropping the function now relies on pg_dist_object, which exists with 9.0+	2022-03-02 08:45:35 +01:00
ywj	a4133c69e8	Build Columnar.so and make Citus depends on it (#5661 ) * [Columnar] Build columnar.so and let citus depends on it Co-authored-by: Yanwen Jin <yanwjin@microsoft.com> Co-authored-by: Ying Xu <32597660+yxu2162@users.noreply.github.com> Co-authored-by: jeff-davis <Jeffrey.Davis@microsoft.com>	2022-03-01 23:31:14 +03:00
Nils Dijk	65bd540943	Feature: configure object propagation behaviour in transactions (#5724 ) DESCRIPTION: Add GUC to control ddl creation behaviour in transactions Historically we would _not_ propagate objects when we are in a transaction block. Creation of distributed tables would not always work in sequential mode, hence objects created in the same transaction as distributing a table that would use the just created object wouldn't work. The benefit was that the user could still benefit from parallelism. Now that the creation of distributed tables is supported in sequential mode it would make sense for users to force transactional consistency of ddl commands for distributed tables. A transaction could switch more aggressively to sequential mode when creating new objects in a transaction. We don't change the default behaviour just yet. Also, many objects would not even propagate their creation when the transaction was already set to sequential, leaving the probability of a self deadlock. The new policy checks solve this discrepancy between objects as well.	2022-03-01 17:29:31 +03:00
Burak Velioglu	8a56e4cf8f	Merge pull request #5744 from citusdata/velioglu/function_expand Expand functions while resolving dependencies	2022-03-01 17:19:23 +03:00
Burak Velioglu	f17872aed4	Expand functions while resolving dependencies	2022-03-01 17:08:46 +03:00
Gledis Zeneli	b825232ecb	Handle rebalance / replication when a node is disabled (Fix #5664 ) (#5729 ) The issue in question is caused when rebalance / replication call `FullShardPlacementList` which returns all shard placements (including those in disabled nodes with `citus_disable_node`). Eventually, `FindFillStateForPlacement` looks for the state across active workers and fails to find a state for the placements which are in the disabled workers causing a seg fault shortly after. Approach: * `ActivePlacementHash` was not using the status of the shard placement's node to determine if the node it is active. Initially, I just fixed that. * Additionally, I refactored the code which handles active shards in replication / rebalance to: * use a single function to determine if a shard placement is active. * do the shard active shard filtering before calling `RebalancePlacementUpdates` and `ReplicationPlacementUpdates`, so test methods like `shard_placement_rebalance_array` and `shard_placement_replication_array` which have different shard placement active requirements can do their own filtering while using the same rebalance / replicate logic that `rebalance_table_shards` and `replicate_table_shards` use. Fix #5664	2022-02-25 19:54:30 +03:00
Hanefi Onaldi	6c25eea62f	Fix some typos in comments	2022-02-24 19:48:52 +03:00
Önder Kalacı	dda47dae7d	Merge pull request #5734 from citusdata/remove_citus_backend Drop support for CitusInitiatedBackend	2022-02-24 15:29:19 +01:00
Onder Kalaci	df95d59e33	Drop support for CitusInitiatedBackend CitusInitiatedBackend was a pre-mature implemenation of the whole GlobalPID infrastructure. We used it to track whether any individual query is triggered by Citus or not. As of now, after GlobalPID is already in place, we don't need CitusInitiatedBackend, in fact it could even be wrong.	2022-02-24 12:12:43 +01:00
Marco Slot	9b4db12651	Merge pull request #5743 from citusdata/marcocitus/drop-wpqr	2022-02-24 10:55:37 +01:00
Marco Slot	0c4e3cb69c	Drop worker_partition_query_result on downgrade	2022-02-24 10:18:56 +01:00
Hanefi Onaldi	1399853608	Merge pull request #5730 from citusdata/locks-on-ddltasklist	2022-02-24 03:47:30 +03:00
Hanefi Onaldi	7bd6c2c9ac	Isolation tests for various ddl operations and metadata sync	2022-02-24 03:19:56 +03:00
Hanefi Onaldi	f4e8af2c22	Do not acquire locks on node metadata explicitly	2022-02-24 03:19:56 +03:00
Hanefi Onaldi	b70949ae8c	Lock nodes when building ddl task lists	2022-02-24 03:19:56 +03:00
Marco Slot	955eabfcd6	Merge pull request #5400 from citusdata/marcocitus/repartition-using-intermediate-results	2022-02-23 20:03:42 +01:00
Marco Slot	ef1ceb3953	Only use a single placement for map tasks	2022-02-23 19:40:21 +01:00
Marco Slot	8de802eec5	Enable local_shared_pool_size 5 in arbitrary configs test	2022-02-23 19:40:21 +01:00
Marco Slot	490765a754	Enable re-partition joins after local execution	2022-02-23 19:40:21 +01:00
Marco Slot	3cd9aa655a	Stop using citus.binary_worker_copy_format	2022-02-23 19:40:21 +01:00
Marco Slot	5ac0d31e8b	Fix re-partition hash range generation	2022-02-23 19:40:21 +01:00
Marco Slot	72d8fde28b	Use intermediate results for re-partition joins	2022-02-23 19:40:21 +01:00
Nils Dijk	1fb970224e	Fix: partitioned index dependencies (#5741 ) #5685 introduced the resolution of dependencies for indices. This missed support for indices on partitioned tables. This change adds support for partitioned indices to the dependency resolution code.	2022-02-23 17:53:26 +03:00
Jelte Fennema	e1afd30263	Speed up test runs on WSL2 a lot (#5736 ) It turns out `whereis` is incredibly slow on WSL2 (at least on my machine): ``` $ time whereis diff diff: /usr/bin/diff /usr/share/man/man1/diff.1.gz real 0m0.408s user 0m0.010s sys 0m0.101s ``` This command is run by our custom `diff` script, which is run for every test file that is run. So this adds lots of unnecessary runtime time to tests. This changes our custom `diff` script to only call `whereis` in the strange case that `/usr/bin/diff` does not exist. The impact of this small change on the total runtime of the tests on WSL is huge. As an example the following command takes 18 seconds without this change and 7 seconds with it: ``` make -C src/test/regress/ check-arbitrary-configs CONFIGS=PostgresConfig ```	2022-02-23 13:03:29 +01:00
Ahmet Gedemenli	9a8f11a086	Merge pull request #5572 from citusdata/add-citus-managed-tables-to-arbitrary-configs Add use_citus_managed_tables to arbitrary configs	2022-02-22 11:54:45 +03:00
Ahmet Gedemenli	8b9402540f	Add use_citus_managed_tables to arbitrary configs (cherry picked from commit 4e93afd1f78854e1aaab63690c441b0b0598a82c) (cherry picked from commit `0295fe2f5b`) (cherry picked from commit 878510725fab9cb6870b4504e0b1f055d7bbc68d)	2022-02-22 11:39:30 +03:00
Teja Mupparti	a62901396b	Allow unsafe triggers via a GUC	2022-02-21 22:45:17 -08:00
Önder Kalacı	d0aa450d7b	Merge pull request #5716 from citusdata/improve_citus_lock_waits_and_use_worker_query Properly set worker_query and use instead of application name on `citus_[dist/worker]_stat_activity`	2022-02-21 18:28:20 +01:00
Onder Kalaci	95d5918967	Properly set worker_query and use	2022-02-21 18:22:33 +01:00
Önder Kalacı	0dd2ddaa70	Merge pull request #5702 from citusdata/improve_citus_lock_waits Improve citus lock waits to show non-tx blocked distributed processes as well	2022-02-21 18:00:51 +01:00
Onder Kalaci	dffcafc096	Use global pids in citus_lock_waits	2022-02-21 17:46:34 +01:00
Onder Kalaci	331af3dce8	Dumping wait edges becomes optionally scan all backends Before this commit, dumping wait edges can only be used for distributed deadlock detection purposes. With this commit, we open the possibility that we can use it for any backend.	2022-02-21 17:37:07 +01:00
Halil Ozan Akgül	c866f4b2d6	Merge pull request #5699 from citusdata/global_cancellation Overrides pg_cancel/terminate_backend functions to use gpid	2022-02-21 16:51:08 +03:00
Halil Ozan Akgul	f6cd4d0f07	Overrides pg_cancel_backend and pg_terminate_backend to accept global pid	2022-02-21 16:41:35 +03:00
Ahmet Gedemenli	70dc85239f	Merge pull request #5727 from citusdata/check-distributed-first-for-DropSchemaStmts Do distributed check first, for DropSchema stmts	2022-02-21 15:26:16 +03:00
Ahmet Gedemenli	c1d5ca9896	Do distributed check first, for DropSchema stmts	2022-02-21 14:43:04 +03:00
Ahmet Gedemenli	24bdb287ed	Merge pull request #5723 from citusdata/refactor-distcol-for-createdistributedtable Refactor CreateDistributedTable to take column name	2022-02-21 12:29:13 +03:00
Ahmet Gedemenli	28aa715ce2	Add test for citus local tables with dropped columns	2022-02-21 12:07:17 +03:00
Ahmet Gedemenli	2bc6a00408	Refactor CreateDistributedTable to take column name	2022-02-21 12:07:17 +03:00
Ying Xu	6d16e9ba56	Merge pull request #5688 from citusdata/maryxu/checkcitusversion_columnar Copied CheckCitusVersion into Columnar and added new call site in beginscan_extended	2022-02-18 10:01:13 -08:00
yxu2162	8974b2de66	Copied CheckCitusVersion over to Columnar to handle dependency issue. If we split columnar into two extensions, this will later be changed tl CheckColumnarVersion.	2022-02-18 09:47:39 -08:00
Philip Dubé	854f6036a9	Merge pull request #5722 from citusdata/avoid-exceptional-control-flow-in-fluent-py fluent.py: prefer simpler return based control flow in _accept rather than relying on raising an exception	2022-02-18 16:27:08 +00:00
Philip Dubé	3d044dc543	Merge branch 'master' into avoid-exceptional-control-flow-in-fluent-py	2022-02-18 16:10:45 +00:00
Burak Velioglu	259707b630	Merge pull request #5701 from citusdata/velioglu/function_propagation Distribute functions with CREATE FUNCTION command	2022-02-18 17:57:53 +03:00
Burak Velioglu	fa6866ed36	Start to propagate functions to worker nodes with CREATE FUNCTION command together with it's dependencies. If the function depends on any nondistributable object, function will be created only locally. Parameterless version of create_distributed_function becomes obsolete with this change, it will deprecated from the code with a subsequent PR.	2022-02-18 13:56:51 +03:00
Gledis Zeneli	0ca060a820	Merge pull request #5713 from citusdata/prevent-deadlock-on-collation-create * When a worker tried to create a collation which had a dependency in the same worker node, it would cause a deadlock, now it throws the correct "not a coordinator" error.	2022-02-18 13:29:46 +03:00
gledis69	a14fada153	Prevent Deadlocks When a Worker Tries to Create Collation (Fix #5583 ) * When a worker tried to create a collation which had a dependency in the same worker node, it would cause a deadlock, now it throws the correct "not a coordinator" error.	2022-02-18 12:28:02 +03:00
Teja Mupparti	46fa47beea	Force-delegated functions' distribution argument must be reset as soon as the routine completes execution, and not wait until the top level Executor ends. This fixes issue #5687	2022-02-17 10:48:30 -08:00
Philip Dubé	e4420a6252	fluent.py: prefer simpler return based control flow in _accept rather than relying on raising an exception	2022-02-17 13:30:17 +00:00
Nils Dijk	754d894375	Merge pull request #5721 from citusdata/fix/reuse-get-rolespec reuse GetRoleSpecObjectForUser	2022-02-17 13:59:04 +01:00
Nils Dijk	768b320470	reuse GetRoleSpecObjectForUser	2022-02-17 13:16:10 +01:00
Nils Dijk	ea86f9f94e	Add support for TEXT SEARCH CONFIGURATION objects (#5685 ) DESCRIPTION: Implement TEXT SEARCH CONFIGURATION propagation The change adds support to Citus for propagating TEXT SEARCH CONFIGURATION objects. TSConfig objects cannot always be created in one create statement, and instead require a create statement followed by many alter statements to get turned into the object they should represent. To support this we add functionality to the worker to create or replace objects based on a list of statements. When the lists of the local object and the remote object correspond 1:1 we skip the creation of the object and simply mark it distributed. This is especially important for TSConfig objects as initdb pre-populates databases with a dozen configurations (for many different languages). When the user creates a new TSConfig based on the copy of an existing configuration there is no direct link to the object copied from. Since there is no link we can't simply rely on propagating the dependencies to the worker and send a qualified	2022-02-17 13:12:46 +01:00
Hanefi Onaldi	886db667ee	Merge pull request #5717 from citusdata/fix-enterprise-merge-docs	2022-02-17 13:56:44 +03:00
Hanefi Onaldi	3ca2be85a7	Introduce new error message on enterprise merge checks	2022-02-17 13:48:36 +03:00
Hanefi Onaldi	78795251e1	Revert "Improve CI checks for enterprise merges on master (#4981 )" This reverts commit `b649dffabd`.	2022-02-17 13:48:36 +03:00
Hanefi Onaldi	8cfb93a662	Merge pull request #5718 from citusdata/fix-enterprise-merges	2022-02-17 13:44:46 +03:00
Hanefi Onaldi	ccc4cc6bf0	Move test in isolation schedule to prevent failure We check for metadata consistency across the cluster in the test isolation_metadata_sync_vs_all. However, some earlier tests in enterprise repo leave invalid pg_dist_node entries in the worker nodes that have Oid values for already dropped role objects. To remedy that, I suggest that we move the test to earlier in the schedule, thereby making the tests pass for the time being. We should later introduce metadata checking either in a new isolation test or by moving this test later in the schedule. However, we should do that after we fix the underlying issue.	2022-02-17 13:15:21 +03:00
Gledis Zeneli	57319b23d0	Update CONTRIBUTING.md (#5714 ) Removed extra comment added in #5695	2022-02-17 11:19:59 +03:00
Ahmet Gedemenli	f2f7497ab8	Merge pull request #5710 from citusdata/support-truncate-foreign-tables Support TRUNCATE for foreign tables	2022-02-17 10:07:09 +03:00
Ahmet Gedemenli	a1c3580c64	Support TRUNCATE for foreign tables	2022-02-17 09:59:53 +03:00
Önder Kalacı	bf5aa1e223	Merge pull request #5711 from citusdata/clean_up_gpid Prevent any monitoring view/udf to show already exited backends	2022-02-15 15:40:05 +01:00
Onder Kalaci	abd5b1c506	Prevent any monitoring view/udf to show already exited backends The low-level StoreAllActiveTransactions() function filters out backends that exited. Before this commit, if you run a pgbench, after that you'd still see the backends show up: ```SQL select count() from get_global_active_transactions(); ┌───────┐ │ count │ ├───────┤ │ 538 │ └───────┘ ``` After this patch, only active backends show-up: ```SQL select count() from get_global_active_transactions(); ┌───────┐ │ count │ ├───────┤ │ 72 │ └───────┘ ```	2022-02-14 17:34:32 +01:00
Ahmet Gedemenli	0411a98c99	Refactor EnsureSequentialMode functions (#5704 )	2022-02-14 18:38:21 +03:00
Gledis Zeneli	badfd561b2	Prevent Citus table functions from being called on shards (Fix #5610 ) (#5694 ) DESCRIPTION: Prevent Citus table functions from being called on shards The operations that guard against using shards are: * Create Local Table * Create distributed table (which affects reference table creation as well). * I used a `ErrorIfRaltionIsKnownShard` instead of `ErrorIfIllegallyChangingKnownShard`. `ErrorIfIllegallyChangingKnownShard` allows the operation if `citus.enable_manual_changes_to_shards`, but I am not sure if it ever makes sense to create a distributed, reference, or citus local table out of a shard. I tried to go over the code to identify other UDF-s where shards could be illegaly changed, but I could not find any other. My knowledge of the codebase is not solid enough for me to say for sure. Fixes #5610	2022-02-14 16:06:48 +03:00
Gledis Zeneli	8a3544b4d9	Merge pull request #5695 from citusdata/update-contributing-md * Adds installation of `mitmproxy`. I was getting this error from running regression tests: ``` Can't exec "mitmdump": No such file or directory at /home/glediszeneli/citus/src/test/regress/pg_regress_multi.pl line 215. ``` * Add a comment to alternatively use `install-all` in the setup. Without `install-all` the `mutli-extension` regression test fails.	2022-02-14 15:17:41 +03:00
Gledis Zeneli	b9dfeba050	Merge branch 'master' into update-contributing-md	2022-02-14 14:30:35 +03:00
Hanefi Onaldi	986d8cff49	Merge pull request #5682 from citusdata/metadata-iso-tests	2022-02-11 16:05:57 +03:00
gledis69	98f7c6bc49	Merge branch 'master' into update-contributing-md	2022-02-11 14:55:11 +03:00
gledis69	5478bd0105	Remove extra sudo in comment for Mac	2022-02-11 14:23:43 +03:00
gledis69	49c594a550	Adding install-all comment to all OS-es	2022-02-11 14:17:22 +03:00
Hanefi Onaldi	2e5ca8ba2b	Add isolation tests for metadata sync vs all This commit introduces several test cases for concurrent operations that change metadata, and a concurrent metadata sync operation. The overall structure is as follows: - Session#1 starts metadata syncing in a transaction block - Session#2 does an operation that change metadata - Both sessions are committed - Another session checks whether the metadata are the same accross all nodes in the cluster.	2022-02-11 01:55:04 +03:00
Önder Kalacı	dc6c194916	Show IDLE backends in citus_dist_stat_activity (#5700 ) * Break the dependency to CitusInitiatedBackend infrastructure With this change, we start to show non-distributed backends as well in citus_dist_stat_activity. I think that (a) it is essential for making citus_lock_waits to work for blocked on DDL commands. (b) it is more expected from the user's perspective. The name of the view is a little inconsistent now (e.g., citus_dist_stat_activity) but we are already planning to improve the names with followup PRs. Also, we have global pids assigned, the CitusInitiatedBackend becomes obsolete.	2022-02-10 08:59:28 -08:00
Ahmet Gedemenli	defc2d991f	Merge pull request #5663 from citusdata/schema-propagation Propagate schema operations	2022-02-10 17:30:35 +03:00
Ahmet Gedemenli	76b63a307b	Propagate create/drop schema commands	2022-02-10 14:58:09 +03:00
Marco Slot	04408cfded	Merge pull request #5693 from citusdata/marcocitus/from-pushdown	2022-02-09 21:14:45 +01:00
Marco Slot	d0711ea9b4	Delegate function calls in FROM outside of transaction block	2022-02-09 20:56:25 +01:00
gledis69	4c2a0f0aa0	Removing install-all, but adding a comment about it	2022-02-09 22:00:39 +03:00
Önder Kalacı	60ce24578f	Merge pull request #5697 from citusdata/improve_nodecon_info Prevent citus.node_conninfo to use "application_name"	2022-02-09 13:38:36 +01:00
Onder Kalaci	1c30f61a70	Prevent citus.node_conninfo to use "application_name" With https://github.com/citusdata/citus/pull/5657, Citus uses a fixed application_name while connecting to remote nodes for internal purposes. It means that we cannot allow users to override it via citus.node_conninfo.	2022-02-09 13:22:04 +01:00
Teja Mupparti	1e3c8e34c0	Allow create_distributed_function() on a function owned by an extension Implement #5649 Allow create_distributed_function() on functions owned by extensions 1) Only update pg_dist_object, and do not propagate CREATE FUNCTION. 2) Ensure corresponding extension is in pg_dist_object. 3) Verify if dependencies exist on the function they should resolve to the extension. 4) Impact on node-scaling: We build a list of ddl commands based on all objects in pg_dist_object. We need to omit the ddl's for the extension-function, as it will get propagated by the virtue of the extension creation. 5) Extra checks for functions coming from extensions, to not propagate changes via ddl commands, even though the function is marked as distributed in pg_dist_object	2022-02-08 11:52:56 -08:00
Halil Ozan Akgül	474e36a405	Merge pull request #5601 from citusdata/global_pid Introduce global PID	2022-02-08 18:43:38 +03:00
Halil Ozan Akgul	8ee02b29d0	Introduce global PID	2022-02-08 16:49:38 +03:00
Burak Velioglu	6376eaf0e0	Merge pull request #5684 from citusdata/velioglu/superuser_connection_for_dep Use super user connection while propagating dependent objects' pg_dist_object entries	2022-02-07 18:38:29 +03:00
gledis69	ed107835cb	Updates a few details in Contributing.md * Adds installation of `mitmproxy`. I was getting this error from running regression tests: ``` Can't exec "mitmdump": No such file or directory at /home/glediszeneli/citus/src/test/regress/pg_regress_multi.pl line 215. ``` * Calls `install-all` in the setup. Without `install-all` the `mutli-extension` regression test failed.	2022-02-07 18:34:39 +03:00
Burak Velioglu	0a70b78bf5	Add test for dist type	2022-02-07 17:50:49 +03:00
Burak Velioglu	c0aece64d0	Add test for checking distributed extension function	2022-02-07 17:50:48 +03:00
Burak Velioglu	ab248c1785	Check object ownership while creating pg_dist_object entries on remote	2022-02-07 17:50:48 +03:00
Burak Velioglu	8ae7577581	Use superuser connection while syncing dependent objects' pg_dist_object tuples	2022-02-07 17:50:45 +03:00
Marco Slot	d7858709b4	Merge pull request #5690 from citusdata/marcocitus/placement-policy-cleanup	2022-02-07 10:28:02 +01:00
Marco Slot	872f0a79db	Remove random shard placement policy	2022-02-06 21:55:58 +01:00
Marco Slot	0cae8e7d6b	Remove local-node-first shard placement	2022-02-06 21:36:34 +01:00
Teja Mupparti	c8e504dd69	Fix the issue #5673 If the expression is simple, such as, SELECT function() or PEFORM function() in PL/PgSQL code, PL engine does a simple expression evaluation which can't interpret the Citus CustomScan Node. Code checks for simple expressions when executing an UDF but missed the DO-Block scenario, this commit fixes it.	2022-02-04 15:44:53 -08:00
Ying Xu	b5c116449b	Removed dependency from EnsureTableOwner (#5676 ) Removed dependency for EnsureTableOwner. Also removed pg_fini() and columnar_tableam_finish() Still need to remove CheckCitusVersion dependency to make Columnar_tableam.h dependency free from Citus.	2022-02-04 12:45:07 -08:00
Onur Tirtir	79442df1b7	Fix coordinator/worker query targetlists for agg. that we cannot push-down (#5679 ) Previously, we were wrapping targetlist nodes with Vars that reference to the result of the worker query, if the node itself is not `Const` or not a `Param`. Indeed, we should not do that unless the node itself is a `Var` node or contains a `Var` within it (e.g.: `OpExpr(Var(column_a) > 2)`). Otherwise, when worker query returns empty result set, then combine query exec would crash since the `Var` would be pointing to an empty tuple slot, which is not desirable for the node-executor methods.	2022-02-04 05:37:25 -08:00
Önder Kalacı	b9b4833710	Merge pull request #5678 from citusdata/unify_gucs Unify old GUCs into a single one	2022-02-04 14:18:10 +01:00
Onder Kalaci	72d7d92611	Apply code review feedback	2022-02-04 10:52:57 +01:00
Onder Kalaci	923bb194a4	Move isolation_multiuser_locking to MX tests	2022-02-04 10:52:57 +01:00
Onder Kalaci	bcb00e3318	remove not used files	2022-02-04 10:52:57 +01:00
Onder Kalaci	ff234fbfd2	Unify old GUCs into a single one Replaces citus.enable_object_propagation with citus.enable_metadata_sync Also, within Citus 11 release cycle, we added citus.enable_metadata_sync_by_default, that is also replaced with citus.enable_metadata_sync. In essence, when citus.enable_metadata_sync is set to true, all the objects and the metadata is send to the remote node. We strongly advice that the users never changes the value of this GUC.	2022-02-04 10:52:56 +01:00
Teja Mupparti	f31bce5b48	Fixes the issue seen in https://github.com/citusdata/citus-enterprise/issues/745 With this commit, rebalancer backends are identified by application_name = citus_rebalancer and the regular internal backends are identified by application_name = citus_internal	2022-02-03 09:40:46 -08:00
jeff-davis	b072b9235e	Columnar: fix checksums, broken in `a4067913`. (#5669 ) Checksums must be set directly before writing the page. log_newpage() sets the page LSN, and therefore invalidates the checksum.	2022-02-02 13:22:11 -08:00
Önder Kalacı	4bb7283af1	Merge pull request #5674 from citusdata/minor_fixes_metadata_sync Minor fixes for metadata syncing	2022-02-02 11:40:51 +01:00
Onder Kalaci	650243927c	Relax some transactional limications on activate node We already enforce EnsureSequentialModeMetadataOperations(), and given that all activate node is transaction, we should be fine	2022-02-01 15:56:55 +01:00
Onder Kalaci	34d91009ed	Update outdated comment As of the current HEAD, we support sequences as first class objects	2022-02-01 15:37:10 +01:00
Marco Slot	593861f285	Merge pull request #5660 from citusdata/marcocitus/function-call-pushdown-from-workers	2022-02-01 14:37:20 +01:00
Marco Slot	63c6896716	Enable function call pushdown from workers	2022-02-01 14:13:25 +01:00
Önder Kalacı	f712dfc558	Add tests coverage (#5672 ) For extension owned tables with sequences	2022-02-01 15:39:52 +03:00
Hanefi Onaldi	82abf22375	Merge pull request #5671 from citusdata/changelog-updates	2022-02-01 14:29:39 +03:00
Hanefi Onaldi	beafde5ff5	Add changelog entries for 10.2.4	2022-02-01 13:53:11 +03:00
Hanefi Onaldi	768643644b	Add changelog entries for 10.1.4	2022-02-01 13:53:10 +03:00
Burak Velioglu	ed8e137467	Merge pull request #5619 from citusdata/velioglu/table_wo_seq_prototype Handle tables and sequences as objects	2022-01-31 18:07:32 +03:00
Burak Velioglu	f88cc230bf	Handle tables and objects as metadata. Update UDFs accordingly With this commit we've started to propagate sequences and shell tables within the object dependency resolution. So, ensuring any dependencies for any object will consider shell tables and sequences as well. Separate logics for both shell tables and sequences have been removed. Since both shell tables and sequences logic were implemented as a part of the metadata handling before that logic, we were propagating them while syncing table metadata. With this commit we've divided metadata (which means anything except shards thereafter) syncing logic into multiple parts and implemented it either as a part of ActivateNode. You can check the functions called in ActivateNode to check definition of different metadata. Definitions of start_metadata_sync_to_node and citus_activate_node have also been updated. citus_activate_node will basically create an active node with all metadata and reference table shards. start_metadata_sync_to_node will be same with citus_activate_node except replicating reference tables. stop_metadata_sync_to_node will remove all the metadata. All of those UDFs need to be called by superuser.	2022-01-31 16:20:15 +03:00
Önder Kalacı	f68ac4a7cf	Consider foreign keys between reference tables (#5659 ) On #5071, we avoid edge cases, but below there are foreign key constraints as well This commit makes sure we cover those as well	2022-01-28 13:38:14 +01:00
Heikki Linnakangas	a40679139b	Use smgrextend() when extending relation, and WAL-log first. (#5654 ) When creating a new table, we bypass the buffer cache and write the initial pages directly with smgrwrite(). However, you're supposed to use smgrextend() when extending a relation, rather than smgrwrite(). There isn't much difference between them, but smgrextend() updates the relation size cache, which seems important, although I haven't seen any real bugs caused by that. Also, write the block to disk only after WAL-logging it, so that we can include the LSN of the WAL record in the version that we write out. Currently, the page as written to disk has LSN 0. That doesn't cause any user-visible issues either, at worst it could make us WAL-log a full page image of the page earlier than necessary, but that doesn't matter currently because we WAL-log full page images of all changes anyway. I bumped into that issue with LSN 0 in the page header when testing Citus with Zenith (https://github.com/zenithdb/zenith/issues/1176). Zenith contains a check that PANICs if you write a block to disk without WAL-logging it, and it works by checking the LSN of the page that's written out. In this case, we are WAL-logging the page even though the LSN on the page is 0, so it was a false alarm, but I'd love to get this changed in Citus to keep the check in Zenith simple. A downside of WAL-logging the page first is that if you run out of disk space, you have already created the WAL record. So if you then crash and restart, WAL recovery will likely run out of disk space, too, which is bad. In practice, we have the same problem in other places, like rewriteheap.c. Also, if you are on the brink of running out of disk space, you will probably run out at WAL replay anyway, regardless of which order we write these few pages. But if we wanted to fix that, we could first extend the relation with zeros, and then WAL-log the pages. That's how heap extension works. It would be even nicer to use the buffer cache for this, and skip the smgrimmedsync() on the relation. However, that would require more work, because we don't have the Relation struct for the relation here. We could use ReadBufferWithoutRelcache(), but that doesn't work for unlogged tables. Unlogged tables are currently not supported (https://github.com/citusdata/citus/issues/4742), but that would become a problem if we want to support them in the future. CreateFakeRelcacheEntry() also doesn't work with unlogged tables. We could do things differently for logged and unlogged tables, but that complicates the code further. Co-authored-by: jeff-davis <Jeffrey.Davis@microsoft.com>	2022-01-27 12:04:08 -08:00
Önder Kalacı	dcb9c71f19	Merge pull request #5657 from citusdata/assign_explicit_citus_user_name Use a fixed application_name while connecting to remote nodes	2022-01-27 13:02:39 +01:00
Onder Kalaci	303540e494	Add PGAPPNAME env. variable to arbitrary configs	2022-01-27 11:00:15 +01:00
Onder Kalaci	b26eeaecd3	Use a fixed application_name while connecting to remote nodes Citus heavily relies on application_name, see `IsCitusInitiatedRemoteBackend()`. But if the user set the application name, such as export PGAPPNAME=test_name, Citus uses that name while connecting to the remote node. With this commit, we ensure that Citus always connects with the "citus" user name to the remote nodes.	2022-01-27 10:46:25 +01:00
Önder Kalacı	9bc0fd9479	Merge pull request #5653 from citusdata/remove_error_create_dist_table Allow creating distributed tables in sequential mode	2022-01-26 13:11:33 +01:00
Onder Kalaci	b9b419ef16	Allow creating distributed tables in sequential mode With https://github.com/citusdata/citus/pull/2780, we allow COPY to use any number of connections that the executor used in a tx block. Meaning that, while COPYing data to the shards, create_distributed_table could allow sequential mode.	2022-01-26 12:58:18 +01:00
Onur Tirtir	8c8d696621	Not fail over to local execution when it's not supported (#5625 ) We fall back to local execution if we cannot establish any more connections to local node. However, we should not do that for the commands that we don't know how to execute locally (or we know we shouldn't execute locally). To fix that, we take localExecutionSupported take into account in CanFailoverPlacementExecutionToLocalExecution too. Moreover, we also prompt a more accurate hint message to inform user about whether the execution is failed because local execution is disabled by them, or because local execution wasn't possible for given command.	2022-01-25 16:43:21 +01:00
Onur Tirtir	ff3913ad99	Copy errmsg for distributed deadlock error into heap (#5641 ) multi_log_hook() hook is called by EmitErrorReport() when emitting the ereport either to frontend or to the server logs. And some callers of EmitErrorReport() (e.g.: errfinish()) seems to assume that string fields of given ErrorData object needs to be freed. For this reason, we copy the message into heap here. I don't think we have faced with such a problem before but it seems worth fixing as it is theoretically possible due to the reasoning above.	2022-01-24 06:27:41 -08:00
Ahmet Gedemenli	152e512aa9	Merge pull request #5642 from citusdata/refactor-GenerateGrantOnSchemaStmtForRights Refactor GenerateGrantOnSchemaStmtForRights	2022-01-24 12:43:42 +03:00
Ahmet Gedemenli	c838fb428f	Refactor GenerateGrantOnSchemaStmtForRights	2022-01-24 11:31:59 +03:00
Ahmet Gedemenli	577224cf23	Merge pull request #5630 from citusdata/multi_colocation_utils-turn-mx-on Turn mx on for test: multi_colocation_utils	2022-01-21 19:48:00 +03:00
Ahmet Gedemenli	e6fc0c6f36	Turn mx on for test: multi_colocation_utils	2022-01-21 19:31:47 +03:00
Onur Tirtir	4dc38e9e3d	Use EnsureCompatibleLocalExecutionState instead (#5640 )	2022-01-21 15:37:59 +01:00
Ahmet Gedemenli	a2b05f0d28	Merge pull request #5639 from citusdata/fix-tagetcolocation-typo Fix typo: taget/target	2022-01-21 10:53:42 +03:00
Ahmet Gedemenli	8647682c11	Fix typo: taget/target	2022-01-21 10:35:56 +03:00
Onur Tirtir	0244d3f206	Merge pull request #5636 from citusdata/drop-tg-utils Drop ruleutils copied for triggers & statistics While reading trigger related parts of our code-base, realized that we actually don't need to copy & paste underlying worker functions from pg/ruleutils.c since higher level functions for those two are anyway exposed as SQL callables, so we can delete more than ~1k lines of code from our ruleutils_x.c files.	2022-01-20 17:35:26 +03:00
Onur Tirtir	181111b84f	Drop ruleutils copied for statistics	2022-01-20 17:28:19 +03:00
Onur Tirtir	7b59295af2	Drop ruleutils copied for triggers	2022-01-20 17:28:19 +03:00
Önder Kalacı	ddf3c9ed32	Merge pull request #5633 from citusdata/make_minimal_work_again Fix check-minimal	2022-01-20 11:54:18 +01:00
Önder Kalacı	e8ba9dd9d3	Merge branch 'master' into make_minimal_work_again	2022-01-20 11:48:53 +01:00
Teja Mupparti	54862f8c22	(1) Functions will be delegated even when present in the scope of an explicit BEGIN/COMMIT transaction block or in a UDF calling another UDF. (2) Prohibit/Limit the delegated function not to do a 2PC (or any work on a remote connection). (3) Have a safety net to ensure the (2) i.e. we should block the connections from the delegated procedure or make sure that no 2PC happens on the node. (4) Such delegated functions are restricted to use only the distributed argument value. Note: To limit the scope of the project we are considering only Functions(not procedures) for the initial work. DESCRIPTION: Introduce a new flag "force_delegation" in create_distributed_function(), which will allow a function to be delegated in an explicit transaction block. Fixes #3265 Once the function is delegated to the worker, on that node during the planning distributed_planner() TryToDelegateFunctionCall() CheckDelegatedFunctionExecution() EnableInForceDelegatedFuncExecution() Save the distribution argument (Constant) ExecutorStart() CitusBeginScan() IsShardKeyValueAllowed() Ensure to not use non-distribution argument. ExecutorRun() AdaptiveExecutor() StartDistributedExecution() EnsureNoRemoteExecutionFromWorkers() Ensure all the shards are local to the node in the remoteTaskList. NonPushableInsertSelectExecScan() InitializeCopyShardState() EnsureNoRemoteExecutionFromWorkers() Ensure all the shards are local to the node in the placementList. This also fixes a minor issue: Properly handle expressions+parameters in distribution arguments	2022-01-19 16:43:33 -08:00
Onder Kalaci	7f30222c90	Fix check-minimal It seems like we broke check-minimal with the refactor on #5486 This commit fixes the minor issue	2022-01-19 16:21:59 +01:00
Ahmet Gedemenli	65ab34810b	Merge pull request #5628 from citusdata/turn-mx-on-test-citus-local-tables Turn mx on for test file citus_local_tables in multi-1 schedule	2022-01-19 14:39:28 +03:00
Ahmet Gedemenli	9e6ebe4826	Turn mx on for test file citus_local_tables, on multi-1 schedule	2022-01-19 13:55:51 +03:00
Onur Tirtir	4a53967bdd	Remove an outdated comment from RelationIsAKnownShard (#5629 )	2022-01-19 11:24:10 +01:00
Ahmet Gedemenli	37b3f50447	Turn mx on for multi-1 schedule (#5627 ) For test files: multi_generate_ddl_commands, multi_repair_shards, multi_create_shards, mixed_relkind_tests	2022-01-19 12:05:54 +03:00
Marco Slot	6a43cfa9f2	Merge pull request #5567 from citusdata/marcocitus/hide-shards	2022-01-18 12:34:30 +01:00
Marco Slot	33bfa0b191	Hide shards from application_name's with a specific prefix	2022-01-18 15:20:55 +04:00
Onur Tirtir	d98500ac22	Fix a flaky test related with temp columnar table cleanup (#5599 ) Wait until old backend to expire to make sure that temp table cleanup is complete.	2022-01-17 09:26:30 -08:00
Ahmet Gedemenli	e564220dd5	Fix typo: GetRelationTriggerFunctionDependencyList (#5626 )	2022-01-17 18:17:07 +03:00
Ahmet Gedemenli	8936543b80	Create wrapper function CreateObjectAddressDependencyDefList (#5623 )	2022-01-17 15:35:40 +03:00
Ying Xu	4dca662e97	Making Columnar Dependency Free from Citus (#5622 ) * Removed distributed dependency in columnar_metadata.c * Changed columnar_debug.c so that it no longer needed distributed/tuplestore and made it return a record instead of a tuplestore * removed distributed/commands.h dependency * Made columnar_tableam.c dependency-free * Fixed spacing for columnar_store_memory_stats function * indentation fix * fixed test failures	2022-01-14 09:43:05 -08:00
Onur Tirtir	70d8e1fe97	Assert that we will create indexes on shards via local execution (#5620 )	2022-01-13 17:09:57 +01:00
Halil Ozan Akgül	deac77e053	Merge pull request #5616 from citusdata/add_missing_library_to_dependencies Add missing library to dependencies.c	2022-01-12 10:38:05 +03:00
Halil Ozan Akgul	63cd90e5dd	Add missing library to dependencies.c	2022-01-11 18:36:43 +03:00
Önder Kalacı	cb447d7bc9	Merge pull request #5611 from citusdata/onderkalaci-patch-1 Enable MX for rebalancer tests	2022-01-11 12:22:08 +01:00
Önder Kalacı	46ec7cd5cf	Enable MX for rebalancer tests	2022-01-11 12:07:39 +01:00
Önder Kalacı	885601c02c	Require superuser while activating a node (#5609 ) * Require superuser while activating a node With this change, we require ActiveNode() (hence citus_add_node(), citus_activate_node()) explicitly require for a superuser. Before this commit, these functions were designed to work with non-superuser roles with the relevent GRANTs given. However, that is not a widely used way for calling the functions above. Due to possibility of non-super user calling the UDFs, they were designed in a way that some commands were using some additional short-lived superuser connections. That is: (a) breaking transactional behavior (e.g., ROLLBACK wouldn't fully rollback the whole transaction) (b) Making it very complicated to reason about which parts of the node activation goes over which connections, and becoming vulnerable to deadlocks / visibility issues.	2022-01-10 08:30:13 -08:00
Onur Tirtir	3cc44ed8b3	Tell other backends it's safe to ignore the backend that concurrently built the shell table index (#5520 ) In addition to starting a new transaction, we also need to tell other backends --including the ones spawned for connections opened to localhost to build indexes on shards of this relation-- that concurrent index builds can safely ignore us. Normally, DefineIndex() only does that if index doesn't have any predicates (i.e.: where clause) and no index expressions at all. However, now that we already called standard process utility, index build on the shell table is finished anyway. The reason behind doing so is that we cannot guarantee not grabbing any snapshots via adaptive executor, and the backends creating indexes on local shards (if any) might block on waiting for current xact of the current backend to finish, which would cause self deadlocks that are not detectable.	2022-01-10 10:23:09 +03:00
Marco Slot	73a76b876a	Merge pull request #5602 from citusdata/marcocitus/disallow-remote-execution	2022-01-07 18:02:13 +01:00
Marco Slot	ee3b50b026	Disallow remote execution from queries on shards	2022-01-07 17:46:21 +01:00
Önder Kalacı	8d1b188620	Enable MX for the remaining failure tests (#5606 )	2022-01-07 17:24:31 +01:00
Ahmet Gedemenli	3c834e6693	Disable foreign distributed tables (#5605 ) * Disable foreign distributed tables * Add warning for existing distributed foreign tables	2022-01-07 18:12:23 +03:00
Önder Kalacı	9d858cb1da	Merge pull request #5579 from citusdata/improve_metadata_conn Improve metadata connection selection logic	2022-01-07 10:42:23 +01:00
Onder Kalaci	7cb1d6ae06	Improve metadata connections With https://github.com/citusdata/citus/pull/5493 we introduced metadata specific connections. With this connection we guarantee that there is a single metadata connection. But note that this connection can be used for any other operation. In other words, this connection is not only reserved for metadata operations. However, as https://github.com/citusdata/citus-enterprise/issues/715 showed us that the logic has a flaw. We allowed ineligible connections to be picked as metadata connections: such as exclusively claimed connections or not fully initialized connections. With this commit, we make sure that we only consider eligable connections for metadata operations.	2022-01-07 10:36:32 +01:00
Önder Kalacı	b6cf5a969b	Merge pull request #5597 from citusdata/move_placement_deletions Move placement deletion from disable node to activate node	2022-01-07 10:03:35 +01:00
Onder Kalaci	9f2d9e1487	Move placement deletion from disable node to activate node We prefer the background daemon to only sync node metadata. That's why we move placement metadata changes from disable node to activate node. With that, we can make sure that disable node only changes node metadata, whereas activate node syncs all the metadata changes. In essence, we already expect all nodes to be up when a node is activated. So, this does not change the behavior much.	2022-01-07 09:56:03 +01:00
Hanefi Onaldi	9edfbe7718	Fix the default value for DeferShardDeleteOnMove The default for GUC citus.defer_drop_after_shard_move is true. However we initialize the global variable with a false value.	2022-01-07 11:01:49 +03:00
Marco Slot	dd122f0f4c	Merge pull request #5592 from janpio/patch-1 docs(README): Fix "Why Citus?" list item indentation	2022-01-06 18:18:45 +01:00
Marco Slot	072e100072	Merge branch 'master' into patch-1	2022-01-06 18:08:20 +01:00
Ahmet Gedemenli	45e423136c	Support foreign tables in MX (#5461 )	2022-01-06 18:50:34 +03:00
Önder Kalacı	5305aa4246	Do not drop sequences when dropping metadata (#5584 ) Dropping sequences means we need to recreate and hence losing the sequence. With this commit, we keep the existing sequences such that resyncing wouldn't drop the sequence. We do that by breaking the dependency of the sequence from the table.	2022-01-06 09:48:34 +01:00
Önder Kalacı	8007adda25	Convert the function to a distributed function (#5596 ) so that when metadata is synced, the table is on the worker	2022-01-06 11:32:40 +03:00
Önder Kalacı	6d9218540b	Enable single node tests with Citus MX (#5595 ) * Enable single node tests with Citus MX The test already has comment on the changes	2022-01-05 16:00:44 +03:00
jeff-davis	2e03efd91e	Columnar: move DDL hooks to citus to remove dependency. (#5547 ) Add a new hook ColumnarTableSetOptions_hook so that citus can get control when the columnar table options change.	2022-01-04 23:26:46 -08:00
jeff-davis	c9292cfad1	Make pg_version_compat.h and listutils.c dependency-free. (#5548 ) Split distributed/version_compat.h into dependency-free pg_version_compat.h, and the original which still has dependencies. The original doesn't have much purpose, but until other files have better discipline about including the correct header files, then it's still needed. Also make distributed/listutils.h dependency-free. Should be moved outside of 'distributed' subdirectory, but that will cause significant code churn, so leave for another cleanup patch. Now both files can be included in columnar without creating a dependency on citus.	2022-01-04 23:02:08 -08:00
jeff-davis	1546aa0d9f	Columnar: use proper generic WAL interface. (#5543 ) Previously, we cheated by using the RM_GENERIC_ID record type, but not actually using the generic WAL API. This worked because we always took a full page image, and saved the extra work of allocating and copying to a temporary page. But it introduced complexity, and perhaps fragility, so better to just use the API properly. The performance penalty for a serial data load seems to be less than 1%.	2022-01-04 22:42:21 -08:00
Jan Piotrowski	61930e81ad	docs(README): Fix "Why Citus?" indentation	2022-01-04 17:22:56 +01:00
Önder Kalacı	afd53e4c54	Merge pull request #5591 from citusdata/unify_tests Make sure that the community and enterprise tests produce the same output	2022-01-04 14:11:24 +01:00
Onder Kalaci	22b5175fd1	Make sure that the community and enterprise tests produce the same output	2022-01-04 13:30:31 +01:00
Önder Kalacı	0a8b0b06c6	Do not allow distributed functions on non-metadata synced nodes (#5586 ) Before this commit, Citus was triggering metadata syncing in the background when a function is distributed. However, with Citus 11, we expect all clusters to have metadata synced enabled. So, we do not expect any nodes not to have the metadata. This change: (a) pro: simplifies the code and opens up possibilities to simplify futher by reducing the scope of bg worker to only sync node metadata (b) pro: explicitly asks users to sync the metadata such that any unforseen impact can be easily detected (c) con: For distributed functions without distribution argument, we do not necessarily require the metadata sycned. However, for completeness and simplicity, we do so.	2022-01-04 13:12:57 +01:00
Gürkan İndibay	30eb24009c	Adds new badges into README (#5590 )	2022-01-04 13:38:41 +03:00
Gürkan İndibay	29dd7dfe05	Adds stackoverflow badge into README.md (#5589 )	2022-01-04 10:42:18 +03:00
Halil Ozan Akgül	9cccaa11d3	Merge pull request #5587 from citusdata/add_isolation_check_mx Add isolation_check_mx test	2021-12-30 15:31:58 +03:00
Halil Ozan Akgul	9547228e8d	Add isolation_check_mx test	2021-12-30 14:58:30 +03:00
Halil Ozan Akgül	41b4462f6e	Merge pull request #5580 from citusdata/fix_metadata_sync_fails_on_multi_transaction_recovery Fix metadata sync fails on multi_transaction_recovery	2021-12-29 11:35:05 +03:00
Halil Ozan Akgul	aef2d83c7d	Fix metadata sync fails on multi_transaction_recovery	2021-12-29 11:21:32 +03:00
Önder Kalacı	d33650d1c1	Record if any partitioned Citus tables during upgrade (#5555 ) With Citus 11, the default behavior is to sync the metadata. However, partitioned tables created pre-Citus 11 might have index names that are not compatiable with metadata syncing. See https://github.com/citusdata/citus/issues/4962 for the details. With this commit, we record the existence of partitioned tables such that we can fix it later if any exists.	2021-12-27 03:33:34 -08:00
Halil Ozan Akgül	c43b6613d0	Merge pull request #5576 from citusdata/fix_metadata_sync_fails_on_multi_truncate Fix metadata sync fails on multi_truncate	2021-12-27 14:08:18 +03:00
Halil Ozan Akgul	0c292a74f5	Fix metadata sync fails on multi_truncate	2021-12-27 13:54:53 +03:00
Önder Kalacı	c9127f921f	Avoid round trips while fixing index names (#5549 ) With this commit, fix_partition_shard_index_names() works significantly faster. For example, 32 shards, 365 partitions, 5 indexes drop from ~120 seconds to ~44 seconds 32 shards, 1095 partitions, 5 indexes drop from ~600 seconds to ~265 seconds `queryStringList` can be really long, because it may contain #partitions * #indexes entries. Before this change, we were actually going through the executor where each command in the query string triggers 1 round trip per entry in queryStringList. The aim of this commit is to avoid the round-trips by creating a single query string. I first simply tried sending `q1;q2;..;qn` . However, the executor is designed to handle `q1;q2;..;qn` type of query executions via the infrastructure mentioned above (e.g., by tracking the query indexes in the list and doing 1 statement per round trip). One another option could have been to change the executor such that only track the query index when `queryStringList` is provided not with queryString including multiple `;`s . That is (a) more work (b) could cause weird edge cases with failure handling (c) felt like coding a special case in to the executor	2021-12-27 10:29:37 +01:00
Halil Ozan Akgül	e3d1a42f81	Merge pull request #5574 from citusdata/fix_metadata_sync_fails_on_multi_function_evaluation Fix metadata sync fails on multi_function_evaluation	2021-12-27 10:53:04 +03:00
Halil Ozan Akgul	bb636e6a29	Fix metadata sync fails on multi_function_evaluation	2021-12-24 19:32:58 +03:00
Halil Ozan Akgül	7e851d2b9b	Merge pull request #5553 from citusdata/fix_metadata_sync_fails_on_multi_sequence_default Fix metadata sync fails on multi_sequence_default and multi_name_lengths	2021-12-24 17:33:37 +03:00
Halil Ozan Akgul	70e68d5312	Fix metadata sync fails on multi_name_lengths	2021-12-24 14:33:32 +03:00
Halil Ozan Akgul	5c2fb06322	Fix metadata sync fails on multi_sequence_default	2021-12-24 14:33:32 +03:00
Hanefi Onaldi	caffaa8517	Merge pull request #5561 from citusdata/improve-circleci-config	2021-12-24 13:20:50 +03:00
Hanefi Onaldi	9ad29e5a9d	Improve CircleCI configs - Parameterize PG versions - Use single parens for consistency - Fix spacing issues - Remove unsupported attributes - Compact check-style steps - Parameterize PG versions for upgrade tests	2021-12-24 13:10:27 +03:00
Halil Ozan Akgül	7f82400352	Merge pull request #5558 from citusdata/turn_metadata_sync_on_in_multi_metadata_sync Turn metadata sync on in multi_metadata_sync	2021-12-24 11:05:25 +03:00
Halil Ozan Akgul	b9c06a6762	Turn metadata sync on in multi_metadata_sync	2021-12-24 10:58:13 +03:00
Hanefi Onaldi	479b2da740	Fix one flaky failure test	2021-12-23 20:11:45 +03:00
Ahmet Gedemenli	63626d4995	Merge pull request #5468 from citusdata/propagate-foreign-server-ops Propagate foreign server ops	2021-12-23 19:04:52 +03:00
Ahmet Gedemenli	042d45b263	Propagate foreign server ops	2021-12-23 17:54:04 +03:00
Onur Tirtir	61b5fb1cfc	Run failure_test_helpers in base schedule (#5559 )	2021-12-23 12:54:12 +01:00
Talha Nisanci	e196d23854	Refactor AttributeEquivalenceId (#5006 )	2021-12-23 13:19:02 +03:00
Hanefi Onaldi	76176caea7	Fix typo s/exlusive/exclusive/	2021-12-23 01:35:01 +03:00
Hanefi Onaldi	1af8ca8f7c	Fix statical analysis findings (#5550 )	2021-12-22 18:16:11 +03:00
Ahmet Gedemenli	d5a969b055	Merge pull request #5540 from citusdata/fix-function-signature-generation Fix function signature generation	2021-12-21 19:25:41 +03:00
Ahmet Gedemenli	8e4ff34a2e	Do not include return table params in the function arg list (cherry picked from commit `90928cfd74`) Fix function signature generation Fix comment typo Add test for worker_create_or_replace_object Add test for recreating distributed functions with OUT/TABLE params Add test for recreating distributed function that returns setof int Fix test output Fix comment	2021-12-21 19:01:42 +03:00
Marco Slot	80f41e94c0	Merge pull request #4945 from citusdata/marcocitus/set-transaction	2021-12-18 11:38:37 +01:00
Marco Slot	2eef71ccab	Propagate SET TRANSACTION commands	2021-12-18 11:31:39 +01:00
Halil Ozan Akgül	2108410a40	Merge pull request #5535 from citusdata/turn_metadata_sync_on_in_add_corrdinator Turn metadata sync on in add_coordinator, foreign_key_to_reference_ta…	2021-12-17 18:05:43 +03:00
Halil Ozan Akgul	46f718c76d	Turn metadata sync on in add_coordinator, foreign_key_to_reference_table and replicate_reference_tables_to_coordinator	2021-12-17 16:33:25 +03:00
Halil Ozan Akgül	c3195f75a5	Merge pull request #5545 from citusdata/turn_ddl_propagation_off_on_multi_copy Turn ddl propagation off in worker on multi_copy	2021-12-17 16:29:13 +03:00
Halil Ozan Akgul	25755a7094	Turn ddl propagation off in worker on multi_copy	2021-12-17 15:54:20 +03:00
Önder Kalacı	695653911a	Merge pull request #4634 from citusdata/citus_grep_command Grep Remote/Local commands	2021-12-17 12:07:45 +01:00
Onder Kalaci	fc98f83af2	Add citus.grep_remote_commands Simply applies ```SQL SELECT textlike(command, citus.grep_remote_commands) ``` And, if returns true, the command is logged. Else, the log is ignored. When citus.grep_remote_commands is empty string, all commands are logged.	2021-12-17 11:47:40 +01:00
Halil Ozan Akgül	5b1a25ae7c	Merge pull request #5544 from citusdata/turn_metadata_sync_on_in_multi_replicate_reference_table Turn metadata sync on in multi_replicate_reference_table and multi_citus_tools	2021-12-17 11:04:41 +03:00
Halil Ozan Akgul	df8d0f3db1	Turn metadata sync on in multi_replicate_reference_table and multi_citus_tools	2021-12-17 10:25:57 +03:00
Onur Tirtir	cc4c83b1e5	HAVE_LZ4 -> HAVE_CITUS_LZ4 (#5541 )	2021-12-16 16:21:52 +03:00
Talha Nisanci	c0945d88de	Normalize a debug failure to WARNING failure (#4996 )	2021-12-16 13:43:49 +03:00
Halil Ozan Akgül	7d0f4f11c3	Merge pull request #5537 from citusdata/turn_metadata_sync_on_in_mx_regular_user Turn metadata sync on in mx_regular_user and remove_coordinator	2021-12-16 11:35:08 +03:00
Halil Ozan Akgul	8943d7b52f	Turn metadata sync on in mx_regular_user and remove_coordinator	2021-12-16 11:26:24 +03:00
Halil Ozan Akgül	047ae2cad0	Merge pull request #5534 from citusdata/turn_metadata_sync_on_in_multi_unsupported_worker_operations Turn metadata sync on in multi_size_queries, multi_drop_extension and multi_unsupported_worker_operations	2021-12-16 11:25:19 +03:00
Halil Ozan Akgul	b82af4db3b	Turn metadata sync on in multi_size_queries, multi_drop_extension and multi_unsupported_worker_operations	2021-12-16 11:10:54 +03:00
Hanefi Onaldi	9d4d73898a	Move healthcheck logic into new file (#5531 ) and add a missing `CheckCitusVersion(ERROR)` call	2021-12-15 15:58:20 -08:00
Hanefi Onaldi	acdcd9422c	Fix one flaky failure test (#5528 ) Removes flaky test	2021-12-15 18:59:58 +03:00
Hanefi Onaldi	29e4516642	Introduce citus_check_cluster_node_health UDF This UDF coordinates connectivity checks accross the whole cluster. This UDF gets the list of active readable nodes in the cluster, and coordinates all connectivity checks in sequential order. The algorithm is: for sourceNode in activeReadableWorkerList: c = connectToNode(sourceNode) for targetNode in activeReadableWorkerList: result = c.execute( "SELECT citus_check_connection_to_node(targetNode.name, targetNode.port") emit sourceNode.name, sourceNode.port, targetNode.name, targetNode.port, result - result -> true -> connection attempt from source to target succeeded - result -> false -> connection attempt from source to target failed - result -> NULL -> connection attempt from the current node to source node failed I suggest you use the following query to get an overview on the connectivity: SELECT bool_and(COALESCE(result, false)) FROM citus_check_cluster_node_health(); Whenever this query returns false, there is a connectivity issue, check in detail.	2021-12-15 01:41:51 +03:00
Hanefi Onaldi	13fff9c37a	Remove NOOP tuplestore_donestoring calls PostgreSQL does not need calling this function since 7.4 release, and it is a NOOP. For more details, check PostgreSQL commit below : commit dd04e958c8b03c0f0512497651678c7816af3198 Author: Tom Lane <tgl@sss.pgh.pa.us> Date: Sun Mar 9 03:34:10 2003 +0000 tuplestore_donestoring() isn't needed anymore, but provide a no-op macro definition so as not to create compatibility problems. diff --git a/src/include/utils/tuplestore.h b/src/include/utils/tuplestore.h index b46babacd1..76fe9fb428 100644 --- a/src/include/utils/tuplestore.h +++ b/src/include/utils/tuplestore.h @@ -17,7 +17,7 @@ * Portions Copyright (c) 1996-2002, PostgreSQL Global Development Group * Portions Copyright (c) 1994, Regents of the University of California * - * $Id: tuplestore.h,v 1.8 2003/03/09 02:19:13 tgl Exp $ + * $Id: tuplestore.h,v 1.9 2003/03/09 03:34:10 tgl Exp $ * ------------------------------------------------------------------------- / @@ -41,6 +41,9 @@ extern Tuplestorestate tuplestore_begin_heap(bool randomAccess, extern void tuplestore_puttuple(Tuplestorestate state, void tuple); +/ tuplestore_donestoring() used to be required, but is no longer used / +#define tuplestore_donestoring(state) ((void) 0) + / backwards scan is only allowed if randomAccess was specified 'true' / extern void tuplestore_gettuple(Tuplestorestate state, bool forward, bool should_free);	2021-12-14 18:55:02 +03:00
Halil Ozan Akgül	1c5430635d	Merge pull request #5525 from citusdata/only_drop_dist_indexes_on_metadata_synced_nodes Fix drop index trying to drop coordinator local indexes on metadata worker nodes	2021-12-14 15:37:45 +03:00
Halil Ozan Akgul	e060720370	Fix metadata sync fails in multi_index_statements	2021-12-14 11:28:08 +03:00
Halil Ozan Akgul	a951e52ce8	Fix drop index trying to drop coordinator local indexes on metadata worker nodes	2021-12-14 11:28:08 +03:00
Halil Ozan Akgül	811eda6d0f	Merge pull request #5527 from citusdata/turn_metadata_sync_on_in_multi_copy Fix metadata sync fails on multi_copy	2021-12-14 11:12:15 +03:00
Halil Ozan Akgul	1d7dde2c4c	Fix metadata sync fails on multi_copy	2021-12-14 10:59:59 +03:00
Halil Ozan Akgül	31ffb0981d	Merge pull request #5522 from citusdata/fix_metadata_sync_fails_on_failure_connection_establishment Fix metadata sync fails on failure_connection_establishment	2021-12-14 10:12:45 +03:00
Halil Ozan Akgul	98e38e2e4e	Fix metadata sync fails on failure_connection_establishment	2021-12-13 11:51:56 +03:00
Halil Ozan Akgül	fed1ebaaed	Merge pull request #5521 from citusdata/turn_metadata_sync_on_in_propagate_statistics Fix metadata sync fails on propagate_statistics and pg13_propagate_statistics tests	2021-12-13 10:22:14 +03:00
Halil Ozan Akgul	507df08422	Fix metadata sync fails on propagate_statistics and pg13_propagate_statistics tests	2021-12-09 12:28:11 +03:00
Halil Ozan Akgül	73ba38eac4	Merge pull request #5517 from citusdata/turn_metadata_sync_on_in_base_schedules Turn metadata sync on in base/minimal schedules	2021-12-09 10:14:06 +03:00
Halil Ozan Akgul	351314f8a1	Turn metadata sync on in base/minimal schedules	2021-12-08 13:34:41 +03:00
Halil Ozan Akgül	9471735764	Merge pull request #5511 from citusdata/fix_metadata_sync_fails_on_follower_schedule Fix metadata sync fails on multi_follower_schedule	2021-12-08 13:18:33 +03:00
Halil Ozan Akgul	ee894c9e73	Fix metadata sync fails on multi_follower_schedule	2021-12-08 13:07:37 +03:00
Halil Ozan Akgül	e443d9578f	Merge pull request #5502 from citusdata/turn_metadata_sync_on_in_failure_schedule Turn metadata sync on in failure schedule	2021-12-08 12:32:58 +03:00
Halil Ozan Akgul	4c8f79d7dd	Turn metadata sync on in failure schedule	2021-12-08 11:22:56 +03:00
Halil Ozan Akgül	a7fc79860f	Merge pull request #5514 from citusdata/turn_metadata_sync_on_in_mx_schedule Turn metadata sync on in mx schedule	2021-12-08 10:31:42 +03:00
Halil Ozan Akgul	4f272ea0e5	Fix metadata sync fails in multi_extension	2021-12-08 10:25:43 +03:00
Halil Ozan Akgul	a3834edeaa	Turn metadata sync on in multi_mx_schedule	2021-12-08 10:25:43 +03:00
Halil Ozan Akgül	3fcd8395e6	Merge pull request #5512 from citusdata/turn_metadata_sync_on_in_upgrade_schedules Turn metadata sync on in upgrade schedules	2021-12-08 10:24:50 +03:00
Halil Ozan Akgul	ea37f4fd29	Turn metadata sync on in upgrade schedules	2021-12-08 10:19:02 +03:00
Hanefi Onaldi	05a3dfa8a9	Remove redundant arbitrary config class We had 2 class definitions for CitusCacheManyConnectionsConfig, where one of them was a copy of CitusSmallCopyBuffersConfig. This commit leaves the intended class definition that configures caching many connections, and removes the one that is a copy of another class	2021-12-08 04:47:08 +03:00
Burak Velioglu	a67f518ef0	Merge pull request #5415 from citusdata/velioglu/propagate_pg_dist_object Propagate pg_dist_object to worker nodes	2021-12-06 20:19:07 +03:00
Burak Velioglu	e8534c1dd5	Drop sequence metadata from workers explicitly	2021-12-06 19:25:51 +03:00
Burak Velioglu	21194c3b9d	Mark sequence distributed explicitly while syncing metadata Since sequences are not marked as distributed while creating table if no metadata worker node exists, we are marking all sequences distributed while syncing metadata explicitly.	2021-12-06 19:25:51 +03:00
Burak Velioglu	6d849cf394	Allow delegating function from worker nodes We've both allowed delegating functions and procedures from worker nodes and also prevented delegation if a function/procedure has already been propagated from another node.	2021-12-06 19:25:51 +03:00
Burak Velioglu	a8b1ee87f7	Increment command counter after altering the sequence type	2021-12-06 19:25:51 +03:00
Burak Velioglu	ed8e32de5e	Sync pg_dist_object on an update and propagate while syncing to a new node Before that PR we were updating citus.pg_dist_object metadata, which keeps the metadata related to objects on Citus, only on the coordinator node. In order to allow using those object from worker nodes (or erroring out with proper error message) we've started to propagate that metedata to worker nodes as well.	2021-12-06 19:25:50 +03:00
Halil Ozan Akgül	d4ed94d2f2	Merge pull request #5504 from citusdata/fix_multi_table_ddl_with_metadata_sync Fix metadata sync fails of multi_table_ddl	2021-12-06 16:54:21 +03:00
Halil Ozan Akgul	ef09ba0d06	Fix metadata sync fails of multi_table_ddl	2021-12-06 13:44:30 +03:00
Halil Ozan Akgül	ae134c209f	Merge pull request #5503 from citusdata/fix_undist_table_test_metadata_sync_fails Fix fails with metadata syncing in undistribute_table	2021-12-03 14:11:06 +03:00
Halil Ozan Akgul	a6d0de060c	Fix fails with metadata syncing in undistribute_table	2021-12-03 13:58:53 +03:00
Hanefi Onaldi	56e9b1b968	Introduce UDF to check worker connectivity citus_check_connection_to_node runs a simple query on a remote node and reports whether this attempt was successful. This UDF will be used to make sure each worker node can connect to all the worker nodes in the cluster. parameters: nodename: required nodeport: optional (default: 5432) return value: boolean success	2021-12-03 02:30:28 +03:00
Talha Nisanci	e4ead8f408	Update broken link for upgrade tests (#5408 ) * Update broken link for upgrade tests * Update src/test/regress/README.md Co-authored-by: Nils Dijk <nils@citusdata.com> Co-authored-by: Nils Dijk <nils@citusdata.com>	2021-12-02 15:25:36 +01:00
Önder Kalacı	ab365a335d	Merge pull request #5486 from citusdata/disable_node_async Allow disabling node(s) when multiple failures happen	2021-12-01 10:48:49 +01:00
Onder Kalaci	549edcabb6	Allow disabling node(s) when multiple failures happen As of master branch, Citus does all the modifications to replicated tables (e.g., reference tables and distributed tables with replication factor > 1), via 2PC and avoids any shardstate=3. As a side-effect of those changes, handling node failures for replicated tables change. With this PR, when one (or multiple) node failures happen, the users would see query errors on modifications. If the problem is intermitant, that's OK, once the node failure(s) recover by themselves, the modification queries would succeed. If the node failure(s) are permenant, the users should call `SELECT citus_disable_node(...)` to disable the node. As soon as the node is disabled, modification would start to succeed. However, now the old node gets behind. It means that, when the node is up again, the placements should be re-created on the node. First, use `SELECT citus_activate_node()`. Then, use `SELECT replicate_table_shards(...)` to replicate the missing placements on the re-activated node.	2021-12-01 10:19:48 +01:00
Halil Ozan Akgül	6feb009834	Merge pull request #5499 from citusdata/fix_enterprise_fails Fix enterprise fails	2021-11-30 16:16:30 +03:00
Halil Ozan Akgul	316274b5f0	Add normalize.sed item for multi_fix_partition_shard_index_names test	2021-11-30 13:28:41 +03:00
Halil Ozan Akgul	11072b4cb8	Normalize create role command in drop_partitioned_table test	2021-11-30 12:46:22 +03:00
Onur Tirtir	1836361a51	Add changelog entries for 10.2.3 (#5498 )	2021-11-29 11:48:00 +03:00
Önder Kalacı	5ef0bae06f	Merge pull request #5493 from citusdata/metadata_connecttion Make sure to use a dedicated metadata connection	2021-11-26 14:47:49 +01:00
Onder Kalaci	d405993b57	Make sure to use a dedicated metadata connection With this commit, we make sure to use a dedicated connection per node for all the metadata operations within the same transaction. This is needed because the same metadata (e.g., metadata includes the distributed table on the workers) can be modified accross multiple connections. With this connection we guarantee that there is a single metadata connection. But note that this connection can be used for any other operation. In other words, this connection is not only reserved for metadata operations.	2021-11-26 14:36:28 +01:00
Önder Kalacı	7b6588fec0	Merge pull request #5469 from citusdata/make_errors_generic Generalize the error checks while removing node	2021-11-26 14:31:26 +01:00
Onder Kalaci	38b08ebde9	Generalize the error checks while removing node The checks for preventing to remove a node are very much reference table centric. We are soon going to add the same checks for replicated tables. So, make the checks generic such that: (a) replicated tables fit naturally (b) we can the same checks in `citus_disable_node`.	2021-11-26 14:25:29 +01:00
Hanefi Onaldi	4c135de9e4	Introduce CI checks for hash comments in specs We do not use comments starting with # in spec files because it creates errors from C preprocessor that expects directives after this character. Instead use C style comments, i.e: // single line comment You can also use multiline comments as well /* * multi line comment */	2021-11-26 14:52:51 +03:00
Halil Ozan Akgül	27a161f443	Merge pull request #5496 from citusdata/multi-1-schedule-with-metadata-syncing Fix tests in multi-1-schedule that fail with metadata syncing	2021-11-26 13:28:18 +03:00
Halil Ozan Akgul	87a1c760d9	Fix tests in multi-1-schedule that fail with metadata syncing	2021-11-26 12:09:53 +03:00
Önder Kalacı	0beb1aba62	Merge pull request #5470 from citusdata/redefine_active_placements Active placements can only be on active nodes	2021-11-26 09:23:28 +01:00
Onder Kalaci	121f5c4271	Active placements can only be on active nodes We re-define the meaning of active shard placement. It used to only be defined via shardstate == SHARD_STATE_ACTIVE. Now, we also add one more check. The worker node that the placement is on should be active as well. This is a preparation for supporting citus_disable_node() for MX with multiple failures at the same time. With this change, the maintanince daemon only needs to sync the "node metadata" (e.g., pg_dist_node), not the shard metadata.	2021-11-26 09:14:33 +01:00
Önder Kalacı	d6cbfd0886	Merge pull request #5467 from citusdata/remove_useless_locks Do not acquire locks on reference tables when a node is removed/disabled	2021-11-26 09:13:55 +01:00
Onder Kalaci	b4931f7345	Do not acquire locks on reference tables when a node is removed/disabled Before this commit, we acquire the metadata locks on the reference tables while removing/disabling a node on all the MX nodes. Although it has some marginal benefits, such as a concurrent modification during remove/disable node blocks, instead of erroring out, the drawbacks seems worse. Both citus_remove_node and citus_disable_node are not tolerant to multiple node failures. With this commit, we relax the locks. The implication is that while a node is removed/disabled, users might see query errors. On the other hand, this change becomes removing/disabling nodes more tolerant to multiple node failures.	2021-11-26 09:08:25 +01:00
Onur Tirtir	76b8006a9e	Allow overwriting columnar storage pages written by aborted xacts (#5484 ) When refactoring storage layer in #4907, we deleted the code that allows overwriting a disk page previously written but not known by metadata. Readers can see the change that introduced the code allows doing so in commit `a8da9acc63`. The reasoning was that; as of 10.2, we started aligning page reservations (`AlignReservation`) for subsequent writes right after allocating pages from disk. That means, even if writer transaction fails, subsequent writes are guaranteed to allocate a new page and write to there. For this reason, attempting to write to a page allocated before is not possible for a columnar table that user created when using v10.2.x. However, since the older versions of columnar doesn't do that, following example scenario can still result in writing to such disk page, even if user now upgraded to v10.2.x. This is because, when upgrading storage to 2.0 (`ColumnarStorageUpdateIfNeeded`), we calculate `reservedOffset` of the metapage based on the highest used address known by stripe metadata (`GetHighestUsedAddressAndId`). However, stripe metadata doesn't have entries for aborted writes. As a result, highest used address would be computed by ignoring pages that are allocated but not used. - User attempts writing to columnar table on Citus v10.0x/v10.1x. - Write operation fails for some reason. - User upgrades Citus to v10.2.x. - When attempting to write to same columnar table, they hit to "attempt to write columnar data .." error since write operation done in the older version of columnar already allocated that page, and now we are overwriting it. For this reason, with this commit, we re-do the change done in `a8da9acc63`. And for the reasons given above, it wasn't possible to add a test for this commit via usual code-paths. For this reason, added a UDF only for testing purposes so that we can reproduce the exact scenario in our regression test suite.	2021-11-26 07:51:13 +01:00
Onur Tirtir	80849f2444	Merge pull request #5456 from citusdata/col/pg-upgrade-dependency	2021-11-26 09:44:17 +03:00
Onur Tirtir	85da4fc2e0	Merge branch 'master' into col/pg-upgrade-dependency	2021-11-26 09:34:43 +03:00
Onur Tirtir	81af605e07	Fix typo: "no sharding pruning constraints" -> "no shard pruning constraints" (#5490 )	2021-11-25 21:00:44 +01:00
Onur Tirtir	73f06323d8	Introduce dependencies from columnarAM to columnar metadata objects During pg upgrades, we have seen that it is not guaranteed that a columnar table will be created after metadata objects got created. Prior to changes done in this commit, we had such a dependency relationship in `pg_depend`: ``` columnar_table ----> columnarAM ----> citus extension ^ ^ \| \| columnar.storage_id_seq -------------------- \| \| columnar.stripe ------------------------------- ``` Since `pg_upgrade` just knows to follow topological sort of the objects when creating database dump, above dependency graph doesn't imply that `columnar_table` should be created before metadata objects such as `columnar.storage_id_seq` and `columnar.stripe` are created. For this reason, with this commit we add new records to `pg_depend` to make columnarAM depending on all rel objects living in `columnar` schema. That way, `pg_upgrade` will know it needs to create those before creating `columnarAM`, and similarly, before creating any tables using `columnarAM`. Note that in addition to inserting those records via installation script, we also do the same in `citus_finish_pg_upgrade()`. This is because, `pg_upgrade` rebuilds catalog tables in the new cluster and that means, we must insert them in the new cluster too.	2021-11-23 13:14:00 +03:00
Onur Tirtir	ef2ca03f24	Reproduce bug via test suite	2021-11-23 13:14:00 +03:00
Onur Tirtir	4a97664fd7	Store tmp_upgrade/newData/*.log as an artifact	2021-11-22 18:19:45 +03:00
Burak Velioglu	8d7c497d68	Merge pull request #5480 from citusdata/velioglu/make_object_lock_explicit Make object locking explicit while adding dependencies	2021-11-22 15:56:09 +03:00
Burak Velioglu	6590f12de4	Merge branch 'master' into velioglu/make_object_lock_explicit	2021-11-22 13:55:36 +03:00
Burak Velioglu	12e05ad196	Sorted addresses before getting lock	2021-11-22 11:43:32 +03:00
Marco Slot	7694569976	Merge pull request #5481 from citusdata/marcocitus/remove-shard-range-update	2021-11-19 11:00:14 +01:00
Marco Slot	f49d26fbeb	Remove citus_update_table_statistics isolation test	2021-11-19 10:51:15 +01:00
Marco Slot	56eae48daf	Stop updating shard range in citus_update_shard_statistics	2021-11-19 10:51:15 +01:00
Burak Velioglu	3a68263cc7	Change lock type	2021-11-19 12:03:17 +03:00
Burak Velioglu	baeaca7bc5	Update comment	2021-11-19 10:51:56 +03:00
Hanefi Onaldi	6ff65db7ee	Merge pull request #5372 from citusdata/fix-broken-drop-schema	2021-11-18 23:58:53 +03:00
Hanefi Onaldi	c0d43d4905	Prevent cache usage on citus_drop_trigger codepaths	2021-11-18 20:24:51 +03:00
Burak Velioglu	77dd12c09d	Merge branch 'master' into velioglu/make_object_lock_explicit	2021-11-18 20:18:07 +03:00
Hanefi Onaldi	e6160ad131	Document failing tests for issue 5099	2021-11-18 20:01:34 +03:00
Hanefi Onaldi	a3cc9b4e53	Remove case block that is identical to its neighbor (#5472 )	2021-11-18 19:41:39 +03:00
Burak Velioglu	b484d9b234	Make object locking explicit while adding dependencies	2021-11-18 19:34:00 +03:00
Marco Slot	77d948a595	Merge pull request #5465 from citusdata/marcocitus/remove-cstore_fdw	2021-11-16 17:43:29 +01:00
Marco Slot	9e6ca23286	Remove cstore_fdw-related logic	2021-11-16 13:59:03 +01:00
Önder Kalacı	8c0bc94b51	Enable replication factor > 1 in metadata syncing (#5392 ) - [x] Add some more regression test coverage - [x] Make sure returning works fine in case of local execution + remote execution (task->partiallyLocalOrRemote works as expected, already added tests) - [x] Implement locking properly (and add isolation tests) - [x] We do #shardcount round-trips on `SerializeNonCommutativeWrites`. We made it a single round-trip. - [x] Acquire locks for subselects on the workers & add isolation tests - [x] Add a GUC to prevent modification from the workers, hence increase the coordinator-only throughput - The performance slightly drops (~%15), unless `citus.allow_modifications_from_workers_to_replicated_tables` is set to false	2021-11-15 15:10:18 +03:00
Hanefi Onaldi	bbcf287f7e	Merge pull request #5462 from citusdata/changelog-10.0.6 Add changelog entries for 10.0.6	2021-11-12 13:11:03 +03:00
Hanefi Onaldi	45549d20a6	Add changelog entries for 10.0.6	2021-11-12 12:38:14 +03:00
Onur Tirtir	25024b776e	Skip deleting options if columnar.options is already dropped (#5458 ) Drop extension might cascade to columnar.options before dropping a columnar table. In that case, we were getting below error when opening columnar.options to delete records for the columnar table that we are about to drop.: "ERROR: could not open relation with OID 0". I somehow reproduced this bug easily when upgrading pg, that is why adding added the test to after_pg_upgrade_schedule.	2021-11-12 12:30:09 +03:00
Ahmet Gedemenli	1aa32d5dbc	Merge pull request #5440 from citusdata/default-add-to-metadata-experiment Introduce GUC citus.use_citus_managed_tables	2021-11-11 14:39:03 +03:00
Ahmet Gedemenli	14a33d4e8e	Introduce GUC citus.use_citus_managed_tables	2021-11-11 14:09:06 +03:00
Hanefi Onaldi	3d9cec70fd	Update migration paths from 10.2 to 11.0 (#5459 ) We recently introduced a set of patches to 10.2, and introduced 10.2-4 migration version. This migration version only resides on `release-10.2` branch, and is missing on our default branch. This creates a problem because we do not have a valid migration path from 10.2 to latest 11.0. To remedy this issue, I copied the relevant migration files from `release-10.2` branch, and renamed some of our migration files on default branch to make sure we have a linear upgrade path.	2021-11-11 13:55:28 +03:00
Önder Kalacı	6f5a343ff4	Make sure that enterprise tests pass (#5451 )	2021-11-08 18:11:19 +03:00
Önder Kalacı	98ca6ba6ca	Allow lock_shard_resources to be called by the users with privileges (#5441 ) Before this commit, we required the user to be owner of the shard/table in order to call lock_shard_resources. However, that is too restrictive. We can have users with GRANTS to the table who are not owners of the tables/shards. With this commit, we allow such patterns.	2021-11-08 15:36:51 +01:00
Hanefi Onaldi	db613b2f5c	Merge pull request #5448 from citusdata/changelog-9.5.10	2021-11-08 16:50:12 +03:00
Hanefi Onaldi	7b63edfc83	Add changelog entries for 9.5.10	2021-11-08 16:41:47 +03:00
Önder Kalacı	3bce4d76d3	Merge pull request #5405 from citusdata/simplify_executor_locks Simplify/Unify executor locks	2021-11-08 13:58:11 +01:00
Onder Kalaci	d5e89b1132	Unify distributed execution logic for single replicated tables Citus does not acquire any executor locks for shard replication == 1. With this commit, we unify this decision and exit early.	2021-11-08 13:52:20 +01:00
Hanefi Onaldi	20f3248b6e	Merge pull request #5445 from citusdata/changelog-9.5.9	2021-11-08 14:09:53 +03:00
Hanefi Onaldi	3d49cbf9ab	Add changelog entries for 9.5.9	2021-11-08 13:19:10 +03:00
Önder Kalacı	65911ce162	Merge pull request #5397 from citusdata/naisila/fix-partitioned-index Run fix_partition_shard_index_names after each wrong naming command	2021-11-08 11:09:08 +01:00
Önder Kalacı	d5b371b2e0	Merge branch 'master' into naisila/fix-partitioned-index	2021-11-08 10:53:16 +01:00
Marco Slot	7f162ba834	Merge pull request #5444 from citusdata/marcocitus/remove-master_append_table_to_shard	2021-11-08 10:49:17 +01:00
naisila	385ba94d15	Run fix_partition_shard_index_names after each wrong naming command	2021-11-08 10:43:34 +01:00
Marco Slot	78866df13c	Remove master_append_table_to_shard UDF	2021-11-08 10:43:24 +01:00
Marco Slot	ee0cd75648	Merge pull request #5399 from citusdata/marcocitus/remove-append-copy	2021-11-07 21:09:26 +01:00
Marco Slot	fba93df4b0	Remove copy into new append shard logic	2021-11-07 21:01:40 +01:00
Marco Slot	27ba19f7e1	Fix a flappy test in drop_column_partitioned_table	2021-11-07 18:25:44 +01:00
Nils Dijk	3fcb456381	Refactor/partitioned result destreceiver (#5432 ) This change creates a slightly higher abstraction of the `PartitionedResultDestReceiver` where it decouples the partitioning from writing it to a file. This allows for easier reuse for other `DestReceiver`'s that would like to route different tuples to different `DestReceiver`'s. Originally there was a lot of state kept in `PartitionedResultDestReceiver` to be able to lazily create `FileDestReceivers` when the first tuple arrived for that target. This convoluted the implementation of the processing of tuples with where they should go. This refactor changes that where it makes the `PartitionedResultDestReceiver` completely agnostic of what kind of Receivers it is writing to. When constructed you pass it a list of `DestReceiver` compatible pointers with the length of `partitionCount`. Internally the `PartitionedResultDestReceiver` keeps track of which `DestReceiver`'s have been started or not, and start them when they first receive a tuple. Alternatively, if the instantiating code of the `PartitionedResultDestReceiver` wants, the startup can be turned from lazily to eagerly. When the startup is eager (not lazy) all `rStartup` functions on the list of `DestReceiver`'s are called during the startup of the `PartitionedResultDestReceiver` and marked as such. A downside of this approach is the following. On highly partitioned destinations we now need to allocate a `FileDestReceiver` for every target, _always_. When the data passed into the `PartitionedResultDestReceiver` is highly skewed to a small set of `FileDestReceiver`'s this will waste some memory. Given the small size of a `FileDestReceiver`, and the fact that actual file handles are only created during the processing of the startup of the `FileDestReceiver` I think this memory waste is not a problem. If this would become a problem we could refactor the source list into some kind of generator object which can generate the `DestReceiver`'s on the fly.	2021-11-05 13:31:18 +01:00
Nils Dijk	0e7cf9f0ca	reinstate optimization that got unintentionally broken in `366461ccdb` (#5418 ) DESCRIPTION: Reinstate optimisation for uniform shard interval ranges During a refactor introduced in #4132 the following change was made, which made the optimisation in `CalculateUniformHashRangeIndex` unreachable: `366461ccdb (diff-565a339ed3c78bc5a0d4ffeb4e91032150b1dffbeeff59cd3e65981d20b998c7L319-R319)` This PR reinstates the path to the optimisation!	2021-11-05 13:07:51 +01:00
Önder Kalacı	763176a4d9	Some minor improvements on top of 5314 (#5428 ) * Refactor some checks in citus local tables * all existing citus local tables are auto converted after upgrade * Update warning messages in CreateCitusLocalTable * Hide notice msg for auto converting local tables * Hide hint msg Co-authored-by: Ahmet Gedemenli <afgedemenli@gmail.com>	2021-11-05 13:59:13 +03:00
Sait Talha Nisanci	ab29c25658	Fix missing from entry	2021-11-04 18:54:52 +03:00
Halil Ozan Akgül	a23f1fb259	Merge pull request #5417 from citusdata/fix_isolation_schedule_with_mx Turns mx on in isolations tests	2021-11-04 17:18:50 +03:00
Halil Ozan Akgul	a8f3f712cc	Turns mx on in isolations tests	2021-11-04 17:12:30 +03:00
Ahmet Gedemenli	b30ed46068	Fixes ALTER STATISTICS IF EXISTS bug (#5435 ) * Fix ALTER STATISTICS IF EXISTS bug	2021-11-04 16:14:05 +03:00
Onur Tirtir	7597e5aee9	Merge pull request #5433 from citusdata/cl-928 Add changelog for 9.2.8	2021-11-04 15:08:01 +03:00
Onur Tirtir	4b598da672	Add changelog for 9.2.8	2021-11-04 14:56:29 +03:00
Halil Ozan Akgül	9bfff4ba8d	Merge pull request #5391 from citusdata/fix_multi_cluster_management_with_mx Fix multi cluster management with metadata syncing enabled	2021-11-04 12:01:24 +03:00
Halil Ozan Akgul	91b377490b	Fix multi_cluster_management fails for metadata syncing	2021-11-04 11:09:21 +03:00
Talha Nisanci	19f28eabae	Fix citus upgrade local run issues (#5414 ) This PR is fixing 2 separate issues related to the local run of citus upgrade tests. `d3e7c825ab` fixes the issue that, with our new testing infrastructure, we moved/renamed some of existing folders. This created a problem for local runs of citus upgrade tests since some paths were sensitive to such changes. This commit tries to make it more generic so that this issue is less likely to happen in the future, while also fixing the current issue. `93de6b60c3` we are fixing an issue that a new environment variable was added for citus upgrade tests, which is defined in the CI. `0cb51f8c37/.circleci/config.yml (L294)` This environment variable wasn't set in our local runs hence it would create problems. Instead of defining this environment variable in the local run, we change the citus_upgrade run command to use an existing env variable, which is now also set in the CI.	2021-11-03 16:17:36 +03:00
Jelte Fennema	9b784e58bf	Add tests for special hash values (#5431 ) We fixed some crashes a while back that would only occur in cases where the value of a distribution column would have result in a high or a very low hash value. This adds a regression test for those crashes.	2021-11-03 13:42:39 +01:00
Jelte Fennema	0cb51f8c37	Test a query that failed on 9.5.8 when coordinator is in metadata (#5412 ) This test starts passing because of PR #4508, to be precise commit: `24e60b44a1` When I undo that commit this newly added test starts failing. This adds this test to make sure we don't regress on this again.	2021-11-03 12:27:28 +01:00
Onur Tirtir	d691148e7e	Merge pull request #5430 from citusdata/cl-927 Add changelog for 9.2.7	2021-11-03 12:06:29 +03:00
Onur Tirtir	2535d15121	Add changelog for 9.2.7	2021-11-03 11:13:54 +03:00
Halil Ozan Akgül	187ec01dd6	Merge pull request #5402 from citusdata/remove_ensure_coordinator_from_metadata_sync Remove EnsureSuperUser from StartMetadataSyncToNode	2021-11-01 18:11:29 +03:00
Halil Ozan Akgul	c0785d570c	Remove EnsureSuperUser from start and stop metadata sync to node	2021-11-01 18:01:49 +03:00
Halil Ozan Akgül	a350feb13c	Merge pull request #5403 from citusdata/reuse_to_be_deleted_connection_in_same_transaction Don't skip connections with forceCloseAtTransactionEnd that Sent Begin in FindAvailableConnection	2021-11-01 17:59:40 +03:00
Halil Ozan Akgul	c0eb67b24f	Skip forceCloseAtTransactionEnd connections only if BEGIN was not sent on them	2021-11-01 17:43:04 +03:00
Jelte Fennema	57a0228c52	Fix string-concatenation warning on Clang 13 (#5425 ) Clang 13 complains about a suspicious string concatenation. It thinks we might have missed a comma. This adds parentheses to make it clear that concatenation is indeed what we meant.	2021-11-01 13:55:43 +03:00
Marco Slot	53882f4723	Merge pull request #5268 from citusdata/renaming	2021-10-30 10:03:27 +02:00
naisila	796d56a7b1	Rename ddlJob->commandString to ddlJob->metadataSyncCommand	2021-10-29 23:45:43 +03:00
Jelte Fennema	b19979fda5	Update install command to work on Ubuntu 20.04 (#5362 ) libxslt-dev was renamed to libxslt1-dev in Ubuntu 20.04. This is also an alias for this package on Ubuntu 18.04, so this new command works there too.	2021-10-28 04:13:09 -07:00
Ahmet Gedemenli	67dca4363d	Dont auto-undistribute user-added citus local tables (#5314 ) * Disable auto-undistribute for user-added citus local tables	2021-10-28 12:10:26 +03:00
Nils Dijk	f4297f774a	Bump mitmproxy version (#5334 ) There is a vulnerability in mitmproxy with the version we are using. It would be hard to exploit anything with regards to the artifacts we ship as its only used in our test suite. Still its good hygiene to _not_ use software with known vulnerabilities. This PR updates the version of python, mitmproxy and the crypto libraries used. The latest version of mitmproxy for python 3.6 is not patched, hence the upgrade of python. For our CI images this cascades into upgrading debian as well :) For CI we bake these versions in our images so we need to update them as well. Changes to the CI images: https://github.com/citusdata/the-process/pull/65	2021-10-27 17:57:13 +02:00
Jelte Fennema	a8cbeb1047	Fix docs of arbitrary configs (#5413 ) The old command would run none of the tests. The new command runs all of the tests for the given configs.	2021-10-27 17:16:24 +02:00
Philip Dubé	44204ec9f1	Merge pull request #5401 from citusdata/fix-typos Fix typos. Spurred spotting "connectios" in logs	2021-10-25 15:33:16 +00:00
Philip Dubé	cc50682158	Fix typos. Spurred spotting "connectios" in logs	2021-10-25 13:54:09 +00:00
Jelte Fennema	3bdbfc3edf	Fix duplicate typedef which can cause compile failures (#5406 ) ColumnarScanDesc is already defined in columnar_tableam.h. Redifining it again causes a compiler error on some C compilers. Useful reference: https://bugzilla.redhat.com/show_bug.cgi?id=767538 Fixes #5404	2021-10-25 12:20:13 +00:00
Önder Kalacı	fc00ddee4e	Merge pull request #5386 from citusdata/simplify_2pc_decision Simplify 2PC decision in the executor	2021-10-23 09:35:22 +02:00
Onder Kalaci	ce4c4540c5	Simplify 2PC decision in the executor It seems like the decision for 2PC is more complicated than it should be. With this change, we do one behavioral change. In essense, before this commit, when a SELECT task with replication factor > 1 is executed, the executor was triggering 2PC. And, in fact, the transaction manager (`ConnectionModifiedPlacement()`) was able to understand not to trigger 2PC when no modification happens. However, for transaction blocks like: BEGIN; -- a command that triggers 2PC -- A SELECT command on replication > 1 .. COMMIT; The SELECT was used to be qualified as required 2PC. And, as a side-effect the executor was setting `xactProperties.errorOnAnyFailure = true;` So, the commands was failing at the time of execution. Now, they fail at the end of the transaction.	2021-10-23 09:06:28 +02:00
Önder Kalacı	8664e6873f	Merge pull request #5381 from citusdata/do_not_mark_placements_invalid Drop support Inactive Shard States	2021-10-23 09:04:50 +02:00
Onder Kalaci	575bb6dde9	Drop support for Inactive Shard placements Given that we do all operations via 2PC, there is no way for any placement to be marked as INACTIVE.	2021-10-22 18:03:35 +02:00
Önder Kalacı	b3299de81c	Drop support for citus.multi_shard_commit_protocol (#5380 ) In the past, we allowed users to manually switch to 1PC (e.g., one phase commit). However, with this commit, we don't. All multi-shard modifications are done via 2PC.	2021-10-21 14:01:28 +02:00
Marco Slot	e4760e348a	Merge pull request #5389 from citusdata/marcocitus/remove-master_get_table_metadata	2021-10-21 12:21:40 +02:00
Marco Slot	df43868369	Remove PG11 expected upgrade_list_citus_objects output	2021-10-21 12:08:05 +02:00
Marco Slot	dafba6c242	Deprecate master_get_table_metadata UDF	2021-10-21 12:08:05 +02:00
Marco Slot	3b7641b32d	Merge pull request #5396 from citusdata/marcocitus/opclass-parameters Support operator class parameters in indexes	2021-10-21 10:38:11 +02:00
Marco Slot	defb97b7f5	Support operator class parameters in indexes	2021-10-20 17:03:59 +02:00
Önder Kalacı	3f726c72e0	When replication factor > 1, all modifications are done via 2PC (#5379 ) With Citus 9.0, we introduced `citus.single_shard_commit_protocol` which defaults to 2PC. With this commit, we prevent any user to set it to 1PC and drop support for `citus.single_shard_commit_protocol`. Although this might add some overhead for users, it is already the default behaviour (so less likely) and marking placements as INVALID is much worse.	2021-10-20 01:39:03 -07:00
Sait Talha Nisanci	a851211dbc	Run tests sequentially	2021-10-19 18:35:26 +03:00
Marco Slot	641ef9bd6f	Fix flappy subquery_append test	2021-10-19 15:29:01 +02:00
Sait Talha Nisanci	56abd3d501	Increase parallelism	2021-10-19 15:38:58 +03:00
Marco Slot	d9e36820f4	Merge pull request #5361 from citusdata/marcocitus/remove-append-4	2021-10-19 12:50:22 +02:00
Marco Slot	096660d61d	Remove master_apply_delete_command	2021-10-18 22:29:37 +02:00
Marco Slot	9571311c65	Merge pull request #5359 from citusdata/marcocitus/remove-append-3	2021-10-18 22:07:41 +02:00
Marco Slot	bece86b2f7	Add some subquery on append-distributed table tests	2021-10-18 21:11:16 +02:00
Marco Slot	93e79b9262	Never allow co-located joins of append-distributed tables	2021-10-18 21:11:16 +02:00
Marco Slot	b97e5081c7	Disable co-located joins for append-distributed tables	2021-10-18 21:11:16 +02:00
Marco Slot	dfad73d918	Disable implicit single re-partition joins for append tables	2021-10-18 21:11:16 +02:00
Marco Slot	2206e64e42	Disable single-repartition joins for append tables	2021-10-18 21:11:16 +02:00
Sait Talha Nisanci	6ff2083311	Remove base test as it is not useful anymore	2021-10-18 20:31:18 +03:00
Sait Talha Nisanci	7336c03c22	Add local-dist table joins to arbitrary configs	2021-10-18 20:31:18 +03:00
Önder Kalacı	31c8f279ac	Add helper UDFs to inspect object dependencies (#5293 ) - citus_get_all_dependencies_for_object: emulate what Citus would qualify as dependency when adding a new node - citus_get_dependencies_for_object: emulate what Citus would qualify as dependency when creating an object Example use: ```SQL -- find all the depedencies of table test SELECT pg_identify_object(t.classid, t.objid, t.objsubid) FROM (SELECT * FROM pg_get_object_address('table', '{test}', '{}')) as addr JOIN LATERAL citus_get_all_dependencies_for_object(addr.classid, addr.objid, addr.objsubid) as t(classid oid, objid oid, objsubid int) ON TRUE ORDER BY 1; ```	2021-10-18 14:46:49 +03:00
Halil Ozan Akgül	169084f1bb	Merge pull request #5382 from citusdata/fix_comma_bug_in_ShardListInserCommand Fix the extra comma bug in ShardListInsertCommand	2021-10-18 13:37:41 +03:00
Halil Ozan Akgul	e3446692f3	Fix the bug by adding comma before the values	2021-10-15 18:42:23 +03:00
Halil Ozan Akgül	b576e69260	Merge pull request #5378 from citusdata/turn_mx_on_columnar_schedule Fix the tests that fail with MX in columnar_schedule	2021-10-15 13:28:53 +03:00
Halil Ozan Akgul	3fb996f6de	Fix the tests that fail with MX in columnar_schedule	2021-10-15 13:09:01 +03:00
Halil Ozan Akgül	eca784d088	Merge pull request #5377 from citusdata/turn_mx_on_multi_schedule Fix tests that fail with MX in multi_schedule	2021-10-15 13:08:30 +03:00
Halil Ozan Akgul	b710e0064d	Fix tests that fail with MX in multi_schedule	2021-10-15 12:58:38 +03:00
Onur Tirtir	59caaf0a54	Merge pull request #5375 from citusdata/update-cl-10.2.2 Add changelog for 10.2.2	2021-10-14 14:29:55 +03:00
Onur Tirtir	c2ea886085	Add changelog for 10.2.2	2021-10-14 14:09:57 +03:00
Ahmet Gedemenli	35f6fe5f9f	Refactor/Improve PreprocessAlterTableStmtAttachPartition (#5366 ) * Refactor/Improve PreprocessAlterTableStmtAttachPartition	2021-10-14 11:39:39 +03:00
SaitTalhaNisanci	de61a89083	Fix sql_schedule_name problem (#5371 )	2021-10-13 13:10:00 +02:00
Hanefi Onaldi	3e64dc44c8	Fix some typos in comments (#5369 )	2021-10-13 13:00:39 +03:00
Önder Kalacı	af876bf452	Add value materialization test (#5368 )	2021-10-13 09:08:24 +02:00
SaitTalhaNisanci	a39859bc74	Remove unnecesary output (#5367 )	2021-10-13 09:28:01 +03:00
SaitTalhaNisanci	3f65751d43	Add an infrastructure to run same tests with arbitrary configs (#5316 ) To run tests in parallel use: ```bash make check-arbitrary-configs parallel=4 ``` To run tests sequentially use: ```bash make check-arbitrary-configs parallel=1 ``` To run only some configs: ```bash make check-arbitrary-base CONFIGS=CitusSingleNodeClusterConfig,CitusSmallSharedPoolSizeConfig ``` To run only some test files with some config: ```bash make check-arbitrary-base CONFIGS=CitusSingleNodeClusterConfig EXTRA_TESTS=dropped_columns_1 ``` To get a deterministic run, you can give the random's seed: ```bash make check-arbitrary-configs parallel=4 seed=12312 ``` The `seed` will be in the output of the run. In our regular regression tests, we can see all the details about either planning or execution but this means we need to run the same query under different configs/cluster setups again and again, which is not really maintanable. When we don't care about the internals of how planning/execution is done but the correctness, especially with different configs this infrastructure can be used. With `check-arbitrary-configs` target, the following happens: - a bunch of configs are loaded, which are defined in `config.py`. These configs have different settings such as different shard count, different citus settings, postgres settings, worker amount, or different metadata. - For each config, a separate data directory is created for tests in `tmp_citus_test` with the config's name. - For each config, `create_schedule` is run on the coordinator to setup the necessary tables. - For each config, `sql_schedule` is run. `sql_schedule` is run on the coordinator if it is a non-mx cluster. And if it is mx, it is either run on the coordinator or a random worker. - Tests results are checked if they match with the expected. When tests results don't match, you can see the regression diffs in a config's datadir, such as `tmp_citus_tests/dataCitusSingleNodeClusterConfig`. We also have a PostgresConfig which runs all the test suite with Postgres. By default configs use regular user, but we have a config to run as a superuser as well. So the infrastructure tests: - Postgres vs Citus - Mx vs Non-Mx - Superuser vs regular user - Arbitrary Citus configs When you want to add a new test, you can add the create statements to `create_schedule` and add the sql queries to `sql_schedule`. If you are adding Citus UDFs that should be a NO-OP for Postgres, make sure to override the UDFs in `postgres.sql`. You can add your new config to `config.py`. Make sure to extend either `CitusDefaultClusterConfig` or `CitusMXBaseClusterConfig`. On the CI, upon a failure, all logfiles will be uploaded as artifacts, so you can check the artifacts tab. All the regressions will be shown as part of the job on CI. In your local, you can check the regression diffs in config's datadirs as in `tmp_citus_tests/dataCitusSingleNodeClusterConfig`.	2021-10-12 14:24:19 +03:00
Teja Mupparti	a8348047c5	Pushdown procedures with OUT parameters (#5348 )	2021-10-11 23:14:36 -07:00
Onur Tirtir	877d21d3f4	Merge pull request #5360 from citusdata/cleanup/compat Since we dropped support for pg versions not having oid attribute in catalog tables, removing those functions.	2021-10-11 12:00:45 +03:00
Onur Tirtir	f7f4a93073	Remove get_relation_trigger_oid_compat	2021-10-11 11:53:00 +03:00
Onur Tirtir	a1e0511583	Remove get_relation_constraint_oid_compat	2021-10-11 11:53:00 +03:00
Ahmet Gedemenli	47e28a4faf	Merge pull request #5296 from citusdata/partitioning-for-citus-local-tables Add partitioning support for citus local tables	2021-10-11 11:42:01 +03:00
Ahmet Gedemenli	d19793c174	Add partitioning support for citus local tables Add/fix tests Fix creating partitions Add test for mx - partition creating case Enable cascading to partitioned tables Fix mx partition adding test Fix cascading through fkeys Style Disable converting with non-inherited fkeys Fix detach bug Early return in case of cascade & Add tests Style Fix undistribute_table bug & Fix test outputs Remove RemovePartitionRelationIds Test with undistribute_table Add test for mx+convert+undistribute Remove redundant usage of CreatePartitionedCitusLocalTable Add some comments Introduce bulk functions for generating attach/detach partition commands Fix: Convert partitioned tables after adding fkey Change the error message for partitions Introduce function ErrorIfPartitionTableAddedToMetadata Polish attach/detach command generation functions Use time_partitions for testing Move mx tests to citus_local_tables_mx Add new partitioned table to cascade test Add test with time series management UDFs Fix test output Fix: Assertion fail on relation access tracking Style Refactor creating partitioned citus local tables Remove CreatePartitionedCitusLocalTable Style Error out if converting multi-level table Revert some old tests Error out adding partitioned partition Polish Polish/address Fix create table partition of case Use CascadeOperationForRelationIdList if no cascade needed Fix create partition bug Revert / Add new tests to mx Style Fix dropping fkey bug Add test with IF NOT EXISTS Convert to CLT when doing ATTACH PARTITION Add comments Add more tests with time series management Edit the error message for converting the child Use OR instead of AND in ErrorIfUnsupportedAlterTableStmt Edit/improve tests Disable ddl prop when dropping default column definitions Disable/enable ddl prop just before/after the command Add comment Add sequence test Add trigger test Remove NeedCascadeViaForeignKeys Add one more insert to sequence test Add comment Style Fix test output shard ids Update comments Disable creating fkey on partitions Move partition check to CreateCitusLocalTable Add comment Add check for attachingmulti-level partition Add test for pg_constraint Check pg_dist_partition in tests Add test inserting on the worker	2021-10-11 10:45:07 +03:00
Marco Slot	09a070221a	Merge pull request #5356 from citusdata/marcocitus/remove-append-2 Reduce reliance on append tables in regression tests	2021-10-08 21:39:18 +02:00
Marco Slot	386d2567d4	Reduce reliance on append tables in regression tests	2021-10-08 21:27:14 +02:00
Halil Ozan Akgül	b288cc9d1f	Merge pull request #5344 from citusdata/turn_mx_on Turn MX on by default	2021-10-08 18:42:52 +03:00
Halil Ozan Akgul	9c9d4b5eeb	Turn MX on by default	2021-10-08 18:17:21 +03:00
Naisila Puka	99d3785b5c	Fix flaky test in multi_fix_partition_shard_index_names.sql (#5364 )	2021-10-08 18:03:34 +03:00
Naisila Puka	d0390af72d	Add fix_partition_shard_index_names udf to fix currently broken names (#5291 ) * Add udf to include shardId in broken partition shard index names * Address reviews: rename index such that operations can be done on it * More comprehensive index tests * Final touches and formatting	2021-10-07 19:34:52 +03:00
Marco Slot	fb34b518af	Merge pull request #5350 from citusdata/marcocitus/fix-index-deparsing	2021-10-06 13:30:06 +02:00
Marco Slot	91b647024a	Fixes CREATE INDEX deparsing issue	2021-10-06 13:08:16 +02:00
Onur Tirtir	5d8f74bd0b	(Share) Lock buffer page when reading from columnar storage (#5338 ) Under high write concurrency, we were sometimes reading columnar metapage as all zeros. In `WriteToBlock()`, if `clear == true`, then it will clear the page before writing the new one, rather than just adding data to the page. That means any concurrent connection that is holding only a pin will be able to see the all-zero state between the `InitPage()` and the `memcpy_s()`. Moreover, postgres/storage/buffer/README states that: > Buffer access rules: > > 1. To scan a page for tuples, one must hold a pin and either shared or > exclusive content lock. To examine the commit status (XIDs and status bits) > of a tuple in a shared buffer, one must likewise hold a pin and either shared > or exclusive lock. For those reasons, we have to make sure to never keep a pin on the page without (at least) the shared lock, to avoid having such problems.	2021-10-06 11:57:02 +03:00
Halil Ozan Akgül	52879fdc96	Merge pull request #5285 from citusdata/typos_in_comment_functions Fixes function names in comments	2021-10-06 10:52:45 +03:00
Halil Ozan Akgul	43d5853b6d	Fixes function names in comments	2021-10-06 09:24:43 +03:00
Hanefi Onaldi	c27b5aa7f8	Merge pull request #5335 from citusdata/bump-to-11.0devel Bump Citus to 11.0devel	2021-10-05 11:20:32 +03:00
Hanefi Onaldi	a74409f24c	Bump Citus to 11.0devel	2021-10-01 22:21:22 +03:00
Önder Kalacı	7bd746b91a	Merge pull request #5332 from citusdata/pg_14_updates Reflect PG14 changes in the readme	2021-10-01 19:25:45 +02:00
Önder Kalacı	47c2483ed7	Merge branch 'master' into pg_14_updates	2021-10-01 13:48:21 +02:00
Onur Tirtir	fe72e8bb48	Discard index deletion requests made to columnarAM (#5331 ) A write operation might trigger index deletion if index already had dead entries for the key we are about to insert. There are two ways of index deletion: a) simple deletion b) bottom-up deletion (>= pg14) Since columnar_index_fetch_tuple never sets all_dead to true, columnarAM doesn't ever expect to receive simple deletion requests (columnar_index_delete_tuples) as we don't mark any index entries as dead. However, since columnarAM doesn't delete any dead entries via simple deletion, postgres might ask for a more comprehensive deletion (i.e.: bottom-up) at some point when pg >= 14. So with this commit, we start gracefully ignoring bottom-up deletion requests made to columnar_index_delete_tuples. Given that users can anyway "VACUUM FULL" their columnar tables, we don't see any problem in ignoring deletion requests.	2021-10-01 14:32:47 +03:00
Onur Tirtir	0ce6650c88	Use pg14 in CONTRIBUTING.md	2021-10-01 12:38:55 +02:00
Onder Kalaci	0995cfef49	Reflect PG14 changes in the readme	2021-10-01 10:03:27 +02:00
SaitTalhaNisanci	d7fde7dd1a	upgrade to 14.0 (#5330 )	2021-09-30 17:27:37 +03:00
Önder Kalacı	c2311b4c0c	Make (columnar.stripe) first_row_number index a unique constraint (#5324 ) * Make (columnar.stripe) first_row_number index a unique constraint Since stripe_first_row_number_idx is required to scan a columnar table, we need to make sure that it is created before doing anything with columnar tables during pg upgrades. However, a plain btree index is not a dependency of a table, so pg_upgrade cannot guarantee that stripe_first_row_number_idx gets created when creating columnar.stripe, unless we make it a unique "constraint". To do that, drop stripe_first_row_number_idx and create a unique constraint with the same name to keep the code change at minimum. * Add more pg upgrade tests for columnar * Fix a logic error in uprade_columnar_after test Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-30 10:51:56 +03:00
Jelte Fennema	97077c5c4a	Check more exit codes in upgrade tests (#5323 ) We were trying to find the cause for a strange update bug. We thought `pg_upgrade` succeeded and then were surprised that certain data was not in the database after the upgrade. Instead `pg_upgrade` had failed halfway through with an actionable error. It took us pretty long to realise this. This commit adds checking of exit codes to a lot more subprocess executions. That should make debugging in the future much easier.	2021-09-24 15:51:00 +02:00
Onur Tirtir	726b3b90a5	Merge pull request #5322 from citusdata/changelog-10.2.1 Add changelog for 10.2.1	2021-09-24 13:08:46 +03:00
Onur Tirtir	2f1bf9499f	Add changelog for 10.2.1	2021-09-24 12:48:13 +03:00
SaitTalhaNisanci	800ad5eca6	Update images to use rc (#5320 )	2021-09-24 11:15:37 +03:00
Onur Tirtir	67de6be913	Merge pull request #5319 from citusdata/fix-clog-lookup BuildStripeMetadata() calls HeapTupleHeaderGetXmin(), which must only be called on a proper heap tuple with MVCC information. Make sure the caller passes the heap tuple, and not a datum tuple.	2021-09-24 10:55:36 +03:00
Jeff Davis	d49d321eac	Columnar: only call BuildStripeMetadata() with heap tuple. BuildStripeMetadata() calls HeapTupleHeaderGetXmin(), which must only be called on a proper heap tuple with MVCC information. Make sure the caller passes the heap tuple, and not a datum tuple. Fixes #5318.	2021-09-23 15:51:01 -07:00
Teja Mupparti	9cc125166b	Merge pull request #5310 from citusdata/teja_colocate_partitions Parition shards to be colocated with parent shards	2021-09-22 16:18:30 -07:00
tejeswarm	a1604a87e6	Parition shards to be colocated with the parent shards	2021-09-22 14:47:04 -07:00
Onur Tirtir	77a2dd68da	Revoke read access to columnar.chunk from unprivileged user (#5313 ) Since this could expose chunk min/max values to unprivileged users.	2021-09-22 16:23:02 +03:00
Onur Tirtir	8a769ec916	Merge pull request #5278 from citusdata/col/pushdown-boolexpr Columnar CustomScan: Pushdown BoolExpr's as we do before	2021-09-22 11:13:47 +03:00
Onur Tirtir	68335285b4	Columnar CustomScan: Pushdown BoolExpr's as we do before	2021-09-22 10:51:34 +03:00
Onur Tirtir	e6ed764f63	Check if xact id is in progress before checking if aborted (#5312 )	2021-09-21 21:20:31 +03:00
Onur Tirtir	f8b1ff7214	Add CheckCitusVersion() calls to columnarAM (#5308 ) Considering all code-paths that we might interact with a columnar table, add `CheckCitusVersion` calls to tableAM callbacks: - initializing table scan (`columnar_beginscan` & `columnar_index_fetch_begin`) - setting a new filenode for a relation (storage initializiation or a table rewrite) - truncating the storage - inserting tuple (single and multi) Also add `CheckCitusVersion` call to: - drop hook (`ColumnarTableDropHook`) - `alter_columnar_table_set` & `alter_columnar_table_reset` UDFs	2021-09-20 17:26:41 +03:00
Önder Kalacı	3b588359d2	Merge pull request #5307 from citusdata/add_missing_version_checks Add missing version checks for citus_internal_XXX functions	2021-09-20 10:00:33 +02:00
Onder Kalaci	cea937f52f	Add missing version checks for citus_internal_XXX functions	2021-09-20 09:54:35 +02:00
Hanefi Onaldi	b29dd95e19	Merge pull request #5302 from citusdata/missing_v_changelog	2021-09-17 20:19:06 +03:00
Gürkan İndibay	c495552255	Missing v in changelogs	2021-09-17 15:24:57 +03:00
Hanefi Onaldi	2e6b78133c	Merge pull request #5301 from citusdata/changelog-10.1.3	2021-09-17 15:01:29 +03:00
Hanefi Onaldi	0d67b7f479	Merge branch 'master' into changelog-10.1.3	2021-09-17 14:53:51 +03:00
SaitTalhaNisanci	35ff513dfe	Give proper error while distributing a temp table (#5269 )	2021-09-17 14:34:40 +03:00
Hanefi Onaldi	c995a55641	Add changelog entries for 10.1.3	2021-09-17 14:32:17 +03:00
Önder Kalacı	63fa24fedd	Merge pull request #5300 from citusdata/improve_read_me Reflect Citus 10.2 changes in the README	2021-09-17 10:18:39 +02:00
Onder Kalaci	9b24553b46	Reflect Citus 10.2 changes in the README I tried with Linux (ubuntu 20.04) and CentOS 8.3 - Gen1, all works expected. Also, updated reflected index support on columnar tables.	2021-09-17 09:55:36 +02:00
Hanefi Onaldi	c82d82f921	Merge pull request #5292 from citusdata/changelog-9.5.8	2021-09-15 19:22:26 +03:00
Hanefi Onaldi	92af115f21	Add changelog entries for 9.5.8	2021-09-15 15:53:36 +03:00
Gurkan Indibay	082667a985	Add changelog entries for 10.2.0	2021-09-15 05:20:13 +03:00
jeff-davis	6e8b19984e	Columnar: separate plan and runtime quals. (#5261 ) * Columnar: separate plain and exec quals. Make a clear separation between plain quals, which contain constants or extern params; and exec quals, which contain exec params and can't be evaluated until a rescan. Fixes #5258. * more vanilla tests Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-09-13 10:54:53 -07:00
jeff-davis	d48ceee238	Columnar: add method ReparameterizeCustomPathByChild. (#5275 ) When performing a partition-wise join, the planner will adjust paths parameterized by the parent rel to instead parameterize by the child rel directly. When this reparameterization happens, we also need to adjust the join quals to reference the child rather than the parent. Fixes #5257.	2021-09-13 10:33:48 -07:00
Onur Tirtir	ea61efb63a	Not flush writes until need to read them when doing index-scan on columnar (#5247 ) Not flush pending writes if given tid belongs to a "flushed" or "aborted" stripe write, or to an "in-progress" stripe write of another backend. That way, we would reduce the cases where we flush single-tuple stripes during index scan. To do that, we follow below steps for index look-up's: - Do not flush any pending writes and do stripe metadata look-up for given tid. If tuple with tid is found, then no need to do another look-up since we already found the tuple without needing to flush pending writes. - If tuple is not found without flushing pending writes, then we have two scenarios: - If given tid belongs to a pending write of my backend, then do stripe metadata look-up for given tid. But this time first flush any pending writes. - Otherwise, just return false from `index_fetch_tuple` since flushing pending writes wouldn't help.	2021-09-13 18:41:20 +02:00
Onur Tirtir	4ee0fb2758	Make sure to skip aborted writes when reading the first tuple (#5274 ) With `5825c44d5f`, we made the changes to skip aborted writes when scanning a columnar table. However, looks like we forgot to handle such cases for the very first call made to columnar_getnextslot. That means, that commit only considered the intermediate stripe read operations. However, functions called by columnar_getnextslot to find first stripe to read (ColumnarBeginRead & ColumnarRescan) were not caring about those aborted writes. To fix that, we teach AdvanceStripeRead to find the very first stripe to read, and then start using it where were blindly calling FindNextStripeByRowNumber.	2021-09-13 11:50:53 +03:00
Burak Velioglu	531ad83b8c	Merge pull request #5263 from citusdata/velioglu/handle_errors_on_abort Swallow errors while aborting remote transactions	2021-09-10 11:18:47 +03:00
Burak Velioglu	ceec5d72e3	Swallow errors while aborting remote transactions	2021-09-10 11:06:16 +03:00
Naisila Puka	a69abe3be0	Fixes bug about int and smallint sequences on MX (#5254 ) * Introduce worker_nextval udf for int&smallint column defaults * Fix current tests and add new ones for worker_nextval	2021-09-09 23:41:07 +03:00
Nils Dijk	80a44a7b93	prevent double inclusion of columnar_tableam.h (#5266 ) Recently there are some warnings during the compilation of Citus. Part of the warnings come due to the `columnar_tableam.h` header not being properly guarded with defines and ifndef's. This PR fixes these warnings.	2021-09-09 17:37:58 +02:00
Onur Tirtir	be74518965	Improve memset calls made to reset bool arrays (#5262 )	2021-09-09 17:56:03 +03:00
Halil Ozan Akgül	f4428412a0	Merge pull request #5234 from citusdata/cte_with_search_clause Errors for CTEs with search clause	2021-09-09 13:53:44 +03:00
Halil Ozan Akgul	19af1cef2f	Errors for CTEs with search clause Relevant PG commit: 3696a600e2292d43c00949ddf0352e4ebb487e5b	2021-09-09 13:48:24 +03:00
Marco Slot	b3f1a94688	Merge pull request #5256 from citusdata/marcocitus/worker_append_table_to_shard Perform copy command as regular user in worker_append_table_to_shard	2021-09-09 12:37:36 +02:00
Marco Slot	f84164a000	Avoid switch to superuser in worker_merge_files_into_table	2021-09-09 11:00:29 +02:00
Marco Slot	04388e13b0	Add worker_append_table_to_shard permissions tests	2021-09-09 11:00:29 +02:00
Marco Slot	4faa49775b	Perform copy command as regular user in worker_append_table_to_shard	2021-09-09 11:00:29 +02:00
Hanefi Onaldi	9ae912a8c8	Prevent C-style comments in all directories (#5250 )	2021-09-09 11:54:58 +03:00
SaitTalhaNisanci	e3e0a028c7	return early in case we want to skip outer vars (#5259 )	2021-09-09 10:53:36 +03:00
Onur Tirtir	32e3e51ed4	Fix a compiler warning that we get on debian (#5260 )	2021-09-08 20:03:59 +03:00
Onur Tirtir	74d9d2a718	Merge pull request #5246 from citusdata/col/no-index-only	2021-09-08 14:19:39 +03:00
Onur Tirtir	9935dfb958	Remove a flaky test from columnar_paths We already knew that it was flaky. Moreover, now it failed on my branch too. So removing it with this commit.	2021-09-08 14:15:22 +03:00
Onur Tirtir	be3914ae28	Prevent generating index-only "Path"s for columnar tables Previously, even when `EXPLAIN` output tells that we will do index-only scan, it was never the case since columnar tables don't have the visibility fork that postgres is looking for. For this reason, visibility check done in `IndexOnlyNext->VM_ALL_VISIBLE` code-path was always returning false and postgres was reading the tuple from the columnar relation itself.	2021-09-08 14:14:24 +03:00
Onur Tirtir	cc49e63222	Not read heaptuple after closing pg_rewrite (#5255 )	2021-09-08 13:03:17 +02:00
Onur Tirtir	3340f17c4e	Prevent planner from choosing parallel scan for columnar tables (#5245 ) Previously, for regular table scans, we were setting `RelOptInfo->partial_pathlist` to `NIL` via `set_rel_pathlist_hook` to discard scan `Path`s that need to use any parallel workers, this was working nicely. However, when building indexes, this hook doesn't get called so we were not able to prevent spawning parallel workers when building an index. For this reason, `9b4dc2f804` added basic implementation for `columnar_parallelscan_*` callbacks but also made some changes to skip using those workers when building the index. However, now that we are doing stripe reservation in two stages, we call `heap_inplace_update` at some point to complete stripe reservation. However, postgres throws an error if we call `heap_inplace_update` during a parallel operation, even if we don't actually make use of those workers. For this reason, with this pr, we make sure to not generate scan `Path`s that need to use any parallel workers by using `get_relation_info_hook`. This is indeed useful to prevent spawning parallel workers during index builds.	2021-09-08 13:53:43 +03:00
Onur Tirtir	5825c44d5f	Handle aborted writes properly when scanning a columnar table (#5244 ) If it is certain that we will not use any `parallel_worker`s for a columnar table, then stripe entries inserted by aborted transactions become visible to `SnapshotAny` and that causes `REINDEX` to fail by throwing a duplicate key error. To fix that: * consider three states for a stripe write operation: "flushed", "aborted", or "in-progress", * make sure to have a clear separation between them, and * act according to those three states when reading from a columnar table	2021-09-08 13:26:11 +03:00
Onur Tirtir	5dc619162d	Add valgrind test target for multi-1 (#5251 )	2021-09-07 16:27:34 +03:00
Jelte Fennema	5f46372416	Merge pull request #5242 from citusdata/enable-binary-protocol-on-pg14 Since PG14 we can now use binary encoding for arrays and composite types that contain user defined types. This was fixed in this commit in Postgres: `670c0a1d47` This change starts using that knowledge, by not necessarily falling back to text encoding anymore for those types. While doing this and testing a bit more I found various cases where binary encoding would fail that our checks didn't cover. This fixes those cases and adds tests for those. It also fixes EXPLAIN ANALYZE never using binary encoding, which was a leftover of workaround that was not necessary anymore. Finally, it changes the default for both `citus.enable_binary_protocol` and `citus.binary_worker_copy_format` to `true` for PG14 and up. In our cloud offering `binary_worker_copy_format` already was true by default. `enable_binary_protocol` had some bug with MX and user defined types, this bug was fixed by the above mentioned fixes.	2021-09-06 10:44:08 +02:00
Jelte Fennema	bb5c494104	Enable binary encoding by default on PG14 Since PG14 we can now use binary encoding for arrays and composite types that contain user defined types. This was fixed in this commit in Postgres: `670c0a1d47` This change starts using that knowledge, by not necessarily falling back to text encoding anymore for those types. While doing this and testing a bit more I found various cases where binary encoding would fail that our checks didn't cover. This fixes those cases and adds tests for those. It also fixes EXPLAIN ANALYZE never using binary encoding, which was a leftover of workaround that was not necessary anymore. Finally, it changes the default for both `citus.enable_binary_protocol` and `citus.binary_worker_copy_format` to `true` for PG14 and up. In our cloud offering `binary_worker_copy_format` already was true by default. `enable_binary_protocol` had some bug with MX and user defined types, this bug was fixed by the above mentioned fixes.	2021-09-06 10:27:29 +02:00
SaitTalhaNisanci	148f51cb98	Merge pull request #5243 from citusdata/update/pg14_images Update pg14 images	2021-09-06 10:56:19 +03:00
Sait Talha Nisanci	52fd2de76c	Update pg14 images	2021-09-06 10:07:52 +03:00
Burak Velioglu	eada7f0bbc	Merge pull request #5236 from citusdata/velioglu/partition_helper_udfs Add helper UDFs for easy time partition management	2021-09-03 23:17:54 +03:00
Burak Velioglu	c3895f35cd	Add helper UDFs for easy time partition management - get_missing_time_partition_ranges: Gets the ranges of missing partitions for the given table, interval and range unless any existing partition conflicts with calculated missing ranges. - create_time_partitions: Creates partitions by getting range values from get_missing_time_partition_ranges. - drop_old_time_partitions: Drops partitions of the table older than given threshold.	2021-09-03 23:03:13 +03:00
Onur Tirtir	2b71263e40	Align columnar path costing functions (#5239 ) * Rename RecostColumnarPaths to CostColumnarPaths * Rename RecostColumnarIndexPath to CostColumnarIndexPath * Reorder args of CostColumnarScan to align with other two costing functions * Not adjust index scan start-up cost * Rename ColumnarIndexScanAddTotalCost to ColumnarIndexScanAdditionalCost * Reflect that index scan will at least read one stripe in totalCost calculation * Organize declarations in columnar_customscan.c	2021-09-03 19:37:42 +03:00
jeff-davis	cc58b58f73	Columnar: reserve metapage flag for UNLOGGED support. (#5237 ) Reserve space in the metapage for a flag to support UNLOGGED tables in the future without a metapage upgrade.	2021-09-03 08:40:55 -07:00
Halil Ozan Akgül	f67574496c	Merge pull request #5238 from citusdata/error_reindex_dist_part_tables Adds error message for REINDEX TABLE queries on distributed partition…	2021-09-03 17:24:44 +03:00
Halil Ozan Akgul	7fadfb74bb	Adds error message for REINDEX TABLE queries on distributed partitioned tables	2021-09-03 16:46:42 +03:00
SaitTalhaNisanci	9f211eb874	Merge pull request #5209 from citusdata/pg14_support Add Pg14 support	2021-09-03 16:22:38 +03:00
Sait Talha Nisanci	3ad3bbba84	Apply latest version compat without conflicts	2021-09-03 16:09:59 +03:00
Sait Talha Nisanci	0b67fcf81d	Fix style	2021-09-03 16:09:59 +03:00
Halil Ozan Akgul	e1f5520e1a	Adds propagation of ALTER TABLE .. ALTER COLUMN .. SET COMPRESSION ..	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	902af39a04	Add join alias tests (#5233 ) PG COMMIT: 055fee7eb4dcc78e58672aef146334275e1cc40d	2021-09-03 15:44:28 +03:00
SaitTalhaNisanci	2a2ebab1fa	Add tests for jsonb subscripting (#5232 ) PG commit: 676887a3b0b8e3c0348ac3f82ab0d16e9a24bd43	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	2b263f9a2a	ALTER STATISTICS .. OWNER TO CURRENT_ROLE (#5225 ) (cherry picked from commit 42322caf90ca094777aa01376e02d1187afc1560)	2021-09-03 15:44:28 +03:00
Onder Kalaci	82a3b20fb3	Fix flaky test	2021-09-03 15:44:28 +03:00
Onder Kalaci	5844ab286c	Support OUT parameters in procedure pushdown delegation In PG 14, procedures can have OUT parameters. In Citus' procedure delegation framework, we need to adjust the function expression to get the outargs parameters. Releven PG change: `e56bce5d43`	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	1ff7186d20	Extended statistics on expressions - PG14 a4d75c8 (#5224 ) (cherry picked from commit 1268415f123b5d99cfacfe207c8670240efc1c00)	2021-09-03 15:44:28 +03:00
Halil Ozan Akgul	113d5d6615	Adds support for column compression in table distribution	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	6fbdeb38a8	ALTER TABLE ... DETACH PARTITION ... CONCURRENTLY - PG14 #71f4c8c (#5223 )	2021-09-03 15:44:28 +03:00
Onder Kalaci	c431bb2e45	Add support for "COPY dist/ref tables FROM" progress report Simply call Postgres' function to report the progress on each row recieved. Note that we currently do not support "COPY dist/ref TO .." progress report nicely. Citus has some specialized logic to support "COPY dist/ref TO .." such that it either converts the underlying command into "COPY (SELECT * FROM dist/ref ) ..." or sends COPY command to shards directly. In the former case, "tuples_processed" is only updated when the executor returns all the tuples, so the progress is not accurate. In the latter case, Citus can actually implement the progress report. But, for the sake of consistency, we prefer to not implement at all. Added to PG 14 with https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=8a4f618e7ae3cb11b0b37d0f06f05c8ff905833f	2021-09-03 15:44:28 +03:00
Ahmet Gedemenli	66303785f3	Add option PROCESS_TOAST to VACUUM - PG14 #7cb3048 (#5219 ) (cherry picked from commit e63bdfc49f9203db14ef77313c1d5e3461a84a32)	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	35a3f7240d	CHANGELOG: Allow REINDEX to change the tablespace of the new index	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	4e85d9ffce	Add empty pg14 sql file	2021-09-03 15:44:28 +03:00
Sait Talha Nisanci	307eb81278	Fix failure for 1pc_copy_hash	2021-09-03 15:41:28 +03:00
Nils Dijk	c799d8cad8	add 14beta3 to CI	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	a6c40ebd14	Fix multi_follower_dml When the_table is emtpy, we don't get an error with pg14 anymore so we replace it generate_series so that we get the error.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	bd501b4d80	Enable pg12-pg14 upgrade test	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	b16dadbe7c	Avoid NOTICE message to avoid an alternative output with pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6ff609fa86	Add alternative output for data_types It seems like there is a problem with Postgres14 with SELECT DISTINCT COUNT. The issue is reported to Postgres and an alternative output is added. We can remove the alternative output when the issue is fixed on PG. If this is not an issue on PG(which is unlikely) we should consider some other solution.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2fa1e5ffe3	Use the default max_parallel_workers_per_gather for vanilla	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	4b951a2ed9	Add alternative output for multi-mx	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	96964aeee5	Turn off debug for one query to avoid adding an alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e7607b6bed	Add a helper function to check explain has a single task In order to avoid adding an alternative output, a function to check if a given explan plan has a single task added. This doesn't change what the changed tests intend to do.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	e0faf34417	turn off costs in columnar_indexes explain query	2021-09-03 15:41:28 +03:00
Nils Dijk	e63302d012	update error messages for libpq 14beta3	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	2656d885f9	Rewrite AppendColumnNames for Pg14 Postgres changed stats expression types as of PG14. Hence we needed to write the AppendColumnNames method. Also they removed the error on PG side so we remove it as well. Relevant commits on pg14: a4d75c86bf15220df22de0a92c819ecef9db3849 388e75ad33489b77cfb9a8590a91e9287d8fb960	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	d1c0403055	Disable Query Idenfifier calculation in tests When queryId is not 0 and verbose is true, the query identifier is emitted to the explain output. This is breaking Postgres outputs. We disable de query identifier calculation in the tests. Commit on PG that introduced the query identifier in the explain output: 4f0b0966c866ae9f0e15d7cc73ccf7ce4e1af84b	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	7c0389a7a1	Update propagate extension commands test for pg12 The test file was changes slightly to avoid adding an alternative output. We update the existing alternative output for pg12 with the new changes.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	cd402b6a2b	Add alternative output for pg12 for window_functions	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	c31b0c2652	Sets next_shard_id at partition_wise_join test	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	9fc4c27b08	Readds deleted resultRelInfo changes for previos PG versions These changes were removed in commit: Introduces ExecSimpleRelationInsert_compat and modifyStateResultRelInfo macros We shouldn't have removed them but instead kept them for before PG14	2021-09-03 15:41:28 +03:00
Nils Dijk	b632dd9940	use pg14 image for pg upgrade tests	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	aca2b8b675	Add alternative output for isolation_master_update_node	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	f3fa133caa	Bind seg version to 1.3 in isolation_textension_commands	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	75fff14792	Turn off VERBOSE to avoid alternative output With VERBOSE option, as of PG14, we get a line with "Query Identifier".	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	6b65dbc492	Add partition_wise_join to avoid big alternative output There was a small part in multi_partitioning that would need an alternative output for pg14. Instead of adding an alternative for the whole file, we created a new file, called partition_wise_join.sql and added the alternative output for that.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	375a1adc9e	Check if extversion is the same for seg extension When we check the exact version of the seg extension, it becomes a problem when its version changes, such as from 1.3 to 1.4. So now we modified the changes to check for that the version is the same in all the cluster.	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	ca0d4c3bde	Includes pg_version_constants.h in columnar_version_compat.h	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	7823e49219	Introduces pg_get_statisticsobj_worker_compat macro Relevant PG commit: a4d75c86bf15220df22de0a92c819ecef9db3849	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	f16d5e1833	Introduces make_simple_restrictinfo_compat and pull_varnos_compat macros make_simple_restrictinfo and pull_varnos functions now have a new parameter These new macros give us the ability to use this new parameter for PG14 and they don't give the parameter for previous versions Relevant PG commit: 55dc86eca70b1dc18a79c141b3567efed910329d	2021-09-03 15:41:28 +03:00
Nils Dijk	79d1b7d50b	add 14beta3 to CI	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	9b6ce10892	Removes password outputs from alter_role_propagation tests	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	20c32a7a1d	Add alternative output for multi_deparse_function Postgres tightened up its checks for invalid GUC names hence we started to get an alternative output for one of our tests. We add an alternative output since the file is relatively small. Commit on PG: 3db826bd55cd1df0dd8c3d811f8e5b936d7ba1e4	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	256e7d1540	Add alternative output for window_functions	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	df9b7149c3	Add some normalization rules for pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	dc81cae18f	Turn off COSTS to avoid alternative output for pg14	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	fb8671f291	Change pg13 test to not differ with pg14 to avoid adding alternative output	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	3f5c178c93	Remove VERBOSE output to make pg14 and pg13 output the same	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	abd3c1089b	Use oid_hash in write state management	2021-09-03 15:41:28 +03:00
Halil Ozan Akgul	8ef94dc1f5	Changes array_cat argument type from anyarray to anycompatiblearray Relevant PG commit: 9e38c2bb5093ceb0c04d6315ccd8975bd17add66 fix array_cat_agg for pg upgrades array_cat_agg now needs to take anycompatiblearray instead of anyarray because array_cat changed its type from anyarray to anycompatiblearray with pg14. To handle upgrades correctly, we drop the aggregate in citus_pg_prepare_upgrade. To be able to drop it, we first remove the dependency from pg_depend. Then we create the right aggregate in citus_finish_pg_upgrade and we also add the dependency back to pg_depend.	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	a1bfb4f31b	Fix unlimited copy size variable's value	2021-09-03 15:41:28 +03:00
Sait Talha Nisanci	29f5b99951	Use empty string instead of NULL for queryString Postgres doesn't accept NULL for queryStrings in explain plans anymore. Internally, there are some places in Postgres where they modified the NULLS to ""(the empty string). So we do the same on citus side. Commit on Postgres: 1111b2668d89bfcb6f502789158b1233ab4217a6	2021-09-03 15:27:25 +03:00
Sait Talha Nisanci	96833e2b8f	Use HASH_STRINGS explicitly in hash functions Postgres expects to set the HASH_STRINGS explicitly in case of the default behaivor for string hash function. Postgres Commit b3817f5f774663d55931dd4fab9c5a94a15ae7ab	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	5930378f61	Renames shadowing ruleutils_14.c variables	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	b21a00e775	Introduces index_insert_compat macro index_insert function now has a new parameter, indexUnchanged This new macro give us the ability to use these new parameter for PG14 and they don't give the parameters for previous versions Existing parameter is set to false Relevant PG commit: 9dc718bdf2b1a574481a45624d42b674332e2903	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	fd2ca2825b	Introduces ExecSimpleRelationInsert_compat and modifyStateResultRelInfo macros es_result_relation_info is removed from Estate. In this commit we make some changes to handle that. resultRelationInfo filed is added to ModifyState to support the removed field. Relevant PG commits: 1375422c7826a2bf387be29895e961614f69de4b a04daa97a4339c38e304cd6164d37da540d665a8	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	b644ac55c6	Introduces GetOldestNonRemovableTransactionId_compat macro GetOldestXmin function is removed so we use GetOldestNonRemovableTransactionId functions instead GetOldestNonRemovableTransactionId_compat picks the appropriate one Relevant PG commit: dc7420c2c9274a283779ec19718d2d16323640c0	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	cb3b76ed24	Introduces get_partition_parent_compat and RelationGetPartitionDesc_compat macros get_partition_parent and RelationGetPartitionDesc functions now have new parameters to also include detached partitions Thess new macros give us the ability to use these new parameter for PG14 and they don't give the parameters for previous versions Existing parameters are set to not accept detached partitions Relevant PG commit: 71f4c8c6f74ba021e55d35b1128d22fb8c6e1629	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	898d3bb8d3	Introduces proc_statusflags_compat macro In two commits vacuumFlags in PGXACT is moved and then renamed to status flags This macro uses the appropriate version of the flag Relevant PG commits: 5788e258bb26495fab65ff3aa486268d1c50b123 cd9c1b3e197a9b53b840dcc87eb41b04d601a5f9	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	287706b717	Introduces SetTuplestoreDestReceiverParams_compat macro SetTuplestoreDestReceiverParams function now has two new parameters This new macro give us the ability to use this new parameter for PG14 and it doesn't give the parameter for previous versions Existing parameters are set to NULL to keep previous behavior Relevant PG commit: 2f48ede080f42b97b594fb14102c82ca1001b80c	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	b01e7e884c	Pass NULL for plannerInfo as we don't generate PlaceHolderVars	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	86d9260781	Uses lfirst_node in ruleutils_14.c Relevant PG commit: 2b00db4fb0c7f02f000276bfadaab65a14059168	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	3b7bcf7555	Adds missing include_out_argument parameter to func_get_detail in ruleutils_14.c Relevant PG commit: e56bce5d43789cce95d099554ae9593ada92b3b7	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	2990cfb6c9	Adds SQL-standard function body support to ruleutils_14.c Relevant PG commit: e717a9a18b2e34c9c40e5259ad4d31cd7e420750	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	84f0be56c3	Adds EXTRACT cases to get_func_sql_syntax in ruleutils_14.c Relevant PG commit: a2da77cdb4661826482ebf2ddba1f953bc74afe4	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	131062d6b5	Removes ModifyTable check from set_deparse_plan in ruleutils_14.c Relevant PG commit: 86dc90056dfdbd9d1b891718d2e5614e3e432f35	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	f557bae64c	Adds JOIN ... USING alias to ruleutils_14.c Relevant PG commit: 055fee7eb4dcc78e58672aef146334275e1cc40d	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	c3f0528607	Extends statistics on expressions in ruleutils_14.c Relevant PG commit: a4d75c86bf15220df22de0a92c819ecef9db3849	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	af2853d1de	Adds GROUP BY DISTINCT to ruleutils_14.c Relevant PG commit: be45be9c33a85e72cdaeb9967e9f6d2d00199e09	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	5bb538543d	Enhances cycle mark values at ruleutils_14.c Relevant PG commit: f4adc41c4f92cc91d507b19e397140c35bb9fd71	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	12b3c04fe3	Adds SEARCH and CYCLE clauses to ruleutils_14.c Relevant PG commit: 3696a600e2292d43c00949ddf0352e4ebb487e5b	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	1174046a33	Adds bytea equivalents of ltrim() and rtrim() to ruleutils_14.c Relevant PG commit: a6cf3df4ebdcbc7857910a67f259705645383e9f	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	71691ecf06	Adds HASH_STRINGS flag to ruleutils_14.c Relevant PG commit: b3817f5f774663d55931dd4fab9c5a94a15ae7ab	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	e72bd0c1a1	Removes dependency.h from ruleutils_14.c Relevant PG commit: 8b069ef5dca97cd737a5fd64c420df3cd61ec1c9	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	d4874f5ad2	Removes indexing.h header from ruleutils_14.c Relevant PG commit: bdc4edbea6fc847f806e1e7118d730e159512bfc	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	1cb865deb8	Adds SQL syntax function calls related changes to ruleutils_14.c Relevant PG commit: 40c24bfef92530bd846e111c1742c2a54441c62c	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	b4f76303c6	Updates F_ARRAY_UNNEST to F_UNNEST_ANYARRAY in ruleutils_14.c Relevant PG commit: 8e1f37c07aafd4bb7aa6e1e1982010af11f8b5c7	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	30f77b29a7	Fixes some appendStringInfos in ruleutils_14.c Relevant PG commit: 110d81728a0a006abcf654543fc15346f8043dc0	2021-09-03 15:27:25 +03:00
Halil Ozan Akgul	69aa240b99	Adds for_each_from to ruleutils_14.c Relevant PG commit: 56fe008996bc1a547ce60c8dddd2ca821cac163e	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	beb49f0d53	Updates AlternativeSubPlan comment in ruleutils_14.c Relevant PG commit: 41efb8340877e8ffd0023bb6b2ef22ffd1ca014d	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	e642f6c97f	Removes support for postfix operators from ruleutils_14.c Relevant PG commit: 1ed6b895634ce0dc5fd4bd040e87252b32182cba	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	a710b3b949	Removes some comments with printf %.*s format from ruleutils_14.c Relevant PG commit: c410af098c46949e36607eb13689e697fa2def97	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	e38b75799d	Fixes some indentation in ruleutils_14.c Relevant PG commit: fa27dd40d5c5f56a1ee837a75c97549e992e32a4	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	1d5053b652	Removes support for old protocols in Copy functions from PG14 Some Copy related functions copied from Postgres had support for both old and new protocols Postgres removed support for old version so we remove it too Relevant PG commit: 3174d69fb96a66173224e60ec7053b988d5ed4d9	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	82858ca8fe	Introduces ProcessUtility macros for readOnlyTree parameter New macros: standard_ProcessUtility_compat, ProcessUtility_compat, ColumnarProcessUtility_compat, PrevProcessUtilityHook_compat The functions now have a new bool parameter: readOnlyTree These new macros give us the ability to use this new parameter for PG14 and it doesn't give the parameter for previous versions In multi_ProcessUtility and ColumnarProcessUtility, before doing anything else, we check if readOnlyTree parameter is true and create a copy of pstmt Existing readOnlyTree parameters are set to false since we already handle the read only case at multi_ProcessUtility and ColumnarProcessUtility Relevant PG commit: 7c337b6b527b7052e6a751f966d5734c56f668b5	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	5df6251619	Removes CopyGetAttnums function definition for PG14 This function was copied from Postgres but it is not static at PG14 So we keep the definition only for previous versions Relevant PG commit: c532d15dddff14b01fe9ef1d465013cb8ef186df	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	db2d9af863	Introduces BeginCopyFrom_compat macro BeginCopyFrom function now has a new whereClause parameter. In the function this parameter is assigned to the whereClause field of the CopyFromState returned Currently in Postgres there is only one place where this argument isn't NULL, and in previous PG version the whereClause argument of copy state is set right after the function call Since we don't have such example all current whereClause parameters are set to NULL Relevant PG commit: c532d15dddff14b01fe9ef1d465013cb8ef186df	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	35cfa5d7b9	Introduces CopyFromState_compat macro CopyState struct is divided into parts and one of them is CopyFromState This macro uses the appropriate one for PG versions Relevant PG commit: c532d15dddff14b01fe9ef1d465013cb8ef186df	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	8f34f84ce6	Introduces IsReindexWithParam_compat macro In ReindexStmt concurrent field is moved to options and then options are converted to params list. This macro uses previous fields for previous versions and the new params list with a new function named IsReindexWithParam for PG14 Relevant PG commits: 844c05abc3f1c1703bf17cf44ab66351ed9711d2 b5913f6120792465f4394b93c15c2e2ac0c08376	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	37ae22ce3e	Introduces macros for vacuum options VacOptTernaryValue enum is renamed to VacOptValue. In the enum there were three values, VACOPT_TERNARY_DEFAULT, VACOPT_TERNARY_DISABLED, and VACOPT_TERNARY_ENABLED Now there are four values VACOPTVALUE_UNSPECIFIED, VACOPTVALUE_AUTO, VACOPTVALUE_DISABLED, and VACOPTVALUE_ENABLED New macros are VacOptValue_compat, VACOPTVALUE_UNSPECIFIED_COMPAT, VACOPTVALUE_DISABLED_COMPAT, and VACOPTVALUE_ENABLED_COMPAT The VACOPTVALUE_UNSPECIFIED_COMPAT matches VACOPT_TERNARY_DEFAULT and VACOPTVALUE_UNSPECIFIED. And there are no macro for VACOPTVALUE_AUTO. Relevant PG commit: 3499df0dee8c4ea51d264a674df5b5e31991319a	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	ebf1b7e23f	Introduces macros for functions that now have include_out_arguments argument New macros: FuncnameGetCandidates_compat and expand_function_arguments_compat The functions (the ones without _compat) now have a new bool include_out_arguments parameter These new macros give us the ability to use this new parameter for PG14 and it doesn't give the parameter for previous versions Existing include_out_arguments parameters are set to 'false' to keep current behavior Relevant PG commit: e56bce5d43789cce95d099554ae9593ada92b3b7	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	347ae2928f	Introduces stats_compat macro for MemoryContextMethods->stats stats function now have a new bool print_to_stderr parameter This new macro gives us the ability to use this new parameter for PG14 and it doesn't give the parameter for previous versions Existing print_to_stderr parameter is set to true to keep current behavior Relevant PG commit: 43620e328617c1f41a2a54c8cee01723064e3ffa	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	54ee93885a	Introduces getObjectTypeDescription_compat and getObjectIdentity_compat macros getObjectTypeDescription and getObjectIdentity functions now have a new bool missing_ok parameter These new macros give us the ability to use this new parameter for PG14 and they don't give the parameter for previous versions Currently all missing_ok parameters are set to false to keep current behavior Relevant PG commit: 2a10fdc4307a667883f7a3369cb93a721ade9680	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	f8d3e50f25	Introduces STATUS_WAITING_COMPAT macro The STATUS_WAITING define is removed and an enum with PROC_WAIT_STATUS_WAITING is added instead This macro uses appropriate one Relevant PG commit: a513f1dfbf2c29a51b0f7cbd5913ce2d2ee452c5	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	3c10e0f568	Introduces ROLE_MONITOR_COMPAT macro DEFAULT_ROLE_MONITOR is renamed to ROLE_PG_MONITOR This macro uses appropriate one Relevant PG commit: c9c41c7a337d3e2deb0b2a193e9ecfb865d8f52b	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	4bc0c80bba	Adds index_delete_tuples instead of compute_xid_horizon_for_tuples Relevant PG commit: d168b666823b6e0bcf60ed19ce24fb5fb91b8ccf	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	b790ecf180	Introduces F_NEXTVAL_COMPAT macro Name of F_NEXTVAL_OID is changed to F_NEXTVAL Relevant PG commit: 8e1f37c07aafd4bb7aa6e1e1982010af11f8b5c7	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	f933d2a57a	Includes defrem.h in index.c	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	63cdb4b70a	Adds AlterTableStmtObjType macro AlterTableStmt's relkind field is changed into objtype New AlterTableStmtObjType macro uses the appropriate one Relevant PG commit: cc35d8933a211d9965eb1c1d2749a903d5735db2	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	1b6c8348fb	Adds PG14 to version_compat.h and columnar_version_compat.h files	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	7a27d7cee3	Adds copy of ruleutils_13.c as ruleutils_14.c	2021-09-03 15:27:24 +03:00
Halil Ozan Akgul	5aace9bb37	Enables Postgres 14 in configure	2021-09-03 15:27:24 +03:00
jeff-davis	4718b6bcdf	Generate parameterized paths for columnar scans. (#5172 ) Allow ColumnarScans to push down join quals by generating parameterized paths. This significantly expands the utility of chunk group filtering, making a ColumnarScan behave similar to an index when on the inner of a nested loop join. Also, evaluate all parameters on beginscan/rescan, which also works for external parameters. Fixes #4488.	2021-09-02 22:22:48 -07:00
Onur Tirtir	e41854f590	Merge pull request #4818 from citusdata/col/show-projected-cols	2021-09-02 19:14:19 +03:00
Onur Tirtir	37d0ecfbb7	Show projected cols for columnar tables in EXPLAIN output	2021-09-02 19:05:32 +03:00
Onur Tirtir	42ba82fb67	Comment ColumnarAttrNeeded	2021-09-02 13:20:11 +03:00
Onur Tirtir	9cb5ef5007	Pass ColumnarScanDesc to ColumnarScanChunkGroupsFiltered	2021-09-02 13:20:11 +03:00
Naisila Puka	4fb05efabb	Distributes partition-to-be table before ProcessUtility (#5191 ) * Skip ALTER TABLE constraint checks while planning * Revert previous commit's solution, keep tests * Distribute partition-to-be table before ProcessUtility * Acquire locks in PreprocessAlterTableStmtAttachPartition	2021-09-02 13:07:42 +03:00
Onur Tirtir	889a2731cb	Split columnar stripe reservation into two phases (#5188 ) Previously, we were doing `first_row_number` reservation for the first row written to current `WriteState` but were doing `stripe_id` reservation when flushing the `WriteState` and were inserting the related record to `columnar.stripe` at that time as well. However, inserting `columnar.stripe` record at flush-time is problematic. This is because, as told in #5160, if relation has any index-based constraints and if there are two concurrent writes that are inserting conflicting key values for that constraint, then postgres relies on `tableAM->fetch_index_tuple` (=`columnar_fetch_index_tuple`) callback to return `true` when indexAM is checking against possible constraint violations. However, pending writes of other backends are not visible to concurrent sessions in columnar since we were not inserting the stripe metadata record until flushing the stripe. With this commit, we split stripe reservation into two phases: i) Reserve `stripe_id` and insert a "dummy" record to `columnar.stripe` at the very same time we reserve `first_row_number`, i.e. when writing the first row to the current `WriteState`. ii) At flush time, do the storage level allocation and complete the missing fields of the dummy record inserted into `columnar.stripe` during i). That way, any concurrent writes would be able to check against possible constraint violations by using `SnapshotDirty` when scanning `columnar.stripe`. Note that `columnar_fetch_index_tuple` still wouldn't be able to fill the output tupleslot for the requested tid but it would at least return `true` for such index look-up's and we believe this should be sufficient for the caller indexAM callback to make the concurrent writer block on prior one. That is how we fix #5160. Only downside of reserving `stripe_id` at the same time we reserve `first_row_number` is that now any aborted writes would also waste some amount of `stripe_id` as in the case of `first_row_number` but we are just wasting them one-by-one. Considering the fact that we waste `first_row_number` by the amount stripe row limit (=150k by default) in such cases, this shouldn't be important at all.	2021-09-02 11:49:14 +03:00
Onur Tirtir	0bf29200eb	Merge pull request #5154 from citusdata/col/use-correct-snapshot	2021-09-02 11:20:27 +03:00
Onur Tirtir	bf4dfad6f7	Update curcid of given snapshot if it is MVCC Before starting to scan a columnar table, we always flush the pending writes to disk. However, we increment command counter after modifying metadata tables. On the other hand, now that we _don't always use_ xact snapshot to scan a columnar table, writes that we just flushed might not be visible to the query that just flushed pending writes to disk since curcid of provided snapshot would become smaller than the command id being used when modifying metadata tables. To give an example, before this change, below was a possible scenario due to the changes that we made to use the correct snapshot. ```sql CREATE TABLE t(a int, b int) USING columnar; BEGIN; INSERT INTO t VALUES (5, 10); SELECT * FROM t; ┌───┬───┐ │ a │ b │ ├───┼───┤ └───┴───┘ (0 rows) SELECT * FROM t; ┌───┬────┐ │ a │ b │ ├───┼────┤ │ 5 │ 10 │ └───┴────┘ (1 row) ```	2021-09-02 11:11:59 +03:00
Onur Tirtir	6c26c67ea0	Flush write state when initializing read state In next commit, we will adjust curcid of the snapshot being used when scanning the columnar table. However, for index scan, snapshot is provided not when beginning scan but within fetch-tuple call. For this reason, start flushing pending writes in init_columnar_read_state since this seem to be a prerequisite step that needs to be done before scanning a columnar table regardless of the scan method being used.	2021-09-02 11:10:11 +03:00
Onur Tirtir	db0e4ce889	Increment command counter in FinishModifyRelation instead Seems that we always increment the command counter right after finishing metadata table modification. For this reason, it makes sense to call CommandCounterIncrement within FinishModifyRelation.	2021-09-02 11:10:11 +03:00
Onur Tirtir	0b4ed075b5	Use correct snapshot when reading a columnar table Instead of using xact snapshot, use the snapshot provided to columnarAM when scanning table.	2021-09-02 11:10:11 +03:00
Naisila Puka	bd91df298f	Fixes ConnectionModifiedPlacement output for a failed transaction (#5198 )	2021-08-31 18:58:46 +03:00
Naisila Puka	7755d5ed3a	Fixes order of citus_drop_all_shards arguments (#5200 )	2021-08-31 18:25:38 +03:00
Naisila Puka	acb5ae6ab6	Skip dropping shards when we know it's a partition (#5176 )	2021-08-31 17:41:37 +03:00
SaitTalhaNisanci	5ae01303d4	Use get_attnum to find the attribute number of target entry (#5220 ) * Use get_attnum to find the attribute number of target entry	2021-08-31 16:47:19 +03:00
Jelte Fennema	481f8be084	Fix crash in shard rebalancer when no distributed tables exist (#5205 ) The logging of the amount of ignored moves crashed when no distributed tables existed in a cluster. This also fixes in passing that the logging of ignored moves logs the correct number of ignored moves if there exist multiple colocation groups and all are rebalanced at the same time.	2021-08-31 14:15:24 +02:00
SaitTalhaNisanci	d50830d4cc	Update failure tests README (#5197 ) * Update failure tests README I keep finding this page when trying to run failure tests, so updating the README that way: https://github.com/pypa/pipenv/issues/3363#issuecomment-452171564 Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com> Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com>	2021-08-26 12:35:06 +03:00
Hanefi Onaldi	7e39c7ea83	Replace master with citus in logs and comments (#5210 ) I replaced - master_add_node, - master_add_inactive_node - master_activate_node with - citus_add_node, - citus_add_inactive_node - citus_activate_node respectively.	2021-08-26 11:31:17 +03:00
SaitTalhaNisanci	51fa7a2208	Store test results as artifact on failure (#5207 ) There are some libpq changes on postgres side that gives some extra outputs on the pg13.4 and pg12.8. It is possible that we won't get these outputs in our local and in those cases it is useful to download those outputs from the CI. In order to do that, we store the test results as an artifact in CI. You can go to artifacts tab in CI to download them.	2021-08-25 17:18:03 +03:00
SaitTalhaNisanci	b923d51fc6	Bump pg12 and pg13 images to pg12.8 and pg13.8 (#5208 ) In our testing infra structure, even though we use pinned versions of postgres, the auxiliary libraries might pull in newer versions. This is for example the case for libpq, which will now use the libpq libraries from 14beta3. The changes in this PR are a lot due to the libpq changes. We also have changed the citus version that is used as a base for the citus upgrades, from 10.0 to 10.1 . This caused columnar to enforce some extra limits on the settings, which conflicted with our upgrade tests. The changes in failure tests are due to the libpq changes. There are also a lot of changes on isolation tests outputs, hence we updated all of them. Co-authored-by: Nils Dijk <nils@citusdata.com>	2021-08-25 16:04:57 +03:00
SaitTalhaNisanci	c8326df8c0	Fix missing comma in connection options (#5206 )	2021-08-25 13:40:42 +03:00
Jelte Fennema	a31429aae5	Allow configuring tcp_user_timeout using citus.node_conn_info (#5203 ) `tcp_user_timeout` is the awesome relatively unknown big brother of the TCP keepalive related options. Instead of depending on keepalives being sent, this determines that a socket is dead by waiting at most N seconds for an ack of data that it has sent. It's exposed in libpq starting from PG12.	2021-08-24 11:48:40 +03:00
Onur Tirtir	5af839ada0	Not print metapage.reserved_offset in regression tests (#5168 ) * We were anyway not testing reserved_offset in any of those tests but other fields. * This only happens with compressed columnar tables and is because the libzstd/liblz4 versions that we have on exttester ci image might be different than what we might have on our local environments.	2021-08-23 11:07:10 +03:00
Onur Tirtir	2b93f8af56	Merge pull request #5189 from citusdata/col/readme-updates Update columnar readme for temporary table & index support	2021-08-23 10:38:52 +03:00
Onur Tirtir	7dcd9380e7	Update index support section of columnar README	2021-08-23 10:35:11 +03:00
Onur Tirtir	3acd3ebae2	Remove temp table limitation from columnar README	2021-08-23 10:35:11 +03:00
Onur Tirtir	262f89359e	Merge pull request #5192 from citusdata/use-pg-functions Use relcache utils instead of scanning catalog tables for indexes and ext stats	2021-08-18 17:56:56 +03:00
Onur Tirtir	4e1201a333	Use RelationGetStatExtList instead of scanning pg_stats_ext	2021-08-18 17:50:58 +03:00
Onur Tirtir	4b03195c06	Use RelationGetStatExtList instead of GetExplicitStatisticsIdList	2021-08-18 17:50:57 +03:00
Onur Tirtir	91544d0191	Use PGIndexProcessor infra to find explicitly created indexes	2021-08-18 17:50:57 +03:00
Onur Tirtir	549ca4de6d	Use RelationGetIndexList instead of scanning pg_index	2021-08-18 17:50:57 +03:00
Onur Tirtir	fa9933daf3	Use get_am_name to find indexAM name	2021-08-18 00:44:37 +03:00
Nils Dijk	dfc950ce1e	Fix a segfault caused by use after free in ConnectionsPlacementHash (#5170 ) DESCRIPTION: Fix a segfault caused by use after free in ConnectionsPlacementHash Fix a segfault caused by retaining data in any of the hashmaps making up the Placement Connection Management. We have seen production systems segfault due to random data referenced from ConnectionPlacementHash. On investigation we found that the backends segfaulting on this had OOM errors closely prior to the segfault. It has shown there are at least 15 places where an allocation can OOM that would cause ConnectionPlacementHash to retain pointers to memory from contexts that are subsequently freed. This would reproduce the segfault we have observed in production. Conditions for these allocations are: - allocated after first call to `AssociatePlacementWithShard`: https://github.com/citusdata/citus/blob/v10.0.3/src/backend/distributed/connection/placement_connection.c#L880-L881 - allocated before `StartNodeUserDatabaseConnection`: https://github.com/citusdata/citus/blob/v10.0.3/src/backend/distributed/connection/connection_management.c#L291 At least 15 points of memory allocation (which could fail) are between the callsites of both in a primary key lookup on a reference table - where we have seen an OOM cause a segfault moments later. Instead of leaving any references in ConnectionPlacementHash, ConnectionShardHash and ColocatedPlacementsHash that could retain any pointers that are freed due to the TopTransactionContext being reset we clear all these hashes irregardless of the state of CurrentCoordinatedTransactionState. Downside is that on any transaction abort we will now iterate through 4 hashmaps and clear their contents. Given that they are either already empty, which should cause a quick iteration, or non-empty, causing segfaults in subsequent executions, this overhead seems reasonable. A better solution would be to move the creation of these hashmaps so they would live in the TopTransactionContext themself, assuming their contents would never outlive a transaction. This needs more investigation and is an involved refactor Hence fixing this quickly here.	2021-08-17 17:42:35 +02:00
jeff-davis	4f213f293e	Columnar: use generate_series for test rather than load. (#5181 )	2021-08-16 16:12:06 -07:00
Hanefi Onaldi	49be45ed00	Merge pull request #5178 from citusdata/changelog-updates	2021-08-16 17:22:50 +03:00
Hanefi Onaldi	41b15d8775	Add changelog entries for 9.5.7	2021-08-16 17:11:32 +03:00
Hanefi Onaldi	167a023770	Add changelog entries for 10.0.5	2021-08-16 17:11:32 +03:00
Hanefi Onaldi	da29a57837	Add changelog entries for 10.1.2	2021-08-16 17:11:31 +03:00
Onur Tirtir	e0ecec80a1	Merge pull request #5165 from citusdata/col/rewrite-mem-reset-bug Use right mem cxt for read state when re-writing a columnar table	2021-08-16 11:14:26 +03:00
Onur Tirtir	68f46c5dc9	Use scan context for intermediate mem allocs too	2021-08-16 11:06:03 +03:00
Onur Tirtir	b3d9fc91f8	Always use right mem cxt when creating ColumnarReadState All the callers except columnar_relation_copy_for_cluster were already switching to right memory context when creating ColumnarReadState. With this commit, we embed that logic into init_columnar_read_state to avoid further such bugs. That way, we start using the right memory context for columnar_relation_copy_for_cluster too.	2021-08-16 11:06:03 +03:00
Onur Tirtir	7fcecde203	Use init_columnar_read_state instead of lower level func Funtionally, this doesn't change anything. This is just a preparation before next commit.	2021-08-16 11:06:03 +03:00
Burak Velioglu	a8435620c4	Merge pull request #5115 from citusdata/velioglu/partition_fixes Support for CREATE INDEX ONLY and ALTER INDEX ATTACH PARTITION	2021-08-13 13:18:36 +03:00
Burak Velioglu	4355ba0a38	Add CREATE INDEX ... ON ONLY and ALTER INDEX ... ATTACH PARTITION (#4938 #4980 ) - Add support for CRETE INDEX ... ON ONLY: Before that commit we were not sending "ONLY" option to the worker nodes at all. With this commit, "ONLY" parameter will be sent to the worker nodes if it is necessary. (#4938) - Add support for ALTER INDEX ... ATTACH PARTITION: Attach child_index to parent_index by creating same inheritance on shard level in addition to table level. (#4980)	2021-08-13 13:12:45 +03:00
SaitTalhaNisanci	2ec4e37e45	Fix assert failure in FindReferencedTableColumn (#5175 )	2021-08-12 18:21:45 +03:00
Ahmet Gedemenli	9e90894f21	Synchronize hasmetadata flag on mx workers (#5086 ) * Synchronize hasmetadata flag on mx workers * Switch to sequential execution * Add test * Use SetWorkerColumn * Add test for stop_sync * Remove usage of UpdateHasmetadataOnWorkersWithMetadata * Remove MarkNodeMetadataSynced * Fix test for metadatasynced * Remove MarkNodeMetadataSynced * Style * Remove MarkNodeHasMetadata * Remove UpdateDistNodeBoolAttr * Refactor SetWorkerColumn * Use SetWorkerColumnLocalOnly when setting up dependencies * Use SetWorkerColumnLocalOnly in TriggerSyncMetadataToPrimaryNodes * Style * Make update command generator functions static * Set metadatasynced before syncing * Call SetWorkerColumn only if the sync is successful * Try to sync all nodes * Fix indexno * Update metadatasynced locally first * Break if a node fails to sync metadata * Send worker commands optional * Style & Rebase * Add raiseOnError param to SetWorkerColumn * Style * Set metadatasynced for all metadata nodes * Style * Introduce SetWorkerColumnOptional * Polish * Style * Dont send set command to not synced metadata nodes * Style * Polish * Add test for stop_sync * Add test for shouldhaveshards * Add test for isactive flag * Sort by placementid in the function verify_metadata * Cover edge cases for failing nodes * Add comments * Add nodeport to isactive test * Add warning if metadata out of sync * Update warning message	2021-08-12 14:16:18 +03:00
Naisila Puka	e5b32b2c3c	Acquire AccessShareLock before updating table statistics (#5155 )	2021-08-12 13:58:15 +03:00
Ahmet Gedemenli	6bb4c5e94f	Merge pull request #5173 from citusdata/missing_shouldhave_shards Make sure that shouldhaveshards is synced to workers	2021-08-11 17:15:14 +03:00
Onder Kalaci	d4368ff2b3	Make sure that shouldhaveshards is synced to workers	2021-08-11 15:53:31 +02:00
Hanefi Onaldi	dc67dbaa01	Merge pull request #5171 from citusdata/changelog-updates Add changelog entries for 9.4.6	2021-08-11 11:32:46 +03:00
Hanefi Onaldi	c6e428896a	Add changelog entries for 9.4.6	2021-08-11 10:53:31 +03:00
Önder Kalacı	272f4d7ce5	Merge pull request #5158 from citusdata/heap_on_master Guard against hard WaitEvenSet errors	2021-08-10 09:41:05 +02:00
Onder Kalaci	86bd28b92c	Guard against hard WaitEvenSet errors In short, add wrappers around Postgres' AddWaitEventToSet() and ModifyWaitEvent(). AddWaitEventToSet()/ModifyWaitEvent*() may throw hard errors. For example, when the underlying socket for a connection is closed by the remote server and already reflected by the OS, however Citus hasn't had a chance to get this information. In that case, if replication factor is >1, Citus can failover to other nodes for executing the query. Even if replication factor = 1, Citus can give much nicer errors. So CitusAddWaitEventSetToSet()/CitusModifyWaitEvent() simply puts AddWaitEventToSet()/ModifyWaitEvent() into a PG_TRY/PG_CATCH block in order to catch any hard errors, and returns this information to the caller.	2021-08-10 09:35:03 +02:00
Önder Kalacı	2ac3cc07eb	Merge pull request #5144 from citusdata/transactional_start_metadata_node Make start/stop_metadata_sync_to_node transactional	2021-08-09 10:55:57 +02:00
Onder Kalaci	5f02d18ef8	transactional metadata sync for maintanince daemon As we use the current user to sync the metadata to the nodes with #5105 (and many other PRs), there is no reason that prevents us to use the coordinated transaction for metadata syncing. This commit also renames few functions to reflect their actual implementation.	2021-08-09 10:34:55 +02:00
Önder Kalacı	999a236540	Merge pull request #5131 from citusdata/fix_drop_column_part Dropped columns do not diverge distribution column for partitioned tables	2021-08-06 13:47:50 +02:00
Onder Kalaci	35964c6366	Dropped columns do not diverge distribution column for partitioned tables Before this commit, creating a partition after a DROP column on the parent (position before dist. key) was leading to partition to have the wrong distribution column.	2021-08-06 13:36:12 +02:00
Hanefi Onaldi	0722ec95bc	Merge pull request #5163 from citusdata/changelog-updates	2021-08-05 19:31:10 +03:00
Hanefi Onaldi	998b9ffcaa	Merge branch 'master' into changelog-updates	2021-08-05 19:27:13 +03:00
jeff-davis	deb7ec605b	Columnar: fix misleading comments and useless types. (#5162 ) CustomScan and CustomPath structures cannot be extended with additional fields. Fix comments and type structure that implied that they can.	2021-08-05 09:22:21 -07:00
Hanefi Onaldi	bc5553b5d1	Add changelog entries for 10.1.1	2021-08-05 17:32:31 +03:00
Ahmet Gedemenli	07ca8784cd	Merge pull request #5161 from citusdata/add-check-for-gucs-order Add check for alphabetically sorted gucs	2021-08-05 17:09:38 +03:00
Ahmet Gedemenli	51d410bb7b	Add check for alphabetically sorted gucs Move to a separate script Add the new script to readme	2021-08-05 16:37:49 +03:00
naisila	798a7902bf	Fix master_update_table_statistics scripts for 9.5	2021-08-03 18:15:56 +03:00
naisila	f9fa5a3d69	Fix master_update_table_statistics scripts for 9.4	2021-08-03 18:15:56 +03:00
Önder Kalacı	0d2f49fbce	Merge pull request #5130 from citusdata/get_ready_update_dist_table_colocation Introduce citus_internal_update_relation_colocation	2021-08-03 11:53:30 +02:00
Onder Kalaci	482b8096e9	Introduce citus_internal_update_relation_colocation update_distributed_table_colocation can be called by the relation owner, and internally it updates pg_dist_partition. With this commit, update_distributed_table_colocation uses an internal UDF to access pg_dist_partition. As a result, this operation can now be done by regular users on MX.	2021-08-03 11:44:58 +02:00
Onur Tirtir	ef6a8604ba	Merge pull request #5140 from citusdata/col/seq-path-costing Re-cost columnar table sequential scan paths With the changes in this pr, we adjust the cost estimates done by postgres for sequential scan paths for columnar tables. We want to make better decisions when columnar custom scan is disabled too. That means, there are cases where index scan is more preferable over sequential scan for heapAM but not for columnarAM. For this reason, we want to make better decisions regarding whether to choose index scan or sequential scan when columnar custom is scan is disabled. So with this pr, we re-estimate costs for sequential scan paths in a way that is quite similar to what we do for columnar custom scan. The idea is that columnar custom scan uses projection pushdown so the cost is directly proportional to column selectivity. However, for sequential scan, we re-estimate the cost considering all the columns since projection pushdown is not supported for plain sequential scan. One thing to note here is that we still don't consider chunk group filtering when estimating the cost for columnar custom scan. For this reason, we calculate the same costs for sequential scan & columnar custom scan if query reads all columns, regardless of the filters in the `where` clause. To avoid mistakenly choosing sequential scan in such cases, we still remove non `IndexPath`s if columnar custom scan is enabled. That way, even when we calculate the same cost for sequential scan and columnar scan, we will anyway remove sequential one and guarantee that we would choose either columnar custom scan or index scan.	2021-08-02 11:38:11 +03:00
Onur Tirtir	93ebbb0607	Re-cost SeqPath's as well for columnar tables	2021-08-02 11:32:25 +03:00
Onur Tirtir	453ac40725	Comment why we still remove non IndexPath's when custom scan is off	2021-08-02 11:25:18 +03:00
Onur Tirtir	a87405b6ba	Not adjust IndexPath cost if indexscan is off	2021-08-02 11:25:18 +03:00
Onur Tirtir	51691a8994	Rename RecostColumnarIndexPaths to RecostColumnarPaths	2021-08-02 11:25:18 +03:00
Onur Tirtir	734fa22272	Merge pull request #5090 from citusdata/col/path-costing Re-cost columnar table index scan paths With the changes in this pr, we adjust the cost estimate done by indexAM for `IndexPath` according to columnar tables when the index is on a columnar table. This is because, the way indexAM estimates the cost is not appropriate for indexes on columnar tables. The most basic reason is that indexAM assumes we will only need to read single page to access a single tuple of the table. On the other hand for columnar tables, we read the whole stripe from disk for a single tuple too, regardless of the optimization done in #5058. Note that we don't simply assign startup / total costs but we add the cost estimated by us to the cost estimated by indexAM. This is because we need to take "the cost due to index data-structure traversal" into account too. Before explaining the logic that we follow for `IndexPath`, let's first summarize what we were / are doing for `ColumnarCustomScan`: ```math X <- cost for reading single column of single stripe // 1 cost = X * (number of columns after projection pushdown) // 2 cost = cost * (number of stripes that relation has) // 3 ``` The logic that we follow to calculate the additional cost for index scan is as follows: ```math X <- cost for reading single column of single stripe // same as 1 above cost = X * (number of columns that relation has) // index scan cannot do projection pushdown, so different than 2 above cost = cost * (estimated number of stripes that we need to read) ``` where, we calculate `estimated number of stripes that we need to read` as follows: ```math indexCorrelation, indexSelectivity <- calculate by using amcostestimate_function estimatedReadRows = (relation row count) * indexSelectivity minEstimateStripeReads = estimatedReadRows / (average stripe row count) // full correlation, we will not do any redundant stripe reads maxEstimateStripeReads = estimatedReadRows // no correlation, we will read a different stripe for each tuple complementCorrelation = 1 - abs(indexCorrelation) estimatedStripeCount = minEstimateStripeReads + complementCorrelation * (maxEstimateStripeReads - minEstimateStripeReads) ```	2021-08-02 11:23:20 +03:00
Onur Tirtir	297f59a70e	Re-cost columnar table index paths	2021-08-02 11:16:37 +03:00
Onur Tirtir	8adcf2096b	Multiply ColumnarCustomScan cost by tblspace.seqpage cost	2021-08-02 11:16:37 +03:00
Onur Tirtir	dba8421453	Refactor ColumnarScanCost into ColumnarPerChunkGroupScanCost	2021-08-02 11:16:37 +03:00
Onur Tirtir	d8f92697f2	Free memory used for last stripe read when re-scanning a columnar table (#5143 ) Instead of setting stripeReadState to NULL, call ColumnarResetRead before re-scanning a columnar table since this function is already designed for doing the necessary clean up when finishing a stripe read. Note that this change shouldn't have a great effect on memory usage since AdvanceStripe was already doing the clean-up for all the stripes except the last one.	2021-08-02 11:16:01 +03:00
Onur Tirtir	38940ed2a6	Merge pull request #5058 from citusdata/col/optimize-index-read Use long-lasting mem cxt during columnar index scan & optimize correlated ones	2021-08-02 11:06:57 +03:00
Onur Tirtir	73058d35cc	Not free (stripe) chunk buffers after de-serializing Previously, we were only using chunk group reader for sequential scan. However, to support index scans on columnar tables, now we use very same low level functions for index scan too. Since those low-level functions were only used for sequential scan, it was guaranteed that we would never read the same chunk group more than once, so we were freeing chunk buffers after deserializing them into a separate buffer. Now that we use those low level functions for index scan, we cannot free chunk buffers since it's possible to read the same chunk group again, such that: - read chunk group 1 of stripe 5 - read chunk group 2 of stripe 5 - read chunk group 1 of stripe 5 again Here, when we decide to read chunk group 1 for a second time, chunk group 1 is not cached. Plus, before this commit, we were freeing the chunk buffers for chunk group 1 after the first read and then we were getting segfault or errors from low-level de-compression APIs.	2021-08-02 11:00:12 +03:00
Onur Tirtir	327ae43b83	Get rid of EndStripeRead, since we anyway reset mem cxt	2021-08-02 11:00:12 +03:00
Onur Tirtir	83f5d42365	Use long-lasting mem cxt & optimize correlated index scan	2021-08-02 11:00:12 +03:00
Onur Tirtir	c021b82a43	Introduce CreateColumnarScanMemoryContext	2021-08-02 11:00:12 +03:00
Onur Tirtir	a25d89e4cb	Merge pull request #5103 from citusdata/at-set-columnar-index Keep supported indexes when converting table to columnar. Previously, as indexes were not supported by columnar tables, we were ignoring all the indexes & index-based constraints of table when converting it to a columnar table. However, now that we support `btree` & `hash` indexAM's for columnar tables, we only ignore the indexAM's other than those two. However, the way we ignore the unsupported indexes is now a bit different than before. Previously we were just _not creating_ any index types after converting table to columnar as we didn't support any of the index types. Now that we support `btree` & `hash` indexAMs for columnar tables, now we really drop the unsupported index types since re-creating the remaining ones is easier than adding some code that creates only the supported indexes.	2021-07-30 17:01:30 +03:00
Onur Tirtir	84a49cc221	Improve error message for indexAMs not supported by columnar	2021-07-30 16:41:53 +03:00
Onur Tirtir	90e856d6bc	Keep supported indexes when converting table to columnar	2021-07-30 16:41:01 +03:00
Onur Tirtir	eeecbd2324	Introduce ColumnarSupportsIndexAM	2021-07-30 16:40:27 +03:00
Halil Ozan Akgül	d140ca1b0e	Merge pull request #5146 from citusdata/fix_ruleutils_13_endif_comment Corrects the ruleutils_13.c endif comment	2021-07-29 17:27:01 +03:00
Halil Ozan Akgul	286b0fe0e8	Corrects the endif comment	2021-07-29 17:22:31 +03:00
SaitTalhaNisanci	4559d02c41	Fix union pushdown issue (#5079 ) * Fix UNION not being pushdown Postgres optimizes column fields that are not needed in the output. We were relying on these fields to understand if it is safe to push down a union query. This fix looks at the parse query, which has the original column fields to detect if it is safe to push down a union query. * Add more tests * Simplify code and make it more robust * Process varlevelsup > 0 in FindReferencedTableColumn * Only look for outers vars in union path * Add more comments * Remove UNION ALL specific logic for pulling up childvars	2021-07-29 13:52:55 +03:00
Jelte Fennema	2aa67421a7	Fix showing target shard size in the rebalance progress monitor (#5136 ) The progress monitor wouldn't actually update the size of the shard on the target node when using "block_writes" as the `shard_transfer_mode`. The reason for this is that the CREATE TABLE part of the shard creation would only be committed once all data was moved as well. This caused our size calculation to always return 0, since the table did not exist yet in the session that the progress monitor used. This is fixed by first committing creation of the table, and only then starting the actual data copy. The test output changes slightly. Apparently splitting this up in two transactions instead of one, increases the table size after the copy by about 40kB. The additional size used doesn't increase when with the amount of data in the table is larger (it stays ~40kB per shard). So this small change in test output is not considered an actual problem.	2021-07-23 16:37:00 +02:00
Jelte Fennema	4c1066e463	Merge pull request #5133 from citusdata/add-cache-to-sequence-def-mx Include data_type and cache in sequence definition on workers	2021-07-22 11:57:03 +02:00
Jelte Fennema	7d0b6dc9be	Include data_type and cache in sequence definition on workers These two options were not included when creating the sequences on the workers as part of metadata syncing. The missing `data_type` part of the definition made finding the cause of #5126 harder than necessary, because of confusing errors.	2021-07-22 11:49:06 +02:00
Önder Kalacı	f52db0abab	Merge pull request #5127 from citusdata/get_ready_tenant_isolation Introduce citus_internal_delete_shard_metadata	2021-07-19 14:43:47 +02:00
Onder Kalaci	903489c763	Improve wording of an error message	2021-07-19 14:38:52 +02:00
Onder Kalaci	c8368e7929	Introduce citus_internal_delete_shard_metadata With this function, the owner of the table is allowed to remove shard metadata. This is going to be useful for tenant-isolation.	2021-07-19 13:25:05 +02:00
Önder Kalacı	87a51ae552	CLUSTER ON deparser should consider schemas (#5122 )	2021-07-16 19:13:18 +03:00
Hanefi Onaldi	38c139ba59	Merge pull request #5114 from citusdata/changelog-updates	2021-07-16 17:53:36 +03:00
Hanefi Onaldi	6b4996f47e	Add changelog entries for 10.1.0 This patch also moves the section to the top of the changelog	2021-07-16 16:51:12 +03:00
Jelte Fennema	adf17a8cf1	Add upgrade and dowgrade tests for Citus 10.2 (#5120 ) It seems we forgot to add this when starting 10.2 development.	2021-07-16 14:39:04 +02:00
Önder Kalacı	644052ea58	Merge pull request #5105 from citusdata/regular_user_metadata_sync Use current user while syncing metadata	2021-07-16 14:00:32 +02:00
Onder Kalaci	2c349e6dfd	Use current user to sync metadata Before this commit, we always synced the metadata with superuser. However, that creates various edge cases such as visibility errors or self distributed deadlocks or complicates user access checks. Instead, with this commit, we use the current user to sync the metadata. Note that, `start_metadata_sync_to_node` still requires super user because accessing certain metadata (like pg_dist_node) always require superuser (e.g., the current user should be a superuser). However, metadata syncing operations regarding the distributed tables can now be done with regular users, as long as the user is the owner of the table. A table owner can still insert non-sense metadata, however it'd only affect its own table. So, we cannot do anything about that.	2021-07-16 13:25:27 +02:00
Hanefi Onaldi	b3cc9d63cb	Merge pull request #5111 from citusdata/changelog-updates	2021-07-14 15:42:43 +03:00
Hanefi Onaldi	45b72c204d	Add changelog entry for 10.0.4	2021-07-14 15:04:45 +03:00
Onur Tirtir	f00c63c33d	Support columnar table index builds with CONCURRENTLY option (#5032 ) With this commit, we add (`CREATE INDEX` / `REINDEX`) `CONCURRENTLY` support for columnar tables. For that, we implement `columnar_index_validate_scan` callback. The reasoning behind the implementation is as follows: * Postgres function `validate_index` provides all the TIDs that are currently in the index to `columnar_index_validate_scan` callback via a `tupleSort` object.. * We start scanning the table by using `columnar_getnextslot` as usual. Before moving forward, note that `columnar_getnextslot` guarantees to return tuples in the order of their TIDs. * For us to use during table scan, postgres provides a snapshot guaranteeing that any tuples that are valid according to that snapshot but are not in the index must be added to the index. * Then for each tuple that we read from our table, we continue iterating given `tupleSort` to find the first TID that is greater than or equal to our tuple's TID. If both TID's are equal to each other, then we skip the tuple since it's already indexed. If the TID that we read from tupleSort is greater then our tuple's TID, then we decide to insert this tuple into index.	2021-07-09 13:44:58 +03:00
Onur Tirtir	ea5fe022a4	Be more explicit when doing ordered scan on columnar cat. tables (#5026 ) systable_getnext already uses ForwardScanDirection if relation has any open indexes, but let's be more explicit doing ordered scan on columnar catalog tables.	2021-07-09 13:24:27 +03:00
Hanefi Onaldi	ab873c6b58	Merge pull request #5030 from citusdata/do-not-use-public-schema	2021-07-09 02:15:42 +03:00
Hanefi Onaldi	efc5776451	Remove public schema dependency for 10.1 upgrades This commit contains a subset of the changes that should be cherry picked to 10.1 releases.	2021-07-09 02:08:22 +03:00
Hanefi Onaldi	8e9cc229ff	Remove public schema dependency for 10.0 upgrades This commit contains a subset of the changes that should be cherry picked to 10.0 releases.	2021-07-09 02:08:22 +03:00
Ahmet Gedemenli	ed3b98a80b	Add failure test for stop_metadata_sync_to_node (#5102 )	2021-07-08 18:23:19 +03:00
Hanefi Onaldi	38c24ae0db	Merge pull request #5100 from citusdata/changelog-updates	2021-07-08 16:02:11 +03:00
Hanefi Onaldi	b68188cd3f	fixup! Add changelog entry for 9.5.6	2021-07-08 15:25:00 +03:00
Hanefi Onaldi	d96d730178	Add changelog entry for 9.5.6	2021-07-08 13:31:19 +03:00
Nils Dijk	2e60f5cf43	Merge pull request #5095 from citusdata/fix/prepare-upgrade-idempotent Fix: citus_prepare_pg_upgrade idempotency	2021-07-08 12:29:18 +02:00
Nils Dijk	18652ef9ff	fix 10.1-1 upgrade script to adhere to idempotency	2021-07-08 12:24:52 +02:00
Nils Dijk	e5517dc7b3	fix 9.5-2 upgrade script to adhere to idempotency	2021-07-08 12:24:52 +02:00
Nils Dijk	366796a72e	Add test for idempotency of citus_prepare_pg_upgrade	2021-07-08 12:24:51 +02:00
Hanefi Onaldi	79979f56cf	Merge pull request #5092 from citusdata/changelog-updates	2021-07-08 09:44:38 +03:00
Hanefi Onaldi	d3b6651403	Add changelog entry for 9.5.5	2021-07-07 16:34:30 +03:00
Hanefi Onaldi	80a5539671	Add changelog entry for 9.4.5	2021-07-07 16:34:30 +03:00
Onur Tirtir	dfcfa18edc	Merge pull request #5088 from citusdata/col/refactor-reader Remove stripeList (list of StripeMetadata) & currentStripe (stripeList index of the current stripe being read) from ColumnarReadState, introduce currentStripeMetadata.	2021-07-07 11:21:16 +03:00
Onur Tirtir	7bfd84bc70	Introduce StripeGetHighestRowNumber	2021-07-07 11:01:39 +03:00
Onur Tirtir	8942086506	Remove stripeList & currentStripe from ColumnarReadState	2021-07-07 11:01:39 +03:00
Onur Tirtir	16dee73b10	Refactor FindStripeByRowNumber into StripeMetadataLookupRowNumber Push the most logic in FindStripeByRowNumber down to an helper function to re-use it in next commit.	2021-07-07 11:01:38 +03:00
Nils Dijk	2954fb0ee8	Merge pull request #5076 from citusdata/marcocitus/fix-pg-upgrade DESCRIPTION: Fixes an issue that could cause citus_finish_pg_upgrade to fail Rewiring Citus upgrade scripts to fix the prepare/finish PG upgrade scripts. Users who upgrade to the patch release of 9.4 will get the fix via 9.4-1--9.4-2. Users who upgrade to the patch release of 9.5 will get the fix via 9.5-1--9.5-2. Users who upgrade to the patch release of 10.0 will get the fix via 10.0-3--10.0-4. Users who upgrade to Citus 10.1 will also get it as part of the 10.0-3--10.0-4 upgrade path. Given that we use CREATE OR REPLACE, it's ok to get the fix multiple times. Fixes #5068, but not #5069.	2021-07-05 16:14:14 +02:00
Marco Slot	214c674989	Fix PG upgrade scripts for 10.1	2021-07-05 14:38:26 +02:00
Marco Slot	b14955c2bd	Fix PG upgrade scripts for 10.0	2021-07-05 14:38:20 +02:00
Marco Slot	3c0dfc12c0	Fix PG upgrade scripts for 9.5	2021-07-05 13:39:35 +02:00
Marco Slot	bee202aa39	Fix PG upgrade scripts for 9.4	2021-07-05 13:39:28 +02:00
Onur Tirtir	b118d4188e	Fix lower boundary calculation when pruning range dist table shards (#5082 ) This happens only when we have a "<" or "<=" filter on distribution column of a range distributed table and that filter falls in between two shards. When the filter falls in between two shards: If the filter is ">" or ">=", then UpperShardBoundary was returning "upperBoundIndex - 1", where upperBoundIndex is exclusive shard index used during binary seach. This is expected since upperBoundIndex is an exclusive index. If the filter is "<" or "<=", then LowerShardBoundary was returning "lowerBoundIndex + 1", where lowerBoundIndex is inclusive shard index used during binary seach. On the other hand, since lowerBoundIndex is an inclusive index, we should just return lowerBoundIndex instead of doing "+ 1". Before this commit, we were missing leftmost shard in such queries. * Remove useless conditional branches The branch that we delete from UpperShardBoundary was obviously useless. The other one in LowerShardBoundary became useless after we remove "+ 1" from there. This indeed is another proof of what & how we are fixing with this pr. * Improve comments and add more * Add some tests for upper bound calculation too	2021-07-02 14:48:21 +03:00
Ahmet Gedemenli	8bae58fdb7	Add parameter to cleanup metadata (#5055 ) * Add parameter to cleanup metadata * Set clear metadata default to true * Add test for clearing metadata * Separate test file for start/stop metadata syncing * Fix stop_sync bug for secondary nodes * Use PreventInTransactionBlock * DRemovedebuggiing logs * Remove relation not found logs from mx test * Revert localGroupId when doing stop_sync * Move metadata sync test to mx schedule * Add test with name that needs to be quoted * Add test for views and matviews * Add test for distributed table with custom type * Add comments to test * Add test with stats, indexes and constraints * Fix matview test * Add test for dropped column * Add notice messages to stop_metadata_sync * Add coordinator check to stop metadat sync * Revert local_group_id only if clearMetadata is true * Add a final check to see the metadata is sane * Remove the drop verbosity in test * Remove table description tests from sync test * Add stop sync to coordinator test * Change the order in stop_sync * Add test for hybrid (columnar+heap) partitioned table * Change error to notice for stop sync to coordinator * Sync at the end of the test to prevent any failures * Add test case in a transaction block * Remove relation not found tests	2021-07-01 16:23:53 +03:00
SaitTalhaNisanci	c932642e3b	Merge pull request #4994 from citusdata/fix/shardPlacementList Exclude orphaned shards while finding shard placements	2021-06-28 18:02:30 +03:00
Sait Talha Nisanci	e7ed16c296	Not include to-be-deleted shards while finding shard placements Ignore orphaned shards in more places Only use active shard placements in RouterInsertTaskList Use IncludingOrphanedPlacements in some more places Fix comment Add tests	2021-06-28 13:05:31 +03:00
Jelte Fennema	802225940e	Make clear that IsTableLocallyAccessible is only for citus local tables (#5075 ) The name and comment of this function did not indicate that it only really could detect locally accessible citus local tables. This fixes that, while also cleaning up the function a bit.	2021-06-28 11:47:21 +02:00
Naisila Puka	fe5907ad2d	Adds propagation of ALTER SEQUENCE and other improvements (#5061 ) * Alter seq type when we first use the seq in a dist table * Don't allow type changes when seq is used in dist table * ALTER SEQUENCE propagation * Tests for ALTER SEQUENCE propagation * Relocate AlterSequenceType and ensure dependencies for sequence * Support for citus local tables, and other fixes * Final formatting	2021-06-24 21:23:25 +03:00
Jelte Fennema	e9bfb8eddd	Fix check to always allow foreign keys to reference tables (#5073 ) With the previous version of this check we would disallow distributed tables that did not have a colocationid, to have a foreign key to a reference table. This fixes that, since there's no reason to disallow that.	2021-06-24 12:15:52 +02:00
Jelte Fennema	f4a2d99ce9	Harden ReplicateShardToNode to unexpected placements (#5071 ) Originally ReplicateShardToNode was meant for `upgrade_to_reference_table`, which required handling of existing inactive placements. These days `upgrade_to_reference_table` is deprecated and cannot be used anymore. Now that we have SHARD_STATE_TO_DELETE too, this left over code seemed error prone. So this removes support for activating inactive reference table placemements, since these should not be possible. If it finds a non active reference table placement anyway it now errors out. This also removes a few outdated comments related to `upgrade_to_refeference_table`.	2021-06-24 13:11:02 +03:00
Jelte Fennema	d1d386a904	Only allow moves of shards of distributed tables (#5072 ) Moving shards of reference tables was possible in at least one case: ```sql select citus_disable_node('localhost', 9702); create table r(x int); select create_reference_table('r'); set citus.replicate_reference_tables_on_activate = off; select citus_activate_node('localhost', 9702); select citus_move_shard_placement(102008, 'localhost', 9701, 'localhost', 9702); ``` This would then remove the reference table shard on the source, causing all kinds of issues. This fixes that by disallowing all shard moves except for shards of distributed tables. Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-06-23 16:25:46 +02:00
Marco Slot	b2d51e6691	Add link to paper to README	2021-06-23 11:38:08 +02:00
Önder Kalacı	2939f10fd0	Merge pull request #5067 from citusdata/add_tests_mx Add regression tests for changing column type with fkey	2021-06-23 09:33:41 +03:00
Onder Kalaci	75847d10b5	Add regression tests for changing column type with fkey closes https://github.com/citusdata/citus/issues/2337 as it doesn't apply anymore.	2021-06-23 09:03:55 +03:00
Önder Kalacı	0a1a3e0dc0	Merge pull request #5066 from citusdata/fix_test fix regression tests to avoid any conflicts in enterprise	2021-06-22 08:49:13 +03:00
Onder Kalaci	55ed93bf0d	fix regression tests to avoid any conflicts in enterprise	2021-06-22 08:45:17 +03:00
Jelte Fennema	ca00b63272	Avoid two race conditions in the rebalance progress monitor (#5050 ) The first and main issue was that we were putting absolute pointers into shared memory for the `steps` field of the `ProgressMonitorData`. This pointer was being overwritten every time a process requested the monitor steps, which is the only reason why this even worked in the first place. To quote a part of a relevant stack overflow answer: > First of all, putting absolute pointers in shared memory segments is > terrible terible idea - those pointers would only be valid in the > process that filled in their values. Shared memory segments are not > guaranteed to attach at the same virtual address in every process. > On the contrary - they attach where the system deems it possible when > `shmaddr == NULL` is specified on call to `shmat()` Source: https://stackoverflow.com/a/10781921/2570866 In this case a race condition occurred when a second process overwrote the pointer in between the first process its write and read of the steps field. This issue is fixed by not storing the pointer in shared memory anymore. Instead we now calculate it's position every time we need it. The second race condition I have not been able to trigger, but I found it while investigating this. This issue was that we published the handle of the shared memory segment, before we initialized the data in the steps. This means that during initialization of the data, a call to `get_rebalance_progress()` could read partial data in an unsynchronized manner.	2021-06-21 14:03:42 +00:00
Önder Kalacı	206401b708	Merge pull request #5064 from citusdata/solidfy_prepared_statements Improve regression tests for prepared statements for local cached plans	2021-06-21 14:07:32 +03:00
Onder Kalaci	76ae5dd0db	Improve regression tests for prepared statements With a recent commit, we made (`644b266dee`) the behaviour of prepared statements for local cached plans has slightly changed. Now, Citus caches the plans when they are re-used. This make triggering of local cached plans on the 7th execution, and 8th execution is the first time the plan is used from the cached. So, the tests are improved to cover 8th execution.	2021-06-21 13:34:44 +03:00
Önder Kalacı	4e632d9da3	Merge pull request #5053 from citusdata/fix_dropped_cached_plans Deparse/parse the local cached queries	2021-06-21 13:33:36 +03:00
Onder Kalaci	69ca943e58	Deparse/parse the local cached queries With local query caching, we try to avoid deparse/parse stages as the operation is too costly. However, we can do deparse/parse operations once per cached queries, right before we put the plan into the cache. With that, we avoid edge cases like (4239) or (5038). In a sense, we are making the local plan caching behave similar for non-cached local/remote queries, by forcing to deparse the query once.	2021-06-21 12:24:29 +03:00
Onur Tirtir	82e58c91f3	Use correct test schedule name in columnar vg test target (#5027 )	2021-06-18 11:31:16 +03:00
Onur Tirtir	b0ca823b4d	Merge pull request #5052 from citusdata/columnar-index Merge columnar metapage changes and basic index support	2021-06-17 14:55:40 +03:00
Onur Tirtir	6215a3aa93	Merge remote-tracking branch 'origin/master' into columnar-index	2021-06-17 14:31:12 +03:00
Hanefi Onaldi	c4f50185e0	Ignore pl/pgsql line numbers in regression outputs (#4411 )	2021-06-17 14:11:17 +03:00
SaitTalhaNisanci	3edef11a9f	Fix a test in hyperscale schedule (#5042 )	2021-06-17 13:40:05 +03:00
Önder Kalacı	e56f5909c9	Merge pull request #5054 from citusdata/base_for_enterprise_16_june Get ready for Improve index backed constraint creation for online rebalancer	2021-06-17 13:09:28 +03:00
Onder Kalaci	bc09288651	Get ready for Improve index backed constraint creation for online rebalancer See: https://github.com/citusdata/citus-enterprise/issues/616	2021-06-17 13:05:56 +03:00
Onur Tirtir	681f700321	Fix first_row_number test for stripe_row_limit enforcement	2021-06-17 10:51:43 +03:00
Onur Tirtir	18fe0311c0	Move rest of the schema changes to 10.2-1	2021-06-16 20:43:41 +03:00
Onur Tirtir	07117b0454	Move sql files for upgrade/downgrade_columnar_storage to 10.2-1	2021-06-16 20:40:26 +03:00
Onur Tirtir	3d11c0f9ef	Merge remote-tracking branch 'origin/master' into columnar-index Conflicts: src/test/regress/expected/columnar_empty.out src/test/regress/expected/multi_extension.out	2021-06-16 20:23:50 +03:00
Onur Tirtir	a2efe59e2f	Merge pull request #4950 from citusdata/col/index-support Add basic index support for columnar tables. This pr brings the support for following index/constraint types: * btree indexes * primary keys * unique constraints / indexes * exclusion constraints * hash indexes * partial indexes * indexes including additional columns (INCLUDE syntax), even if we don't properly support index-only scans	2021-06-16 20:11:54 +03:00
Onur Tirtir	b6b969971a	Error out for CLUSTER commands on columnar tables	2021-06-16 20:06:33 +03:00
Onur Tirtir	5adab2a3ac	Report progress when building index on columnar tables	2021-06-16 20:06:33 +03:00
Onur Tirtir	9b4dc2f804	Prevent using parallel scan for columnar index builds	2021-06-16 19:59:32 +03:00
Onur Tirtir	82ea1b5daf	Not remove all paths, keep IndexPath's	2021-06-16 19:59:32 +03:00
Onur Tirtir	1af50e98b3	Fix a comment in ColumnarMetapageRead	2021-06-16 19:59:32 +03:00
Onur Tirtir	10a762aa88	Implement columnar index support functions	2021-06-16 19:59:32 +03:00
Halil Ozan Akgül	27c7d28f7f	Merge pull request #5045 from citusdata/master-update-version-9e0729f4-4406-43ed-9942-d27aa5c398ec Bump Citus to 10.2devel	2021-06-16 19:26:41 +03:00
Halil Ozan Akgul	db03afe91e	Bump citus version to 10.2devel	2021-06-16 17:44:05 +03:00
Ahmet Gedemenli	5115100db0	Set table size to zero if no size is read (#5049 ) * Set table size to zero if no size is read * Add comment to relation size bug fix	2021-06-16 17:23:19 +03:00
SaitTalhaNisanci	2511c4c045	Merge pull request #5025 from citusdata/split_multi Split multi schedule	2021-06-16 15:30:15 +03:00
SaitTalhaNisanci	1784c7ef85	Merge branch 'master' into split_multi	2021-06-16 15:26:09 +03:00
Marco Slot	9797857967	Merge pull request #5048 from citusdata/marcocitus/fix-wcoar-null-input	2021-06-16 13:40:51 +02:00
Sait Talha Nisanci	c7d04e7f40	swap multi_schedule and multi_schedule_1	2021-06-16 14:40:14 +03:00
Sait Talha Nisanci	c55e44a4af	Drop table if exists	2021-06-16 14:19:59 +03:00
Sait Talha Nisanci	fc89487e93	Split check multi	2021-06-16 14:19:59 +03:00
Naisila Puka	e26b29d3bb	Fix nextval('seq_name'::text) bug, and schema for seq tests (#5046 )	2021-06-16 13:58:49 +03:00
Marco Slot	a7e4d6c94a	Fix a bug that causes worker_create_or_alter_role to crash with NULL input	2021-06-15 20:07:08 +02:00
Halil Ozan Akgül	72eb37095b	Merge pull request #5043 from citusdata/citus-10.1.0-changelog-1623733267 Update Changelog for 10.1.0	2021-06-15 17:21:19 +03:00
Halil Ozan Akgul	91db015051	Add changelog entry for 10.1.0	2021-06-15 14:28:15 +03:00
Jelte Fennema	4c3934272f	Improve performance of citus_shards (#5036 ) We were effectively joining on a calculated column because of our calls to `shard_name`. This caused a really bad plan to be generated. In my specific case it was taking ~18 seconds to show the output of citus_shards. It had this explain plan: ``` QUERY PLAN ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── Subquery Scan on citus_shards (cost=18369.74..18437.34 rows=5408 width=124) (actual time=18277.461..18278.509 rows=5408 loops=1) -> Sort (cost=18369.74..18383.26 rows=5408 width=156) (actual time=18277.457..18277.726 rows=5408 loops=1) Sort Key: ((pg_dist_shard.logicalrelid)::text), pg_dist_shard.shardid Sort Method: quicksort Memory: 1629kB CTE shard_sizes -> Function Scan on citus_shard_sizes (cost=0.00..10.00 rows=1000 width=40) (actual time=71.137..71.934 rows=5413 loops=1) -> Hash Join (cost=177.62..18024.42 rows=5408 width=156) (actual time=77.985..18257.237 rows=5408 loops=1) Hash Cond: ((pg_dist_shard.logicalrelid)::oid = (pg_dist_partition.logicalrelid)::oid) -> Hash Join (cost=169.81..371.98 rows=5408 width=48) (actual time=1.415..13.166 rows=5408 loops=1) Hash Cond: (pg_dist_placement.groupid = pg_dist_node.groupid) -> Hash Join (cost=168.68..296.49 rows=5408 width=16) (actual time=1.403..10.011 rows=5408 loops=1) Hash Cond: (pg_dist_placement.shardid = pg_dist_shard.shardid) -> Seq Scan on pg_dist_placement (cost=0.00..113.60 rows=5408 width=12) (actual time=0.004..3.684 rows=5408 loops=1) Filter: (shardstate = 1) -> Hash (cost=101.08..101.08 rows=5408 width=12) (actual time=1.385..1.386 rows=5408 loops=1) Buckets: 8192 Batches: 1 Memory Usage: 318kB -> Seq Scan on pg_dist_shard (cost=0.00..101.08 rows=5408 width=12) (actual time=0.003..0.688 rows=5408 loops=1) -> Hash (cost=1.06..1.06 rows=6 width=40) (actual time=0.007..0.007 rows=6 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 9kB -> Seq Scan on pg_dist_node (cost=0.00..1.06 rows=6 width=40) (actual time=0.004..0.005 rows=6 loops=1) -> Hash (cost=5.69..5.69 rows=169 width=130) (actual time=0.070..0.071 rows=169 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 36kB -> Seq Scan on pg_dist_partition (cost=0.00..5.69 rows=169 width=130) (actual time=0.009..0.041 rows=169 loops=1) SubPlan 2 -> Limit (cost=0.00..3.25 rows=1 width=8) (actual time=3.370..3.370 rows=1 loops=5408) -> CTE Scan on shard_sizes (cost=0.00..32.50 rows=10 width=8) (actual time=3.369..3.369 rows=1 loops=5408) Filter: ((shard_name(pg_dist_shard.logicalrelid, pg_dist_shard.shardid) = table_name) OR (('public.'::text \|\| shard_name(pg_dist_shard.logicalrelid, pg_dist_shard.shardid)) = table_name)) Rows Removed by Filter: 2707 Planning Time: 0.705 ms Execution Time: 18278.877 ms ``` With the changes it only takes 180ms to show the same output: ``` QUERY PLAN ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────── Sort (cost=904.59..918.11 rows=5408 width=156) (actual time=182.508..182.960 rows=5408 loops=1) Sort Key: ((pg_dist_shard.logicalrelid)::text), pg_dist_shard.shardid Sort Method: quicksort Memory: 1629kB -> Hash Join (cost=418.03..569.27 rows=5408 width=156) (actual time=136.333..146.591 rows=5408 loops=1) Hash Cond: ((pg_dist_shard.logicalrelid)::oid = (pg_dist_partition.logicalrelid)::oid) -> Hash Join (cost=410.22..492.83 rows=5408 width=56) (actual time=136.231..140.132 rows=5408 loops=1) Hash Cond: (pg_dist_placement.groupid = pg_dist_node.groupid) -> Hash Right Join (cost=409.09..417.34 rows=5408 width=24) (actual time=136.218..138.890 rows=5408 loops=1) Hash Cond: ((((regexp_matches(citus_shard_sizes.table_name, '_(\d+)$'::text))[1])::integer) = pg_dist_shard.shardid) -> HashAggregate (cost=45.00..48.50 rows=200 width=12) (actual time=131.609..132.481 rows=5408 loops=1) Group Key: ((regexp_matches(citus_shard_sizes.table_name, '_(\d+)$'::text))[1])::integer Batches: 1 Memory Usage: 737kB -> Result (cost=0.00..40.00 rows=1000 width=12) (actual time=107.786..129.831 rows=5408 loops=1) -> ProjectSet (cost=0.00..22.50 rows=1000 width=40) (actual time=107.780..128.492 rows=5408 loops=1) -> Function Scan on citus_shard_sizes (cost=0.00..10.00 rows=1000 width=40) (actual time=107.746..108.107 rows=5414 loops=1) -> Hash (cost=296.49..296.49 rows=5408 width=16) (actual time=4.595..4.598 rows=5408 loops=1) Buckets: 8192 Batches: 1 Memory Usage: 339kB -> Hash Join (cost=168.68..296.49 rows=5408 width=16) (actual time=1.702..3.783 rows=5408 loops=1) Hash Cond: (pg_dist_placement.shardid = pg_dist_shard.shardid) -> Seq Scan on pg_dist_placement (cost=0.00..113.60 rows=5408 width=12) (actual time=0.004..0.837 rows=5408 loops=1) Filter: (shardstate = 1) -> Hash (cost=101.08..101.08 rows=5408 width=12) (actual time=1.683..1.685 rows=5408 loops=1) Buckets: 8192 Batches: 1 Memory Usage: 318kB -> Seq Scan on pg_dist_shard (cost=0.00..101.08 rows=5408 width=12) (actual time=0.004..0.824 rows=5408 loops=1) -> Hash (cost=1.06..1.06 rows=6 width=40) (actual time=0.007..0.008 rows=6 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 9kB -> Seq Scan on pg_dist_node (cost=0.00..1.06 rows=6 width=40) (actual time=0.004..0.006 rows=6 loops=1) -> Hash (cost=5.69..5.69 rows=169 width=130) (actual time=0.079..0.079 rows=169 loops=1) Buckets: 1024 Batches: 1 Memory Usage: 36kB -> Seq Scan on pg_dist_partition (cost=0.00..5.69 rows=169 width=130) (actual time=0.011..0.046 rows=169 loops=1) Planning Time: 0.789 ms Execution Time: 184.095 ms ```	2021-06-14 13:32:30 +02:00
Onur Tirtir	a209999618	Enforce table opt constraints when using alter_columnar_table_set (#5029 )	2021-06-08 17:39:16 +03:00
Hanefi Onaldi	5c6069a74a	Do not rely on fk cache when truncating local data (#5018 )	2021-06-07 11:56:48 +03:00
Marco Slot	9770a1bf00	Merge pull request #5020 from citusdata/disable-dropping-shards	2021-06-04 14:48:11 +02:00
Jelte Fennema	c113cb3198	Merge pull request #5024 from citusdata/cleanup-old-shards-before-rebalance	2021-06-04 14:37:22 +02:00
Jelte Fennema	1a83628195	Use "orphaned shards" naming in more places We were not very consistent in how we named these shards.	2021-06-04 11:39:19 +02:00
Jelte Fennema	3f60e4f394	Add ExecuteCriticalCommandInDifferentTransaction function We use this pattern multiple times throughout the codebase now. Seems like a good moment to abstract it away.	2021-06-04 11:30:27 +02:00
Jelte Fennema	503c70b619	Cleanup orphaned shards before moving when necessary A shard move would fail if there was an orphaned version of the shard on the target node. With this change before actually fail, we try to clean up orphaned shards to see if that fixes the issue.	2021-06-04 11:23:07 +02:00
Jelte Fennema	280b9ae018	Cleanup orphaned shards at the start of a rebalance In case the background daemon hasn't cleaned up shards yet, we do this manually at the start of a rebalance.	2021-06-04 11:23:07 +02:00
Jelte Fennema	7015049ea5	Add citus_cleanup_orphaned_shards UDF Sometimes the background daemon doesn't cleanup orphaned shards quickly enough. It's useful to have a UDF to trigger this removal when needed. We already had a UDF like this but it was only used during testing. This exposes that UDF to users. As a safety measure it cannot be run in a transaction, because that would cause the background daemon to stop cleaning up shards while this transaction is running.	2021-06-04 11:23:07 +02:00
Naisila Puka	0f37ab5f85	Fixes column default coming from a sequence (#4914 ) * Add user-defined sequence support for MX * Remove default part when propagating to workers * Fix ALTER TABLE with sequences for mx tables * Clean up and add tests * Propagate DROP SEQUENCE * Removing function parts * Propagate ALTER SEQUENCE * Change sequence type before propagation & cleanup * Revert "Propagate ALTER SEQUENCE" This reverts commit 2bef64c5a29f4e7224a7f43b43b88e0133c65159. * Ensure sequence is not used in a different column with different type * Insert select tests * Propagate rename sequence stmt * Fix issue with group ID cache invalidation * Add ALTER TABLE ALTER COLUMN TYPE .. precaution * Fix attnum inconsistency and add various tests * Add ALTER SEQUENCE precaution * Remove Citus hook * More tests Co-authored-by: Marco Slot <marco.slot@gmail.com>	2021-06-03 23:02:09 +03:00
Marco Slot	ec9664c5a4	Merge pull request #5021 from citusdata/marcocitus/fix-remove-node	2021-06-03 11:27:57 +02:00
Hanefi Onaldi	056005db4d	Improve tests for truncating local data (#5012 ) We have a slightly different behavior when using truncate_local_data_after_distributing_table UDF on metadata synced clusters. This PR aims to add tests to cover such cases. We allow distributing tables with data that have foreign keys to reference tables only on metadata synced clusters. This is the reason why some of my earlier tests failed when run on a single node Citus cluster.	2021-06-03 08:51:32 +03:00
Nils Dijk	5f76b93eac	fix link to codecov report from badge (#5022 ) links the codecov badge to the codecov report instead of the badge	2021-06-02 16:48:33 +02:00
Marco Slot	e81d25a7be	Refactor RelationIsAKnownShard to remove onlySearchPath argument	2021-06-02 14:30:27 +02:00
Ahmet Gedemenli	089ef35940	Disable dropping and truncating known shards Add test for disabling dropping and truncating known shards	2021-06-02 14:30:27 +02:00
Hanefi Onaldi	fa29d6667a	Accept invalidation before fk graph validity check (#5017 ) InvalidateForeignKeyGraph sends an invalidation via shared memory to all backends, including the current one. However, we might not call AcceptInvalidationMessages before reading from the cache below. It would be better to also add a call to AcceptInvalidationMessages in IsForeignConstraintRelationshipGraphValid.	2021-06-02 14:45:35 +03:00
Ahmet Gedemenli	f9c7d74623	Merge pull request #5019 from citusdata/sort-gucs-in-alphabetical-order Sort GUCs in alphabetical order	2021-06-02 13:01:25 +03:00
Ahmet Gedemenli	103cf34418	Sort GUCs in alphabetical order	2021-06-02 12:52:18 +03:00
Jelte Fennema	abbcf4099a	Merge pull request #5013 from citusdata/better-citus-version-check Move CheckCitusVersion to the top of each function	2021-06-02 10:03:42 +02:00
Jelte Fennema	b1cad26ebc	Move CheckCitusVersion to the top of each function Previously this was usually done after argument parsing. This can cause SEGFAULTs if the number or type of arguments changes in a new version. By checking that Citus version is correct before doing any argument parsing we protect against these types of issues. Issues like this have occurred in pg_auto_failover, so it's not just a theoretical issue. The main reason why these calls were not at the top of functions is really just historical. It was because in the past we didn't allow statements before declarations. Thus having this check before the argument parsing would have only been possible if we first declared all variables. In addition to moving existing CheckCitusVersion calls it also adds these calls to rebalancer related functions (they were missing there).	2021-06-01 17:43:46 +02:00
Ahmet Gedemenli	98081557fb	Merge pull request #5016 from citusdata/fix-test-shard-id-issue Fix shard id difference for enterprise	2021-06-01 17:44:24 +03:00
Ahmet Gedemenli	0fbddc740d	Fix shard id difference for enterprise	2021-06-01 17:17:46 +03:00
Jelte Fennema	4c20bf7a36	Remove pg_dist_rebalence_strategy_enterprise_check (#5014 ) This is not necessary anymore now that the rebalancer is open source.	2021-06-01 06:16:46 -07:00
Ahmet Gedemenli	e2704d9ad9	Merge pull request #5015 from citusdata/fix-relname-null-bug-when-parallel-execution Fix relname null bug when parallel execution	2021-06-01 15:05:32 +03:00
Ahmet Gedemenli	69d39c0e8b	Fix relname null bug when parallel execution	2021-06-01 14:14:35 +03:00
Ahmet Gedemenli	28b97c6c53	Merge pull request #5010 from citusdata/remove-func-generate-new-target-entries-for-sort-clauses Remove function GenerateNewTargetEntriesForSortClauses	2021-06-01 12:46:47 +03:00
Ahmet Gedemenli	9638933d9d	Remove function GenerateNewTargetEntriesForSortClauses	2021-06-01 12:35:36 +03:00
Jelte Fennema	d3feee37ea	Add a simple python script to generate a new test (#3972 ) The current default citus settings for tests are not really best practice anymore. However, we keep them because lots of tests depend on them. I noticed that I created the same test harness for new tests I added all the time. This is a simple script that generates that harness, given a name for the test. To run: src/test/regress/bin/create_test.py my_awesome_test	2021-06-01 11:22:11 +02:00
Marco Slot	c03729ad03	Only warn about reference tables when removing last node	2021-06-01 10:53:12 +02:00
Onur Tirtir	94f30a0428	Refactor index check in ColumnarProcessUtility	2021-06-01 11:12:28 +03:00
SaitTalhaNisanci	c72d2b479b	Add tests for union pushdown workaround (#5005 )	2021-05-31 20:02:20 +02:00
Jelte Fennema	3271f1bd13	Fix data race in get_rebalance_progress (#5008 ) To be able to report progress of the rebalancer, the rebalancer updates the state of a shard move in a shared memory segment. To then fetch the progress, `get_rebalance_progress` can be called which reads this shared memory. Without this change it did so without using any synchronization primitives, allowing for data races. This fixes that by using atomic operations to update and read from the parts of the shared memory that can be changed after initialization.	2021-05-31 15:27:32 +02:00
SaitTalhaNisanci	8c3f85692d	Not consider old placements when disabling or removing a node (#4960 ) * Not consider old placements when disabling or removing a node * update cluster test	2021-05-28 22:38:20 +02:00
SaitTalhaNisanci	40a229976f	Fix flaky test because of parallel metadata syncing (#5004 )	2021-05-28 13:19:15 +03:00
SaitTalhaNisanci	a20cc3b36a	Only consider shard state 1 in citus shards (#4970 )	2021-05-28 11:33:48 +03:00
SaitTalhaNisanci	a4944a2102	Rename CoordinatedTransactionShouldUse2PC (#4995 )	2021-05-21 18:57:42 +03:00
Hanefi Onaldi	8b8f0161c3	Merge pull request #4891 from citusdata/remove-replmodel-guc Deprecates the `citus.replication_model` GUC We used to have 2 different GUCs that decided shard replication models: - `citus.replication_model`: either set to "statement" or "streaming" - `citus.shard_replication_factor` that prevents us to use streaming replication when greater than 1. This PR aims to deprecate the `citus.replication_model` GUC and decide on the replication model, solely based on the shard replication factor of distributed tables that are affected by queries.	2021-05-21 16:32:44 +03:00
Hanefi Onaldi	4941f00a95	Do not run ref2ref tests in parallel	2021-05-21 16:14:59 +03:00
Hanefi Onaldi	c160325d07	Use streaming replication when repl factor = 1	2021-05-21 16:14:59 +03:00
Hanefi Onaldi	878513f325	Remove all occurences of replication_model GUC	2021-05-21 16:14:59 +03:00
SaitTalhaNisanci	87e3a5e24a	Use 2PC when using a node connection (#4997 )	2021-05-21 14:58:53 +03:00
SaitTalhaNisanci	82f34a8d88	Enable citus.defer_drop_after_shard_move by default (#4961 ) Enable citus.defer_drop_after_shard_move by default	2021-05-21 10:48:32 +03:00
Nils Dijk	d7dd247fb5	fix shared dependencies that are not resident in a database (#4992 ) DESCRIPTION: fix shared dependencies that are not resident in a database eg. databases depend on users (their owners) that both don’t have a database they reside in. These dependencies are recorded in pg_shdepend with a `dbid` of `InvalidOid` When we fetch our shared dependencies we don’t take these links in account. With this patch we use logic inspired by `classIdGetDbId` to decide when to use `MyDatabaseId` vs `InvalidOid` to correctly resolve dependencies between shared objects.	2021-05-20 08:55:02 -07:00
Jelte Fennema	b25d3e83ef	Merge pull request #4963 from citusdata/fill-rebalance-monitor-shardsize	2021-05-20 16:51:08 +02:00
Jelte Fennema	10f06ad753	Fetch shard size on the fly for the rebalance monitor Without this change the rebalancer progress monitor gets the shard sizes from the `shardlength` column in `pg_dist_placement`. This column needs to be updated manually by calling `citus_update_table_statistics`. However, `citus_update_table_statistics` could lead to distributed deadlocks while database traffic is on-going (see #4752). To work around this we don't use `shardlength` column anymore. Instead for every rebalance we now fetch all shard sizes on the fly. Two additional things this does are: 1. It adds tests for the rebalance progress function. 2. If a shard move cannot be done because a source or target node is unreachable, then we error in stop the rebalance, instead of showing a warning and continuing. When using the by_disk_size rebalance strategy it's not safe to continue with other moves if a specific move failed. It's possible that the failed move made space for the next move, and because the failed move never happened this space now does not exist. 3. Adds two new columns to the result of `get_rebalancer_progress` which shows the size of the shard on the source and target node. Fixes #4930	2021-05-20 16:38:17 +02:00
Nils Dijk	a6c2d2a4c4	Feature: alter database owner (#4986 ) DESCRIPTION: Add support for ALTER DATABASE OWNER This adds support for changing the database owner. It achieves this by marking the database as a distributed object. By marking the database as a distributed object it will look for its dependencies and order the user creation commands (enterprise only) before the alter of the database owner. This is mostly important when adding new nodes. By having the database marked as a distributed object it can easily understand for which `ALTER DATABASE ... OWNER TO ...` commands to propagate by resolving the object address of the database and verifying it is a distributed object, and hence should propagate changes of owner ship to all workers. Given the ownership of the database might have implications on subsequent commands in transactions we force sequential mode for transactions that have a `ALTER DATABASE ... OWNER TO ...` command in them. This will fail the transaction with meaningful help when the transaction already executed parallel statements. By default the feature is turned off since roles are not automatically propagated, having it turned on would cause hard to understand errors for the user. It can be turned on by the user via setting the `citus.enable_alter_database_owner`.	2021-05-20 13:27:44 +02:00
Önder Kalacı	1ce607dd23	Merge pull request #4990 from citusdata/prevent_moving_to_non_existing_node Make sure that target node in shard moves is eligible for shard move	2021-05-20 10:57:56 +02:00
Onder Kalaci	d07db99ea4	Make sure that target node in shard moves is eligable for shard move	2021-05-20 10:51:01 +02:00
Önder Kalacı	4d8e3969ac	Merge pull request #4923 from citusdata/wait_until_connections_ready Wait until all connections are successfully established	2021-05-19 16:04:36 +02:00
Onder Kalaci	926069a859	Wait until all connections are successfully established Comment from the code: /* * Iterate until all the tasks are finished. Once all the tasks * are finished, ensure that that all the connection initializations * are also finished. Otherwise, those connections are terminated * abruptly before they are established (or failed). Instead, we let * the ConnectionStateMachine() to properly handle them. * * Note that we could have the connections that are not established * as a side effect of slow-start algorithm. At the time the algorithm * decides to establish new connections, the execution might have tasks * to finish. But, the execution might finish before the new connections * are established. / Note that the abruptly terminated connections lead to the following errors: 2020-11-16 21:09:09.800 CET [16633] LOG: could not accept SSL connection: Connection reset by peer 2020-11-16 21:09:09.872 CET [16657] LOG: could not accept SSL connection: Undefined error: 0 2020-11-16 21:09:09.894 CET [16667] LOG: could not accept SSL connection: Connection reset by peer To easily reproduce the issue: - Create a single node Citus - Add the coordinator to the metadata - Create a distributed table with shards on the coordinator - f.sql: select count() from test; - pgbench -f /tmp/f.sql postgres -T 12 -c 40 -P 1 or pgbench -f /tmp/f.sql postgres -T 12 -c 40 -P 1 -C	2021-05-19 15:59:13 +02:00
Önder Kalacı	61977a3c09	Merge pull request #4895 from citusdata/conservative_connection_establishment Executor takes connection establishment and task execution costs into account	2021-05-19 15:57:22 +02:00
Onder Kalaci	995adf1a19	Executor takes connection establishment and task execution costs into account With this commit, the executor becomes smarter about refrain to open new connections. The very basic example is that, if the connection establishments take 1000ms and task executions as 5 msecs, the executor becomes smart enough to not establish new connections.	2021-05-19 15:48:07 +02:00
Onder Kalaci	28b0b4ebd1	Move slow start increment to generic place	2021-05-19 14:31:20 +02:00
Marco Slot	54c9bf8342	Merge pull request #4849 from citusdata/marcocitus/fix-insert-mem	2021-05-19 14:19:02 +02:00
Jelte Fennema	924959fdb1	Include result type in upgrade diff test (#4987 ) We often change result types of functions slightly. Our downgrade tests wouldn't notice these changes. This change adds them to the description of these items. An example of an SQL change that isn't caught without this change and is caught with the get_rebalance_progress change in this PR: https://github.com/citusdata/citus/pull/4963	2021-05-18 16:25:39 +02:00
Marco Slot	715dce1eea	Reduce local insert memory usage during deparsing	2021-05-18 16:11:43 +02:00
Marco Slot	644b266dee	Only cache local plans when reusing a distributed plan	2021-05-18 16:11:43 +02:00
Marco Slot	00792831ad	Add execution memory contexts and free after local query execution	2021-05-18 16:11:43 +02:00
SaitTalhaNisanci	ff2a125a5b	Lookup hostname before execution (#4976 ) We lookup the hostname just before the execution so that even if there are cached entries in the prepared statement cache we use the updated entries.	2021-05-18 16:46:31 +03:00
SaitTalhaNisanci	eaa7d2bada	Not block maintenance daemon (#4972 ) It was possible to block maintenance daemon by taking an SHARE ROW EXCLUSIVE lock on pg_dist_placement. Until the lock is released maintenance daemon would be blocked. We should not block the maintenance daemon under any case hence now we try to get the pg_dist_placement lock without waiting, if we cannot get it then we don't try to drop the old placements.	2021-05-17 03:22:35 -07:00
Hanefi Onaldi	b649dffabd	Improve CI checks for enterprise merges on master (#4981 )	2021-05-12 19:15:15 +03:00
Nils Dijk	c91f8d8a15	Feature: localhost guc (#4836 ) DESCRIPTION: introduce `citus.local_hostname` GUC for connections to the current node Citus once in a while needs to connect to itself for some systems operations. This used to be hardcoded to `localhost`. The hardcoded hostname causes some issues, for example in environments where `sslmode=verify-full` is required. It is not always desirable or even feasible to get `localhost` as an alt name on the certificate. By introducing a GUC to use when connecting to the current instance the user has more control what network path is used and what hostname is required to be present in the server certificate.	2021-05-12 16:59:44 +02:00
Hanefi Onaldi	acdde9fa2d	Merge pull request #4811 from citusdata/fix-gitignore	2021-05-12 11:50:38 +03:00
Hanefi Onaldi	6b2c9d3567	Remove ignored files from git tree	2021-05-12 09:49:07 +03:00
Hanefi Onaldi	13808b60cf	Update gitignore files	2021-05-12 09:49:07 +03:00
Hanefi Onaldi	c96b439a48	Introduce scripts to sync gitignore rules for .source files	2021-05-12 09:49:06 +03:00
Jelte Fennema	cbbd10b974	Implement an improvement threshold in the rebalancer (#4927 ) Every move in the rebalancer algorithm results in an improvement in the balance. However, even if the improvement in the balance was very small the move was still chosen. This is especially problematic if the shard itself is very big and the move will take a long time. This changes the rebalancer algorithm to take the relative size of the balance improvement into account when choosing moves. By default a move will not be chosen if it improves the balance by less than half of the size of the shard. An extra argument is added to the rebalancer functions so that the user can decide to lower the default threshold if the ignored move is wanted anyway.	2021-05-11 14:24:59 +02:00
Önder Kalacı	fa61eda7b9	Merge pull request #4974 from citusdata/minor_b_fix Remove wrong PG_USED_FOR_ASSERTS_ONLY	2021-05-11 14:13:52 +02:00
Onder Kalaci	cc4870a635	Remove wrong PG_USED_FOR_ASSERTS_ONLY	2021-05-11 12:58:37 +02:00
Önder Kalacı	398d2472f6	Merge pull request #4973 from citusdata/base_for_logical Get prepared for some improvements for online rebalancer	2021-05-11 12:19:36 +02:00
Onder Kalaci	a231ff29b0	Get prepared for some improvements for online rebalancer To see all the changes, see https://github.com/citusdata/citus-enterprise/pull/586/files	2021-05-10 19:54:31 +02:00
Onur Tirtir	4f3c672ebe	Re-consider VALID_ITEMPOINTER_OFFSETS wrt bitmap scan logic	2021-05-10 20:16:50 +03:00
Onur Tirtir	0f4c97e0d0	Improve the constants around row number mapping	2021-05-10 20:16:50 +03:00
Onur Tirtir	181848cc80	Implement ErrorIfInvalidRowNumber To use the same logic when mapping tid's to row number's	2021-05-10 20:16:50 +03:00
Onur Tirtir	7ae90b7f96	Rename ColumnarStripeIndexRelationId to ColumnarStripePKeyIndexRelationId Since now we have another index on columnar.stripe	2021-05-10 20:16:50 +03:00
Onur Tirtir	f846c16514	Implement BuildStripeMetadata	2021-05-10 20:16:50 +03:00
Onur Tirtir	2552aee404	Handle old versioned columnar metapage after binary upgrade (#4956 ) * Make VACUUM hint for upgrade scenario actually work * Suggest using VACUUM if metapage doesn't exist Plus, suggest upgrading sql version as another option. * Always force read metapage block * Fix two typos	2021-05-10 20:16:50 +03:00
Onur Tirtir	2e419ea177	Add first_row_number column to columnar.stripe for tid mapping	2021-05-10 20:16:50 +03:00
Onur Tirtir	9c1ac3127f	Implement ColumnarOverwriteMetapage	2021-05-10 20:16:50 +03:00
jeff-davis	7b9aecff21	Columnnar: metapage changes. (#4907 ) * Columnar: introduce columnar storage API. This new API is responsible for the low-level storage details of columnar; translating large reads and writes into individual block reads and writes that respect the page headers and emit WAL. It's also responsible for the columnar metapage, resource reservations (stripe IDs, row numbers, and data), and truncation. This new API is not used yet, but will be used in subsequent forthcoming commits. * Columnar: add columnar_storage_info() for debugging purposes. * Columnar: expose ColumnarMetadataNewStorageId(). * Columnar: always initialize metapage at creation time. This avoids the complexity of dealing with tables where the metapage has not yet been initialized. * Columnar: columnar storage upgrade/downgrade UDFs. Necessary upgrade/downgrade step so that new code doesn't see an old metapage. * Columnar: improve metadata.c comment. * Columnar: make ColumnarMetapage internal to the storage API. Callers should not have or need direct access to the metapage. * Columnar: perform resource reservation using storage API. * Columnar: implement truncate using storage API. * Columnar: implement read/write paths with storage API. * Columnar: add storage tests. * Revert "Columnar: don't include stripe reservation locks in lock graph." This reverts commit `c3dcd6b9f8`. No longer needed because the columnar storage API takes care of concurrency for resource reservation. * Columnar: remove unnecessary lock when reserving. No longer necessary because the columnar storage API takes care of concurrent resource reservation. * Add simple upgrade tests for storage/ branch * fix multi_extension.out Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2021-05-10 20:16:46 +03:00
Onur Tirtir	7def297a3b	Move the logic that builds relation col list into a function (#4964 )	2021-05-10 20:01:28 +03:00
Onur Tirtir	59fea712e2	Implement an helper to create memory cxt for stripe read (#4965 )	2021-05-10 19:55:47 +03:00
SaitTalhaNisanci	5a941814fd	Close connection after each shard move (#4967 )	2021-05-10 16:57:19 +03:00
Ahmet Gedemenli	8cb505d6e1	Fix matview access method change issue (#4959 ) * Fix matview access method change issue * Use pg function get_am_name * Split view generation command into pieces	2021-05-07 15:47:24 +03:00
SaitTalhaNisanci	6b1904d37a	When moving a shard to a new node ensure there is enough space (#4929 ) * When moving a shard to a new node ensure there is enough space * Add WairForMiliseconds time utility * Add more tests and increase readability * Remove the retry loop and use a single udf for disk stats * Address review * address review Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2021-05-06 17:28:02 +03:00
Ahmet Gedemenli	ff4098724a	Merge pull request #4957 from citusdata/log-remote-commands-for-shard-cost-func Log remote commands for shard cost func at multi_partitioninig	2021-05-06 17:17:34 +03:00
Ahmet Gedemenli	bc818e76e2	Add notice log message for skipping child tables for optimization	2021-05-06 16:49:37 +03:00
Ahmet Gedemenli	2fed133cf8	Merge pull request #4951 from citusdata/fix-nested-select-query-bug Fix nested select query with union bug	2021-05-05 22:59:16 +03:00
Ahmet Gedemenli	2e0bb5c0c8	Fix nested select query with union bug	2021-05-05 20:35:00 +03:00
Jelte Fennema	d0ba122061	Editorconfig: configure 4 spaces for python files (#4953 )	2021-05-05 10:32:47 +00:00
Jelte Fennema	0e6c080e81	Run copy_modified in upgrade tests (#4952 ) This allows running the following command to update the expected files with normalized output files for upgrade tests too: ```bash cp src/test/regress/{results,expected}/upgrade_rebalance_strategy_before.out ```	2021-05-05 12:28:05 +02:00
Jelte Fennema	50357db957	Simplify code that tests the shard rebalancer algorithm (#4925 ) This modifies the test code to use sane defaults instead of requiring all values to be specified in the test.	2021-05-03 15:47:19 +02:00
Hanefi Onaldi	23a505d41f	Bump PG versions in CI (#4941 ) Co-authored-by: Hanefi Onaldi <Hanefi.Onaldi@microsoft.com> Co-authored-by: Sait Talha Nisanci <s.talhanisanci@gmail.com>	2021-05-03 13:51:20 +03:00
SaitTalhaNisanci	2b70987341	Merge pull request #4919 from citusdata/continue_dropping_shards Continue to remove shards after first failure in DropMarkedShards	2021-04-30 18:22:39 +03:00
Jelte Fennema	2f29d4e53e	Continue to remove shards after first failure in DropMarkedShards The comment of DropMarkedShards described the behaviour that after a failure we would continue trying to drop other shards. However the code did not do this and would stop after the first failure. Instead of simply fixing the comment I fixed the code, because the described behaviour is more useful. Now a single shard that cannot be removed yet does not block others from being removed.	2021-04-30 15:42:09 +03:00
SaitTalhaNisanci	ca5d281784	Merge pull request #4940 from citusdata/reduce_memory_usage_rebalancer Decrease memory usage with rebalancer	2021-04-30 11:15:11 +03:00
Sait Talha Nisanci	8cabd2e822	Decrease memory usage with rebalancer We decrease memory usage by: - Freeing temporary buffers - Using separate memory context for blocks that uses "small" amount of memory but can be repeated many times such as loops	2021-04-29 13:40:47 +03:00
Hanefi Onaldi	2f90ce931b	Fix minor issues with makefile targets (#4717 )	2021-04-28 15:46:55 +03:00
Marco Slot	6a050ab6b9	Merge pull request #4865 from citusdata/marcocitus/fix-from-only Fix FROM ONLY queries on partitioned tables	2021-04-28 14:17:12 +02:00
Marco Slot	4b49cb112f	Fix FROM ONLY queries on partitioned tables	2021-04-27 16:10:07 +02:00
Ahmet Gedemenli	9c08ab49df	Merge pull request #4917 from citusdata/sort-gucs-alphabetically Sort GUCs in alphabetic order	2021-04-26 16:31:47 +03:00
Ahmet Gedemenli	fe65be993e	Sort GUCs in alphabetic order	2021-04-26 15:05:42 +03:00
Onur Tirtir	f8bacbedac	Merge pull request #4920 from citusdata/add-citus-10.0-upgrade Preparation for adding citus 10.0 to upgrade tests	2021-04-26 15:04:02 +03:00
Onur Tirtir	889ad6fa8c	Run some upgrade tests only when old version=9.0	2021-04-26 14:53:53 +03:00
Onur Tirtir	6afa4f2e62	Export upgrade_test_old_citus_version to use in some upgrade tests	2021-04-26 14:53:53 +03:00
Jelte Fennema	7ee9a0d1c4	Enable security flags in CI (#4924 )	2021-04-26 10:28:35 +02:00
Ahmet Gedemenli	332c5ce4ad	Fix worker partitioned size functions (#4922 )	2021-04-26 10:29:46 +03:00
Philip Dubé	8cd9b8d8af	Merge pull request #4926 from citusdata/diff-filter-full-search Fix diff-filter to search the whole line for matches	2021-04-23 13:08:36 +00:00
Jelte Fennema	763fa1cf41	Fix diff-filter to search the whole line for matches Recently two new normalization line deletion rules have been added that don't match the start of a line: ``` /local tables that are added to metadata but not chained with reference tables via foreign keys might be automatically converted back to postgres tables$/d /Consider setting citus.enable_local_reference_table_foreign_keys to 'off' to disable this behavior$/d ``` Because `diff-filter` used `regex.match` these lines were not removed when creating a new diff. This could cause some confusing diffs, where the wrong lines were shown as changed. This fixes that by using `regex.search` instead of `regex.match`.	2021-04-23 12:43:49 +02:00
Önder Kalacı	3dfb766f35	Merge pull request #4915 from citusdata/allow_values Allow constant VALUES clause in pushdown queries	2021-04-21 14:39:50 +02:00
Onder Kalaci	918838e488	Allow constant VALUES clauses in pushdown queries As long as the VALUES clause contains constant values, we should not recursively plan the queries/CTEs. This is a follow-up work of #1805. So, we can easily apply OUTER join checks as if VALUES clause is a reference table/immutable function.	2021-04-21 14:28:08 +02:00
SaitTalhaNisanci	93c2dcf3d2	Fix data-race with concurrent calls of DropMarkedShards (#4909 ) * Fix problews with concurrent calls of DropMarkedShards When trying to enable `citus.defer_drop_after_shard_move` by default it turned out that DropMarkedShards was not safe to call concurrently. This could especially cause big problems when also moving shards at the same time. During tests it was possible to trigger a state where a shard that was moved would not be available on any of the nodes anymore after the move. Currently DropMarkedShards is only called in production by the maintenaince deamon. Since this is only a single process triggering such a race is currently impossible in production settings. In future changes we will want to call DropMarkedShards from other places too though. * Add some isolation tests Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2021-04-21 10:59:48 +03:00
Ahmet Gedemenli	33c620f232	Optimize partitioned disk size calculation (#4905 ) * Optimize partitioned disk size calculation * Polish * Fix test for citus_shard_cost_by_disk_size Try optimizing if not CSTORE	2021-04-19 13:30:56 +03:00
Onur Tirtir	96278822d9	Move columnar test helpers to a separate file (#4908 ) * Move columnar test helpers to another file * Rename column_store_memory_stats to columnar_store_memory_stats	2021-04-16 18:56:21 +03:00
Önder Kalacı	31d4ed41d7	Merge pull request #4892 from citusdata/connection_execution_stats Keep more statistics about connection establishment times	2021-04-16 15:03:19 +02:00
Onder Kalaci	5482d5822f	Keep more statistics about connection establishment times When DEBUG4 enabled, Citus now prints per connection establishment time.	2021-04-16 14:56:31 +02:00
Önder Kalacı	0a060b327b	Merge pull request #4859 from citusdata/task_execution_stats Keep more execution statistics	2021-04-16 14:51:24 +02:00
Onder Kalaci	5b78f6cd63	Keep more execution statistics When DEBUG4 enabled, Citus now prints per task execution times.	2021-04-16 14:45:00 +02:00
jeff-davis	9ed56928d3	Columnar: fix use-after-free. (#4906 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-04-15 01:00:00 -07:00
Hanefi Onaldi	987137ef97	Merge pull request #4794 from citusdata/fix/4736	2021-04-14 18:43:13 +03:00
Hanefi Onaldi	9919fbe3f8	Switch to sequential mode on long partition names This commit adds support for long partition names for distributed tables: - ALTER TABLE dist_table ATTACH PARTITION .. - CREATE TABLE .. PARTITION OF dist_table .. Note: create_distributed_table UDF does not support long table and partition names, and is not covered in this commit	2021-04-14 15:27:50 +03:00
Ahmet Gedemenli	e445e3d39c	Introduce 3 partitioned size udfs (#4899 ) * Introduce 3 partitioned size udfs * Add tests for new partition size udfs * Fix type incompatibilities * Convert UDFs into pure sql functions * Fix function comment	2021-04-13 17:36:27 +03:00
Onur Tirtir	fe5c985e1d	Remove HAS_TABLEAM config since we dropped pg11 support (#4862 ) * Remove HAS_TABLEAM config * Drop columnar_ensure_objects_exist * Not call columnar_ensure_objects_exist in citus_finish_pg_upgrade	2021-04-13 10:51:26 +03:00
Onur Tirtir	716cc629f1	Refactor ColumnarReadNextRow for better readability (#4823 )	2021-04-13 10:44:00 +03:00
jeff-davis	3efdfdd791	Columnar: make projectedColumnList an integer list. (#4869 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-04-12 19:07:21 -07:00
Ahmet Gedemenli	d74d358a45	Refactor size queries with new enum SizeQueryType (#4898 ) * Refactor size queries with new enum SizeQueryType * Polish	2021-04-12 17:14:29 +03:00
SaitTalhaNisanci	b453563e88	Warm up connections params hash (#4872 ) ConnParams(AuthInfo and PoolInfo) gets a snapshot, which will block the remote connectinos to localhost. And the release of snapshot will be blocked by the snapshot. This leads to a deadlock. We warm up the conn params hash before starting a new transaction so that the entries will already be there when we start a new transaction. Hence GetConnParams will not get a snapshot.	2021-04-12 13:08:38 +03:00
Ahmet Gedemenli	a1a394dbc9	Merge pull request #4894 from citusdata/add-comment-to-postprocess-create-table Update func comment for PostprocessCreateTableStmt	2021-04-10 20:03:20 +03:00
Ahmet Gedemenli	caef0463b0	Update func comment for PostprocessCreateTableStmt	2021-04-09 13:41:59 +03:00
Ahmet Gedemenli	52e467a9a0	Error out if inheriting a distributed table (#4871 ) * Error out if inheriting a distributed table * Add test inheriting a distirbuted table	2021-04-07 11:21:06 +03:00
Ahmet Gedemenli	e4c4a9b683	Fix error message for local table joins (#4870 ) * Fix error message for local table joins * Fix error messages for regression tests expected outputs	2021-04-06 16:18:28 +03:00
Ahmet Gedemenli	b3ef3194e3	Merge pull request #4866 from citusdata/fix-shard-not-found-issue-for-public-schema Fix shard not found issue for public schema	2021-04-06 10:37:36 +03:00
Ahmet Gedemenli	48a6a5b128	Add test for public shard not found issue	2021-04-06 10:29:17 +03:00
Ahmet Gedemenli	d530d79d73	Fix tests for public schema	2021-04-06 10:29:17 +03:00
Ahmet Gedemenli	840c879572	Remove redundant if statement for schema name	2021-04-06 10:29:17 +03:00
jeff-davis	063e673038	Columnar: use clause Vars for chunk group filtering. (#4856 ) * Columnar: use clause Vars for chunk group filtering. This solves #4780 and also provides a cleaner separation between chunk group filtering and projection pushdown. * Columnar: sort and deduplicate Vars pulled from clauses. * Columnar: cleanup variable names. * Columnar: remove alternate test output. * Columnar: do not recurse when looking for whereClauseVars. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-04-01 12:27:28 -07:00
Onur Tirtir	57c3e226cb	Use pg13, not pg11 in CONTRIBUTING.md (#4864 )	2021-04-01 16:25:01 +03:00
Halil Ozan Akgül	67bc990c1c	Merge pull request #4827 from citusdata/shard-count-parameter-in-distribution Adds shard_count parameter to create_distributed_table	2021-03-30 12:15:54 +03:00
Halil Ozan Akgul	a5038046f9	Adds shard_count parameter to create_distributed_table	2021-03-29 16:22:49 +03:00
Hanefi Onaldi	76a1ddac94	Merge pull request #4612 from citusdata/clean-before-upgrade-tests	2021-03-29 13:17:19 +03:00
Hanefi Önaldı	797538750f	Delete all upgrade test artifacts before citus-upgrade-local	2021-03-27 00:46:06 +03:00
SaitTalhaNisanci	3c2efe287e	Merge pull request #4799 from citusdata/drop/pg11 Drop postgres 11 support	2021-03-25 12:46:05 +03:00
Onur Tirtir	7081690480	Add check-columnar-vg regression test target (#4737 )	2021-03-25 11:55:58 +03:00
SaitTalhaNisanci	9437d21601	Merge pull request #4848 from citusdata/ignore/modifiedVersions Ignore config and version modified	2021-03-25 11:48:43 +03:00
SaitTalhaNisanci	3a3171cd04	Ignore temporary output files	2021-03-25 09:59:21 +03:00
SaitTalhaNisanci	03832f353c	Drop postgres 11 support	2021-03-25 09:20:28 +03:00
jeff-davis	248c6cb91a	Columnar: do not bother building unnecessary RestrictInfo. (#4852 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-03-24 16:05:08 -07:00
Onur Tirtir	c01507a91b	Remove columnar/.gitignore (#4825 )	2021-03-24 13:04:14 +03:00
Hanefi Onaldi	1b7a1357f1	Merge pull request #4844 from citusdata/update-cl-833 Update Changelog for 8.3.3	2021-03-23 20:25:04 +03:00
Hanefi Onaldi	4a9655e833	Update Changelog for 8.3.3	2021-03-23 19:58:37 +03:00
Nils Dijk	1c1999ed7b	incorporate the fixopen fix for osx users on bigsur (#4837 ) comparable to https://github.com/citusdata/tools/pull/88 this patch adds checks to the perl script running the testing harness of citus to start the postgres instances via the fixopen binary when present to work around `Interrupted System` call errors on OSX Big Sur.	2021-03-22 16:22:08 +01:00
Nils Dijk	787ee97867	Tests: foreign key non colocated tests (#4841 ) Earlier versions of Citus (pre 9.0) had a bug where a user was able to get in a situation where a foreign key between two non-colocated tables was allowed. This was caused by the wrongful scoping together with only setting to on of a boolean variable in a loop, causing the `true` from an earlier iteration to leak into a new iteration. This was 'by accident' solved in a refactor that was executed in the preparation of the 9.0 release. Only recently we had a user running into this and it was tracked down to this behaviour. Given the dire situation a user could get them self into when running into this bug we have backported a fix to the latest 8.3 release branch. To make sure this regression does not happen anymore in the future I propose we add the tests from the backport to our mainline. For reference: https://github.com/citusdata/citus/pull/4840	2021-03-22 15:33:56 +01:00
Marco Slot	6876e87f31	Merge pull request #4838 from citusdata/dependabot/pip/src/test/regress/jinja2-2.11.3 Bump jinja2 from 2.11.2 to 2.11.3 in /src/test/regress	2021-03-22 11:03:55 +01:00
dependabot[bot]	a1aedc41f1	Bump jinja2 from 2.11.2 to 2.11.3 in /src/test/regress Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.2 to 2.11.3. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/master/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/2.11.2...2.11.3) Signed-off-by: dependabot[bot] <support@github.com>	2021-03-20 05:51:26 +00:00
Marco Slot	28719ae79b	Merge pull request #4835 from citusdata/marcocitus/fix-configure Apply #4834 to configure as well	2021-03-18 14:52:02 +01:00
Marco Slot	4ad88a7a89	Merge pull request #4834 from ProsperWorks/master More precise error messages for ./configure.	2021-03-18 14:22:09 +01:00
Marco Slot	1e11a34d00	Apply #4834 to configure as well	2021-03-18 01:35:34 +01:00
jhwillett	8bcf3b9887	More precise error messages for ./configure --without-lz4 and --without-zstd.	2021-03-17 12:52:04 -07:00
SaitTalhaNisanci	b4620bed87	Merge pull request #4822 from citusdata/changelog/10.0-3 Update CHANGELOG for 10.0.3	2021-03-17 17:54:32 +03:00
Önder Kalacı	b5f4320164	Make sure that single task local executions start coordinated transaction (#4831 ) With https://github.com/citusdata/citus/pull/4806 we enabled 2PC for any non-read-only local task. However, if the execution is a single task, enabling 2PC (CoordinatedTransactionShouldUse2PC) hits an assertion as we are not in a coordinated transaction. There is no downside of using a coordinated transaction for single task local queries.	2021-03-17 12:20:57 +01:00
Ahmet Gedemenli	4558132239	Merge pull request #4830 from citusdata/add-udf-citus-get-active-worker-nodes Add udf citus_get_active_worker_nodes	2021-03-17 13:39:30 +03:00
Ahmet Gedemenli	5e5db9eefa	Add udf citus_get_active_worker_nodes	2021-03-17 13:15:59 +03:00
Sait Talha Nisanci	92130ae2a2	Update CHANGELOG for 10.0.3	2021-03-17 11:21:36 +03:00
Marco Slot	946f529826	Merge pull request #4820 from citusdata/marcocitus/copy-guc Introduce citus.remote_copy_flush_threshold GUC	2021-03-16 19:05:10 +01:00
Marco Slot	43f600bc46	Merge pull request #4808 from citusdata/marcocitus/connection-lifetime	2021-03-16 14:58:06 +01:00
Marco Slot	fbc2147e11	Replace MAX_PUT_COPY_DATA_BUFFER_SIZE by citus.remote_copy_flush_threshold GUC	2021-03-16 06:00:38 +01:00
Marco Slot	1646fca445	Add GUC to set maximum connection lifetime	2021-03-16 01:57:57 +01:00
jeff-davis	3b12556401	Columnar: cleanup (#4814 ) * Columnar: fix misnamed file. * Columnar: make compression not dependent on columnar.h. * Columnar: rename columnar_metadata_tables.c to columnar_metadata.c. * Columnar: make customscan not depend on columnar.h. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-03-15 11:34:39 -07:00
Onur Tirtir	b2a7bafcc4	Fix flaky test in multi_foreign_key_relation_graph (#4819 )	2021-03-15 17:55:04 +03:00
Marco Slot	7d8d5cad98	Merge pull request #4817 from citusdata/marcocitus/fix-warning Remove unnecessary AtEOXact_Files call	2021-03-15 09:58:41 +01:00
Marco Slot	6c5d263b7a	Remove unnecessary AtEOXact_Files call	2021-03-15 09:34:02 +01:00
Onur Tirtir	1d3e075e62	Support temporary columnar tables (#4766 )	2021-03-12 12:01:36 +03:00
Önder Kalacı	56245d232d	Merge pull request #4806 from citusdata/fix_2pc_local_exec Do not trigger 2PC for reads on local execution	2021-03-12 09:34:34 +01:00
Onder Kalaci	e65e72130d	Rename use -> shouldUse Because setting the flag doesn't necessarily mean that we'll use 2PC. If connections are read-only, we will not use 2PC. In other words, we'll use 2PC only for connections that modified any placements.	2021-03-12 08:29:43 +00:00
Onder Kalaci	6a7ed7b309	Do not trigger 2PC for reads on local execution Before this commit, Citus used 2PC no matter what kind of local query execution happens. For example, if the coordinator has shards (and the workers as well), even a simple SELECT query could start 2PC: ```SQL WITH cte_1 AS (SELECT * FROM test LIMIT 10) SELECT count(*) FROM cte_1; ``` In this query, the local execution of the shards (and also intermediate result reads) triggers the 2PC. To prevent that, Citus now distinguishes local reads and local writes. And, Citus switches to 2PC only if a modification happens. This may still lead to unnecessary 2PCs when there is a local modification and remote SELECTs only. Though, we handle that separately via #4587.	2021-03-12 08:29:43 +00:00
Onur Tirtir	874d5fd962	Remove foreign keys between columnar metadata tables (#4791 ) Postgres keeps AFTER trigger state for each transaction, because we can have deferred AFTER triggers which will be fired at the end of a transaction. Postgres cleans up this state at the end of transaction. Postgres processes ON COMMIT triggers after cleaning-up the AFTER trigger states. So if we fire any triggers in ON COMMIT, the AFTER trigger state won't be cleaned-up properly and the transaction state will be left in an inconsistent state, which might result in assertion failure. So with this commit, we remove foreign keys between columnar metadata tables and enforce constraints between them manually when dropping columnar tables.	2021-03-12 11:28:17 +03:00
Naisila Puka	71a9f45513	Fix upgrade and downgrade paths for master/citus_update_table_statistics (#4805 )	2021-03-11 14:52:40 +03:00
Marco Slot	69f09556fd	Merge pull request #4809 from joeljuca/readme-code-highlights Fix incorrect language syntax on README.md	2021-03-10 23:44:45 +01:00
Joel Jucá	a770c5fe4e	Fix incorrect language syntax on README.md	2021-03-10 18:18:45 -03:00
Naisila Puka	196064836c	Skip 2PC for readonly connections in a transaction (#4587 ) * Skip 2PC for readonly connections in a transaction * Use ConnectionModifiedPlacement() function * Remove the second check of ConnectionModifiedPlacement() * Add order by to prevent flaky output * Test using pg_dist_transaction	2021-03-10 20:01:37 +03:00
Marco Slot	68a527ba17	Merge pull request #4800 from citusdata/marcocitus/fix-mod-cte	2021-03-09 21:12:46 +01:00
Marco Slot	9c0d7f5c26	Add tests for modifying CTE and SELECT without FROM	2021-03-09 10:39:33 +01:00
Marco Slot	58f85f55c0	Fixes a crash in queries with a modifying CTE and a SELECT without FROM	2021-03-09 10:39:33 +01:00
SaitTalhaNisanci	aef7fc3a51	Ignore columnar generated test files (#4796 )	2021-03-09 10:52:08 +03:00
Claire Giordano	742db1c5a2	Merge pull request #4789 from citusdata/claire-readme-edit1 Rm Performance H2 section title (temporarily)	2021-03-07 22:25:08 -08:00
Claire Giordano	33a6f763ea	Rm Performance H2 section title (temporarily)	2021-03-07 17:24:24 -08:00
Marco Slot	fe5494b72a	Merge pull request #4345 from citusdata/marcocitus/readme_updates Modernize Citus readme	2021-03-05 18:12:45 +01:00
Philip Dubé	ce296ac62e	Merge pull request #4779 from citusdata/typos Fix various typos due to zealous repetition	2021-03-05 13:16:12 +00:00
Philip Dubé	4e22f02997	Fix various typos due to zealous repetition	2021-03-04 19:28:15 +00:00
Marco Slot	61d7363eed	Rewrite the README	2021-03-04 10:29:11 +01:00
Onur Tirtir	1bb7a0a268	Fix chunk_group_consistency regression test view (#4765 )	2021-03-04 12:20:25 +03:00
Onur Tirtir	9728ce1167	Add tests for concurrent index deadlock issue (#4775 )	2021-03-04 11:56:54 +03:00
Marco Slot	d7880df4ab	Merge pull request #4767 from citusdata/marcocitus/fix-master-add-node Try to return earlier in idempotent citus_add_node	2021-03-03 23:30:37 +01:00
Hadi Moshayedi	6c409b5d3e	Merge pull request #4769 from citusdata/fix-4675 Populate DATABASEOID cache before CREATE INDEX CONCURRENTLY	2021-03-03 13:04:41 -08:00
Hadi Moshayedi	affe38eac6	Populate DATABASEOID cache before CREATE INDEX CONCURRENTLY	2021-03-03 12:59:46 -08:00
Halil Ozan Akgül	fc493547cd	Merge pull request #4772 from citusdata/update-cl-1002 Update CHANGELOG for 10.0.2	2021-03-03 17:03:35 +03:00
Halil Ozan Akgül	c2a9706203	Update CHANGELOG for 10.0.2	2021-03-03 16:18:00 +03:00
Önder Kalacı	857beb36fe	Merge pull request #4756 from citusdata/fix_on_top_of_pg13 Prevent infinite recursion for queries that involve UNION ALL and JOIN	2021-03-03 13:41:37 +01:00
Onder Kalaci	54ee96470e	Pass pointer of AttributeEquivalenceClass instead of pointer of pointer AttributeEquivalenceClass seems to be unnecessarily used with multiple pointers. Just use a single pointer for ease of read.	2021-03-03 12:27:26 +01:00
Onder Kalaci	d1cd198655	Prevent infinite recursion for queries that involve UNION ALL and JOIN With this commit, we make sure to prevent infinite recursion for queries in the format: [subquery with a UNION ALL] JOIN [table or subquery] Also, fixes a bug where we pushdown UNION ALL below a JOIN even if the UNION ALL is not safe to pushdown.	2021-03-03 12:27:26 +01:00
Hanefi Onaldi	697bbbd3c6	Do not use security flags by default (#4770 )	2021-03-03 12:51:16 +03:00
Hadi Moshayedi	1a05131331	Use chunk groups to read columnar data (#4768 )	2021-03-02 23:53:24 -08:00
Naisila Puka	2f30614fe3	Reimplement citus_update_table_statistics to detect dist. deadlocks (#4752 ) * Reimplement citus_update_table_statistics * Update stats for the given table not colocation group * Add tests for reimplemented citus_update_table_statistics * Use coordinated transaction, merge with citus_shard_sizes functions * Update the old master_update_table_statistics as well	2021-03-03 04:12:30 +03:00
Hanefi Onaldi	f87107eb6b	Add security flags in configure scripts (#4760 )	2021-03-03 01:55:29 +03:00
Marco Slot	f25de6a0e3	Try to return earlier in idempotent master_add_node	2021-03-02 21:22:47 +01:00
Marco Slot	f3b0d445b2	Merge pull request #4759 from citusdata/marcocitus/normalize-notices Normalize the ConvertTable notices	2021-03-02 12:49:37 +01:00
jeff-davis	9da9bd3dfd	Columnar: rename files and tests. (#4751 ) * Columnar: rename files and tests. * Columnar: Rename TableState to ColumnarState.	2021-03-01 08:34:24 -08:00
Marco Slot	dca615c5aa	Normalize the ConvertTable notices	2021-03-01 10:36:12 +01:00
SaitTalhaNisanci	feee25dfbd	Use translated vars in postgres 13 as well (#4746 ) * Use translated vars in postgres 13 as well Postgres 13 removed translated vars with pg 13 so we had a special logic for pg 13. However it had some bug, so now we copy the translated vars before postgres deletes it. This also simplifies the logic. * fix rtoffset with pg >= 13	2021-02-26 19:41:29 +03:00
Halil Ozan Akgül	85c382a63b	Merge pull request #4744 from citusdata/grant_citus_tables_to_public Adds GRANT for public to citus_tables	2021-02-26 16:51:56 +03:00
Halil Ozan Akgul	5c5cb200f7	Adds GRANT for public to citus_tables	2021-02-26 16:24:33 +03:00
Önder Kalacı	0fe26a216c	Prevent cross join without any target list entries (#4750 ) /* * The physical planner assumes that all worker queries would have * target list entries based on the fact that at least the column * on the JOINs have to be on the target list. However, there is * an exception to that if there is a cartesian product join and * there is no additional target list entries belong to one side * of the JOIN. Once we support cartesian product join, we should * remove this error. */	2021-02-26 11:04:21 +01:00
Onur Tirtir	5e6030b87f	Merge pull request #4747 from citusdata/col/grant-access	2021-02-26 12:46:00 +03:00
Onur Tirtir	54ac924bef	Grant read access for columnar metadata tables to unprivileged user	2021-02-26 12:31:09 +03:00
Onur Tirtir	dcc0207605	Add 10.0-2 schema version	2021-02-26 12:31:09 +03:00
Onur Tirtir	5ed954844c	Ensure table owner when using alter_columnar_table_set/alter_columnar_table_reset (#4748 )	2021-02-26 12:27:51 +03:00
jeff-davis	fbeb747006	Columnar: refactor read path and fix zero-column tables. (#4668 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-25 09:04:54 -08:00
Naisila Puka	5ebd4eac7f	Preserve colocation with procedures in alter_distributed_table (#4743 )	2021-02-25 19:52:47 +03:00
Hanefi Onaldi	8820541fd4	Merge pull request #4709 from citusdata/sequential-mode-on-long-table-names	2021-02-24 17:33:02 +03:00
Hanefi Onaldi	5aff18b573	Fix flaky test	2021-02-24 17:09:08 +03:00
Hanefi Onaldi	9a792ef841	Remove length limitations for table renames	2021-02-24 03:35:27 +03:00
Hanefi Onaldi	7bebeb872d	Failing long table name tests	2021-02-24 03:35:27 +03:00
Onur Tirtir	495096ef5e	Remove useless pg version checks (#4741 )	2021-02-23 21:20:18 +03:00
Naisila Puka	dbb88f6f8b	Fix insert query with CTEs/sublinks/subqueries etc (#4700 ) * Fix insert query with CTE * Add more cases with deferred pruning but false fast path * Add more tests * Better readability with if statements	2021-02-23 18:00:47 +03:00
Naisila Puka	105bb580e1	Add columnar regression tests (#4727 ) * Add cursor tests for columnar tables * Add columnar tests for data types w/out comp. operators * Add more prepared statements with columnar tables * Add constraint tests for columnar tables * Add row level security, detach partition and rename columnar tests * Add some ORDER BYs	2021-02-23 14:16:38 +03:00
Hadi Moshayedi	b6f5d98bee	Merge pull request #4723 from citusdata/fix_warning Fix alignment issue in DatumToBytea	2021-02-22 16:15:27 -08:00
Hadi Moshayedi	2fca5ff3b5	Fix alignment issue in DatumToBytea	2021-02-22 16:04:30 -08:00
Onur Tirtir	bebca9ee79	Merge pull request #4733 from citusdata/update-cl-954 Update CHANGELOG for 9.5.4	2021-02-19 14:47:19 +03:00
Onur Tirtir	bb14c5267f	Update CHANGELOG for 9.5.4	2021-02-19 14:20:25 +03:00
SaitTalhaNisanci	dcf54eaf2a	Use PROCESS_UTILITY_QUERY in utility calls When we use PROCESS_UTILITY_TOPLEVEL it causes some problems when combined with other extensions such as pg_audit. With this commit we use PROCESS_UTILITY_QUERY in the codebase to fix those problems.	2021-02-19 13:55:59 +03:00
Sait Talha Nisanci	bbf6132226	Revert "wip (#4730 )" This reverts commit `62e6d54a4e`.	2021-02-19 13:55:59 +03:00
SaitTalhaNisanci	62e6d54a4e	wip (#4730 )	2021-02-19 13:42:19 +03:00
Onur Tirtir	6db5ecb97a	Merge pull request #4729 from citusdata/update-cl-10.0.1 Update CHANGELOG for 10.0.1	2021-02-19 12:03:18 +03:00
Onur Tirtir	9031a22e20	Update CHANGELOG for 10.0.1	2021-02-19 11:53:02 +03:00
Marco Slot	b51d3bf981	Merge pull request #4725 from citusdata/marcocitus/fix-time-partitions	2021-02-18 14:13:36 +01:00
Marco Slot	972a8bc0b7	Rewrite time_partitions join clause to avoid smallint[] operator	2021-02-18 12:01:18 +01:00
Ahmet Gedemenli	b0aeb41d4e	Merge pull request #4714 from citusdata/support-multi-drop-index Support dropping local table indexes along with a distributed index	2021-02-18 13:37:03 +03:00
Ahmet Gedemenli	a740fb5978	Merge branch 'master' into support-multi-drop-index	2021-02-18 13:31:08 +03:00
Ahmet Gedemenli	1f345f65b4	Support dropping local table indexes along with a distributed index	2021-02-18 13:30:12 +03:00
Onur Tirtir	1cbbeab405	Merge pull request #4719 from citusdata/master-update-version-1613472925 Bump Citus to 10.1devel	2021-02-17 12:25:12 +03:00
Onur Tirtir	676d9a9726	Bump Citus to 10.1devel	2021-02-17 11:54:33 +03:00
jeff-davis	0227317002	Columnar: better specification for microbenchmark. (#4711 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-16 15:28:25 -08:00
Onur Tirtir	530a284e51	Merge pull request #4720 from citusdata/9.5.3-cl Update CHANGELOG for 9.5.3	2021-02-16 16:10:48 +03:00
Onur Tirtir	a0de066996	Update CHANGELOG for 9.5.3	2021-02-16 15:16:06 +03:00
Onur Tirtir	41da3a3205	Merge pull request #4716 from citusdata/citus-10.0-changelog-1613462741 Update CHANGELOG for 10.0.0	2021-02-16 13:52:00 +03:00
Onur Tirtir	2d36707a82	Update CHANGELOG for 10.0.0	2021-02-16 13:36:01 +03:00
Onur Tirtir	d61fd6e478	Decide changing sequence dependencies on MX nodes according to resulting relation (#4713 ) When executing alter_table / undistribute_table udf's, we should not try to change sequence dependencies on MX workers if new table wouldn't require syncing metadata. Previously, we were checking that for input table. But in some cases, the fact that input table requires syncing metadata doesn't imply the same for resulting table (e.g when undistributing a Citus table). Even more, doing that was giving an unexpected error when undistributing a Citus table so this commit actually fixes that.	2021-02-15 19:20:26 +03:00
SaitTalhaNisanci	bcbd24f8de	Only consider pseudo constants for shortcuts (#4712 ) It seems that we need to consider only pseudo constants while doing some shortcuts in planning. For example there could be a false clause but it can contribute to the result in which case it will not be a pseudo constant.	2021-02-15 18:39:37 +03:00
SaitTalhaNisanci	0f1ce7a913	Not skip relation in conversion if it doesn't have RelationRestriction (#4685 ) We would exclude tables without relationRestriction from conversion candidates in local-distributed table joins. This could leave a leftover local table which should have been converted to a subquery. Ideally I would expect that in each call to CreateDistributedPlan we would pass a new plan id, but that seems like a bigger change.	2021-02-12 12:33:55 +03:00
Hadi Moshayedi	0f613ac31c	Merge pull request #4702 from citusdata/metadata_changes Move stripe.chunk_count to last position	2021-02-11 21:06:57 -08:00
Hadi Moshayedi	e690d8b79b	Move stripe.chunk_count to last position	2021-02-11 17:00:44 -08:00
Hadi Moshayedi	1c4081ea5f	Merge pull request #4699 from citusdata/readme Columnar: update README to compare with cstore_fdw.	2021-02-11 11:13:26 -08:00
Jeff Davis	b96673de69	Columnar: update README to compare with cstore_fdw.	2021-02-11 10:47:27 -08:00
Hadi Moshayedi	277a773f5e	Merge pull request #4698 from citusdata/chunk-group-num Columnar: rename chunk_num -> chunk_group_num.	2021-02-11 10:18:13 -08:00
Jeff Davis	1f1c3c362b	Columnar: rename chunk_num -> chunk_group_num.	2021-02-11 09:27:00 -08:00
Önder Kalacı	1b5244c410	Merge pull request #4683 from citusdata/int_results_on_separate_conns Do not re-use connections for colocated intermediate results during COPY	2021-02-11 16:40:28 +01:00
Onder Kalaci	f297c96ec5	Add regression tests for COPY into colocated intermediate results To add the tests without too much data, make the copy switchover configurable.	2021-02-11 15:41:06 +01:00
Onder Kalaci	5d5a357487	Do not connection re-use for intermediate results /* * Colocated intermediate results are just files and not required to use * the same connections with their co-located shards. So, we are free to * use any connection we can get. * * Also, the current connection re-use logic does not know how to handle * intermediate results as the intermediate results always truncates the * existing files. That's why, we use one connection per intermediate * result. */	2021-02-11 15:41:06 +01:00
Ahmet Gedemenli	01fb8f8124	Merge pull request #4674 from citusdata/fix-dropping-fkey Fix dropping fkey when distributing table	2021-02-11 15:59:00 +03:00
Ahmet Gedemenli	c8e83d1f26	Fix dropping fkey when distributing table	2021-02-11 15:48:35 +03:00
SaitTalhaNisanci	847b79078f	Not consider subplans in restriction list (#4679 ) * Not consider subplans in restriction list * Not consider sublink, alternative subplan in restrictions	2021-02-11 15:04:07 +03:00
Hadi Moshayedi	1d4b2e3fd0	Merge pull request #4680 from citusdata/fix_deadlock Don't include stripe reservation locks in lock graph	2021-02-10 13:27:10 -08:00
Hadi Moshayedi	c3dcd6b9f8	Columnar: don't include stripe reservation locks in lock graph.	2021-02-10 10:20:20 -08:00
Hadi Moshayedi	841d25bae9	Release metadata locks early	2021-02-10 10:20:12 -08:00
Onur Tirtir	7170ed287c	Merge pull request #4677 from citusdata/test-long-name-citus-local Test adding local table with long name to metadata	2021-02-10 18:17:20 +03:00
Onur Tirtir	ec7ab68f3b	Test adding local table with long name to metadata	2021-02-10 18:05:04 +03:00
Onur Tirtir	9f619a85d6	Fix EXPLAIN ANALYZE exec when query returns no cols (#4672 ) We do not include dummy column if original task didn't return any columns. Otherwise, number of columns that original task returned wouldn't match number of columns returned by worker_save_query_explain_analyze.	2021-02-10 17:59:47 +03:00
Hadi Moshayedi	29d340331e	Merge pull request #4630 from citusdata/fix_4626 Columnar: Fix zero column tables	2021-02-09 23:12:08 -08:00
Hadi Moshayedi	52297804ae	Fix zero column tables	2021-02-09 23:05:11 -08:00
Hadi Moshayedi	d06f6658da	Merge pull request #4681 from citusdata/metadata_changes Columnar metadata changes	2021-02-09 23:04:44 -08:00
Hadi Moshayedi	2d09c76b76	Rename storageid to storage_id	2021-02-09 19:57:04 -08:00
Hadi Moshayedi	8270b598b6	Rename stripeid, chunkid, and attnum	2021-02-09 19:50:50 -08:00
Hadi Moshayedi	9114fd4050	Move chunk.value_count to last position	2021-02-09 19:43:34 -08:00
Hadi Moshayedi	ba937bf316	Merge pull request #4659 from citusdata/chunk_group Columnar: add chunk_group metadata table	2021-02-09 14:21:36 -08:00
Hadi Moshayedi	be90c20457	Fix write path for zero column tables	2021-02-09 14:14:06 -08:00
Hadi Moshayedi	c8d61a31e2	Columnar: chunk_group metadata table	2021-02-09 14:11:58 -08:00
Önder Kalacı	c2480343c7	Merge pull request #4666 from citusdata/write_to_local Allow local execution for co-located intermediate results in COPY	2021-02-09 15:34:50 +01:00
Onder Kalaci	c804c9aa21	Allow local execution for intermediate results in COPY When COPY is used for copying into co-located files, it was not allowed to use local execution. The primary reason was Citus treating co-located intermediate results as co-located shards, and COPY into the distributed table was done via "format result". And, local execution of such COPY commands was not implemented. With this change, we implement support for local execution with "format result". To do that, we use the buffer for every file on shardState->copyOutState, similar to how local copy on shards are implemented. In fact, the logic is similar to local copy on shards, but instead of writing to the shards, Citus writes the results to a file. The logic relies on LOCAL_COPY_FLUSH_THRESHOLD, and flushes only when the size exceeds the threshold. But, unlike local copy on shards, in this case we write the headers and footers just once.	2021-02-09 15:00:06 +01:00
Hadi Moshayedi	1d3b866df5	Merge pull request #4667 from citusdata/private-structs Columnar: make read and write state private.	2021-02-08 10:24:02 -08:00
Jeff Davis	2ea31c899e	Columnar: make read and write state private.	2021-02-08 10:11:57 -08:00
Hanefi Onaldi	353b080474	Fix Semmle errors (#4636 ) Co-authored-by: Halil Ozan Akgül <hozanakgul@gmail.com>	2021-02-08 18:37:44 +03:00
SaitTalhaNisanci	e96da4886f	Sort results in citus_shards and give raw size (#4649 ) * Sort results in citus_shards and give raw size Sort results so that it is consistent and also similar to citus_tables. Use raw size in the output so that doing operations on the size is easier. * Change column ordering	2021-02-08 15:29:42 +03:00
Hadi Moshayedi	2a927522b9	Merge pull request #4655 from citusdata/fix_isolation Normalize isolation_metadata_sync_deadlock	2021-02-06 16:13:37 -08:00
Hadi Moshayedi	3e6b54b964	Normalize isolation_metadata_sync_deadlock	2021-02-06 15:59:28 -08:00
Hadi Moshayedi	eff8cffaf3	Columnar: improve naming of limit config variables. (#4653 ) * Rename chunk_row_count to chunk_group_row_limit * Rename stripe_row_count to stripe_row_limit * Undo couple of renames	2021-02-06 09:04:04 -08:00
Hadi Moshayedi	a7da38e71f	Merge pull request #4650 from citusdata/seq-permissions Columnar: Call nextval_internal instead of DirectFunctionCall.	2021-02-06 02:53:10 -08:00
Jeff Davis	b1882d4400	Columnar: Call nextval_internal instead of DirectFunctionCall.	2021-02-06 01:45:30 -08:00
Hadi Moshayedi	2c372b7b0e	Merge pull request #4654 from citusdata/fix_isolation Make isolation_metadata_sync_deadlock more resilient	2021-02-06 01:44:27 -08:00
Hadi Moshayedi	4e53314e3f	Make isolation_metadata_sync_deadlock more resilient	2021-02-06 01:05:24 -08:00
Hadi Moshayedi	75d9e4a206	Merge pull request #4645 from citusdata/fix-chunks Columnar: don't double count chunks filtered	2021-02-05 11:04:31 -08:00
Hadi Moshayedi	0a9fd91d8f	Use 'Chunk Groups' in EXPLAIN ANALYZE of columnar scan	2021-02-05 10:58:01 -08:00
Hadi Moshayedi	1d311b0709	Columnar: don't double count chunks filtered	2021-02-05 10:58:01 -08:00
Halil Ozan Akgül	cbb95af2c2	Merge pull request #4580 from citusdata/convert-relabeltype-into-collateexpr-in-deparser Convert relabeltype into collateexpr in deparser	2021-02-05 13:33:02 +03:00
Ahmet Gedemenli	5dd2a3da03	Convert RelabelTypes into CollateExprs in get_rule_expr function	2021-02-05 12:06:46 +03:00
Ahmet Gedemenli	f96e93ab67	Merge pull request #4631 from citusdata/rename-master-parameter-for-dist-stat-activity Rename master to citus for dist stat activity cols	2021-02-04 15:42:37 +03:00
Ahmet Gedemenli	503171d2f2	Merge branch 'master' into rename-master-parameter-for-dist-stat-activity	2021-02-04 15:37:13 +03:00
Ahmet Gedemenli	2443b20b2c	Rename master to distributed for worker stat activity	2021-02-04 12:20:06 +03:00
Önder Kalacı	fcb1b7f7d5	Merge pull request #4604 from citusdata/copy_single_node Adaptive connection management for COPY on local nodes	2021-02-04 10:14:06 +01:00
Onder Kalaci	fc9a23792c	COPY uses adaptive connection management on local node With #4338, the executor is smart enough to failover to local node if there is not enough space in max_connections for remote connections. For COPY, the logic is different. With #4034, we made COPY work with the adaptive connection management slightly differently. The cause of the difference is that COPY doesn't know which placements are going to be accessed hence requires to get connections up-front. Similarly, COPY decides to use local execution up-front. With this commit, we change the logic for COPY on local nodes: Try to reserve a connection to local host. This logic follows the same logic (e.g., citus.local_shared_pool_size) as the executor because COPY also relies on TryToIncrementSharedConnectionCounter(). If reservation to local node fails, switch to local execution Apart from this, if local execution is disabled, we follow the exact same logic for multi-node Citus. It means that if we are out of the connection, we'd give an error.	2021-02-04 09:45:07 +01:00
Ahmet Gedemenli	34840ddc5c	Rename master to citus for dist stat activity cols	2021-02-04 11:12:23 +03:00
Hadi Moshayedi	2afb806e7e	Merge pull request #4638 from citusdata/fix-cic Columnar: disallow CREATE INDEX CONCURRENTLY	2021-02-03 16:29:14 -08:00
Hadi Moshayedi	5fde617229	Columnar: disallow CREATE INDEX CONCURRENTLY	2021-02-03 12:10:00 -08:00
Hadi Moshayedi	569c0460c5	Merge pull request #4628 from citusdata/fix-inheritance Columnar: fix inheritance planning.	2021-02-03 10:46:13 -08:00
Jeff Davis	4043731c41	Columnar: fix inheritance planning.	2021-02-03 10:41:21 -08:00
Sait Talha Nisanci	ff82e85ea2	Replace workerNodeCount -> nodeCount	2021-02-03 20:02:03 +03:00
Sait Talha Nisanci	eb5be579e3	Set previous cell inside a for loop	2021-02-03 20:02:03 +03:00
Sait Talha Nisanci	9ba3f70420	Remove unused method	2021-02-03 20:02:03 +03:00
Sait Talha Nisanci	24e60b44a1	Consider coordinator in intermediate result optimization It seems that we were not considering the case where coordinator was added to the cluster as a worker in the optimization of intermediate results. This could lead to errors when coordinator was added as a worker.	2021-02-03 20:02:03 +03:00
Onur Tirtir	c0f2817b70	Disallow using alter_table udfs with tables having any identity cols (#4635 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating table from scratch. For this reason, we don't allow using alter_table udfs with tables having any identity cols.	2021-02-03 19:33:54 +03:00
Onur Tirtir	3a403090fd	Disallow adding local table with identity column to metadata (#4633 ) pg_get_tableschemadef_string doesn't know how to deparse identity columns so we cannot reflect those columns when creating shell relation. For this reason, we don't allow adding local tables -having identity cols- to metadata.	2021-02-03 19:05:17 +03:00
Onur Tirtir	5efb742f8a	Skip copying GENERATED ALWAYS AS STORED cols in ReplaceTable (#4616 ) Postgres doesn't allow inserting into columns having GENERATED ALWAYS AS (...) STORED expressions. For this reason, when executing undistribute_table or an alter_* udf, we should skip copying such columns. This is not bad since Postgres would already generate such columns.	2021-02-03 17:55:16 +03:00
jeff-davis	e03246dd45	Colummnar: mark custom scan path paralle_safe. (#4619 ) Enables an overall plan to be parallel (e.g. over a partition hierarchy), even though an individual ColumnarScan is not parallel-aware. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-02 11:56:00 -08:00
jeff-davis	e195af7e72	Columnar: always disable parallel paths. (#4617 ) Previously, if columnar.enable_custom_scan was false, parallel paths could remain, leading to an unexpected error. Also, ensure that cheapest_parameterized_paths is cleared if a custom scan is used. Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-02-02 11:37:42 -08:00
Onur Tirtir	3ca0b6146b	Merge pull request #4613 from citusdata/fix/generated-cols-citus-local When adding local table to metadata, we are dropping DEFAULT expressions from shard relation. When finding columns having DEFAULT expressions, we shouldn't rely on atthasdef since it might be true if column has GENERATED ALWAYS AS (...) STORED expression. On the other hand, we should not actually drop such GENERATED expressions from shard relation since we don't evaluate such columns in coordinator and this would result in inserting NULL values to such columns.	2021-02-02 18:34:45 +03:00
Onur Tirtir	53b1888cac	Rename DropAndMoveDefaultSequenceOwnerships	2021-02-02 18:17:42 +03:00
Onur Tirtir	93c3f30024	Rename ExtractColumnsOwningSequences	2021-02-02 18:17:42 +03:00
Onur Tirtir	912d829757	Skip GENERATED AS ALWAYS STORED cols when processing cols owning sequences When finding columns owning sequences, we shouldn't rely on atthasdef since it might be true when column has GENERATED ALWAYS AS (...) STORED expression.	2021-02-02 18:17:42 +03:00
Onur Tirtir	c8a48c6eee	Not try to sync metadata for local tables (#4625 )	2021-02-02 15:12:12 +03:00
Onur Tirtir	c5d4e7081b	Fix invalid read issue in deprecated create_citus_local_table udf (#4611 ) Since create_citus_local_table doesn't specify cascadeViaForeignKeys option, we can't directly call citus_add_local_table_to_metadata from create_citus_local_table. Instead, implement an internal method and call it from deprecated udf too.	2021-02-02 12:53:27 +03:00
Hanefi Onaldi	e38c6ebb39	Add instructions to install lz4 and zstd packages (#4606 )	2021-02-02 11:46:35 +03:00
Hadi Moshayedi	03d2b614e2	Merge pull request #4622 from citusdata/fix-4621 Columnar: properly initialize rowNumber.	2021-02-01 21:23:31 -08:00
Jeff Davis	f417510a7f	Columnar: properly initialize rowNumber.	2021-02-01 21:15:14 -08:00
Hadi Moshayedi	d0317ec4d0	Merge pull request #4618 from citusdata/fix_4608	2021-02-01 20:17:37 -08:00
Hadi Moshayedi	bcb162976f	Fix #4608	2021-02-01 16:23:16 -08:00
Hadi Moshayedi	877d87e372	Merge pull request #4610 from citusdata/fix_4600 Columnar: Fix lateral joins	2021-02-01 12:09:28 -08:00
Hadi Moshayedi	f5b1e49b79	Columnar: Fix lateral joins	2021-02-01 11:59:36 -08:00
Hadi Moshayedi	e2afbc9283	Merge pull request #4607 from citusdata/fix_4602 Columnar: Fix ALTER TABLE ... ADD COLUMN.	2021-02-01 11:51:04 -08:00
Hadi Moshayedi	ef927688fa	Columnar: Fix ALTER TABLE ... ADD COLUMN.	2021-02-01 11:40:17 -08:00
Brian Bergeron	1253eeb9ff	Don't propagate ALTER ROLE SET when scoped to a different database (#4471 ) Co-authored-by: brberger <brberger@microsoft.com>	2021-02-01 15:49:26 +03:00
Hanefi Onaldi	31763ef079	Merge pull request #4410 from citusdata/fix-shardid-in-partition-constraints fix_partition_constraints() goes over all partitioned distributed tables and renames constraint names to their original values. fix_partition_constraints(partitioned_dist_table) goes over all shard placements of a partitioned distributed table and sends worker_fix_partition_constraints(...) to workers in a distributed transaction. worker_fix_partition_constraints(partitioned_dist_table, shardId, constraintName) checks if a shardId is appended to a constraint, and removes that suffix with an ALTER TABLE .. RENAME CONSTRAINT command.	2021-01-29 17:46:42 +03:00
Hanefi Önaldı	cab17afce9	Introduce UDFs for fixing partitioned table constraint names	2021-01-29 17:32:20 +03:00
Hanefi Önaldı	92cf49b7e9	Limit shardId in partitioned table constraint names to only CHECK	2021-01-29 17:29:53 +03:00
SaitTalhaNisanci	738825cc38	Fix partition column index issue (#4591 ) * Fix partition column index issue We send column names to worker_hash/range_partition_table methods, and in these methods we check the column name index from tuple descriptor. Then this index is used to decide the bucket that the current row will be sent for the repartition. This becomes a problem when there are the same column names in the tupleDescriptor. Then we can choose the wrong index. Hence the partitioned data will be put to wrong workers. Then the result could miss some data because workers might contain different range of data. An example: TupleDescriptor contains "trip_id", "car_id", "car_id" for one table. It contains only "car_id" for the other table. And assuming that the tables will be partitioned by car_id, it is not certain what should be used for deciding the bucket number for the first table. Assuming value 2 goes to bucket 2 and value 3 goes to bucket 3, it is not certain which bucket "1 2 3" (trip_id, car_id, car_id) row will go to. As a solution we send the index of partition column in targetList instead of the column name. The old API is kept so that if workers upgrade work, it still works (though it will have the same bug) * Use the same method so that backporting is easier	2021-01-29 14:40:40 +03:00
SaitTalhaNisanci	1ba399f5ca	Fix a flaky behaviour in shared_connection_stats (#4596 ) With the previous query, we were not pushing down the pg_sleep hence the number of connections to a worker could be different from run to run.	2021-01-28 18:42:49 +03:00
Önder Kalacı	34ccae8478	Merge pull request #4588 from citusdata/copy_fix_connection When reaches to pool size, COPY sets the placement access	2021-01-28 13:08:47 +01:00
Onder Kalaci	c7ea46067f	Add regression tests	2021-01-28 12:45:57 +01:00
Onder Kalaci	04fcd73eb6	When reaches to shared pool size, COPY sets the placement access It looks like we forgot to set the placement accesses, and this could lead to self-deadlocks on complex transaction blocks.	2021-01-28 12:45:57 +01:00
Onder Kalaci	36bdeef1bb	When reaches to executor pool size, COPY sets the placement access It looks like we forgot to set the placement accesses, and this could lead to self-deadlocks on complex transaction blocks.	2021-01-28 12:45:57 +01:00
Onur Tirtir	bb5962ee79	Early error out when creating citus local from a temp table (#4592 )	2021-01-28 14:18:06 +03:00
Halil Ozan Akgül	e96db9b407	Merge pull request #4581 from citusdata/error-for-alter-table-am-pg11 Adds error message to AlterTableSetAccessMethod for below PG12	2021-01-28 12:30:38 +03:00
Halil Ozan Akgul	913aa91449	Adds error message to AlterTableSetAccessMethod for below PG12	2021-01-28 11:32:02 +03:00
jeff-davis	15297cab49	Columnar: add GUC to control qual pushdown. (#4586 )	2021-01-27 09:57:40 -08:00
jeff-davis	62e0383150	Columnar readme. (#4585 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-01-27 09:33:35 -08:00
Nils Dijk	07d3b4fd04	fix NaN cost estimate on empty columnar tables (#4593 ) Fixing a division by zero in the cost calculations for scanning a columnar table. Due to how the columns in a columnar table are counted an empty table would result in a division by zero. Instead this patch keeps the column selection ratio on zero when this happens, resulting in an accurate cost of zero pages to scan a columnar table. fixes #4589	2021-01-27 17:32:17 +01:00
Nils Dijk	07cf037b13	fix parse error on pg11.8 for extension creation (#4582 ) In pg11.8 it seemingly tries to parse the full sql file creating the extension, since we use syntax introduced in postgres 12 this fails. This patch rewrites the statement not recognized by pg11.8 to be dynamically executed from a string literal via `EXECUTE`.	2021-01-27 17:00:29 +01:00
Onur Tirtir	b20615cbbe	Advise dropping foreign key in addition to create_reference_table hint (#4590 )	2021-01-27 17:59:06 +03:00
Onur Tirtir	a18d4288e9	Merge pull request #4573 from citusdata/rename-create_citus_local_table Rename create_citus_local_table to citus_add_local_table_to_metadata	2021-01-27 17:45:46 +03:00
Onur Tirtir	8151c4b443	Merge remote-tracking branch 'origin/master' into rename-create_citus_local_table	2021-01-27 17:08:58 +03:00
Ahmet Gedemenli	1c7ee10de2	Merge pull request #4584 from citusdata/fix-dropping-mat-views-when-alter-table Fix dropping materialized views while doing alter table	2021-01-27 17:01:39 +03:00
Ahmet Gedemenli	b2c1bbddd4	Merge branch 'master' into fix-dropping-mat-views-when-alter-table	2021-01-27 16:33:10 +03:00
Ahmet Gedemenli	35043c56f1	Fix dropping materialized views while doing alter table	2021-01-27 16:32:09 +03:00
Onur Tirtir	93a83d5472	Rename create_citus_local_table.c to citus_add_local_table_to_metadata.c	2021-01-27 15:52:37 +03:00
Onur Tirtir	dfcdccd0e7	Rename udf in regression tests (as per prev commit)	2021-01-27 15:52:37 +03:00
Onur Tirtir	1a4482a37c	Get rid of the sql dir for new udf	2021-01-27 15:52:37 +03:00
Onur Tirtir	2f30be823e	Rename create_citus_local_table to citus_add_local_table_to_metadata For simplicity in downgrade test in multi_extension, didn't actually remove create_citus_local_table udf.	2021-01-27 15:52:36 +03:00
Onur Tirtir	cd6f381d3c	Merge pull request #4567 from citusdata/hide-notice-undis Hide notice messages when implicitly undistributing citus local tables	2021-01-27 13:54:40 +03:00
Onur Tirtir	c06fcc26e5	Hide notice messages when implicitly undistributing citus local tables	2021-01-27 13:42:06 +03:00
Onur Tirtir	458a81f93d	Add suppressNoticeMessages to TableConversionState	2021-01-27 12:53:58 +03:00
Onur Tirtir	cacb76d2c6	Not mention citus local tables in error messages (#4579 )	2021-01-27 12:36:53 +03:00
Naisila Puka	94bc2703bc	Make undistribute_table() and citus_create_local_table() work with columnar (#4563 ) * Make undistribute_table() and citus_create_local_table() work with columnar * Rename and use LocallyExecuteUtilityTask for UDF check * Remove 'local' references in ExecuteUtilityCommand	2021-01-27 01:17:20 +03:00
Halil Ozan Akgül	9b6ccb313d	Merge pull request #4583 from citusdata/alter-am-to-columnar-notice-names-of-indexes Adds error messages with names of indexes that will be dropped when converting to columnar	2021-01-26 18:46:01 +03:00
Halil Ozan Akgul	bafa692fc1	Adds error messages with names of indexes that will be dropped	2021-01-26 18:18:26 +03:00
Ahmet Gedemenli	5659a3b830	Merge pull request #4574 from citusdata/fix-renaming-index-citus-local-tables Fix index renaming when creating citus local tables	2021-01-26 17:32:45 +03:00
Ahmet Gedemenli	e99f052904	Fix index renaming when creating citus local tables	2021-01-26 15:52:48 +03:00
Ahmet Gedemenli	7952100f49	Merge pull request #4561 from citusdata/fix-maintenance-daemon-crash Remove failing assertions	2021-01-26 15:50:32 +03:00
Ahmet Gedemenli	6cba42a8bc	Merge branch 'master' into fix-maintenance-daemon-crash	2021-01-26 13:38:08 +03:00
SaitTalhaNisanci	499e7ed038	Update CHANGELOG for 9.5.2 (#4577 )	2021-01-26 13:26:30 +03:00
Ahmet Gedemenli	14bf9d85d6	Merge branch 'master' into fix-maintenance-daemon-crash	2021-01-26 12:52:28 +03:00
Hadi Moshayedi	54f0e8619a	Merge pull request #4566 from citusdata/write-opt Columnar: optimize write path.	2021-01-25 12:00:09 -08:00
Jeff Davis	d62e54dc09	Columnar: optimize write path.	2021-01-25 11:47:21 -08:00
Hadi Moshayedi	350e0c1d61	Merge pull request #4565 from citusdata/fix_4555 Read chunk row count from catalog tables	2021-01-25 09:04:36 -08:00
Hadi Moshayedi	639952ffa8	Read chunk row count from catalog tables	2021-01-25 08:53:52 -08:00
Onur Tirtir	690f54b4fd	Merge pull request #4570 from citusdata/fix/downgrade-9.5-notify-dropped Drop notify_constraint_dropped beforehand when downgrading	2021-01-25 19:15:32 +03:00
Onur Tirtir	6a28f62239	Remove stale comment	2021-01-25 18:55:57 +03:00
Onur Tirtir	9e0150e9e2	Drop notify_constraint_dropped beforehand when downgrading	2021-01-25 18:55:57 +03:00
Nils Dijk	d127516dc8	Mitigate segfault in connection statemachine (#4551 ) As described in the comment, we have observed crashes in production due to a segfault caused by the dereference of a NULL pointer in our connection statemachine. As a mitigation, preventing system crashes, we provide an error with a small explanation of the issue. Unfortunately the case is not reliably reproduced yet, hence the inability to add tests. DESCRIPTION: Prevent segfaults when SAVEPOINT handling cannot recover from connection failures	2021-01-25 15:55:04 +01:00
Onur Tirtir	eed7c17ddf	Merge pull request #4539 from citusdata/auto-citus-local-when-create-ref Convert postgres tables to citus local when creating reference table having fkeys	2021-01-25 11:11:29 +03:00
Onur Tirtir	215d6630c3	Update foreign_key_to_reference_table so that test output doesn't change	2021-01-25 11:03:39 +03:00
Onur Tirtir	b5ea033a0b	Convert postgres tables to citus local when creating reference table having fkeys	2021-01-25 11:02:50 +03:00
Onur Tirtir	8e02375aa3	Some refactor as a preparation	2021-01-25 11:01:33 +03:00
Onur Tirtir	253c19062a	Rename IsCitusInitiatedBackend to IsCitusInitiatedRemoteBackend (#4562 )	2021-01-23 01:07:43 +03:00
Hadi Moshayedi	a4b5da79dd	Merge pull request #4564 from citusdata/cleanup-cstore Columnar: clean up old references to cstore.	2021-01-22 11:25:53 -08:00
Jeff Davis	53f7b019d5	Columnar: clean up old references to cstore.	2021-01-22 11:08:36 -08:00
Onur Tirtir	941c8fbf32	Automatically undistribute citus local tables when no more fkeys with reference tables (#4538 )	2021-01-22 18:15:41 +03:00
Ahmet Gedemenli	5022fc8301	Remove failing assertions	2021-01-22 17:09:24 +03:00
Marco Slot	11083b9987	Merge pull request #4560 from citusdata/marcocitus/citus-tables-rename Rename citus_tables column names to be query-friendly	2021-01-22 14:13:49 +01:00
Ahmet Gedemenli	6e62a9ea74	Merge pull request #4529 from citusdata/remove-deprecated-gucs-udfs Remove unused GUCs/UDFs	2021-01-22 13:39:17 +03:00
Ahmet Gedemenli	63fab1b7d9	Merge branch 'master' into remove-deprecated-gucs-udfs	2021-01-22 13:29:07 +03:00
SaitTalhaNisanci	3d69ab5576	Choose the smallest colocation id among all matches (#4559 ) Currently we choose an arbitrary colocation id from all the matches for a colocation id. This could mean that 2 distributed tables, which have the same scheme could go into different colocation groups. This fix makes sure that the same match will go to the same colocation group.	2021-01-22 13:28:43 +03:00
Ahmet Gedemenli	3ac30ef9d8	Merge branch 'master' into remove-deprecated-gucs-udfs	2021-01-22 13:06:13 +03:00
Ahmet Gedemenli	790c22c0ce	Merge pull request #4536 from citusdata/fix-bug-create-citus-local-table-with-stats Fix bug creating citus local table with stats	2021-01-22 13:05:56 +03:00
Ahmet Gedemenli	76354ff563	Merge branch 'master' into remove-deprecated-gucs-udfs	2021-01-22 12:47:06 +03:00
Ahmet Gedemenli	887b67953b	Merge branch 'master' into fix-bug-create-citus-local-table-with-stats	2021-01-22 12:46:47 +03:00
Hadi Moshayedi	b53296b4e4	Merge pull request #4552 from citusdata/columnar_fix_names More meaningful columnar metadata table names	2021-01-21 21:39:52 -08:00
Hadi Moshayedi	ff38996645	More meaningful columnar metadata table names	2021-01-21 21:29:07 -08:00
Hadi Moshayedi	9051bf485f	Merge pull request #4557 from citusdata/columnar_rename_funcs Don't use 'cstore' in symbols	2021-01-21 21:11:27 -08:00
Hadi Moshayedi	222fb4d589	Don't use 'cstore' in function names	2021-01-21 18:32:21 -08:00
jeff-davis	0b5551faaf	Columnar: add explain info for chunk filtering (#4554 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-01-21 15:04:42 -08:00
jeff-davis	0581df23f4	Add columnar test for json (#4553 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-01-21 14:36:38 -08:00
Marco Slot	03328e9679	Rename citus_tables column names to be query-friendly	2021-01-21 18:58:30 +01:00
Önder Kalacı	9b39b25390	Prevent citus local table creation via remote execution (#4540 ) /* * Creating Citus local tables relies on functions that accesses * shards locally (e.g., ExecuteAndLogDDLCommand()). As long as * we don't teach those functions to access shards remotely, we * cannot relax this check. */	2021-01-21 11:26:45 +03:00
Onur Tirtir	433062e5d2	Add fkeys between citus local and reference tables in some tests (#4546 )	2021-01-20 19:30:20 +03:00
Ahmet Gedemenli	89a6fe83f7	Replace to update_distributed_table_colocation for tests	2021-01-20 17:30:06 +03:00
Ahmet Gedemenli	ceb6b503c0	Remove unused UDF mark_tables_colocated	2021-01-20 17:29:23 +03:00
Ahmet Gedemenli	2fa060a32d	Fix bug creating citus local table with stats	2021-01-20 17:17:13 +03:00
Önder Kalacı	64a1fddd9a	Merge pull request #4544 from citusdata/refactor_utility_hook Refactor utility hook	2021-01-20 16:02:48 +03:00
Onder Kalaci	8129ce472f	Refactor Utility Hook We want to be able to find the "top-level" DDL commands (not internal/cascading ones). To achieve that, we have some refactoring.	2021-01-20 15:54:00 +03:00
Onder Kalaci	8df58926c5	Rename CitusProcessUtility -> ProcessUtilityForNode	2021-01-20 15:54:00 +03:00
Halil Ozan Akgül	082899ffa4	Merge pull request #4545 from citusdata/alter-into-same-access-method Adds same access method check	2021-01-20 15:50:52 +03:00
Halil Ozan Akgul	434f5af030	Adds same access method check	2021-01-20 15:18:03 +03:00
Hadi Moshayedi	e1376ca106	Merge pull request #4541 from citusdata/normalize_tests Normalize citus_local_tables	2021-01-19 16:15:40 -08:00
Hadi Moshayedi	8a5b6a43fc	Normalize citus_local_tables	2021-01-19 15:56:42 -08:00
Hadi Moshayedi	131c981502	Merge pull request #4524 from citusdata/metadata_sync_reland Metadata sync reland	2021-01-19 08:20:14 -08:00
Hadi Moshayedi	0e0fd6599a	Faster logical replication tests. Logical replication status can take wal_receiver_status_interval seconds to get updated. Default is 10s, which means tests in which logical replication is used can take a long time to finish. We reduce it to 1 second to speed these tests up. Logical replication apply launcher launches workers every wal_retrieve_retry_interval, so if we have many shard moves with logical replication consecutively, they will be throttled by this parameter. Default is 5s, we reduce it to 1s so we finish tests faster.	2021-01-19 07:48:47 -08:00
Hadi Moshayedi	bc01c795a2	Reland #4419	2021-01-19 07:48:47 -08:00
SaitTalhaNisanci	745ffbc691	Separate schedules for mixed mode and normal mode in upgrade (#4420 )	2021-01-19 14:08:11 +03:00
Halil Ozan Akgül	fbcad34c26	Merge pull request #4522 from citusdata/include-indexes-in-statistics-command Moves Creation of ALTER INDEX STATISTICS Commands Next to Index Commands	2021-01-18 17:25:11 +03:00
Halil Ozan Akgul	27c2bd1599	Moves creation of ALTER INDEX STATISTICS commands next to index commands	2021-01-18 16:55:53 +03:00
Naisila Puka	7124a7715d	Skip 'already exists' in CREATE TABLE IF NOT EXISTS PARTITION OF (#4507 ) * Just skip 'already exists' in CT IF NOT EXISTS PARTITION OF * Generalize to tables that are not already distributed partitions	2021-01-18 15:56:02 +03:00
Onur Tirtir	f1ecbc3a53	Fix segfault when adding/dropping fkey from ref to citus local via remote exec (#4528 )	2021-01-17 20:43:33 +03:00
Onur Tirtir	5a3e8a6e24	Skip postgres tables for UndistributeTable(cascadeViaFKeys) (#4530 ) The reason behind skipping postgres tables is that we support foreign keys between postgres tables and reference tables (without converting postgres tables to citus local tables) when enable_local_reference_table_foreign_keys is false or when coordinator is not added to metadata.	2021-01-17 20:32:30 +03:00
Ahmet Gedemenli	2e61f93171	Merge pull request #4523 from citusdata/fix-assert-failure-create-statistics Fix assert failure when creating statistics	2021-01-15 19:49:58 +03:00
Ahmet Gedemenli	107097ee28	Fix assert failure when creating statistics	2021-01-15 19:36:58 +03:00
Onur Tirtir	7dddfa2d0b	Not invalidate fkey cache if citus not installed (#4521 )	2021-01-15 18:31:43 +03:00
Onur Tirtir	1e377ec699	Merge pull request #4480 from citusdata/create-table-define-fkey * Convert postgres tables to citus local tables for CREATE TABLE commands defining foreign keys * Introduce citus.enable_local_reference_table_foreign_keys guc	2021-01-15 18:23:05 +03:00
Onder Kalaci	c35e22d75d	Skip validation for foreign key creation commands For certaion purposes, we drop and recreate the foreign keys. As we acquire exclusive locks on the tables in between drop and re-create, we can safely skip validation phase of the foreign keys. The reason is purely being performance as foreign key validation could take a long value.	2021-01-15 18:04:52 +03:00
Onder Kalaci	ae0b92233d	Rename function	2021-01-15 18:04:52 +03:00
Onder Kalaci	30d0a65f40	Adds citus.enable_local_reference_table_foreign_keys When enabled any foreign keys between local tables and reference tables supported by converting the local table to a citus local table. When the coordinator is not in the metadata, the logic is disabled as foreign keys are not allowed in this configuration.	2021-01-15 18:04:52 +03:00
Onder Kalaci	ed58a404d5	Release lock on CoordinatorAddedAsWorkerNode() Because master_add_node(or others) might acquire ExclusiveLock and their initiated sessions may call CoordinatorAddedAsWorkerNode(). With this we prevent potential deadlocks.	2021-01-15 18:04:42 +03:00
Onur Tirtir	e718d24868	Add support for CREATE TABLE commands defining foreign keys	2021-01-15 17:46:06 +03:00
Ahmet Gedemenli	66ff912f5d	Merge pull request #4512 from citusdata/remove-unused-gucs-and-functions Remove unused GUCs and functions	2021-01-15 13:47:40 +03:00
Ahmet Gedemenli	9a100bcdb9	Remove unused GUCs Remove deprecated variables Remove GUC citus.sslmode Remove GUC citus.expire_cached_shards Remove GUC citus.task_tracker_delay Remove GUC citus.max_assign_task_batch_size Remove GUC citus.max_tracked_tasks_per_node Remove GUC citus.max_running_tasks_per_node Remove GUC citus.large_table_shard_count Remove GUC citus.max_task_string_size Remove GUC citus.binary_master_copy_format	2021-01-15 13:30:45 +03:00
Onur Tirtir	787ed643dd	Undistribute table when cascade_via_foreign_keys=true even if rel has no fkeys (#4516 ) If relation is not involved in any foreign key relationships, foreign key graph would not return any relations for given relationId as expected. But even if it's the case, we should still undistribute the table itself.	2021-01-15 12:45:44 +03:00
Halil Ozan Akgül	4d204320d3	Merge pull request #4513 from citusdata/fix-tableconversionreturn-warning Fixes tableConversionReturn Redefinition Warning	2021-01-15 11:58:14 +03:00
Halil Ozan Akgul	9407965817	Moves struct to the header	2021-01-15 11:50:11 +03:00
Onur Tirtir	4b9285353d	Merge pull request #4479 from citusdata/alter-table-add-fkey Convert postgres tables to citus local tables for ALTER TABLE commands defining foreign keys	2021-01-14 19:03:49 +03:00
Onur Tirtir	36b418982f	Add support for ALTER TABLE commands defining foreign keys	2021-01-14 17:12:00 +03:00
Onur Tirtir	05931b8fe2	Pass ProcessUtilityContext to .preprocess	2021-01-14 17:12:00 +03:00
Onur Tirtir	ac7bccd847	Skip citus tables for CreateCitusLocalTable(cascadeViaFKeys)	2021-01-14 17:12:00 +03:00
Nils Dijk	a655ef27bc	Test columnar recovery (#4485 ) DESCRIPTION: Add tests to verify crash recovery for columnar tables Based on the Postgres TAP tooling we add a new test suite to the array of test suites for citus. It is modelled after `src/test/recovery` in the postgres project and takes the same place in our repository. It uses the perl modules defined in the postgres project to control the postgres nodes. The test we add here focus on crash recovery. Our follower tests should cover the streaming replication behaviour. It is hooked to our CI for both postgres 12 and postgres 13. We omit the recovery tests for postgres 11 as we do not have support for the columnar table access method.	2021-01-14 14:58:29 +01:00
Marco Slot	c3f46de421	Merge pull request #4504 from citusdata/marcocitus/alter-old-partitions Add alter_old_partitions_set_access_method procedure to compress old partitions	2021-01-14 14:18:41 +01:00
Marco Slot	b840e97cd6	Add a alter_old_partitions_set_access_method UDF	2021-01-14 10:44:14 +01:00
Ahmet Gedemenli	bb089c4344	Merge pull request #4503 from citusdata/recreate-invalidation-functions-for-citus10 Recreate invalidation functions for Citus10	2021-01-14 00:16:22 +03:00
Ahmet Gedemenli	9b56ad48cb	Recreate invalidation functions for Citus10 Fix multi_create_table Add schema name to altered functions Recreate invalidation functions when downgrading	2021-01-13 23:18:07 +03:00
jeff-davis	9cffd41389	Cleanup: use table_open, not heap_open. (#4506 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-01-13 12:08:46 -08:00
jeff-davis	ec319faa43	Only allow columnar tables with permanent storage (#4492 ). (#4495 ) Co-authored-by: Jeff Davis <jefdavi@microsoft.com>	2021-01-13 10:37:34 -08:00
jeff-davis	b49beda4c3	Stronger check for triggers on columnar tables (#4493 ). (#4494 ) * Stronger check for triggers on columnar tables (#4493). Previously, we used a simple ProcessUtility_hook. Change to use an object_access_hook instead. * Replace alter_table_set_access_method test on partition with foreign key Co-authored-by: Jeff Davis <jefdavi@microsoft.com> Co-authored-by: Marco Slot <marco.slot@gmail.com>	2021-01-13 10:30:53 -08:00
Marco Slot	b79af16dac	Merge pull request #4421 from citusdata/marcocitus/expand-sublink-recursive-planning	2021-01-13 17:33:18 +01:00
Marco Slot	de6aaaa648	Expand support for subqueries in target list through recursive planning	2021-01-13 17:26:09 +01:00
Onur Tirtir	bfc98e01d1	Merge pull request #4489 from citusdata/enable-create-ref-from-citus-local Enable reference/distributed table creation from citus local tables	2021-01-13 17:26:51 +03:00
Onur Tirtir	ccbc3de535	Enable reference/distributed table creation from citus local tables	2021-01-13 17:14:26 +03:00
Onur Tirtir	7180ef5df1	Increment command counter in UndistributeTable	2021-01-13 16:54:35 +03:00
Onur Tirtir	00da1eed20	Some refactor as a preparation	2021-01-13 16:50:09 +03:00
Halil Ozan Akgül	a4f377282e	Merge pull request #4387 from citusdata/alter-table-udfs Adds Alter Table UDFs	2021-01-13 16:40:40 +03:00
Halil Ozan Akgul	2be14cce2e	Adds alter_distributed_table and alter_table_set_access_method UDFs	2021-01-13 16:02:39 +03:00
Onur Tirtir	1299895e71	Give hint to use ref table for unsupported fkeys between citus local & ref (#4501 )	2021-01-13 15:33:46 +03:00
SaitTalhaNisanci	724d56f949	Add citus shard helper view (#4361 ) With citus shard helper view, we can easily see: - where each shard is, which node, which port - what kind of table it belongs to - its size With such a view, we can see shards that have a size bigger than some value, which could be useful. Also debugging can be easier in production as well with this view. Fetch shards in one go per node The previous implementation was slow because it would do a lot of round trips, one per shard to be exact. Hence it is improved so that we fetch all the shard_name, shard-size pairs per node in one go. Construct shards_names, sizes query on coordinator	2021-01-13 13:58:47 +03:00
Önder Kalacı	7e0826a06b	Make sure that materialized views that contains only (#4499 ) Make sure that materialized views that contains only intermediate results work fine.	2021-01-13 13:17:43 +03:00
Ahmet Gedemenli	436c9d9d79	Remove the word 'master' from Citus UDFs (#4472 ) * Replace master_add_node with citus_add_node * Replace master_activate_node with citus_activate_node * Replace master_add_inactive_node with citus_add_inactive_node * Use master udfs in old scripts * Replace master_add_secondary_node with citus_add_secondary_node * Replace master_disable_node with citus_disable_node * Replace master_drain_node with citus_drain_node * Replace master_remove_node with citus_remove_node * Replace master_set_node_property with citus_set_node_property * Replace master_unmark_object_distributed with citus_unmark_object_distributed * Replace master_update_node with citus_update_node * Replace master_update_shard_statistics with citus_update_shard_statistics * Replace master_update_table_statistics with citus_update_table_statistics * Rename master_conninfo_cache_invalidate to citus_conninfo_cache_invalidate Rename master_dist_local_group_cache_invalidate to citus_dist_local_group_cache_invalidate * Replace master_copy_shard_placement with citus_copy_shard_placement * Replace master_move_shard_placement with citus_move_shard_placement * Rename master_dist_node_cache_invalidate to citus_dist_node_cache_invalidate * Rename master_dist_object_cache_invalidate to citus_dist_object_cache_invalidate * Rename master_dist_partition_cache_invalidate to citus_dist_partition_cache_invalidate * Rename master_dist_placement_cache_invalidate to citus_dist_placement_cache_invalidate * Rename master_dist_shard_cache_invalidate to citus_dist_shard_cache_invalidate * Drop master_modify_multiple_shards * Rename master_drop_all_shards to citus_drop_all_shards * Drop master_create_distributed_table * Drop master_create_worker_shards * Revert old function definitions * Add missing revoke statement for citus_disable_node	2021-01-13 12:10:43 +03:00
Onur Tirtir	2ef5879bcc	Fix error thrown for foreign keys from citus local to dist tables (#4490 )	2021-01-13 10:15:12 +03:00
Onur Tirtir	dd55ab394e	Disallow cascade_via_foreign_keys if any partition rel has non-inherited fkeys (#4487 )	2021-01-11 21:50:09 +03:00
Naisila Puka	7b05777682	Add ALTER TABLE .. SET LOGGED/UNLOGGED support (#4486 )	2021-01-11 20:39:06 +03:00
Marco Slot	21fd2e2c92	Merge pull request #4434 from citusdata/marcocitus/single-node	2021-01-08 17:36:39 +01:00
Marco Slot	d900a7336e	Automatically add placeholder record for coordinator	2021-01-08 15:09:53 +01:00
Marco Slot	ce344a54cb	Merge pull request #4431 from citusdata/marcocitus/time-partitions	2021-01-08 14:07:11 +01:00
Marco Slot	597533b1ff	Add citus_set_coordinator_host	2021-01-08 13:36:26 +01:00
Onur Tirtir	5289785da4	Add cascade_via_foreign_keys option to create_citus_local_table (#4462 )	2021-01-08 15:13:26 +03:00
Marco Slot	e7f13978b5	Add a view for simple (time) partitions and their access methods	2021-01-08 11:28:15 +01:00
Marco Slot	9c851817f1	Merge pull request #4400 from citusdata/marcocitus/rebalancer Add the shard rebalancer implementation	2021-01-07 19:15:48 +01:00
Marco Slot	011283122b	Add the shard rebalancer implementation	2021-01-07 16:51:55 +01:00
Onur Tirtir	d9a3e26f20	Fix flaky test in multi_foreign_key_relation_graph (#4476 ) CREATE TABLE does not invalidate foreign key graph but some other set of ddl commands do. Previously, as we run multi_foreign_key & multi_foreign_key_relation_graph in parallel, it's possible that multi_foreign_key invalidates foreign key graph via some ddl commands and create table test in multi_foreign_key_relation_graph becomes flaky. So we un-parallelize those two tests.	2021-01-07 16:19:11 +03:00
Onur Tirtir	47cd1db209	Merge pull request #4457 from citusdata/cascade-udf * Add infrastructure to cascade citus table functions on foreign keys * Add cascade_via_foreign_keys option to undistribute_table	2021-01-07 15:52:08 +03:00
Onur Tirtir	f3801143fb	Add cascade option to undistribute_table	2021-01-07 15:41:49 +03:00
Onur Tirtir	2e3e680ba9	Add infra to cascade citus table functions	2021-01-07 15:41:48 +03:00
Marco Slot	952d1ee2cd	Merge pull request #4477 from citusdata/marcocitus/revert-metadata-sync-fix	2021-01-07 13:27:26 +01:00
Marco Slot	47c1b19174	Revert "Do metadata sync in a separate background worker." This reverts commit `4df723cf9b`.	2021-01-07 10:30:04 +01:00
Marco Slot	d9f175532b	Revert "Trigger metadata sync at transaction commit" This reverts commit `a2c73bef27`.	2021-01-07 10:30:00 +01:00
Marco Slot	75c533ca02	Merge pull request #4473 from citusdata/marcocitus/fix-insert-select-local-execution Support local execution for INSERT..SELECT with re-partitioning	2021-01-06 16:55:03 +01:00
Marco Slot	5de3337b2f	Support local execution for INSERT..SELECT with re-partitioning	2021-01-06 16:15:53 +01:00
Önder Kalacı	26c8f1632f	Merge pull request #4474 from citusdata/remove_warn_leak Remove "WarnAboutLeakedPreparedTransaction" function	2021-01-06 16:13:36 +03:00
Onder Kalaci	2fe158961b	Remove "WarnAboutLeakedPreparedTransaction" function We used to need WarnAboutLeakedPreparedTransaction() as we didn't have auto 2PC recovery. But, we long have 2PC recovery by https://github.com/citusdata/citus/pull/1574 So, we don't need anymore.	2021-01-06 15:48:58 +03:00
Naisila Puka	bcfc0aa4e9	Rethrow original concurrent index creation failure message (#4469 ) * Rethrow original concurrent index creation failure message * Alter test outputs for concurrent index creation * Detect duplicate table failure in concurrent index creation * Add test for conc. index creation w/out duplicates	2021-01-06 15:27:13 +03:00
Onur Tirtir	0d7aea3a22	Move pre undistribute_table chekcs into C API (#4456 )	2021-01-06 10:49:35 +03:00
Ahmet Gedemenli	1f36ff7c17	Prevent deadlock for long named partitioned index creation on single node (#4461 ) * Prevent deadlock for long named partitioned index creation on single node * Create IsSingleNodeCluster function * Use both local and sequential execution	2021-01-05 13:39:13 +03:00
Ahmet Gedemenli	f27649754b	Add alter index set statistics support (#4455 ) * Add alter index set statistics support * Use attNum instead of attName	2021-01-05 13:23:11 +03:00
Onur Tirtir	e91e745dbc	Implement ConstraintWithNameIsOfType (#4451 )	2020-12-29 11:53:06 +03:00
Onur Tirtir	c0e7f31eb0	Merge pull request #4452 from citusdata/implement-GetPgDependTuplesForDependingObjects Implement GetPgDependTuplesForDependingObjects	2020-12-29 01:13:38 +03:00
Onur Tirtir	e74acf11fe	Merge branch 'master' into implement-GetPgDependTuplesForDependingObjects	2020-12-29 00:34:31 +03:00
Onur Tirtir	87e5276bdd	Fix fkey graph test for self reference (#4450 )	2020-12-28 12:47:39 +03:00
Onur Tirtir	feda8bdd37	Now that we use tuples after closing pg_depend, don't release lock	2020-12-25 18:03:28 +03:00
Onur Tirtir	04a4167a8a	Implement GetPgDependTuplesForDependingObjects	2020-12-25 18:03:28 +03:00
Halil Ozan Akgül	a8626d1944	Fixes the table used in the error message (#4449 )	2020-12-25 16:48:50 +03:00
Naisila Puka	b9cd91ef08	Merge pull request #4409 from citusdata/issue4237 Prevent empty placement creation in the coordinator	2020-12-25 12:43:42 +03:00
Naisila Puka	04aeb6938b	Merge branch 'master' into issue4237	2020-12-25 12:36:40 +03:00
Hadi Moshayedi	52164450eb	Merge pull request #4419 from citusdata/metadata_sync Do metadata sync in a separate background worker.	2020-12-24 09:15:16 -08:00
Hadi Moshayedi	a2c73bef27	Trigger metadata sync at transaction commit	2020-12-24 08:28:38 -08:00
Hadi Moshayedi	4df723cf9b	Do metadata sync in a separate background worker.	2020-12-24 08:25:55 -08:00
Ahmet Gedemenli	299d3fcbc5	Merge pull request #4444 from citusdata/alter-statistics-propagation Propagate alter statistics	2020-12-24 18:35:52 +03:00
Naisila Puka	0bb2c991f9	Merge branch 'master' into issue4237	2020-12-24 18:05:27 +03:00
Ahmet Gedemenli	5af585269a	Add separate pg13 test for stats targets	2020-12-24 18:01:25 +03:00
naisila	59a81491e8	Add test for master_create_empty_shard on coordinator	2020-12-24 17:59:40 +03:00
Ahmet Gedemenli	d4bc17f6f0	Propagate statistics with altered targets	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	48ca1637a4	Propagate alter stats owner	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	f7c70f9a63	Propagate alter stats target	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	5a1607b6c0	Propagate alter stats schema	2020-12-24 17:10:12 +03:00
Ahmet Gedemenli	bdce4a7e67	Propagate rename statistics	2020-12-24 17:10:12 +03:00
Onur Tirtir	5ed9197041	Implement infra to get foreign key connected relations (#4439 ) On top of our foreign key graph, implement the infrastructure to get list of relations that are connected to input relation via a foreign key graph. We need this to support cascading create_citus_local_table & undistribute_table operations. Also add regression tests to see what our foreign key graph is able to capture currently.	2020-12-24 16:42:40 +03:00
Onur Tirtir	0db21bbe14	Remove fkey graph visited flags & rework GetConnectedListHelper (#4446 ) With this commit, we remove visited flags from ForeignConstraintRelationshipNode struct since keeping local state in global object is both dangerous and meaningless. Also to improve readability, this commit also converts needless recursion to iterative DFS to avoid passing local hash-map as another parameter to GetConnectedListHelper function.	2020-12-24 12:38:48 +03:00
SaitTalhaNisanci	1ac9cb3fd2	Update pg upgrade tester tag (#4447 )	2020-12-24 12:13:24 +03:00
Onur Tirtir	57e7defa3c	Support CREATE INDEX commands without index name on citus tables (#4273 )	2020-12-23 23:15:39 +03:00
Marco Slot	f7b182ebeb	Merge pull request #4445 from citusdata/marcocitus/remove-upgrade-to-reference-table Remove upgrade_to_reference_table UDF	2020-12-23 17:41:57 +01:00
Halil Ozan Akgül	9fd3f62cb6	Refactor foreign key functions to use table types (#4424 ) * Reuses extractReferencing/Referenced variables * Refactors GetForeignKeyOids function to check table types * Converts flags to inclusive	2020-12-23 17:05:09 +03:00
Onur Tirtir	d1b3eaf767	Refactor ColumnAppearsInForeignKeyToReferenceTable (#4441 )	2020-12-23 11:44:02 +03:00
jeff-davis	90d63cb792	Add columnar pg_dump test. (#4433 )	2020-12-22 15:57:35 -08:00
Marco Slot	e3dcc278e0	Remove upgrade_to_reference_table UDF	2020-12-23 00:40:14 +01:00
naisila	5234caecca	Prevent empty placement creation in the coordinator	2020-12-22 19:39:05 +03:00
Ahmet Gedemenli	00bd784783	Merge pull request #4436 from citusdata/propagate-drop-statistics Propagate Drop Statistics	2020-12-22 18:47:23 +03:00
Ahmet Gedemenli	874fa1fc09	Propagate Drop Statistics	2020-12-22 18:34:46 +03:00
Onur Tirtir	3f60b08b11	Refactor foreign_key_relationship.c (#4438 )	2020-12-22 18:12:02 +03:00
Marco Slot	dca83e5938	Merge pull request #4437 from citusdata/marcocitus/collapse-7 Collapse Citus 7.* scripts into Citus 8.0-1	2020-12-22 13:41:19 +01:00
Marco Slot	321cc784c7	Collapse Citus 7.* scripts into Citus 8.0-1	2020-12-21 22:55:51 +01:00
Hadi Moshayedi	dde0323b57	Columnar: enable zstd & lz4 compilation by default (#4402 ) * Columnar: enable zstd & lz4 compilation by default * Make zstd & lz4 tests more consistent * Don't require lz4 & zstd for postgres 11 Co-authored-by: Nils Dijk <nils@citusdata.com>	2020-12-21 12:11:58 -08:00
Onur Tirtir	cceaf31e4c	Add some more tests with views to test recursive planning on views (#4427 ) (cherry picked from commit `51f422f3c6`)	2020-12-21 11:53:37 +03:00
jeff-davis	49281202af	Add simple follower test for columnar. (#4432 )	2020-12-18 13:59:20 -08:00
jeff-davis	3e0f1aaaab	Prevent inserting into logically-replicated columnar table. (#4429 )	2020-12-18 12:29:30 -08:00
Marco Slot	f2056e553f	Expose partition column of subqueries in optimizer (#4355 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2020-12-18 20:32:52 +01:00
SaitTalhaNisanci	145112f3a0	Fix attribute numbers in subquery conversions (#4426 ) Attribute number in a subquery RTE and relation RTE means different things. In a relation attribute number will point to the column number in the table definition including the dropped columns as well however in subquery, it means the index in the target list. When we convert a relation RTE to subquery RTE we should either correct all the relevant attribute numbers or we can just add a dummy column for the dropped columns. We choose the latter in this commit because it is practically too vulnerable to update all the vars in a query. Another thing this commit fixes is that in case a join restriction clause list contains a false clause, we should just returns a false clause instead of the whole list, because the whole list will contain restrictions from other RTEs as well and this breaks the query, which can be seen from the output changes, now it is much simpler. Also instead of adding single tests for dropped columns, we choose to run the whole mixed queries with tables with dropped columns, this revealed some bugs already, which are fixed in this commit.	2020-12-18 20:25:41 +03:00
Nils Dijk	9799db0567	Merge pull request #4417 from citusdata/ci/rework-ci-spec This doesn't change anything functionally for Citus or our CI. Instead it is a quality of life change to make it easier to maintain CI and how we build. In general this patch accomplishes the following - move build and test scripts into the Citus tree instead of our CI tree. - deduplicate convoluted jobs definition by a couple generic build and test runs - standardized the output - standardized coredumps - made all coredumps and regression diffs available for download A big part of this refactor is in the `the-process` repository. https://github.com/citusdata/the-process/pull/48	2020-12-18 18:12:39 +01:00
Nils Dijk	a748729998	rework ci	2020-12-18 18:04:45 +01:00
Ahmet Gedemenli	fb06f7e57e	Merge pull request #4395 from citusdata/propagate-create-statistics Propagate create statistics	2020-12-18 18:18:35 +03:00
Ahmet Gedemenli	770d3da1ca	Add dependencies for stat schemas	2020-12-18 17:04:13 +03:00
Ahmet Gedemenli	6c0465566a	Propagate create statistics	2020-12-17 20:38:36 +03:00
Marco Slot	1e2518f83c	Add tests for router queries with catalog tables (#4422 ) Co-authored-by: Marco Slot <marco.slot@gmail.com>	2020-12-17 15:07:50 +01:00
Marco Slot	61bf2fb477	Merge pull request #4385 from citusdata/marcocitus/correlated-subqueries	2020-12-16 11:55:43 +01:00
SaitTalhaNisanci	26284bf2a1	Merge pull request #4358 from citusdata/local_distributed_table_join Support local/citus local distributed/subquery joins	2020-12-15 18:29:39 +03:00
Sait Talha Nisanci	181a7e1d36	Skip dropped columns	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	7951273f74	Refactor WrapRteRelationIntoSubquery	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	0e53aa5d3b	Add more tests	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	d5b0f02a64	Decide what group to convert, then convert them all in one go	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	c4d3927956	Not allow local table updates with remote citus local tables	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	f5dd5379b2	Add more tests	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	f7c1509fed	Not check if the query is routable for converting It seems that there are only very few cases where that is useful, and for now we prefer not having that check. This means that we might perform some unnecessary checks, but that should be rare and not performance critical.	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	1d82972ff4	Increase the performance with a trick Instead of sending NULL's over a network, we now convert the subqueries in the form of: SELECT t.a, NULL, NULL FROM (SELECT a FROM table)t; And we recursively plan the inner part so that we don't send the NULL's over network. We still need the NULLs in the outer subquery because we currently don't have an easy way of updating all the necessary places in the query. Add some documentation for how the conversion is done	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	3aed6c3ad0	Rename containsOnlyLocalTable as isLocalTableModification Update error message in Modify View	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	13c43d5744	Improve table conversion logic in dist-local joins	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	5618f3a3fc	Use BaseRestrictInfo for finding equality columns Baseinfo also has pushed down filters etc, so it makes more sense to use BaseRestrictInfo to determine what columns have constant equality filters. Also RteIdentity is used for removing conversion candidates instead of rteIndex.	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	28c5b6a425	Convert some hard coded errors to deferred errors in router planner	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	69992d58f9	Add broken local-dist table modifications tests It seems that most of the updates were broken, we weren't aware of it because there wasn't any data in the tables. They are broken mostly because local tables do not have a shard id and some code paths should be updated with that information, currently when there is an invalid shard id, it is assumed to be pruned. Consider local tables in router planner In case there is a local table, the shard id will not be valid and there are some checks that rely on shard id, we should skip these in case of local tables, which is handled with a dummy placement. Add citus local table dist table join tests add local-dist table mixed joins tests	2020-12-15 18:18:36 +03:00
Sait Talha Nisanci	a34504d7bf	Move recursive planning related function to recursive_planning	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	2a44029aaf	Simplify ContainsTableToBeConvertedToSubquery AllDataLocallyAccessible and ContainsLocalTableSubqueryJoin are removed. We can possibly remove ModifiesLocalTableWithRemoteCitusLocalTable as well. Though this removal has a side effect that now when all the data is locally available, we could still wrap a relation into a subquery, I guess that should be resolved in the router planner itself. Add more tests	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	26d9f0b457	Use auto mode in tests and fix debug message	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	3bd53a24a3	Support update on postgres table from citus local table	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	4b6611460a	Support foreign table joins as well	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	7e9204eba9	Update vars in quals while wrapping RTE to subquery When we wrap an RTE to subquery we are updating the variables varno's as 1, however we should also update the varno's of vars in quals. Also some other small code quality improvements are done.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	0689f2ac1a	Recursively plan distributed tables only if all have unique filters The previous algorithm was not consistent and it could convert different RTEs based on the table orders in the query. Now we convert local tables if there is a distributed table which doesn't have a unique index. So if there are 4 tables, local1, local2, dist1, dist2_with_pkey then we will convert local1 and local2 in `auto` mode. Converting a distributed table is not that logical because as there is a distributed table without a unique index, we will need to convert the local tables anyway. So converting the distributed table with pkey is redundant.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	a008fc611c	Support materialized view joins as well	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	5f46abffd9	Update check multi tests	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	ff4f3b2f3c	Use PlannerRestrictionContext instead of RecursivePlannerContext	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	3fe3c55023	Use ShouldConvertLocalTableJoinsToSubqueries Remove FillLocalAndDistributedRTECandidates and use ShouldConvertLocalTableJoinsToSubqueries, which simplifies things as we rely on a single function to decide whether we should continue converting RTE to subquery.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	eebcd995b3	Add some more tests	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	5693cabc41	Not convert an already routable plannable query We should not recursively plan an already routable plannable query. An example of this is (SELECT * FROM local JOIN (SELECT * FROM dist) d1 USING(a)); So we let the recursive planner do all of its work and at the end we convert the final query to to handle unsupported joins. While doing each conversion, we check if it is router plannable, if so we stop. Only consider range table entries that are in jointree If a range table is not in jointree then there is no point in considering that because we are trying to convert range table entries to subqueries for join use case.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	2ff65f3630	Enable partitioned distributed tables in local-dist table joins	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	44953579cf	Enable citus-local distributed table joins Check equality in quals We want to recursively plan distributed tables only if they have an equality filter on a unique column. So '>' and '<' operators will not trigger recursive planning of distributed tables in local-distributed table joins. Recursively plan distributed table only if the filter is constant If the filter is not a constant then the join might return multiple rows and there is a chance that the distributed table will return huge data. Hence if the filter is not constant we choose to recursively plan the local table.	2020-12-15 18:17:10 +03:00
Sait Talha Nisanci	f3d55448b3	Choose distributed table if it has a unique index in filter When doing local-distributed table joins we convert one of them to subquery. The current policy is that we convert distributed tables to subquery if it has a unique index on a column that has unique index(primary key also has a unique index).	2020-12-15 18:17:10 +03:00
Onder Kalaci	f0aef67ed2	Update existing regression tests	2020-12-15 18:17:10 +03:00
Onder Kalaci	3f4952cc2b	Pushdown projections when relations are recursively planned This is important to limit the data transfer size.	2020-12-15 18:17:10 +03:00
Onder Kalaci	945193555b	add basic regression tests	2020-12-15 18:17:10 +03:00
Onder Kalaci	594e001f3b	Add filter pushdown regression tests Also handle WHERE false	2020-12-15 18:17:10 +03:00
Onder Kalaci	82a4830c7d	Adjust the existing regression tests	2020-12-15 18:17:10 +03:00
Onder Kalaci	7a4d6b2984	Handle modifications as well	2020-12-15 18:17:10 +03:00
Onder Kalaci	8f8390ed6e	Recursively plan local table joins The logical planner cannot handle joins between local and distributed table. Instead, we can recursively plan one side of the join and let the logical planner handle the rest. Our algorithm is a little smart, trying not to recursively plan distributed tables, but favors local tables.	2020-12-15 18:17:10 +03:00
Onder Kalaci	7cc25c9125	Add ability to fetch the restrictions per relation With this commit, we add the ability to add restrictions per relation. We simply rely on the restrictions that Postgres keeps per relation.	2020-12-15 18:17:10 +03:00
Marco Slot	100e5d3196	Address review feedback	2020-12-15 15:23:38 +01:00
Marco Slot	23dccd8941	Add some new tests for complex correlated subqueries in WHERE	2020-12-15 14:17:16 +01:00
Marco Slot	707a6554b1	Support co-located/recurring correlated subqueries	2020-12-15 14:17:16 +01:00
Onur Tirtir	0eb5701658	Not consider single shard hash dist. tables as replicated (#4413 )	2020-12-15 14:33:01 +03:00
Marco Slot	cc04fce10f	Merge pull request #4360 from citusdata/marcocitus/sublinks	2020-12-13 23:46:40 +01:00
Marco Slot	4985bda3bd	Merge pull request #4384 from citusdata/marcocitus/fix/citus-tables-failure Harden citus_tables against node failure	2020-12-13 23:17:15 +01:00
Marco Slot	f2538a456f	Support co-located/recurring sublinks in the target list	2020-12-13 15:45:24 +01:00
Marco Slot	8e8adcd92a	Harden citus_tables against node failure	2020-12-13 15:10:40 +01:00
Hadi Moshayedi	e24e7985f7	Merge pull request #4407 from citusdata/cstore_analyze Columnar: Fix ANALYZE for large number of rows.	2020-12-10 17:32:42 -08:00
Hadi Moshayedi	4dd22cc4e4	Columnar: Fix ANALYZE for large number of rows.	2020-12-10 09:52:33 -08:00
Hadi Moshayedi	df1ff60754	Merge pull request #4403 from citusdata/cstore_default_compression Columnar: set default compression as zstd if available	2020-12-09 23:36:08 -08:00
Hadi Moshayedi	b3dac5e9d1	Columnar: set default compression as zstd if available	2020-12-09 14:32:08 -08:00
Hadi Moshayedi	4985dcbafe	Merge pull request #4399 from citusdata/cstore_compression_level Columnar: Make compression level configurable	2020-12-09 08:57:50 -08:00
Hadi Moshayedi	4668fe51a6	Columnar: Make compression level configurable	2020-12-09 08:48:50 -08:00
Hadi Moshayedi	4501310b4c	Merge pull request #4388 from citusdata/cstore_zstd Add zstd compression to columnar	2020-12-09 08:37:30 -08:00
Hadi Moshayedi	9f559b37d0	Merge pull request #4337 from citusdata/cstore_lz4 Add LZ4 compression to columnar	2020-12-09 08:37:10 -08:00
Hadi Moshayedi	f5a4a4bc74	Columnar: Support zstd compression	2020-12-09 08:30:55 -08:00
Hadi Moshayedi	3f81ee26fd	Columnar: Support LZ4 compression	2020-12-09 08:29:07 -08:00
jeff-davis	260a02180b	Add tests for unsupported columnar storage features (#4397 ) Add negative tests: * Deletes * Sample scan * Special columns * Tuple locks * Indexes	2020-12-09 00:08:45 -08:00
jeff-davis	776c165843	Merge pull request #4396 from citusdata/rename2 Rename cstore->columnar in SQL objects and errors.	2020-12-07 15:43:13 -08:00
Jeff Davis	c91e5b052b	more test fixups	2020-12-07 13:43:27 -08:00
Jeff Davis	7169ba21c4	more test fixes	2020-12-07 13:36:46 -08:00
Jeff Davis	e26fdeb706	fixup tests some more	2020-12-07 13:22:16 -08:00
Jeff Davis	5b3c32eb38	fixup tests	2020-12-07 13:18:22 -08:00
Jeff Davis	068af7f38e	fixup upgrade tests	2020-12-07 13:11:51 -08:00
Jeff Davis	3758e83850	Rename cstore->columnar in SQL objects and errors.	2020-12-07 13:01:53 -08:00
jeff-davis	dee753ef05	Merge pull request #4394 from citusdata/test-update Tests for UPDATE and error message improvement.	2020-12-07 12:10:13 -08:00
Jeff Davis	ad919ff220	Tests for UPDATE and error message improvement. UPDATEs on partitioned tables that affect only row partitions should succeed, the rest should fail. Also rename CStoreScan to ColumnarScan to make the error message more relevant.	2020-12-07 11:25:30 -08:00
Ahmet Gedemenli	45ac491885	Merge pull request #4390 from citusdata/fix-transaction-name-length-calculation Fix transaction name length calculation	2020-12-07 12:56:13 +03:00
Ahmet Gedemenli	7577821920	Fix transaction name length calculation	2020-12-07 12:34:15 +03:00
Ahmet Gedemenli	3d8a7c1741	Merge pull request #4381 from citusdata/recover-transactions-when-removing-node Delete transactions when removing node	2020-12-07 11:53:26 +03:00
Ahmet Gedemenli	936775e8e3	Delete transactions when removing node With this commit, we delete entries in pg_dist_transaction for the primary nodes that are removed by `master_remove_node`.	2020-12-07 11:35:20 +03:00
Hadi Moshayedi	164d73ad8c	Merge pull request #4386 from citusdata/cstore_uncompressed_size Columnar: track decompressed length in metadata	2020-12-04 11:16:50 -08:00
Hadi Moshayedi	01da2a1c73	Columnar: track decompressed length in metadata	2020-12-04 09:09:39 -08:00
Önder Kalacı	d4f5d4a27b	Merge pull request #4364 from citusdata/add_some_data_type_tests Add regression tests with different data types	2020-12-04 10:40:34 +03:00
Onder Kalaci	bd9827aed9	Add regression tests with different data types We typically do not test Citus with these uncommon data types. Now, we already have the tests for ADF integration, add it to regression tests as well.	2020-12-04 10:25:00 +03:00
Hadi Moshayedi	c23bdb129d	Merge pull request #4379 from citusdata/cstore_chunk Columnar: rename block to chunk	2020-12-03 09:01:10 -08:00
Hadi Moshayedi	317cf44a56	Merge pull request #4375 from citusdata/cstore_empty Columnar: Fix VACUUM for empty tables	2020-12-03 09:00:47 -08:00
Hadi Moshayedi	4a9aebaa7b	Columnar: rename block to chunk	2020-12-03 08:50:19 -08:00
Hadi Moshayedi	24bfd368a9	Columnar: Fix VACUUM for empty tables	2020-12-03 08:46:09 -08:00
Marco Slot	c4f36c195f	Merge pull request #4309 from citusdata/features/add-citus-tables-view	2020-12-03 17:44:14 +01:00
Marco Slot	c9b658daea	Add a public.citus_tables view	2020-12-03 17:31:40 +01:00
Marco Slot	4098d33acb	Allow citus size functions on replicated tables	2020-12-03 16:33:24 +01:00
SaitTalhaNisanci	f164575524	Add a utility to process each table index (#4382 ) A utility function is added so that each caller can implement a handler for each index on a given table. This means that the caller doesn't need to worry about how to access each index, the only thing that it needs to do each to implement a function to which each index on the table is passed iteratively.	2020-12-03 16:33:13 +03:00
Marco Slot	746b36103e	Merge pull request #4380 from citusdata/marcocitus/fix-flappy Fix flappy failure test	2020-12-03 14:17:14 +01:00
Marco Slot	c69ea2512a	Fix flappy failure test	2020-12-03 13:54:02 +01:00
Önder Kalacı	9789a7005f	Merge pull request #4338 from citusdata/single_node_conn_mngmt_main Local node connection management	2020-12-03 14:35:35 +03:00
Onder Kalaci	c546ec5e78	Local node connection management When Citus needs to parallelize queries on the local node (e.g., the node executing the distributed query and the shards are the same), we need to be mindful about the connection management. The reason is that the client backends that are running distributed queries are competing with the client backends that Citus initiates to parallelize the queries in order to get a slot on the max_connections. In that regard, we implemented a "failover" mechanism where if the distributed queries cannot get a connection, the execution failovers the tasks to the local execution. The failover logic is follows: - As the connection manager if it is OK to get a connection - If yes, we are good. - If no, we fail the workerPool and the failure triggers the failover of the tasks to local execution queue The decision of getting a connection is follows: /* * For local nodes, solely relying on citus.max_shared_pool_size or * max_connections might not be sufficient. The former gives us * a preview of the future (e.g., we let the new connections to establish, * but they are not established yet). The latter gives us the close to * precise view of the past (e.g., the active number of client backends). * * Overall, we want to limit both of the metrics. The former limit typically * kics in under regular loads, where the load of the database increases in * a reasonable pace. The latter limit typically kicks in when the database * is issued lots of concurrent sessions at the same time, such as benchmarks. */	2020-12-03 14:16:13 +03:00
Hadi Moshayedi	c2f60b6422	Columnar: pg_upgrade support (#4354 )	2020-12-02 08:46:59 -08:00
Nils Dijk	27113255e5	add codecov upload task for pg_upgrade tests (#4377 ) Small change that will upload lines hit during upgrades to codecov. It got to our attention we are not capturing the codecov during upgrades in #4354 .	2020-12-02 15:21:59 +01:00
Ahmet Gedemenli	5b0e60884e	Merge pull request #4373 from citusdata/propagate-alter-schema-rename Propagate alter schema rename	2020-12-02 15:29:26 +03:00
Ahmet Gedemenli	5242dcfe99	Add tests for propagating alter schema rename	2020-12-02 15:18:26 +03:00
Ahmet Gedemenli	514c6a76ac	Propagate alter schema rename	2020-12-02 15:18:26 +03:00
Nils Dijk	fde93072dd	Merge pull request #4335 from citusdata/fix/cstore-options-dist-tables columnar table options for distributed tables	2020-12-02 13:12:57 +01:00
Nils Dijk	6f9c040f76	DESCRIPTION: Propagate columnar table settings for distributed tables When distributing a columnar table, as well as changing options on a distributed columnar table, this patch will forward the settings from the coordinator to the workers. For propagating options changes on an already distributed table this change is pretty straight forward. Before applying the change in options locally we will create a `DDLJob` that contains a call to `alter_columnar_table_set(...)` for every shard placement with all settings of the current table. This goes both for setting an option as well as resetting. This will reset the values to the defaults configured on the coordinator. Having the effect that the coordinator is authoritative on the settings and makes sure the shards have the same settings set as the table on the coordinator. When a columnar table is distributed it is using the `TableDDLCommand` infra structure to create a new kind of `TableDDLCommand`. This new type, called a `TableDDLCommandFunction` contains a context and 2 function pointers to execute. One function returns the command as applied on the table, the second function will return the sql command to apply to a shard with a given shard id. The schema name is ignored as it will use the fully qualified name of the shard in the same schema as the base table.	2020-12-02 13:02:42 +01:00
Halil Ozan Akgül	ef0914a7f8	Adds ORDER BY to flaky test (#4305 ) Co-authored-by: Önder Kalacı <onder@citusdata.com>	2020-12-02 14:24:05 +03:00
Önder Kalacı	48d6266fd4	Merge pull request #4374 from citusdata/sequential_execution_use_lcao Multi-row INSERTs use local execution when placements are local	2020-12-01 22:45:35 +03:00
Onder Kalaci	f7e1aa3f22	Multi-row INSERTs use local execution when placements are local Multi-row execution already uses sequential execution. When shards are local, using local execution is profitable as it avoids an extra connection establishment to the local node.	2020-12-01 21:37:59 +03:00
Onur Tirtir	ea79ca0e5e	Merge pull request #4372 from citusdata/update-cl-951 Update CHANGELOG for 9.5.1	2020-12-01 16:51:36 +03:00
Onur Tirtir	dd3453ced5	Update CHANGELOG for 9.5.1	2020-12-01 14:02:36 +03:00
Marco Slot	df3539710a	Merge pull request #4370 from citusdata/marcocitus/fix-flappy	2020-12-01 11:30:53 +01:00
Ahmet Gedemenli	cc9ea31c60	Merge pull request #4356 from citusdata/add-test-for-citus-size-func Add test for citus table size func in transaction with modification	2020-12-01 11:08:20 +03:00
Ahmet Gedemenli	8e5f0487eb	Add order by for flaky test	2020-12-01 10:54:52 +03:00
Ahmet Gedemenli	67761897ab	Add test for citus table size func in transaction with modification Add test for citus_relation_size	2020-12-01 10:38:15 +03:00
Hadi Moshayedi	feecb7b423	Columnar: few fixes (#4371 ) * Columnar: fix a memory issue * Columnar: no need for deferred triggers * Columnar: relax memory growth constraints	2020-11-30 18:09:43 -08:00
Hadi Moshayedi	a94e8c9cda	Associate column store metadata with storage id (#4347 )	2020-11-30 18:01:43 -08:00
Marco Slot	de22b633cb	Merge pull request #4365 from citusdata/marcocitus/fix-flappy Fix flappy test: Run subquery_prepared_statements by itself	2020-11-30 22:35:51 +01:00
Marco Slot	4a05b2ad77	Merge pull request #4367 from citusdata/isolate_join_test Isolate join test	2020-11-30 22:09:21 +01:00
Sait Talha Nisanci	8b0aed521f	Isolate join test Join test gets too many clients error too frequently hence we should not run anything concurrently with that. Hopefully this will fix the flakiness of test.	2020-12-01 00:00:17 +03:00
Marco Slot	04cffdd925	Run master_copy_shard_placement separately	2020-11-30 20:34:03 +01:00
Marco Slot	48caca4084	Improve regression test settings	2020-11-30 20:34:03 +01:00
SaitTalhaNisanci	c31a8df380	Call 6 times not 7 in subquery_prepared_statements (#4357 )	2020-11-30 21:20:51 +03:00
Onur Tirtir	03bcccdee0	Fix hostname length check in StartNodeUserDatabaseConnection (#4363 ) Copying string before hostname length check makes the check useless	2020-11-30 20:00:35 +03:00
Onur Tirtir	7f3d1182ed	Handle invalid connection hash entries (#4362 ) If MemoryContextAlloc errors out -e.g. during an OOM-, ConnectionHashEntry->connections stays as NULL. With this commit, we add isValid flag to ConnectionHashEntry that should be set to true right after we allocate & initialize ConnectionHashEntry->connections list properly, and we check it before accesing to ConnectionHashEntry->connections.	2020-11-30 19:44:03 +03:00
SaitTalhaNisanci	8c3dd6338e	Run pg12 and pg13 separately (#4352 ) It seems that sometimes we get `too many clients errors` with this set of parallel tests, hence two of them are separated.	2020-11-30 19:32:49 +03:00
Marco Slot	ecbc1ab008	Run subquery_prepared_statements by itself	2020-11-30 08:53:06 +01:00
Hadi Moshayedi	7f43804dae	Normalize VACUUM VERBOSE output (#4353 ) This is to avoid flaky changes like the following in test outputs: -CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.00 s. +CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.02 s.	2020-11-27 12:07:25 -08:00
Nils Dijk	383e334023	refactor options to their own table linked to the regclass (#4346 ) Columnar options were by accident linked to the relfilenode instead of the regclass/relation oid. This PR moves everything related to columnar options to their own catalog table.	2020-11-27 11:22:08 -08:00
SaitTalhaNisanci	af02ac6cf5	Refactor MultiRouterPlannableQuery (#4350 ) The name of the function is different than the implemantation. Because the function is designed to only consider SELECT queries. Also this changes the assert with an error.	2020-11-27 18:44:38 +03:00
Nils Dijk	326e6afa53	refactor table ddl events scoped for shards (#4342 ) Refactor internals on how Citus creates the SQL commands it sends to recreate shards. Before Citus collected solely ddl commands as `char `'s to recreate a table. If they were used to create a shard they were wrapped with `worker_apply_shard_ddl_command` and send to the workers. On the workers the UDF wrapping the ddl command would rewrite the parsetree to replace tables names with their shard name equivalent. This worked well, but poses an issue when adding columnar. Due to limitations in Postgres on creating custom options on table access methods we need to fall back on a UDF to set columnar specific options. Now, to recreate the table, we can not longer rely on having solely DDL statements to recreate a table. A prototype was made to run this UDF wrapped in `worker_apply_shard_ddl_command`. This became pretty messy, hard to understand and subsequently hard to maintain. This PR proposes a refactor of the internal representation of table ddl commands into a `TableDDLCommand` structure. The current implementation only supports a `char ` as its contents. Based on the use of the DDL statement (eg. creating the table -mx- or creating a shard) one of two different functions can be called to get the statement to send to the worker: - `GetTableDDLCommand(TableDDLCommand command)`: This function returns that ddl command to create the table. In this implementation it will just return the `char `. This has the same functionality as getting the old list and not wrapping it. - `GetShardedTableDDLCommand(TableDDLCommand command, uint64 shardId, char schemaName)`: This function returns the ddl command wrapped in `worker_apply_shard_ddl_command` with the `shardId` as an argument. Due to backwards compatibility it also accepts a. `schemaName`. The exact purpose is not directly clear. Ideally new implementations would work with fully qualified statements and ignore the `schemaName`. A future implementation could accept 2.function pointers and a `void *` for context to let the two pointers work on. This gives greater flexibility in controlling what commands get send in which situations. Also, in a future, we could implement the intermediate step of creating the `parsetree` datastructure of statements based on the contents in the catalog with a corresponding deparser. For sharded queries a mutator could be ran over the parsetree to rewrite the tablenames to the names with the shard identifier. This will completely omit the requirement for `worker_apply_shard_ddl_command`.	2020-11-26 13:31:59 +01:00
SaitTalhaNisanci	83020f444e	Initialize fast planner restriction context (#4349 ) We initialize fast planner restriction context so that code paths that rely on this being not NULL will operate without a problem.	2020-11-26 13:45:27 +03:00
Önder Kalacı	7539454ccb	Merge pull request #4312 from citusdata/single_node_conn_mngmt_backend_counter Add the infrastructure to count the number of client backends	2020-11-25 19:49:57 +01:00
Onder Kalaci	629ecc3dee	Add the infrastructure to count the number of client backends Considering the adaptive connection management improvements that we plan to roll soon, it makes it very helpful to know the number of active client backends. We are doing this addition to simplify yhe adaptive connection management for single node Citus. In single node Citus, both the client backends and Citus parallel queries would compete to get slots on Postgres' `max_connections` on the same Citus database. With adaptive connection management, we have the counters for Citus parallel queries. That helps us to adaptively decide on the remote executions pool size (e.g., throttle connections if necessary). However, we do not have any counters for the total number of client backends on the database. For single node Citus, we should consider all the client backends, not only the remote connections that Citus does. Of course Postgres internally knows how many client backends are active. However, to get that number Postgres iterates over all the backends. For examaple, see [pg_stat_get_db_numbackends](`8e90ec5580/src/backend/utils/adt/pgstatfuncs.c (L1240)`) where Postgres iterates over all the backends. For our purpuses, we need this information on every connection establishment. That's why we cannot affort to do this kind of iterattion.	2020-11-25 19:19:24 +01:00
SaitTalhaNisanci	180195b445	Remove unused parameter from VarConstOpExprClause (#4348 )	2020-11-25 21:00:22 +03:00
Ahmet Gedemenli	850b292886	Merge pull request #4326 from citusdata/constraint-key-name-fail Fix constraint name for local execution	2020-11-25 15:20:03 +03:00
Ahmet Gedemenli	a64dc8a72b	Fixes a bug preventing INSERT SELECT .. ON CONFLICT with a constraint name on local shards Separate search relation shard function Add tests	2020-11-25 15:10:46 +03:00
Onur Tirtir	46be63d76b	Refactor PreprocessIndexStmt (#4272 )	2020-11-25 12:19:37 +03:00
Önder Kalacı	ba300dcad8	Merge pull request #4344 from citusdata/improveCitusTableTypeIdList Do not cache all the distributed table metadata during CitusTableTypedList()	2020-11-24 17:51:53 +01:00
Onder Kalaci	7accbff3f6	Do not cache all the distributed table metadata during CitusTableTypeIdList() CitusTableTypeIdList() function iterates on all the entries of pg_dist_partition and loads all the metadata in to the cache. This can be quite memory intensive especially when there are lots of distributed tables. When partitioned tables are used, it is common to have many distributed tables given that each partition also becomes a distributed table. CitusTableTypeIdList() is used on every CREATE TABLE .. PARTITION OF.. command as well. It means that, anytime a partition is created, Citus loads all the metadata to the cache. Note that Citus typically only loads the accessed table's metadata to the cache.	2020-11-24 17:44:06 +01:00
Önder Kalacı	c760cd3470	Move local execution after remote execution (#4301 ) * Move local execution after the remote execution Before this commit, when both local and remote tasks exist, the executor was starting the execution with local execution. There is no strict requirements on this. Especially considering the adaptive connection management improvements that we plan to roll soon, moving the local execution after to the remote execution makes more sense. The adaptive connection management for single node Citus would look roughly as follows: - Try to connect back to the coordinator for running parallel queries. - If succeeds, go on and execute tasks in parallel - If fails, fallback to the local execution So, we'll use local execution as a fallback mechanism. And, moving it after to the remote execution allows us to implement such further scenarios.	2020-11-24 13:43:38 +01:00
Onur Tirtir	d15a4c15cf	Merge pull request #4341 from citusdata/update-cl-943 Update CHANGELOG for 9.4.3	2020-11-24 13:29:59 +03:00
Onur Tirtir	76a429f19b	Update CHANGELOG for 9.4.3	2020-11-24 12:52:16 +03:00
Hadi Moshayedi	fc0ef8abba	Merge pull request #4336 from citusdata/cstore_memory_leaks Fix memory leaks in column store	2020-11-23 11:40:39 -08:00
Hadi Moshayedi	40b52ab757	Fix memory leaks in column store	2020-11-23 11:26:12 -08:00
Önder Kalacı	532b457554	Solidify the slow-start algorithm (#4318 ) The adaptive executor emulates the TCP's slow start algorithm. Whenever the executor needs new connections, it doubles the number of connections established in the previous iteration. This approach is powerful. When the remote queries are very short (like index lookup with < 1ms), even a single connection is sufficent most of the time. When the remote queries are long, the executor can quickly establish necessary number of connections. One missing piece on our implementation seems that the executor keeps doubling the number of connections even if the previous connection attempts have been finalized. Instead, we should wait until all the attempts are finalized. This is how TCP's slow-start works. Plus, it decreases the unnecessary pressure on the remote nodes.	2020-11-23 19:20:13 +01:00
jeff-davis	2e70dbe40a	Merge pull request #4330 from citusdata/remove-fdw remove columnar FDW code	2020-11-20 10:19:20 -08:00
Jeff Davis	ba6ec610e2	address review comment	2020-11-20 10:03:12 -08:00
Jeff Davis	8cee2b092b	remove columnar FDW code	2020-11-20 10:03:12 -08:00
Jelte Fennema	b2def22ab1	Fix possible uninitialized variable warning (#4334 ) I got this warning when compiling citus: ``` ../columnar/write_state_management.c: In function ‘PendingWritesInUpperTransactions’: ../columnar/write_state_management.c:364:20: warning: ‘entry’ may be used uninitialized in this function [-Wmaybe-uninitialized] if (found && entry->writeStateStack != NULL) ~~~~~^~~~~~~~~~~~~~~~ ``` I fixed this by checking by always initializing entry, by using an early return if `WriteStateMap` didn't exist. Instead of using the `found` variable to check for existence of the key, I now simply check the `entry` variable itself. To quote the postgres comment on the hash_enter function: > If foundPtr isn't NULL, then *foundPtr is set true if we found an > existing entry in the table, false otherwise. This is needed in the > HASH_ENTER case, but is redundant with the return value otherwise.	2020-11-20 16:02:03 +01:00
Önder Kalacı	856e5c85cf	Merge pull request #4331 from citusdata/pre_executor_run Do not execute subplans multiple times with cursors	2020-11-20 13:36:07 +01:00
Onder Kalaci	c433c66f2b	Do not execute subplans multiple times with cursors Before this commit, we let AdaptiveExecutorPreExecutorRun() to be effective multiple times on every FETCH on cursors. That does not affect the correctness of the query results, but adds significant overhead.	2020-11-20 10:43:56 +01:00
Önder Kalacı	b0ddbbd33a	Enable parallel query on EXPLAIN ANALYZE (#4325 ) It seems that we forgot to pass the revelant flag to enable Postgres' parallel query capabilities on the shards when user does EXPLAIN ANALYZE on a distributed table.	2020-11-20 09:54:04 +01:00
Hadi Moshayedi	c35f38459b	Merge pull request #4320 from citusdata/cstore_alter_table Fix ALTER COLUMN ... TYPE for columnar	2020-11-19 15:58:15 -08:00
Hadi Moshayedi	b182a95389	Fix ALTER COLUMN ... SET TYPE for columnar	2020-11-19 15:36:45 -08:00
jeff-davis	4e035b6044	Merge pull request #4328 from citusdata/rename rename cstore_tableam -> columnar	2020-11-19 13:37:31 -08:00
Jeff Davis	cef1d0e915	fixup test output	2020-11-19 12:45:52 -08:00
Jeff Davis	91015deb9d	rename UDFs also	2020-11-19 12:27:40 -08:00
Jeff Davis	a2b698a766	rename cstore_tableam -> columnar	2020-11-19 12:15:51 -08:00
SaitTalhaNisanci	05390729f9	Merge pull request #4327 from citusdata/initializeVariable Initialize entry variable as NULL	2020-11-19 16:37:24 +03:00
Sait Talha Nisanci	ddc8e6c702	Initialize entry variable as NULL	2020-11-19 15:23:39 +03:00
SaitTalhaNisanci	09f737d942	Merge pull request #4283 from citusdata/component_governance_config Add component governance config	2020-11-19 13:27:46 +03:00
SaitTalhaNisanci	3dca29a4c3	Merge branch 'master' into component_governance_config	2020-11-19 13:16:01 +03:00
Sait Talha Nisanci	5f436e10d0	Add the NOTICE file	2020-11-18 17:49:01 +03:00
SaitTalhaNisanci	9c44911226	Improve error messages in shard pruning (#4324 )	2020-11-18 17:16:06 +03:00
Hadi Moshayedi	021ed07f12	Merge pull request #4322 from citusdata/cstore_tests Test more of SQL features with column store	2020-11-17 20:28:03 -08:00
Hadi Moshayedi	2747fd80ff	Add prepared materialized view tests for columnar	2020-11-17 20:13:20 -08:00
Hadi Moshayedi	6711340ea6	Add prepared xact & stmt tests for columnar	2020-11-17 20:00:57 -08:00
Hadi Moshayedi	3088ccd62a	Merge pull request #4319 from citusdata/cstore_write_state_management Implements write state management for tuple inserts.	2020-11-17 12:17:51 -08:00
Hadi Moshayedi	97cba2d5b6	Implements write state management for tuple inserts. TableAM API doesn't allow us to pass around a state variable along all of the tuple inserts belonging to the same command. We require this in columnar store, since we batch them, and when we have enough rows we flush them as stripes. To do that, we keep a (relfilenode) -> stack of (subxact id, TableWriteState) global mapping. Inserts Whenever we want to insert a tuple, we look up for the relation's relfilenode in this mapping. If top of the stack matches current subtransaction, we us the existing TableWriteState. Otherwise, we allocate a new TableWriteState and push it on top of stack. (Sub)Transaction Commit/Aborts When the subtransaction or transaction is committed, we flush and pop all entries matching current SubTransactionId. When the subtransaction or transaction is committed, we pop all entries matching current SubTransactionId and discard them without flushing. Reads Since we might have unwritten rows which needs to be read by a table scan, we flush write states on SELECTs. Since flushing the write state of upper transactions in a subtransaction will cause metadata being written in wrong subtransaction, we ERROR out if any of the upper subtransactions have unflushed rows. Table Drops We record in which subtransaction the table was dropped. When committing a subtransaction in which table was dropped, we propagate the drop to upper transaction. When aborting a subtransaction in which table was dropped, we mark table as not deleted.	2020-11-17 12:07:16 -08:00
Nils Dijk	2e09116b30	Merge pull request #4311 from citusdata/merge-cstore Merge cstore into the citus repo	2020-11-17 19:10:32 +01:00
Nils Dijk	725f4a37d0	change configure to not have options	2020-11-17 19:01:54 +01:00
Nils Dijk	22df8027b0	add extra output for multi_extension targeting pg11	2020-11-17 19:01:54 +01:00
Nils Dijk	7c891a01a9	create missing objects during upgrade path	2020-11-17 19:01:51 +01:00
Nils Dijk	2987535172	add pg upgrade tests verifying table am is created	2020-11-17 18:55:36 +01:00
Hadi Moshayedi	691fdb2c64	Don't grab in additional locks cstore code when truncating	2020-11-17 18:55:36 +01:00
Nils Dijk	d065bb495d	Prepare downgrade script and bump development version to 10.0-1	2020-11-17 18:55:35 +01:00
Nils Dijk	3e5df81e89	remove use of banned api	2020-11-17 18:55:35 +01:00
Nils Dijk	b6d4a1bbe2	fix style	2020-11-17 18:55:35 +01:00
Nils Dijk	3bb6554976	make tests run	2020-11-17 18:55:35 +01:00
Nils Dijk	213eb93e6d	make columnar compile and functionally working	2020-11-17 18:55:34 +01:00
Nils Dijk	f89bd3eeb5	move columnar test files	2020-11-17 18:55:34 +01:00
Nils Dijk	30fbd877e7	remove readme that has outdated info	2020-11-17 18:55:34 +01:00
Nils Dijk	527d3ce0bb	move headers to include directory	2020-11-17 18:55:34 +01:00
Nils Dijk	5fe4c12d49	Add 'src/backend/columnar/' from commit '4339e911933ca2109db46014befdaccf77c5c13f' git-subtree-dir: src/backend/columnar git-subtree-mainline: `34de1f645c` git-subtree-split: `4339e91193`	2020-11-17 18:55:06 +01:00
SaitTalhaNisanci	34de1f645c	Update failure test dependencies (#4284 ) * Update failure test dependencies There was a security alert for cryptography. The vulnerability was fixed in 3.2.0. The vulnebarility: "RSA decryption was vulnerable to Bleichenbacher timing vulnerabilities, which would impact people using RSA decryption in online scenarios." The fix: `58494b41d6` It wasn't enough to only update crpytography because mitm was incompatible with the new version, so mitm is also upgraded. The steps to do in local: python -m pip install -U cryptography python -m pip install -U mitmproxy	2020-11-17 19:16:08 +03:00
Önder Kalacı	0c0fc69f2a	Remove unused field (#4275 )	2020-11-17 11:41:57 +01:00
Nils Dijk	d0c6950d43	Merge pull request #4310 from citusdata/fix/enterprise-modules add placeholder for enterprise modules	2020-11-11 16:22:23 +01:00
Nils Dijk	7d14800071	add placeholder for enterprise modules	2020-11-11 15:43:04 +01:00
Onur Tirtir	601e6baa96	Merge pull request #4307 from citusdata/citus-9.5.0-changelog-1604924343 Update CHANGELOG for 9.5.0	2020-11-11 16:42:10 +03:00
Onur Tirtir	52a5ab0751	Update CHANGELOG for 9.5.0	2020-11-11 16:01:52 +03:00
Onur Tirtir	4bf754b245	Fix location of citus--10.0-1--9.5-1.sql downgrade script (#4306 )	2020-11-09 16:43:56 +03:00
Onur Tirtir	65c7827cab	Merge pull request #4304 from citusdata/master-update-version-1604913966 Bump Citus to 10.0devel	2020-11-09 14:47:23 +03:00
Onur Tirtir	5e3dc9d707	Bump citus version to 10.0devel	2020-11-09 13:16:54 +03:00
Hanefi Onaldi	d3019f1b6d	Introduce foreach_ptr_modify macro (#4303 ) If one wishes to iterate through a List and insert list elements in PG13, it is not safe to use for_each_ptr as the List representation in PostgreSQL no longer linked lists, but arrays, and it is possible that the whole array is repalloc'ed if ther is not sufficient space available. See postgres commit 1cff1b95ab6ddae32faa3efe0d95a820dbfdc164 for more information	2020-11-09 12:03:59 +03:00
Onur Tirtir	5d5966f700	Fix a flaky test in mixed_relkind_tests (#4300 )	2020-11-06 14:53:30 +03:00
Önder Kalacı	1f723cabd2	Merge pull request #4292 from citusdata/fix_local_join Do not rely on set_rel_pathlist_hook for finding local relations	2020-11-06 11:26:03 +01:00
Onder Kalaci	e0d2ac7620	Do not rely on set_rel_pathlist_hook for finding local relations When a relation is used on an OUTER JOIN with FALSE filters, set_rel_pathlist_hook may not be called for the table. There might be other cases as well, so do not rely on the hook for classification of the tables.	2020-11-06 11:14:30 +01:00
Onur Tirtir	0556952607	Normalize partitioned table aliases in explain output (#4295 ) Aliases that postgres choose for partitioned tables in explain output might change in different pg versions, so normalize them and remove the alternative test output	2020-11-06 10:44:01 +03:00
Onur Tirtir	d912d4bc38	Print full file path in valgrind testing (#4299 )	2020-11-06 10:26:53 +03:00
Onur Tirtir	cc8be422ce	Fix relkind checks in planner for relkinds other than RELKIND_RELATION (#4294 ) We were qualifying relations with relkind != RELKIND_RELATION as non-relations due to the strict checks around RangeTblEntry->relkind in planner.	2020-11-05 14:21:02 +03:00
SaitTalhaNisanci	25de5b1290	Fix uninitilized variable (#4293 ) Valgrind found that, we were doing an if check on uninitialized variable and it seems that this is on context.appendparents. `ac22929a26/src/backend/utils/adt/ruleutils.c (L1054)`	2020-11-04 12:08:15 +03:00
Hanefi Onaldi	96913f6530	Merge pull request #4286 from citusdata/prevent-undistribute-partitions	2020-11-04 10:35:08 +03:00
jeff-davis	4339e91193	Merge pull request #31 from citusdata/upgrade Handle case of partially-present metadata.	2020-11-03 12:15:51 -08:00
Jeff Davis	630e579912	Handle case of partially-present metadata.	2020-11-03 10:39:39 -08:00
Hanefi Önaldı	d6f19e2298	Honor error message conventions	2020-11-03 18:11:18 +03:00
Hanefi Önaldı	85a4b61a0e	Prevent undistribute_table calls for partitions	2020-11-03 18:10:20 +03:00
Hanefi Onaldi	feca381500	Merge pull request #4279 from citusdata/prevent-undistribute-foreign-tables Prevent undistribute_table calls for foreign tables	2020-11-03 18:08:05 +03:00
Hanefi Önaldı	5db380f33a	Prevent undistribute_table calls for foreign tables	2020-11-03 17:33:29 +03:00
Nils Dijk	d03e9ca861	Feature: cstore table options (#25 ) DESCRIPTION: Add UDF's to maintain cstore table options This PR adds two UDF's and a view to interact and maintain the cstore table options. - ``alter_cstore_table_set(relid REGCLASS, [ options ... ])`` - ``alter_cstore_table_reset(relid REGCLASS, [ options ... ])`` - ``cstore.cstore_options`` The `set` function takes options and their specific types. When specified it will change the option associated with the table to the provided value. When omitted no action is taken. The `reset` function takes options as booleans. When set to `true` the value of the option associated with the table will be reset to the current default as specified by the associated GUC's. The options view containes a record for every cstore table with its associated settings as columns.	2020-11-03 13:39:46 +01:00
jeff-davis	8909769975	Merge pull request #29 from citusdata/w-error Use -Werror	2020-11-02 08:00:07 -08:00
Jeff Davis	653dbc615a	Use -Werror	2020-11-02 07:55:19 -08:00
jeff-davis	d455ef6785	Merge pull request #30 from citusdata/v13 Support for v13	2020-11-02 07:52:32 -08:00
Nils Dijk	288025d9ea	add pg13 on CI	2020-11-02 13:04:18 +01:00
Sait Talha Nisanci	7c11aa124b	Add component governance config This config is used to generate components on ADO(Azure devops). Currently this might not be super useful because we don't really use ADO but when run, we can see the warnings/issues with components that we use. A component is like a dependency basically.	2020-11-02 11:11:24 +03:00
Jeff Davis	acd49b68aa	Support for v13	2020-11-01 16:59:10 -08:00
Hadi Moshayedi	65cf9f0a6c	Merge pull request #27 from citusdata/fix-clean fix "make clean"	2020-10-30 21:08:26 -07:00
Jeff Davis	a3caa5ff0f	fix "make clean"	2020-10-30 19:27:42 -07:00
Hadi Moshayedi	efb7cf9bda	Merge pull request #22 from citusdata/concurrent_writes Implement concurrent writes	2020-10-30 15:22:59 -07:00
Hadi Moshayedi	c92ea1de96	Implement concurrent writes	2020-10-30 15:21:13 -07:00
Halil Ozan Akgül	5fcddfa2c6	Merge pull request #4254 from citusdata/outer-join-geqo-bug Fixes geqo outer join bug	2020-10-22 14:16:27 +03:00
Halil Ozan Akgul	77b3be8b6d	Turn RelOptInfos to only used field of them, relids, to be able to copy	2020-10-22 13:42:28 +03:00
Onur Tirtir	ef49b75cd6	Fix memory issues around deparsing index commands (#4270 )	2020-10-22 13:17:13 +03:00
Onur Tirtir	f3d3381220	Merge pull request #4267 from citusdata/update-cl-942 Update CHANGELOG for 9.4.2	2020-10-21 16:03:18 +03:00
Onur Tirtir	c7755103f1	Update CHANGELOG for 9.4.2	2020-10-21 15:05:17 +03:00
Önder Kalacı	808f30c1a2	Merge pull request #4264 from citusdata/remove_remove_duplicate Remove RemoveDuplicateJoinRestrictions() function	2020-10-21 11:34:15 +02:00
Onder Kalaci	5c4c9304ba	Remove RemoveDuplicateJoinRestrictions() function RemoveDuplicateJoinRestrictions() function was introduced with the aim of decrasing the overall planning times by eliminating the duplicate JOIN restriction entries (#1989). However, it turns out that the function itself is so CPU intensive with a very high algorithmic complexity, it hurts a lot more than it helps. The function is a clear example of premature optimization. The table below shows the difference clearly: "distributed query planning time master" RemoveDuplicateJoinRestrictions() execution time on master "Remove the function RemoveDuplicateJoinRestrictions() this PR" 5 table INNER JOIN 9 msec 2msec 7 msec 10 table INNER JOIN 227 msec 194 msec 29 msec 20 table INNER JOIN 1 sec 235 msec 1 sec 139 msec 90 msecs 50 table INNER JOIN 24 seconds 21 seconds 1.5 seconds 100 table INNER JOIN 2 minutes 16 secods 1 minute 53 seconds 23 seconds 250 table INNER JOIN Bottleneck on JoinClauseList 18 minutes 52 seconds Bottleneck on JoinClauseList 5 table INNER JOIN in subquery 9 msec 0 msec 6 msec 10 table INNER JOIN subquery 33 msec 10 msec 32 msec 20 table INNER JOIN subquery 132 msec 67 msec 123 msec 50 table INNER JOIN subquery 1.2 seconds 900 msec 500 msec 100 table INNER JOIN subquery 6 seconds 5 seconds 2 seconds 250 table INNER JOIN subquery 54 seconds 37 seconds 20 seconds 5 table LEFT JOIN 5 msec 0 msec 5 msec 10 table LEFT JOIN 11 msec 0 msec 13 msec 20 table LEFT JOIN 26 msec 2 msec 30 msec 50 table LEFT JOIN 150 msec 15 msec 193 msec 100 table LEFT JOIN 757 msec 71 msec 722 msec 250 table LEFT JOIN 8 seconds 600 msec 8 seconds 5 JOINs among 2 table JOINs 37 msec 11 msec 25 msec 10 JOINs among 2 table JOINs 536 msec 306 msec 352 msec 20 JOINs among 2 table JOINs 794 msec 181 msec 640 msec 50 JOINs among 2 table JOINs 25 seconds 2 seconds 22 seconds 100 JOINs among 2 table JOINs Bottleneck on JoinClauseList 9 seconds Bottleneck on JoinClauseList 150 JOINs among 2 table JOINs Bottleneck on JoinClauseList 46 seconds Bottleneck on JoinClauseList On top of the performance penalty, the function had a critical bug #4255, and with #4254 we hit one more important bug. It should be fixed by adding the followig check to the ContextCoversJoinRestriction(): ``` static bool JoinRelIdsSame(JoinRestriction leftRestriction, JoinRestriction rightRestriction) { Relids leftInnerRelIds = leftRestriction->innerrel->relids; Relids rightInnerRelIds = rightRestriction->innerrel->relids; if (!bms_equal(leftInnerRelIds, rightInnerRelIds)) { return false; } Relids leftOuterRelIds = leftRestriction->outerrel->relids; Relids rightOuterRelIds = rightRestriction->outerrel->relids; if (!bms_equal(leftOuterRelIds, rightOuterRelIds)) { return false; } return true; } ``` However, adding this eliminates all the benefits tha RemoveDuplicateJoinRestrictions() brings. I've used the commands here to generate the JOINs mentioned in the PR: https://gist.github.com/onderkalaci/fe8654f9df5916c7af4c7c5eb892561e#file-gistfile1-txt Inner and outer JOINs behave roughly the same, to simplify the table only added INNER joins.	2020-10-21 10:29:39 +02:00
Onur Tirtir	790beea59f	Add intermediate result tests with unsupported outer joins (#4262 )	2020-10-20 12:11:18 +03:00
Hadi Moshayedi	4303758a28	Merge pull request #23 from citusdata/triggers trigger fix and tests	2020-10-19 10:21:14 -07:00
SaitTalhaNisanci	0f209377c4	Fix incorrect join related fields (#4242 ) * Fix incorrect join related fields Ruleutils expect to give the original index of join columns hence we should consider the dropped columns while setting the fields in SetJoinRelatedFieldsCompat. * add some more tests for joins * Move tests to join.sql and create a utility function	2020-10-19 18:28:39 +03:00
Onur Tirtir	c49077d594	Disallow outer joins `ON TRUE` with ref & dist tables when ref table is outer relation (#4255 ) Disallow `ON TRUE` outer joins with reference & distributed tables when reference table is outer relation by fixing the logic bug made when calling `LeftListIsSubset` function. Also, be more defensive when removing duplicate join restrictions when join clause is empty for non-inner joins as they might still contain useful information for non-inner joins.	2020-10-19 16:58:11 +03:00
Onur Tirtir	6e493624af	Merge pull request #4103 from citusdata/remove-unused-functions Remove unused functions that cppcheck found	2020-10-19 14:58:29 +03:00
Onur Tirtir	f80f4839ad	Remove unused functions that cppcheck found	2020-10-19 13:50:52 +03:00
Önder Kalacı	25e43a4aa6	Merge pull request #4253 from citusdata/improve_perf_for_queries Improve the relation restriction counters	2020-10-19 09:22:03 +02:00
Onder Kalaci	bbedfca761	Improve the relation restriction counters It seems like Postgres could call set_rel_pathlist() for the same relation multiple times. This breaks the logic where we assume relationCount eqauls to the number of entries in relationRestrictionList. In summary, relationRestrictionList may contain duplicate entries.	2020-10-19 08:51:16 +02:00
Hadi Moshayedi	4708dc04f1	Merge pull request #4257 from citusdata/tableam_hadi Set explicit transfer_mode in tableam tests	2020-10-16 12:54:52 -07:00
Hadi Moshayedi	663549db33	Set explicit transfer_mode in tableam tests	2020-10-16 12:40:37 -07:00
Hadi Moshayedi	db96b9f861	Merge pull request #4250 from citusdata/tableam_hadi Support "CREATE TABLE ... USING table_access_method" for distributed tables	2020-10-16 12:18:37 -07:00
Nils Dijk	caabbf4b84	Table access method support for distributed tables	2020-10-16 12:02:25 -07:00
Onur Tirtir	7cb07c70fa	Move hasSemiJoin to JoinRestrictionContext (#4256 )	2020-10-16 18:37:39 +03:00
Marco Slot	3261fc7eef	Merge pull request #4251 from citusdata/fix/ref-view-mod Support view in reference table modification	2020-10-16 11:37:41 +02:00
Marco Slot	8976f245ab	Support reference table view in reference table modification	2020-10-16 11:31:24 +02:00
Onur Tirtir	de6f2d3f42	Refactor JoinRestrictionListExistsInContext to improve readability (#4249 )	2020-10-16 12:24:56 +03:00
Önder Kalacı	212adfb26f	Merge pull request #4245 from citusdata/add_single_node_tests_more Add more regression test for single node Citus	2020-10-15 17:49:03 +02:00
Onder Kalaci	596f7bf4a9	Add more regression test for single node Citus Tests on commands with SCHEMA.	2020-10-15 17:32:32 +02:00
Önder Kalacı	3e5a92d33b	Merge pull request #4236 from citusdata/fix_intermediate_size Local execution enforces citus.max_intermediate_result_size	2020-10-15 17:26:51 +02:00
Onder Kalaci	fe3caf3bc8	Local execution considers intermediate result size limit With this commit, we make sure that local execution adds the intermediate result size as the distributed execution adds. Plus, it enforces the citus.max_intermediate_result_size value.	2020-10-15 17:18:55 +02:00
Marco Slot	ded4561661	Merge pull request #4246 from citusdata/fix/table-exists Check table existence in EnsureRelationKindSupported	2020-10-15 17:18:05 +02:00
Jeff Davis	4355ca4945	trigger fix and tests	2020-10-15 08:05:35 -07:00
Marco Slot	31858c8a29	Check table existence in EnsureRelationKindSupported	2020-10-15 17:05:06 +02:00
SaitTalhaNisanci	b5a3526c07	Merge pull request #4233 from citusdata/introduce_get_local_execution_status Introduce GetCurrentLocalExecutionStatus wrapper	2020-10-15 15:55:46 +03:00
Sait Talha Nisanci	ecde6c6eef	Introduce GetCurrentLocalExecutionStatus wrapper We should not access CurrentLocalExecutionStatus directly because that would mean that we could also set it directly, which we shouldn't because we have checks to see if the new state is possible, otherwise we error.	2020-10-15 15:38:19 +03:00
Marco Slot	619b8b7654	Merge pull request #4247 from citusdata/fix/idempotent-upgrade	2020-10-15 14:08:46 +02:00
Simon Kelly	4f94e544b7	create 9.5-1 udfs and update citus--9.4-1--9.5-1.sql	2020-10-15 13:50:36 +02:00
Simon Kelly	2a6c867cb0	Make citus_prepare_pg_upgrade idempotent https://github.com/citusdata/citus/issues/3527	2020-10-15 13:49:50 +02:00
Önder Kalacı	291154665f	Merge pull request #3597 from citusdata/refactor_outer_join_tests Refactor outer join checks	2020-10-14 15:23:55 +02:00
Onder Kalaci	15e724c073	Add regression tests for outer/cross JOINs	2020-10-14 15:17:30 +02:00
Onder Kalaci	de33079065	Improve outer join checks Before this commit, the logic was: - As long as the outer side of the JOIN is not a JOIN (e.g., relation or subquery etc.), we check for the existence of any recurring tuples. There were two implications of this decision. First, even if a subquery which is on the outer side contains distributed table JOIN reference table, Citus would unnecessarily throw an error. Note that, the JOIN inside the subquery would already be going to be tested recursively. But, as long as that check passes, there is no reason for the upper JOIN to fail. An example, which used to fail and now works: SELECT * FROM (SELECT * FROM dist JOIN ref) as foo LEFT JOIN dist; Second, certain JOINs, especially with ON (true) conditions were not represented as Citus expects the JOINs to be in the format DeferredErrorIfUnsupportedRecurringTuplesJoin().	2020-10-14 15:17:30 +02:00
Onur Tirtir	1a28858c47	Disallow field indirection in INSERT/UPDATE queries (#4241 )	2020-10-14 14:11:59 +03:00
Nils Dijk	5fc7f61936	Projection pushdown (#11 ) DESCRIPTION: add pushdown support for projections and quals in table access method scan This implementation uses custom scans to push projections into the scans on a columnar table. The custom scan replaces all access paths to a table to force the projection of the columns.	2020-10-13 13:36:02 +02:00
Onur Tirtir	8efca3b60a	Fix a crash with inserting domain composite types in coord. evaluation (#4231 ) Use short lived per-tuple context in citus_evaluate_expr like (pg) evaluate_expr does. We should not use planState->ExprContext when evaluating expressions as it might lead to freeing the same executor twice (first one happens in citus_evaluate_expr itself and the other one happens when postgres doing clean-up for the top level executor state), which in turn might cause seg.faults. However, now as we don't have necessary planState info to evaluate prepared statements, we also add planState->es_param_list_info to per-tuple ExprContext.	2020-10-13 14:19:59 +03:00
Halil Ozan Akgül	df185179c3	Merge pull request #4201 from citusdata/support-with-ties Adds support for WITH TIES option	2020-10-12 19:44:48 +03:00
Halil Ozan Akgul	e2736c25bd	Adds support for WITH TIES option	2020-10-12 19:34:18 +03:00
Hadi Moshayedi	685d5c9d4c	Merge pull request #15 from citusdata/vacuum Initial support for VACUUM (without FULL option)	2020-10-09 21:10:34 -07:00
Hadi Moshayedi	c4eb36dfd2	Merge pull request #13 from citusdata/vacuum_analyze Support VACUUM FULL	2020-10-09 21:10:11 -07:00
Hadi Moshayedi	102b7670d4	Fix tautological compare issue (#19 )	2020-10-09 13:08:03 -07:00
Önder Kalacı	93764a3782	Merge pull request #4230 from citusdata/do_not_copy Do not copy bit map set unnecessarily	2020-10-09 18:29:26 +02:00
Onder Kalaci	e29aa51a87	Do not copy bms	2020-10-09 16:41:36 +02:00
SaitTalhaNisanci	0919d90cf8	Merge pull request #4229 from citusdata/use_pg13.0 Use pg 13.0 in tests	2020-10-09 13:53:54 +03:00
Sait Talha Nisanci	b27ed05f1a	Use pg 13.0 in tests	2020-10-09 11:51:14 +03:00
SaitTalhaNisanci	58d7c1613a	Merge pull request #4221 from citusdata/fix/vacuum_stuck Commit transaction for VACUUM on shell table	2020-10-09 11:50:31 +03:00
SaitTalhaNisanci	c7ceabc44a	Merge branch 'master' into fix/vacuum_stuck	2020-10-09 11:32:39 +03:00
Jelte Fennema	d57bbfd3f9	Add uuid-dev to Ubuntu deps in CONTRIBUTING (#4218 ) This is needed to compile postgres with --with-uuid=e2fs.	2020-10-09 10:27:47 +02:00
Sait Talha Nisanci	dc40758355	Return early if there is no citus table in VACUUM	2020-10-09 11:10:00 +03:00
Sait Talha Nisanci	99bb79745a	Commit transaction for VACUUM on shell table With postgres 13, there is a global lock that prevents multiple VACUUMs happening in the current database. This global lock is taken for a short time but this creates a problem because of the following: - We execute the VACUUM for the shell table through the standard process utility. In this step the global lock is taken for the current database. - If the current node has shard placements then it tries to execute VACUUM over a connection to localhost with ExecuteUtilityTaskList. - the VACUUM on shard placements cannot proceed because it is waiting for the global lock for the current database to be released. - The acquired lock from the VACUUM for shell table will not be released until the transaction is committed. - So there is a deadlock. As a solution, we commit the current transaction in case of VACUUM after the VACUUM is executed for the shell table. Executing the VACUUM on a shell table is not important because the data there will probably be truncated. PostprocessVacuumStmt takes the necessary locks on the shell table so we don't need to take any extra locks after we commit the current transaction.	2020-10-09 10:57:44 +03:00
Hadi Moshayedi	e481e73d18	Encapsulate snapshot used for reading stripes in cstore_metadata_tables	2020-10-08 16:02:45 -07:00
Hadi Moshayedi	76a71aa61a	Use SnapshotDirty for reading metadata in truncation	2020-10-08 14:46:42 -07:00
Hadi Moshayedi	55885c81dd	log stats on verbose	2020-10-08 14:42:49 -07:00
Hadi Moshayedi	37e3845e6a	Address Nils feedback	2020-10-08 14:42:49 -07:00
Hadi Moshayedi	74dd1facf3	add isolation tests	2020-10-08 14:42:49 -07:00
Hadi Moshayedi	2ede755107	Initial version of VACUUM	2020-10-08 14:42:49 -07:00
Hadi Moshayedi	aa3032cfdd	Address feedback	2020-10-08 14:42:31 -07:00
Hadi Moshayedi	eeb25aca85	Add a test which checks for resource clean-up	2020-10-08 14:42:31 -07:00
Hadi Moshayedi	7cc8c8c155	Support VACUUM FULL	2020-10-08 14:42:31 -07:00
Hadi Moshayedi	ad78260c3d	Merge pull request #21 from citusdata/warn_shadows Remove shadowed variable definitions	2020-10-08 13:17:30 -07:00
Hadi Moshayedi	d1c7d9f09d	address feedback	2020-10-08 11:50:32 -07:00
Hadi Moshayedi	92e1603443	Remove shadowed variables	2020-10-08 11:03:07 -07:00
Nils Dijk	9b9b9e2cf0	remove double declaration of stripeMetadata (#20 ) Compilers seem to behave differently with variable shadowing as both I and the marlin deployment have segfaults when querying a cstore table today, however, CI seem to not care :D This removes a double declaration that was not caught in #10	2020-10-08 19:07:18 +02:00
Marco Slot	fd40605745	Merge pull request #4222 from citusdata/fix/multiple-maintenanced	2020-10-08 16:45:39 +02:00
Marco Slot	881e5df780	Fix a bug that could lead to multiple maintenance daemons	2020-10-08 16:18:14 +02:00
Marco Slot	18219843d0	Add maintenance daemon error tests	2020-10-08 16:17:33 +02:00
Marco Slot	2e02a30e37	Merge pull request #4226 from snopoke/patch-2	2020-10-08 13:59:48 +02:00
Simon Kelly	03e007e4eb	Merge branch 'master' into patch-2	2020-10-08 13:00:11 +02:00
Simon Kelly	50fa4af7e4	update migration script	2020-10-08 12:52:27 +02:00
Metin Döşlü	6f394e8b1e	Update CLA link (#4227 )	2020-10-08 12:57:26 +03:00
Simon Kelly	6fffee7616	Drop backup table after upgrade The prepare for upgrade script creates the `'public.pg_dist_rebalance_strategy` table which is not dropped when the upgrade is finished. This may block future upgrades.	2020-10-08 09:48:04 +02:00
Marco Slot	f904ce1726	Merge pull request #4215 from citusdata/fix/rls-moves	2020-10-06 14:16:40 +02:00
Marco Slot	73fc054c27	Rename DDL command functions	2020-10-06 11:30:56 +02:00
Marco Slot	4f69298d90	Fix RLS and replica identity propagation on shard move	2020-10-06 11:30:03 +02:00
Marco Slot	bce11514b9	Merge pull request #4203 from citusdata/fix/sequence-drop	2020-10-06 11:22:21 +02:00
Marco Slot	dbc348b7e0	Create sequence dependency during metadata syncing	2020-10-06 10:57:39 +02:00
Marco Slot	9bba8bb4e8	Remove master_drop_sequences	2020-10-06 10:57:33 +02:00
SaitTalhaNisanci	b271a4890b	Merge pull request #4216 from citusdata/write_to_postgres_config Write settings to postgres configuration file directly	2020-10-05 22:43:11 +03:00
Sait Talha Nisanci	078dcae18c	Write settings to postgres configuration file directly In our test structure, we have been passing postgres configurations from the terminal, which causes problems after it hits to a certain length hence it cannot start the server and understanding why it failed is not easy because there isn't a nice error message. This commit changes this to write the settings directly to the postgres configuration file. This way we can add as many postgres settings as we want to without needing to worry about the length problem.	2020-10-05 22:09:08 +03:00
Hadi Moshayedi	434275d46b	Merge pull request #17 from citusdata/truncate_cleanup Implement nontransactional TRUNATE + resource clean-up on TRUNCATE	2020-10-05 11:41:09 -07:00
Hadi Moshayedi	62fc59202c	Implement nontransactional truncate	2020-10-05 10:09:19 -07:00
Hadi Moshayedi	b72a4d8d19	Clean-up old metadata on TRUNCATE	2020-10-05 10:08:26 -07:00
Hadi Moshayedi	2e47bf5172	Merge pull request #18 from citusdata/rollback Fix writes after rollback	2020-10-05 09:53:13 -07:00
Hadi Moshayedi	a8da9acc63	Fix writes after rollback	2020-10-05 09:51:24 -07:00
Hadi Moshayedi	e5a3bd18ae	Merge pull request #14 from citusdata/resource_cleanup Resource cleanup	2020-10-05 09:31:48 -07:00
Hadi Moshayedi	a70b0c362e	Rename cstore_tables to cstore_data_files	2020-10-05 09:28:40 -07:00
Hadi Moshayedi	a87c15a1e1	Address feedback	2020-10-05 09:28:40 -07:00
Ahmet Gedemenli	889fc2db5f	Merge pull request #4214 from citusdata/degrade-gracefully-when-no-background-workers Degrade gracefully when no background workers available	2020-10-05 17:26:44 +03:00
Ahmet Gedemenli	81db4dca5c	Degrade gracefully when no background workers available	2020-10-05 16:55:00 +03:00
Onur Tirtir	2cd0a69dfb	Fix multi-row & router INSERT crash with local exec. when def. cols not specified (#4197 ) Multi-row & router INSERT's were crashing with local execution if at least one of the DEFAULT columns were not specified in VALUES list. This was because, the changes we make on query->values_lists and query->targetList was sufficient for deparsing given INSERT for remote execution but not sufficient for local execution. With this commit, DEFAULT value normalization for multi-row & router INSERT's is fixed by adding dummy column references for unspecified DEFAULT columns.	2020-10-05 10:45:17 +03:00
Hanefi Onaldi	ba88ed3f0b	Merge pull request #4207 from citusdata/no-worker-hash-in-insert-select	2020-10-02 18:27:36 +03:00
Hanefi Önaldı	6d8e83d24f	Replace worker_hash calls with partkey IS NOT NULL filters	2020-10-02 18:16:24 +03:00
Önder Kalacı	df5aa0f0cc	Switch to sequential execution if the index name is long (#4209 ) Citus has the logic to truncate the long shard names to prevent various issues, including self-deadlocks. However, for partitioned tables, when index is created on the parent table, the index names on the partitions are auto-generated by Postgres. We use the same Postgres function to generate the index names on the shards of the partitions. If the length exceeds the limit, we switch to sequential execution mode.	2020-10-02 13:39:34 +03:00
SaitTalhaNisanci	45bb0fb587	Do initial cleanup only once in pg_init (#4213 ) In postmasters execution of _PG_init, IsUnderPostmaster will be false and we want to do the cleanup at that time only, otherwise there is a chance that there will be parallel queries and we might do a cleanup for things that are already in use.	2020-10-02 09:12:39 +03:00
Ahmet Gedemenli	6a341b6ab8	Merge pull request #4196 from citusdata/support-explain-analyze-wal Support EXPLAIN(ANALYZE, WAL)	2020-10-01 14:43:42 +03:00
Ahmet Gedemenli	70e9edb4f2	Add subplan test with insert	2020-10-01 13:58:55 +03:00
Jelte Fennema	13ef8252e7	Add broken distributed subplan test	2020-10-01 13:52:42 +03:00
Ahmet Gedemenli	3357eea46b	Add regression tests for PG13 WAL	2020-10-01 13:52:42 +03:00
Ahmet Gedemenli	d268aa7bc8	Support EXPLAIN(ANALYZE, WAL)	2020-10-01 13:52:42 +03:00
Önder Kalacı	f3962fc7f6	Merge pull request #4199 from citusdata/terminate_connection Forcefully terminate connections after citus.node_connection_timeout	2020-10-01 08:56:39 +02:00
Onder Kalaci	56ca256374	Forcefully terminate connections after citus.node_connection_timeout After the connection timeout, we fail the session/pool. However, the underlying connection can still be trying to connect. That is dangerous because the new placement executions have already been in place. The executor cannot handle the situation where multiple of EXECUTION_ORDER_ANY task executions succeeds. Adding a regression test doesn't seem easily doable. To reproduce the issue - Add 2 worker nodes - create a reference table - set citus.node_connection_timeout to 1ms (requires code change) - Continiously execute `SELECT count(*) FROM ref_table` - Sometime later, you hit an out-of-array access in `ScheduleNextPlacementExecution()` hence crashing. - The reason for that is sometimes the first connection successfully established while the executor is already trying to execute the query on the second node.	2020-09-30 18:24:24 +02:00
Hanefi Onaldi	2894002211	Merge pull request #4208 from citusdata/cleanup-pgoptions Remove some pgoptions to prevent hitting bash command character limits	2020-09-30 17:04:27 +03:00
Hanefi Önaldı	9ec85f1283	Remove some pgoptions to prevent hitting bash command character limits	2020-09-30 15:04:40 +03:00
Onur Tirtir	3f8ac527c9	Merge pull request #4205 from citusdata/update-cl-941 Update CHANGELOG for 9.4.1	2020-09-30 10:47:45 +03:00
Onur Tirtir	bc29238546	Update CHANGELOG for 9.4.1	2020-09-30 10:09:54 +03:00
Hanefi Onaldi	85d32bcf35	Merge pull request #4198 from citusdata/disallow-volatile-subquery-in-updates Disallow volatile functions on single shard update subqueries	2020-09-29 16:27:13 +03:00
Hanefi Önaldı	b0a2c1ee5c	Disallow volatile functions on single shard update queries We currently do not support volatile functions in update/delete statements because the function evaluation logic does not know how to distinguish volatile functions (that need to be evaluated per row) from stable functions (that need to be evaluated per query), and it is also not safe to push the volatile functions down on replicated tables.	2020-09-29 15:40:21 +03:00
Marco Slot	12ecdea790	Merge pull request #4173 from citusdata/fix/create-index-concurrently-local	2020-09-29 10:15:40 +02:00
Hadi Moshayedi	d37c717e14	Clean-up resources on drop	2020-09-28 22:49:24 -07:00
Hadi Moshayedi	cf0ba6103e	Associate metadata with rel filenode	2020-09-28 22:43:33 -07:00
Hadi Moshayedi	207eedc35a	Merge pull request #16 from citusdata/analyze Initial implementation of ANALYZE	2020-09-28 06:57:15 -07:00
Hadi Moshayedi	ec1e277e8e	Initial implementation of ANALYZE	2020-09-26 23:55:46 -07:00
Hadi Moshayedi	d352a987fa	Merge pull request #12 from citusdata/memcxt reset memory context at end of execution	2020-09-26 23:37:09 -07:00
Hadi Moshayedi	5a077f2308	Remove the unused drop event trigger	2020-09-25 13:10:32 -07:00
Hadi Moshayedi	1d69519bd8	Delete autogenerated expected files	2020-09-25 13:03:34 -07:00
Marco Slot	b905c8043d	Fix create index concurrently crash with local execution	2020-09-25 11:49:09 +02:00
Ahmet Gedemenli	e892e253b1	Merge pull request #4191 from citusdata/sort-explain-analyze-output-by-time Sort explain analyze output by task time	2020-09-24 14:38:06 +03:00
Ahmet Gedemenli	abfb79bda6	Sort explain analyze output by task time Add sort method parameter for regression tests Fix check-style Change sorting method parameters to enum Polish Add task fields to OutTask Add test into multi_explain Fix isolation test	2020-09-24 11:38:40 +03:00
Jeff Davis	7714b60e5e	reset memory context at end of execution	2020-09-23 22:53:49 -07:00
Hadi Moshayedi	398394056c	Merge pull request #10 from citusdata/cleanup_metadata Metadata simplification and some refactoring	2020-09-23 10:42:48 -07:00
Hadi Moshayedi	a34cdeb83c	Remove StripeFooter	2020-09-23 10:40:55 -07:00
Jeff Davis	1b45cfb52e	remove generated sql test files	2020-09-23 09:53:32 -07:00
Hadi Moshayedi	db5287069f	Make block offsets relative to stripe start	2020-09-23 09:21:13 -07:00
Hadi Moshayedi	bc585be3ed	Save blockRowCount in StripeMetadata	2020-09-23 09:21:13 -07:00
jeff-davis	be5a586843	Merge pull request #9 from citusdata/tableam Tableam	2020-09-22 08:00:59 -07:00
Onur Tirtir	64d5ac6a10	Do not downgrade if a citus local table exists (#4174 ) As the previous versions of Citus don't know how to handle citus local tables, we should prevent downgrading from 9.5 to older versions if any citus local tables exists.	2020-09-22 14:19:50 +03:00
Jeff Davis	8af9c91540	address review comments	2020-09-21 18:13:14 -07:00
SaitTalhaNisanci	dba7e052df	Merge enterprise branch if it exists (#4181 ) * Merge enterprise branch if it exists We should merge the enterprise branch if it exists in the check enterpise merge job, otherwise the following can happen: - there is some change on community that breaks the compilation on enterprise without creating any conflicts - we fix the compilation issue by opening a branch on enterprise - the job doesn't see the enterprise specific fix because it doesn't try to merge enterprise branch if there are no conflicts * Update ci/check_enterprise_merge.sh Co-authored-by: Jelte Fennema <github-tech@jeltef.nl> * Simplify the steps Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2020-09-21 19:31:10 +03:00
Önder Kalacı	bc293d9d5e	Merge pull request #4167 from citusdata/metadata_improvements Improve the robustness of function call delegation	2020-09-21 15:14:21 +02:00
Onder Kalaci	5d017cd123	Improve node matedata when coordinator is added Coordinator should always be always active, hasmetadata and metadasynced. Prevent changing those fields.	2020-09-21 14:53:41 +02:00
Onder Kalaci	6fc1dea85c	Improve the robustness of function call delegation Pushing down the CALLs to the node that the CALL is executed is dangerous and could lead to infinite recursion. When the coordinator added as worker, Citus was by chance preventing this. The coordinator was marked as "not metadatasynced" node in pg_dist_node, which prevented CALL/function delegation to happen. With this commit, we do the following: - Fix metadatasynced column for the coordinator on pg_dist_node - Prevent pushdown of function/procedure to the same node that the function/procedure is being executed. Today, we do not sync pg_dist_object (e.g., distributed functions metadata) to the worker nodes. But, even if we do it now, the function call delegation would prevent the infinite recursion.	2020-09-21 14:53:30 +02:00
SaitTalhaNisanci	e7cd1ed0ee	Not take ShareUpdateExlusiveLock on pg_dist_transaction (#4184 ) * Not take ShareUpdateExlusiveLock on pg_dist_transaction We were taking ShareUpdateExlusiveLock on pg_dist_transaction during recovery to prevent multiple recoveries happening concurrenly. VACUUM( not FULL) also takes ShareUpdateExclusiveLock, and they can conflict. It seems that VACUUM will skip the table if there is a conflicting lock already taken unless it is doing the vacuum to prevent id wraparound, in which case there can be a deadlock. I guess the deadlock happens if: - VACUUM takes a lock on pg_dist_transaction and is done for id wraparound problem - The transaction in the maintenance tries to take a lock but cannot as that conflicts with the lock acquired by VACUUM - The transaction in the maintenance daemon has a very old xid hence VACUUM cannot proceed. If we take a row exclusive lock in transaction recovery then it wouldn't conflict with VACUUM hence it could proceed so the deadlock would be resolved. To prevent concurrent transaction recoveries happening, an advisory lock is taken with ShareUpdateExlusiveLock as before. * Use CITUS_OPERATIONS tag	2020-09-21 15:20:38 +03:00
Jeff Davis	c303f0f135	improve rel size estimate	2020-09-18 12:15:08 -07:00
Jeff Davis	a05e75a6d1	fixup	2020-09-18 11:59:28 -07:00
Jeff Davis	06f1c96975	almost works	2020-09-18 11:37:39 -07:00
Onur Tirtir	e69ee407e1	Merge pull request #4176 from citusdata/refactor/id_list_functions Refactor the functions that return OID lists for citus tables	2020-09-18 20:49:05 +03:00
Jeff Davis	0f43534845	fixup guc	2020-09-18 09:26:20 -07:00
Jeff Davis	fbe4728287	use GUCs	2020-09-18 09:19:41 -07:00
Jeff Davis	9f9bb64c4c	fixup	2020-09-18 09:18:03 -07:00
Jeff Davis	12daf4c317	add GUCs	2020-09-18 09:09:02 -07:00
Jeff Davis	d7f40f3be6	address review comments	2020-09-18 08:59:45 -07:00
Onur Tirtir	1b31b22635	Refactor the functions that return OID lists for citus tables	2020-09-18 16:42:46 +03:00
SaitTalhaNisanci	dae2c69fd7	Not allow removing a single node with ref tables (#4127 ) * Not allow removing a single node with ref tables We should not allow removing a node if it is the only node in the cluster and there is a data on it. We have this check for distributed tables but we didn't have it for reference tables. * Update src/test/regress/expected/single_node.out Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> * Update src/test/regress/sql/single_node.sql Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com>	2020-09-18 15:35:59 +03:00
SaitTalhaNisanci	6e316d46a2	Remove unused variable (#4172 )	2020-09-18 11:25:07 +03:00
Ahmet Gedemenli	e076d2a14e	Merge pull request #4168 from citusdata/shorten-insert-select-test Shorten insert select connection leak test	2020-09-18 10:43:02 +03:00
Ahmet Gedemenli	1cf11b4632	Shorten insert_select_connection_leak_test	2020-09-18 10:07:15 +03:00
Önder Kalacı	8d3f353746	Add more tests for single node citus - distributetd tables (#4166 )	2020-09-17 17:50:35 +02:00
Marco Slot	966718c76a	Merge pull request #4171 from citusdata/fix/explain-analyze-truncation Fix EXPLAIN ANALYZE truncation	2020-09-17 14:56:30 +02:00
Marco Slot	c9d46c618b	Fix EXPLAIN ANALYZE truncation	2020-09-17 14:42:21 +02:00
Onur Tirtir	d81559b7f8	Use "table" instead of "reference table" in sequential truncate log (#4164 ) We might get this debug message for citus local tables as well	2020-09-17 14:37:36 +03:00
SaitTalhaNisanci	5723038f74	Comment user provided input memory allocation (#4163 )	2020-09-17 13:18:13 +03:00
Jeff Davis	b9f2b410b5	fix am_alter test	2020-09-16 15:29:24 -07:00
Jeff Davis	d352cd07dd	citus indent and Makefile fixup	2020-09-16 11:51:23 -07:00
Jeff Davis	4dfec401ce	more Makefile cleanup	2020-09-16 11:10:40 -07:00
Jeff Davis	ec8afe0a5d	better makefile	2020-09-16 11:10:40 -07:00
Jeff Davis	3b3d1b1f89	11 and 12 both pass	2020-09-16 11:10:40 -07:00
Jeff Davis	248a2db970	fixup	2020-09-16 11:10:40 -07:00
Jeff Davis	ada9da609e	fixup mod.c	2020-09-16 11:10:40 -07:00
Jeff Davis	a3b513167c	disable a few tests	2020-09-16 11:10:40 -07:00
Jeff Davis	c49acc948a	more test fixes........	2020-09-16 11:10:40 -07:00
Jeff Davis	fd6b4aeba2	more tests...	2020-09-16 11:10:39 -07:00
Jeff Davis	7ba75fc2a6	more tests pass	2020-09-16 11:10:39 -07:00
Jeff Davis	83f2d4aef2	more fixes	2020-09-16 11:10:39 -07:00
Jeff Davis	18f6829621	more fixes	2020-09-16 11:10:39 -07:00
Jeff Davis	a57b9004a4	tests WIP	2020-09-16 11:10:39 -07:00
Jeff Davis	f886fb33e5	add AM tests	2020-09-16 11:10:39 -07:00
Jeff Davis	aa422f2da0	fixup rebase	2020-09-16 11:10:39 -07:00
Jeff Davis	b06f48a2a7	tableAM updates	2020-09-16 11:10:39 -07:00
Jeff Davis	b6ca8fcd70	extension control	2020-09-16 11:10:39 -07:00
Jeff Davis	48e9c17b50	stubs for table access method	2020-09-16 11:10:39 -07:00
jeff-davis	30b78d6f54	Merge pull request #8 from citusdata/circleci-project-setup Circleci project setup	2020-09-16 09:42:14 -07:00
Nils Dijk	1e93e15a8d	fix indentation via citus_indent	2020-09-16 15:22:09 +02:00
Nils Dijk	20a8bca426	add integration files for circle ci This is based on the circle ci integration we have for citus, albeit highly simplified.	2020-09-16 15:21:42 +02:00
Nils Dijk	09208986ba	remove travis	2020-09-16 15:21:42 +02:00
Jeff Davis	fe7ab6df84	Rename tests to be FDW-specific.	2020-09-15 12:51:15 -07:00
Jeff Davis	f7f59933f8	fix v11 tests	2020-09-15 12:48:44 -07:00
Hadi Moshayedi	d69bff7621	Use schema config in control file	2020-09-15 10:06:11 -07:00
Onur Tirtir	4118560b75	Prevent citus local table creation from a catalog table (#4158 )	2020-09-15 14:30:48 +03:00
Nils Dijk	00cb58135d	Merge pull request #7 from citusdata/add-wal Add wal support	2020-09-15 12:41:41 +02:00
Nils Dijk	a94bbcc7ef	write wal entries when writing to the buffers	2020-09-15 12:38:50 +02:00
Hadi Moshayedi	139da88ad9	Remove some unnecessary code & fix compiler warnings	2020-09-14 15:08:50 -07:00
Hadi Moshayedi	c1cf3fe6e7	Merge pull request #5 from citusdata/skiplist_to_metadata_tables Move skipnodes to metadata tables	2020-09-14 15:00:29 -07:00
Hadi Moshayedi	2737686fd0	Move skipnodes to metadata tables	2020-09-14 14:57:13 -07:00
jeff-davis	c570932712	Merge pull request #6 from citusdata/smgr Smgr	2020-09-14 13:52:25 -07:00
Hadi Moshayedi	fb110446be	Fix compilation in pg 11	2020-09-14 13:13:36 -07:00
Önder Kalacı	e7079d1384	Add orderbys to some tests (#4162 )	2020-09-14 16:59:22 +02:00
Jeff Davis	573555747f	address review comments	2020-09-11 16:28:57 -07:00
jeff-davis	b8b5d3aeee	Merge pull request #4 from citusdata/fdw-relfilenode create relfilenode for FDW	2020-09-11 16:16:55 -07:00
Jeff Davis	dee408248c	Replace file access with Smgr	2020-09-11 16:14:36 -07:00
Jeff Davis	a2f7eadeb9	lock while initializing relfilenode	2020-09-11 16:02:00 -07:00
Jeff Davis	b18c9c8060	drop storage for DROP command	2020-09-11 15:04:46 -07:00
Jeff Davis	e9045227cd	create relfilenode for FDW	2020-09-11 12:48:00 -07:00
Marco Slot	94736ce78d	Merge pull request #3938 from citusdata/fix/extension-dist-tables	2020-09-11 12:24:35 +02:00
Onur Tirtir	9a56c22917	Add udf tests with citus local tables (#4154 )	2020-09-11 12:36:53 +03:00
Hadi Moshayedi	407892a9dd	Merge pull request #2 from citusdata/table_footer_to_metadata_tables Move table footer to metadata tables	2020-09-09 22:27:23 -07:00
Marco Slot	b82f6ee163	Add tests for distributing catalog tables	2020-09-10 04:46:11 +02:00
Marco Slot	bd12555b16	Fix distributing tables owned by extensions	2020-09-10 04:46:11 +02:00
Hadi Moshayedi	0d4e249c97	Reuse the same state for multiple inserts	2020-09-09 14:17:30 -07:00
Hadi Moshayedi	35a52a6fe1	Use cstore namespace instead of pg_catalog.	2020-09-09 11:04:27 -07:00
Onur Tirtir	5e5ba46793	Merge pull request #4143 from citusdata/single-placement-table/master-cache-entry-rebased DESCRIPTION: Introduce citus local tables The commits in this pr are merged from other sub-pr's: * community/#3852: Brings lazy&fast table creation logic for create_citus_local_table udf * community/#3995: Brings extended utility command support for citus local tables * community/#4133: Brings changes in planner and in several places to integrate citus local tables into our distributed execution logic We are introducing citus local tables, which a new table type to citus. To be able to create a citus local table, first we need to add coordinator as a worker node. Then, we can create a citus local table via SELECT create_citus_local_table(<tableName>). Calling this udf from coordinator will actually create a single-shard table whose shard is on the coordinator. Also, from the citus metadata perspective, for citus local tables: * partitionMethod is set to DISTRIBUTE_BY_NONE (like reference tables) and * replicationModel is set to the current value of citus.replication_model, which already can't be equal to REPLICATION_MODEL_2PC, which is only used for reference tables internally. Note that currently we support creating citus local tables only from postgres tables living in the coordinator. That means, it is not allowed to execute this udf from worker nodes or it is not allowed to move shard of a citus local table to any other nodes. Also, run-time complexity of calling create_citus_local_table udf does not depend on the size of the relation, that means, creating citus local tables is actually a non-blocking operation. This is because, instead of copying the data to a new shard, this udf just does the following: * convert input postgres table to the single-shard of the citus local table by suffixing the shardId to it's name, constraints, indexes and triggers etc., * create a shell table for citus local table in coordinator and in mx-worker nodes when metadata sycn is enabled. * create necessary objects on shell table. Here, we should also note we can execute queries/dml's from mx worker nodes as citus local tables are already first class citus tables. Even more, we brought trigger support for citus local tables. That means, we can define triggers on citus local tables so that users can define trigger objects to perform execution of custom functions that might even modify other citus tables and other postgres tables. Other than trigger support, citus local tables can also be involved in foreign key relationships with reference tables. Here the only restriction is, foreign keys from reference tables to citus local tables cannot have behaviors other than RESTRICT & NO ACTION behavior. Other than that, foreign keys between citus local tables and reference tables just work fine. All in all, citus local tables are actually just local tables living in the coordinator, but natively accessible from other nodes like other first class citus tables and this enables us to set foreign keys constraints between very big coordinator tables and reference tables without having to do any data replication to worker nodes for local tables.	2020-09-09 13:02:42 +03:00
Onur Tirtir	3a73fba810	Apply planner changes for citus local tables	2020-09-09 11:51:18 +03:00
Onur Tirtir	0b1cc118a9	Adapt other cache entry changes for citus local tables	2020-09-09 11:50:55 +03:00
Onur Tirtir	a58a4395ab	Extend citus local table utility command support This commit brings following features: Foreign key support from citus local tables to reference tables * Foreign key support from reference tables to citus local tables (only with RESTRICT & NO ACTION behavior) * ALTER TABLE ENABLE/DISABLE trigger command support * CREATE/DROP/ALTER trigger command support and disallows: * ALTER TABLE ATTACH/DETACH PARTITION commands * CREATE TABLE <postgres table> ATTACH PARTITION <citus local table> commands * Foreign keys from postgres tables to citus local tables (the other way was already disallowed) for citus local tables.	2020-09-09 11:50:55 +03:00
Onur Tirtir	17cc810372	Implement "citus local table" creation logic	2020-09-09 11:50:48 +03:00
Hadi Moshayedi	10fd94a9e3	Address feedback	2020-09-08 19:05:07 -07:00
Hadi Moshayedi	9e247cdf40	Move table footer to metadata tables	2020-09-07 21:53:28 -07:00
Hadi Moshayedi	85a51fb2ef	Merge pull request #3 from citusdata/add_reindent Add 'make reindent'	2020-09-07 15:51:09 -07:00
Hadi Moshayedi	b74de68ce3	Add 'make reindent'	2020-09-07 15:48:23 -07:00
Hadi Moshayedi	4b1e80a19f	Merge pull request #1 from citusdata/use_metadata_tables Move StripeFooter to metadata tables.	2020-09-07 15:33:26 -07:00
Hadi Moshayedi	f691576f13	Move StripeFooter to metadata tables.	2020-09-07 15:22:52 -07:00
Hadi Moshayedi	406bebe4b8	update .gitignore	2020-09-07 15:22:52 -07:00
Onur Tirtir	ba208eae4d	Record non-distributed table accesses in local executor (#4139 )	2020-09-07 18:19:08 +03:00
Nils Dijk	959629d3f3	Merge pull request #4136 from citusdata/fix/ensure-reference-transfer-mode expose transfer mode for ensure reference table existence	2020-09-03 16:18:17 +02:00
Nils Dijk	bbf42063a7	export LookupShardTransferMode	2020-09-03 16:06:38 +02:00
Nils Dijk	6e4862c57f	expose transfermode for ensure reference table existance	2020-09-03 16:06:37 +02:00
SaitTalhaNisanci	366461ccdb	Introduce cache entry/table utilities (#4132 ) Introduce table entry utility functions Citus table cache entry utilities are introduced so that we can easily extend existing functionality with minimum changes, specifically changes to these functions. For example IsNonDistributedTableCacheEntry can be extended for citus local tables without the need to scan the whole codebase and update each relevant part. * Introduce utility functions to find the type of tables A table type can be a reference table, a hash/range/append distributed table. Utility methods are created so that we don't have to worry about how a table is considered as a reference table etc. This also makes it easy to extend the table types. * Add IsCitusTableType utilities * Rename IsCacheEntryCitusTableType -> IsCitusTableTypeCacheEntry * Change citus table types in some checks	2020-09-02 22:26:05 +03:00
Jeff Davis	3089c92103	header file and include cleanup	2020-09-02 11:41:01 -07:00
Jeff Davis	59d5d96170	move _PG_* declarations to mod.h	2020-09-02 10:31:10 -07:00
Jeff Davis	ba506acd35	Refactor the FDW API to take code out of cstore_fdw.c.	2020-09-01 21:26:46 -07:00
Citus Team	abc9fbe1c3	Squash of original cstore_fdw	2020-11-05 14:17:20 +01:00
Jelte Fennema	f38d24f8fb	Merge pull request #4006 from citusdata/orerror-instead-of-force Rename ForceXxx functions to to XxxOrError	2020-09-01 11:56:52 +02:00
Jelte Fennema	451ea04508	Rename ForceXxx functions to to XxxOrError This clearer naming was suggested in https://github.com/citusdata/citus/pull/4001	2020-09-01 11:19:17 +02:00
Hanefi Onaldi	b7aad903e8	Merge pull request #4117 from citusdata/delegate-reference-procedures	2020-09-01 07:46:53 +03:00
Hanefi Önaldı	024d398cd7	Allow distribution of functions that read from reference tables create_distributed_function(function_name, distribution_arg_name, colocate_with text) This UDF did not allow colocate_with parameters when there were no disttribution_arg_name supplied. This commit changes the behaviour to allow missing distribution_arg_name parameters when the function should be colocated with a reference table.	2020-09-01 07:28:34 +03:00
Jelte Fennema	d0f4c19f15	Add python to OSX CONTRIBUTING requirements (#4083 )	2020-08-31 18:14:21 +02:00
SaitTalhaNisanci	2baf3e0bae	Fail if merge to enterprise causes compilation issues (#4027 ) * check compilation of enterprise job * test that enterprise merge job fails with compilation error * Revert "test that enterprise merge job fails with compilation error" This reverts commit 0eaccd58c207a4c15365186017bf47601cc95552. * Update readme and use citus extbuilder:13beta3	2020-08-31 13:56:15 +03:00
Jelte Fennema	1c9dc80e8f	merge-enterprise script instructions to resolve outdated branches (#4073 ) Co-authored-by: Onur Tirtir <onurcantirtir@gmail.com> Co-authored-by: SaitTalhaNisanci <s.talhanisanci@gmail.com>	2020-08-31 11:10:32 +02:00
Önder Kalacı	983206c5e1	Hide `citus.subquery_pushdown` flag and NOTICE when enabled (#4124 ) * Hide citus.subquery_pushdown flag This flag is dangerous and could likely to let queries return wrong results. The flag has a very specific purpose for a very specific data distribution and query structure. In those cases, when the flag is set, the user can skip recursive planning altogether at their own risk. The meaning of the flag is that "I know what I'm doing such that the query structure/data distribution is on my control, so Citus can skip many correctness checks". For regular users, enabling this flag is discouraged. We have to keep the support only for backward compatibility for some users. In addition to that, give a NOTICE to discourage new users to use it.	2020-08-28 14:53:09 +02:00
SaitTalhaNisanci	2459ba6eca	Update docker images (#4122 ) * Update and separate test images The build image was a single one and it would contain pg11, pg12 and pg13. Now it is separated so that we can build each pg major independently. Tags are used as full postgres versions so that we can know which version we use by looking at the tag. For example exttester:11.9 would mean we are using pg11.9. pg11 is updated from 11.5 to 11.9. pg12 is updated from 12rc to 12.4. * Ignore memory usage in pg13 explain * Use citus instead of personal repo	2020-08-26 16:23:59 +03:00
SaitTalhaNisanci	f7c2af0411	Rename RemoveCoordinatorPlacement (#4125 ) RemoveCoordinatorPlacement does not do what it says. It removes the coordinator placement only if there are other placements, so it is not a single node, and only if the coordinator has a placement.	2020-08-26 13:12:10 +03:00
Onur Tirtir	2ca8d2fb33	Update codecov orb to 1.1.1 (#4112 )	2020-08-21 12:38:29 +03:00
Hanefi Onaldi	f47b3a7e7d	Remove unused parameters from round robin reordering and friends (#4120 )	2020-08-20 12:45:01 +03:00
SaitTalhaNisanci	20c39fae9a	Loosen the requirement to pushdown a subquery with ref tables (#4110 ) AllTargetExpressionsAreColumnReferences would return false if a query had an entry that is referencing the outer query. It seems safe to not have this for non-distributed tables, such as reference tables. We already have separate checks for other cases such as having limits.	2020-08-14 12:11:15 +03:00
SaitTalhaNisanci	679bf0d2b2	Create CanPushdownSubqery wrapper for better readability (#4108 )	2020-08-12 17:28:20 +03:00
SaitTalhaNisanci	73ef40886b	Rename FindNodeCheckXXX functions (#4106 ) FindNodeCheck is not clear about what the function is doing. They are renamed to FindNodeMatchingCheckFunctionXXX. Also for choosing elements in these functions, CheckNodeFunc type is introduced.	2020-08-11 15:01:23 +03:00
Hadi Moshayedi	b8d826a113	Merge pull request #4068 from citusdata/explain_analyze_execute Support EXPLAIN ANALYZE EXECUTE and EXPLAIN EXECUTE	2020-08-10 13:59:40 -07:00
Hadi Moshayedi	7b74eca22d	Support EXPLAIN EXECUTE ANALYZE.	2020-08-10 13:44:30 -07:00
SaitTalhaNisanci	e500779ddd	Merge pull request #4098 from citusdata/isolateTests Isolate each test schedule	2020-08-10 15:31:00 +03:00
Sait Talha Nisanci	4cb77da9d4	Use full names in jobs for make targets	2020-08-10 15:07:05 +03:00
Sait Talha Nisanci	1de6a8e8fb	Isolate each test schedule Since we don't have any limitation on parallelism now, it makes sense to isolate each test schedule so that: - we can use more parallelism - we will wait less on retries because if a job fails with multiple schedules, we needed to rerun all of them.	2020-08-10 15:07:05 +03:00
Philip Dubé	7d5c85657e	Merge pull request #4097 from citusdata/fix-case-insensitive-test-on-old-icu Fix non deterministic collation test to work with ancient libicu versions	2020-08-07 12:46:11 +00:00
Philip Dubé	212ae7163f	Fix non deterministic collation test to work with ancient libicu versions CentOS 7's libicu is too old for und-u-ks-level2 @colStrength=secondary works with both older & newer versions of libicu	2020-08-07 12:34:32 +00:00
Marco Slot	320a840b3f	Merge pull request #3819 from citusdata/fix/mx_multi_shard_lock	2020-08-07 12:11:18 +02:00
Hanefi Onaldi	5be8287989	Fix comments of helper functions that set local config values (#4100 )	2020-08-07 11:20:38 +03:00
Marco Slot	768d8b232c	Do not take multi-shard locks on workers	2020-08-06 21:48:25 +02:00
Halil Ozan Akgül	b7efb51e63	Merge pull request #3913 from citusdata/undistribute-table Undistribute Table	2020-08-05 15:30:52 +03:00
Halil Ozan Akgul	375310b7f1	Adds support for table undistribution	2020-08-05 14:36:03 +03:00
SaitTalhaNisanci	7378ab6bf8	Add postgres 13 to configure.in (#4088 )	2020-08-05 11:36:55 +03:00
SaitTalhaNisanci	3d1fd08fcf	Merge pull request #3900 from citusdata/enh/pg13Support Add PG13 support	2020-08-04 23:53:47 +03:00
Sait Talha Nisanci	195c2a91e2	Use citus docker repo instead of personal account	2020-08-04 23:07:00 +03:00
Sait Talha Nisanci	fe4ac51d8c	Normalize Output:.. since it changes with pg13 Fix indentation for better readability	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	283b1db6a4	add pg13 to CI	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	33406598e3	Add ruleutils changes from 3977 and 4011	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	63ed126ad4	Set buffer usage with explain It seems that currently we process even postgres tables in explain commands. This is because we register a hook for explain and we don't have any check to see if the query has any citus table. With this commit, we now send the buffer usage as well to the relevant API. There is some duplicate in the code but it is because of the existing structure, we can refactor this separately.	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	fe1e1c9b68	Replace Set_ptr_value as SetListCellPtr to be more explicit Move header to right place and fix comment style	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	8e9b52971c	Use new var field names in the codebase The codebase is updated to use varattnosync and varnosyn and we defined the macros for older versions. This way we can just remove the macros when we drop an older version.	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	b641f63bfd	Use CMDTAG_SELECT_COMPAT CMDTAG_SELECT exists in PG12 hence defining a MACRO such as CMDTAG_SELECT -> "SELECT" is not possible. I chose CMDTAG_SELECT_COMPAT because with the COMPAT suffix it is explicit that it maps to different things in different versions and also has a less chance of mapping something irrevelant. For example if we used SELECT as a macro, then it would map every SELECT to whatever it is mapping to, which might have unexpected/undesired behaviour.	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	d68bfc5687	Improve error for index operator class parameters The error message when index has opclassopts is improved and the commit from postgres side is also included for future reference. Also some minor style related changes are applied.	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	288aa58603	add alternative out for pg13 test	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	d0b0c88920	Changelog: error out if index has opclassopts Error out if index has opclassopts. Changelog entry on PG13: Allow CREATE INDEX to specify the GiST signature length and maximum number of integer ranges (Nikita Glukhov)	2020-08-04 15:38:13 +03:00
Sait Talha Nisanci	f7a1971361	Changelog: Alter type options It seems that we don't support propagating commands related to base types. Therefore Alter TYPE options doesn't seem to apply to us. I have added a test to verify that we don't propagate them. Changelog entry on pg13: Add ALTER TYPE options useful for extensions, like TOAST and I/O functions control (Tomas Vondra, Tom Lane)	2020-08-04 15:38:11 +03:00
Sait Talha Nisanci	00633165fc	Changelog: Test unicode escapes Unicode escapes work as expected, related tests are added. Changelog entry on PG13: Allow Unicode escapes, e.g., E'\u####', U&'\####', to specify any character available in the database encoding, even when the database encoding is not UTF-8 (Tom Lane)	2020-08-04 15:36:30 +03:00
Sait Talha Nisanci	79dcb80140	Changelog: Test IS NORMALIZED for pg13 Tests for is_normalized and normalized ar eadded. One thing that seems to be because of existent bug is that when we don't give the second argument to normalize or is_normalized, which is optional, it crashes. Because in the executor part, in the expression we don't have the default argument. Changelog entry in PG-13: Add SQL functions NORMALIZE() to normalize Unicode strings, and IS NORMALIZED to check for normalization (Peter Eisentraut) Commit on Postgres: 2991ac5fc9b3904ca4582be6d323497d7c3d17c9	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	ebabca16b7	Changelog: Test row suffix notation It seems that row suffix notation is working fine with our code, a test is added. Changelog entry in PG13: Allow ROW values values to have their members extracted with suffix notation (Tom Lane)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	275ccd0400	Changelog: Test that alter view rename column works Changelog entry in PG13: Add ALTER VIEW syntax to rename view columns (Fujii Masao)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	920d7211e4	Changelog: Test that we error out for DROP EXPRESSION PG13 now supports dropping expression from a column such as generated columns. We error out with this currently. Changelog entry in postgres: Add ALTER TABLE clause DROP EXPRESSION to remove generated properties from columns (Peter Eisentraut)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	87088d92bc	Changelog: handle VACUUM PARALLEL option Postgres 13 added a new VACUUM option, PARALLEL. It is now supported in our code as well. Relevant changelog message on postgres: Allow VACUUM to process indexes in parallel (Masahiko Sawada, Amit Kapila)	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	1070828465	update cte inline output for pg13 Make some macros in version_compat more robust Remove commented code in ruleutils Remove unnecessary variable assignments	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	157af140e4	ignore concurrent root page split debugs	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	1112b254a7	adapt recently added code for pg13 This commit mostly adds pg_get_triggerdef_command to our ruleutils_13. This doesn't add anything extra for ruleutils 13 so it is basically a copy of the change on ruleutils_12	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	ff7a563c57	decrease log level to debug1 to prevent flaky debug	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	6ff4e42706	Add alternative output for multi_function_in_join With pg13, constants functions from "FROM" clause are replaced. This means that in citus side, we will see the constraints in restriction info, instead of the function call. For example: SELECT * FROM table1 JOIN add(3,5) sum ON (id = sum) ORDER BY id ASC; Assuming that the function `add` returns constant, it will be evaluated on postgres side. This means that this query will be routable because there will be only one shard after pruning with the restrictions. However before pg13, this would be multi shard query. And it would go into recursive planning, the function would be evaluated on the coordinator because it can be. This means that with pg13, users will need to distribute the function because when it is routable executable, it will currently also send the function call to the worker in the query. So the function should exist in the worker. It could be better to replace the constant in the query tree as well so that the query string sent to the worker has the constant value and therefore it doesn't need the function. However I feel like users would already have the function in workers if they have any multi shard query. Commit on Postgres side: 7266d0997dd2a0632da38a594c78e25ff21df67e	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	a34a1126ec	add alternative output for pg13 in some tests	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	108a2972c2	Introduce a workaround for join aliases When there is a join alias, var->varnosync will point to the alias and var->varno will point to the table itself, but we need to use the alias when deparsing the query. Hence a workaround is introduced to solve this problem in ruleutils. Normally this case can be understood with dpns->plan == NULL check but in our case, dpns->plan is always NULL. We should sync our ruleutils at some point with postgres ruleutils. This could be a wrong solution as well but the tests pass.	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	c5c9ec288f	fix multi_mx_create_table test	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	76c7b3d1c6	Remove unused steps in isolation tests PG13 gives a warning for unused steps therefore we should remove the unused steps in isolation tests.	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	17388e2e91	update some tests	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	6ad708642e	Fix rte index with pg >=13 Rte index is increased by range table index offset in pg >= 13. The offset is removed with the pg >= 13. Currently pushdown for union all is disabled because translatedVars is set to nil on postgres side, and we were using translatedVars to figure out if partition key has the same index in both sides of union all. This should be fixed. Commit on postgres side: 6ef77cf46e81f45716ec981cb08781d426181378 fix union all pushdown logic for pg13 Before pg 13, there was a field, translatedVars, and we were using that to understand if the partition key has the same index on both sides of the union all. With pg13 there is a parent_colnos field in appendRelInfo and we can use that to get the attribute numbers(varattnos) in union all vars. We make use of parent_colnos instead of translatedVars in pg >=13.	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	de82d0ff79	add output for pg13 for propagate extension commands CREATE EXTENSION <name> FROM <old_version> is not supported anymore with postgres 13. An alternative output is added for pg13 where we basically error for that statement.	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	80d2bc2317	normalize some output and sort test result	2020-08-04 15:18:27 +03:00
Sait Talha Nisanci	0f6c21d418	sort result in ch_bench_having_mx test	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	70f27c10e5	Add some normalization rules for tests The not-null constraint message changed with pg13 slightly hence a normalization rule is added for that, which converts it to pg < 13 output. Commit on postgres: 05f18c6b6b6e4b44302ee20a042cedc664532aa2 An extra debug message is added related to indexes on postgres, these are safe to be ignored, so we can delete them from tests. Commit on Postgres side: 612a1ab76724aa1514b6509269342649f8cab375 varnoold is renamed as varnosyn and varoattno is renamed as varattnosyn so in the output we normalize the values as the old ones to simply pass the tests.	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	3cc7717e64	Fill new join fields for PG>=13 For joins 3 new fields are added, joinleftcols, joinrightcols, and joinmergedcols. We are not interested in joinmergedcols because we always expand the column used in joins. There joinmergedcols is always 0 in our case. For filling joinleftcols and joinrightcols we basically construct the lists with sequences so either list is of the form: [1 2 3 4 .... n] Ruleutils is not completed synced with postgres ruleutils and the most important part is identify_join_columns function change, which now uses joinleftcols and joinrightcols. Commit on postgres side: 9ce77d75c5ab094637cc4a446296dc3be6e3c221 A useful email thread: https://www.postgresql.org/message-id/flat/7115.1577986646%40sss.pgh.pa.us#0ae1d66feeb400013fbaa67a7cccd6ca	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	bc20920252	introduce SetJoinRelatedColumnsCompat PG13 uses joinmergedcols, joinleftcols and joinrightcols for finding join order now. There relevant fields are set on citus side. Postgres side commit: 9ce77d75c5ab094637cc4a446296dc3be6e3c221	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	135af84859	Update ruleutils for join related changes of postgres Postgres changed some join related fields and therefore they also changed ruleutils, this commit applies those changes to our copy of ruleutils. Related commit on postgres side: 9ce77d75c5ab094637cc4a446296dc3be6e3c221	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	38aaf1faba	use QueryCompletion struct Postgres introduced QueryCompletion struct. Hence a compat utility is added to finish query completion for older versions and pg >= 13. The commit on Postgres side: 2f9661311b83dc481fc19f6e3bda015392010a40	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	9f1ec792b3	add queryString to distributed_planner distributed_planner now takes query string as a parameter. related commit on PG side: 6aba63ef3e606db71beb596210dd95fa73c44ce2	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	1a7ccac6ef	Add RangeTableEntryFromNSItem macro addRangeTableEntryXXX methods return a ParseNamespaceItem with pg >= 13. RangeTableEntryFromNSItem macro is added so that we return the range table entry from the ParseNamespaceItem in pg>=13 and for pg < 13 rte would already be returned with addRangeTableEntryXXX methods. Commit on Postgres side: 5815696bc66b3092f6361f53e0394909647042c8	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	4ed30a0824	create Set_ptr_value Since PG13 changed the list, a listcell doesn't contain data anymore. Therefore Set_ptr_value macro is created, so that depending on the version it will either use cell->data.ptr_value or cell->ptr_value. Commit on Postgres side: 1cff1b95ab6ddae32faa3efe0d95a820dbfdc164	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	ab85a8129d	map varoattno and varnoold fields in Var With PG13 varoattno and varnoold fields were renamed as varattnosyn and varnosyn. A macro is defined for these. Commit on Postgres side: 9ce77d75c5ab094637cc4a446296dc3be6e3c221 Command on Postgres side: git log --all --grep="varoattno"	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	688ab16bba	Introduce ExplainOnePlanCompat Since ExplainOnePlan expects BufferUsage as well with PG >= 13, ExplainOnePlanCompat is added. Commit on Postgres side: ed7a5095716ee498ecc406e1b8d5ab92c7662d10	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	6314eba5df	introduce standard_planner_compat standard_planner now takes the query string as a parameter as well with pg >= 13. Commit on Postgres Side: 66888f7424f7d6c7cea2c26e181054d1455d4e7a	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	991f49efc9	introduce getOwnedSequencesCompat macro Commit on Postgres side: 19781729f789f3c6b2540e02b96f8aa500460322	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	01632c56a0	Change utils/hashutils.h to common/hashfn.h for PG >= 13 Commit on postgres side: 05d8449e73694585b59f8b03aaa087f04cc4679a Command on postgres side: git log --all --grep="hashutils" include common/hashfn.h for pg >= 13 tag_hash was moved from hsearch.h to hashutils.h then to hashfn.h Commits on Postgres side: 9341c783cc42ffae5860c86bdc713bd47d734ffd	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	00e7386007	introduce PortalDefineQuerySelectCompat PortalDefineQuery doesn't accept char* for command tag anymore with PG >= 13. We are currently only using it with Select, therefore a Portal define query compat for select is created. Commit on PG side: 2f9661311b83dc481fc19f6e3bda015392010a40	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	62879ee8c1	introduce planner_compat and pg_plan_query_compat macros As the new planner and pg_plan_query_compat methods expect the query string as well, macros are defined to be compatible in different versions of postgres. Relevant commit on Postgres: 6aba63ef3e606db71beb596210dd95fa73c44ce2 Command on Postgres: git log --all --grep="pg_plan_query"	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	bf831d2e59	Use table_openXXX methods in the codebase With PG13 heap_* (heap_open, heap_close etc) are replaced with table_* (table_open, table_close etc). It is better to use the new table access methods in the codebase and define the macros for the previous versions as we can easily remove the macro without having to change the codebase when we drop the support for the old version. Commits that introduced this change on Postgres: f25968c49697db673f6cd2a07b3f7626779f1827 e0c4ec07284db817e1f8d9adfb3fffc952252db0 4b21acf522d751ba5b6679df391d5121b6c4a35f Command to see relevant commits on Postgres side: git log --all --grep="heap_open"	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	0819b79631	introduce list compat macros Pass the list to lnext API lnext API now expects the list as well. The commit on Postgres that introduced the change: 1cff1b95ab6ddae32faa3efe0d95a820dbfdc164 lnext_compat and list_delete_cell_compat macros are introduced so that we can use these macros in the codebase without having to use #if directives in the codebase. Related commit on postgres: 1cff1b95ab6ddae32faa3efe0d95a820dbfdc164 Command to search in postgres: git log --all --grep="list_delete_cell" add ListCellAndListWrapper When iterating a list in separate function calls, we need both the list and the current cell starting from PG13, therefore ListCellAndListWrapper is added to store both as a wrapper. Use ListCellAndListWrapper in foreign key test udfs As we iterate a list in these udfs using a functionContext, we need to use the wrapper to be able to access both the list and the current cell.	2020-08-04 15:10:22 +03:00
Sait Talha Nisanci	8ce8683ac4	Update ruleutils_13.c with postgres ruleutils Some manual updates are done for ruleutils_13 based on the difference between pg12 ruleutils and pg13 ruleutils.	2020-08-04 13:34:13 +03:00
Sait Talha Nisanci	30549dc0e2	add copy of ruleutils_12 as ruleutils_13	2020-08-04 13:34:13 +03:00
Sait Talha Nisanci	58643a4098	Enable postgres 13 in configure	2020-08-04 13:34:13 +03:00
Önder Kalacı	c79c6506b9	Merge pull request #4034 from citusdata/copy_shared_pool_size_with_reservations Implement shared connection count reservation & enable `citus.max_shared_pool_size` for COPY	2020-08-03 19:03:24 +02:00
Onder Kalaci	eeb8c81de2	Implement shared connection count reservation & enable `citus.max_shared_pool_size` for COPY With this patch, we introduce `locally_reserved_shared_connections.c/h` files which are responsible for reserving some space in shared memory counters upfront. We sometimes need to reserve connections, but not necessarily establish them. For example: - COPY command should reserve connections as it cannot know which connections it needs in which order. COPY establishes connections as any input data hits the workers. For example, for router COPY command, it only establishes 1 connection. As discussed here (https://github.com/citusdata/citus/pull/3849#pullrequestreview-431792473), COPY needs to reserve connections up-front, otherwise we can end up with resource starvation/un-detected deadlocks.	2020-08-03 18:51:40 +02:00
Onur Tirtir	066860a98a	Merge pull request #3966 from citusdata/citus-9.4.0-changelog-1593605812 Add changelog entry for 9.4.0	2020-07-28 16:01:31 +03:00
Onur Tirtir	c7f97a9e01	Update CHANGELOG for 9.4.0	2020-07-28 14:40:45 +03:00
nukoyluoglu	38987431e7	propagation of CHECK statements to workers with parentheses (#4039 ) * ensure propagation of CHECK statements to workers with parantheses & adjust regression test outputs * add tests for distributing tables with simple CHECK constraints * added test for CHECK on bool variable	2020-07-27 15:08:37 +03:00
Benjamin Satzger	a35a15a513	Distribute custom aggregates with multiple arguments (#4047 ) Enable custom aggregates with multiple parameters to be executed on workers. #2921 introduces distributed execution of custom aggregates. One of the limitations of this feature is that only aggregate functions with a single aggregation parameter can be pushed to worker nodes. Aim of this change is to remove that limitation and support handling of multi-parameter aggregates. Resolves: #3997 See also: #2921	2020-07-24 15:16:00 -07:00
Onur Tirtir	c73563c340	Merge pull request #4065 from citusdata/update-cl-935 Update CHANGELOG for 9.3.5	2020-07-24 18:49:43 +03:00
Onur Tirtir	c2ba9a4844	Update CHANGELOG for 9.3.5	2020-07-24 15:58:52 +03:00
Halil Ozan Akgül	a8c7a3e2ac	Merge pull request #4060 from citusdata/create-index-concurrently-local-table-bug Fixes CREATE INDEX CONCURRENTLY bug	2020-07-24 14:32:48 +03:00
Halil Ozan Akgul	38b72ddd66	Fixes create index concurrently bug	2020-07-24 12:14:14 +03:00
SaitTalhaNisanci	ef841115de	Fix int32 overflow and use PG macros for INT32_XX (#4061 ) * Use CalculateUniformHashRangeIndex in HashPartitionId INT32_MIN definition can change among different platforms hence it is possible to get overflow, we would see crashes because of this in debian distros. We have already solved a similar problem with introducing CalculateUniformHashRangeIndex method, hence to solve it we can use the same method, this also removes some duplication and has a single place to decide that. * Use PG_INT32_XX instead of INT32_XX to be safer	2020-07-23 18:30:08 +03:00
Halil Ozan Akgül	e9f89ed651	Fixes the non existing table bug (#4058 )	2020-07-23 18:01:21 +03:00
Önder Kalacı	770610ab11	Merge pull request #4055 from citusdata/improve_find_available_connection Make FindAvailableConnection() more strict	2020-07-23 16:09:41 +02:00
Onder Kalaci	a2f53dff74	Make FindAvailableConnection() more strict With adaptive connection management, we might have some connections which are not fully initialized. Those connections should not be qualified as available.	2020-07-23 15:59:50 +02:00
Önder Kalacı	20a46f8f57	Merge pull request #4054 from citusdata/rename_connection_flag Minor refactorings in COPY command execution	2020-07-23 15:58:16 +02:00
Onder Kalaci	cfb633601d	Minor refactorings in COPY command execution 1) Rename CONNECTION_PER_PLACEMENT to REQUIRE_CLEAN_CONNECTION. This is mostly to make things clear as the new name reveals more. 2) We also make sure that mark all the copy connections critical, even if they are accessed earlier in the transction	2020-07-23 15:36:19 +02:00
SaitTalhaNisanci	64469708af	separate the logic in ManageWorkerPool (#3298 )	2020-07-23 13:47:35 +03:00
Önder Kalacı	2c8066a313	Merge pull request #4052 from citusdata/refactor_adaptive_flags Move executor specific logic to a function	2020-07-22 16:31:15 +02:00
Onder Kalaci	52c0fccb08	Move executor specific logic to a function Because as we're planning to use the same logic, it'd be nice to use the exact same functions.	2020-07-22 15:09:47 +02:00
Önder Kalacı	d03c4aff2d	Merge pull request #4053 from citusdata/unify_node_comparisions Unify node sort ordering	2020-07-22 11:23:18 +02:00
Onder Kalaci	ff6555299c	Unify node sort ordering The executor relies on WorkerPool, and many other places rely on WorkerNode. With this commit, we make sure that they are sorted via the same function/logic.	2020-07-22 11:03:25 +02:00
SaitTalhaNisanci	40405cd978	Merge pull request #4042 from citusdata/cleanup/task-tracker Clean up task-tracker related comments documentation tests	2020-07-21 16:52:22 +03:00
Sait Talha Nisanci	01c23b0df2	update test outputs with task-tracker removal	2020-07-21 16:25:08 +03:00
Sait Talha Nisanci	1dbd545cf4	replace task-tracker with adaptive in tests	2020-07-21 16:21:01 +03:00
Sait Talha Nisanci	4308d867d9	remove task-tracker in comments, documentation	2020-07-21 16:21:01 +03:00
Sait Talha Nisanci	a3dc8fe2b5	remove occurrences of task-tracker from gucs	2020-07-21 16:19:46 +03:00
Onur Tirtir	6aa29abd86	Merge pull request #4049 from citusdata/update-cl-934 Update CHANGELOG for 9.3.4	2020-07-21 14:03:11 +03:00
Onur Tirtir	9e12a39cb7	Update CHANGELOG for 9.3.4	2020-07-21 10:26:06 +03:00
Hanefi Onaldi	61bc47e6a8	Merge pull request #4035 from citusdata/fix-4012 Split list of configuration values properly	2020-07-21 04:24:17 +03:00
Hanefi Önaldı	e534dbae4a	Accept list of values in a supported ALTER ROLE .. SET statement Some GUCs support a list of values which is indicated by GUC_LIST_INPUT flag. When an ALTER ROLE .. SET statement is executed, the new configuration default for affected users and databases are stored in the setconfig(text[]) column in a pg_db_role_setting record. If a GUC that supports a list of values is used in an ALTER ROLE .. SET statement, we need to split the text into items delimited by commas.	2020-07-21 03:49:57 +03:00
Nils Dijk	00a4a15d95	fix sorting on string litteral (#4045 ) As noted by Talha https://github.com/citusdata/citus/pull/4029#issuecomment-660466972 there was still some sort order flappiness in the test. The root cause is that sorting on `1::text` sorts on the literal `'1'` which causes sorting to be indeterministic. This behaviour is consistent with Postgres' behaviour, so no bug on Citus' side.	2020-07-20 17:39:27 +02:00
Önder Kalacı	32d9cce8a2	Merge pull request #4041 from citusdata/remove_router_executable_flag Remove `routerExecutable` flag from `DistributedPlan`	2020-07-20 16:03:37 +02:00
Onder Kalaci	c25de2cf22	Remove flag from As it doesn't make any sense anymore	2020-07-20 12:45:05 +02:00
SaitTalhaNisanci	b3af63c8ce	Remove task tracker executor (#3850 ) * use adaptive executor even if task-tracker is set * Update check-multi-mx tests for adaptive executor Basically repartition joins are enabled where necessary. For parallel tests max adaptive executor pool size is decresed to 2, otherwise we would get too many clients error. * Update limit_intermediate_size test It seems that when we use adaptive executor instead of task tracker, we exceed the intermediate result size less in the test. Therefore updated the tests accordingly. * Update multi_router_planner It seems that there is one problem with multi_router_planner when we use adaptive executor, we should fix the following error: +ERROR: relation "authors_range_840010" does not exist +CONTEXT: while executing command on localhost:57637 * update repartition join tests for check-multi * update isolation tests for repartitioning * Error out if shard_replication_factor > 1 with repartitioning As we are removing the task tracker, we cannot switch to it if shard_replication_factor > 1. In that case, we simply error out. * Remove MULTI_EXECUTOR_TASK_TRACKER * Remove multi_task_tracker_executor Some utility methods are moved to task_execution_utils.c. * Remove task tracker protocol methods * Remove task_tracker.c methods * remove unused methods from multi_server_executor * fix style * remove task tracker specific tests from worker_schedule * comment out task tracker udf calls in tests We were using task tracker udfs to test permissions in multi_multiuser.sql. We should find some other way to test them, then we should remove the commented out task tracker calls. * remove task tracker test from follower schedule * remove task tracker tests from multi mx schedule * Remove task-tracker specific functions from worker functions * remove multi task tracker extra schedule * Remove unused methods from multi physical planner * remove task_executor_type related things in tests * remove LoadTuplesIntoTupleStore * Do initial cleanup for repartition leftovers During startup, task tracker would call TrackerCleanupJobDirectories and TrackerCleanupJobSchemas to clean up leftover directories and job schemas. With adaptive executor, while doing repartitions it is possible to leak these things as well. We don't retry cleanups, so it is possible to have leftover in case of errors. TrackerCleanupJobDirectories is renamed as RepartitionCleanupJobDirectories since it is repartition specific now, however TrackerCleanupJobSchemas cannot be used currently because it is task tracker specific. The thing is that this function is a no-op currently. We should add cleaning up intermediate schemas to DoInitialCleanup method when that problem is solved(We might want to solve it in this PR as well) * Revert "remove task tracker tests from multi mx schedule" This reverts commit `03ecc0a681`. * update multi mx repartition parallel tests * not error with task_tracker_conninfo_cache_invalidate * not run 4 repartition queries in parallel It seems that when we run 4 repartition queries in parallel we get too many clients error on CI even though we don't get it locally. Our guess is that, it is because we open/close many connections without doing some work and postgres has some delay to close the connections. Hence even though connections are removed from the pg_stat_activity, they might still not be closed. If the above assumption is correct, it is unlikely for it to happen in practice because: - There is some network latency in clusters, so this leaves some times for connections to be able to close - Repartition joins return some data and that also leaves some time for connections to be fully closed. As we don't get this error in our local, we currently assume that it is not a bug. Ideally this wouldn't happen when we get rid of the task-tracker repartition methods because they don't do any pruning and might be opening more connections than necessary. If this still gives us "too many clients" error, we can try to increase the max_connections in our test suite(which is 100 by default). Also there are different places where this error is given in postgres, but adding some backtrace it seems that we get this from ProcessStartupPacket. The backtraces can be found in this link: https://circleci.com/gh/citusdata/citus/138702 * Set distributePlan->relationIdList when it is needed It seems that we were setting the distributedPlan->relationIdList after JobExecutorType is called, which would choose task-tracker if replication factor > 1 and there is a repartition query. However, it uses relationIdList to decide if the query has a repartition query, and since it was not set yet, it would always think it is not a repartition query and would choose adaptive executor when it should choose task-tracker. * use adaptive executor even with shard_replication_factor > 1 It seems that we were already using adaptive executor when replication_factor > 1. So this commit removes the check. * remove multi_resowner.c and deprecate some settings * remove TaskExecution related leftovers * change deprecated API error message * not recursively plan single relatition repartition subquery * recursively plan single relation repartition subquery * test depreceated task tracker functions * fix overlapping shard intervals in range-distributed test * fix error message for citus_metadata_container * drop task-tracker deprecated functions * put the implemantation back to worker_cleanup_job_schema_cachesince citus cloud uses it * drop some functions, add downgrade script Some deprecated functions are dropped. Downgrade script is added. Some gucs are deprecated. A new guc for repartition joins bucket size is added. * order by a test to fix flappiness	2020-07-18 13:11:36 +03:00
Hadi Moshayedi	339d43357c	Merge pull request #4037 from citusdata/remove_per_placement_query Refactor: Use TupleDestination API for partitioning in insert/select.	2020-07-17 10:20:07 -07:00
Hadi Moshayedi	13003d8d05	Use TupleDestination API for partitioning in insert/select.	2020-07-17 09:43:46 -07:00
Marco Slot	f323033ce8	Merge pull request #4036 from citusdata/fix/overflow	2020-07-16 14:57:38 +02:00
Marco Slot	b823f2127d	Prevent integer overflow in FindShardIntervalIndex	2020-07-16 14:30:56 +02:00
Nils Dijk	d0b6e62c9a	change wording to allowlist and the likes (#3906 ) In the same line as #3904 Change wording to better reflect use and remove words that enforce/maintain bias.	2020-07-15 16:24:40 +02:00
Marco Slot	1baf6c3a45	Merge pull request #3976 from citusdata/fix/foreign-key-to-local-table-hint Improve error message when creating a foreign key to a local table	2020-07-15 14:30:19 +02:00
Marco Slot	e09860e9e3	Merge pull request #3991 from citusdata/fix/remove-level-assert Remove executor/planner level asserts in abort handler	2020-07-14 22:43:24 +02:00
SaitTalhaNisanci	bc011a6286	Add IsCitusTable check to citus table utilities (#4028 )	2020-07-14 18:29:33 +03:00
Nils Dijk	23d44eba9f	fix flappy tests due to undeterministic order of test output (#4029 ) As reported on #4011 https://github.com/citusdata/citus/pull/4011/files#r453804702 some of the tests were flapping due to an indeterministic order for test outputs. This PR makes the test output ordered for all tests returning non-zero rows. Needs to be backported to 9.2, 9.3, 9.4	2020-07-14 15:47:29 +02:00
Hanefi Onaldi	8189415731	Merge pull request #4004 from citusdata/move-downgrades	2020-07-14 13:56:33 +03:00
Hanefi Önaldı	315b323d47	Introduce new make targets for downgrade scripts Here are the updated make targets: - install: install everything except downgrade scripts. - install-downgrades: build and install only the downgrade migration scripts. - install-all: install everything along with the downgrade migration scripts.	2020-07-14 13:10:18 +03:00
SaitTalhaNisanci	ab5be77709	test coordinator reference-distributed table join (#3698 )	2020-07-14 11:43:03 +03:00
SaitTalhaNisanci	fd760fa4b3	Merge pull request #4005 from citusdata/fix/coordinator_repartition_join Send commands to coordinator when it is added as a worker	2020-07-13 20:22:43 +03:00
Sait Talha Nisanci	1b5ed45a58	add multi follower repartition tests	2020-07-13 19:50:50 +03:00
Sait Talha Nisanci	510535f558	address feedback	2020-07-13 19:45:02 +03:00
Sait Talha Nisanci	41ec76a6ad	use ActiveReadableNodeList in JobExecutorType and task tracker The reason we should use ActiveReadableNodeList instead of ActiveReadableNonCoordinatorNodeList is that if coordinator is added to cluster as a worker, it should be counted as well. Otherwise if there is only coordinator in the cluster, the count will be 0, hence we get a warning. In MultiTaskTrackerExecute, we should connect to coordinator if it is added to the cluster because it will also be assigned tasks.	2020-07-13 19:45:02 +03:00
Sait Talha Nisanci	d97d03ec65	use ActivePrimaryNodeList to include coordinator ActiveReadableWorkerNodeList doesn't include coordinator, however if coordinator is added as a worker, we should also include that while planning. The current methods are very easily misusable and this requires a refactoring to make the distinction between methods that include coordinator and that don't very explicit as they can introduce subtle/major bugs pretty easily.	2020-07-13 19:20:15 +03:00
Sait Talha Nisanci	db1b78148c	send schema creation/cleanup to coordinator in repartitions We were using ALL_WORKERS TargetWorkerSet while sending temporary schema creation and cleanup. We(well mostly I) thought that ALL_WORKERS would also include coordinator when it is added as a worker. It turns out that it was FILTERING OUT the coordinator even if it is added as a worker to the cluster. So to have some context here, in repartitions, for each jobId we create (at least we were supposed to) a schema in each worker node in the cluster. Then we partition each shard table into some intermediate files, which is called the PARTITION step. So after this partition step each node has some intermediate files having tuples in those nodes. Then we fetch the partition files to necessary worker nodes, which is called the FETCH step. Then from the files we create intermediate tables in the temporarily created schemas, which is called a MERGE step. Then after evaluating the result, we remove the temporary schemas(one for each job ID in each node) and files. If node 1 has file1, and node 2 has file2 after PARTITION step, it is enough to either move file1 from node1 to node2 or vice versa. So we prune one of them. In the MERGE step, if the schema for a given jobID doesn't exist, the node tries to use the `public` schema if it is a superuser, which is actually added for testing in the past. So when we were not sending schema creation comands for each job ID to the coordinator(because we were using ALL_WORKERS flag, and it doesn't include the coordinator), we would basically not have any schemas for repartitions in the coordinator. The PARTITION step would be executed on the coordinator (because the tasks are generated in the planner part) and it wouldn't give us any error because it doesn't have anything to do with the temporary schemas(that we didn't create). But later two things would happen: - If by chance the fetch is pruned on the coordinator side, we the other nodes would fetch the partitioned files from the coordinator and execute the query as expected, because it has all the information. - If the fetch tasks are not pruned in the coordinator, in the MERGE step, the coordinator would either error out saying that the necessary schema doesn't exist, or it would try to create the temporary tables under public schema ( if it is a superuser). But then if we had the same task ID with different jobID it would fail saying that the table already exists, which is an error we were getting. In the first case, the query would work okay, but it would still not do the cleanup, hence we would leave the partitioned files from the PARTITION step there. Hence ensure_no_intermediate_data_leak would fail. To make things more explicit and prevent such bugs in the future, ALL_WORKERS is named as ALL_NON_COORD_WORKERS. And a new flag to return all the active nodes is added as ALL_DATA_NODES. For repartition case, we don't use the only-reference table nodes but this version makes the code simpler and there shouldn't be any significant performance issue with that.	2020-07-13 19:20:15 +03:00
SaitTalhaNisanci	76ddb85545	improve error message in secondaries (#4025 )	2020-07-13 19:18:57 +03:00
Nils Dijk	449d1f0e91	force aliases in deparsing for queries with anonymous column references (#4011 ) DESCRIPTION: Force aliases in deparsing for queries with anonymous column references Fixes: #3985 The root cause has todo with discrepancies in the query tree we create. I think in the future we should spend some time on categorising all changes we made to ruleutils and see if we can change the data structure `query` we pass to the deparser to have an actual valid postgres query for the deparser to render. For now the fix is to keep track, besides changing the names of the entries in the target list, also if we have a reference to an anonymous columns. If there are anonymous columns we set the `printaliases` flag to true which forces the deparser to add the aliases.	2020-07-13 16:29:24 +02:00
Marco Slot	9cb8dc9d12	Improve error message when creating a foreign key to a local table	2020-07-13 13:57:22 +02:00
Marco Slot	5fbb925df1	Remove level asserts in abort handler	2020-07-12 22:54:35 +02:00
Onur Tirtir	50b2c5a7aa	Merge pull request #4023 from citusdata/dont-check-merge-to-enterprise-release Don't check-merge-to-enterprise for release branches	2020-07-10 19:59:32 +03:00
Onur Tirtir	1c6439d1af	Don't run check-merge-to-enterprise for release branches	2020-07-10 18:28:35 +03:00
Onur Tirtir	f3a01482b4	Merge pull request #4021 from citusdata/update-cl-0710-93 Update CHANGELOG for 9.3.3	2020-07-10 17:47:28 +03:00
Onur Tirtir	4c26bb5ffc	Update CHANGELOG for 9.3.3	2020-07-10 15:01:42 +03:00
SaitTalhaNisanci	b8830d063f	remove no-op check in TaskListRequires2PC (#4018 ) We already return true if replication model is REPLICATION_MODEL_2PC at the very beginning of the function, hence the check later is not used.	2020-07-10 14:16:23 +03:00
SaitTalhaNisanci	15290bc43b	remove unused worker methods (#4017 )	2020-07-10 13:45:55 +03:00
SaitTalhaNisanci	3f50165365	rename TargetWorkerSet enums (#4015 ) Rename TargetWorkerSet enums to make them more explicit about what they mean. Ideally it would be good to treat everything as a node without the 'worker' concept because it makes things complicated. Another improvement could be to rename TargetWorkerSet as TargetNodeSet but it goes to renaming many occurrences of Worker, which is probably too big for this PR.	2020-07-10 11:21:27 +03:00
Hadi Moshayedi	b642ed10e9	Merge pull request #4000 from citusdata/fix_subxact_memory_leak Fix a memory leak in subtransaction handling	2020-07-09 12:43:49 -07:00
Hadi Moshayedi	3651fc64ee	Fix Subtransaction memory leak	2020-07-09 12:33:39 -07:00
Jelte Fennema	4c68ed4c33	Make static analysis happier (#4008 ) Some small non-functional changes to make static analysis happy.	2020-07-09 16:04:27 +02:00
Jelte Fennema	759e628dd5	Handle some NULL issues that static analysis found (#4001 ) Static analysis found some issues where we used the result from ExtractResultRelationRTE, without checking that it wasn't NULL. It seems like in all these cases it can never actually be NULL, since we have checked before that it isn't a SELECT query. So, this PR is mostly to make static analysis happy (and protect a bit against future changes of the code).	2020-07-09 15:46:42 +02:00
SaitTalhaNisanci	96adce77d6	rename node/worker utilities (#4003 ) The names were not explicit about what they do, and we have many misusages in the codebase, so they are renamed to be more explicit.	2020-07-09 15:30:35 +03:00
Jelte Fennema	16242d5264	Fix write queries with const expressions and COLLATE in various places (#3973 )	2020-07-08 18:19:53 +02:00
Jelte Fennema	ab01571c9e	Fix crash with single node dummy placement (#3993 ) Static analysis found an issue where we could dereference `NULL`, because `CreateDummyPlacement` could return `NULL` when there were no workers. This PR changes it so that it never returns `NULL`, which was intended by @marcocitus when doing this change: https://github.com/citusdata/citus/pull/3887/files#r438136433 While adding tests for citus on a single node I also added some more basic tests and it turns out we error out on repartition joins. This has been present since `shouldhaveshards` was introduced and is not trivial to fix. So I created a separate issue for this: https://github.com/citusdata/citus/issues/3996	2020-07-08 17:11:25 +02:00
Jelte Fennema	f6e2f1b1cb	Replace words that have bad associations (#3992 ) We had a few words in our codebase that static analysis flagged as having bad associations.	2020-07-08 14:57:48 +02:00
Onur Tirtir	844221bb9f	Refactor utility hook global state changes (#3990 )	2020-07-08 10:44:00 +03:00
Hadi Moshayedi	cf9a72c3c8	Merge pull request #3988 from citusdata/fix_memory_issue Fix task->fetchedExplainAnalyzePlan memory issue.	2020-07-07 08:14:02 -07:00
Hadi Moshayedi	23fa421639	Fix task->fetchedExplainAnalyzePlan memory issue.	2020-07-07 07:58:02 -07:00
Philip Dubé	1c9810e395	Merge pull request #3977 from citusdata/fix-routing-unreferenced-modifying-ctes ruleutils: use get_rtable_name for deparsing resultRelation	2020-07-07 12:35:39 +00:00
Philip Dubé	444472ffc6	ruleutils: use get_rtable_name for deparsing resultRelation	2020-07-07 12:20:41 +00:00
Marco Slot	0b32a80f58	Merge pull request #3922 from citusdata/fix/coordinator-evaluation	2020-07-07 10:59:54 +02:00
citus bot	f0693e2f75	Remove unused MaxMasterConnectionCount function	2020-07-07 10:37:57 +02:00
citus bot	bdfeb380d3	Fix some more master->coordinator comments	2020-07-07 10:37:53 +02:00
Marco Slot	b4fec63bc0	Rename master evaluation to coordinator evaluation	2020-07-07 10:37:41 +02:00
Hadi Moshayedi	23ffaabe52	Merge pull request #3978 from citusdata/fix/explain_analyze Fix explain subplan duration	2020-07-03 21:57:45 -07:00
Sait Talha Nisanci	4d217819ff	Fix explain subplan duration	2020-07-03 20:39:55 +03:00
Jelte Fennema	8ab47f4f37	Add a CI check to see if all tests are part of a schedule (#3959 ) I recently forgot to add tests to a schedule in two of my PRs. One of these was caught by review, but the other one was not. This adds a script to causes CI to ensure that each test in the repo is included in at least one schedule. Three tests were found that were currently not part of a schedule. This PR adds those three tests to a schedule as well and it also fixes some small issues with these tests.	2020-07-03 11:34:55 +02:00
Jelte Fennema	c19957f90c	Merge pull request #3958 from citusdata/document-ci-tooling-better We keep accumulating more and more scripts to flag issues in CI. This is good, but we are currently missing consistent documentation for them. This commit moves all these scripts to the `ci` directory and adds some documentation for all of them in the README. It also makes sure that the last line of output of a failed script points to this documentation.	2020-07-03 11:11:25 +02:00
Jelte Fennema	9311978487	Add README for CI scripts We keep accumulating more and more scripts to flag issues in CI. This is good, but we are currently missing consistent documentation for them. This commit moves all these scripts to the `ci` directory and adds some documentation for all of them in the README. It also makes sure that the last line of output of a failed script points to this documentation.	2020-07-03 10:22:48 +02:00
Önder Kalacı	c0325d55d3	Merge pull request #3968 from citusdata/fix_bin_protocol Fix default value of EnableBinaryProtocol	2020-07-02 14:18:20 +02:00
Onder Kalaci	aa8a2866f3	Fix default value of EnableBinaryProtocol	2020-07-02 13:44:56 +02:00
Onur Tirtir	14c6861e5d	Merge pull request #3963 from citusdata/master-update-version-1593590518 Bump Citus to 9.5devel	2020-07-01 15:54:35 +03:00
Onur Tirtir	be17ebb334	Bump citus version to 9.5devel	2020-07-01 14:46:55 +03:00
Hanefi Onaldi	8913d63ae2	Merge pull request #3927 from citusdata/downgrade-paths	2020-07-01 10:48:50 +03:00
Hanefi Önaldı	ca2ececb3b	Downgrade path from 9.4 to 9.3 to 9.2	2020-07-01 10:38:11 +03:00
Hadi Moshayedi	dd5277418f	Merge pull request #3961 from citusdata/fix/constant-pushdown Don't push expressions to workers when aggregating without GROUP BY.	2020-06-30 14:06:15 -07:00
Sait Talha Nisanci	e5a21f07cb	test aggregates with expressions	2020-06-30 11:41:16 -07:00
Marco Slot	eeffbde8bd	Fix pushdown of constants in aggregate queries	2020-06-30 11:41:16 -07:00
Jelte Fennema	392c5e2c34	Fix wrong cancellation message about distributed deadlocks (#3956 )	2020-06-30 14:57:46 +02:00
Marco Slot	634d6cf9d7	Improve performance of metadata cache (#3924 ) #3866 removed the shard ID hash in metadata_cache.c to simplify cache management, but we observed a significant performance regression that was being masked by the performance improvement provided by #3654 in our benchmarks, but #3654 only applies to specific workloads. This PR brings back the shard ID cache as it existed before #3866 with some extra measures to handle invalidation. When we load a table entry, we overwrite ShardIdCacheEntry->tableEntry pointers for all the shards in that table, though it's possible that the table no longer contains the old shard ID or the table entry is never reloaded, which would leave a dangling pointer once the table entry is freed. To handle that case, we remove all shard ID cache entries that point exactly to that table entry when a table is freed (at the end of the transaction or any call to CitusTableCacheFlushInvalidatedEntries). Co-authored-by: SaitTalhaNisanci <s.talhanisanci@gmail.com> Co-authored-by: Marco Slot <marco.slot@gmail.com> Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2020-06-30 12:10:10 +02:00
Jelte Fennema	02fa942be1	Fix assertion error when rolling back to savepoint (#3868 ) It was possible to get an assertion error, if a DML command was cancelled that opened a connection and then "ROLLBACK TO SAVEPOINT" was used to continue the transaction. The reason for this was that canceling the transaction might leave the `claimedExclusively` flag on for (some of) it's connections. This caused an assertion failure because `CanUseExistingConnection` would return false and a new connection would be opened, and then there would be two connections doing DML for the same placement. Which is disallowed. That this situation caused an assertion failure instead of an error, means that without asserts this could possibly result in some visibility bugs, similar to the ones described https://github.com/citusdata/citus/issues/3867	2020-06-30 11:31:46 +02:00
SaitTalhaNisanci	e28683a025	Upgrade codecov orb in circleci (#3945 ) The only reason for this upgrade is to see if it will fix codecov pushing the coverage many times to PRs, which is cluttering the PRs. The reason for this change is that it is possible that "pushing many times" is related to codecov internals so upgrading can help.	2020-06-30 11:33:21 +03:00
Hadi Moshayedi	d022f80340	Merge pull request #3943 from citusdata/fix_explain_2 Report correct INSERT/SELECT method in EXPLAIN	2020-06-26 08:21:50 -07:00
Hadi Moshayedi	4ed59d2db3	Move more from insert_select_executor to insert_select_planner	2020-06-26 08:08:26 -07:00
Hadi Moshayedi	d34c21890f	Rename CoordinatorInsertSelect... to NonPushableInsertSelect	2020-06-25 08:55:48 -07:00
Hadi Moshayedi	cd25a27174	Fix crash caused by EXPLAIN EXECUTE INSERT ... SELECT	2020-06-25 08:55:48 -07:00
Hadi Moshayedi	4e8d79998e	Save INSERT/SELECT method in DistributedPlan. This is so we don't need to calculate it twice in insert_select_executor.c and multi_explain.c, which can cause discrepancy if an update in one of them is not reflected in the other site.	2020-06-25 08:55:48 -07:00
Jelte Fennema	64506143e4	Replace flaky repartition analyze test with a non flaky one (#3950 ) The flaky test was introduced in #3941. This removes that flaky test and adds a new one that fails in the same manner when removing the fix in #3941. An example of a random failure can be found here: https://app.circleci.com/pipelines/github/citusdata/citus/9558/workflows/de76e7a5-6558-46c9-97e7-8b1dae1f173b/jobs/135876/steps	2020-06-25 15:19:15 +02:00
SaitTalhaNisanci	50e115fe3a	test task tracker repartition with replication >1 (#3944 )	2020-06-24 14:54:20 +03:00
SaitTalhaNisanci	f458d1fd1c	Fix/task execution (#3941 ) * Not set TaskExecution with adaptive executor Adaptive executor is using a utility method from task tracker for repartition joins, however adaptive executor doesn't need taskExecution. It is only used by task tracker. This causes a problem when explain analyze is used because what taskExecution is pointing to might be random. We solve this by not setting taskExecution from adaptive executor. So it will stay NULL as set by CreateTask. * use same memory context as task for taskExecution Co-authored-by: Jelte Fennema <github-tech@jeltef.nl>	2020-06-24 12:10:00 +03:00
Philip Dubé	ac3c646ed5	Merge pull request #3942 from citusdata/fix-default-func-param-evaluation citus_evaluate_expression: call expand_function_arguments beforehand to avoid segfaulting on implicit parameters	2020-06-23 18:37:40 +00:00
Philip Dubé	cd0b2ad5b5	citus_evaluate_expression: call expand_function_arguments beforehand to avoid segfaulting on implicit parameters	2020-06-23 18:06:46 +00:00
Jelte Fennema	a98226842d	Use rename to make sure no files are inserted while deleting (#3912 ) As suggested by @marcocitus in https://github.com/citusdata/citus/pull/3911#issuecomment-643978531, there was a regression in #3893. If another backend would write a file during deletion of the intermediate results directory, this file would not necessarily be deleted. The approach used in `CitusRemoveDirectory` is to try recursive removal of the directory again if it has failed. This does not work here, since when a file can not be removed for other reasons (e.g. `EPERM`) it will not throw an error anymore. So then we would get into an infinite removal loop. Instead I now `rename` the directory before removing it. That way other backends will not write files to it anymore.	2020-06-23 10:38:44 +02:00
Hanefi Onaldi	0e0695481c	Merge pull request #3935 from citusdata/disallow-long-changelog	2020-06-22 23:55:50 +03:00
Hanefi Önaldı	e93c47f003	Fix long changelog items	2020-06-22 23:45:47 +03:00
Hanefi Önaldı	e61ced53e3	Disallow long changelog entries	2020-06-22 23:45:46 +03:00
Önder Kalacı	f41e1b1a60	Merge pull request #3923 from citusdata/assert_order Sort WorkerPool in executions	2020-06-22 18:27:54 +02:00
Onder Kalaci	88c473e007	Sort WorkerPool in executions We sort the workerList because adaptive connection management (e.g., OPTIONAL_CONNECTION) requires any concurrent executions to wait for the connections in the same order to prevent any starvation. If we don't sort, we might end up with: Execution 1: Get connection for worker 1, wait for worker 2 Execution 2: Get connection for worker 2, wait for worker 1 and, none could proceed. Instead, we enforce every execution establish the required connections to workers in the same order.	2020-06-22 16:39:27 +02:00
Onur Tirtir	fb46ef1d17	Merge pull request #3930 from citusdata/update-cl-0622 Update CHANGELOG for 9.2.6 & 9.3.2	2020-06-22 16:23:05 +03:00
Onur Tirtir	d41ad47579	Update CHANGELOG for 9.3.2	2020-06-22 14:20:16 +03:00
Onur Tirtir	4a38685744	Update CHANGELOG for 9.2.6	2020-06-22 14:19:56 +03:00
Hanefi Onaldi	ebd8de88d5	Merge pull request #3829 from citusdata/migrations-disallow-c-comment	2020-06-22 13:36:57 +03:00
Hanefi Önaldı	618453a2ba	Disallow C-style comments in migration files	2020-06-22 12:51:16 +03:00
Hanefi Önaldı	56285e6470	Use citus docker hub org	2020-06-22 12:51:16 +03:00
Jelte Fennema	b3ec6fbe7a	Make check_enterprise_merge script stricter (#3918 ) We've had two issues with merge conflicts to enterprise in the last week, that suddenly happened. Because of this CI check this actually blocks all community PRs from being merged. This PR tries to improve on the previous script we had, by putting tougher constraints on when a merge is allowed. Previously the check would pass in two cases: 1. This PR be merged without conflicts into `enterprise-master` 2. A branch exists with the same name as this PR on enterprise and that can be merged into `enterprise-master`. The first case stays the same, but I've changed the second case to require the following instead: 1. A branch exists on enterprise with the same name as this PR 2. NEW: This branch contains the the last commit of the community PR branch 3. This branch can be merged into enterprise-master This makes sure the enterprise branch is actually up to date and not forgotten about. If we still get problems with this change, future improvements could be: 1. Check that the PR on enterprise passes CI 2. Check that the PR on enterprise has been approved 3. Require the enterprise PR branch to be merged before merging community.	2020-06-19 12:45:36 +02:00
SaitTalhaNisanci	3a789352b6	rename citus hammerdb branch prefix as citus_github_push (#3925 ) When we are using hammerdb jobs, the job creates a branch on test automation, since that branch should be deleted, it would have `delete_me` prefix, however since the result branch on release-test-results will have the test automation branch as prefix, it will also have `delete_me` prefix, which seems a bit confusing. This PR updates it as citus_github_push	2020-06-18 21:11:58 +03:00
Onur Tirtir	c61e84c14b	Merge pull request #3921 from citusdata/update-cl-0617 Update CHANGELOG for 9.2.5 & 9.3.1	2020-06-17 19:05:45 +03:00
Onur Tirtir	4640f90933	Update CHANGELOG for 9.3.1	2020-06-17 18:45:54 +03:00
Onur Tirtir	74f20149cd	Update CHANGELOG for 9.2.5	2020-06-17 18:45:54 +03:00
Marco Slot	004e0e4617	Merge pull request #3919 from citusdata/fix/combine-query Rename masterQuery to combineQuery	2020-06-17 16:12:13 +02:00
Marco Slot	2a3234ca26	Rename masterQuery to combineQuery	2020-06-17 14:14:37 +02:00
Jelte Fennema	0259815d3a	Fix EXPLAIN ANALYZE received data counter issues (#3917 ) In #3901 the "Data received from worker(s)" sections were added to EXPLAIN ANALYZE. After merging @pykello posted some review comments. This addresses those comments as well as fixing a other issues that I found while addressing them. The things this does: 1. Fix `EXPLAIN ANALYZE EXECUTE p1` to not increase received data on every execution 2. Fix `EXPLAIN ANALYZE EXECUTE p1(1)` to not return 0 bytes as received data allways. 3. Move `EXPLAIN ANALYZE` specific logic to `multi_explain.c` from `adaptive_executor.c` 4. Change naming of new explain sections to `Tuple data received from node(s)`. Firstly because a task can reference the coordinator too, so "worker(s)" was incorrect. Secondly to indicate that this is tuple data and not all network traffic that was performed. 5. Rename `totalReceivedData` in our codebase to `totalReceivedTupleData` to make it clearer that it's a tuple data counter, not all network traffic. 6. Actually add `binary_protocol` test to `multi_schedule` (woops) 7. Fix a randomly failing test in `local_shard_execution.sql`.	2020-06-17 11:33:38 +02:00
Marco Slot	7bd93c8f2f	Merge pull request #3904 from citusdata/fix/remove-master Remove master terminology from file hierarchy	2020-06-16 18:00:25 +02:00
Marco Slot	d1bab78d79	Remove master from file hierarchy	2020-06-16 17:49:09 +02:00
Jelte Fennema	b71f82b31e	Use 5 second isolation test timeout (#3907 ) Sometimes isolation tests get stuck in CI and we cannot see why, because the job is killed by the CI runner. This will instead fail inside make the testsuite continue, but mark it as a failure like this in the diff output: ```diff +isolationtester: canceling step s2-ddl-create-index-concurrently after 5 seconds step s2-ddl-create-index-concurrently: CREATE INDEX CONCURRENTLY select_append_index ON select_append(id); +ERROR: CONCURRENTLY-enabled index command failed ``` We should detect blockages very quickly and the queries we run are also very fast, so 5 seconds should be more than enough to catch any random slowness. The default from Postgres is 5 minutes, which is waaay to much for us.	2020-06-16 14:57:49 +02:00
Jelte Fennema	799bfdab56	Temporarily disable connection leak tests that fail a lot (#3911 ) MX connection leak failures: 1. https://app.circleci.com/pipelines/github/citusdata/citus/9296/workflows/e36d1088-662a-4f60-acec-293132632c2f/jobs/131908/steps 2. https://app.circleci.com/pipelines/github/citusdata/citus/9258/workflows/37659d82-2c5b-495e-b0e7-905811e30444/jobs/131299 Failure connection leak failures: 1. https://app.circleci.com/pipelines/github/citusdata/citus/9297/workflows/c0ebc326-8c93-468f-8b70-f470bd492fb9/jobs/131920 2. https://app.circleci.com/pipelines/github/citusdata/citus/9283/workflows/9af154d0-ff96-4c5d-ae19-81faae1e0c18/jobs/131668	2020-06-16 13:48:48 +02:00
Philip Dubé	56eb5ee305	Merge pull request #3866 from citusdata/release-cache-entry-deferred Deferred release of metadata cache entries	2020-06-15 16:41:02 +00:00
Philip Dubé	39400319e6	Defer freeing CitusTableCacheEntry, as there were memory safety issues before Shard id to index mapping stored in cache entry as there may now be multiple entries alive for a given relation insert_select_executor: revert copying cache entry, which was a hack added to avoid memory safety issues	2020-06-15 16:20:50 +00:00
Jelte Fennema	927de6d187	Show amount of data received in EXPLAIN ANALYZE (#3901 ) Sadly this does not actually work yet for binary protocol data, because when doing EXPLAIN ANALYZE we send two commands at the same time. This means we cannot use `SendRemoteCommandParams`, and thus cannot use the binary protocol. This can still be useful though when using the text protocol, to find out that a lot of data is being sent.	2020-06-15 16:01:05 +02:00
SaitTalhaNisanci	077c784fe9	Create EnsureTableCanBeCreated for some checks (#3839 )	2020-06-14 14:25:58 +03:00
Hadi Moshayedi	b090dcd530	Merge pull request #3887 from citusdata/local-router-joins Implement local table joins in router planner	2020-06-12 18:45:13 -07:00
Hadi Moshayedi	ef778c1cd7	address feedback from Sait Talha & Hadi	2020-06-12 18:36:02 -07:00
Marco Slot	4f7989ad8e	Rename WorkersContainingAllShards to PlacementsForWorkersContainingAllShards	2020-06-12 18:36:02 -07:00
Marco Slot	080f711e62	Remove useless debug message in router planner	2020-06-12 18:36:02 -07:00
Marco Slot	d953f084db	Rename FindRouterWorkerList to CreateTaskPlacementListForShardIntervals	2020-06-12 18:36:01 -07:00
Marco Slot	24feadc230	Handle joins between local/reference/cte via router planner	2020-06-12 18:36:01 -07:00
Nils Dijk	f57711b3d2	fix test output for tdigest (#3909 ) Due to the problem described in #3908 we don't cover the tdigest integration (and other extensions) on CI. Due to this a bug got in the patch due to a change in `EXPLAIN VERBOSE` being merged concurrently with the tdigest integration. This PR fixes the test output that missed the newly added information.	2020-06-12 20:54:27 +02:00
Halil Ozan Akgül	8c5eb6b7ea	Insert Select Into Local Table (#3870 ) * Insert select with master query * Use relid to set custom_scan_tlist varno * Reviews * Fixes null check Co-authored-by: Marco Slot <marco.slot@gmail.com>	2020-06-12 17:06:31 +03:00
Jelte Fennema	0e12d045b1	Support use of binary protocol in between nodes (#3877 ) This can save a lot of data to be sent in some cases, thus improving performance for which inter query bandwidth is the bottleneck. There's some issues with enabling this as default, so that's currently not done.	2020-06-12 15:02:51 +02:00
Nils Dijk	da8f2b0134	Feature: tdigest aggregate (#3897 ) DESCRIPTION: Adds support to partially push down tdigest aggregates tdigest extensions: https://github.com/tvondra/tdigest This PR implements the partial pushdown of tdigest calculations when possible. The extension adds a tdigest type which can be combined into the same structure. There are several aggregate functions that can be used to get; - a quantile - a list of quantiles - the quantile of a hypothetical value - a list of quantiles for a list of hypothetical values These function can work both on values or tdigest types. Since we can create tdigest values either by combining them, or based on a group of values we can rewrite the aggregates in such a way that most of the computation gets delegated to the compute on the shards. This both speeds up the percentile calculations because the values don't have to be sorted while at the same time making the transfer size from the shards to the coordinator significantly less.	2020-06-12 13:50:28 +02:00
Philip Dubé	f69037c192	Merge pull request #3903 from citusdata/remove-misleading-iscitustable-check IsReferenceTable, ShardIntervalCount: remove misleading isCitusTable check	2020-06-11 18:50:36 +00:00
Philip Dubé	8faaaee6a5	IsReferenceTable, ShardIntervalCount: remove misleading isCitusTable check GetCitusTableCacheEntry raises an error if relationId is not distributed	2020-06-11 15:35:02 +00:00
Philip Dubé	f344c1a4bc	Merge pull request #3654 from citusdata/2776_modifying_ctes Modifying ctes in router planner	2020-06-11 15:26:00 +00:00
Philip Dubé	1722d8ac8b	Allow routing modifying CTEs We still recursively plan some cases, eg: - INSERTs - SELECT FOR UPDATE when reference tables in query - Everything must be same single shard & replication model	2020-06-11 15:14:06 +00:00
Hadi Moshayedi	e37c385d6c	Merge pull request #3899 from citusdata/explain_analyze_sort_by_time Include execution duration in worker_last_saved_explain_analyze	2020-06-11 04:05:41 -07:00
Hadi Moshayedi	0e3140c14d	Include execution duration in worker_last_saved_explain_analyze	2020-06-11 02:54:54 -07:00
Hadi Moshayedi	93b79880fe	Merge pull request #3864 from citusdata/explain_analyze_cte CTE statistics in EXPLAIN ANALYZE	2020-06-11 02:49:06 -07:00
Hadi Moshayedi	7c52c6edb0	CTE statistics in EXPLAIN ANALYZE	2020-06-11 02:39:59 -07:00
Hadi Moshayedi	fe8a9c721c	Merge pull request #3891 from citusdata/explain_analyze_worker_query Show query text in EXPLAIN output	2020-06-11 02:38:07 -07:00
Hadi Moshayedi	1f6d6ee4a5	Show query text in EXPLAIN output	2020-06-11 02:19:55 -07:00
Hadi Moshayedi	9a49f10c49	Merge pull request #3890 from citusdata/explain_analyze_exec_once Do EXPLAIN ANALYZE at the same time as execution to avoid executing twice.	2020-06-11 02:17:38 -07:00
Hadi Moshayedi	bb96ef5047	Does the EXPLAIN ANALYZE at the same time as execution, so avoids executing twice. We wrap worker tasks in worker_save_query_explain_analyze() so we can fetch their explain output later by a call worker_last_saved_explain_analyze(). Fixes #3519 Fixes #2347 Fixes #2613 Fixes #621	2020-06-11 01:55:57 -07:00
Hadi Moshayedi	8551affc1e	Merge pull request #3892 from citusdata/explain_execute Test we don't support multi-shard EXPLAIN EXECUTE	2020-06-10 17:19:10 -07:00
Hadi Moshayedi	6ca621bd16	Test we don't support multi-shard EXPLAIN EXECUTE	2020-06-10 17:11:27 -07:00
Jelte Fennema	6f2eb4cdb6	Remove FlattenJoinVars (#3880 ) This code is not needed anymore since #3668 was merged. It's actually causing some issues when using the binary Postgres protocol, because postgres thinks it gets a `bigint` from the worker, but actually gets an normal `int`. The query in question that fails is this: ```sql CREATE TABLE test_table_1(id int, val1 int); CREATE TABLE test_table_2(id int, val1 bigint); SELECT create_distributed_table('test_table_1', 'id'); SELECT create_distributed_table('test_table_2', 'id'); INSERT INTO test_table_1 VALUES(1,1),(2,2),(3,3); INSERT INTO test_table_2 VALUES(1,1),(3,3),(4,5); SELECT val1 FROM test_table_1 LEFT JOIN test_table_2 USING(id, val1) ORDER BY 1; ``` The difference in queries that is sent to the workers after this change is this, for this query: ```diff --- query_old.sql 2020-06-09 09:51:21.460000000 +0200 +++ query_new.sql 2020-06-09 09:51:39.500000000 +0200 @@ -1 +1 @@ -SELECT worker_column_1 AS val1 FROM (SELECT test_table_1.val1 AS worker_column_1 FROM (public.test_table_1_102015 test_table_1(id, val1) LEFT JOIN public.test_table_2_102019 test_table_2(id, val1) USING (id, val1))) worker_subquery +SELECT worker_column_1 AS val1 FROM (SELECT val1 AS worker_column_1 FROM (public.test_table_1_102015 test_table_1(id, val1) LEFT JOIN public.test_table_2_102019 test_table_2(id, val1) USING (id, val1))) worker_subquery ```	2020-06-10 17:24:53 +02:00
Jelte Fennema	f4791fcb10	Remove SwallowErrors by using PathNameDeleteTemporaryDir (#3893 ) This is a different version of #3634. It also removes SwallowErrors, but instead of modifying our own functions to not throw errors, it uses the postgres built in `PathNameDeleteTemporaryDir` function. This function does not throw errors. Since this change is for a bugfix, I tried to minimize the changes. PRs with the following changes would be good to do separately from this PR: 1. Use PathName(Create\|Open\|Delete)Temporary(File\|Dir) to open and remove all files/dirs instead of our own custom file functions. 2. Prefix our outmost files/directories with `PG_TEMP_FILE_PREFIX` so that they are identified by Postgres as temporary files, which will be removed at postmaster start. This way we do not have to do this cleanup ourselves. 3. Store the files in the temporary table space if it exists. Fixes #3634 Fixes #3618	2020-06-10 17:04:07 +02:00
Hanefi Onaldi	6e9324e99d	Merge pull request #3841 from citusdata/copy_max_adaptive_executor_pool_size	2020-06-10 17:25:12 +03:00
Onder Kalaci	640717bea2	Copy doesn't use more than MaxAdaptiveExecutor Co-authored-by: Hanefi Önaldı <Hanefi.Onaldi@Microsoft.com>	2020-06-10 16:46:21 +03:00
Jelte Fennema	b87bae71bb	Error out when using different users in the same transaction (#3869 ) Fixes #3867 As described in the issue above we return incorrect results when changing user within a transaction. This causes us to error out instead.	2020-06-10 14:07:40 +02:00
Marco Slot	02a70df656	Merge pull request #3889 from citusdata/fix/stage_generates_utility_commands Execute shard creation as utility tasks	2020-06-10 11:51:24 +02:00
Marco Slot	1243b6a948	Execute shard creation as utility tasks	2020-06-10 11:29:49 +02:00
Önder Kalacı	34554a2957	Merge pull request #3886 from citusdata/fix_coercion Coerce types properly for distribution keys when necessary	2020-06-10 10:50:47 +02:00
Onder Kalaci	06461ca55f	Coerce types properly for INSERT Also, unify similar code-paths to rely on more accurate function.	2020-06-10 10:40:28 +02:00
Hadi Moshayedi	08d2b9b40b	Merge pull request #3881 from citusdata/explain_analyze_udfs Implement EXPLAIN ANALYZE udfs.	2020-06-09 10:12:21 -07:00
Hadi Moshayedi	5cdfa9f571	Implement EXPLAIN ANALYZE udfs. Implements worker_save_query_explain_analyze and worker_last_saved_explain_analyze. worker_save_query_explain_analyze executes and returns results of query while saving its EXPLAIN ANALYZE to be fetched later. worker_last_saved_explain_analyze returns the saved EXPLAIN ANALYZE result.	2020-06-09 10:02:05 -07:00
Onur Tirtir	a4f1c41391	Implement GetQueryLockMode helper (#3860 ) If we want to get necessary lockmode for a relation RangeVar within a query, we can get the lockmode easily from the RangeVar itself (if pg version >= 12). However, if we want to decide the lockmode appropriate for the "query", we can derive this information by using GetQueryLockMode according to the code comment from RangeTblEntry->rellockmode.	2020-06-09 13:08:44 +03:00
Hadi Moshayedi	5781aaf6c7	Merge pull request #3883 from citusdata/circ_deps typedef TupleDestination only once	2020-06-09 01:36:41 -07:00
Hadi Moshayedi	198d5d8b0f	typedef TupleDestination once	2020-06-08 20:38:28 -07:00
Hadi Moshayedi	6869d7bfb2	Merge pull request #3878 from citusdata/explain_analyze_initial_cleanup Explain Analyze tests & cleanup	2020-06-07 00:06:11 -07:00
Hadi Moshayedi	45a41e249f	Test EXPLAIN ANALYZE doesn't show repartition join tasks	2020-06-06 23:24:45 -07:00
Hadi Moshayedi	02cff1a7c6	Test that EXPLAIN ANALYZE is not supported for some forms of INSERT/SELECT	2020-06-06 23:24:45 -07:00
Hadi Moshayedi	f54a8e53c0	Remove unused consts from multi_explain.c	2020-06-06 23:24:45 -07:00
Hadi Moshayedi	797405f3d1	Merge pull request #3871 from citusdata/tupledest Implement TupleDestination to allow custom processing of task results.	2020-06-06 10:50:02 -07:00
Hadi Moshayedi	0bfd39ea52	Implement TupleDestination intereface. Implements a new `TupleDestination` interface to allow custom tuple processing per task. This can be specially useful if a task contains multiple queries. An example of this EXPLAIN ANALYZE, where it needs to add some UDF calls to the query to fetch the explain output from worker after fetching the actual query results.	2020-06-05 17:47:40 -07:00
SaitTalhaNisanci	d0f47eb338	Check the removeType in IsDropCitusStmt (#3859 ) We should check the remove type in IsDropCitusStmt because if the remove type is not OBJECT_EXTENSION then the stored objects in dropStmt->objects may not be of type Value. This was crashing PG-13. Also rename the method as IsDropCitusExtensionStmt.	2020-06-05 20:49:54 +03:00
Onur Tirtir	f7224a12f2	Implement PushOverrideEmptySearchPath (#3874 ) To reduce code duplication, implement function that pushes search_path to be NIL and sets addCatalog to true so that all objects outside of pg_catalog will be schema-prefixed.	2020-06-05 19:23:59 +03:00
Onur Tirtir	8b39d12846	Append IF NOT EXISTS to deparsed CREATE SERVER commands (#3875 ) Append IF NOT EXISTS to CREATE SERVER commands generated by pg_get_serverdef_string function when deparsing an existing server object that a foreign table depends.	2020-06-05 18:04:33 +03:00
Onur Tirtir	741e808049	Merge pull request #3873 from citusdata/refactor/implement-explicit-index Implement IndexIsImpliedByAConstraint	2020-06-05 16:05:08 +03:00
Onur Tirtir	f3f711e097	Implement IndexIsImpliedByAConstraint	2020-06-05 15:33:54 +03:00
Philip Dubé	4f63443b49	Merge pull request #3861 from citusdata/remove-null-plannerrestrictioncontext-check multi_router_planner: Remove NULL check which would've segfaulted earlier	2020-06-02 13:18:30 +00:00
Philip Dubé	25f86bca3f	multi_router_planner: Remove NULL check which would've segfaulted earlier	2020-06-02 13:08:38 +00:00
Philip Dubé	1b0776f98e	Merge pull request #3863 from citusdata/remove-getupdateordeleterte multi_router_planner: replace GetUpdateOrDeleteRTE with ExtractResultRelationRTE	2020-06-02 12:58:42 +00:00
Philip Dubé	2623aefe38	multi_router_planner: replace GetUpdateOrDeleteRTE with ExtractResultRelationRTE	2020-06-02 00:22:30 +00:00
Onur Tirtir	738a4ddb58	Merge pull request #3858 from citusdata/copy_paste_pg_get_triggerdef * Error out if creating a citus table from a table having triggers already. * Error out for CREATE TRIGGER commands that are run on citus tables. * Introduce the ability to deparse CREATE TRIGGER commands needed to recreate triggers on a table.	2020-06-01 10:35:45 +03:00
Onur Tirtir	dfcc18468c	Error out for unsupported trigger objects Error out if creating a citus table from a table having triggers. Error out for CREATE TRIGGER commands that are run on citus tables.	2020-05-31 23:10:01 +03:00
Onur Tirtir	6e6bc155a9	Implement methods to process & recreate triggers on citus tables	2020-05-31 15:28:17 +03:00
Onur Tirtir	5af64084ea	Copy & paste pg_get_triggerdef_worker from Postgres	2020-05-31 15:25:07 +03:00
Philip Dubé	dca09e998f	Merge pull request #3857 from citusdata/use_RelationGetPartitionDesc use RelationGetPartitionDesc to be more safe	2020-05-29 15:12:57 +00:00
Sait Talha Nisanci	dec2b28d49	use RelationGetPartitionDesc to be more safe For getting the partition desc, we should use RelationGetPartitionDesc method so that even if it is NULL, it will be created in the method.	2020-05-29 10:55:52 +03:00
Philip Dubé	a3c470b2b8	Merge pull request #3834 from citusdata/prep-routing-modifying-ctes Prep routing modifying ctes	2020-05-20 17:35:36 +00:00
Philip Dubé	c0515dcd67	This prepares for routing modifying CTEs, where modLevel should not be used to infer whether a plan is a select or not SELECT_TASK is renamed to READ_TASK as a SELECT with modifying CTEs will be a MODIFYING_TASK RouterInsertJob: Assert originalQuery->commandType == CMD_INSERT CreateModifyPlan: Assert originalQuery->commandType != CMD_SELECT Remove unused function IsModifyDistributedPlan DistributedExecution, ExecutionParams, DistributedPlan: Rename hasReturning to expectResults SELECTs set expectResults to true Rename CreateSingleTaskRouterPlan to CreateSingleTaskRouterSelectPlan	2020-05-20 17:26:12 +00:00
Onur Tirtir	780c72bdc8	Merge pull request #3846 from citusdata/implement-table-constraints-internal Refactor the methods accessing to pg_constraint	2020-05-20 17:45:59 +03:00
Onur Tirtir	98a660d0b7	Don't release lock on pg_constraint until the xact ends Do not release AccessShareLock when closing pg_constraint to prevent modifications to be done on pg_constraint to make sure that caller will process valid foreign key constraints through the transaction.	2020-05-20 17:27:17 +03:00
Onur Tirtir	79a688ffe0	Refactor the methods accessing to pg_constraint Implement internal functions to accces to pg_contraint and utilize them in existing foreign key checks.	2020-05-20 17:27:17 +03:00
SaitTalhaNisanci	80e34382cf	Rename AppropriateReplicationModel -> DecideReplicationModel (#3842 )	2020-05-17 10:24:14 +03:00
Onur Tirtir	8f9ef63e8a	Implement get_relation_constraint_oid_compat helper (#3836 )	2020-05-15 17:36:59 +03:00
Philip Dubé	4ad10adbc4	Merge pull request #3832 from citusdata/fix/varchar-type-propagation Fix composite types lacking typemod when distributed	2020-05-15 13:25:19 +00:00
MoYi	9e1f198155	Fix composite create type deparsing to preserve typmod	2020-05-15 13:12:54 +00:00
Onur Tirtir	249550b815	Refactor EnsureLocalTableEmptyIfNecessary (#3830 )	2020-05-15 14:20:33 +03:00
Onur Tirtir	8f3373c702	Remove unused parameter from RecordDistributedRelationDependencies (#3831 )	2020-05-15 10:34:35 +03:00
SaitTalhaNisanci	625f0d034c	Merge pull request #3773 from citusdata/enh/hammerdbJob Add optional ch_benchmark and tpcc_benchmark job	2020-05-14 16:08:43 +03:00
Sait Talha Nisanci	41fceb7849	Add optional ch_benchmark and tpcc_benchmark job With this commit: You can trigger two types of hammerdb benchmark jobs: -ch_benchmark (analytical and transactional queries) -tpcc_benchmark (only transactional queries) Your branch will be run against `master` branch. In order to trigger the jobs prepend `ch_benchmark/` or `tpcc_benchmark/` to your branch and push it. For example if you were running on a feature/improvement branch with name `improve/adaptive_executor`. In order to trigger a tpcc benchmark, you can do the following: ```bash git checkout improve/adaptive_executor git checkout -b tpcc_benchmark/improve/adaptive_executor git push origin tpcc_benchmark/improve/adaptive_executor # the tpcc benchmark job will be triggered. ``` You will see the results in a branch in [https://github.com/citusdata/release-test-results](https://github.com/citusdata/release-test-results). The branch name will be something like: `delete_me/citusbot_tpcc_benchmark_rg/<date>/<date>`. The resource groups will be deleted automatically but if the benchmark fails, they won't be deleted(If you don't see the results after a reasonable time, it might mean it failed, you can check the resource usage from portal, if it is almost 0 and you didn't see the results, it means it probably failed). In that case, you will need to delete the resource groups manually from portal, the resource groups are `citusbot_ch_benchmark_rg` and `citusbot_tpcc_benchmark_rg`.	2020-05-14 16:01:48 +03:00
SaitTalhaNisanci	cf98b9d6d5	not wait forever for metadata sync in tests (#3760 ) We shouldn't wait forever for metada sync in tests, otherwise when a test gets stuck, we don't know which line causes the problem.	2020-05-14 10:51:24 +03:00
Onur Tirtir	a024389ab6	Use exactly matching tag in citus_version output (#3828 ) Instead of using the git tag that is reachable from the HEAD commit, use the exactly matching tag in citus_version(). For the other branches (including the master), just use name of the current branch that we are installing citus and sha of the HEAD. This is because, we do not tag the branches except the release ones. That way, we won't see v9.2.3-DO-NOT-INSTALL tag anymore in the output of the citus_version();	2020-05-13 15:05:07 +03:00
Onur Tirtir	ac1ec40bfb	Add changelog entry for 9.3.0 (#3823 )	2020-05-07 16:04:07 +03:00
SaitTalhaNisanci	22c903b151	remove ExecuteUtilityTaskListWithoutResults (#3696 ) This PR removes ExecuteUtilityTaskListWithoutResults and uses the same path for local execution via ExecuteTaskListExtended. ExecuteUtilityTaskList is added. ExecuteLocalTaskListExtended now has a parameter for utility commands so that it can call the right method. In order not to change the existing calls, ExecuteTaskListExtendedInternal is added, which is the main method that runs the execution, via local and remote execution.	2020-05-07 13:30:50 +03:00
Nils Dijk	105de7beb8	Fix for pruned target list entries (#3818 ) DESCRIPTION: Ignore pruned target list entries in coordinator plan The postgres planner has the ability to prune target list entries that are proven not used in the output relation. When this happens at the `CitusCustomScan` boundary we need to _not_ return these pruned columns to not upset the rest of the planner. By using the target list the planner asks us to return we fix issues that lead to Assertion failures, and potentially could be runtime errors when they hit in a production build. Fixes #3809	2020-05-06 13:56:02 +02:00
Marco Slot	bfa7177352	Merge pull request #3814 from citusdata/fix/any_value	2020-05-06 11:48:50 +02:00
Marco Slot	6ce2803777	Make sure we don't wrap GROUP BY expressions in any_value	2020-05-05 05:12:45 +02:00
Hadi Moshayedi	d4943cee55	Merge pull request #3815 from citusdata/fix_maintenaced Don't error out when cannot create maintenanced	2020-05-04 10:29:33 -07:00
Hadi Moshayedi	dbf509bbdd	Don't error out when cannot create maintenanced	2020-05-04 09:53:52 -07:00
SaitTalhaNisanci	4a9d516f1b	Add a job to check if merge to enterprise master would fail (#3777 ) * add a job to check if merge to enterprise master would fail Add a job to check if merge to enterprise master would fail. The job does the following: - It checks if there is already a branch with the same name on enterprise, if so it tries to merge it to enterprise master, if the merge fails the job fails. - If the branch doesn't exist on the enterprise, it tries to merge the current branch to enterprise master, it fails if there is any conflict while merging. The motivation is that if a branch on community would create a conflict on enterprise-master, until we create a PR on enterprise that would solve this conflict, we won't be able to merge the PR on community. This way we won't have many conflicts when merging to enterprise master and the author, who has the most context will be responsible for resolving the conflict when he has the most context, not after 1 month. * Improve test suite to be able to easily run locally * Add documentation on how to resolve conflicts to enterprise master * Improve enterprise merge script * Improve merge conflict job README * Improve merge conflict job README * Improve merge conflict job README * Improve merge conflict job README Co-authored-by: Nils Dijk <nils@citusdata.com>	2020-05-04 17:08:17 +03:00
Önder Kalacı	6fd87e1aac	Merge pull request #3816 from citusdata/fix_false_subquery Remove assertion for subqueries in WHERE clause ANDed with FALSE	2020-05-04 15:54:13 +02:00
Onder Kalaci	f9d4a9cf38	Remove assertion for subqueries in WHERE clause ANDed with FALSE In the code, we had the assumption that if restriction information is NULL, it means that we cannot have any disributetd tables in the subquery. However, for subqueries in WHERE clause, that is not the case when the subquery is ANDed with FALSE. In that case, Citus operates on the originalQuery (which doesn't go through the standard_planner()), and rely on the restriction information generated by standard_plannner(). As Postgres is smart enough to no generate restriction information for subqueries ANDed with FALSE, we hit the assertion.	2020-05-04 10:52:15 +02:00
Önder Kalacı	6bcf6d3411	Merge pull request #3813 from citusdata/add_order_by_19912312 Add order by to some tests to make the output consistent	2020-05-01 13:22:34 +02:00
Onder Kalaci	891d99efaf	add order by to some tests to make the output consistent	2020-05-01 12:41:51 +02:00
Önder Kalacı	30d7765d0e	Merge pull request #3812 from citusdata/rebuid_when_socket_channges Rebuild WaitEventSet if socket changes after calling `PQconnectPoll `	2020-05-01 09:54:52 +02:00
Onder Kalaci	77c397e9ae	Rebuild wait event sets after PQconnectPoll() if socket changes The reason is that PQconnectPoll() may change the underlying socket. If we don't rebuild the wait event set, the low level APIs (such as epoll_ctl()) may fail due to invalid sockets. Instead, rebuilding ensures that we'll use accurate/active sockets.	2020-05-01 09:44:21 +02:00
Jelte Fennema	c6f5d5fe88	Add some asserts to pass static analysis (#3805 )	2020-04-29 11:19:11 +02:00
SaitTalhaNisanci	cbda951395	Fix task copy and appending empty task in ExtractLocalAndRemoteTasks (#3802 ) * Not append empty task in ExtractLocalAndRemoteTasks ExtractLocalAndRemoteTasks extracts the local and remote tasks. If we do not have a local task the localTaskPlacementList will be NIL, in this case we should not append anything to local tasks. Previously we would first check if a task contains a single placement or not, now we first check if there is any local task before doing anything. * fix copy of node task Task node has task query, which might contain a list of strings in its fields. We were using postgres copyObject for these lists. Postgres assumes that each element of list will be a node type. If it is not a node type it will error. As a solution to that, a new macro is introduced to copy a list of strings.	2020-04-29 11:05:34 +03:00
Philip Dubé	3fecf0b732	Merge pull request #3799 from citusdata/fix-copy-generated Fix COPY TO's COPY (SELECT) with distributed table having generated columns	2020-04-28 14:53:34 +00:00
Philip Dubé	b6b3c1bc17	Fix COPY TO's COPY (SELECT) with distributed table having generated columns It's necessary to omit generated columns from output	2020-04-28 14:40:47 +00:00
SaitTalhaNisanci	164c00cf08	Fix typo: longer visible -> no longer visible (#3803 )	2020-04-27 16:32:46 +03:00
Önder Kalacı	50346d0b42	Merge pull request #3797 from citusdata/increase_the_timeout Increase the default value of `citus.node_connection_timeout`	2020-04-24 16:08:50 +02:00
Onder Kalaci	bc54c5125f	Increase the default value of citus.node_connection_timeout The previous default was 5 seconds, and we change it to 30 seconds. The main motivation for this is that for busy clusters, 5 seconds can be too aggressive. Especially with connection throttling, the servers might be kept busy for a really long time, and users may see the connection errors more frequently. We've done some sanity checks, for really quick queries (like `SELECT count(*) from table`), 30 seconds is a decent value even if users execute 300 distributed queries on the coordinator. We've verified this on Hyperscale(Citus).	2020-04-24 15:16:42 +02:00
Önder Kalacı	30a0a955d1	Merge pull request #3794 from citusdata/fix_custom_type_select Explicitly mark queries in physical planner for [not] having parameters	2020-04-24 12:58:39 +02:00
Onder Kalaci	0cb7ab2d05	Explicitly mark queries in physical planner for [not] having parameters Physical planner doesn't support parameters. If the parameters have already been resolved when the physical planner handling the queries, mark it. The reason is that the executor is unaware of this, and sends the parameters along with the worker queries, which fails for composite types. (See `DissuadePlannerFromUsingPlan()` for the details of paramater resolving)	2020-04-24 12:49:43 +02:00
Önder Kalacı	4372954f31	Merge pull request #3795 from citusdata/improve_test Re-enable isolation test for reference tables + distributed deadlock detection	2020-04-24 12:07:28 +02:00
Onder Kalaci	f517fa2e2a	Re-enable isolation test for reference tables + distributed deadlock detection	2020-04-24 11:53:03 +02:00
SaitTalhaNisanci	07cbd84631	Add base isolation schedule (#3784 ) We should do some setup steps in check-isolation-base target. This PR adds base_isolation_schedule which will set up the cluster.	2020-04-24 12:38:37 +03:00
Onur Tirtir	b8dd8f50d1	Fix build issue in GCC 10 (#3790 ) As reported in #3787, we were having issues while building citus with "GCC Red Hat 10" (maybe in some other versions of gcc as well). Fixes "multiple definition of 'CitusNodeTagNames'" error by explicitly specifying storage of CitusNodeTagNames to be extern.	2020-04-22 16:41:34 +03:00
Onur Tirtir	2e927bd6b7	Bump Citus to 9.4devel (#3788 )	2020-04-22 12:50:00 +03:00
Hanefi Onaldi	2e0cb6160c	Merge pull request #3786 from citusdata/coord-skip-dep-setup Skip dependency setup on coordinator node	2020-04-21 15:29:26 +03:00
Hanefi Önaldı	e85b835065	Skip dependency setup on coordinator node	2020-04-21 12:06:31 +03:00
Philip Dubé	2e5b1bfa41	Merge pull request #3756 from citusdata/fix-maintenanced-error-restart maintenanced: use before_shmem_exit to clear workerPid	2020-04-20 14:57:30 +00:00
Philip Dubé	9093d51a22	maintenanced: handle before_shmem_exit, assert workerPid == 0 on start	2020-04-20 14:41:40 +00:00
Jelte Fennema	1423433531	Fix running check-isolation-base (#3782 )	2020-04-20 15:36:09 +02:00
Önder Kalacı	793c65b539	Merge pull request #3606 from citusdata/improve_error_messages Improve connection error message from the worker nodes	2020-04-20 13:47:43 +02:00
Onder Kalaci	e182215d96	Improve connection error message from the worker nodes We currently put the actual error message to the detail part. However, many drivers don't show detail part. As connection errors are somehow common, and hard to trace back, can't we added the detail to the message itself. In addition to that, we changed "connection error" message, as it was confusing to the users who think that the error was happening while connecting to the coordinator. In fact, this error is showing up when the coordinator fails to connect remote nodes.	2020-04-20 13:32:55 +02:00
Hadi Moshayedi	797180e0e3	Merge pull request #3778 from citusdata/more_replicate Replicate reference tables before master_create_empty_shard	2020-04-17 16:54:59 -07:00
Hadi Moshayedi	1250d691d3	Replicate reference tables before master_create_empty_shard	2020-04-17 16:47:03 -07:00
Philip Dubé	c03d3714b3	Merge pull request #3779 from citusdata/insert-select-copy-cache-entry Try copying shard intervals out of cache for long lived borrow	2020-04-17 22:49:58 +00:00
Philip Dubé	8e79672839	Try copying shard intervals out of cache for long lived borrow	2020-04-17 22:00:41 +00:00
Philip Dubé	a461ef20d9	Merge pull request #3769 from citusdata/avoid-invalidating-live-cache-entries Avoid invalidating live cache entries	2020-04-17 15:22:10 +00:00
Philip Dubé	c00d57a955	CreateDistributedInsertSelectPlan: avoid calling GetCitusTableCacheEntry in a way that would invalidate live ShardInterval pointers	2020-04-17 14:44:23 +00:00
SaitTalhaNisanci	1d0f4bdcd2	invalidate plan cache in master_update_node (#3758 ) * invalidate plan cache in master_update_node If a plan is cached by postgres but a user uses master_update_node, then when the plan cache is used for the updated node, they will get the old nodename/nodepost in the plan. This is because the plan cache doesn't know about the master_update_node. This could be a problem in prepared statements or anything that goes into plancache. As a solution the plan cache is invalidated inside master_update_node. * add invalidate_inactive_shared_connections test function We introduce invalidate_inactive_shared_connections udf to be used in testing. It is possible that a connection count for an inactive node will be greater than 0 and in that case it will not be removed at the time of invalidation. However, later we don't have a mechanism to remove it, which means that it will stay in the hash. For this not to cause a problem, we use this udf in testing. * move invalidate_inactive_shared_connections to udfs from test as it will be used in mx * remove the test udf * remove the IsInactive check	2020-04-17 17:43:48 +03:00
Philip Dubé	ae391c4f4b	Merge pull request #3768 from citusdata/avoid-long-lived-metadata Copy data from CitusTableCacheEntry more often	2020-04-17 14:25:51 +00:00
Philip Dubé	c0a95a3adb	Copy data from CitusTableCacheEntry more often This copies over fixes from reference counting branch, all CitusTableCacheEntry data may be freed when a GetCitusTableCacheEntry call occurs for its relationId This fix is not complete, but reference counting is being deferred until 9.4 CopyShardInterval: remove dest parameter, always return newly allocated object	2020-04-17 14:17:18 +00:00
Önder Kalacı	a919f09c96	Remove the entries from the shared connection counter hash when no connections remain (#3775 ) We initially considered removing entries just before any change to pg_dist_node. However, that ended-up being very complex and making MX even more complex. Instead, we're switching to a simpler solution, where we remove entries when the counter gets to 0. With certain workloads, this may have some performance penalty. But, two notes on that: - When counter == 0, it implies that the cluster is not busy - With cached connections, that's not possible	2020-04-17 17:14:58 +03:00
Philip Dubé	79f6f3c02c	Merge pull request #3757 from citusdata/fix-window-function-assertion-failure Avoid setting hasWindowFuncs true after window functions have been optimized out of query	2020-04-17 12:39:32 +00:00
Philip Dubé	e4a4707f4a	Avoid setting hasWindowFuncs true after window functions have been optimized out of query	2020-04-17 12:22:48 +00:00
SaitTalhaNisanci	a9a3be15cc	introduce TASK_QUERY_NULL task type (#3774 ) When we call SetTaskQueryString we would set the task type to TASK_QUERY_TEXT, and some parts of the codebase rely on the fact that if TASK_QUERY_TEXT is set, the data can be read safely. However if SetTaskQueryString is called with a NULL taskQueryString this can cause crashes. In that case taskQueryType will simply be set to TASK_QUERY_NULL.	2020-04-17 14:59:22 +03:00
Hanefi Onaldi	2d50f63841	Merge pull request #3752 from citusdata/local-truncate UDF to truncate local data after distributing table	2020-04-17 13:50:58 +03:00
Hanefi Önaldı	0c5d0cfee9	Notice message to help truncate local data after distribution	2020-04-17 13:21:34 +03:00
Hanefi Önaldı	d535121f8d	Introduce truncate_local_data_after_distributing_table()	2020-04-17 13:21:34 +03:00
Marco Slot	c3324f8962	Merge pull request #3772 from citusdata/fixes_from_enterprise Use block_writes for replicate_reference_tables	2020-04-17 12:07:57 +02:00
Hadi Moshayedi	61198251fd	Use block_writes for replicate_reference_tables	2020-04-16 19:25:41 -07:00
Nils Dijk	1d6ba1d09e	Refactor alter role to work on distributed roles (#3739 ) DESCRIPTION: Alter role only works for citus managed roles Alter role was implemented before we implemented good role management that hooks into the object propagation framework. This is a refactor of all alter role commands that have been implemented to - be on by default - only work for supported roles - make the citus extension owner a supported role Instead of distributing the alter role commands for roles at the beginning of the node activation role it now _only_ executes the alter role commands for all users in all databases and in the current database. In preparation of full role support small refactors have been done in the deparser. Earlier tests targeting other roles than the citus extension owner have been either slightly changed or removed to be put back where we have full role support. Fixes #2549	2020-04-16 12:23:27 +02:00
Hadi Moshayedi	e0eba87b6c	Merge pull request #3764 from citusdata/fix_stuck Detect deadlocks in replicate_reference_tables()	2020-04-15 13:18:38 -07:00
Hadi Moshayedi	59b9a4e5a1	Detect deadlocks in replicate_reference_tables()	2020-04-15 11:06:18 -07:00
SaitTalhaNisanci	df9048ebaa	update outdated comments related to local_execution (#3759 )	2020-04-15 16:15:43 +03:00
Marco Slot	5bd4970fac	Merge pull request #3017 from citusdata/fix/notices Propagate notices from queries as notices	2020-04-15 11:50:56 +02:00
Marco Slot	8b83306a27	Issue worker messages with the same log level	2020-04-14 21:08:25 +02:00
SaitTalhaNisanci	132efdbc56	add execution params struct (#3747 ) We had 9+ parameters in some of the functions related to execution. Execution params is created to simplify this a bit so that we can set only the fields that we are interested in and it is easier to read.	2020-04-14 14:32:40 +03:00
SaitTalhaNisanci	d58b5e67c1	not run multi_router_planner_fast_path in parallel (#3744 )	2020-04-14 13:14:23 +03:00
Önder Kalacı	9229db2081	Merge pull request #3692 from citusdata/shared_connection_counter Throttle connections to the worker nodes	2020-04-14 10:37:57 +02:00
Onder Kalaci	aa6b641828	Throttle connections to the worker nodes With this commit, we're introducing a new infrastructure to throttle connections to the worker nodes. This infrastructure is useful for multi-shard queries, router queries are have not been affected by this. The goal is to prevent establishing more than citus.max_shared_pool_size number of connections per worker node in total, across sessions. To do that, we've introduced a new connection flag OPTIONAL_CONNECTION. The idea is that some connections are optional such as the second (and further connections) for the adaptive executor. A single connection is enough to finish the distributed execution, the others are useful to execute the query faster. Thus, they can be consider as optional connections. When an optional connection is not allowed to the adaptive executor, it simply skips it and continues the execution with the already established connections. However, it'll keep retrying to establish optional connections, in case some slots are open again.	2020-04-14 10:27:48 +02:00
Onder Kalaci	38b8a9ad62	Add citus_remote_connection_stats() function This function is intended to be used for monitoring the remote connections.	2020-04-14 10:03:27 +02:00
Onder Kalaci	0dbfbe0c37	Add the necessary shared memory infrastructure - The hashmap in the shared memory - The lock to access the hashmap - The GUC to control the size	2020-04-14 10:03:26 +02:00
Hadi Moshayedi	4e3d402473	Merge pull request #3742 from citusdata/fix_sync Ensure metadata is synced on master_copy_shard_placement(..., do_repair := false)	2020-04-13 12:57:11 -07:00
Hadi Moshayedi	2639a9a19d	Test master_copy_shard_placement errors on foreign constraints	2020-04-13 12:45:27 -07:00
Hadi Moshayedi	f9de734329	Ensure metadata is synced on ReplicateColocatedShardPlacement	2020-04-13 11:45:21 -07:00
Hadi Moshayedi	2218b7e38d	Refactor ReplicateColocatedShardPlacement	2020-04-13 11:07:26 -07:00
SaitTalhaNisanci	2b2a146af4	update gitignores with new files in test folder (#3749 )	2020-04-13 17:09:18 +03:00
SaitTalhaNisanci	2438e80a58	use CURSOR_OPT_PARALLEL_OK flag in local execution (#3745 ) We currently don't use any cursor flags in local execution, but we can use CURSOR_OPT_PARALLEL_OK flag to potentially benefit from parallelism when possible.	2020-04-12 19:49:22 +03:00
Philip Dubé	c8d0e45dd4	Merge pull request #3489 from citusdata/fix-having-some-not-recursively-planned Fix subquery arguments in aggregates	2020-04-10 13:37:53 +00:00
Philip Dubé	30f10984e1	Defer get_agg_clause_costs, it happens later & avoids errors	2020-04-10 13:26:05 +00:00
Philip Dubé	b054911466	Merge pull request #3740 from citusdata/avoid-freeconn-segfault GetConnParams: Set runtimeParamStart before setting keywords/values to avoid out of bounds access	2020-04-10 13:25:32 +00:00
Philip Dubé	ab0b59ad3b	GetConnParams: Set runtimeParamStart before setting keywords/values to avoid out of bounds access	2020-04-10 13:14:06 +00:00
Halil Ozan Akgül	475c98a62a	Merge pull request #3546 from citusdata/connection-string-tests-9.2 Regression Tests on an Existing Cluster	2020-04-10 16:12:14 +03:00
Halil Ozan Akgul	34c2b7e056	Fixes the psql connection bug	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	56e814a333	Adds public host to only hyperscale tests	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	d574ac33a8	Adds next shard ids to multi_create_table tests	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	a701fc774a	Adds multi_schedule_hyperscale schedule	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	5bf350faf9	Removes failing tests This task just removes the failing tests. It doesn't mean this tests cannot be saved. It's just a starting point	2020-04-10 15:54:47 +03:00
Halil Ozan Akgul	1aa1f55d8e	Adds check_multi_hyperscale_superuser schedule	2020-04-10 13:05:07 +03:00
Halil Ozan Akgul	c2edf989cf	Adds public host parameters	2020-04-10 13:04:24 +03:00
Halil Ozan Akgul	4b9705f714	Adds worker host parameters	2020-04-10 13:03:28 +03:00
Halil Ozan Akgul	119bf590c8	Creates normalize_modified.sed	2020-04-10 13:03:19 +03:00
Halil Ozan Akgul	c8a81ef1ce	Changes copy to \copy	2020-04-10 13:03:15 +03:00
Halil Ozan Akgul	93b97248b2	Adds a connection string to run tests on that connection	2020-04-10 13:03:03 +03:00
SaitTalhaNisanci	17373d51da	not wait forever in upgrade distributed function before (#3731 )	2020-04-10 09:43:42 +03:00
SaitTalhaNisanci	07f9a442b0	Refactor CopyLocalDataIntoShards (#3693 ) This PR: - Declares variables when they are needed. - Creates DoCopyFromLocalTableIntoShards for better readability. - Doesn't use a hardcoded value, instead use a variable for better readability.	2020-04-10 09:25:26 +03:00
Philip Dubé	d99043fe0c	Merge pull request #3690 from citusdata/fix/limit_non_const Correctly handle non-constant LIMIT/OFFSET clauses	2020-04-09 20:25:27 +00:00
Marco Slot	a4b2197450	Correctly handle non-constant LIMIT/OFFSET clauses	2020-04-09 19:59:50 +00:00
SaitTalhaNisanci	3dc7cad754	use an enum for local execution status (#3733 ) We have two variables that are related to local execution status. TransactionAccessedLocalPlacement and TransactionConnectedToLocalGroup. Only one of these fields should be set, however we didn't have any check for this contraint and it was error prone. What those two variables are used is that we are trying to understand if we should use local execution, the current session, or if we should be using a connection to execute the current query, therefore the tasks. In the enum, now it is more clear what these variables mean. Also, now we have a method to change the local execution status. The method will error if we are trying to transition from a state to a wrong state. This will help us avoid problems.	2020-04-09 19:11:04 +03:00
SaitTalhaNisanci	24dcb02bca	enable local table join with reference table (#3697 ) * enable local table join with reference table * test different cases with local table and reference join	2020-04-09 15:25:54 +03:00
SaitTalhaNisanci	ebda3eff61	read database name inside the function (#3730 )	2020-04-09 13:11:13 +03:00
SaitTalhaNisanci	233e4a24d1	use local execution within transaction block (#3714 ) * use local executon when in a transaction block When we are inside a transaction block, there could be other methods that need local execution, therefore we will use local execution in a transaction block. * update test outputs with transaction block local execution * add a test to verify we dont leak intermediate schemas	2020-04-09 12:41:58 +03:00
SaitTalhaNisanci	fa88046ce1	test that we don't leak intermediate schemas (#3737 ) * test that we don't leak intermediate schemas We have tests to make sure that we don't intermediate any intermediate files, tables etc but we don't test if we are leaking schemas. It makes sense to test this as well. * remove all repartition schemas in case of error This solution is not an ideal one but it seems to be doing the job. We should have a more generic solution for the cleanup but it seems that putting the cleanup in the abort handler is dangerous and it was crashing.	2020-04-09 12:17:41 +03:00
SaitTalhaNisanci	362d72853c	return early in ExecuteTaskListExtended (#3738 ) It is possible to return an error in ExecuteTaskListExtended after performing local execution with the current structure. However there is no point in execution the local tasks if we are going to return an error later. So the local execution is moved after the error check.	2020-04-09 10:10:49 +03:00
Hadi Moshayedi	117233c1e0	Merge pull request #3736 from citusdata/remove_todo Remove todo from reference_table_utils	2020-04-08 12:54:48 -07:00
Hadi Moshayedi	cd877f3fdd	Merge pull request #3637 from citusdata/defer_reference_table_replication_copy Defer reference table replication	2020-04-08 12:54:04 -07:00
Hadi Moshayedi	9b8802ba2d	Remove todo from reference_table_utils	2020-04-08 12:46:55 -07:00
Hadi Moshayedi	dda53a0bba	GUC for replicate reference tables on activate.	2020-04-08 12:42:45 -07:00
Hadi Moshayedi	c168a53ebc	Tests for replicate_reference_tables	2020-04-08 12:41:36 -07:00
Hadi Moshayedi	acfa850c38	Make multi_replicate_reference_table check-base friendly	2020-04-08 12:41:36 -07:00
Hadi Moshayedi	0758a81287	Prevent reference tables being dropped when replicating reference tables	2020-04-08 12:41:36 -07:00
Marco Slot	924cd7343a	Defer reference table replication to shard creation time	2020-04-08 12:41:36 -07:00
Philip Dubé	76a8a3c7c9	Merge pull request #3719 from citusdata/stricter-trigger-checks Verify trigger relation before reading old/new tuples	2020-04-07 16:18:36 +00:00
Philip Dubé	26797bfb94	Verify trigger relation before reading old/new tuples master_dist_placement_cache_invalidate: bail when triggering on pg_dist_shard_placement	2020-04-07 15:39:31 +00:00
Önder Kalacı	9fb83d6e5d	Merge pull request #3703 from citusdata/get_rid_of_side_channel Move connection establishment for intermediate results after query execution	2020-04-07 17:21:30 +02:00
Önder Kalacı	70012dfd33	Do not error when an intermediate file does not exit (#3707 ) When the file does not exist, it could mean two different things. First -- and a lot more common -- case is that a failure happened in a concurrent backend on the same distributed transaction. And, one of the backends in that transaction has already been roll backed, which has already removed the file. If we throw an error here, the user might see this error instead of the actual error message. Instead, we prefer to WARN the user and pretend that the file has no data in it. In the end, the user would see the actual error message for the failure. Second, in case of any bugs in intermediate result broadcasts, we could try to read a non-existing file. That is most likely to happen during development. Thus, when asserts enabled, we throw an error instead of WARNING so that the developers cannot miss.	2020-04-07 17:06:55 +02:00
Onder Kalaci	a695b44ce9	Add new regression tests	2020-04-07 17:06:55 +02:00
Onder Kalaci	4b3d17f466	Make sure that tests are not failing randomly	2020-04-07 17:06:55 +02:00
Onder Kalaci	4f7c902c6c	Move connection establishment for intermediate results after query execution When we have a query like the following: ```SQL WITH a AS (SELECT * FROM foo LIMIT 10) SELECT max(x) FROM a JOIN bar 2 USING (y); ``` Citus currently opens side channels for doing the `COPY "1_1"` FROM STDIN (format 'result') before starting the execution of `SELECT * FROM foo LIMIT 10` Since we need at least 1 connection per worker to do `SELECT * FROM foo LIMIT 10` We need to have 2 connections to worker in order to broadcast the results. However, we don't actually send a single row over the side channel until the execution of `SELECT * FROM foo LIMIT 10` is completely done (and connections unclaimed) and the results are written to a tuple store. We could actually reuse the same connection for doing the `COPY "1_1"` FROM STDIN (format 'result'). This also fixes the issue that Citus doesn't obey `citus.max_adaptive_executor_pool_size` when the query includes an intermediate result.	2020-04-07 17:06:55 +02:00
Onder Kalaci	721daec9a5	Move the logic that initilize connections/local files into a function	2020-04-07 17:06:55 +02:00
Onder Kalaci	9b29a32d7a	Remove all references for side channel connections We don't need any side channel connections. That is actually problematic in the sense that it creates extra connections. Say, citus.max_adaptive_executor_pool_size equals to 1, Citus ends up using one extra connection for the intermediate results. Thus, not obeying citus.max_adaptive_executor_pool_size. In this PR, we remove the following entities from the codebase to allow further commits to implement not requiring extra connection for the intermediate results: - The connection flag REQUIRE_SIDECHANNEL - The function GivePurposeToConnection - The ConnectionPurpose struct and related fields	2020-04-07 17:06:55 +02:00
Hanefi Onaldi	e31dcff178	Merge pull request #3666 from citusdata/size-functions-without-locks Remove metadata locks from size functions	2020-04-07 18:02:39 +03:00
Hanefi Onaldi	1d22d0c2ff	Remove metadata locks from size functions	2020-04-07 17:37:15 +03:00
SaitTalhaNisanci	0430b568be	explicitly return false if transaction connected to local node (#3715 ) * explicitly return false if transaction connected to local node * not set TransactionConnectedToLocalGroup if we are writing to a file We use TransactionConnectedToLocalGroup to prevent local execution from happening as that might cause visibility problems. As files are visible to all transactions, we shouldn't set this variable if we are writing to a file.	2020-04-07 17:30:34 +03:00
Marco Slot	225adbc7ac	Merge pull request #3720 from citusdata/fix/intermediate_result_pruning Simplify and fix issues in intermediate result pruning	2020-04-07 11:20:07 +02:00
Marco Slot	2632343f64	Fix intermediate result pruning for INSERT..SELECT	2020-04-07 11:07:49 +02:00
Marco Slot	84672c3dbd	Simplify intermediate result pruning logic	2020-04-07 10:53:29 +02:00
SaitTalhaNisanci	a710b3cdc5	fix null tupleStoreState case in ExecuteLocalTaskListExtended (#3711 ) In case we don't care about the tupleStoreState in ExecuteLocalTaskListExtended, it could be passed as null. In that case we will get a seg error. This changes it so that a dummy tuple store will be created when it is null. Do not use local execution in ExecuteTaskListOutsideTransaction. As we are going to run the tasks outside transaction, we shouldn't use local execution. However, there is some problem when using local execution related to repartition joins, when we solve that problem, we can execute the tasks coming to this path with local execution. Also logging the local command is simplified. normalize job id in worker_hash_partition_table in test outputs.	2020-04-07 11:47:09 +03:00
SaitTalhaNisanci	a369f9001d	fix incorrect groupid or nodeid (#3710 ) For shardplacements, we were setting nodeid, nodename, nodeport and nodegroup manually. This makes it very error prone, and it seems that we already forgot to set some of them. This would mean that they would have their default values, e.g group id would be 0 when its group id is not 0. So the implication is that we would have inconsistent worker metadata. A new method is introduced, and we call the method to set those fields now, so that as long as we call this method, we won't be setting inconsistent metadata. It probably makes sense to have a struct for these fields. We already have NodeMetadata but it doesn't have nodename or nodeport. So that could be done over another refactor to make things simpler.	2020-04-07 11:14:14 +03:00
Philip Dubé	ec734a643b	Merge pull request #3722 from citusdata/optimistic-duplicate-grouping Duplicate grouping on worker whenever possible	2020-04-06 21:31:23 +00:00
Philip Dubé	4860e11561	Duplicate grouping on worker whenever possible This is possible whenever we aren't pulling up intermediate rows We want to do this because this was done in 9.2, some queries rely on the performance of grouping causing distinct values This change was introduced when implementing window functions on coordinator	2020-04-06 18:51:30 +00:00
Philip Dubé	6a6d5af8a3	Merge pull request #3403 from citusdata/fix-rollback-savepoint-hang Check connections from connection_placement before polling	2020-04-06 18:04:03 +00:00
Philip Dubé	b01bae5937	Check connections from connection_placement before polling	2020-04-06 17:45:44 +00:00
SaitTalhaNisanci	cd3e499834	not log in debug level in null parameters (#3718 ) The purpose of null_parameters is to make sure that citus doesn't crash with null parameters. (The related issue is #3493.) The logs in this file are not that important and they are flaky. The flakiness is related to postgres part as well so it is hard to reproduce them. Therefore it makes sense to decrease the log level.	2020-04-06 17:59:46 +03:00
SaitTalhaNisanci	3d3605be80	simplify vacuum test and fix the flakiness (#3704 ) look at sent commands to simplify complex logic in vacuum test also normalize connection id as that can differ when we don't have to choose a specific connection.	2020-04-03 21:39:54 +03:00
Onur Tirtir	a0f95c5b70	Merge pull request #3701 from citusdata/refactor/planner-ref-rte Do not traverse query tree one more time in distributed planner	2020-04-03 18:40:10 +03:00
Onur Tirtir	4c95ad1579	do not traverse parse tree in distributed planner one more time	2020-04-03 18:24:48 +03:00
Onur Tirtir	abdabbedb2	refactor distributed_planner.c	2020-04-03 18:24:41 +03:00
Onur Tirtir	13a35c6813	implement GetOnlyShardOidOfReferenceTable and some refactor in shard_uitls	2020-04-03 18:24:13 +03:00
Jelte Fennema	459a4829ae	Fix isolation tests on OSX (#3706 ) * Don't print out comments in make output * Remove empty lines with sed	2020-04-03 16:28:06 +02:00
SaitTalhaNisanci	32156dbf5c	fix flaky log statement in null_parameters (#3705 ) It seems that sometimes the pruning is deferred and sometimes not with this statement. What we care in this test is to see that it doesn't crash. I think we don't care about the log statement for this line. So it makes sense to not log this statement, and care about the result.	2020-04-03 17:01:59 +03:00
Hanefi Onaldi	7e682cd5e8	Merge pull request #3700 from citusdata/bump-migration-version Remove migration paths to 9.3-1, introduce 9.3-2	2020-04-03 13:02:58 +03:00
Hanefi Önaldı	d1223bd6cc	Remove migration paths to 9.3-1, introduce 9.3-2	2020-04-03 12:50:45 +03:00
SaitTalhaNisanci	710970407f	not wait forever in multi_extension test (#3702 )	2020-04-03 12:21:02 +03:00
SaitTalhaNisanci	659283c9a7	fix multi utilities vacuum test (#3699 )	2020-04-03 11:50:00 +03:00
Marco Slot	9b26dcaf31	Merge pull request #3680 from citusdata/fix/nextval Evaluate nextval in the target list on the coordinator	2020-04-02 16:23:32 +02:00
Marco Slot	fd8cdb92f4	Evaluate nextval in the target list on the coordinator	2020-04-02 02:53:19 +02:00
SaitTalhaNisanci	d80baa3557	Merge pull request #3636 from citusdata/enh/localShardCreationExecution add local shard creation support	2020-04-01 18:30:50 +03:00
SaitTalhaNisanci	df88ab71b6	normalize assign_distributed_transaction_id in tests	2020-04-01 18:23:16 +03:00
SaitTalhaNisanci	0aebd78ea7	use localExecution in ExecuteTaskListExtended ExecuteTaskListExtended is the common method for different codepaths, and instead of writing separate local execution logics in different codepaths, it makes more sense to have the logic here. We still need to do some refactoring, this is an initial step. After this commit, we can run create shard commands locally. There is a special case with shard creation commands. A create shard command might have a concatenated query string, however local execution did not know how to execute a task with multiple query strings. This is also implemented in this commit. We go over each query in the concatenated query string and plan/execute them one by one. A more clean solution to this would be to make sure that each task has a single query. We currently cannot do that because we need to ensure the task dependencies. However, it would make sense to do that at some point and it would simplify the code a lot.	2020-04-01 18:23:16 +03:00
SaitTalhaNisanci	ba01f3457a	use macros for pg versions instead of hardcoded values (#3694 ) 3 Macros are defined for removing the hardcoded pg versions. PG_VERSION_11, PG_VERSION_12 and PG_VERSION_13.	2020-04-01 17:01:52 +03:00
Philip Dubé	4fa06388e3	Merge pull request #3689 from citusdata/fix-upgrade-type-after-ordering upgrade_type_after: ORDER BY	2020-04-01 04:58:28 +00:00
Philip Dubé	3bb4f14efd	upgrade_type_after: ORDER BY	2020-04-01 01:07:21 +00:00
Hadi Moshayedi	b4ba832ae9	Merge pull request #3686 from citusdata/fix-typos tests: remove stale comment, fix typo	2020-03-31 15:05:18 -07:00
Philip Dubé	d155149c18	tests: remove stale comment, fix typo	2020-03-31 20:13:51 +00:00
Philip Dubé	4552a990e9	Merge pull request #3685 from citusdata/assert-shard-index-valid Assert bounds checks on two array reads which rely on data not being out of bounds	2020-03-31 20:13:21 +00:00
Philip Dubé	ddc3377026	Assert bounds checks on two array reads which rely on data not being out of bounds	2020-03-31 18:58:35 +00:00
Hadi Moshayedi	bab4cff1c9	Merge pull request #3679 from citusdata/fix/table_type Allow table type to be used in target list	2020-03-31 11:38:06 -07:00
Marco Slot	252abcce16	Allow table type to be used in target list	2020-03-31 11:11:01 -07:00
SaitTalhaNisanci	5bf9f32dd3	disable one of deadlock detection test (#3682 ) It seems that one of the deadlock detection tests fails way too often in our CI. The difference is only ordering. Currently it seems that it is a good idea to disable this test for the sake of development.	2020-03-31 19:47:58 +03:00
SaitTalhaNisanci	6cd32b0db1	refactor ExecuteLocalTaskList (#3617 ) ExecuteLocalTaskList doesn't need scanState as it only uses paramListInfo, distributedPlan and tupleStoreState. It is better to pass only the variables that the function needs, so that we can call this function from other places when we dont have scanState.	2020-03-31 19:19:54 +03:00
SaitTalhaNisanci	96358079ac	decrease cirleci no output timeout to 2 mins from 10 mins (#3681 ) We sometimes get no output timeout in our cirleci jobs. There is no point in waiting for 10 minutes to get those timeouts. Ideally we should see some output, this will also prevent adding a test that takes longer than 2 mins to run.	2020-03-31 18:53:59 +03:00
SaitTalhaNisanci	e04a307a4d	Merge pull request #3659 from citusdata/refactor/queryString refactor query string of task	2020-03-31 16:12:24 +03:00
SaitTalhaNisanci	b5591b1b28	use taskQuery as a struct to simplify the code	2020-03-31 15:47:55 +03:00
SaitTalhaNisanci	8806c4d697	move queryStringList into taskQuery Also allocate task query in the memory context of task.	2020-03-31 15:47:55 +03:00
SaitTalhaNisanci	c796ac335d	add TaskQuery struct to abstract query string related fields We had many fields in task related to query strings. It was kind of complex, and only of them could be set at a time. Therefore it makes more sense to abstract this and use a union so that it is clear that only of them should be set. We have three fields that could have query related strings: - queryForLocation - queryStringLazy - perPlacementQueryStrings Relatively, they can be set with: - SetTaskQueryString - SetTaskQueryIfShouldLazyDeparse - SetTaskPerPlacementQueryStrings The direct usage of the query related fields are also removed. Rename queryForLocalExecution Currently queryForLocalExecution is only used for deparsing purposes, therefore it makes sense to rename it to what it is doing.	2020-03-31 15:47:55 +03:00
SaitTalhaNisanci	98f95e2a5e	add TaskQueryStringForPlacement TaskQueryStringForPlacement simplifies how the executor gets the query string for a given placement. Task will use the necessary fields to return the correct query placement string. Executor doesn't need to know the details for this. rename TaskQueryString as TaskQueryStringAllPlacements TaskQueryString returns the query string that will be the same for all the placements. In INSERT..SELECT the query string can be different for each placement. Adaptive executor uses TaskQueryStringForPlacement, which returns the query string for a placement. It makes sense to rename TaskQueryString as TaskQueryStringAllPlacements as it is returning the query string for all placements. rename SetTaskQuery as SetTaskQueryIfShouldLazyDeparse SetTaskQuery does not always sets the task query. It can set the query string as well. So it is more clear to name it SetTaskQueryIfShouldLazyDeparse, since it will set the query not query string only when we should deparse the query in a lazy way.	2020-03-31 15:47:55 +03:00
SaitTalhaNisanci	982b5fbabf	add SetTaskPerPlacementStrings It is possible that a task will have different query string for each placement. This is the case in INSERT..SELECT via repartitioning. When we are setting task->perPlacementQueryString, we should set queryStringLazy to NULL. Therefore a method for that purpose is created.	2020-03-31 15:47:55 +03:00
Marco Slot	157f4599c3	Merge pull request #3668 from citusdata/fix/left_join_pk Fix error when using LEFT JOIN with GROUP BY on primary key	2020-03-31 10:30:53 +02:00
SaitTalhaNisanci	e1802c5c00	extract local plan cache related methods into a file (#3667 )	2020-03-31 11:11:34 +03:00
SaitTalhaNisanci	8dfc2cb122	not append ; if end of the list in StringJoin (#3672 )	2020-03-31 10:01:28 +03:00
Philip Dubé	67d2ad4e37	Fixes flaky test in multi_reference_table: ORDER BY (#3676 ) Fixes app.circleci.com/pipelines/github/citusdata/citus/7744/workflows/0848f36c-af9e-46b7-9dda-a421df54ba56/jobs/109503	2020-03-30 23:31:10 +02:00
Philip Dubé	ae1e92e337	Merge pull request #3670 from citusdata/avoid-stale-metadata multi_copy.c: remove tableMetadata	2020-03-30 20:00:25 +00:00
Philip Dubé	4eb2c33f38	multi_copy.c: remove tableMetadata	2020-03-30 19:26:44 +00:00
Onur Tirtir	aedfc99b62	Update CHANGELOG for 9.2.4 (#3675 )	2020-03-30 20:45:26 +03:00
Marco Slot	331b45348c	Fix error when using LEFT JOIN with GROUP BY on primary key	2020-03-30 16:42:22 +02:00
Jelte Fennema	3be665269f	Reintroduce ForceSearchShardPlacementInList (#3664 ) This was added to silence static analysis errors. It was removed accidentally in #3591. This reintroduces it again.	2020-03-27 14:28:50 +01:00
Hanefi Onaldi	c0930d157e	Merge pull request #3510 from citusdata/alter-role-set-propagation In PostgreSQL, user defaults for config parameters can be changed by ALTER ROLE .. SET statements. We wish to propagate those defaults across the Citus cluster so that the behavior will be similar in different workers. The defaults can either be set in a specific database, or the whole cluster, similarly they can be set for a single role or all roles. We propagate the ALTER ROLE .. SET if all the conditions below are met: - The query affects the current database, or all databases - The user is already created in worker nodes	2020-03-27 14:04:39 +03:00
Hanefi Onaldi	0e8103b101	Propagate ALTER ROLE .. SET statements In PostgreSQL, user defaults for config parameters can be changed by ALTER ROLE .. SET statements. We wish to propagate those defaults accross the Citus cluster so that the behaviour will be similar in different workers. The defaults can either be set in a specific database, or the whole cluster, similarly they can be set for a single role or all roles. We propagate the ALTER ROLE .. SET if all the conditions below are met: - The query affects the current database, or all databases - The user is already created in worker nodes	2020-03-27 13:02:48 +03:00
Philip Dubé	bda1f1d530	Merge pull request #3661 from citusdata/fix/agg_evaluation Fixes a bug that causes some DML queries containing aggregates to fail	2020-03-26 16:14:42 +00:00
Marco Slot	a65ffee266	Fixes a bug that causes some DML queries containing aggregates to fail	2020-03-26 16:08:34 +00:00
SaitTalhaNisanci	d3fdade2e8	add missing perPlacementQueryStrings to copy and out funcs (#3657 )	2020-03-26 17:16:29 +03:00
Marco Slot	6bc3895b02	Merge pull request #3651 from citusdata/fix/srf_evaluation Fix a bug which caused queries with SRFs and function evaluation to fail	2020-03-26 14:48:26 +01:00
SaitTalhaNisanci	dd1a456407	store query command list in task (#3649 ) Sometimes we have concatenated query strings for a task. However, when we want to find each query string, it is not a trivial task. Therefore, it makes sense to store this in task so that when we need each query string we can easily get it.	2020-03-26 12:04:08 +03:00
Philip Dubé	4686133bf2	Merge pull request #3653 from citusdata/fix-grouping-sets-segfault Don't segfault on queries using GROUPING	2020-03-25 17:43:26 +00:00
Philip Dubé	0ad1956551	Merge pull request #3537 from citusdata/master-window-functions Master window functions	2020-03-25 17:27:31 +00:00
Philip Dubé	917cb6ae93	Don't segfault on queries using GROUPING GROUPING will always return 0 outside of GROUPING SETS, CUBE, or ROLLUP Since we don't support those, it makes sense to reject GROUPING in queries	2020-03-25 15:46:43 +00:00
Philip Dubé	720525cfda	Add support for window functions on coordinator Some refactoring: Consolidate expression which decides whether GROUP BY/HAVING are pushed down Rename early pullUpIntermediateRows to hasNonDistributableAggregates Create WorkerColumnName to handle formatting WORKER_COLUMN_FORMAT Ignore NULL StringInfo pointers to SafeToPushdownWindowFunction Fix bug where SubqueryPushdownMultiNodeTree mutates supplied Query, SafeToPushdownWindowFunction requires the original query as it relies on rtable	2020-03-25 15:31:20 +00:00
Jelte Fennema	36ff150465	Update CHANGELOG for v9.2.3 (#3648 )	2020-03-25 14:32:55 +01:00
Nils Dijk	4e611cfc25	Refactor dependency resolution and resolve from pg_shdepend (#3633 ) DESCRIPTION: Refactor dependency resolution and resolve from pg_shdepend This PR refactors how dependencies are resolved by not assuming solely a `pg_depend` record describing the dependency. Instead we keep a definition of the dependency around which records how the dependency is resolved. This can be one of the following ways - `pg_depend`, data will contain a copy of the `pg_depend` record - `pg_shdepend`, data will contain a copy of the `pg_shdepend` record - `ObjectAddress`, data will contain only an `ObjectAddress` describing a dependency Irregardless of way the dependency was found it will always be able to get to the address of the dependency as that is the most important property. For some checks we can inspect the source where the dependency was found and perform a deep inspection to decide if we want to follow the dependency. This is important to not distribute dependencies coming from extensions for example.	2020-03-25 13:38:25 +01:00
Onur Tirtir	eaaf302795	Merge pull request #3644 from citusdata/refactor/small-typos-etc Move MakeNameListFromRangeVar function and some other small changes	2020-03-25 11:36:50 +03:00
Onur Tirtir	52fd58d51f	move MakeNameListFromRangeVar function to a more appropriate file	2020-03-25 11:01:50 +03:00
Onur Tirtir	2396b66ac5	remove an outdated comment in local executor	2020-03-25 11:01:40 +03:00
Onur Tirtir	8ebb8ef31d	use PG_USED_FOR_ASSERTS_ONLY	2020-03-25 11:01:33 +03:00
Onur Tirtir	81d48d3466	fix some typos	2020-03-25 11:01:26 +03:00
Marco Slot	b89e9dc158	Fix a bug which caused queries with SRFs and function evalution to fail	2020-03-25 06:55:53 +01:00
Jelte Fennema	149f0b2122	Use Microsoft approved cipher string (#3639 ) This cipher string is approved by the Microsoft security team and only enables TLSv1.2 ciphers.	2020-03-24 15:51:44 +01:00
Jelte Fennema	2aabe3e2ef	Mark all connections for shutdown when citus.node_conninfo chan… (#3642 ) We cache connections between nodes in our connection management code. This is good for speed. For security this can be a problem though. If the user changes settings related to TLS encryption they want those to be applied to future queries. This is especially important when they did not have TLS enabled before and now they want to enable it. This can normally be achieved by changing citus.node_conninfo. However, because connections are not reopened there will still be old connections that might not be encrypted at all. This commit changes that by marking all connections to be shutdown at the end of their current transaction. This way running transactions will succeed, even if placement requires connections to be reused for this transaction. But after this transaction completes any future statements will use a connection created with the new connection options. If a connection is requested and a connection is found that is marked for shutdown, then we don't return this connection. Instead a new one is created. This is needed to make sure that if there are no running transactions, then the next statement will not use an old cached connection, since connections are only actually shutdown at the end of a transaction.	2020-03-24 15:31:41 +01:00
Hadi Moshayedi	b166105f16	Merge pull request #3591 from citusdata/copy_shard_placement Allow master_copy_shard_placement to replicate to new nodes	2020-03-23 08:45:21 -07:00
Hadi Moshayedi	b46b9a68ae	Tests for master_copy_shard_placement	2020-03-23 08:33:55 -07:00
Marco Slot	ede176d849	Implement shard placement copying	2020-03-23 08:33:08 -07:00
Philip Dubé	f77c71a9bd	Merge pull request #3625 from citusdata/avoid-execinitexpr-sublink PartiallyEvaluateExpression: Avoid unrecognized paramkind: 2	2020-03-23 14:25:28 +00:00
Philip Dubé	dd2bd53e5b	PartiallyEvaluateExpression: Avoid unrecognized paramkind: 2	2020-03-23 14:14:01 +00:00
SaitTalhaNisanci	3b7959a763	not run local shard copy test in parallel (#3640 ) It seems that when logging is enabled we should not run local shard copy in parallel with other tests. The reason is that it adds coordinator for reference tables and if the parallel test creates a schema before this test is run, the schema will be logged. So it is not deterministic.	2020-03-23 14:38:18 +03:00
SaitTalhaNisanci	c5c446f84f	not run local_shard_copy in parallel (#3635 )	2020-03-23 13:56:25 +03:00
SaitTalhaNisanci	3df578010e	add a UDF to update colocation (#3623 ) If two tables have the same distribution column type, we implicitly colocate them. This is useful since colocation has a big performance impact in most applications. When a table is rebalanced, all of the colocated tables are also rebalanced. If table A and table B are colocated and we want to rebalance table A, table B will also be rebalanced. We need replica identity so that logical replication can replicate updates and deletes during rebalancing. If table B does not have a replica identity we error out. A solution to this is to introduce a UDF so that colocation can be updated. The remaining tables in the colocation group will stay colocated. For example if table A, B and C are colocated and after updating table B's colocations, table A and table C stay colocated. The "updating colocation" step does not move any data around, it only updated pg_dist_partition and pg_dist_colocation tables. Specifically it creates a new colocation group for the table and updates the entry in pg_dist_partition while invalidating any cache.	2020-03-23 13:22:24 +03:00
Önder Kalacı	3e980c81e9	Merge pull request #3631 from citusdata/improve_at_exit Properly terminate connections at the end session	2020-03-20 18:01:16 +01:00
Onder Kalaci	7b4eb9611b	Properly terminate connections at the end session Citus coordinator (or MX nodes) caches `citus.max_cached_conns_per_worker` connections per node. This means that, those connections are not terminated after each statement. Instead, cached to avoid the cost of re-establishment. This is crucial for OLTP performance. The problem with that approach is that, we never properly handle the termnation of those cached connections. For instance, when a session on the coordinator disconnects, you'd see the following logs on the workers: ``` 2020-03-20 09:13:39.454 CET [64028] LOG: could not receive data from client: Connection reset by peer ``` With this patch, we're terminating the cached connections properly at the end of the connection.	2020-03-20 17:34:34 +01:00
Jelte Fennema	8deb805338	Ignore safestringlib sourcefiles in coverage (#3632 ) This is not our code, so we don't care about the coverage our tests generate for it.	2020-03-20 14:26:52 +01:00
Jelte Fennema	56863e8f0b	Really ignore -Wgnu-variable-sized-type-not-at-end (#3627 )	2020-03-20 11:53:28 +01:00
Jelte Fennema	ed0376bb41	Unparallelize tests (#3629 ) We're getting a lot of random failures on CI regarding connection errors. This works around that by not running that create lots of connections in parallel.	2020-03-20 10:31:34 +01:00
Jelte Fennema	30ada54f6a	Merge pull request #3626 from citusdata/vendor-new-directory Compile safestringlib using regular configure	2020-03-19 12:36:38 +01:00
Jelte Fennema	a3513c8902	Ignore symlinks and directories editorconfig CI script	2020-03-19 11:53:05 +01:00
Jelte Fennema	605b901637	Update cherry-pick hash in vendor README	2020-03-19 11:53:05 +01:00
Jelte Fennema	dc2a371d9f	Fix compilation issues with safestringlib Based on `92d7a40d1d`	2020-03-19 11:52:20 +01:00
Jelte Fennema	9a79935f1f	Update safestringlib	2020-03-19 11:52:20 +01:00
Jelte Fennema	6db7d87618	Compile safestringlib using regular configure This is needed to automatically generate .bc (bitcode) files when postgres is compiled with llvmjit support. It also has the advantage that cmake is not required for the build anymore.	2020-03-19 11:52:20 +01:00
Nils Dijk	6ff79c5ea9	Revert: Semmle: Protect against theoretical race in recursive d… (#3619 ) As discussed with @JelteF; #3559 caused consistent errors on BSD (OSX). Given a group of people use this environment to develop on it is an undesirable change. This reverts commit `ca8f7119fe`.	2020-03-18 13:48:05 +01:00
SaitTalhaNisanci	e5a2bbb2bd	Merge pull request #3557 from citusdata/enh/localExecutionCopy add local copy execution	2020-03-18 09:43:57 +03:00
SaitTalhaNisanci	2eaf7bba69	not use local copy if we are copying into intermediate results file We have special logic to copy into intermediate results and we use a custom format for that, "result" copy format. Postgres internally does not know this format and if we use this locally it will error saying that it does not know this format. Files are visible to all transactions, which means that we can use any connection to access files. In order to use the existing logic, it makes sense that in case we have intermediate results, which means we will write the results to a file, we preserve the same behavior, which is opening connections to localhost. Therefore if we have intermediate results we return false in ShouldExecuteCopyLocally.	2020-03-18 09:35:20 +03:00
SaitTalhaNisanci	9d2f3c392a	enable local execution in INSERT..SELECT and add more tests We can use local copy in INSERT..SELECT, so the check that disables local execution is removed. Also a test for local copy where the data size > LOCAL_COPY_FLUSH_THRESHOLD is added. use local execution with insert..select	2020-03-18 09:34:39 +03:00
SaitTalhaNisanci	42cfc4c0e9	apply review items log shard id in local copy and add more comments	2020-03-18 09:33:55 +03:00
SaitTalhaNisanci	c22068e75a	use the right partition for partitioned tables	2020-03-18 09:28:59 +03:00
SaitTalhaNisanci	1df9601e13	not use local copy if current transaction is connected to local group If current transaction is connected to local group we should not use local copy, because we might not see some of the changes that are made over the connection to the local group.	2020-03-18 09:28:59 +03:00
SaitTalhaNisanci	39bbec0f30	add tests for local copy execution	2020-03-18 09:28:59 +03:00
SaitTalhaNisanci	f9c4431885	add the support to execute copy locally A copy will be executed locally if - Local execution is enabled and current transaction accessed a local placement - Local execution is enabled and we are inside a transaction block. So even if local execution is enabled but we are not in a transaction block, the copy will not be run locally. This will not run locally: ``` COPY distributed_table FROM STDIN; .... ``` This will run locally: ``` SET citus.enable_local_execution to 'on'; BEGIN; COPY distributed_table FROM STDIN; COMMIT; .... ``` . There are 3 ways to do a copy in postgres programmatically: - from a file - from a program - from a callback function I have chosen to implement it with a callback function, which means that we write the rows of copy from a callback function to the output buffer, which is used to insert tuples into the actual table. For each shard id, we have a buffer that keeps the current rows to be written, we perform the actual copy operation either when: - copy buffer for the given shard id reaches to a threshold, which is currently 512KB - we reach to the end of the copy The buffer size is debatable(512KB). At a given time, we might allocate (local placement * buffer size) memory at most. The local copy uses the same copy format as remote copy, which means that we serialize the data in the same format as remote copy and send it locally. There was also the option to use ExecSimpleRelationInsert to insert slots one by one, which would avoid the extra serialization/deserialization but doing some benchmarks it seems that using buffers are significantly better in terms of the performance. You can see this comment for more details: https://github.com/citusdata/citus/pull/3557#discussion_r389499054	2020-03-18 09:28:59 +03:00
Jelte Fennema	99c5b0add7	Make building safestringlib on some distros easier (#3616 ) On some distros (e.g. Redhat 7) there is cmake version 2 and cmake version 3, safestringlib requires cmake version 3. On those distros the binary is called cmake3, so try to use that one before falling back to regular cmake binary.	2020-03-16 11:34:30 +01:00
Philip Dubé	f3d2265d80	Merge pull request #3614 from citusdata/copyobject-is-a-deepcopy multi_logical_optimizer: replace ListCopyDeep with copyObject	2020-03-13 17:14:44 +00:00
Philip Dubé	7b382e43bc	multi_logical_optimizer: replace ListCopyDeep with copyObject, stack allocate WorkerAggregateWalkerContext	2020-03-13 15:46:01 +00:00
Nils Dijk	e5237b9e20	Fix left join shard pruning (#3569 ) DESCRIPTION: Fix left join shard pruning in pushdown planner Due to #2481 which moves outer join planning through the pushdown planner we caused a regression on the shard pruning behaviour for outer joins. In the pushdown planner we make a union of the placement groups for all shards accessed by a query based on the filters we see during planning. Unfortunately implicit filters for left joins are not available during this part. This causes the inner part of an outer join to not prune any shards away. When we take the union of the placement groups it shows the behaviour of not having any shards pruned. Since the inner part of an outer query will not return any rows if the outer part does not contain any rows we have observed we do not have to add the shard intervals of the inner part of an outer query to the list of shard intervals to query. Fixes: #3512	2020-03-13 15:20:45 +01:00
Onur Tirtir	a14739f808	Local execution of ddl/drop/truncate commands (#3514 ) * reimplement ExecuteUtilityTaskListWithoutResults for local utility command execution * introduce new functions for local execution of utility commands * change ErrorIfTransactionAccessedPlacementsLocally logic for local utility command execution * enable local execution for TRUNCATE command on distributed & reference tables * update existing tests for local utility command execution * enable local execution for DDL commands on distributed & reference tables * enable local execution for DROP command on distributed & reference tables * add normalization rules for cascaded commands * add new tests for local utility command execution	2020-03-13 15:39:32 +03:00
Jelte Fennema	ca8f7119fe	Semmle: Protect against theoretical race in recursive directory… (#3559 ) In between stat at the start of the loop and unlink/rmdir at the end the item that the filename references might have changed. In some cases this can be a security bug, but since we only delete the file/directory it should not be for us as far as I can tell. It could in theory still cause errors though if the a file is changed into a directory by some other process. This commit makes the code robust against that, by not using stat and only rely on error codes and retries.	2020-03-13 10:37:13 +01:00
SaitTalhaNisanci	77f96a1f87	retry vanilla tests if they fail once more (#3611 )	2020-03-12 12:50:06 +03:00
Jelte Fennema	c7aa6eddf3	Fix some bugs in string to int functions (#3602 ) This fixes 3 bugs: 1. `strtoul` never underflows, so that branch was useless 2. `strtoul` has ULONG_MAX instead of LONG_MAX when it overflows 3. `long` and `unsigned long` are not necessarily 64bit, they can be either more or less. So now `strtoll` and `strtoull` are used and 64 bit bounds are checked.	2020-03-11 23:03:02 +01:00
Jelte Fennema	c4cc26ed37	Semmle: Ensure stack memory is not leaked through uninitialized… (#3561 ) New stack memory can contain anything including passwords/private keys. In these functions we return structs that can have their padding bytes uninitialized. By first zeroing out the struct fully, we try to ensure that any data that is in these padding bytes is at least overwritten once. It might not be zero anymore after setting the fields, but at least it shouldn't be private data anymore.	2020-03-11 20:05:36 +01:00
Philip Dubé	7eb678f0f7	Merge pull request #3600 from citusdata/typecheck-agg-combine Add runtime type checking to AGGREGATE_CUSTOM_COMBINE helper functions	2020-03-11 17:31:17 +00:00
Philip Dubé	11b968bc30	Add runtime type checking to AGGREGATE_CUSTOM_COMBINE helper functions	2020-03-11 17:20:30 +00:00
Jelte Fennema	e0bbe1ca38	Semmle: Actively check one possible NULL deref case (#3560 ) Calling ErrorIfUnsupportedConstraint was still giving errors on Semmle. This makes sure that we check for NULL at runtime. This way we can safely ignore all errors created by this function.	2020-03-11 18:11:56 +01:00
Philip Dubé	468319e638	Merge pull request #3608 from citusdata/fix-non-pushdownable-agg-in-having Also check aggregates in havingQual when scanning for non pushdownable aggregates	2020-03-11 15:55:12 +00:00
Philip Dubé	4b68ee12c6	Also check aggregates in havingQual when scanning for non pushdownable aggregates Came across this while coming up with test cases, 'result "68_1" does not exist' I'll seek to address in a future PR, for now avoid segfault	2020-03-11 15:47:04 +00:00
Önder Kalacı	63ced3d901	Improve master evaluation tests (#3609 ) * Add third column to master_evaluation_modify table It was already added in some tests, but now make it globally applicable to the test file. * Add third column to master_evaluation_select table As we'll use the column in some tests * Add modify regression tests For the combinations of: local/remote, router/fast-path: - Distribution key is a const. - Contains a function - A column which is not dist. key is parametrized * Add select regression tests For the combinations of: local/remote, router/fast-path: - Distribution key is a const. - Contains a function - A column which is not dist. key is parametrized * Make some tests consistent to check-base	2020-03-11 15:38:08 +01:00
Önder Kalacı	afc942c6af	Remove non-adaptive test schedules (#3605 ) As we don't have any other executors to run them. These schedules were added when we had both the adaptive executor and the real-time/router executors in the code. Since we only have adaptive executor anymore, we can remove these.	2020-03-11 09:58:49 +01:00
Önder Kalacı	f7f0fff304	Merge pull request #3604 from citusdata/prevent_worker_mx_create_dist_f Prevent create_distributed_function() from the workers	2020-03-11 09:47:41 +01:00
Onder Kalaci	7d787e3d5e	Prevent create_distributed_function() from the workers As this could cause weird edge cases.	2020-03-10 18:24:20 +01:00
Onur Tirtir	e902581cb6	implement DropTaskList before introducing local DROP table execution (#3603 )	2020-03-10 19:12:44 +03:00
Marco Slot	c26f99ea82	Merge pull request #3584 from citusdata/simplify_insert_logic Simplify INSERT logic in router planner	2020-03-10 16:45:38 +01:00
Marco Slot	cb3d90bdc8	Simplify INSERT logic in router planner	2020-03-10 15:54:40 +01:00
Philip Dubé	d0d51bb8c3	Merge pull request #3601 from citusdata/maintenanced-dont-proc-exit-from-term-handler maintenanced: Don't call proc_exit in SIGTERM handler	2020-03-10 13:50:48 +00:00
Philip Dubé	2b4ea33a2b	maintenanced: Don't call proc_exit in SIGTERM handler Instead set got_SIGTERM to true to signal mainloop to exit	2020-03-09 23:22:19 +00:00
Philip Dubé	877687cc64	Merge pull request #3568 from citusdata/fix-having-subquery-ref First phase of addressing HAVING subquery issues	2020-03-09 18:23:02 +00:00
Philip Dubé	81cfa05d3d	First phase of addressing HAVING subquery issues Add failing tests, make changes to avoid crashes at least Fix HAVING subquery pushdown ignoring reference table only subqueries, also include HAVING in recursive planning Given that we have a function IsDistributedTable which includes reference tables, it seems best to have IsDistributedTableRTE & QueryContainsDistributedTableRTE reflect that they do not include reference tables in their check Similarly SublinkList's name should reflect that it only scans WHERE contain_agg_clause asserts that we don't have SubLinks, use contain_aggs_of_level as suggested by pg sourcecode	2020-03-09 17:58:30 +00:00
Önder Kalacı	7793d19b71	Merge pull request #3578 from citusdata/fix_wrong_left_join Improve definition of RelationInfoContainsOnlyRecurringTuples	2020-03-09 17:42:51 +01:00
Onder Kalaci	2ed19181fe	Improve definition of RelationInfoContainsOnlyRecurringTuples Before this commit, we considered !ContainsRecurringRTE() enough for NotContainsOnlyRecurringTuples. However, instead, we can check for existince of any distributed table. DESCRIPTION: Fixes a bug that causes wrong results with complex outer joins	2020-03-09 17:28:33 +01:00
SaitTalhaNisanci	321d0152c1	add a utility to get shard oid from relation oid and shard id (#3596 )	2020-03-09 15:50:29 +03:00
SaitTalhaNisanci	4509d9a72b	Create a variable SLOW_START_DISABLED (#3593 ) When ExecutorSlowStartInterval is set to 0, it has a special meaning that we do not want to use slow start. Therefore, in the code we have checks such as ExecutorSlowStartInterval > 0 to understand if it is enabled or not. However, this is kind of subtle, and it creates an extra mapping in our mind. Therefore, I thought that using a variable for the special value removes the mapping and makes it easier to understand.	2020-03-09 14:54:01 +03:00
Hanefi Onaldi	2595b4864b	Remove all GetWorkerNodeCount() references As @onderkalaci suggested removing the definition of GetWorkerNodeCount() that can potentially cause misunderstandings. I can advise using ActiveReadableWorkerNodeCount() that returns the number of active primaries is a safer alternative than GetWorkerNodeCount() that returns the total number of workers containing inactives, primaries, and unavailable nodes. I introduced a bug #3556 and in the bugfix #3564 removed the single usage of said function	2020-03-09 13:35:18 +03:00
Philip Dubé	426b8ff1a9	Merge pull request #3592 from citusdata/rename-lookuplookup Rename LookupCitusTableCacheEntry to GetCitusTableCacheEntry, LookupLookupCitusTableCacheEntry back to LookupCitusTableCacheEntry	2020-03-08 14:20:24 +00:00
Philip Dubé	7cdfa1daab	Rename LookupCitusTableCacheEntry to GetCitusTableCacheEntry, LookupLookupCitusTableCacheEntry back to LookupCitusTableCacheEntry	2020-03-08 14:08:23 +00:00
Philip Dubé	70436ec279	Merge pull request #3587 from citusdata/fix-typos Fix typos, rename isDistributedRelation to isCitusRelation	2020-03-07 14:21:03 +00:00
Philip Dubé	a7cca1bcde	Rename DistTableCacheEntry to CitusTableCacheEntry	2020-03-07 14:08:03 +00:00
Philip Dubé	b514ab0f55	Fix typos, rename isDistributedRelation to isCitusRelation	2020-03-06 19:20:34 +00:00
Philip Dubé	00a7bc3044	Merge pull request #3586 from citusdata/rename-distributed-to-citus Try to create clear distinction between DistributedTable vs CitusTable	2020-03-06 19:10:16 +00:00
Philip Dubé	bec58000d6	Given IsDistributedTableRTE, there's ambiguity in what DistributedTable means Elsewhere we used DistributedTable to include reference tables Marco suggested we use CitusTable for distributed & reference tables So renaming: - IsDistributedTable -> IsCitusTable - IsDistributedTableViaCatalog -> IsCitusTableViaCatalog - DistributedTableCacheEntry -> CitusTableCacheEntry - DistributedTableList -> CitusTableList - isDistributedTable -> isCitusTable - InsertSelectIntoDistributedTable -> InsertSelectIntoCitusTable - ExtractFirstDistributedTableId -> ExtractFirstCitusTableId	2020-03-06 18:57:55 +00:00
Onur Tirtir	a381074787	Update CHANGELOG for 9.0.2 (#3585 ) (cherry picked from commit `de6068b2c4`) Co-authored-by: Hanefi Onaldi <hanefionaldi@gmail.com>	2020-03-06 18:26:04 +03:00
Marco Slot	fa29fb8c52	Merge pull request #3579 from citusdata/disable_postgres_parallelism Disable Postgres parallelism by default in tests	2020-03-06 15:54:57 +01:00
Onur Tirtir	50e59f1a61	Update CHANGELOG for 9.2.2 (#3582 ) Co-authored-by: Hanefi Onaldi <hanefionaldi@gmail.com>	2020-03-06 16:09:34 +03:00
Marco Slot	5b1d1dd413	Remove unnecessary use of max_parallel_workers_per_gather	2020-03-06 13:18:58 +01:00
Marco Slot	d0fead6691	Disable Postgres parallelism by default in tests	2020-03-06 13:18:58 +01:00
Onur Tirtir	c5007bc93c	Merge pull request #3563 from citusdata/refactor/local-group-id-and-fkey Refactor around foreign key constraints and GetLocalGroupId	2020-03-05 20:32:10 +03:00
Onur Tirtir	bdce9acc30	some refactor around foreign key constraints	2020-03-05 20:20:41 +03:00
Onur Tirtir	88bfd2e4b7	refactor around local group id checks Mostyl optimizes the calls made to GetLocalGroupId and refactors its usages	2020-03-05 20:20:41 +03:00
Onur Tirtir	1e128a6ee4	fix a potential infinite loop	2020-03-05 20:20:41 +03:00
SaitTalhaNisanci	a75436a54b	refactor CoordinatedTransactionCallback (#3571 )	2020-03-05 18:36:12 +03:00
Hanefi Onaldi	6e6763678c	Merge pull request #3564 from citusdata/fix-early-exits-on-subplan-pruning Fix early exits on intermediate result pruning There are 2 problems with our early exit strategy that this commit fixes: 1- When we decide that a subplan results are sent to all worker nodes, we used to skip traversing the whole distributed plan, instead of skipping only the subplan. 2- We used to consider all available nodes in the cluster (secondaries and inactive nodes as well as active primaries) when deciding on early exit strategy. This resulted in failures to early exit when there are secondaries or inactive nodes.	2020-03-05 16:51:15 +03:00
Hanefi Onaldi	c0ad44f975	Fix early exit bug on intermediate result pruning There are 2 problems with our early exit strategy that this commit fixes: 1- When we decide that a subplan results are sent to all worker nodes, we used to skip traversing the whole distributed plan, instead of skipping only the subplan. 2- We used to consider all available nodes in the cluster (secondaries and inactive nodes as well as active primaries) when deciding on early exit strategy. This resulted in failures to early exit when there are secondaries or inactive nodes.	2020-03-05 16:41:44 +03:00
Marco Slot	241c186603	Merge pull request #3553 from citusdata/refactor/begin_scan Refactor CitusBeginScan into separate SELECT / DML paths	2020-03-05 13:06:10 +01:00
Onder Kalaci	f72916875f	Expand test coverage for combinations of master evalution, deferred pruning, parameters, local execution - Router & Remote & Requires Master Evaluation & With Param & Without Param - Fast Path Router & Remote & Requires Master Evaluation & With Param & Without Param	2020-03-05 12:37:22 +01:00
Marco Slot	dc4c0c032e	Refactor CitusBeginScan into separate DML / SELECT paths	2020-03-05 12:37:22 +01:00
Nils Dijk	268ad741a9	Refactor the deparsing of a CREATE EXTENSION to prevent NULL POINTER dereferences (#3518 ) DESCRIPTION: satisfy static analysis tool for a nullptr dereference During the static analysis project on the codebase this code has been flagged as having the potential for a null pointer dereference. Funnily enough the author had already made a comment of it in the code this was not possible due to us setting the schema name before we pass in the statement. If we want to reuse this code in a later setting this comment might not always apply and we could actually run into null pointer dereference. This patch changes a bit of the code around to first of all make sure there is no NULL pointer dereference in this code anymore. Secondly we allow for better deparsing by setting and adhering to the `if_not_exists` flag on the statement. And finally add support for all syntax described in the documentation of postgres (FROM was missing).	2020-03-04 16:47:07 +01:00
Önder Kalacı	9096c650f6	Merge pull request #3562 from citusdata/add_type_to_deparse For composite types, add cast to the parameter to ease remote node detect the type	2020-03-04 13:12:05 +01:00
Marco Slot	27f23d2c89	Add some distribution column = composite type prepared statement tests	2020-03-04 05:01:43 +01:00
Onder Kalaci	087f6eb4c0	For composite types, add cast to the parameter to ease remote node detect the type.	2020-03-04 11:27:45 +01:00
Onur Tirtir	c9c6e58c53	Merge pull request #3554 from citusdata/refactor/vacuum-and-local-executor Refactor vacuumTaskList function and local_executor.c line lengths	2020-03-02 12:04:55 +03:00
Onur Tirtir	ff9c9d1808	make VacuumTaskList even with other taskList functions and some safety changes Makees VacuumTaskList function even with other TaskList creator functions. Also, previously we were generating per-shard vacuum command strings via unconventional usage of StringInfo struct (setting the stringInfo->len field manually) which could cause unexepected memory errors (that I cannot foresee now).	2020-03-02 10:25:28 +03:00
Onur Tirtir	cf718ffe77	safely error out in DistributedTableCacheEntry function	2020-03-02 10:25:12 +03:00
Onur Tirtir	17d9b934c3	refactor local_executor.c lines with >78 characters	2020-02-29 15:04:34 +03:00
Philip Dubé	6dbb48c9f1	Merge pull request #3550 from citusdata/fix-generated-halfway Fix create_distributed_table on a table using GENERATED ALWAYS AS	2020-02-28 17:56:16 +00:00
Philip Dubé	34f241af16	Fix create_distributed_table on a table using GENERATED ALWAYS AS If the generated column does not come at the end of the column list, columnNameList doesn't line up with the column indexes. Seek past CREATE TABLE test_table ( test_id int PRIMARY KEY, gen_n int GENERATED ALWAYS AS (1) STORED, created_at TIMESTAMPTZ NOT NULL DEFAULT now() ); SELECT create_distributed_table('test_table', 'test_id'); Would raise ERROR: cannot cast 23 to 1184	2020-02-28 09:34:26 -08:00
Philip Dubé	2fae132e45	repartition_join_execution: Don't store 64 bit integers as poin… (#3551 ) Pointers are not necessarily 64bit	2020-02-28 15:06:06 +01:00
Philip Dubé	20abc4d2b5	Replace foreach with foreach_ptr/foreach_oid (#3544 )	2020-02-27 16:54:49 +01:00
Philip Dubé	99589de5f9	Merge pull request #3545 from citusdata/make-implicit-cell-harder Make bad refactors to foreach_xxx error out	2020-02-27 13:16:59 +00:00
Jelte Fennema	c48f0ca7e5	Make bad refactors to foreach_xxx error out Without this commit you could still use varCell in the body of loop. This makes it easy for bad refactors that still use the ListCell to slip through unnoticed, because the new ListCell will be named the same as the one used in the old code. By renaming the ListCell to varCellDoNotUse this will not happen.	2020-02-27 10:59:45 +01:00
Jelte Fennema	685b54b3de	Semmle: Check for NULL in some places where it might occur (#3509 ) Semmle reported quite some places where we use a value that could be NULL. Most of these are not actually a real issue, but better to be on the safe side with these things and make the static analysis happy.	2020-02-27 10:45:29 +01:00
Jelte Fennema	f6a89bcd12	Merge pull request #3541 from citusdata/jelte_fix We got some errors for safestringlib builds on OSX. The fixes are as follows: 1. Change name of memset_s to memset8_s 2. Disable some linker flags on OSX 3. Reorder warning flags in so -Wall does not override an ignore for clang Also adds clean-full target to clean our compiled code and also vendored artifacts. Usually it's not needed to clean vendored artifacts once they are built correctly, so they are not cleaned during regular clean to keep full recompiles of our code faster.	2020-02-26 17:59:03 +01:00
Jelte Fennema	0cad263c82	Add new vendor README update instructions	2020-02-26 17:46:37 +01:00
Jelte Fennema	eb8e099f09	Fix Makefile so that it builds safestringlib correctly on OSX	2020-02-26 17:44:44 +01:00
Jelte Fennema	8e7eaaf949	Add clean-full to also clean full builds of vendored libraries	2020-02-26 17:44:44 +01:00
Jelte Fennema	92d7a40d1d	Fix safestringlib build on OSX	2020-02-26 17:44:44 +01:00
Hadi Moshayedi	8ca55c739f	Merge pull request #3543 from citusdata/MarkusSintonen-improve-shard-pruning (MarkusSintonen) Improve shard pruning logic to understand OR-conditions	2020-02-26 07:37:26 -08:00
Hadi Moshayedi	e7cce40e6e	Address pykello's feedback	2020-02-26 07:17:32 -08:00
Hadi Moshayedi	1b3e58f0c3	Merge branch 'improve-shard-pruning' of https://github.com/MarkusSintonen/citus into MarkusSintonen-improve-shard-pruning	2020-02-26 07:13:33 -08:00
SaitTalhaNisanci	82d22b34fe	create temp schemas in parallel (#3540 )	2020-02-26 16:20:08 +03:00
SaitTalhaNisanci	d94c3fd43d	send repartition cleanup jobs in parallel to all workers (#3485 ) * send repartition cleanup jobs in parallel to all workers * add review items	2020-02-26 13:44:06 +03:00
Marco Slot	1b6020e2d6	Merge pull request #3539 from citusdata/unlogged_merge_tables Make merge tables during re-partitioning unlogged	2020-02-26 10:55:15 +01:00
Marco Slot	c7f123947e	Make merge tables during re-partitioning unlogged	2020-02-26 10:46:07 +01:00
Jelte Fennema	5d601bb45a	Merge pull request #3465 from citusdata/safestringlib Use safestringlib for safe buffer interaction	2020-02-25 16:33:44 +01:00
Jelte Fennema	62bf571ced	Make SafeSnprintf work on PG11	2020-02-25 15:39:27 +01:00
Jelte Fennema	7d24cebc80	Add pg11 snprintf file to repo for use in pg11 when it's not compiled	2020-02-25 15:39:27 +01:00
Jelte Fennema	8de8b62669	Convert unsafe APIs to safe ones	2020-02-25 15:39:27 +01:00
Jelte Fennema	b7841267dc	vendor github.com/intel/safestringlib	2020-02-25 15:39:27 +01:00
Nils Dijk	a77ed9cd23	Refactor master query to be planned by postgres' planner (#3326 ) DESCRIPTION: Replace the query planner for the coordinator part with the postgres planner Closes #2761 Citus had a simple rule based planner for the query executed on the query coordinator. This planner grew over time with the addigion of SQL support till it was getting close to the functionality of the postgres planner. Except the code was brittle and its complexity rose which made it hard to add new SQL support. Given its resemblance with the postgres planner it was a long outstanding wish to replace our hand crafted planner with the well supported postgres planner. This patch replaces our planner with a call to postgres' planner. Due to the functionality of the postgres planner we needed to support both projections and filters/quals on the citus custom scan node. When a sort operation is planned above the custom scan it might require fields to be reordered in the custom scan before returning the tuple (projection). The postgres planner assumes every custom scan node implements projections. Because we controlled the plan that was created we prevented reordering in the custom scan and never had implemented it before. A same optimisation applies to having clauses that could have been where clauses. Instead of applying the filter as a having on the aggregate it will push it down into the plan which could reach a custom scan node. For both filters and projections we have implemented them when tuples are read from the tuple store. If no projections or filters are required it will directly return the tuple from the tuple store. Otherwise it will loop tuples from the tuple store through the filter and projection until a tuple is found and returned. Besides filters being pushed down a side effect of having quals that could have been a where clause is that a call to read intermediate result could be called before the first tuple is fetched from the custom scan. This failed because the intermediate result would only be pulled to the coordinator on the first tuple fetch. To overcome this problem we do run the distributed subplans now before we run the postgres executor. This ensures the intermediate result is present on the coordinator in time. We do account for total time instrumentation by removing the instrumentation before handing control to the psotgres executor and update the timings our self. For future SQL support it is enough to create a valid query structure for the part of the query to be executed on the query coordinating node. As a utility we do serialise and print the query at debug level4 for engineers to inspect what kind of query is being planned on the query coordinator.	2020-02-25 14:39:56 +01:00
Philip Dubé	0c4f9e230d	Merge pull request #3453 from citusdata/fix-stray-files Fix multi_task_string_size sometimes leaking intermediate files	2020-02-24 17:36:08 +00:00
Philip Dubé	025cb94159	Fix multi_task_string_size sometimes leaking intermediate files	2020-02-24 16:33:34 +00:00
Onur Tirtir	2e096d4eb9	Merge pull request #3531 from citusdata/refactor/utility-local Refactor some pieces of code before implementing local drop & truncate execution	2020-02-24 18:35:07 +03:00
Onur Tirtir	873e9fd604	Refactor DropShards before introducing local DROP execution	2020-02-24 17:52:20 +03:00
Onur Tirtir	3c99db40b9	Some small typos & cleanup	2020-02-24 16:37:55 +03:00
Jelte Fennema	2a9fccc7a0	Remove READFUNCs (#3536 ) We don't actually use these functions anymore since merging #1477. Advantages of removing: 1. They add work whenever we add a new node. 2. They contain some usage of stdlib APIs that are banned by Microsoft. Removing it means we don't have to replace those with safe ones.	2020-02-24 12:43:28 +01:00
Philip Dubé	c291fd5d11	Merge pull request #3522 from citusdata/fix-flaky-multi-extension-2 Address a couple issues with maintenace daemon management	2020-02-21 17:10:25 +00:00
Philip Dubé	bcf54c5014	Address a couple issues with maintenace daemon management: - Stop the daemon when citus extension is dropped - Bail on maintenance daemon startup if myDbData is started with a non-zero pid - Stop maintenance daemon from spawning itself - Don't use postgres die, just wrap proc_exit(0) - Assert(myDbData->workerPid == MyProcPid) The two issues were that multiple daemons could be running for a database, or that a daemon would be leftover after DROP EXTENSION citus	2020-02-21 16:49:01 +00:00
Nils Dijk	6ee82c381e	Add missing pieces for version bump of #3482 (#3523 )	2020-02-21 12:35:29 +01:00
Jelte Fennema	00d667c41d	Semmle: Fix obvious issues (#3502 ) Fixes some obvious issues found by the Semmle static analysis tool.	2020-02-21 10:16:00 +01:00
Onur Tirtir	2c51057013	Merge pull request #3521 from citusdata/null-relationname Fix null relation name due to DROP on distributed table in a transaction block	2020-02-20 10:18:44 +03:00
Onur Tirtir	926a1a61b9	change "relation" with "table" in error messages related with foreign keys on reference tables	2020-02-20 09:58:47 +03:00
Onur Tirtir	001089783c	Fix null relation name issue in CheckConflictingRelationAccesses	2020-02-19 19:10:35 +03:00
Philip Dubé	d66f011f71	Merge pull request #3494 from citusdata/use-instr-time Prefer instr_time to TimestampTz when we want CLOCK_MONOTONIC	2020-02-19 00:42:52 +00:00
Philip Dubé	52042d4a00	Prefer instr_time to TimestampTz when we want CLOCK_MONOTONIC	2020-02-19 00:34:17 +00:00
Philip Dubé	36bb85e5c0	Merge pull request #3495 from citusdata/fix-information-schema-join Add test for issue 2717, does not reproduce issue	2020-02-19 00:33:38 +00:00
Philip Dubé	d7a4ffdc46	Add test for issue, does not reproduce issue	2020-02-18 23:45:17 +00:00
Philip Dubé	6d6ef54775	Merge pull request #3517 from citusdata/fix-typos Fix typos	2020-02-18 23:43:56 +00:00
Philip Dubé	08f6842d50	Fix typos Equivalance -> Equivalence utillity -> utility shorted lived one -> shortly lived one elegible -> eligible	2020-02-18 17:14:40 +00:00
Marco Slot	3e7d4fd739	Merge pull request #3361 from citusdata/copy_out Implement direct COPY table TO stdout	2020-02-17 17:34:29 +01:00
Marco Slot	038e5999cb	Implement direct COPY table TO stdout	2020-02-17 15:15:10 +01:00
Jelte Fennema	3f7c5a5cf6	Semmle: Fix possible infite loops caused by overflow (#3503 ) Comparison between differently sized integers in loop conditions can cause infinite loops. This can happen when doing something like this: ```c int64 very_big = MAX_INT32 + 1; for (int32 i = 0; i < very_big; i++) { // do something } // never reached because i overflows before it can reach the value of very_big ```	2020-02-17 14:35:10 +01:00
Jelte Fennema	15f1173b1d	Semmle: Ensure permissions of private keys are 0600 (#3506 ) When using --allow-group-access option from initdb our keys and certificates would be created with 0640 permissions. Which is a pretty serious security issue: This changes that. This would not be exploitable though, since postgres would not actually enable SSL and would output the following message in the logs: ``` DETAIL: File must have permissions u=rw (0600) or less if owned by the database user, or permissions u=rw,g=r (0640) or less if owned by root. ``` Since citus still expected the cluster to have SSL enabled handshakes between workers and coordinator would fail. So instead of a security issue the cluster would simply be unusable.	2020-02-17 12:58:40 +01:00
SaitTalhaNisanci	2b08916f93	Merge pull request #3491 from citusdata/refactor/runDistributedExecution refactor RunDistributedExecution	2020-02-17 14:27:14 +03:00
SaitTalhaNisanci	9302e6e699	apply review items	2020-02-17 14:16:49 +03:00
SaitTalhaNisanci	1b78045867	rename AssignTasksToConnections with AssignTasksToConnectionsOrWorkerPool	2020-02-17 14:16:20 +03:00
SaitTalhaNisanci	355805c7d8	create ProcessWaitEvents for separating the logic of handling events	2020-02-17 14:16:20 +03:00
SaitTalhaNisanci	c35981f9de	create UpdateWaitEventSet for better readability	2020-02-17 14:16:20 +03:00
SaitTalhaNisanci	a7e735a648	use a utility method to get event size	2020-02-17 14:16:20 +03:00
SaitTalhaNisanci	71f1aa48a3	remove unnecessary if check (#3500 )	2020-02-17 14:15:36 +03:00
Markus Sintonen	099e266a6c	Force task executor	2020-02-16 01:32:52 +02:00
Markus Sintonen	cf8319b992	Add comment, add subquery NOT tests	2020-02-16 01:21:10 +02:00
Markus Sintonen	3d3d615040	Add comment about NOT_EXPR. Treat it as invalid constraint for safety.	2020-02-15 16:54:38 +02:00
Philip Dubé	7382c8be00	Clean up from code review Only change to behavior is: - don't ignore array const's constcollid in SAORestrictions - don't end lines with commas in DebugLogPruningInstance	2020-02-14 17:58:23 +00:00
Markus Sintonen	cdedb98c54	Improve shard pruning logic to understand OR-conditions. Previously a limitation in the shard pruning logic caused multi distribution value queries to always go into all the shards/workers whenever query also used OR conditions in WHERE clause. Related to https://github.com/citusdata/citus/issues/2593 and https://github.com/citusdata/citus/issues/1537 There was no good workaround for this limitation. The limitation caused quite a bit of overhead with simple queries being sent to all workers/shards (especially with setups having lot of workers/shards). An example of a previous plan which was inadequately pruned: ``` EXPLAIN SELECT count() FROM orders_hash_partitioned WHERE (o_orderkey IN (1,2)) AND (o_custkey = 11 OR o_custkey = 22); QUERY PLAN --------------------------------------------------------------------- Aggregate (cost=0.00..0.00 rows=0 width=0) -> Custom Scan (Citus Adaptive) (cost=0.00..0.00 rows=0 width=0) Task Count: 4 Tasks Shown: One of 4 -> Task Node: host=localhost port=xxxxx dbname=regression -> Aggregate (cost=13.68..13.69 rows=1 width=8) -> Seq Scan on orders_hash_partitioned_630000 orders_hash_partitioned (cost=0.00..13.68 rows=1 width=0) Filter: ((o_orderkey = ANY ('{1,2}'::integer[])) AND ((o_custkey = 11) OR (o_custkey = 22))) (9 rows) ``` After this commit the task count is what one would expect from the query defining multiple distinct values for the distribution column: ``` EXPLAIN SELECT count() FROM orders_hash_partitioned WHERE (o_orderkey IN (1,2)) AND (o_custkey = 11 OR o_custkey = 22); QUERY PLAN --------------------------------------------------------------------- Aggregate (cost=0.00..0.00 rows=0 width=0) -> Custom Scan (Citus Adaptive) (cost=0.00..0.00 rows=0 width=0) Task Count: 2 Tasks Shown: One of 2 -> Task Node: host=localhost port=xxxxx dbname=regression -> Aggregate (cost=13.68..13.69 rows=1 width=8) -> Seq Scan on orders_hash_partitioned_630000 orders_hash_partitioned (cost=0.00..13.68 rows=1 width=0) Filter: ((o_orderkey = ANY ('{1,2}'::integer[])) AND ((o_custkey = 11) OR (o_custkey = 22))) (9 rows) ``` "Core" of the pruning logic works as previously where it uses `PrunableInstances` to queue ORable valid constraints for shard pruning. The difference is that now we build a compact internal representation of the query expression tree with PruningTreeNodes before actual shard pruning is run. Pruning tree nodes represent boolean operators and the associated constraints of it. This internal format allows us to have compact representation of the query WHERE clauses which allows "core" pruning logic to work with OR-clauses correctly. For example query having `WHERE (o_orderkey IN (1,2)) AND (o_custkey=11 OR (o_shippriority > 1 AND o_shippriority < 10))` gets transformed into: 1. AND(o_orderkey IN (1,2), OR(X, AND(X, X))) 2. AND(o_orderkey IN (1,2), OR(X, X)) 3. AND(o_orderkey IN (1,2), X) Here X is any set of unknown condition(s) for shard pruning. This allow the final shard pruning to correctly recognize that shard pruning is done with the valid condition of `o_orderkey IN (1,2)`. Another example with unprunable condition in query `WHERE (o_orderkey IN (1,2)) OR (o_custkey=11 AND o_custkey=22)` gets transformed into: 1. OR(o_orderkey IN (1,2), AND(X, X)) 2. OR(o_orderkey IN (1,2), X) Which is recognized as unprunable due to the OR condition between distribution column and unknown constraint -> goes to all shards. Issue https://github.com/citusdata/citus/issues/1537 originally suggested transforming the query conditions into a full disjunctive normal form (DNF), but this process of transforming into DNF is quite a heavy operation. It may "blow up" into a really large DNF form with complex queries having non trivial `WHERE` clauses. I think the logic for shard pruning could be simplified further but I decided to leave the "core" of the shard pruning untouched.	2020-02-14 17:58:13 +00:00
Jelte Fennema	3d8efe303e	Fix flaky test introduced by #3374 (#3504 ) Since #3374 multi_utilities is not safe to run in parallel anymore. This is because it now also shows locks on shards created outside it's own test. This is not really possible to fix. Example of flaky test: - https://circleci.com/gh/citusdata/citus/89995 - https://circleci.com/gh/citusdata/citus/90017	2020-02-14 16:07:33 +01:00
Jelte Fennema	5ef3e83ce4	Make multi_utilities test take 2 seconds instead of 20 (#3507 ) On worker 2 it was waiting for dustbunnies_990001 to be vacuumed/analyzed. This table doesn't actually exist, so that never happend. Now it waits for the correct table and throws an error if it waits more than 10 seconds.	2020-02-14 15:38:51 +01:00
Onur Tirtir	e4dd5ac2ad	Update CHANGELOG for 9.2.1 (#3501 )	2020-02-14 11:18:40 +03:00
SaitTalhaNisanci	72d1850b4e	enhance local executor description (#3499 )	2020-02-13 20:19:08 +03:00
Önder Kalacı	6225f37b91	Merge pull request #3498 from citusdata/fix_null_crash Do not prune shards if the distribution key is NULL	2020-02-13 17:21:31 +01:00
Onder Kalaci	975c4c2264	Do not prune shards if the distribution key is NULL The root of the problem is that, standard_planner() converts the following qual ``` {OPEXPR :opno 98 :opfuncid 67 :opresulttype 16 :opretset false :opcollid 0 :inputcollid 100 :args ( {VAR :varno 1 :varattno 1 :vartype 25 :vartypmod -1 :varcollid 100 :varlevelsup 0 :varnoold 1 :varoattno 1 :location 45 } {CONST :consttype 25 :consttypmod -1 :constcollid 100 :constlen -1 :constbyval false :constisnull true :location 51 :constvalue <> } ) :location 49 } ``` To ``` ( {CONST :consttype 16 :consttypmod -1 :constcollid 0 :constlen 1 :constbyval true :constisnull true :location -1 :constvalue <> } ) ``` So, Citus doesn't deal with NULL values in real-time or non-fast path router queries. And, in the FastPathRouter planner, we check constisnull in DistKeyInSimpleOpExpression(). However, in deferred pruning case, we do not check for isnull for const. Thus, the fix consists of two parts: - Let PruneShards() not crash when NULL parameter is passed - For deferred shard pruning in fast-path queries, explicitly check that we have CONST which is not NULL	2020-02-13 15:00:31 +01:00
Onur Tirtir	cd8210d516	Bump citus version to 9.3devel (#3482 )	2020-02-13 16:22:05 +03:00
Hadi Moshayedi	fc1fe0244e	Merge pull request #3488 from citusdata/fix-typos Fix typos noticed while reading through code trying to understand HAVING	2020-02-11 15:40:13 -08:00
Philip Dubé	3a906b8210	Fix typos noticed while reading through code trying to understand HAVING	2020-02-11 19:55:10 +00:00
Onur Tirtir	ab0b49db82	fix uninitialized variable warning (#3483 )	2020-02-11 15:44:31 +01:00
Onur Tirtir	e660f4f854	Add changelog entry for 9.2.0 (#3463 )	2020-02-10 11:03:39 +03:00
Onur Tirtir	39df51e903	Introduce objects to dist. infrastructure when updating Citus (#3477 ) Mark existing objects that are not included in distributed object infrastructure in older versions of Citus (but now should be) as distributed, after updating Citus successfully.	2020-02-07 18:07:59 +03:00
Nils Dijk	d5433400f9	Fix: Unnecessary repartition on joins with more than 4 tables (#3473 ) DESCRIPTION: Fix unnecessary repartition on joins with more than 4 tables In 9.1 we have introduced support for all CH-benCHmark queries by widening our definitions of joins to include joins with expressions in them. This had the undesired side effect of Q5 regressing on its plan by implementing a repartition join. It turned out this regression was not directly related to widening of the join clause, nor the schema employed by CH-benCHmark. Instead it had to do with 4 or more tables being joined in a chain. A chain meaning: ```sql SELECT * FROM a,b,c,d WHERE a.part = b.part AND b.part = c.part AND .... ``` Due to how our join order planner was implemented it would only keep track of 1 of the partition columns when comparing if the join could be executed locally. This manifested in a join chain of 4 tables to _always_ be executed as a repartition join. 3 tables joined in a chain would have the middle table shared by the two outer tables causing the local join possibility to be found. With this patch we keep a unique list (or set) of all partition columns participating in the join. When a candidate table is checked for a possibility to execute a local join it will check if there is any partition column in that set that matches an equality join clause on the partition column of the candidate table. By taking into account all partition columns in the left relation it will now find the local join path on >= 4 tables joined in a chain. fixes: #3276	2020-02-06 15:07:07 +01:00
Philip Dubé	345455d765	Merge pull request #3461 from citusdata/fix-adaptive-repartition-join-leak Fix adaptive repartition join leak	2020-02-05 17:43:23 +00:00
Philip Dubé	ecad4aa5e6	Fill in jobIdList field of DistributedExecution Pass down jobIdList from ExecuteTasksInDependencyOrder Also clean up comment for ExecuteTaskListOutsideTransaction	2020-02-05 17:32:22 +00:00
Philip Dubé	c252811884	dont: don't, wont: won't, acylic: acyclic	2020-02-05 17:32:22 +00:00
Halil Ozan Akgül	fff3866844	Merge pull request #3472 from citusdata/grant_on_public_schema Fixes the bug of grants on public schema propagation	2020-02-05 18:40:45 +03:00
Halil Ozan Akgul	8ce4f20061	Fixes the bug of grants on public schema propagation	2020-02-05 18:05:58 +03:00
SaitTalhaNisanci	89dc7d5e41	remove outdated information in citus upgrade readme (#3471 )	2020-02-05 13:31:02 +03:00
Marco Slot	8c972dc614	Merge pull request #3470 from citusdata/insert_select_issue Rename discarded target list items in repartitioned INSERT/SELECT	2020-02-05 11:21:05 +01:00
Marco Slot	64ca5c9acb	Add additional INSERT..SELECT repartition tests	2020-02-05 11:06:44 +01:00
Hadi Moshayedi	9dd14fa90d	Rename discarded target list items in repartitioned INSERT/SELECT	2020-02-05 11:06:44 +01:00
Önder Kalacı	1aa89d3242	Merge pull request #3467 from citusdata/fix_crash_numeric Improve single hash-repartitioning with numeric (or non-int) types	2020-02-05 09:12:41 +01:00
Onder Kalaci	c7e2309f4c	Improve single hash-repartitioning with numeric (or non-int) types We used to treat the shard interval array that we passed as numeric[]. However, it should be int[], as the shard ranges are int[].	2020-02-04 20:30:04 +01:00
Hadi Moshayedi	3826e81056	Merge pull request #3460 from citusdata/fix_permissions Create merge task temporary schemas with current user	2020-02-04 10:05:02 -08:00
Hadi Moshayedi	bc1a800f70	Use current user for repartition join temp schemas. Otherwise when using a less privileged user we might get errors when trying to create the schema.	2020-02-04 09:48:20 -08:00
Hadi Moshayedi	13d27cb280	Merge pull request #3451 from citusdata/insert_select_partitioned_joins_2 Don't error out when subquery in INSERT/SELECT is not router plannable.	2020-02-03 13:29:04 -08:00
Hadi Moshayedi	890e23e734	Update multi_insert_select_non_pushable_queries	2020-02-03 13:13:30 -08:00
Hadi Moshayedi	5818bcd27e	Update with_dml	2020-02-03 13:13:30 -08:00
Hadi Moshayedi	46f60e1ac0	Update multi_insert_select_conflict	2020-02-03 13:13:30 -08:00
Hadi Moshayedi	05f58c9ec5	Update multi_insert_select	2020-02-03 13:13:30 -08:00
Hadi Moshayedi	264530311a	Don't use distributed insert/select for repartitioned joins	2020-02-03 13:13:30 -08:00
Marco Slot	2e8c118a8f	Make connection assignment more liberal after parallel join wit… (#3456 ) Make connection assignment more liberal after parallel join with reference table	2020-02-03 20:11:20 +01:00
Onder Kalaci	8be1b0112d	Add failure test for parallel reference table join	2020-02-03 19:35:07 +01:00
Marco Slot	be77d3304f	Fixup	2020-02-03 11:59:55 +01:00
Marco Slot	a6bd6c657e	Add tests that exercise parallel reference table join logic	2020-02-03 11:54:29 +01:00
Marco Slot	b0fd6aa006	If reference tables was read over multiple connections, do not assign connection	2020-02-03 11:54:29 +01:00
Önder Kalacı	508b392304	Merge pull request #3454 from citusdata/recursively_check_params Make sure to recursively go into the functions to search for PARAMs	2020-02-03 11:27:12 +01:00
Onder Kalaci	2f274a4fce	Make sure to go deeper into the functions to search for PARAMs For example, a PARAM might reside inside a function just because of a casting of a type such as the follows: ``` {FUNCEXPR :funcid 1740 :funcresulttype 1700 :funcretset false :funcvariadic false :funcformat 2 :funccollid 0 :inputcollid 0 :args ( {PARAM :paramkind 0 :paramid 15 :paramtype 23 :paramtypmod -1 :paramcollid 0 :location 356 } ) ``` We should recursively check the expression before bailing out.	2020-02-03 09:36:12 +01:00
Hadi Moshayedi	1adc293286	Merge pull request #3450 from citusdata/fix-ci-locale-issues diff-filter: use utf8 encoding, not ascii	2020-01-30 21:57:18 -08:00
Philip Dubé	db2eac5658	diff-filter: use utf8 encoding, not ascii	2020-01-31 00:03:17 +00:00
Hadi Moshayedi	b0f9f94a52	Merge pull request #3448 from citusdata/insert_select_leak Add insert/select connection leak tests	2020-01-30 14:20:14 -08:00
Hadi Moshayedi	9d988b3437	Add insert/select connection leak tests	2020-01-30 14:09:07 -08:00
Philip Dubé	461facb149	Merge pull request #3447 from citusdata/fix-group-by-distribution-no-group-by Intermediate row pull up should be false whenever we can fully push down grouping	2020-01-30 21:31:34 +00:00
Philip Dubé	d43c80d4d8	pullUpIntermediateRows should not be true when groupedByDisjointPartitionColumn is true This was causing 'SELECT id, stdev(y_int) FROM tbl GROUP BY id' to push down stddev without group by	2020-01-30 21:18:08 +00:00
Philip Dubé	d7204c9696	Merge pull request #3423 from citusdata/remove-directory-even-if-new-files-added CitusRemoveDirectory: loop when directory is not empty	2020-01-30 20:21:47 +00:00
Philip Dubé	84a500ffc6	CitusRemoveDirectory: loop when directory is not empty Sometimes during errors workers will create files while we're deleting intermediate directories example: DEBUG: could not remove file "base/pgsql_job_cache/10_0_431": Directory not empty DETAIL: WARNING from localhost:57637	2020-01-30 20:02:08 +00:00
Philip Dubé	6b43fab325	Merge pull request #3406 from citusdata/fix-limit-approx Expand the set of aggregates which cannot have LIMIT approximated	2020-01-30 18:00:40 +00:00
Philip Dubé	5fccc56d3e	Expand the set of aggregates which cannot have LIMIT approximated Previously we only prevented AVG from being pushed down, but this is incorrect: - array_agg, while somewhat non sensical to order by, will potentially be missing values - combinefunc aggregation will raise errors about cstrings not being comparable (while we also can't know if the aggregate is commutative) This commit limits approximating LIMIT pushdown when ordering by aggregates to: min, max, sum, count, bit_and, bit_or, every, any Which means of those we previously supported, we now exclude: avg, array_agg, jsonb_agg, jsonb_object_agg, json_agg, json_object_agg, hll_add, hll_union, topn_add, topn_union	2020-01-30 17:45:18 +00:00
Önder Kalacı	8584cb005b	Do not evaluate functions on the coordinator for SELECT queries (#3440 ) Previously, the logic for evaluting the functions and the parameters were the same. That ended-up evaluting the functions inaccurately on the coordinator. Instead, split the function evaluation logic from parameter evalution logic.	2020-01-30 08:47:28 +01:00
Önder Kalacı	e9c17b71a4	Add missing ORDER BY (#3441 ) As it causes some random failures	2020-01-29 17:36:32 +01:00
Önder Kalacı	412fe719f7	Hide citus.enable_ddl_propagation setting (#3437 ) As that is powerful and cause metadata inconsistency. See the following steps: (Note that we cannot use PGC_SUSET because on Citus MX we need this flag for non- superusers as well) ```SQL CREATE TABLE test_ref_table(key int); SELECT create_reference_table('test_ref_table'); SELECT logicalrelid, logicalrelid::oid FROM pg_dist_partition; ┌────────────────┬──────────────┐ │ logicalrelid │ logicalrelid │ ├────────────────┼──────────────┤ │ test_ref_table │ 16831 │ └────────────────┴──────────────┘ (1 row) Time: 0.929 ms SELECT relname FROM pg_class WHERE oid = 16831; ┌────────────────┐ │ relname │ ├────────────────┤ │ test_ref_table │ └────────────────┘ (1 row) Time: 0.785 ms SET citus.enable_ddl_propagation TO off; DROP TABLE test_ref_table ; SELECT logicalrelid, logicalrelid::oid FROM pg_dist_partition; ┌──────────────┬──────────────┐ │ logicalrelid │ logicalrelid │ ├──────────────┼──────────────┤ │ 16831 │ 16831 │ └──────────────┴──────────────┘ (1 row) Time: 0.972 ms SELECT relname FROM pg_class WHERE oid = 16831; ┌─────────┐ │ relname │ ├─────────┤ └─────────┘ (0 rows) Time: 0.908 ms SELECT master_add_node('localhost', 9703); server closed the connection unexpectedly This probably means the server terminated abnormally before or while processing the request. The connection to the server was lost. Attempting reset: Failed. Time: 5.028 ms !> ```	2020-01-29 10:17:53 +01:00
Philip Dubé	1ca108feda	Merge pull request #3429 from citusdata/diff-filter-handle-normalize Update diff-filter to handle lines removed by normalization	2020-01-28 16:23:32 +00:00
Philip Dubé	40ce531850	Update diff-filter to handle lines removed by normalization Add a script to test our diff logic pg_regress_multi updated to rely on $PATH having copy_modified to fix testing with VPATH	2020-01-28 15:39:40 +00:00
Jelte Fennema	b9eee70fa5	Fix random output ordering in CTE inlining test (#3434 )	2020-01-27 16:38:27 +01:00
SaitTalhaNisanci	94bd563ff0	switch back to old memory context in cache local plan for task (#3428 )	2020-01-27 13:00:46 +03:00
Philip Dubé	ca85b2b4ff	Merge pull request #3425 from citusdata/make-test-copying-work-again Replace denormalized test output with normalized at the end of the run	2020-01-24 17:42:54 +00:00
Jelte Fennema	c38446b5f5	Replace denormalized test output with normalized at the end of the run	2020-01-24 11:42:38 +01:00
Önder Kalacı	4519d3411d	Improve the representation of used sub plans (#3411 ) Previously, we've identified the usedSubPlans by only looking to the subPlanId. With this commit, we're expanding it to also include information on the location of the subPlan. This is useful to distinguish the cases where the subPlan is used either on only HAVING or both HAVING and any other part of the query.	2020-01-24 10:47:14 +01:00
Philip Dubé	87e6352d5b	Merge pull request #3422 from citusdata/return-const-when-borrowing CurrentDatabaseName: return const char* as we're borrowing from cache	2020-01-23 23:06:39 +00:00
Philip Dubé	50c5e814c8	CurrentDatabaseName: return const char* as we're borrowing from cache	2020-01-23 22:49:35 +00:00
Philip Dubé	cc1c398d87	Merge pull request #3401 from citusdata/test-extension-owner See what flaky multi_extension test is doing with roles	2020-01-23 22:05:45 +00:00
Philip Dubé	69dde460de	See what flaky multi_extension test is doing with roles	2020-01-23 21:50:40 +00:00
Philip Dubé	5e6a5a3ed0	Merge pull request #3419 from citusdata/less-normal-diffs-when-different Avoid obscuring regression test diffs with normalization	2020-01-23 19:02:48 +00:00
Philip Dubé	5e55a36172	Avoid obscuring regression test diffs with normalization First, diff is updated to not update the files in-place For some reason diff is being called multiple times, so $file1.unmodified becomes normalized on second invocation Secondly, diff-filter updates output to come from the unmodified version Normalization is serving two purposes: - avoid diff noise in regressions - avoid diff noise in commits when expected result is updated The first purpose only wants to reduce the lines which diff registers, whereas the second wants those changes to be committed	2020-01-23 18:51:23 +00:00
Hadi Moshayedi	65dfcaa54b	Merge pull request #3421 from citusdata/allow_noent_in_remove Don't error for ENOENT in CitusRemoveDirectory.	2020-01-23 10:16:26 -08:00
Hadi Moshayedi	1dc19215eb	Don't error for ENOENT in CitusRemoveDirectory. For concurrency reasons, this can happen even if initial stat succeeded.	2020-01-23 10:07:54 -08:00
Hadi Moshayedi	4fbf9a290a	Merge pull request #3414 from citusdata/small_fixes Change DistributedResultFragment::nodeId to uint32.	2020-01-23 10:05:04 -08:00
Hadi Moshayedi	3e1004c232	Change DistributedResultFragment::nodeId to uint32. This is to match the type of WorkerNode::nodeId.	2020-01-23 09:33:15 -08:00
Önder Kalacı	ef7d1ea91d	Locally execute queries that don't need any data access (#3410 ) * Update shardPlacement->nodeId to uint As the source of the shardPlacement->nodeId is always workerNode->nodeId, and that is uint32. We had this hack because of: `0ea4e52df5 (r266421409)` And, that is gone with: `90056f7d3c (diff-c532177d74c72d3f0e7cd10e448ab3c6L1123)` So, we're safe to do it now. * Relax the restrictions on using the local execution Previously, whenever any local execution happens, we disabled further commands to do any remote queries. The basic motivation for doing that is to prevent any accesses in the same transaction block to access the same placements over multiple sessions: one is local session the other is remote session to the same placement. However, the current implementation does not distinguish local accesses being to a placement or not. For example, we could have local accesses that only touches intermediate results. In that case, we should not implement the same restrictions as they become useless. So, this is a pre-requisite for executing the intermediate result only queries locally. * Update the error messages As the underlying implementation has changed, reflect it in the error messages. * Keep track of connections to local node With this commit, we're adding infrastructure to track if any connection to the same local host is done or not. The main motivation for doing this is that we've previously were more conservative about not choosing local execution. Simply, we disallowed local execution if any connection to any remote node is done. However, if we want to use local execution for intermediate result only queries, this'd be annoying because we expect all queries to touch remote node before the final query. Note that this approach is still limiting in Citus MX case, but for now we can ignore that. * Formalize the concept of Local Node Also some minor refactoring while creating the dummy placement * Write intermediate results locally when the results are only needed locally Before this commit, Citus used to always broadcast all the intermediate results to remote nodes. However, it is possible to skip pushing the results to remote nodes always. There are two notable cases for doing that: (a) When the query consists of only intermediate results (b) When the query is a zero shard query In both of the above cases, we don't need to access any data on the shards. So, it is a valuable optimization to skip pushing the results to remote nodes. The pattern mentioned in (a) is actually a common patterns that Citus users use in practice. For example, if you have the following query: WITH cte_1 AS (...), cte_2 AS (....), ... cte_n (...) SELECT ... FROM cte_1 JOIN cte_2 .... JOIN cte_n ...; The final query could be operating only on intermediate results. With this patch, the intermediate results of the ctes are not unnecessarily pushed to remote nodes. * Add specific regression tests As there are edge cases in Citus MX and with round-robin policy, use the same queries on those cases as well. * Fix failure tests By forcing not to use local execution for intermediate results since all the tests expects the results to be pushed remotely. * Fix flaky test * Apply code-review feedback Mostly style changes * Limit the max value of pg_dist_node_seq to reserve for internal use	2020-01-23 18:28:34 +01:00
Önder Kalacı	a227e34c41	Merge pull request #3413 from citusdata/int_to_uint Update shardPlacement->nodeId from `int` to `uint`	2020-01-23 17:31:16 +01:00
Onder Kalaci	a0dff301c7	Update shardPlacement->nodeId to uint As the source of the shardPlacement->nodeId is always workerNode->nodeId, and that is uint32. We had this hack because of: `0ea4e52df5 (r266421409)` And, that is gone with: `90056f7d3c (diff-c532177d74c72d3f0e7cd10e448ab3c6L1123)` So, we're safe to do it now.	2020-01-23 13:00:24 +01:00
Philip Dubé	d42b0f7c19	Merge pull request #3416 from citusdata/test_improvements Output filenames in ensure_no_intermediate_data_leak	2020-01-22 19:46:39 +00:00
Hadi Moshayedi	be647ad944	Output filenames in ensure_no_intermediate_data_leak This can helpful in guiding us where to look when this test fails. For example, if the result file has repartitioned_results_ prefix, then we need to look into repartitioned insert/select. Otherwise it is probably a CTE or a subquery.	2020-01-22 11:12:16 -08:00
Jelte Fennema	c62b756f34	Fix new method of locking shard distribition metadata (#3407 ) In #3374 a new way of locking shard distribution metadata was implemented. However, this was only done in the function `LockShardDistributionMetadata` and not in `TryLockShardDistributionMetadata`. This is bad, since it causes these locks to not block eachother in some cases. This commit fixes this issue by sharing the code that sets the locktag between the two function.	2020-01-22 16:44:17 +01:00
Jelte Fennema	cd5259a25a	Do not place new shards with shards in TO_DELETE state (#3408 ) When creating a new distributed table. The shards would colocate with shards with SHARD_STATE_TO_DELETE (shardstate = 4). This means if that state was because of a shard move the new shard would be created on two nodes and it would not get deleted since it's shard state would be 1.	2020-01-22 14:52:12 +01:00
Philip Dubé	77589b2b08	Merge pull request #3400 from citusdata/fix_ref_table Avoid marking reference table shards unhealthy	2020-01-20 19:20:54 +00:00
Onder Kalaci	4be69bbf6f	Fix reference table issue	2020-01-20 18:45:18 +00:00
Halil Ozan Akgül	b2a17f5f67	Merge pull request #3385 from citusdata/schema_grant Grant On Schema Propagation	2020-01-20 15:01:30 +03:00
Halil Ozan Akgul	b40f067d05	Adds propagation for grant on schema commands	2020-01-20 14:51:28 +03:00
Philip Dubé	c436f1b668	Merge pull request #3399 from citusdata/cleanup-during-avoid-marking-reference-shard-unhealthy Code cleanup of adaptive_executor, connection_management, placement_connection	2020-01-17 18:51:02 +00:00
Philip Dubé	fdcc413559	Code cleanup of adaptive_executor, connection_management, placement_connection adaptive_executor: sort includes, use foreach_ptr, remove lies from FinishDistributedExecution docs connection_management: rename msecs, which isn't milliseconds placement_connection: small typos	2020-01-17 17:44:47 +00:00
Önder Kalacı	5f34399e1f	Merge pull request #3388 from citusdata/local_prepared_on_top_lazy_deparse Cache local plans on shards for Citus MX	2020-01-17 17:17:41 +01:00
Onder Kalaci	2f0ef8bc36	Apply feedback 1	2020-01-17 16:06:04 +01:00
Onder Kalaci	fd17e4578e	Improve tests	2020-01-17 16:02:57 +01:00
Onder Kalaci	0bf1e81e33	Cache local plans on BeginScan	2020-01-17 16:02:57 +01:00
Onder Kalaci	08d148d43e	Make TaskAccessesLocalNode external function	2020-01-17 16:02:57 +01:00
Onder Kalaci	5dc454cdad	Exclude localPlannedStatements from copy distributedPlan	2020-01-17 16:02:57 +01:00
Onder Kalaci	ff12df411b	Add LocalPlannedStatement struct	2020-01-17 16:02:57 +01:00
Önder Kalacı	4b5241c7b2	Merge pull request #3397 from citusdata/cte_inline_pg_11 Fix issues for CTE inlining on Postgres 11	2020-01-17 14:39:21 +01:00
Onder Kalaci	016f561e45	Ingest data for cte_inline tests	2020-01-17 12:46:00 +01:00
Onder Kalaci	3833a7e686	Fix issues for CTE inlining on Postgres 11 Comment from code: /* * We had to implement this hack because on Postgres11 and below, the originalQuery * and the query would have significant differences in terms of CTEs where CTEs * would not be inlined on the query (as standard_planner() wouldn't inline CTEs * on PG 11 and below). * * Instead, we prefer to pass the inlined query to the distributed planning. We rely * on the fact that the query includes subqueries, and it'd definitely go through * query pushdown planning. During query pushdown planning, the only relevant query * tree is the original query. */	2020-01-17 11:59:02 +01:00
Jelte Fennema	246435be7e	Lazy query deparsing executable queries (#3350 ) Deparsing and parsing a query can be heavy on CPU. When locally executing the query we don't need to do this in theory most of the time. This PR is the first step in allowing to skip deparsing and parsing the query in these cases, by lazily creating the query string and storing the query in the task. Future commits will make use of this and not deparse and parse the query anymore, but use the one from the task directly.	2020-01-17 11:49:43 +01:00
Hadi Moshayedi	60a2bc5ec2	Merge pull request #3376 from citusdata/insert_select INSERT...SELECT with re-partitioning	2020-01-17 01:36:36 -08:00
Hadi Moshayedi	6cf1c01660	Don't use repartitioned INSERT/SELECT for repartition joins	2020-01-16 23:40:31 -08:00
Hadi Moshayedi	5eeb07124f	Repartitioned INSERT/SELECT: include job id in result id prefix	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	a079278b0c	Repartitioned INSERT/SELECT: Add a GUC to enable/disable it	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	ce5eea4885	INSERT/SELECT: make SELECT column names unique	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	3258d87f3e	Isolation tests for INSERT/SELECT repartition	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	8b27a9a195	More range partitioned tests	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	8635396cea	Repartitioned INSERT/SELECT: Test rollback behaviour	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	43218eebf6	Failure tests for INSERT/SELECT repartition	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	665b33dca1	MX tests for INSERT/SELECT repartition	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	af2349f21f	Repartitioned INSERT/SELECT: Add a prepared statement test	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	97072c9eb1	INSERT/SELECT: show method in EXPLAIN output	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	b143d9588a	Repartitioned INSERT/SELECT: Test GROUP BY	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	fe548b762f	Repartitioned INSERT/SELECT: Test CTEs	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	494cc383cc	Repartitioned INSERT/SELECT: Enable RETURNING	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	4b14347fc3	Tests for DML followed by insert/select repartition	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	44a2aede16	Don't start a coordinated transaction on workers. Otherwise transaction hooks of Citus kick in and might cause unwanted errors.	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	42c3c03b85	Handle extra columns added in ExpandWorkerTargetEntry() in repartitioned INSERT/SELECT	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	89463f9760	Repartitioned INSERT/SELECT: cast columns in SELECT targets	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	d67a384350	Enable repartitioned INSERT/SELECT ON CONFLICT.	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	b4e5f4b10a	Implement INSERT ... SELECT with repartitioning	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	ced876358d	INSERT/SELECT: Refactor out AddInsertSelectCasts	2020-01-16 23:24:52 -08:00
Hadi Moshayedi	d449c1857c	INSERT/SELECT: Use ExecutePlan* instead of ExecuteSelect*	2020-01-16 23:24:52 -08:00
Philip Dubé	a53b844939	Merge pull request #3393 from citusdata/order_by_multirow_insert Add ORDER BY to multi_row_insert.sql	2020-01-17 00:31:32 +00:00
Hadi Moshayedi	e30580e2bd	Add ORDER BY to multi_row_insert.sql	2020-01-16 15:20:39 -08:00
Jelte Fennema	062bda29fb	Fix bug causing errors when planning a query with multiple subq… (#3389 ) Our checks to find subqueries in the rewritten query were not sufficient. When multiple subqueries are present in the original query and some would be replaced by a join, we could miss other subqueries that were not rewritten. This in turn caused us not to go into the subquery planner, causing some queries that were planning fine before to suddenly not plan anymore. This was a regression introduced by #3171.	2020-01-16 19:01:13 +01:00
Jelte Fennema	0ee1eab070	Make tests fail with a useful error message	2020-01-16 18:30:30 +01:00
Jelte Fennema	cb5154cf03	Add more failing tests, of which some have bad error messages	2020-01-16 18:30:30 +01:00
Marco Slot	82f1fffa28	Fix epoll_ctl() error message on connection error	2020-01-16 06:40:57 +01:00
Önder Kalacı	89d5bed88d	Merge pull request #3369 from citusdata/move_fast_path_pruning_to_executor Defer shard pruning for fast-path router queries to execution	2020-01-16 17:35:33 +01:00
Onder Kalaci	dc17c2658e	Defer shard pruning for fast-path router queries to execution This is purely to enable better performance with prepared statements. Before this commit, the fast path queries with prepared statements where the distribution key includes a parameter always went through distributed planning. After this change, we only go through distributed planning on the first 5 executions.	2020-01-16 16:59:36 +01:00
Onder Kalaci	933d666c0d	Do not forget to copy fastPathRouterPlan@DistributedPlan	2020-01-16 16:39:20 +01:00
Halil Ozan Akgül	023f40ca60	Merge pull request #3373 from citusdata/alter_table_schema_propagation Adds alter table schema propagation	2020-01-16 17:18:01 +03:00
Halil Ozan Akgul	c5539d20d9	Adds alter table schema propagation	2020-01-16 17:04:16 +03:00
Nils Dijk	b6e09eb691	Fix: distributed function with table reference in declare (#3384 ) DESCRIPTION: Fixes a problem when adding a new node due to tables referenced in a functions body Fixes #3378 It was reported that `master_add_node` would fail if a distributed function has a table name referenced in its declare section of the body. By default postgres validates the body of a function on creation. This is not a problem in the normal case as tables are replicated to the workers when we distribute functions. However when a new node is added we first create dependencies on the workers before we try to create any tables, and the original tables get created out of bound when the metadata gets synced to the new node. This causes the function body validator to raise an error the table is not on the worker. To mitigate this issue we set `check_function_bodies` to `off` right before we are creating the function. The added test shows this does resolve the issue. (issue can be reproduced on the commit without the fix)	2020-01-16 14:21:54 +01:00
Jelte Fennema	e76281500c	Replace shardId lock with lock on colocation+shardIntervalIndex (#3374 ) This new locking pattern makes sure that some deadlocks that could happend during rebalancing cannot occur anymore.	2020-01-16 13:14:01 +01:00
Jelte Fennema	86876c0473	CTE pushdown via CTE inlining in distributed planning (#3161 ) Before this patch, Citus used to always recursively plan CTEs. In PostgreSQL 12, there is a [logic](https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=608b167f9f9c4553c35bb1ec0eab9ddae643989b) for inlining CTEs, which is basically converting certain CTEs to subqueries. With this patch, citus becomes capable of doing the same, can get rid of recursively planning all the CTEs. Instead, the pushdown-able ones would simply be converted to subquery pushdown. If the inlined CTE query cannot be pushed down, it'd simply follow the recursive planning logic. See an example below: ```SQL -- the query that users pass WITH some_users AS (SELECT users_table.user_id FROM users_table JOIN events_table USING (user_id) WHERE event_type = 5) SELECT count() FROM users_table JOIN some_users USING (user_id); -- worker query SELECT count() AS COUNT FROM ((users_table_102039 users_table JOIN users_table_102039 users_table_1 ON ((users_table_1.user_id OPERATOR(pg_catalog.=) users_table.user_id))) JOIN events_table_102071 events_table ON ((users_table.user_id OPERATOR(pg_catalog.=) events_table.user_id))) WHERE (events_table.event_type OPERATOR(pg_catalog.=) 5) ``` There are few things to call-out for future reference and help the reviewer(s) to understand the patch easier: 1) On top of Postgres' restrictions to inline CTEs, Citus enforces one more. This is to prevent regressing on the SQL support. For example, the following cte is OK to inline by Postgres. However, if inlined, Citus cannot plan the whole query, so we prefer to skip inlining that cte: ```SQL -- Citus should not inline this CTE because otherwise it cannot -- plan the query WITH cte_1 AS (SELECT * FROM test_table) SELECT , row_number() OVER () FROM cte_1; ``` 2) Some exotic queries with multiple colocation groups involved could become repartition joins. Basically, after the CTE inlining happens, ShouldRecursivelyPlanNonColocatedSubqueries() fails to detect that the query is a non-colocated subquery. We should improve there to fix it. But, since we fall-back to planning again, the query is successfully executed by Citus. ```SQL SET citus.shard_count TO 4; CREATE TABLE colocation_1 (key int, value int); SELECT create_distributed_table('colocation_1', 'key'); SET citus.shard_count TO 8; CREATE TABLE colocation_2 (key int, value int); SELECT create_distributed_table('colocation_2', 'key'); -- which used to work because the cte was recursively planned -- now the cte becomes a repartition join since --- (a) the cte is replaced to a subquery --- (b) since the subquery is very simple, postgres pulled it to become --- a simple join WITH cte AS (SELECT FROM colocation_1) SELECT count() FROM cte JOIN colocation_2 USING (key); ... message: the query contains a join that requires repartitioning detail: hint: Set citus.enable_repartition_joins to on to enable repartitioning ... ┌───────┐ │ count │ ├───────┤ │ 0 │ └───────┘ (1 row) ``` 3) We decided to implement inlining CTEs even after standard planner. In Postgres 12+, the restriction information in CTEs are generated because the CTEs are actually treated as subqueries via Postgres' inline capabilities. In Postgres 11-, the restriction information is not generated for CTEs. Because of that, some queries work differently on pg 11 vs pg 12. To see such queries, see cte_inline.sql file, where the file has two output files. 4) As a side-effect of (2), we're now able to inline CTEs for INSERT .. SELECT queries as well. Postgres prevents it, I cannot see a reason to prevent it. With this capability, some of the INSERT ... SELECT queries where the cte is in the SELECT query could become pushdownable. See an example: ```SQL INSERT INTO test_table WITH fist_table_cte AS (SELECT FROM test_table) SELECT key, value FROM fist_table_cte; ``` 5) A class of queries now could be supported. Previously, if a CTE is used in the outer part of an outer join, Citus would complained about that. So, the following query: ```SQL WITH cte AS ( SELECT * FROM users_table WHERE user_id = 1 ORDER BY value_1 ) SELECT cte.user_id, cte.time, events_table.event_type FROM cte LEFT JOIN events_table ON cte.user_id = events_table.user_id ORDER BY 1,2,3 LIMIT 5; ERROR: cannot pushdown the subquery DETAIL: Complex subqueries and CTEs cannot be in the outer part of the outer join ``` Becomes ```SQL -- cte LEFT JOIN distributed_table should error out WITH cte AS ( SELECT * FROM users_table WHERE user_id = 1 ORDER BY value_1 ) SELECT cte.user_id, cte.time, events_table.event_type FROM cte LEFT JOIN events_table ON cte.user_id = events_table.user_id ORDER BY 1,2,3 LIMIT 5; user_id \| time \| event_type ---------+---------------------------------+------------ 1 \| Wed Nov 22 22:51:43.132261 2017 \| 0 1 \| Wed Nov 22 22:51:43.132261 2017 \| 0 1 \| Wed Nov 22 22:51:43.132261 2017 \| 1 1 \| Wed Nov 22 22:51:43.132261 2017 \| 1 1 \| Wed Nov 22 22:51:43.132261 2017 \| 2 (5 rows) ```	2020-01-16 12:43:48 +01:00
Jelte Fennema	86343bcc8f	Re-add test that broke with GUC workaround	2020-01-16 12:34:50 +01:00
Jelte Fennema	6b9b633695	Add more tests for prepared statements	2020-01-16 12:28:15 +01:00
Jelte Fennema	43a3fdd12f	Fix comment	2020-01-16 12:28:15 +01:00
Jelte Fennema	fe3827e499	Add tests for [NOT] MATERIALEZED	2020-01-16 12:28:15 +01:00
Onder Kalaci	326dfab44a	Fix a query which triggers an existing bug, see https://github.com/citusdata/citus/issues/3189#issuecomment-571497051	2020-01-16 12:28:15 +01:00
Onder Kalaci	81d8178625	Note that we'll drop the GUC after PG 11 support dropped	2020-01-16 12:28:15 +01:00
Onder Kalaci	c653923960	Update regression tests 6 Local execution and CTE pushdown	2020-01-16 12:28:15 +01:00
Onder Kalaci	3818be45a6	Update regression tests-5 Failure tests that rely on intermediate results	2020-01-16 12:28:15 +01:00
Onder Kalaci	1e85938b46	Update regression tests-4 Update the MX tests. Similar to the previous commits, prevent CTE inlining in some cases to prevent divergent test outputs.	2020-01-16 12:28:15 +01:00
Onder Kalaci	fc07bd7c5b	Update regression tests-3 Update the regression tests which only change in PG 12.	2020-01-16 12:28:15 +01:00
Onder Kalaci	64560b07be	Update regression tests-2 In this commit, we're introducing a way to prevent CTE inlining via a GUC. The GUC is used in all the tests where PG 11 and PG 12 tests would diverge otherwise. Note that, in PG 12, the restriction information for CTEs are generated. It means that for some queries involving CTEs, Citus planner (router planner/ pushdown planner) may behave differently. So, via the GUC, we prevent tests to diverge on PG 11 vs PG 12. When we drop PG 11 support, we should get rid of the GUC, and mark relevant ctes as MATERIALIZED, which does the same thing.	2020-01-16 12:28:15 +01:00
Onder Kalaci	5cb203b276	Update regression tests-1 These set of tests has changed in both PG 11 and PG 12. The changes are only about CTE inlining kicking in both versions, and yielding the exact same distributed planning.	2020-01-16 12:28:15 +01:00
Onder Kalaci	421bf68516	Add the specific regression tests With this commit, we're adding the specific tests for CTE inlining. The test has a different output file for pg 11, because as mentioned in the previous commits, PG 12 generates more restriction information for CTEs.	2020-01-16 12:28:15 +01:00
Onder Kalaci	efb1577d06	Handle CTE aliases accurately Basically, make sure to update the column name with the CTEs alias if we need to do so.	2020-01-16 12:28:15 +01:00
Onder Kalaci	05d600dd8f	Call CTE inlining in Citus planner The idea is simple: Inline CTEs(if any), try distributed planning. If the planning yields a successful distributed plan, simply return it. If the planning fails, fallback to distributed planning on the query tree where CTEs are not inlined. In that case, if the planning failed just because of the CTE inlining, via recursive planning, the same query would yield a successful plan. A very basic set of examples: WITH cte_1 AS (SELECT * FROM test_table) SELECT , row_number() OVER () FROM cte_1; or WITH a AS (SELECT FROM test_table), b AS (SELECT * FROM test_table) SELECT * FROM a JOIN b ON (a.value> b.value);	2020-01-16 12:28:15 +01:00
Onder Kalaci	01a5800ee8	Add Citus' CTE inlining functions With this commit we add the necessary Citus function to inline CTEs in a queryTree. You might ask, why do we need to inline CTEs if Postgres is already going to do it? Few reasons behind this decision: - One techinal node here is that Citus does the recursive CTE planning by checking the originalQuery which is the query that has not gone through the standard_planner(). CTEs in Citus is super powerful. It is practically key for full SQL coverage for multi-shard queries. With CTEs, you can always reduce any query multi-shard query into a router query via recursive planning (thus full SQL coverage). We cannot let CTE inlining break that. The main idea is Citus should be able to retry planning if anything goes after CTE inlining. So, by taking ownership of CTE inlining on the originalQuery, Citus can fallback to recursive planning of CTEs if the planning with the inlined query fails. It could have been a lot harder if we had relied on standard_planner() to have the inlined CTEs on the original query. - We want to have this feature in PostgreSQL 11 as well, but Postgres only inlines in version 12	2020-01-16 12:28:15 +01:00
Onder Kalaci	1856ab6cdd	Copy & paste code from Postgres source All the code in this commit is direct copy & paste from Postgres source code. We can classify the copy&paste code into two: - Copy paste from CTE inline patch from postgres (https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=608b167f9f9c4553c35bb1ec0eab9ddae643989b) These include the functions inline_cte(), inline_cte_walker(), contain_dml(), contain_dml_walker(). It also include the code in function PostgreSQLCTEInlineCondition(). We prefer to extract that code into a seperate function, because (a) we'll re-use the logic later (b) we added one check for PG_11 Finally, the struct "inline_cte_walker_context" is also copied from the same Postgres commit. - Copy paste from the other parts of the Postgres code In order to implement CTE inlining in Postgres 12, the hackers modified the query_tree_walker()/range_table_walker() with the `18c0da88a5` Since Citus needs to support the same logic in PG 11, we copy & pasted that functions (and related flags) with the names pg_12_query_tree_walker() and pg_12_range_table_walker()	2020-01-16 12:28:15 +01:00
Philip Dubé	1cfebf9f41	Merge pull request #3387 from citusdata/multi_row_insert_bug Multi row insert bug	2020-01-16 05:48:56 +00:00
Philip Dubé	4d9a733c2f	Fix inserting multiple values with row expression partition column causing the insert to be ignored Raise an error instead of silently inserting nothing if we hit this condition in the future	2020-01-15 21:10:50 +00:00
Philip Dubé	f6d4df6da9	Merge pull request #3382 from citusdata/fix-error-on-repeated-placement-done PlacementExecutionDone: We may mark placements as failed multiple times	2020-01-15 18:52:58 +00:00
Philip Dubé	4989c9a15c	PlacementExecutionDone: We may mark placements as failed multiple times, but should only act the first time.	2020-01-15 18:20:01 +00:00
Marco Slot	fd5935d798	Always use NOTICE in log_remote_commands and avoid redaction wh… (#3339 ) Always use NOTICE in log_remote_commands and avoid redaction when possible	2020-01-14 11:36:56 +01:00
Marco Slot	f0d6ea1afb	Merge pull request #3261 from citusdata/remove_copy_from_worker Remove copy from worker for append-partitioned table	2020-01-14 09:21:19 +01:00
Marco Slot	90056f7d3c	Remove copy from worker for append-partitioned table	2020-01-13 23:03:40 -08:00
Philip Dubé	5ec644c691	Merge pull request #3381 from citusdata/mitm-threadsafe mitmscripts/fluent.py: use atomic increment	2020-01-14 06:32:31 +00:00
Philip Dubé	62524d152d	mitmscripts/fluent.py: use atomic increment	2020-01-13 20:35:08 +00:00
Marco Slot	f1a0582973	Make ApplyLogRedaction a macro and redefine ereport	2020-01-13 18:24:36 +01:00
Marco Slot	06709ee108	Always use NOTICE in log_remote_commands and avoid redaction when possible	2020-01-13 18:24:36 +01:00
Philip Dubé	b6975c7dcf	Merge pull request #3367 from citusdata/propagate-routine Propagate DROP ROUTINE, ALTER ROUTINE	2020-01-13 15:42:50 +00:00
Philip Dubé	ccabf19090	Propagate DROP ROUTINE, ALTER ROUTINE In two places I've made code more straight forward by using ROUTINE in our own codegen Two changes which may seem extraneous: AppendFunctionName was updated to not use pg_get_function_identity_arguments. This is because that function includes ORDER BY when printing an aggregate like my_rank. While ALTER AGGREGATE my_rank(x "any" ORDER BY y "any") is accepted by postgres, ALTER ROUTINE my_rank(x "any" ORDER BY y "any") is not. Tests were updated to use macaddr over integer. Using integer is flaky, our logic could sometimes end up on tables like users_table. I originally wanted to use money, but money isn't hashable.	2020-01-13 15:37:46 +00:00
Philip Dubé	8b4429e2dd	Merge pull request #3375 from citusdata/rename-relayfilestate Rename RelayFileState to ShardState	2020-01-12 06:18:36 +00:00
Philip Dubé	4b5d6c3ebe	Rename RelayFileState to ShardState Replace FILE_ prefix with SHARD_STATE_	2020-01-12 05:57:53 +00:00
Philip Dubé	f1a4b97450	Merge pull request #3372 from citusdata/dont-palloc-walkercontext Replace ARRAY_OUT_FUNC_ID with postgres's F_ARRAY_OUT	2020-01-10 17:06:37 +00:00
Philip Dubé	e71386af33	Replace ARRAY_OUT_FUNC_ID with postgres's F_ARRAY_OUT Also use stack allocation for walkerContext in multi_logical_optimizer	2020-01-10 16:54:00 +00:00
Hadi Moshayedi	c7efbf9711	Merge pull request #3355 from citusdata/redistribute_results Redistribute task list results to correspond to a target relation's distribution	2020-01-09 23:52:28 -08:00
Hadi Moshayedi	40ba2cdd6e	Test RedistributeTaskListResult	2020-01-09 23:47:25 -08:00
Hadi Moshayedi	527d7d41c1	Implement RedistributeTaskListResult	2020-01-09 23:47:25 -08:00
Philip Dubé	d855faf2b2	Merge pull request #3371 from citusdata/fix-task-tracker-row-gather-subquery Fix row-gather for subqueries being handled by task-tracker	2020-01-10 02:03:38 +00:00
Philip Dubé	281aacce9b	Fix row-gather for subqueries being handled by task-tracker task-tracker has specific logic for MultiPartition when GROUP BY is missing We were ending up in this code path because row-gather removes GROUP BY	2020-01-10 01:51:37 +00:00
Hadi Moshayedi	e185b54cbc	Merge pull request #3363 from citusdata/redistribute_failure PartitionTasklistResults: Use different queries per placement	2020-01-09 11:20:44 -08:00
Hadi Moshayedi	e1e383cb59	Don't override xact id assigned by coordinator on workers. We might need to send commands from workers to other workers. In these cases we shouldn't override the xact id assigned by coordinator, or otherwise we won't read the consistent set of result files accross the nodes.	2020-01-09 11:09:11 -08:00
Hadi Moshayedi	bb65669186	Failure tests for PartitionTasklistResults	2020-01-09 10:55:58 -08:00
Hadi Moshayedi	c7c460e843	PartitionTasklistResults: Use different queries per placement We need to know which placement succeeded in executing the worker_partition_query_result() call. Otherwise we wouldn't know which node to fetch from. This change allows that by introducing Task::perPlacementQueryStrings.	2020-01-09 10:55:58 -08:00
Hadi Moshayedi	08b5145765	Merge pull request #3353 from citusdata/partition_task_list_results Partitioned task list results. Implements PartitionTasklistResults(), which partitions results of given SELECT tasks based on shard ranges of a given relation.	2020-01-09 10:53:11 -08:00
Hadi Moshayedi	f38d0e5b3f	Partitioned task list results.	2020-01-09 10:32:58 -08:00
Philip Dubé	893b4538c2	Merge pull request #3364 from citusdata/fp-deparse Refactor deparsing/planning to use DistributeObjectOps struct	2020-01-09 18:29:32 +00:00
Philip Dubé	73c06fae3b	Introduce GetDistributeObjectOps to organize dispatch of logic dependent on node/object type	2020-01-09 18:24:29 +00:00
Önder Kalacı	22cc5b1240	Merge pull request #3366 from citusdata/normalize-plan-numbers-insert-select Normalize plan numbers in insert_select output	2020-01-07 10:02:01 +00:00
Jelte Fennema	9724e25065	Normalize plan numbers in insert_select output	2020-01-07 10:34:08 +01:00
Philip Dubé	0e227e391a	Merge pull request #3324 from citusdata/gather-row-aggregation Pull up intermediate rows to coordinator for aggregates we cannot push down	2020-01-07 01:26:39 +00:00
Philip Dubé	bf7d86a3e8	Fix typo: aggragate -> aggregate	2020-01-07 01:16:09 +00:00
Philip Dubé	863bf49507	Implement pulling up rows to coordinator when aggregates cannot be pushed down. Enabled by default	2020-01-07 01:16:04 +00:00
Jelte Fennema	16b4140dc8	Use fewer CPU cycles on fast-path planning (#3332 ) Fast-path queries are introduced with #2606. The basic idea is that for very simple queries like SELECT count(*) FROM table WHERE dist_key = X, we can skip some parts of the distributed planning. The most notable thing to skip is standard_planner(), which was already done in #2606. With this commit, we do some further optimizations. First, we used to call the function which decides whether the query is fast path twice, which can be reduced to one. Second, we used to do shard pruning for every query, now we'll optimize it for some cases. Finally, since the definition of fast-path queries are very strict, we can skip some query traversals.	2020-01-06 14:54:11 +01:00
Jelte Fennema	5b0baea72c	Refactor distributed_planner for better understandability	2020-01-06 14:23:38 +01:00
Onder Kalaci	5a1e752726	Apply feedback - add fastPath field to plan	2020-01-06 12:42:43 +01:00
Onder Kalaci	13a9b55695	Skip expensive checks when fast-path query The definition of fast-path query is very strict. So, we don't need to do some extra checks.	2020-01-06 12:42:43 +01:00
Onder Kalaci	7f3ab7892d	Skip shard pruning when possible We're already traversing the queryTree and finding the distribution key value, so pass it to the later stages of the planning.	2020-01-06 12:42:43 +01:00
Onder Kalaci	ca293116fa	Reduce calls to FastPathRouterQuery() Before this commit, we called it twice durning planning. Instead, we save the information and pass it.	2020-01-06 12:42:43 +01:00
Önder Kalacı	270571c106	Merge pull request #3333 from citusdata/fix_wrong_data Make sure to update shard states of partitions on failures	2020-01-06 11:37:40 +00:00
Onder Kalaci	c8f14c9f6c	Make sure to update shard states of partitions on failures Fixes #3331 In #2389, we've implemented support for partitioned tables with rep > 1. The implementation is limiting the use of modification queries on the partitions. In fact, we error out when any partition is modified via EnsurePartitionTableNotReplicated(). However, we seem to forgot an important case, where the parent table's partition is marked as INVALID. In that case, at least one of the partition becomes INVALID. However, we do not mark partitions as INVALID ever. If the user queries the partition table directly, Citus could happily send the query to INVALID placements -- which are not marked as INVALID. This PR fixes it by marking the placements of the partitions as INVALID as well. The shard placement repair logic already re-creates all the partitions, so should be fine in that front.	2020-01-06 12:26:08 +01:00
Jelte Fennema	3c770516eb	Commenting out flaky intermediate data leak test (#3359 ) check-multi apparently has an intermediate data leak, so commenting out that test for now. This was introduced by #3349 Examples: - https://app.circleci.com/jobs/github/citusdata/citus/74675 - https://app.circleci.com/jobs/github/citusdata/citus/74683 - https://app.circleci.com/jobs/github/citusdata/citus/74763	2020-01-06 11:55:01 +01:00
Jelte Fennema	d29ce8965c	Actually check that test output normalization is applied in CI (#3358 ) Fixup of an issue with #3336 that caused CI not to check correctly that normalized test output was committed.	2020-01-06 10:37:34 +01:00
Jelte Fennema	de75243000	Commit normalized test output for better diffs (#3336 ) We have a `normalize.sed` script that before diffing test output normalizes the expected file and the actual file. This makes sure that we don't have random test failures and that we have to update our test output all the time. This PR takes that one step further and actually commits the normalized files. That way whenever we DO have to update our test output files only relevant changes will be visible in the diff. The other change that this PR does it that it strips trailing whitespace during normalization. This works well with our editorconfig settings. As an added benefit of committing these files it's also much more visible what new normalization rules will result in. The original changes that were proposed here were a bit to wide and were changing output that was not intentended to be changed: https://github.com/citusdata/citus/pull/3161#discussion_r360928922 Because these changes are now in the diff of the commit they are much easier to spot. Finally the Plan number normalization rules were also added to this PR, because they are useful even without the CTE inlining PR.	2020-01-06 09:56:31 +01:00
Jelte Fennema	4a20ba3bfc	Merge remote-tracking branch 'origin/master' into normalized-test-output	2020-01-06 09:36:04 +01:00
Jelte Fennema	2e4e1c030f	Make sure the expected .out file always exists when running diff on it	2020-01-06 09:32:03 +01:00
Jelte Fennema	16bcf15e16	Remove unused normalization rule	2020-01-06 09:32:03 +01:00
Jelte Fennema	634ea80009	Add a basic testing README including normalization explanation	2020-01-06 09:32:03 +01:00
Jelte Fennema	7c3e8e150e	Normalize tests: s/Subplan [0-9]+\_/Subplan XXX\_/g	2020-01-06 09:32:03 +01:00
Jelte Fennema	acd12a6de5	Normalize tests: s/read_intermediate_result\('[0-9]+_/read_intermediate_result('XXX_/g	2020-01-06 09:32:03 +01:00
Jelte Fennema	21dbd4e55d	Normalize tests: s/generating subplan [0-9]+\_/generating subplan XXX\_/g	2020-01-06 09:32:03 +01:00
Jelte Fennema	58723dd8b0	Normalize tests: s/DEBUG: Plan [0-9]+/DEBUG: Plan XXX/g	2020-01-06 09:32:03 +01:00
Jelte Fennema	34c5532e9c	Add commented out rules to normalize Plan numbers	2020-01-06 09:32:03 +01:00
Jelte Fennema	38ac28b4b8	Normalize tests: intermediate_results	2020-01-06 09:32:03 +01:00
Jelte Fennema	0c6983a80e	Normalize tests: pg12 changes	2020-01-06 09:32:03 +01:00
Jelte Fennema	7730bd449c	Normalize tests: Remove trailing whitespace	2020-01-06 09:32:03 +01:00
Jelte Fennema	6353c9907f	Normalize tests: Line info varies between versions	2020-01-06 09:32:03 +01:00
Jelte Fennema	bf2c203908	Normalize tests: solation_ref2ref_foreign_keys	2020-01-06 09:32:03 +01:00
Jelte Fennema	7b2c769a5d	Normalize tests: normalize file names for partitioned files	2020-01-06 09:32:03 +01:00
Jelte Fennema	98bab9caab	Normalize tests: ignore WAL warnings	2020-01-06 09:32:03 +01:00
Jelte Fennema	5c0f955ab9	Normalize tests: ignore could not consume warnings	2020-01-06 09:32:03 +01:00
Jelte Fennema	dc3cff991f	Normalize tests: normalize failed task ids	2020-01-06 09:32:03 +01:00
Jelte Fennema	d0ade90cd0	Normalize tests: pkey constraints for multi_insert_select	2020-01-06 09:32:03 +01:00
Jelte Fennema	704e1d2bc8	Normalize tests: shard table names for multi_name_lengths	2020-01-06 09:32:03 +01:00
Jelte Fennema	1c4ea6836b	Normalize tests: shard table names for multi_insert_select_conflict	2020-01-06 09:32:03 +01:00
Jelte Fennema	27997c054e	Normalize tests: shard table names for foreign_key_restrection_enforcement	2020-01-06 09:32:03 +01:00
Jelte Fennema	432b5baac7	Normalize tests: shard table names for custom_aggregate_support	2020-01-06 09:32:03 +01:00
Jelte Fennema	0c23caeb75	Normalize tests: shard table names for multi_subtransactions	2020-01-06 09:32:03 +01:00
Jelte Fennema	883ee9121f	Normalize tests: shard table names in foreign_key_to_reference_table	2020-01-06 09:32:03 +01:00
Jelte Fennema	7f3de68b0d	Normalize tests: header separator length	2020-01-06 09:32:03 +01:00
Philip Dubé	51a7e661f9	Merge pull request #3349 from citusdata/ensure-no-intermediate-data-leak-at-end End regression tests with ensure_no_intermediate_data_leak	2020-01-03 19:20:28 +00:00
Philip Dubé	566246ecd4	End regression tests with ensure_no_intermediate_data_leak Also update tests to clean up jobs when they're directly testing job udfs	2020-01-03 18:59:02 +00:00
Önder Kalacı	0c70a5470e	Allow RETURNING in fast-path queries (#3352 ) * Allow RETURNING in fast-path queries Because there is no specific reason for that.	2020-01-03 13:42:50 +00:00
Önder Kalacı	a174eb4f7b	Do not go through standard_planner() for INSERTs (#3348 ) That seems unnecessary. We already have the notion of FastPath queries, simply add it there.	2020-01-03 12:15:22 +00:00
Jelte Fennema	75a9c25acd	Normalize tests: s/node group [12] (but\|does)/node group \1/	2020-01-03 11:46:01 +01:00
Jelte Fennema	96434e898f	Normalize tests: s/assigned task [0-9]+ to node/assigned task to node/	2020-01-03 11:45:22 +01:00
Jelte Fennema	7b833466ba	Normalize tests: s/shard [0-9]+/shard xxxxx/g	2020-01-03 11:44:30 +01:00
Jelte Fennema	8b5fe8aa17	Normalize tests: s/placement [0-9]+/placement xxxxx/g	2020-01-03 11:42:48 +01:00
Jelte Fennema	f21f00544e	Normalize tests: s/ port=[0-9]+ / port=xxxxx /g	2020-01-03 11:42:09 +01:00
Jelte Fennema	8c5c0dd74c	Normalize tests: s/localhost:[0-9]+/localhost:xxxxx/g	2020-01-03 11:40:50 +01:00
Jelte Fennema	a1ff2117bf	Ignore .modified and .unmodified files in git	2020-01-03 11:38:12 +01:00
Jelte Fennema	7630029a7f	Keep an .unmodified file for debugging	2020-01-03 11:30:08 +01:00
Jelte Fennema	9a819d401a	Ensure that only normalized test output is commited	2020-01-03 11:30:08 +01:00
Jelte Fennema	8fae3ed800	Remove trailing whitespace during normalization in test output	2020-01-03 11:30:08 +01:00
Jelte Fennema	b815425d2c	Make diff normalize our test output files in place	2020-01-03 11:30:08 +01:00
Philip Dubé	3cfb9b64bf	Merge pull request #3351 from citusdata/uncomment-working-tests Uncomment local execution EXPLAIN ANALYZE tests	2020-01-02 19:01:42 +00:00
Jelte Fennema	5fee9d04c9	Uncomment local execution EXPLAIN ANALYZE tests	2020-01-02 18:56:32 +00:00
Marco Slot	5a9d31f136	Fix union (all) pushdown issue (#3306 ) Fix union (all) pushdown issue	2020-01-02 13:56:06 +01:00
Marco Slot	ba39d72fe1	Fix incorrect union all pushdown issue	2020-01-01 09:03:50 +01:00
Hanefi Onaldi	7a909fc807	Add changelog entry for 9.1.2	2019-12-30 11:33:10 +03:00
Jelte Fennema	0cd5d6ac49	Support any inner join on a reference table (#3323 ) This PR works by doing two things: 1. Expand the notion of a join condition to any expression that contains columns from two or more tables. 2. Support cartesian products on reference tables. Cartesian products on reference tables are considered in the join order planner as the least desirable join (except for normal cartesian products). That way they will be done at the end of the join. This is preferable since the cartesian product multiplies the rows. By doing it at the end at least these multiplications of rows will not be sent over the network when doing repartitioning, only when sending to the master. Fixes #3079 Fixes #3198	2019-12-27 15:14:50 +01:00
Jelte Fennema	cf88bdf833	Add tests for complex joins on reference tables	2019-12-27 15:05:51 +01:00
Jelte Fennema	3a042e4611	Allow cartesian products on reference tables	2019-12-27 15:05:51 +01:00
Jelte Fennema	61e2501645	Make any expression with two or more tables a join expression	2019-12-27 15:05:51 +01:00
Jelte Fennema	4233cd0d9d	Allow non equi joins on reference tables	2019-12-27 15:05:51 +01:00
Jelte Fennema	7642928be1	Makefile fix DESTDIR together with cleanup (#3342 ) This should fix this build issue: redmine.postgresql.org/issues/5032	2019-12-27 10:34:57 +01:00
Philip Dubé	e91755f73c	Merge pull request #3307 from citusdata/group_by_speedup Do not repeat GROUP BY distribution_column on coordinator	2019-12-25 01:39:56 +00:00
Marco Slot	b21b6905ae	Do not repeat GROUP BY distribution_column on coordinator Allow arbitrary aggregates to be pushed down in these scenarios	2019-12-25 01:33:41 +00:00
Philip Dubé	11368451f4	Merge pull request #3344 from citusdata/fix-extension-already-exists-test Fix tests when hll/topn installed	2019-12-24 21:45:36 +00:00
Philip Dubé	a6ffcab59d	CREATE EXTENSION is propagated now	2019-12-24 21:04:37 +00:00
Marco Slot	ee71b24538	Fix inconsistent shard metadata issue (#3334 ) Fix inconsistent shard metadata issue	2019-12-24 13:28:17 +01:00
Hadi Moshayedi	10605f8a26	Merge pull request #3329 from citusdata/predistribute Partitioned intermediate results	2019-12-24 04:00:43 -08:00
Hadi Moshayedi	d7aea7fa10	Implement partitioned intermediate results.	2019-12-24 03:53:39 -08:00
Marco Slot	1aef63abfb	Fix error in distributed queries when shards are on the coordin… (#3308 ) Fix error in distributed queries when shards are on the coordinator	2019-12-24 12:14:55 +01:00
Marco Slot	a2ddfecd86	Fix inconsistent shard metadata issue	2019-12-24 08:01:32 +01:00
Marco Slot	b37ef0e394	Fix error in distributed queries when shards are on the coordinator	2019-12-24 06:36:43 +01:00
Philip Dubé	2349f838a1	Merge pull request #3316 from citusdata/fix-empty-agg-combine Fix handling of empty intermediate results when distributing custom aggregates	2019-12-23 17:38:52 +00:00
Philip Dubé	e9bbdb8f31	Fix handling of empty intermediate results when distributing custom aggregates	2019-12-23 17:27:52 +00:00
Hadi Moshayedi	bb6ba89708	Merge pull request #3327 from citusdata/fix_reindent Fix reindent version inconsistencies.	2019-12-20 08:38:15 -08:00
Philip Dubé	f007b7f91d	Also fix reindent inconsistencies with fake_fdw.c	2019-12-20 08:27:47 +00:00
Hadi Moshayedi	08eb0ade31	Fix reindent version inconsistencies. Different versions of reindent tool reformatted citus_custom_scan.c and citus_copyfuncs.c differently. So some developers spent some extra attention not to commit these two files after reindent. This PR tries to address this.	2019-12-19 23:10:34 -08:00
Jelte Fennema	b655c02352	Add the necessary changes for rebalance strategies on enterprise (#3325 ) This commit adds the SQL and C changes necessary to support custom rebalance strategies in the Enterprise version of Citus.	2019-12-19 15:23:08 +01:00
Hadi Moshayedi	c9ceff7d78	Merge pull request #3318 from citusdata/fetch_intermediate_results Implement fetch_intermediate_results	2019-12-18 10:51:56 -08:00
Hadi Moshayedi	ef487e0792	Implement fetch_intermediate_results	2019-12-18 10:46:35 -08:00
Onur TIRTIR	eb3c1b4eb4	Add changelog entry for 9.1.1 (#3321 )	2019-12-18 15:32:48 +03:00
Hadi Moshayedi	e96201c609	Merge pull request #3304 from citusdata/read_intermediate_results Implement read_intermediate_results	2019-12-17 14:08:39 -08:00
Hadi Moshayedi	249508d267	Estimate cost of read_intermediate_results()	2019-12-17 13:51:51 -08:00
Hadi Moshayedi	113bd1e5f1	Implement read_intermediate_results	2019-12-17 13:51:16 -08:00
SaitTalhaNisanci	7ff4ce2169	Add adaptive executor support for repartition joins (#3169 ) * WIP * wip * add basic logic to run a single job with repartioning joins with adaptive executor * fix some warnings and return in ExecuteDependedTasks if there is none * Add the logic to run depended jobs in adaptive executor The execution of depended tasks logic is changed. With the current logic: - All tasks are created from the top level task list. - At one iteration: - CurTasks whose dependencies are executed are found. - CurTasks are executed in parallel with adapter executor main logic. - The iteration is repeated until all tasks are completed. * Separate adaptive executor repartioning logic * Remove duplicate parts * cleanup directories and schemas * add basic repartion tests for adaptive executor * Use the first placement to fetch data In task tracker, when there are replicas, we try to fetch from a replica for which a map task is succeeded. TaskExecution is used for this, however TaskExecution is not used in adaptive executor. So we cannot use the same thing as task tracker. Since adaptive executor fails when a map task fails (There is no retry logic yet). We know that if we try to execute a fetch task, all of its map tasks already succeeded, so we can just use the first one to fetch from. * fix clean directories logic * do not change the search path while creating a udf * Enable repartition joins with adaptive executor with only enable_reparitition_joins guc * Add comments to adaptive_executor_repartition * dont run adaptive executor repartition test in paralle with other tests * execute cleanup only in the top level execution * do cleanup only in the top level ezecution * not begin a transaction if repartition query is used * use new connections for repartititon specific queries New connections are opened to send repartition specific queries. The opened connections will be closed at the FinishDistributedExecution. While sending repartition queries no transaction is begun so that we can see all changes. * error if a modification was done prior to repartition execution * not start a transaction if a repartition query and sql task, and clean temporary files and schemas at each subplan level * fix cleanup logic * update tests * add missing function comments * add test for transaction with DDL before repartition query * do not close repartition connections in adaptive executor * rollback instead of commit in repartition join test * use close connection instead of shutdown connection * remove unnecesary connection list, ensure schema owner before removing directory * rename ExecuteTaskListRepartition * put fetch query string in planner not executor as we currently support only replication factor = 1 with adaptive executor and repartition query and we know the query string in the planner phase in that case * split adaptive executor repartition to DAG execution logic and repartition logic * apply review items * apply review items * use an enum for remote transaction state and fix cleanup for repartition * add outside transaction flag to find connections that are unclaimed instead of always opening a new transaction * fix style * wip * rename removejobdir to partition cleanup * do not close connections at the end of repartition queries * do repartition cleanup in pg catch * apply review items * decide whether to use transaction or not at execution creation * rename isOutsideTransaction and add missing comment * not error in pg catch while doing cleanup * use replication factor of the creation time, not current time to decide if task tracker should be chosen * apply review items * apply review items * apply review item	2019-12-17 19:09:45 +03:00
Marco Slot	8cea662f17	Use any available non-data connection for intermediate results (#3301 ) Use any available non-data connection for intermediate results	2019-12-17 12:22:59 +01:00
Onur TIRTIR	8092529a2c	Split propagate extension test and add alternative output (#3314 ) * Split extension name tests from propagate_extension_commands.sql * Add alternative output for escape_extension_name.sql	2019-12-17 13:49:16 +03:00
Marco Slot	2f568ad5a5	Forbid using connections that sent intermediate results for data access and vice versa	2019-12-17 11:49:13 +01:00
Marco Slot	5aec71855a	Clean up transaction block usage logic in adaptive executor (#3288 ) Clean up transaction block usage logic in adaptive executor	2019-12-17 11:35:57 +01:00
Marco Slot	f4031dd477	Clean up transaction block usage logic in adaptive executor	2019-12-17 10:48:19 +01:00
Nils Dijk	bfc3d2eb90	make sure to correctly decrement ExecutorLevel (#3311 ) DESCRIPTION: Fix counter that keeps track of internal depth in executor While reviewing #3302 I ran into the `ExecutorLevel` variable which used a variable to keep the original value to restore on successful exit. I haven't explored the full space and if it is possible to get into an inconsistent state. However using `PG_TRY`/`PG_CATCH` seems generally more correct. Given very bad things will happen if this level is not reset, I kept the failsafe of setting the variiable back to 0 on the `XactCallback` but I did add an assert to treat it as a developer bug.	2019-12-16 20:50:13 +01:00
Marco Slot	f90bbc64f6	Fix a crash when calling a distributed function from PL/pgSQL (#3302 ) Fix a crash when calling a distributed function from PL/pgSQL	2019-12-16 19:03:45 +01:00
SaitTalhaNisanci	97bfd0bba0	add circleci build status (#3310 ) (#3309 )	2019-12-16 19:25:32 +03:00
Marco Slot	5f656e22db	Fix issue in IsMultiStatementTransaction detection	2019-12-16 17:01:43 +01:00
SaitTalhaNisanci	e3db433ec1	add circleci build status (#3310 )	2019-12-16 17:46:36 +03:00
SaitTalhaNisanci	2829c601dd	replace Begin words in coordinated transactions with use (#3293 )	2019-12-16 10:40:31 +03:00
SaitTalhaNisanci	a2f2107e6a	refactor MapTaskList in multi physical planner (#3297 )	2019-12-13 22:41:49 +03:00
Marco Slot	3b6b3f8c48	Fix crash in IN (.., NULL) queries (#3299 ) Fix crash in IN (.., NULL) queries	2019-12-13 18:49:36 +01:00
Marco Slot	1633123d78	Fix crash in IN (NULL) queries	2019-12-13 08:35:54 +01:00
Hadi Moshayedi	7666d02537	Merge pull request #3294 from citusdata/fix_typos Fix some typos from #3280	2019-12-12 13:35:38 -08:00
Hadi Moshayedi	e7a6cc0801	Fix some typos from #3280	2019-12-12 13:29:26 -08:00
SaitTalhaNisanci	420e21919b	refactor extract distributed insert values rte (#3287 )	2019-12-12 23:47:44 +03:00
Marco Slot	7447dfe156	Fix error in DML with NULL expression in where clause (#3262 ) Fix error in DML with NULL expression in where clause	2019-12-12 17:25:23 +01:00
SaitTalhaNisanci	2c040d2c8f	use a function for duplicate code in connection state machine (#3209 )	2019-12-12 17:55:38 +03:00
SaitTalhaNisanci	a0fe8646e0	add IsHoldOffCancellationReceived utility function (#3290 )	2019-12-12 17:32:59 +03:00
SaitTalhaNisanci	053fe18404	not continue in sequential execution if a cancellation is received (#3289 )	2019-12-12 17:22:30 +03:00
Hadi Moshayedi	0cd14449f3	Merge pull request #3280 from citusdata/fix_drop_column Fix the way we check for local/reference table joins in the executor	2019-12-12 05:13:36 -08:00
Marco Slot	e7a8db5493	Fix issue with some zero-shard modifications	2019-12-12 07:19:10 +01:00
Hadi Moshayedi	383d34f51b	Tests for multi-statement transactions with subqueries or ctes	2019-12-11 19:54:15 -08:00
Hadi Moshayedi	939d3c955b	Don't plan function joins locally	2019-12-11 16:53:29 -08:00
Hadi Moshayedi	067d92a7f6	Don't plan joins between ref tables and views locally	2019-12-11 14:31:34 -08:00
Hadi Moshayedi	e3e174f30f	Fix the way we check for local/reference table joins in the executor	2019-12-11 12:50:20 -08:00
SaitTalhaNisanci	13204487e9	remove copyright years (#3286 )	2019-12-11 21:14:08 +03:00
SaitTalhaNisanci	3422e79f97	update contributing (#3284 )	2019-12-11 20:55:21 +03:00
SaitTalhaNisanci	c2823c9349	remove unused targets from makefile (#3283 )	2019-12-11 20:37:56 +03:00
Marco Slot	7a1817370b	rename REMOTE_TRANS_INVALID to REMOTE_TRANS_IDLE (#3285 ) rename REMOTE_TRANS_INVALID to REMOTE_TRANS_IDLE	2019-12-11 15:06:45 +01:00
SaitTalhaNisanci	d10f97998c	rename REMOTE_TRANS_INVALID to REMOTE_TRANS_NOT_STARTED	2019-12-11 15:24:18 +03:00
Önder Kalacı	fecf61ef1f	Add missing ORDER BY in a CTE (#3282 ) Otherwise, the query output might not be consistent.	2019-12-11 10:24:54 +01:00
Hadi Moshayedi	90568a87d0	Merge pull request #3113 from citusdata/refactor/insert_select Move coordinator insert..select logic into executor	2019-12-10 11:37:38 -08:00
Marco Slot	133b8e1e0e	Move coordinator insert..select logic into executor	2019-12-10 11:21:35 -08:00
Marco Slot	5d08ac3720	Fix inserts into local tables with distributed subqueries (#3271 ) Fix inserts into local tables with distributed subqueries	2019-12-10 16:58:54 +01:00
SaitTalhaNisanci	8e5041885d	Refactor isolation tests (#3062 ) Currently in mx isolation tests the setup is the same except the creation of tables. Isolation framework lets us define multiple `setup` stages, therefore I thought that we can put the `mx_setup` to one file and prepend this prior to running tests. How the structure works: - cpp is used before running isolation tests to preprocess spec files. This way we can include any file we want to. Currently this is used to include mx common part. - spec files are put to `/build/specs` for clear separation between generated files and template files - a symbolic link is created for `/expected` in `build/expected/`. - when running isolation tests, as the `inputdir`, `build` is passed so it runs the spec files from `build/specs` and checks the expected output from `build/expected`. `/specs` is renamed as `/spec` because postgres first look at the `specs` file under current directory, so this is renamed to avoid that since we are running the isolation tests from `build/specs` now. Note: now we use `//` instead of `#` in comments in spec files, because cpp interprets `#` as a directive and it ignores `//`.	2019-12-10 16:12:54 +01:00
Önder Kalacı	5395ce6480	We don't support PG 10 anymore, so the sed rule can go away (#3277 )	2019-12-10 12:40:43 +01:00
Marco Slot	486c620a3c	Fix inserts into local tables with distributed subqueries	2019-12-10 10:17:18 +01:00
Önder Kalacı	f027e9dd77	Improve Recursive CTE tests (#3274 ) Postgres keeps track of recursive CTEs in the queryTree in two ways: - queryTree->hasRecursive is set to true, whenever a RECURSIVE CTE is used in the SQL. Citus checks for it - If the CTE is actually a recursive one (a.k.a., references itself) Postgres marks CommonTableExpr->cterecursive as true as well The tests that are changed in the PR doesn't cover (b), and this becomes an issue with CTE inlining (#3161). In that case, Citus/Postgres can inline such CTEs, and the queries works with Citus. However, this tests intend to check if there is any recursive CTE in the queryTree. So, we're actually making the CTEs recursive CTEs by referring itself. We'll add cases where a recursive CTE works by inlining in #3161.	2019-12-10 09:38:45 +01:00
Philip Dubé	768912e82b	Merge pull request #2907 from citusdata/test_non_deterministic_collation pg_dist_colocation: distributioncolumncollation	2019-12-09 20:25:01 +00:00
Philip Dubé	fcf2fd819b	Add distributioncolumncollation to to pg_dist_colocation Use partition column's collation for range distributed tables Don't allow non deterministic collations for hash distributed tables CoPartitionedTables: don't compare unequal types	2019-12-09 19:51:40 +00:00
SaitTalhaNisanci	91f8be76e1	Add a script that fixes style related things (#3234 ) * Add a script that fixes style related things It is kind of tedious that we need make sure that every style check passes with any change we make. A script is added, which does all the things for us so that we dont have to run separate commands. * run fix style string in reindent target	2019-12-09 14:23:53 +03:00
Philip Dubé	10f2d7c078	Merge pull request #3196 from citusdata/propagate_create_collation Propagate collations	2019-12-09 04:48:19 +00:00
Philip Dubé	d138bb89bf	Support creating collations as part of dependency resolution. Propagate ALTER/DROP on distributed collations Propagate CREATE COLLATION when outside transaction	2019-12-09 04:42:51 +00:00
Jelte Fennema	6340fc1171	Fix editorconfig syntax (#3272 ) The comma needs to be contained in curly braces otherwise it does not work	2019-12-06 17:05:04 +01:00
Alexander Pyhalov	6174a4d3d6	Fix build on illumos	2019-12-06 14:40:47 +01:00
Marco Slot	b0ac70f1f4	Fix strange errors in DML with unreachable sublinks (#3263 ) Fix strange errors in DML with unreachable sublinks	2019-12-06 14:40:28 +01:00
Marco Slot	6a9c0ea7fe	Fix errors in DML with sublinks hidden by null expressions	2019-12-06 14:25:04 +01:00
Hadi Moshayedi	9c254859bf	Merge pull request #3257 from citusdata/fix_sql_udf_calls Detect SQL UDF Calls.	2019-12-05 14:40:00 -08:00
Hadi Moshayedi	d28beb3711	Detect SQL UDF Calls.	2019-12-05 14:31:05 -08:00
Philip Dubé	9463509e4a	Merge pull request #3220 from citusdata/test-coordinator-coherence Test coordinator coherence	2019-12-03 22:34:36 +00:00
Philip Dubé	5a17fd6d9d	Test more reference/local cases, also ALTER ROLE Test ALTER ROLE doesn't deadlock when coordinator added, or propagate from mx workers Consolidate wait_until_metadata_sync & verify_metadata to multi_test_helpers	2019-12-03 22:23:14 +00:00
Philip Dubé	3433fd0068	Merge pull request #3249 from citusdata/aggregation-directives aggregate_support test: test DISTINCT, ORDER BY, FILTER, & no intermediate results	2019-12-03 15:56:09 +00:00
Philip Dubé	1597fbb369	aggregate_support test: test DISTINCT, ORDER BY, FILTER, & no intermediate results Previously, - we'd push down ORDER BY, but this doesn't order intermediate results between workers - we'd keep FILTER on master aggregate, which would raise an error about unexpected cstrings	2019-12-03 15:46:01 +00:00
Philip Dubé	ffacefc2ad	Merge pull request #3258 from citusdata/more-depended Stray depended to dependent tidy up	2019-12-03 15:36:37 +00:00
Philip Dubé	5fcc169a3a	Stray depended to dependent tidy up	2019-12-03 15:28:32 +00:00
Marco Slot	0b71697d88	Fix segfault in column_to_column_name (#3260 ) Fix segfault in column_to_column_name	2019-12-03 14:30:40 +01:00
Marco Slot	33f3fa0eb9	Fix segfault when executing DDL via UDF (#3259 ) Fix segfault when executing DDL via UDF	2019-12-03 14:06:55 +01:00
Marco Slot	bb3bc10f0c	Fix segfault in column_to_column_name	2019-12-01 23:57:25 +01:00
Marco Slot	b1b13e394e	Fix segfault when executing DDL via UDF	2019-12-01 22:54:41 +01:00
Marco Slot	5957d731ec	Merge pull request #3248 from citusdata/bump92 Bump repo version to 9.2devel	2019-12-01 22:20:14 +01:00
Nils Dijk	1ef1667ddb	add gitref to the output of citus_version (#3246 ) DESCRIPTION: add gitref to the output of citus_version During debugging of custom builds it is hard to know the exact version of the citus build you are using. This patch will add a human readable/understandable git reference to the build of citus which can be retrieved by calling `citus_version();`.	2019-11-29 15:54:09 +01:00
Marco Slot	cb7105d1dd	Remove distinction between SQL_TASK and ROUTER_TASK (#3243 ) Remove distinction between SQL_TASK and ROUTER_TASK	2019-11-29 14:54:50 +01:00
Marco Slot	4c8d43c5d0	Bump repo version to 9.2devel	2019-11-29 07:33:39 +01:00
Marco Slot	16d1ad3666	Remove distinction between SQL_TASK and ROUTER_TASK	2019-11-29 05:58:29 +01:00
SaitTalhaNisanci	aeec3d1544	fix typo in dependent jobs and dependent task (#3244 )	2019-11-28 23:47:28 +03:00
Onur TIRTIR	ec9392e729	Update CHANGELOG.md (#3241 )	2019-11-28 16:13:58 +03:00
Philip Dubé	dd5570810e	Merge pull request #3237 from citusdata/support-more-record-expressions RECORD: Add support for more expression types	2019-11-27 17:58:37 +00:00
Philip Dubé	0d04ff1692	RECORD: Add support for more expression types - OpExpr - NullIfExpr - MinMaxExpr - CoalesceExpr - CaseExpr Also fix case where ARRAY[(1,2), NULL] was rejected	2019-11-27 17:07:22 +00:00
Philip Dubé	6d14f63f81	Merge pull request #3211 from citusdata/support-recordarray Implement support for RECORD[] where we support RECORD	2019-11-27 15:27:13 +00:00
Philip Dubé	168e11cc9b	Implement support for RECORD[] where we support RECORD Support for ARRAY[] expressions is limited to having a consistent shape, eg ARRAY[(int,text),(int,text)] as opposed to ARRAY[(int,text),(float,text)] or ARRAY[(int,text),(int,text,float)]	2019-11-27 15:02:43 +00:00
Hadi Moshayedi	2268a9cae6	Error for metadata commands if any metadata node is out-of-sync (#3226 ) * Error for metadata commands if any metadata node is out-of-sync * Make the functions have separate APIs for all workers/metadata workers	2019-11-27 09:52:57 +01:00
Marco Slot	2329157406	Swap aggregate_support tests to simplify enterprise merge	2019-11-26 13:39:18 +01:00
Önder Kalacı	1cfbeb89ec	Make NodeCanHaveDistTablePlacements() public (#3229 ) Since it is required in rebalancer.	2019-11-26 12:15:38 +01:00
Marco Slot	60b741927f	Add missing include to deparse_function_stmts.c	2019-11-24 06:04:22 +01:00
Hadi Moshayedi	5a9d5d213a	Merge pull request #3227 from citusdata/nnig Fix typos	2019-11-25 16:46:10 -08:00
Philip Dubé	261a9de42d	Fix typos: VAR_SET_VALUE_KIND -> VAR_SET_VALUE kind beginnig -> beginning plannig -> planning the the -> the er then -> er than	2019-11-25 23:24:13 +00:00
Marco Slot	4b0ac4b0dd	Properly escape ALTER FUNCTION .. SET deparsing. Also test	2019-11-25 23:01:30 +00:00
Philip Dubé	3c10c27b13	GetFunctionAlterOwnerCommand: use format_procedure_qualified distributed_functions: test a function with a quote in name AppendDefElemSet: quote variable names	2019-11-25 23:01:30 +00:00
Philip Dubé	a81e6a81ab	Fix distributed aggregation for non superuser roles Moves support functions to pg_catalog for now. We'd prefer a different solution for when we're creating these support functions dynamically	2019-11-25 20:46:25 +00:00
Khashayar Fereidani	f81785ad14	Fix underflow initialization of default values Initialization of queryWindowClause and queryOrderByLimit "memset" underflow these variables. It's possible due to the invalid usage sizeof this part of the program cause buffer overflow and function return data corruption in future changes.	2019-11-25 19:25:51 +00:00
Onur TIRTIR	bef32624c3	Escape extension name in extension command propagation (#3218 )	2019-11-24 12:16:10 +03:00
Philip Dubé	99164398bf	Fix potential segfault from standard_planner inlining functions	2019-11-21 18:47:36 +00:00
Philip Dubé	c563e0825c	Strip trailing whitespace and add final newline (#3186 ) This brings files in line with our editorconfig file	2019-11-21 14:25:37 +01:00
Jelte Fennema	1d8dde232f	Automatically convert useless declarations using regex replace (#3181 ) * Add declaration removal to CI * Convert declarations	2019-11-21 13:47:29 +01:00
Onur TIRTIR	9961297d7b	Improve extension command propagation logic and tests * Improve extension command propagation tests * patch for hardcoded citus extension name (cherry picked from commit 0bb3dbac0afabda10e8928f9c17eda048dc4361a)	2019-11-21 11:24:39 +03:00
Marco Slot	7d2813f799	Update .codecov.yml after moving ruleutils files	2019-11-16 14:25:35 +01:00
Marco Slot	38748159ec	Merge pull request #3089 from citusdata/move_files Move C files into the appropriate directories	2019-11-20 19:38:44 +01:00
Hanefi Onaldi	d82f3e9406	Introduce intermediate result broadcasting In plain words, each distributed plan pulls the necessary intermediate results to the worker nodes that the plan hits. This is primarily useful in three ways. (i) If the distributed plan that uses intermediate result(s) is a router query, then the intermediate results are only broadcasted to a single node. (ii) If a distributed plan consists of only intermediate results, which is not uncommon, the intermediate results are broadcasted to a single node only. (iii) If a distributed query hits a sub-set of the shards in multiple workers, the intermediate results will be broadcasted to the relevant node(s). The final item (iii) becomes crucial for append/range distributed tables where typically the distributed queries hit a small subset of shards/workers. To do this, for each query that Citus creates a distributed plan, we keep track of the subPlans used in the queryTree, and save it in the distributed plan. Just before Citus executes each subPlan, Citus first keeps track of every worker node that the distributed plan hits, and marks every subPlan should be broadcasted to these nodes. Later, for each subPlan which is a distributed plan, Citus does this operation recursively since these distributed plans may access to different subPlans, and those have to be recorded as well.	2019-11-20 15:26:36 +03:00
Philip Dubé	b7fef5c31a	Miscellaneous cleanup in prep for collation propagation	2019-11-19 17:28:59 +00:00
Jelte Fennema	1ed05be82c	Flaky test: Fix recover_prepared_transactions (#3205 ) Failed test: https://app.circleci.com/jobs/github/citusdata/citus/35994 We now always take a new connection	2019-11-19 17:49:13 +01:00
Jelte Fennema	1ac96f228b	Flaky test: Force correct plan (#3203 ) Failing test: https://app.circleci.com/jobs/github/citusdata/citus/23148	2019-11-19 17:11:05 +01:00
Onur TIRTIR	26c306d188	Add extensions to distributed object propagation infrastructure (#3185 )	2019-11-19 17:56:28 +03:00
SaitTalhaNisanci	2cb82ae9bd	create a utility method to mark tasks as failed (#3150 )	2019-11-19 16:35:56 +03:00
SaitTalhaNisanci	306d159072	refactor AfterXacthodtConnectionHandling (#3202 )	2019-11-19 14:50:23 +03:00
Jelte Fennema	87f57eb92b	Fix verify_metadata not returning consistent results (#3199 ) Failing test: https://app.circleci.com/jobs/github/citusdata/citus/58827	2019-11-19 11:02:35 +01:00
Hanefi Onaldi	e3ad4aba94	Bump 9.1devel * Add Changelog entry for 9.0.1 * Bump citus version to 9.1devel	2019-11-19 10:35:57 +03:00
Marco Slot	18843af688	Return early in CitusHasBeenLoaded when creating a different ex… (#3178 ) Return early in CitusHasBeenLoaded when creating a different extension	2019-11-18 22:10:43 +01:00
Önder Kalacı	40fa3862ce	Prevent Citus extension becoming distributed object (#3197 ) Prevent Citus extension being distributed Because that could prevent doing rolling upgrades, where users may prefer to upgrade the version on the coordinator but not the workers. There could be some other edge cases, so I'd prefer to keep Citus extension outside the picture for now.	2019-11-18 16:57:10 +01:00
Halil Ozan Akgül	c5c31e6093	Merge pull request #3184 from citusdata/alter_role_propagation Alter Role Propagation	2019-11-18 18:43:15 +03:00
Halil Ozan Akgul	5ae7b219ff	Create the ALTER ROLE propagation	2019-11-18 18:31:28 +03:00
Nils Dijk	217890af5f	Feature: Expression in reference join (#3180 ) DESCRIPTION: Expression in reference join Fixed: #2582 This patch allows arbitrary expressions in the join clause when joining to a reference table. An example of such joins could be found in CHbenCHmark queries 7, 8, 9 and 11; `mod((s_w_id * s_i_id),10000) = su_suppkey` and `ascii(substr(c_state,1,1)) = n2.n_nationkey`. Since the join is on a reference table these queries are able to be pushed down to the workers. To implement these queries we will widen the `IsJoinClause` predicate to not check if the expressions are a type `Var` after stripping the implicit coerciens. Instead we define a join clause when the `Var`'s in a clause come from more than 1 table. This allows more clauses to pass into the logical planner's `MultiNodeTree(...)` planning function. To compensate for this we tighten down the `LocalJoin`, `SinglePartitionJoin` and `DualPartitionJoin` to check for direct column references when planning. This allows the planner to work with arbitrary join expressions on reference tables.	2019-11-18 16:25:46 +01:00
Önder Kalacı	a4c90b6ee1	Make distributed object dependency logic follow upto extensions (#3195 ) With this commit, we're slightly changing the dependency traversal logic to enable extension propagation. The main idea is to "follow" the extension dependencies, but do not "apply" them. Since some extension dependencies are base types, and base types could have circular dependencies, we implement a logic to prevent revisiting an already visited object.	2019-11-17 17:21:21 +01:00
Marco Slot	e0cccf7f9a	Move C files into the appropriate directory	2019-11-16 11:36:17 +01:00
Hadi Moshayedi	1f46d47f36	Merge pull request #3188 from citusdata/repref_planner Plan reference<->local table joins locally	2019-11-15 09:30:21 -08:00
Hadi Moshayedi	d9dcba25e3	Plan reference/local table joins locally	2019-11-15 07:36:50 -08:00
Hadi Moshayedi	4230b96247	Merge pull request #3170 from citusdata/fix_round_robin Do not include coordinator shards when round-robin is selected	2019-11-15 06:11:41 -08:00
Onder Kalaci	90943a6ce6	Do not include coordinator shards when round-robin is selected When the user picks "round-robin" policy, the aim is that the load is distributed across nodes. However, for reference tables on the coordinator, since local execution kicks in immediately, round-robin is ignored. With this change, we're excluding the placement on the coordinator. Although the approach seems a little bit invasive because of modifications in the placement list, that sounds acceptable. We could have done this in some other ways such as: 1) Add a field to "Task->roundRobinPlacement" (or such), which is updated as the first element after RoundRobinPolicy is applied. During the execution, if that placement is local to the coordinator, skip it and try the other remote placements. 2) On TaskAccessesLocalNode()@local_execution.c, check task_assignment_policy, if round-robin selected and there is local placement on the coordinator, skip it. However, task assignment is done on planning, but this decision is happening on the execution, which could create weird edge cases.	2019-11-15 06:03:32 -08:00
Hadi Moshayedi	f8459f81a8	Merge pull request #3155 from citusdata/repref_base Replicate reference tables to coordinator, except planner changes	2019-11-15 05:59:19 -08:00
Hadi Moshayedi	c8c68d719b	Merge pull request #3164 from citusdata/propagate_activate Propagate isactive to metadata nodes.	2019-11-15 05:57:35 -08:00
Hadi Moshayedi	15af1637aa	Replicate reference tables to coordinator.	2019-11-15 05:50:19 -08:00
Hadi Moshayedi	cb011bb30f	Propagate isactive to metadata nodes.	2019-11-15 05:48:42 -08:00
SaitTalhaNisanci	b9b7fd7660	add IsLoggableLevel utility function (#3149 ) * add IsLoggableLevel utility function * add function comment for IsLoggableLevel * put ApplyLogRedaction to logutils	2019-11-15 14:59:13 +03:00
Jelte Fennema	1b2c438e69	Rename variables to not shadow globals in RHEL6 (#3194 ) Fixes #2839	2019-11-15 12:12:24 +01:00
Jelte Fennema	a8bd2d58f5	Update SQL definitions to prepare for drain node functionality (#3179 )	2019-11-15 10:11:56 +01:00
Jelte Fennema	4b9b4b0995	Don't warn for declaration-after-statement since we only support GNU99 (#3132 ) This change was actually already intended in #3124. However, the postgres Makefile manually enables this warning too. This way we undo that. To confirm that it works two functions were changed to make use of not having the warning anymore.	2019-11-15 09:46:06 +01:00
Marco Slot	622462cad7	Return early in CitusHasBeenLoaded when creating a different extension	2019-11-15 03:00:20 +01:00
Philip Dubé	495c0f5117	Phase 1 implementation of custom aggregates Phase 1 seeks to implement minimal infrastructure, so does not include: - dynamic generation of support aggregates to handle multiple arguments - configuration methods to direct aggregation strategy, or mark an aggregate's serialize/deserialize as safe to operate across nodes Aggregates can be distributed when: - they have a single argument - they have a combinefunc - their transition type is not a pseudotype	2019-11-14 19:01:24 +00:00
Philip Dubé	edc7a2ee38	Improve RECORD support	2019-11-14 18:32:22 +00:00
Philip Dubé	eb35743c3f	Remove citus.worker_list_file & master_initialize_node_metadata	2019-11-13 00:49:58 +00:00
Philip Dubé	48552bfffe	Call DestReceiver rDestroy before it goes out of scope CitusCopyDestReceiverDestroy: call hash_destroy on shardStateHash & connectionStateHash	2019-11-12 15:03:07 +00:00
Jelte Fennema	adc6ca6100	Make simple in queries on unique columns work with repartion join (#3171 ) This is necassery to support Q20 of the CHbenCHmark: #2582. To summarize the fix: The subquery is converted into an INNER JOIN on a table. This fixes the issue, since an INNER JOIN on a table is already supported by the repartion planner. The way this replacement is happening.: 1. Postgres replaces `col in (subquery)` with a SEMI JOIN (subquery) on col = subquery_result 2. If this subquery is simple enough Postgres will replace it with a regular read from a table 3. If the subquery returns unique results (e.g. a primary key) Postgres will convert the SEMI JOIN into an INNER JOIN during the planning. It will not change this in the rewritten query though. 4. We check if Postgres sends us any SEMI JOINs during its join order planning, if it doesn't we replace all SEMI JOINs in the rewritten query with INNER JOIN (which we already support).	2019-11-11 13:44:28 +01:00
SaitTalhaNisanci	57380fd668	remove duplicated method in multi_logical_optimizer (#3166 )	2019-11-11 13:51:21 +03:00
Önder Kalacı	460f000218	Remove failure tests related to real-time executor (#3174 ) Since we've removed the executor, we don't need the specific tests. Since the tests are already using adaptive executor, they were passing. But, we've plenty of extra tests for adaptive executor, so seems safe to remove.	2019-11-11 10:18:37 +01:00
Philip Dubé	ad86c1b866	AcquireDistributedLockOnRelations: escape relation names	2019-11-08 21:23:01 +00:00
Philip Dubé	e8ecbbfcb3	Escape transaction names	2019-11-08 21:23:01 +00:00
Jelte Fennema	9fb897a074	Fix queries with repartition joins and group by unique column (#3157 ) Postgres doesn't require you to add all columns that are in the target list to the GROUP BY when you group by a unique column (or columns). It even actively removes these group by clauses when you do. This is normally fine, but for repartition joins it is not. The reason for this is that the temporary tables don't have these primary key columns. So when the worker executes the query it will complain that it is missing columns in the group by. This PR fixes that by adding an ANY_VALUE aggregate around each variable in the target list that does is not contained in the group by or in an aggregate. This is done only for repartition joins. The ANY_VALUE aggregate chooses the value from an undefined row in the group.	2019-11-08 15:36:18 +01:00
SaitTalhaNisanci	02b359623f	remove duplicate code in citus_dist_stat_activity (#3165 )	2019-11-08 15:41:32 +03:00
Önder Kalacı	0b3d4e55d9	Local execution should not change hasReturning for distributed tables (#3160 ) It looks like the logic to prevent RETURNING in reference tables to have duplicate entries that comes from local and remote executions leads to missing some tuples for distributed tables. With this PR, we're ensuring to kick in the logic for reference tables only.	2019-11-08 12:49:56 +01:00
Philip Dubé	9a31837647	isolation_create_restore_point: test reference tables too	2019-11-07 17:50:22 +00:00
Philip Dubé	72c3d64ead	Rename OpenConnectionsToAllNodes to OpenConnectionsToAllWorkerNodes	2019-11-07 17:50:22 +00:00
Philip Dubé	2fc45e5897	create_distributed_function: accept aggregates Adds support for OCLASS_PROC to worker_create_or_replace_object	2019-11-06 18:23:37 +00:00
Hadi Moshayedi	622ee54c95	Merge pull request #3152 from citusdata/remove-ref-table-replication Don't maintain replicationfactor of reference tables in pg_dist_colocation	2019-11-05 09:27:19 -08:00
Hadi Moshayedi	e00d1546f3	Don't maintain replicationfactor of reference tables	2019-11-05 07:23:14 -08:00
Onder Kalaci	471703bfaf	DEBUG only when the function is distributed Otherwise, we're seeing this message way to often.	2019-11-05 15:08:35 +00:00
Önder Kalacı	960cd02c67	Remove real time router executors (#3142 ) * Remove unused executor codes All of the codes of real-time executor. Some functions in router executor still remains there because there are common functions. We'll move them to accurate places in the follow-up commits. * Move GUCs to transaction mngnt and remove unused struct * Update test output * Get rid of references of real-time executor from code * Warn if real-time executor is picked * Remove lots of unused connection codes * Removed unused code for connection restrictions Real-time and router executors cannot handle re-using of the existing connections within a transaction block. Adaptive executor and COPY can re-use the connections. So, there is no reason to keep the code around for applying the restrictions in the placement connection logic.	2019-11-05 12:48:10 +01:00
Jelte Fennema	f0c35ad134	Include fmgr.h, don't duplicate FunctionCallInfo typedef	2019-11-04 17:10:33 +00:00
Jelte Fennema	4ba5619bb2	Make implicit declarations and wrong return types hard errors This makes three warnings hard errors: 1. `implicit-int` > Warn when a declaration does not specify a type. 2. `implicit-function-declaration` > Give a warning whenever a function is used before being declared. 3. `return-type` > Warn whenever a function is defined with a return type that defaults to > "int". Also warn about any "return" statement with no return value in > a function whose return type is not "void" (falling off the end of the > function body is considered returning without a value). > > For C only, warn about a "return" statement with an expression in a > function whose return type is "void", unless the expression type is > also "void". The compiler behaviour when these warnings occur is not the behaviour the developer expects. So even during development it makes sense that they are errors.	2019-11-04 16:48:27 +00:00
SaitTalhaNisanci	7c410e3cd7	pass CitusCustomState directly to adaptive executor (#3151 )	2019-11-01 19:57:32 +03:00
Önder Kalacı	ffd89e4e01	Include all relevant relations in the ExtractRangeTableRelationWalker (#3135 ) We've changed the logic for pulling RTE_RELATIONs in #3109 and non-colocated subquery joins and partitioned tables. @onurctirtir found this steps where I traced back and found the issues. While looking into it in more detail, we decided to expand the list in a way that the callers get all the relevant RTE_RELATIONs RELKIND_RELATION, RELKIND_PARTITIONED_TABLE, RELKIND_FOREIGN_TABLE and RELKIND_MATVIEW. These are all relation kinds that Citus planner is aware of.	2019-11-01 16:06:58 +01:00
Onur TIRTIR	d3f68bf44f	Fix view is not distributed error when view is used in modify statements (#3104 )	2019-11-01 16:34:01 +03:00
SaitTalhaNisanci	c7ceca3216	update outdated comment in JobExecutorType (#3148 )	2019-11-01 11:36:56 +03:00
SaitTalhaNisanci	70e46703aa	Fix debug1 message in JobExecutorType (#3147 ) When citus.enable_repartition_joins guc is set to on, and we have adaptive executor, there was a typo in the debug message, which was saying realtime executor no adaptive executor.	2019-11-01 11:14:19 +03:00
Hadi Moshayedi	ab4cf8525b	Merge pull request #3146 from citusdata/fix/metadata_sync_on_standby Do not try to sync metadata on standby coordinator	2019-10-31 14:56:51 -07:00
SaitTalhaNisanci	dadbe86af1	refactor some of hard coded values in citus gucs (#3137 ) * refactor some of hard coded values in citus gucs * rename GUC_ALLOW_ALL to GUC_STANDARD	2019-10-30 10:35:39 +03:00
Marco Slot	51c64c70c9	Do not try to sync metadata on standby coordinator	2019-10-30 05:15:45 +01:00
Jelte Fennema	341feb21ca	Clean old citus install before installing extension (#3134 ) One case where this would have been useful recently was when we changed the `citus--9.0-1--9.1-1.sql` file to `citus--9.0-2--9.1-1.sql`. If you had the old file installed it would never be cleaned. Any changes in the 9.0-2 version would not be applied, because postgres tries to find the shortest migration path, so it would always run the 9.0-1--9.1-1 version, because then it could skip 9.0-1--9.0-2.	2019-10-29 16:11:43 +01:00
Marco Slot	ed7c55a7af	Merge pull request #3129 from citusdata/fix_distributed_function Disallow distributed function creation when replication_model is 'statement'	2019-10-28 16:53:17 +01:00
SaitTalhaNisanci	29d45bd1b9	Do not assign InvalidOid for local execution while extracting parameters (#3131 ) * do not assign InvalidOid for local execution while extracting parameters * rename functions * rename parameter and replace function	2019-10-28 14:28:22 +03:00
Marco Slot	03cae27782	Add tests for distributing functions with replication_model statement	2019-10-26 23:57:59 +02:00
Marco Slot	067657af26	Disallow distributed functions with distribution arguments unless replication_model is streaming	2019-10-26 23:57:59 +02:00
Önder Kalacı	dceaddbe4d	Remove real-time/router executors (step 1) (#3125 ) See #3125 for details on each item. * Remove real-time/router executor tests-1 These are the ones which doesn't have '_%d' in the test output files. * Remove real-time/router executor tests-2 These are the ones which has in the test output files. * Move the tests outputs to correct place * Make sure that single shard commits use 2PC on adaptive executor It looks like we've messed the tests in #2891. Fixing back. * Use adaptive executor for all router queries This becomes important because when task-tracker is picked, we used to pick router executor, which doesn't make sense. * Remove explicit references to real-time/router executors in the tests * JobExecutorType never picks real-time/router executors * Make sure to go incremental in test output numbers * Even users cannot pick real-time anymore * Do not use real-time/router custom scans * Get rid of unnecessary normalizations * Reflect unneeded normalizations * Get rid of unnecessary test output file	2019-10-25 10:54:54 +02:00
Jelte Fennema	96b8c36723	Better check for compiler flag capability When adding a `-Wno-some-error` flag, instead of trying to check for `-Wno-some-error` support this checks for `-Wsome-error` support. This is done because currently there's the a problem when compiling citus with `./configure CFLAGS=-Werror`. If there's a warning (which is then converted to an error) in addition to this error, the output will contain the following message with GCC 7: ``` cc1: error: unrecognized command line option ‘-Wno-gnu-variable-sized-type-not-at-end’ [-Werror] ``` This is because of the following behaviour of GCC in case of unknown warnings: > When an unrecognized warning option is requested (e.g., -Wunknown-warning), GCC > emits a diagnostic stating that the option is not recognized. However, if the > -Wno- form is used, the behavior is slightly different: no diagnostic is > produced for -Wno-unknown-warning unless other diagnostics are being produced. > This allows the use of new -Wno- options with old compilers, but if something > goes wrong, the compiler warns that an unrecognized option is present. By changing the check to `-Wsome-error`, the check will actually fail when the warning is not supported. Instead of silently being ignored when checking, but then coming up when another error happens.	2019-10-24 17:05:04 +00:00
Marco Slot	86a4c0925b	Revoke usage from the citus schema from public (#3123 ) Revoke usage from the citus schema from public	2019-10-24 14:22:29 +02:00
Jelte Fennema	32676f9233	Use -std=gnu99 (C99 + GNU extensions) (#3124 ) Everything supports this, works in clang 6 and gcc 4.7 Also removes -Wdeclaration-after-statement, since declarations after statements are only not supported in ISO C90. Fixes #3122	2019-10-24 14:01:24 +02:00
Jelte Fennema	a5010e5b17	Add extra foreach convenience macros (#3117 ) This completely hides `ListCell` to the user of the loop Example usage: ```c WorkerNode workerNode = NULL; foreach_ptr(workerNode, workerNodeList) { // Do stuff with workerNode } ``` Instead of: ```c ListCell workerNodeCell = NULL; foreach(cell, workerNodeList) { WorkerNode *workerNode = lfirst(workerNodeCell); // Do stuff with workerNode } ```	2019-10-23 16:49:12 +02:00
Önder Kalacı	54de466876	Merge pull request #3067 from citusdata/add_upgrade_test_function Add upgrade test for distributed functions	2019-10-23 16:16:43 +02:00
Onder Kalaci	c2460a1c31	Add upgrade test for distributed functions Simply make sure that Citus can pushdown functions after pg upgrade.	2019-10-23 12:07:51 +02:00
Philip Dubé	b2f084d7f5	UnsetMetadataSyncedForAll: use CatalogTupleUpdateWithInfo	2019-10-23 00:45:11 +00:00
Philip Dubé	2a969fe4bb	ssl_by_default: remove stray PG10 check	2019-10-23 00:27:54 +00:00
Marco Slot	b8c8fd4612	Fix run_command_on_colocated_placements tests	2019-10-23 00:08:17 +02:00
Marco Slot	a1162b2023	Rename 9.1 upgrade script to upgrade from 9.0-2	2019-10-23 00:08:17 +02:00
Marco Slot	04040e0a37	Revoke usage from the citus schema	2019-10-23 00:08:17 +02:00
Philip Dubé	2204a17dbd	isolation_multiuser_locking: reorder GRANT to avoid deadlock on enterprise	2019-10-22 21:10:55 +00:00
Önder Kalacı	f0f93d9c45	Merge pull request #3114 from citusdata/fix_leak_more_generic Fix memory leak on ReceiveResults	2019-10-22 17:29:52 +02:00
Onder Kalaci	a208f8b151	Fix memory leak on ReceiveResults It turns out that TupleDescGetAttInMetadata() allocates quite a lot of memory. And, if the target list is long and there are too many rows returning, the leak becomes appereant. You can reproduce the issue wout the fix with the following commands: ```SQL CREATE TABLE users_table (user_id int, time timestamp, value_1 int, value_2 int, value_3 float, value_4 bigint); SELECT create_distributed_table('users_table', 'user_id'); insert into users_table SELECT i, now(), i, i, i, i FROM generate_series(0,99999)i; -- load faster -- 200,000 INSERT INTO users_table SELECT * FROM users_table; -- 400,000 INSERT INTO users_table SELECT * FROM users_table; -- 800,000 INSERT INTO users_table SELECT * FROM users_table; -- 1,600,000 INSERT INTO users_table SELECT * FROM users_table; -- 3,200,000 INSERT INTO users_table SELECT * FROM users_table; -- 6,400,000 INSERT INTO users_table SELECT * FROM users_table; -- 12,800,000 INSERT INTO users_table SELECT * FROM users_table; -- making the target list entry wider speeds up the leak to show up select ,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, FROM users_table ; ```	2019-10-22 17:22:26 +02:00
Jelte Fennema	78e495e030	Add shouldhaveshards to pg_dist_node (#2960 ) This is an improvement over #2512. This adds the boolean shouldhaveshards column to pg_dist_node. When it's false, create_distributed_table for new collocation groups will not create shards on that node. Reference tables will still be created on nodes where it is false.	2019-10-22 16:47:16 +02:00
Jelte Fennema	5001c44990	Remove trailing whitespace	2019-10-22 11:26:08 +02:00
Hanefi Onaldi	7ebda04494	Update all c-style comments in migration files	2019-10-21 16:05:53 +03:00
Halil Ozan Akgül	210a6cc04b	Merge pull request #3111 from citusdata/refresh_materialized_view_with_subquery Refresh Materialized View with Subquery	2019-10-17 16:07:35 +03:00
Halil Ozan Akgul	5f04ac774f	Adds the tests for refresh materialized views	2019-10-17 16:00:56 +03:00
Önder Kalacı	e065933928	Remove all the non-adaptive test schedules (#3112 ) Just because we'll remove the executors soon, and it doesn't make sense to keep them running. I'll remove the tests files with a follow-up commit, but it seems safe to remove them from circleci now.	2019-10-17 10:39:31 +03:00
Jelte Fennema	7abedc38b0	Support subqueries in HAVING (#3098 ) Areas for further optimization: - Don't save subquery results to a local file on the coordinator when the subquery is not in the having clause - Push the the HAVING with subquery to the workers if there's a group by on the distribution column - Don't push down the results to the workers when we don't push down the HAVING clause, only the coordinator needs it Fixes #520 Fixes #756 Closes #2047	2019-10-16 16:40:14 +02:00
Onur TIRTIR	3bfb2a078b	Make changes on if-statement in ExtractRangeTableList for furhter walker types (#3110 )	2019-10-16 15:50:09 +03:00
Onur TIRTIR	d5f83dc110	Refactor range table walkers (#3109 )	2019-10-16 01:20:49 +03:00
SaitTalhaNisanci	94a7e6475c	Remove copyright years (#2918 ) * Update year as 2012-2019 * Remove copyright years	2019-10-15 17:44:30 +03:00
Jelte Fennema	9b2f4d71ac	Make sure some MX tests use defined shard_ids (#3103 )	2019-10-12 22:46:14 +02:00
Philip Dubé	74cb168205	Remove Postgres 10 support	2019-10-11 21:56:56 +00:00
SaitTalhaNisanci	95633416f7	remove pg10 from pg upgrade tests (#3102 )	2019-10-11 10:30:06 +02:00
Hadi Moshayedi	3fb9c63890	Merge pull request #3100 from citusdata/fix_typo Fix a typo	2019-10-10 11:16:11 -07:00
Hadi Moshayedi	b50d216536	Fix a typo	2019-10-10 10:44:41 -07:00
Philip Dubé	4063e7ca67	CALL delegation: apply strip_implicit_coercions to distribution argument	2019-10-10 17:42:43 +00:00
Philip Dubé	7ffd78b6e0	isolation_multiuser_locking Introduce a test which checks that locks are only acquired when a user has necessary permissions Currently tests REINDEX, CREATE INDEX, TRUNCATE	2019-10-10 16:58:41 +00:00
Philip Dubé	dd490b6376	Cache whether an object is in pg_dist_object. Avoids redundant lookups for non-distributed objects	2019-10-10 14:50:38 +00:00
Hanefi Onaldi	2a47cff7b7	Bump citus to 9.0.0	2019-10-09 11:33:58 +03:00
Jelte Fennema	f0f702c604	Add sane editorconfig settings (#3051 ) - UNIX line endings - utf8 - newline at end of file - trim trailing whitespace	2019-10-08 15:07:01 +02:00
SaitTalhaNisanci	83667436f7	add support to run citus upgrade tests locally (#3083 ) * add support to run citus upgrade tests locally * dont build tars if they already exist * use current code instead of master for upgrade * always build the current code * copy the current citus code to have isolated citus upgrade tests * fix configure and simplify copy	2019-10-08 15:32:44 +03:00
Nils Dijk	4a4a220945	Fix enum add value order and pg12 (#3082 ) DESCRIPTION: Fix order for enum values and correctly support pg12 PG 12 introduces `ALTER TYPE ... ADD VALUE ...` during transactions. Earlier versions would error out when called in a transaction, hence we connect to workers outside of the transaction which could cause inconsistencies on pg12 now that postgres doesn't error with this syntax anymore. During the implementation of this fix it became apparent there was an error with the ordering of enum labels when the type was recreated. A patch and test have been included.	2019-10-07 17:16:19 +02:00
Jelte Fennema	01da11f264	Change citus truncate trigger to AFTER and add more upgrade tests (#3070 ) * Add more upgrade tests * Fix citus trigger generation after upgrade citus_truncate_trigger runs before truncate when created by create_distributed_table: `492d1b2cba/src/backend/distributed/commands/create_distributed_table.c (L1163)` * Remove pg_dist_jobid_seq	2019-10-07 16:43:04 +02:00
SaitTalhaNisanci	bffd110446	use citus instead of citusdata repo (#3077 )	2019-10-07 12:03:42 +03:00
Onder Kalaci	3be72ce42f	Make sure that distributed functions always have the correct user Objectives: (a) both super user and regular user should have the correct owner for the function on the worker (b) The transactional semantics would work fine for both super user and regular user (c) non-super-user and non-function owner would get a reasonable error message if tries to distribute the function Co-authored-by: @serprex	2019-10-04 21:38:49 +00:00
SaitTalhaNisanci	c547664fae	Add Citus upgrade tests with its job (#3003 ) * Add initial citus upgrade test * Add restart databases and run tests in all nodes * Add output for citus versions 8.0 8.1 8.2 and 8.3 * Add verify step for citus upgrade * Add target for citus upgrade test in makefile * Add check citus upgrade job * Fix installation file path and add missing tar * Run citus upgrade for v8.0 v8.1 v8.2 and v8.3 * Create upgrade_common file and rename upgrade check * Add pg version to citus upgrade test * Test with postgres 10 and 11 in citus upgrade tests * Add readme for citus upgrade test * Add some basic tests to citus upgrade tests * Add citus upgrade mixed mode test * Remove citus artifacts before installing another one * Refactor citus upgrade test according to reviews * quick and dirty rewrite of citus upgrade tests to support local execution. I think we need to change the makefile in such a way that the tar files can be injected from the circle ci config file. Also I removed some of the citus version checks you had to not have the requirement to pass that in separately from the pre tar file. I am not super happy with it, but two flags that need to be kept in sync is also not desirable. Instead I print out the citus version that is installed per node. This will not cause a failure if they are not what one would expect but it lets us verify we are running the expected version. * use latest citusupgradetester in circleci * update readme and use common alias for upgrade_common import	2019-10-04 17:44:49 +03:00
Marco Slot	144a4f4cfa	Merge pull request #3075 from citusdata/fix/citus_schema_grant Grant usage on schema citus to public	2019-10-04 13:22:57 +02:00
Marco Slot	1a3a174f67	Grant usage on schema citus to public	2019-10-04 12:26:08 +02:00
Marco Slot	c15ddfb63f	Merge pull request #3068 from citusdata/fix_metadata_sync_locks Don't block for locks in SyncMetadataToNodes()	2019-10-04 12:24:57 +02:00
Marco Slot	89377ee578	Move RowExclusiveLock to start in SyncMetadataToNodes	2019-10-04 12:07:41 +02:00
Hadi Moshayedi	217db2a03e	Don't block for locks in SyncMetadataToNodes()	2019-10-03 16:53:36 -07:00
Hadi Moshayedi	ae915493e6	Don't send metadata commands to not-synced workers. Otherwise some of the dependencies might not exist yet and commands will error out.	2019-10-03 16:52:25 -07:00
Marco Slot	7a49f158f3	Merge pull request #3072 from citusdata/drop_rebalancer Drop the rebalancer before creating new UDFs	2019-10-03 18:14:47 +02:00
Marco Slot	0b4b63e647	Drop the rebalancer before creating new UDFs	2019-10-03 16:08:58 +02:00
Marco Slot	fd5c2409a5	Merge pull request #3071 from citusdata/check_command_type Check command type in TryToDelegateFunctionCall	2019-10-03 15:49:02 +02:00
Marco Slot	2e50306cf8	Check command type in TryToDelegateFunctionCall	2019-10-03 15:37:15 +02:00
Jelte Fennema	9833c07070	Improve upgrade test runner - Update certifi in regress pipenv - Use normal test ports for upgrade tests - Make diff behave correctly for upgrade tests	2019-10-03 13:10:11 +02:00
Halil Ozan Akgül	e32592221a	Merge pull request #3060 from citusdata/mx_isolation_dis2ref_foreign_key MX Isolation Tests for Foreign keys from Distributrion to Reference tables vs DML	2019-10-03 09:52:02 +03:00
Halil Ozan Akgul	bda8f6f87b	Created tests for distribution to reference table foreign keys on mx	2019-10-03 09:31:13 +03:00
SaitTalhaNisanci	19bdca14d8	Add jobs to run tests with pg 12 (#3033 ) * Add PG12 test outputs * Add jobs to run tests with pg 12 * use POSIX collate for compatibility between pg10/pg11/pg12 * do not override the new default value when running vanilla tests * fix 2 problems with pg12 tests * update pg12 images with pg12 rc1 * remove pg10 jobs * Revert "Add PG12 test outputs" This reverts commit `f3545b92ef`. * change images to use latest instead of dev * add missing coverage flags	2019-10-02 15:33:12 +03:00
Halil Ozan Akgül	3f83b726e0	Merge pull request #3044 from citusdata/mx_ref_isolation_update_delete_upsert MX Isolation Tests for Update, Delete, Upsert on Reference Tables	2019-10-02 10:27:47 +03:00
Halil Ozan Akgul	e5906bead2	Created isolation tests for update, delete, upsert on reference tables with MX.	2019-10-02 10:11:21 +03:00
Hanefi Onaldi	bd416ef68f	Fix empty FROM clauses in PG12	2019-10-01 19:54:11 +00:00
Halil Ozan Akgül	9fa4a20148	Merge pull request #3045 from citusdata/mx_ref_isolation_select_for_update MX Isolation Tests for SELECT FOR UPDATE on Reference Tables	2019-10-01 16:48:56 +03:00
Halil Ozan Akgul	1d7030a651	Created isolation tests for select for update on reference tables with MX.	2019-10-01 16:29:15 +03:00
Jelte Fennema	ec4a165eec	Improve isolation test block detection (#3055 )	2019-10-01 14:10:15 +02:00
Jelte Fennema	40f785e6d8	Move citus_isolation_test_session_is_blocked to separate udf sql file	2019-10-01 14:10:15 +02:00
Philip Dubé	89d35e9692	Attempt to force custom plans for prepared statements when trying to delegate function calls We discern between PARAM_EXEC & PARAM_EXTERN: `d52eaa0948/src/include/nodes/primnodes.h (L211)` According to primnodes.h we should only run into PARAM_EXEC or PARAM_EXTERN	2019-09-30 23:49:14 +00:00
Philip Dubé	29f1ea079b	PG_VERSION_NUM > 110000 should be PG_VERSION_NUM >= 110000 Also fix a > 12000 typo	2019-09-30 23:37:43 +00:00
Hadi Moshayedi	1989ff85b5	Merge pull request #3059 from citusdata/cte_function_calls Don't push down queries when in subqueries/ctes	2019-09-30 15:56:24 -07:00
Hadi Moshayedi	5e97e5c98e	Don't push down queries when in subqueries/ctes	2019-09-30 14:22:05 -07:00
Marco Slot	2d1639ad9d	Merge pull request #3052 from citusdata/fix/internal_connection_caching Avoid caching connections from backends that service internal connections	2019-09-30 17:07:03 +02:00
Nils Dijk	01b26cf91a	Disallow distributed functions for functions depending on an extension (#3049 ) DESCRIPTION: Disallow distributed functions for functions depending on an extension Functions depending on an extension cannot (yet) be distributed by citus. If we would allow this it would cause issues with our dependency following mechanism as we stop following objects depending on an extension. By not allowing functions to be distributed when they depend on an extension as well as not allowing to make distributed functions depend on an extension we won't break the ability to add new nodes. Allowing functions depending on extensions to be distributed at the moment could cause problems in that area.	2019-09-30 15:19:47 +02:00
Jelte Fennema	4d991f281c	Update CircleCI codecov orb (#3050 ) Hopefully this fixes the long runtime for coverage uploads that we sometimes have (> 1 minute).	2019-09-30 15:04:49 +02:00
Nils Dijk	473cbc0115	Propagate CREATE OR REPLACE FUNCTION to workers for distributed functions (#3043 ) DESCRIPTION: Propagate CREATE OR REPLACE FUNCTION Distributed functions could be replaced, which should be propagated to the workers to keep the function in sync between all nodes. Due to the complexity of deparsing the `CreateFunctionStmt` we actually produce the plan during the processing phase of our utilityhook. Since the changes have already been made in the catalog tables we can reuse `pg_get_functiondef` to get us the generated `CREATE OR REPLACE` sql.	2019-09-30 12:41:17 +02:00
Jelte Fennema	82ec918b29	Add explain summary support (#3046 ) Fixes #2922 and also adds explain analyze regression tests	2019-09-30 10:58:49 +02:00
Marco Slot	35bef0f3db	Avoid caching connections from backends that servicei internal connections	2019-09-28 08:32:10 +02:00
Nils Dijk	9c2c50d875	Hookup function/procedure deparsing to our utility hook (#3041 ) DESCRIPTION: Propagate ALTER FUNCTION statements for distributed functions Using the implemented deparser for function statements to propagate changes to both functions and procedures that are previously distributed.	2019-09-27 22:06:49 +02:00
Philip Dubé	363409a0c2	Propagate REINDEX TABLE & REINDEX INDEX	2019-09-27 18:14:53 +00:00
Hanefi Onaldi	66b9f2e887	Deparsing and qualifiying for FUNCTION/PROCEDURE statements (#3014 ) This PR aims to add all the necessary logic to qualify and deparse all possible `{ALTER\|DROP} .. {FUNCTION\|PROCEDURE}` queries. As Procedures are introduced in PG11, the code contains many PG version checks. I tried my best to make it easy to clean up once we drop PG10 support. Here are some caveats: - I assumed that the parse tree is a valid one. There are some queries that are not allowed, but still are parsed successfully by postgres planner. Such queries will result in errors in execution time. (e.g. `ALTER PROCEDURE p STRICT` -> `STRICT` action is valid for functions but not procedures. Postgres decides to parse them nevertheless.)	2019-09-27 19:02:52 +02:00
Hadi Moshayedi	28acab9d02	Merge pull request #3021 from citusdata/distribute-select Distribute select function	2019-09-27 09:35:24 -07:00
Marco Slot	2868e02a3d	Implement SELECT function call delegation. When a function is marked as colocated with a distributed table, we try delegating queries of kind "SELECT func(...)" to workers. We currently only support this simple form, and don't delegate forms like "SELECT f1(...), f2(...)", "SELECT f1(...) FROM ...", or function calls inside transactions. As a side effect, we also fix the transactional semantics of DO blocks. Previously we didn't consider a DO block a multi-statement transaction. Now we do. Co-authored-by: Marco Slot <marco@citusdata.com> Co-authored-by: serprex <serprex@users.noreply.github.com> Co-authored-by: pykello <hadi.moshayedi@microsoft.com>	2019-09-27 09:13:25 -07:00
Jelte Fennema	dab16be283	Set default threshold on get_rebalance_table_shards_plan to 0, like rebalance_table_shards (#3039 ) In this PR the default `threshold` of `rebalance_table_shards` was set to 0: https://github.com/citusdata/shard_rebalancer/pull/73 However, the default for get_rebalance_table_shards_plan was not updated. This can cause the confusing situation where the actual steps run by `rebalance_table_shards` are not the same as the ones returned by `get_rebalance_table_shards_plan`.	2019-09-27 17:21:36 +02:00
Jelte Fennema	6adf64efdb	Hotfix for circleci problem where directory is owned by root (#3048 )	2019-09-27 17:07:30 +02:00
Halil Ozan Akgül	bbe3ec0493	Merge pull request #3015 from citusdata/mx_isolation_test_insert_select MX Isolation tests for Insert Select	2019-09-26 18:15:22 +03:00
Halil Ozan Akgul	824a69587c	Created isolation tests for insert select on MX	2019-09-26 17:40:36 +03:00
Marco Slot	32a11bdf6c	Return early for common commands in the utility hook (#3031 ) We started copying parse trees by default further on in `multi_ProcessUtility`. That's not a problem for maintenance command, but might register for things like `PREPARE` and `EXECUTE`, which might happen thousands of times per second. Add a few common commands to the check at the start.	2019-09-26 11:43:35 +02:00
Halil Ozan Akgül	3e465a6449	Merge pull request #3028 from citusdata/mx_isolation_test_drop_alter_index_select_for_update MX Isolation Tests for Drop, Alter, Index and Select For Update	2019-09-26 10:56:08 +03:00
Halil Ozan Akgul	d56ab6274c	Created isolation tests for drop, alter, index and select for update on MX.	2019-09-26 10:47:14 +03:00
SaitTalhaNisanci	e3dcc9504f	Update all docker images with pg 11.5 (#3012 )	2019-09-26 10:44:18 +03:00
SaitTalhaNisanci	24a56d2257	Add postgres upgrade job (#2973 )	2019-09-25 17:31:19 +03:00
Halil Ozan Akgül	cc5d68577a	Merge pull request #3025 from citusdata/mx_isolation_test_truncate MX Isolation Tests for Truncate	2019-09-25 17:14:42 +03:00
Halil Ozan Akgul	d426fb2159	Created isolation tests for truncate on MX.	2019-09-25 16:51:20 +03:00
Halil Ozan Akgül	198535b752	Merge pull request #3010 from citusdata/mx_isolation_test_copy MX Isolation Tests for Copy	2019-09-25 15:54:45 +03:00
Halil Ozan Akgul	62b6852923	Created isolation tests for copy on MX.	2019-09-25 15:36:05 +03:00
Önder Kalacı	cefedf5b00	Merge pull request #3026 from citusdata/improve_test Improve some tests around local execution and CTE inlining on pg 12	2019-09-25 11:07:28 +02:00
Onder Kalaci	219f3676a0	Improve some tests around local execution and CTE inlining on pg 12	2019-09-25 10:53:19 +02:00
Philip Dubé	4f60e3a149	Feedback	2019-09-24 17:31:09 +00:00
Marco Slot	c1e43b25da	Use the new create_distributed_function API in some call tests	2019-09-24 17:31:09 +00:00
Marco Slot	ca478defeb	Deparse CALL statement instead of using original query string	2019-09-24 17:31:09 +00:00
Philip Dubé	90e1f1442a	Annotated tests for multi_mx_call. Co-authored-by: pykello <hadi.moshayedi@microsoft.com>	2019-09-24 17:31:09 +00:00
Marco Slot	e269d990c9	Cast the distribution argument value when possible	2019-09-24 17:31:09 +00:00
Philip Dubé	c95d46b4f3	Extend multi_mx_call with some of Hadi's suggestions for better test coverage	2019-09-24 17:31:09 +00:00
Philip Dubé	432a8ef85b	Hadi's feedback Co-authored-by: pykello <hadi.moshayedi@microsoft.com> Co-authored-by: serprex <serprex@users.noreply.github.com>	2019-09-24 17:31:09 +00:00
Philip Dubé	16b8d17aba	Test: multi_mx_call	2019-09-24 17:31:09 +00:00
Philip Dubé	bc1ad67eb5	Distribute CALL on distributed procedures to metadata workers Lots taken from https://github.com/citusdata/citus/pull/2829	2019-09-24 17:31:09 +00:00
Önder Kalacı	932a407f07	Merge pull request #3029 from citusdata/relax_colocation_checks Relax the colocation checks for distributed functions	2019-09-24 16:37:36 +02:00
Onder Kalaci	18de78f386	Relax the colocation checks for distributed functions As long as the types can be coerced, it is safe to pushdown functions.	2019-09-24 16:31:08 +02:00
Jelte Fennema	7172c7f727	Add editorconfig settings for yaml files (#3027 )	2019-09-24 16:09:20 +02:00
Jelte Fennema	0f90c2497e	Use synchronous replication for follower tests	2019-09-24 15:51:49 +02:00
Jelte Fennema	78ccc323d1	Remove stuff needed only for PG 9.6 from test runner	2019-09-24 15:51:49 +02:00
Jelte Fennema	bd2103e597	Remove flappy test	2019-09-24 14:15:33 +02:00
Jelte Fennema	897ec1bdeb	Revert "Temporarily disable flappy test" This reverts commit `4b4459ee62`.	2019-09-24 14:15:33 +02:00
Marco Slot	4acca9b9fe	Merge pull request #3016 from citusdata/fix/swap_sequences Swap pg_dist_node groupid and nodeid sequences	2019-09-24 12:46:16 +02:00
Marco Slot	42be8afd74	Swap pg_dist_node groupid and nodeid sequences	2019-09-24 12:03:44 +02:00
Marco Slot	0dea485c68	Fix misspelling in multi_colocation_utils	2019-09-24 11:27:30 +02:00
Marco Slot	4b4459ee62	Temporarily disable flappy test	2019-09-24 11:02:34 +02:00
Hadi Moshayedi	e293230712	Merge pull request #3020 from citusdata/fix-pg12 Fix pg12	2019-09-23 15:18:43 -07:00
Hadi Moshayedi	48078a30e6	Fix wait_until_metadata_sync() for postgres 12. Postgres 12 now has an assertion that the calls to WaitLatchOrSocket handle postmaster death.	2019-09-23 14:15:35 -07:00
Philip Dubé	06faba91c0	Include ifdefs for pg12 API changes, update local_shard_executiuon test to avoid CTE inlining	2019-09-23 20:22:35 +00:00
Önder Kalacı	ec9fee1c92	Merge pull request #3005 from citusdata/sync_metadata_to_node Sync metadata to worker nodes after create_distributed_function	2019-09-23 19:01:38 +02:00
Onder Kalaci	d37745bfc7	Sync metadata to worker nodes after create_distributed_function Since the distributed functions are useful when the workers have metadata, we automatically sync it. Also, after master_add_node(). We do it lazily and let the deamon sync it. That's mainly because the metadata syncing cannot be done in transaction blocks, and we don't want to add lots of transactional limitations to master_add_node() and create_distributed_function().	2019-09-23 18:30:53 +02:00
Marco Slot	59fe461d4a	Merge pull request #3009 from citusdata/small_serial Support serial and smallserial when syncing metadata	2019-09-23 17:53:40 +02:00
Marco Slot	b749d4fb65	Merge pull request #3008 from citusdata/fix/select_for_update Fix assert failure in bare SELECT FROM reference table FOR UPDATE in MX	2019-09-23 17:41:04 +02:00
Marco Slot	5f23b951c7	Support serial and smallserial when syncing metadata	2019-09-23 17:39:21 +02:00
Marco Slot	e58d76c5f6	Fix assert failure in bare SELECT FROM reference table FOR UPDATE in MX	2019-09-23 17:00:09 +02:00
SaitTalhaNisanci	71e7047e65	Enhance pg upgrade tests (#3002 ) * Enhance pg upgrade tests * Add a specific upgrade test for pg_dist_partition We store the index of distribution column, and when a column with an index that is smaller than distribution column index is dropped before an upgrade, the index should still match the distribution column after an upgrade	2019-09-23 17:37:14 +03:00
SaitTalhaNisanci	7bf04a999c	Refactor circleci config for better readability (#3013 )	2019-09-23 16:21:21 +03:00
Marco Slot	9474bee98b	Merge pull request #3006 from citusdata/router_row_types Support anonymous composite types on the target list in router queries	2019-09-23 15:02:19 +02:00
Marco Slot	d85d77634d	Handle anonymous composite types on the target list	2019-09-23 14:53:02 +02:00
Önder Kalacı	03fa3628f1	Merge pull request #2989 from citusdata/mx_isolation_test_update_delete_upsert MX Isolation Tests for Update, Delete and Upsert	2019-09-23 14:43:21 +02:00
Halil Ozan Akgul	b55b275a30	Created isolation tests for update, delete and upsert on MX	2019-09-23 14:13:29 +03:00
Önder Kalacı	900f5a61fc	Merge pull request #2990 from citusdata/add_function_arguments Add arguments to `create_distributed_function()`	2019-09-23 08:35:32 +02:00
Onder Kalaci	d7e2968120	Add parameters to create_distributed_function() With this commit, we're changing the API for create_distributed_function() such that users can provide the distribution argument and the colocation information.	2019-09-22 21:53:33 +02:00
Önder Kalacı	ff100b2720	Merge pull request #3001 from citusdata/fix_add_node_func Make sure that functions are also listed in `SupportedDependencyByCitus`	2019-09-20 11:09:16 +02:00
Onder Kalaci	e1fe8d60b4	Make sure that functions are also listed in SupportedDependencyByCitus We've recently merged two commits, `db5d03931d` and `eccba1d4c3`, which actually operates on the very similar places. It turns out that we've an integration issue, where master_add_node() fails to replicate the functions to newly added node.	2019-09-20 11:02:50 +02:00
Hadi Moshayedi	4875c3c81c	Merge pull request #2997 from citusdata/fix_master_update_node Set current snapshot in maintenance daemon.	2019-09-19 09:50:50 -07:00
Hadi Moshayedi	d24cefd055	Set active snapshot before SyncMetadataToNodes().	2019-09-19 09:00:25 -07:00
Philip Dubé	46866066cf	Merge pull request #3000 from citusdata/fix/disable_object_propagation-test-pg12 fix disable_object_propagation test for pg12	2019-09-19 15:55:06 +00:00
Nils Dijk	72015faeb2	fix disable_object_propagation test for pg12	2019-09-19 17:40:24 +02:00
Hanefi Onaldi	eccba1d4c3	Create previously distributed functions in new workers (#2985 ) Add distributed func creation queries in dependency replication logic	2019-09-18 20:23:33 +03:00
Hanefi Onaldi	ed11b9590c	Add distributed func creation queries in dependency replication logic	2019-09-18 20:07:45 +03:00
Hadi Moshayedi	09d4efadcf	Merge pull request #2928 from citusdata/master_update_node Propagate metadata for master_update_node	2019-09-18 09:40:28 -07:00
Hadi Moshayedi	d2f2acc4b2	Make master_update_node citus-ha friendly.	2019-09-18 09:32:54 -07:00
Hadi Moshayedi	76f3933b05	Add metadatasynced, and sync on master_update_node() Co-authored-by: pykello <hadi.moshayedi@microsoft.com> Co-authored-by: serprex <serprex@users.noreply.github.com>	2019-09-18 09:32:54 -07:00
Nils Dijk	db5d03931d	Feature disable object propagation (#2986 ) DESCRIPTION: Provide a GUC to turn of the new dependency propagation functionality In the case the dependency propagation functionality introduced in 9.0 causes issues to a cluster of a user they can turn it off almost completely. The only dependency that will still be propagated and kept track of is the schema to emulate the old behaviour. GUC to change is `citus.enable_object_propagation`. When set to `false` the functionality will be mostly turned off. Be aware that objects marked as distributed in `pg_dist_object` will still be kept in the catalog as a distributed object. Alter statements to these objects will not be propagated to workers and may cause desynchronisation.	2019-09-18 17:16:22 +02:00
SaitTalhaNisanci	c9ec98852e	Ignore .vscode (#2969 )	2019-09-18 17:08:22 +03:00
Philip Dubé	1c7e009de3	Merge pull request #2987 from citusdata/dont-fatal pg12 doesn't support client_min_messages as 'fatal'	2019-09-17 20:49:23 +00:00
Philip Dubé	ac14f1dd49	pg12 doesn't support client_min_messages as 'fatal'	2019-09-17 20:37:06 +00:00
Nils Dijk	2b7f5552c8	Fix: rename remote type on conflict (#2983 ) DESCRIPTION: Rename remote types during type propagation To prevent data to be destructed when a remote type differs from the type on the coordinator during type propagation we wanted to rename the type instead of `DROP CASCADE`. This patch removes the `DROP` logic and adds the creation of a rename statement to a free name.	2019-09-17 18:54:10 +02:00
Nils Dijk	0a3152d09c	Add feature flag to turn off create type propagation (#2982 ) DESCRIPTION: Add feature flag to turn off create type propagation When `citus.enable_create_type_propagation` is set to `false` citus will not propagate `CREATE TYPE` statements to the workers. Types are still distributed when tables that depend on these types are distributed.	2019-09-17 15:50:06 +02:00
Önder Kalacı	47d703c911	Merge pull request #2981 from citusdata/mx_isolation_test_select MX Isolation Tests for Select	2019-09-17 15:11:17 +02:00
Halil Ozan Akgul	5333296a54	Created isolation tests for select on MX	2019-09-17 12:44:45 +03:00
Hadi Moshayedi	c0d736ce91	Merge pull request #2980 from citusdata/fix_2979 Merge two conflicting pg_dist_object headers	2019-09-16 12:44:36 -07:00
Philip Dubé	964020097d	Merge two conflicting pg_dist_object headers	2019-09-16 19:19:21 +00:00
Philip Dubé	72dd439ca7	Merge pull request #2979 from citusdata/function_args Add columns to pg_dist_object for distributed functions	2019-09-16 15:44:48 +00:00
Onder Kalaci	cde6b02858	Add columns to pg_dist_object for distributed functions This PR simply adds the columns to pg_dist_object and implements the necessary metadata changes to keep track of distribution argument of the functions/procedures.	2019-09-16 17:28:04 +02:00
Jelte Fennema	af9fb9f785	Fix depend arguments for OSX clang cpp (#2978 ) A better fix for #2975. Apparently for OSX cpp -MF and -MT shouldn't have a space in between the flag and their value. Without the space it still works for gcc as well.	2019-09-16 15:22:07 +02:00
Halil Ozan Akgül	301febbd2c	Merge pull request #2967 from citusdata/mx_isolation_test_insert MX isolation test insert	2019-09-16 15:56:09 +03:00
Halil Ozan Akgul	7cde785031	Added the MX isolation tests for insert	2019-09-16 15:49:43 +03:00
Jelte Fennema	31fac3b90e	Don't generate SQL files twice by not making directories a target (#2977 )	2019-09-16 12:53:17 +02:00
Önder Kalacı	13947a63ce	Don't use flags that mac clang doesn't support as it does on other platforms (#2975 )	2019-09-16 11:44:06 +02:00
Hanefi Onaldi	8f2a3a0604	Introduce create_distributed_function(regproc) UDF (#2961 ) This PR aims to add the minimal set of changes required to start distributing functions. You can use create_distributed_function(regproc) UDF to distribute a function. SELECT create_distributed_function('add(int,int)'); The function definition should include the param types to properly identify the correct function that we wish to distribute	2019-09-13 23:27:46 +03:00
Philip Dubé	012595da11	Merge pull request #2927 from citusdata/fix_2909 ActivePrimaryNodeList: Lock DistNodeRelationId()	2019-09-13 18:22:23 +00:00
Philip Dubé	fb10edcb9d	isolation_add_node_vs_reference_table_operations: test add in parallel with create_reference_table	2019-09-13 18:13:58 +00:00
Philip Dubé	492d1b2cba	ActivePrimaryNodeList: add lockMode parameter	2019-09-13 17:44:56 +00:00
Philip Dubé	482f3b1474	Merge pull request #2971 from citusdata/fix-pg12 Fix pg12 compile	2019-09-13 17:34:57 +00:00
Philip Dubé	5e5f4628a0	Fix pg12 compile	2019-09-13 17:25:30 +00:00
Jelte Fennema	4bbf65d913	Change SQL migration build process for easier reviews (#2951 ) @thanodnl told me it was a bit of a problem that it's impossible to see the history of a UDF in git. The only way to do so is by reading all the sql migration files from new to old. Another problem is that it's also hard to review the changed UDF during code review, because to find out what changed you have to do the same. I thought of a IMHO better (but not perfect) way to handle this. We keep the definition of a UDF in sql/udfs/{name_of_udf}/latest.sql. That file we change whenever we need to make a change to the the UDF. On top of that you also make a snapshot of the file in sql/udfs/{name_of_udf}/{migration-version}.sql (e.g. 9.0-1.sql) by copying the contents. This way you can easily view what the actual changes were by looking at the latest.sql file. There's still the question on how to use these files then. Sadly postgres doesn't allow inclusion of other sql files in the migration sql file (it does in psql using \i). So instead I used the C preprocessor+ make to compile a sql/xxx.sql to a build/sql/xxx.sql file. This final build/sql/xxx.sql file has every occurence of #include "somefile.sql" in sql/xxx.sql replaced by the contents of somefile.sql.	2019-09-13 18:44:27 +02:00
Nils Dijk	2879689441	Distribute Types to worker nodes (#2893 ) DESCRIPTION: Distribute Types to worker nodes When to propagate ============== There are two logical moments that types could be distributed to the worker nodes - When they get used ( just in time distribution ) - When they get created ( proactive distribution ) The just in time distribution follows the model used by how schema's get created right before we are going to create a table in that schema, for types this would be when the table uses a type as its column. The proactive distribution is suitable for situations where it is benificial to have the type on the worker nodes directly. They can later on be used in queries where an intermediate result gets created with a cast to this type. Just in time creation is always the last resort, you cannot create a distributed table before the type gets created. A good example use case is; you have an existing postgres server that needs to scale out. By adding the citus extension, add some nodes to the cluster, and distribute the table. The type got created before citus existed. There was no moment where citus could have propagated the creation of a type. Proactive is almost always a good option. Types are not resource intensive objects, there is no performance overhead of having 100's of types. If you want to use them in a query to represent an intermediate result (which happens in our test suite) they just work. There is however a moment when proactive type distribution is not beneficial; in transactions where the type is used in a distributed table. Lets assume the following transaction: ```sql BEGIN; CREATE TYPE tt1 AS (a int, b int); CREATE TABLE t1 AS (a int PRIMARY KEY, b tt1); SELECT create_distributed_table('t1', 'a'); \copy t1 FROM bigdata.csv ``` Types are node scoped objects; meaning the type exists once per worker. Shards however have best performance when they are created over their own connection. For the type to be visible on all connections it needs to be created and committed before we try to create the shards. Here the just in time situation is most beneficial and follows how we create schema's on the workers. Outside of a transaction block we will just use 1 connection to propagate the creation. How propagation works ================= Just in time ----------- Just in time propagation hooks into the infrastructure introduced in #2882. It adds types as a supported object in `SupportedDependencyByCitus`. This will make sure that any object being distributed by citus that depends on types will now cascade into types. When types are depending them self on other objects they will get created first. Creation later works by getting the ddl commands to create the object by its `ObjectAddress` in `GetDependencyCreateDDLCommands` which will dispatch types to `CreateTypeDDLCommandsIdempotent`. For the correct walking of the graph we follow array types, when later asked for the ddl commands for array types we return `NIL` (empty list) which makes that the object will not be recorded as distributed, (its an internal type, dependant on the user type). Proactive distribution --------------------- When the user creates a type (composite or enum) we will have a hook running in `multi_ProcessUtility` after the command has been applied locally. Running after running locally makes that we already have an `ObjectAddress` for the type. This is required to mark the type as being distributed. Keeping the type up to date ==================== For types that are recorded in `pg_dist_object` (eg. `IsObjectDistributed` returns true for the `ObjectAddress`) we will intercept the utility commands that alter the type. - `AlterTableStmt` with `relkind` set to `OBJECT_TYPE` encapsulate changes to the fields of a composite type. - `DropStmt` with removeType set to `OBJECT_TYPE` encapsulate `DROP TYPE`. - `AlterEnumStmt` encapsulates changes to enum values. Enum types can not be changed transactionally. When the execution on a worker fails a warning will be shown to the user the propagation was incomplete due to worker communication failure. An idempotent command is shown for the user to re-execute when the worker communication is fixed. Keeping types up to date is done via the executor. Before the statement is executed locally we create a plan on how to apply it on the workers. This plan is executed after we have applied the statement locally. All changes to types need to be done in the same transaction for types that have already been distributed and will fail with an error if parallel queries have already been executed in the same transaction. Much like foreign keys to reference tables.	2019-09-13 17:46:07 +02:00
Önder Kalacı	6e4fbeb8b9	Merge pull request #2962 from citusdata/fix-2958 Correctly add schema when distributing sequence definitons	2019-09-13 17:26:51 +02:00
Jelte Fennema	e4cfea3751	Correctly add schema when distributing sequence definitons Fixes 2958	2019-09-13 17:19:35 +02:00
Jelte Fennema	579a40dfa5	Add make check-base-mx	2019-09-13 17:19:35 +02:00
Jelte Fennema	389086102a	Refactor 9 argument function to use a struct (#2952 ) For another PR I needed to add another column which would require to add another argument to an already 9 argument function signature. In this case it would be a boolean flag and there were already two boolean flags in there. In my experience it becomes really easy to mess up the order of these flags at that point. Especially because the type system doesn't distinguish between the 3 different booleans with completely different meanings. So I refactored these signatures to receive a struct containing most of these arguments. Like that you don't mess up orderening, because the meaning of the boolean is not order dependent but fieldname dependent. It also makes it possible to set good shared defaults for this struct.	2019-09-13 15:49:53 +02:00
Önder Kalacı	48b7fbb9e5	Merge pull request #2968 from citusdata/insert_isolation_duplicate_test Changed the duplicate test into missing test	2019-09-13 15:31:54 +02:00
Halil Ozan Akgul	4d34b79b87	There were two multi insert - single insert tests but no multi insert - multi insert test. Fixed it.	2019-09-13 16:09:11 +03:00
Nils Dijk	05f0668cdc	Fix: schema leak onto create index statement cache (#2964 ) DESCRIPTION: Fix schema leak on CREATE INDEX statement When a CREATE INDEX is cached between execution we might leak the schema name onto the cached statement of an earlier execution preventing the right index to be created. Even though the cache is cleared when the search_path changes we can trigger this behaviour by having the schema already on the search path before a colliding table is created in a schema earlier on the `search_path`. When calling an unqualified create index via a function (used to trigger the caching behaviour) we see that the index is created on the wrong table after the schema leaked onto the statement. By copying the complete `PlannedStmt` and `utilityStmt` during our planning phase for distributed ddls we make sure we are not leaking the schema name onto a cached data structure. Caveat; COPY statements already have a lot of parsestree copying ongoing without directly putting it back on the `pstmt`. We should verify that copies modify the statement and potentially copy the complete `pstmt` there already.	2019-09-13 14:04:23 +02:00
Hadi Moshayedi	1f84056b83	Merge pull request #2963 from citusdata/update_udfs Return nodeid instead of record in some UDFs	2019-09-12 14:54:16 -07:00
Hadi Moshayedi	48ff4691a0	Return nodeid instead of record in some UDFs	2019-09-12 12:46:21 -07:00
Philip Dubé	d23185d077	Merge pull request #2957 from citusdata/dont-distribute-aggregate-named-invalid Begin searching AggregateNames from 1, not 0	2019-09-12 17:02:06 +00:00
Philip Dubé	ae1171a373	Test invalid aggregate	2019-09-12 16:55:05 +00:00
Philip Dubé	2aa6852dea	Begin searching AggregateNames from 1, not 0	2019-09-12 16:55:05 +00:00
Jelte Fennema	d6deb062aa	Add shard rebalancer stubs	2019-09-12 16:40:25 +02:00
Jelte Fennema	58012054c9	Add an extra advisory lock tag class	2019-09-12 16:40:25 +02:00
Jelte Fennema	eb7e45d556	Make LookupNodeForGroup extern	2019-09-12 16:40:25 +02:00
Jelte Fennema	257406fda7	Fix ArrayObjectCount for zero sized arrays	2019-09-12 16:40:25 +02:00
Jelte Fennema	de5174f763	include postgres.h into some of our .h files to silence warnings	2019-09-12 16:40:25 +02:00
Jelte Fennema	ea2e010d42	Better editorconfig	2019-09-12 16:40:25 +02:00
Jelte Fennema	4ebdf5989b	Add check-minimal to test Makefile	2019-09-12 16:40:25 +02:00
Önder Kalacı	07cca85227	Merge pull request #2938 from citusdata/local_execution_2 Introduce the concept of Local Execution	2019-09-12 12:18:43 +02:00
Onder Kalaci	0b0c779c77	Introduce the concept of Local Execution /* * local_executor.c * * The scope of the local execution is locally executing the queries on the * shards. In other words, local execution does not deal with any local tables * that are not shards on the node that the query is being executed. In that sense, * the local executor is only triggered if the node has both the metadata and the * shards (e.g., only Citus MX worker nodes). * * The goal of the local execution is to skip the unnecessary network round-trip * happening on the node itself. Instead, identify the locally executable tasks and * simply call PostgreSQL's planner and executor. * * The local executor is an extension of the adaptive executor. So, the executor uses * adaptive executor's custom scan nodes. * * One thing to note that Citus MX is only supported with replication factor = 1, so * keep that in mind while continuing the comments below. * * On the high level, there are 3 slightly different ways of utilizing local execution: * * (1) Execution of local single shard queries of a distributed table * * This is the simplest case. The executor kicks at the start of the adaptive * executor, and since the query is only a single task the execution finishes * without going to the network at all. * * Even if there is a transaction block (or recursively planned CTEs), as long * as the queries hit the shards on the same, the local execution will kick in. * * (2) Execution of local single queries and remote multi-shard queries * * The rule is simple. If a transaction block starts with a local query execution, * all the other queries in the same transaction block that touch any local shard * have to use the local execution. Although this sounds restrictive, we prefer to * implement in this way, otherwise we'd end-up with as complex scenarious as we * have in the connection managements due to foreign keys. * * See the following example: * BEGIN; * -- assume that the query is executed locally * SELECT count() FROM test WHERE key = 1; * -- at this point, all the shards that reside on the * -- node is executed locally one-by-one. After those finishes * -- the remaining tasks are handled by adaptive executor * SELECT count() FROM test; * * (3) Modifications of reference tables * * Modifications to reference tables have to be executed on all nodes. So, after the * local execution, the adaptive executor keeps continuing the execution on the other * nodes. * * Note that for read-only queries, after the local execution, there is no need to * kick in adaptive executor. * * There are also few limitations/trade-offs that is worth mentioning. First, the * local execution on multiple shards might be slow because the execution has to * happen one task at a time (e.g., no parallelism). Second, if a transaction * block/CTE starts with a multi-shard command, we do not use local query execution * since local execution is sequential. Basically, we do not want to lose parallelism * across local tasks by switching to local execution. Third, the local execution * currently only supports queries. In other words, any utility commands like TRUNCATE, * fails if the command is executed after a local execution inside a transaction block. * Forth, the local execution cannot be mixed with the executors other than adaptive, * namely task-tracker, real-time and router executors. Finally, related with the * previous item, COPY command cannot be mixed with local execution in a transaction. * The implication of that any part of INSERT..SELECT via coordinator cannot happen * via the local execution. */	2019-09-12 11:51:25 +02:00
Marco Slot	d69be38932	Merge pull request #2933 from citusdata/drop_poolinfo_fk Drop foreign key from pg_dist_poolinfo to pg_dist_node	2019-09-12 11:50:05 +02:00
SaitTalhaNisanci	e132d579f2	Change --new-bindir flag description to be consistent (#2950 )	2019-09-11 15:36:39 +03:00
SaitTalhaNisanci	0f170cb75f	Use variables instead of hardcoded tmp dirs (#2944 )	2019-09-11 13:25:18 +03:00
Jelte Fennema	c591a135f1	Update ubuntu dependencies in CONTRIBUTING (#2941 )	2019-09-11 09:49:43 +02:00
Önder Kalacı	dd4e767702	Merge pull request #2942 from citusdata/fix_adaptive_bug Make sure that lost connections are handled properly in adaptive executor	2019-09-10 18:01:17 +02:00
Onder Kalaci	485189c0b6	Make sure that lost connections are handled properly Before this patch, when a connection is lost, we'd have the following situation: - Pop a task execution from readyQueue - Lost connection - Fail the session/pool. -> This step was not acting properly because we've popped the task, but not set to session->currentTask yet After the patch: - Pop a task execution from readyQueue - Immediately set it to session->currentTask - Lost connection - Fail the session/pool. -> At this step, failing the session would trigger query failures (or failovers) properly.	2019-09-10 17:54:27 +02:00
SaitTalhaNisanci	d99deab7d9	Add upgrade postgres version test (#2940 ) * Add creating a citus cluster script Creating a citus cluster is automated. Before running this script: - Citus should be installed and its control file should be added to postgres. (make install) - Postgres should be installed. * Initialize upgrade test table and fill * Finalize the layout of upgrade tests Postgres upgrade function is added. The newly added UDFs(citus_prepare_pg_upgrade, citus_finish_pg_upgrade) are used to perform upgrade. * Refactor upgrade test and add config file * Add schedules for upgrade testing * Use pg_regress for upgrade tests pg_regress is used for creating a simple distributed table in upgrade tests. After upgrading another schedule is used to verify that the distributed table exists. Router and realtime queries are used for verifying. * Run upgrade tests as a postgres user in a temp dir postgres user is used for psql to be consistent at running tests. A temp dir is created and the temp dir's permissions are changed so that postgres user can access it. All psql commands are now run with postgres user. "Select * from t" query is changed as "Select * from t order by a" so that the result is always in the same order. * Add docopt and arguments for the upgrade script Docopt dependency is added to parse flags in script. Some refactoring in variable names is done. * Add readme for upgrade tests * Refactor upgrade tests Use relative data path instead of absolute assuming that this script will always be run from 'src/test/regress' Remove 'citus-path' flag Use specific version for docopt instead of * Use named args in string formatting * Resolve a security problem Instead of using string formatting in subprocess.call, arguments list is used. Otherwise users could do shell injection. Shell = True is removed from subprocess call as it is not recommended to use this. * Add how the test works to readme * Refactor some variables to be consistent * Update upgrade script based on the reviews It was possible that postgres server would stay running even when the script crashes, atexit library is used to ensure that we always do a teardown where we stop the databases. Some formatting is done in the code for better readability. Config class is used instead of a dictonary. A target for upgrade test is added to makefile. Unused flags/functions/variables are removed. * Format commands and remove unnecessary flag from readme	2019-09-10 17:56:04 +03:00
Marco Slot	810aca8d41	Drop foreign key from pg_dist_poolinfo to pg_dist_node	2019-09-10 09:52:19 +02:00
Philip Dubé	b4a1a0fb80	Merge pull request #2911 from citusdata/test_merge_files_and_query_more Extend tests from release testing	2019-09-05 16:57:54 +00:00
Philip Dubé	b301cf628a	Test worker_cleanup_job_schema_cache actually drops schemas	2019-09-05 16:52:24 +00:00
Philip Dubé	8979fd038b	worker_check_invalid_arguments: invalid task/job ids	2019-09-05 16:52:24 +00:00
Philip Dubé	5f9e88b260	multi_multiuser: test that worker_merge_files_and_query doesn't allow privilege escalation	2019-09-05 16:52:24 +00:00
Philip Dubé	60dc42a3ae	Merge pull request #2929 from citusdata/fix_pg12_distobject get_catalog_object_by_oid requires an extra parameter in pg12	2019-09-05 16:46:04 +00:00
Philip Dubé	a28b82d67d	get_catalog_object_by_oid requires an extra parameter in pg12	2019-09-05 16:38:07 +00:00
Nils Dijk	511e715ee3	Remove early escape in walking pg_depend (#2930 ) This is a bug that got in when we inlined the body of a function into this loop. Earlier revisions had two loops, hence a function that would be reused. With a return instead of a continue the list of dependencies being walked is dependent on the order in which we find them in pg_depend. This became apparent during pg12 compatibility. The order of entries in pg12 was luckily different causing a random test to fail due to this return. By changing it to a continue we only skip the entries that we don’t want to follow instead of skipping all entries that happen to be found later. sidefix for more stable isolation tests around ensure dependency	2019-09-05 18:03:34 +02:00
Philip Dubé	f90fb10b5f	Merge pull request #2879 from citusdata/pg12_generatedcolumns Pg12 generated columns	2019-09-04 15:07:18 +00:00
Philip Dubé	bdd30bb181	Don't allow distributing by a generated column	2019-09-04 14:50:17 +00:00
Philip Dubé	41dca121e2	Support GENERATE ALWAYS AS STORED	2019-09-04 14:50:17 +00:00
Nils Dijk	936d546a3c	Refactor Ensure Schema Exists to Ensure Dependecies Exists (#2882 ) DESCRIPTION: Refactor ensure schema exists to dependency exists Historically we only supported schema's as table dependencies to be created on the workers before a table gets distributed. This PR puts infrastructure in place to walk pg_depend to figure out which dependencies to create on the workers. Currently only schema's are supported as objects to create before creating a table. We also keep track of dependencies that have been created in the cluster. When we add a new node to the cluster we use this catalog to know which objects need to be created on the worker. Side effect of knowing which objects are already distributed is that we don't have debug messages anymore when creating schema's that are already created on the workers.	2019-09-04 14:10:20 +02:00
Philip Dubé	bc97523940	Merge pull request #2925 from citusdata/remove_check_for_updates Remove CheckForUpdates	2019-09-03 21:28:17 +00:00
Philip Dubé	28d964240f	Remove CheckForUpdates https://reports.citusdata.com/v1/releases/latest We haven't updated the version CheckForUpdates sees since 7.1.0	2019-09-03 21:11:25 +00:00
Philip Dubé	077f5e26af	Merge pull request #2926 from citusdata/normalize_all_the_tests Normalize all tests	2019-09-03 21:10:40 +00:00
Philip Dubé	4d26829d50	Remove normalized_tests.lst, don't normalize check-vanilla	2019-09-03 17:25:00 +00:00
Philip Dubé	169d2f193f	Merge pull request #2914 from citusdata/propagate_column_collate create_distributed_table: include COLLATE on columns	2019-08-29 14:31:21 +00:00
Philip Dubé	da00c62eea	create_distributed_table: include COLLATE on columns	2019-08-29 14:22:54 +00:00
Philip Dubé	dd57232ba3	Merge pull request #2912 from citusdata/MaxBackends_max_wal_senders Update TotalProcCount to match update in InitializeMaxBackends in pg12	2019-08-29 14:16:38 +00:00
Philip Dubé	32ef459025	backend_data.c: include max_wal_senders in calculating maxBackend, matches changes in pg12's InitializeMaxBackends	2019-08-28 21:24:33 +00:00
Jelte Fennema	cbecf97c84	Move tuplestore setup to a helper function (#2898 ) * Add tuplestore helpers * More detailed error messages in tuplestore * Add CreateTupleDescCopy to SetupTuplestore * Use new SetupTuplestore helper function * Remove unnecessary copy * Remove comment about undefined behaviour	2019-08-27 09:11:08 +02:00
Philip Dubé	b354644c56	Merge pull request #2908 from citusdata/sort_colocatedshardintervallist Sort ColocatedShardIntervalList	2019-08-26 17:53:47 +00:00
Philip Dubé	eba3828ef7	ColocatedShardIntervalList: sort	2019-08-26 17:42:41 +00:00
Philip Dubé	c1587cc00a	Merge pull request #2906 from citusdata/add-rls-SET-LOCAL-GUC-test Test SET LOCAL propagation when GUC is used in RLS policy	2019-08-22 20:36:05 +00:00
Matthias Kurz	fc069dc611	Test SET LOCAL propagation when GUC is used in RLS policy	2019-08-22 20:29:52 +00:00
Philip Dubé	d3be6cd0a6	Merge pull request #2844 from citusdata/postgres12 Postgres 12	2019-08-22 19:36:24 +00:00
Philip Dubé	6b0d8ed83d	SortList in FinalizedShardPlacementList, makes 3 failure tests consistent between 11/12	2019-08-22 19:30:56 +00:00
Philip Dubé	693d4695d7	Create a test 'pg12' for pg12 features & error on unsupported new features Unsupported new features: COPY FROM WHERE, GENERATED ALWAYS AS, non-heap table access methods	2019-08-22 19:30:56 +00:00
Philip Dubé	e84fcc0b12	Modify tests to be consistent between versions Normalize UNION to prevent optimization Remove WITH OIDS Sort ddl events client_min_messages no longer accepts FATAL	2019-08-22 19:30:50 +00:00
Philip Dubé	e5cd298a98	pg12 revised layout of FunctionCallInfoData See `a9c35cf85c` clang raises a warning due to FunctionCall2InfoData technically being variable sized This is fine, as the struct is the size we want it to be. So silence the warning	2019-08-22 19:02:35 +00:00
Philip Dubé	bee779e7d4	planner/distributed_planner.c: get_func_cost replaced with add_function_cost in pg12	2019-08-22 19:02:10 +00:00
Philip Dubé	be3285828f	Collations matter for hashing strings in pg12 See https://www.postgresql.org/docs/12/collation.html#COLLATION-NONDETERMINISTIC	2019-08-22 18:58:37 +00:00
Philip Dubé	fe10ca453d	Implement FileCompat to abstract pg12 requiring API consumer to track file offsets	2019-08-22 18:57:47 +00:00
Philip Dubé	018ad1c58e	pg12: version_compat.h, tuples, oids, misc	2019-08-22 18:57:23 +00:00
Philip Dubé	9643ff580e	Update commands/vacuum.c with pg12 changes Adds support for SKIP_LOCKED, INDEX_CLEANUP, TRUNCATE Removes broken assert	2019-08-22 18:56:54 +00:00
Philip Dubé	68c4b71f93	Fix up includes with pg12 changes	2019-08-22 18:56:21 +00:00
Philip Dubé	fbc3e346e8	ruleutils_12.c Produced this file by copying ruleutils_11.c, then comparing postgres ruleutils.c changes between REL_11_STABLE & REL_12_STABLE	2019-08-22 18:56:05 +00:00
Philip Dubé	b7e2908fc2	configure: don't prevent pg12	2019-08-22 18:55:55 +00:00
Hadi Moshayedi	0b939b0455	Merge pull request #2894 from citusdata/fix_locks_3 Fix distributed deadlock in TRUNATE	2019-08-22 11:19:27 -07:00
Hadi Moshayedi	6be1bacddd	Fix distributed deadlock for TRUNCATE	2019-08-22 11:03:53 -07:00
Hadi Moshayedi	036b4216a8	Merge pull request #2864 from citusdata/ref2ref_fkey Foreign key between reference tables	2019-08-22 03:22:33 -07:00
Hadi Moshayedi	a5b087c89b	Support FKs between reference tables	2019-08-21 16:11:27 -07:00
Hadi Moshayedi	3de851d3c5	Merge pull request #2904 from citusdata/sort_load_shard_placement_array Sort load_shard_placement_array by worker name/port	2019-08-21 14:44:37 -07:00
Hadi Moshayedi	a3578a6e60	Sort load_shard_placement_array by worker name/port	2019-08-21 14:35:05 -07:00
Philip Dubé	4bbea6e3d8	Merge pull request #2903 from citusdata/fix_assertion_error_in_2900 commands/index.c: Fix assertion typo	2019-08-21 19:57:23 +00:00
Philip Dubé	7bf7e41594	commands/index.c: Fix assertion typo	2019-08-21 18:54:05 +00:00
Philip Dubé	f0a79800d2	Merge pull request #2900 from citusdata/reindex-error Raise an error when REINDEX TABLE or INDEX is invoked on a distributed relation	2019-08-21 17:17:58 +00:00
Philip Dubé	f4b90419ae	Raise an error when REINDEX TABLE or INDEX is invoked on a distributed relation	2019-08-21 17:03:14 +00:00
Philip Dubé	560c9ba4e9	Merge pull request #2897 from citusdata/task_tracker_fix_error_message Task Tracker: fix error being copy pasted from above block	2019-08-21 15:53:55 +00:00
Philip Dubé	db5a7f49a7	Task Tracker: fix error being copy pasted from above block	2019-08-21 15:44:01 +00:00
Philip Dubé	bc7a76d139	Merge pull request #2890 from citusdata/check_shard_interval_search_fail Avoid invalid array accesses to partitionFileArray	2019-08-20 18:07:02 +00:00
Philip Dubé	f62d4a6712	citus_rm_job_directory for multi_query_directory_cleanup	2019-08-19 17:04:42 +00:00
Philip Dubé	9777f22e1e	Avoid invalid array accesses to partitionFileArray	2019-08-19 17:04:42 +00:00
Önder Kalacı	519dc8329b	Merge pull request #2896 from citusdata/single_shard_commit_no_show single_shard_commit_protocol: GUC_NO_SHOW_ALL	2019-08-19 10:20:39 +02:00
Philip Dubé	f4ca02664a	single_shard_commit_protocol: GUC_NO_SHOW_ALL	2019-08-18 12:54:32 +00:00
Hadi Moshayedi	7fccb9d2aa	Merge pull request #2855 from citusdata/fix_locks_2 Add some missing locks.	2019-08-15 12:41:53 -07:00
Hadi Moshayedi	c582eb89c8	Add some missing locks.	2019-08-15 12:34:31 -07:00
Philip Dubé	130e999ac7	Merge pull request #2891 from citusdata/guc_to_disable_2pc_for_single_shard_modify Introduce citus.single_shard_commit_protocol for if users want 1PC on writes to replicas	2019-08-15 19:03:06 +00:00
Philip Dubé	f4e513b3d4	Introduce citus.single_shard_commit_protocol for if users want 1PC on writes to replicas	2019-08-15 18:49:40 +00:00
Philip Dubé	86b2ddc9ae	Merge pull request #2884 from citusdata/avoidduplicatereferencetablecolocationrecords Avoid multiple pg_dist_colocation records being created for reference tables	2019-08-14 15:12:01 +00:00
Philip Dubé	cd951fa9ca	Avoid multiple pg_dist_colocation records being created for reference tables master_deactivate_node is updated to decrement the replication factor Otherwise deactivation could have create_reference_table produce a second record UpdateColocationGroupReplicationFactor is renamed UpdateColocationGroupReplicationFactorForReferenceTables & the implementation looks up the record based on distributioncolumntype == InvalidOid, rather than by id Otherwise the record's replication factor fails to be maintained when there are no reference tables	2019-08-13 17:21:02 +00:00
Nils Dijk	be6b7bec69	Add UDF citus_(prepare\|finish)_pg_upgrade to aid with upgrading citus (#2877 ) DESCRIPTION: Add functions to help with postgres upgrades Currently there is [a list of manual steps](https://docs.citusdata.com/en/v8.2/admin_guide/upgrading_citus.html?highlight=upgrade#upgrading-postgresql-version-from-10-to-11) to perform during a postgres upgrade. These steps guarantee our catalog tables are kept and counter values are maintained across upgrades. Having more than 1 command in our docs for users to manually execute during upgrades is error prone for both the user, and our docs. There are already 2 catalog tables that have been introduced to citus that have not been added to our docs for backing up during upgrades (`pg_authinfo` and `pg_dist_poolinfo`). As we add more functionality to citus we run into situations where there are more steps required either before or after the upgrade. At the same time, when we move catalog tables to a place where the contents will be maintained automatically during upgrades we could have less steps in our docs. This will come to a hard to maintain matrix of citus versions and steps to be performed. Instead we could take ownership of these steps within the extension itself. This PR introduces two new functions for the user to use instead of long lists of error prone instructions to follow. - `citus_prepare_pg_upgrade` This function should be called by the user right before shutting down the cluster. This will ensure all citus catalog tables are backed up in a location where the information will be retained during an upgrade. - `citus_finish_pg_upgrade` This function should be called right after a pg_upgrade of the cluster. This will restore the catalog tables to the state before the upgrade happend. Both functions need to be executed both on the coordinator and on all the workers, in the same fashion our current documentation instructs to do. There are two known problems with this function in its current form, which is also a problem with our docs. We should schedule time in the future to improve on this, but having it automated now is better as we are about to add extra steps to take after upgrades. - When you install citus in a clean cluster we do enable ssl for communication between the coordinator and the workers. If an upgrade to a clean cluster is performed we do not setup ssl on the new cluster causing the communication to fail. - There are no automated tests added in this PR to execute an upgrade test durning every build. Our current test infrastructure does not allow for 2 versions of postgres to exist in the same environment. We will need to invest time to create a new testing harness that could run the following scenario: 1. Create cluster 2. Run extensible scripts to execute arbitrary statements on this cluster 3. Perform an upgrade by preparing, upgrading and finishing 4. Run extensible scripts to verify all objects created by earlier scripts exists in correct form in the upgraded cluster Given the non trivial amount of work involved for such a suite I'd like to land this before we have automated testing. On a side note; As the reviewer noticed, the tables created in the public namespace are not visible in `psql` with `\d`. The backup catalog tables have the same name as the tables in `pg_catalog`. Due to postgres internals `pg_catalog` is first in the search path and therefore the non-qualified name would alwasy resolve to `pg_catalog.pg_dist_*`. Internally this is called a non-visible table as it would resolve to a different table without a qualified name. Only visible tables are shown with `\d`.	2019-08-13 15:53:10 +02:00
Hadi Moshayedi	6395289456	Merge pull request #2883 from citusdata/minor_cleanup Some cleanup	2019-08-12 15:45:26 -07:00
Hadi Moshayedi	009d8b7401	Some cleanup	2019-08-12 15:38:52 -07:00
Philip Dubé	03ef456c50	Merge pull request #2871 from citusdata/pg12-test-prep Update tests in preparation for pg12	2019-08-09 15:32:19 +00:00
Philip Dubé	5459c01956	multi_partitioning_utils: version_above_ten	2019-08-09 15:25:59 +00:00
Philip Dubé	e0f19fb58c	multi_partitioning_1.out	2019-08-09 15:25:59 +00:00
Philip Dubé	5e835e7565	Fix multi_repair_shards. There's already a group/shardid entry, pg11 gives us back the inserted one, pg12 gives us the preexisting one	2019-08-09 15:25:59 +00:00
Philip Dubé	66ce2d2d2d	Materialize c1 to keep subplan ids in sync	2019-08-09 15:25:59 +00:00
Philip Dubé	9065ef429c	foreign_key_to_reference_table: terse to avoid differing order of drop cascade details	2019-08-09 15:25:59 +00:00
Philip Dubé	0d9e5bde9c	window_functions: 'ORDER BY time' when using lag(time) & coordinator_plan	2019-08-09 15:25:59 +00:00
Philip Dubé	7992077fd9	multi_modifying_xacts: don't differ in output if reference table select tries broken worker first	2019-08-09 15:25:59 +00:00
Philip Dubé	546b71ac18	multi_router_planner: be terse for ctes with false wheres	2019-08-09 15:25:59 +00:00
Philip Dubé	a523a5b773	multi_null_minmax_value_pruning: no versioning & coordinator_plan	2019-08-09 15:25:59 +00:00
Philip Dubé	871dabdc63	Force CTE materialization in pg12	2019-08-09 15:25:59 +00:00
Philip Dubé	667c67891e	intermediate_results: COSTS OFF	2019-08-09 15:25:59 +00:00
Philip Dubé	b2ea806d8a	extra_float_digits=0	2019-08-09 15:25:59 +00:00
Philip Dubé	705d1bf0e0	Use PG_JOB_CACHE_DIR	2019-08-09 15:25:59 +00:00
Hanefi Onaldi	ef33282de4	Update Changelog for v8.3.2	2019-08-09 12:32:38 +03:00
Önder Kalacı	e21578f3af	Merge pull request #2866 from citusdata/fix_83_regression Do not record relation accessess unnecessarily	2019-08-08 18:47:58 +02:00
Onder Kalaci	060ac11476	Do not record relation accessess unnecessarily Before this commit, we've recorded the relation accesses in 3 different places - FindPlacementListConnection -- applies all executor in tx block - StartPlacementExecutionOnSession() -- adaptive executor only - StartPlacementListConnection() -- router/real-time only This is different than Citus 8.2, and could lead to query execution times increase considerably on multi-shard commands in transaction block that are on partitioned tables. Benchmarks: ``` 1+8 c5.4xlarge cluster Empty distributed partitioned table with 365 partitions: https://gist.github.com/onderkalaci/1edace4ed6bd6f061c8a15594865bb51#file-partitions_365-sql ./pgbench -f /tmp/multi_shard.sql -c10 -j10 -P 1 -T 120 postgres://citus:w3r6KLJpv3mxe9E-NIUeJw@c.fy5fkjcv45vcepaogqcaskmmkee.db.citusdata.com:5432/citus?sslmode=require cat /tmp/multi_shard.sql BEGIN; DELETE FROM collections_list; DELETE FROM collections_list; DELETE FROM collections_list; COMMIT; cat /tmp/single_shard.sql BEGIN; DELETE FROM collections_list WHERE key = :aid; DELETE FROM collections_list WHERE key = :aid; DELETE FROM collections_list WHERE key = :aid; COMMIT; cat /tmp/mix.sql BEGIN; DELETE FROM collections_list WHERE key = :aid; DELETE FROM collections_list WHERE key = :aid; DELETE FROM collections_list WHERE key = :aid; DELETE FROM collections_list; DELETE FROM collections_list; DELETE FROM collections_list; COMMIT; ``` The table shows `latency average` of pgbench runs explained above, so we have a pretty solid improvement even over 8.2.2. \| Test \| Citus 8.2.2 \| Citus 8.3.1 \| Citus 8.3.2 (this branch) \| Citus 8.3.1 (FKEYs disabled via GUC) \| \| ------------- \| ------------- \| ------------- \|------------- \| ------------- \| \|multi_shard \| 2370.083 ms \|3605.040 ms \|1324.094 ms \|1247.255 ms \| \| single_shard \| 85.338 ms \|120.934 ms \|73.216 ms \| 78.765 ms \| \| mix \| 2434.459 ms \| 3727.080 ms \|1306.456 ms \| 1280.326 ms \|	2019-08-08 18:42:08 +02:00
Onder Kalaci	35ee896f3d	Get rid of an unnecessary parameter targetPoolSize parameter for ExecuteUtilityTaskListWithoutResults becomes obsolete, just remove it.	2019-08-07 19:35:56 +02:00
Onder Kalaci	b2e01d0745	Refactor switching to sequential mode We don't need to wait until the execution. As soon as we realize that we need sequential execution, we should do it.	2019-08-07 19:35:56 +02:00
Hanefi Onaldi	263faffb27	Update CONTRIBUTING.md (#2865 ) * Update dependency versions * Add libcurl and autoconf to required dependencies * Add Clang/LLVM instructions for CentOS/RHEL setup	2019-08-06 17:52:44 +03:00
Hadi Moshayedi	f9efb21f1b	Merge pull request #2863 from citusdata/fix_typo Fix a typo in foreign_key_restriction_enforcement	2019-08-02 16:13:43 -07:00
Hadi Moshayedi	b1ab805ce2	Fix a typo in foreign_key_restriction_enforcement	2019-08-02 16:06:52 -07:00
Hadi Moshayedi	a1a7d95c0a	Merge pull request #2861 from citusdata/less_polymorphic_plan_router_query PlanRouterQuery: don't store list of list of shard intervals in relationShardList	2019-08-02 09:16:09 -07:00
Philip Dubé	b77c52f95b	PlanRouterQuery: don't store list of list of shard intervals in relationShardList	2019-08-02 14:08:57 +00:00
Philip Dubé	9b4ba2f5b2	Merge pull request #2858 from citusdata/multi_modifications_bug Use 2PC in adaptive executor when dealing with replication factors above 1	2019-08-02 00:20:22 +00:00
Philip Dubé	fdc0ef6392	Adaptive executor: use 2PC when replication_factor > 1	2019-08-01 23:55:12 +00:00
Philip Dubé	19bcb1b4f7	multi_modifications: extend to demonstrate issue in adaptive executor	2019-08-01 23:55:04 +00:00
Hadi Moshayedi	b81d5947e4	Merge pull request #2859 from citusdata/no_null_percent_s Avoid segfault in logging queries	2019-08-01 09:36:52 -07:00
Philip Dubé	064bd66a20	Avoid segfault in logging queries	2019-07-31 15:28:46 +00:00
Hanefi Onaldi	e88cb8335f	Update Changelog for v8.3.1	2019-07-29 13:10:28 +03:00
Philip Dubé	dc67fa36c6	Merge pull request #2854 from citusdata/compare_shard_intervals_id_tie_breaker CompareShardIntervals: if intervals are equal, compare id	2019-07-26 16:27:35 +00:00
Philip Dubé	3982b4635f	CompareShardIntervals: if intervals are equal, compare id. Works around sort being unstable	2019-07-26 16:13:36 +00:00
Philip Dubé	6f1a8dfdbe	Merge pull request #2852 from citusdata/update_tests_colocation_utils_copy Update two tests, useful fallout from pg12 branch	2019-07-25 17:36:16 +00:00
Marco Slot	c471d9680c	Merge pull request #2848 from citusdata/adaptive_executor_tuning Adaptive executor performance improvements	2019-07-25 16:52:36 +02:00
Philip Dubé	0e233c63a3	multi_colocation_utils: sort by nodeport, not placementid multi_copy: replace smgr with aclitem, smgr is removed in pg12	2019-07-25 14:33:43 +00:00
Hadi Moshayedi	cd2905ec23	Merge pull request #2845 from citusdata/squash_migrations_56 Squash migrations for versions 5/6, don't use WITH OIDS	2019-07-24 11:10:15 -07:00
Philip Dubé	50144b75d0	Add check-empty to testing Makefile Don't create functions multiple times Move ALTER TABLEs to their declaration Remove DROP FUNCTIONS IF EXISTS, OR REPLACE	2019-07-24 11:03:54 -07:00
Philip Dubé	acbaa38a62	Squash migrations for versions 5/6, don't use WITH OIDS	2019-07-24 11:03:29 -07:00
Philip Dubé	3e6f3e4f3b	Merge pull request #2849 from citusdata/sort-list-is-not-in-place Update workerNodeList after sorting	2019-07-23 21:04:22 +00:00
Hanefi Onaldi	8127297999	update workerNodeList after sorting	2019-07-23 20:57:07 +00:00
Philip Dubé	6c5866cc4d	Merge pull request #2850 from citusdata/fix_a_couple_shardid_tests Fix multi_prune_shard_list	2019-07-23 20:19:55 +00:00
Philip Dubé	6598c68993	Fix multi_prune_shard_list & don't set next_shard_id unnecessarily in multi_null_minmax_value_pruning	2019-07-23 19:44:18 +00:00
Marco Slot	e2bc09838e	Use ereport instead of elog in adaptive executor	2019-07-23 20:40:32 +02:00
Marco Slot	bd111366b0	Skip CheckConnectionTimeout when checkForPoolTimeout is false	2019-07-23 20:40:32 +02:00
Marco Slot	a3811b1e55	Avoid FindWorkerNode calls in adaptive executor	2019-07-23 20:40:32 +02:00
Marco Slot	4444d92dbc	Set initial pool size to cached connection count	2019-07-23 20:40:32 +02:00
Marco Slot	4c0c33365e	Avoid creating a redundant event set at the start	2019-07-23 20:40:32 +02:00
Marco Slot	32e7a80960	Avoid unnecessary calls to PQconsumeInput	2019-07-23 20:40:32 +02:00
Marco Slot	71ad5c095b	Use ModifyWaitEvent when only wait flags changed	2019-07-23 20:40:32 +02:00
Marco Slot	efbe58eab2	Fix SQL schema version, we skipped 8.3	2019-07-17 16:05:25 +02:00
Hadi Moshayedi	86b30ee094	Merge pull request #2807 from citusdata/2776_preparation DistributedPlan: replace operation with modLevel	2019-07-16 14:04:10 -07:00
Philip Dubé	0915027389	DistributedPlan: replace operation with modLevel This causes no behaviorial changes, only organizes better to implement modifying CTEs Also rename ExtactInsertRangeTableEntry to ExtractResultRelationRTE, as the source of this function didn't match the documentation Remove Task's upsertQuery in favor of ROW_MODIFY_NONCOMMUTATIVE Split up AcquireExecutorShardLock into more internal functions Tests: Normalize multi_reference_table multi_create_table_constraints	2019-07-16 13:58:18 -07:00
Hanefi Onaldi	0bdec52761	Fix default_version in citus.control file (#2840 )	2019-07-11 14:24:51 +03:00
Hadi Moshayedi	e3ab6388c1	Merge pull request #2838 from citusdata/normalize_sql_procedure_and_custom_aggregate_support Tests: normalize sql_procedure and custom_aggregate_support	2019-07-10 16:21:39 -07:00
Philip Dubé	befd0caddd	Tests: normalize sql_procedure and custom_aggregate_support Also fix typo in multi_insert_select	2019-07-10 14:36:17 +00:00
Hanefi Onaldi	5a6eba6ba9	Bump Citus to 8.4devel	2019-07-10 15:26:10 +03:00
Hanefi Onaldi	fbfc0660d2	Bump citus to 8.3.0 Add changelog entry for 8.3.0	2019-07-10 14:49:11 +03:00
Nils Dijk	3d815f240c	Merge pull request #2831 from citusdata/tests/multi-user-manual-automation Fix an issue with subquery map merge jobs as non-root	2019-07-10 12:48:23 +02:00
Nils Dijk	791cc26a86	Fix an issue with subquery map merge jobs as non-root Also automated all manual tests around multi user isolation for internal citus udf's automate upgrade_to_reference_table tests add negative tests for lock_relation_if_exists add tests for permissions on worker_cleanup_job_schema_cache add tests for worker_fetch_partition_file add tests for worker_merge_files_into_table fix problem with worker_merge_files_and_run_query when run as non-super user and add tests for behaviour	2019-07-10 12:40:05 +02:00
Marco Slot	9453645860	Merge pull request #2833 from citusdata/fix_relation_shard_list Don't modify cache entry in RelationShardListForShardCreate()	2019-07-10 09:37:00 +02:00
Hadi Moshayedi	46608e42f9	Add hyperscale tutorial to the regression tests.	2019-07-10 10:47:55 +02:00
Hadi Moshayedi	91d8a41ecd	Don't modify cache entry in RelationShardListForShardCreate()	2019-07-09 12:44:48 -07:00
Marco Slot	70434bc716	Increase slow start time in test to make valgrind tests pass	2019-07-08 06:04:13 +02:00
Marco Slot	b09ee85408	Merge pull request #2825 from citusdata/fix_set Fix crash in RESET and make it behave properly	2019-07-05 23:25:00 +02:00
Hadi Moshayedi	032167c553	Fix Assert() in ProcessVariableSetStmt()	2019-07-05 14:11:22 -07:00
Marco Slot	07d2266e11	Fix RESET and other types of SET	2019-07-05 19:30:48 +02:00
Marco Slot	8617838fd6	Merge pull request #2822 from citusdata/fix_find_worker_node Copy WorkerNode before returning in FindWorkerNode	2019-07-05 18:13:11 +02:00
Marco Slot	97334ff1ec	Copy WorkerNode before returning in FindWorkerNode	2019-07-05 09:35:53 +02:00
Marco Slot	99f18c55d9	Merge pull request #2815 from citusdata/fix_valgrind_stack_errors Increase valgrind's max-stackframe	2019-07-04 15:25:24 +02:00
Hadi Moshayedi	5d59aab38d	Increase valgrind's max-stackframe	2019-07-04 14:19:41 +02:00
Marco Slot	ea630c5070	Merge pull request #2817 from citusdata/fix_multi_extension Fix multi_extension in check-multi-vg	2019-07-04 12:53:22 +02:00
Hadi Moshayedi	d233887d68	Fix multi_extension in check-multi-vg	2019-07-04 13:03:46 +02:00
Marco Slot	ce2d4a216d	Merge pull request #2816 from citusdata/fix_null_dereference Fix a NULL dereference.	2019-07-04 11:44:52 +02:00
Hadi Moshayedi	47aa95d00d	Fix a NULL dereference.	2019-07-03 16:26:49 -07:00
Marco Slot	3359a7e6f0	Merge pull request #2812 from citusdata/fix_memory_issues Fix a use after free in adaptive executor	2019-07-03 10:43:29 +02:00
Hadi Moshayedi	805a2ac602	Fix a use after free in adaptive executor	2019-07-02 10:12:13 -07:00
Marco Slot	d6c667946c	Fix citus_executor_name mapping by reimplementing it in C	2019-06-29 22:38:29 +02:00
Marco Slot	70c0d96507	Track partition key for adaptive executor in CitusEndScan	2019-06-29 21:37:15 +02:00
Önder Kalacı	40da78c6fd	Introduce the adaptive executor (#2798 ) With this commit, we're introducing the Adaptive Executor. The commit message consists of two distinct sections. The first part explains how the executor works. The second part consists of the commit messages of the individual smaller commits that resulted in this commit. The readers can search for the each of the smaller commit messages on https://github.com/citusdata/citus and can learn more about the history of the change. /------------------------------------------------------------------------- * adaptive_executor.c * * The adaptive executor executes a list of tasks (queries on shards) over * a connection pool per worker node. The results of the queries, if any, * are written to a tuple store. * * The concepts in the executor are modelled in a set of structs: * * - DistributedExecution: * Execution of a Task list over a set of WorkerPools. * - WorkerPool * Pool of WorkerSessions for the same worker which opportunistically * executes "unassigned" tasks from a queue. * - WorkerSession: * Connection to a worker that is used to execute "assigned" tasks * from a queue and may execute unasssigned tasks from the WorkerPool. * - ShardCommandExecution: * Execution of a Task across a list of placements. * - TaskPlacementExecution: * Execution of a Task on a specific placement. * Used in the WorkerPool and WorkerSession queues. * * Every connection pool (WorkerPool) and every connection (WorkerSession) * have a queue of tasks that are ready to execute (readyTaskQueue) and a * queue/set of pending tasks that may become ready later in the execution * (pendingTaskQueue). The tasks are wrapped in a ShardCommandExecution, * which keeps track of the state of execution and is referenced from a * TaskPlacementExecution, which is the data structure that is actually * added to the queues and describes the state of the execution of a task * on a particular worker node. * * When the task list is part of a bigger distributed transaction, the * shards that are accessed or modified by the task may have already been * accessed earlier in the transaction. We need to make sure we use the * same connection since it may hold relevant locks or have uncommitted * writes. In that case we "assign" the task to a connection by adding * it to the task queue of specific connection (in * AssignTasksToConnections). Otherwise we consider the task unassigned * and add it to the task queue of a worker pool, which means that it * can be executed over any connection in the pool. * * A task may be executed on multiple placements in case of a reference * table or a replicated distributed table. Depending on the type of * task, it may not be ready to be executed on a worker node immediately. * For instance, INSERTs on a reference table are executed serially across * placements to avoid deadlocks when concurrent INSERTs take conflicting * locks. At the beginning, only the "first" placement is ready to execute * and therefore added to the readyTaskQueue in the pool or connection. * The remaining placements are added to the pendingTaskQueue. Once * execution on the first placement is done the second placement moves * from pendingTaskQueue to readyTaskQueue. The same approach is used to * fail over read-only tasks to another placement. * * Once all the tasks are added to a queue, the main loop in * RunDistributedExecution repeatedly does the following: * * For each pool: * - ManageWorkPool evaluates whether to open additional connections * based on the number unassigned tasks that are ready to execute * and the targetPoolSize of the execution. * * Poll all connections: * - We use a WaitEventSet that contains all (non-failed) connections * and is rebuilt whenever the set of active connections or any of * their wait flags change. * * We almost always check for WL_SOCKET_READABLE because a session * can emit notices at any time during execution, but it will only * wake up WaitEventSetWait when there are actual bytes to read. * * We check for WL_SOCKET_WRITEABLE just after sending bytes in case * there is not enough space in the TCP buffer. Since a socket is * almost always writable we also use WL_SOCKET_WRITEABLE as a * mechanism to wake up WaitEventSetWait for non-I/O events, e.g. * when a task moves from pending to ready. * * For each connection that is ready: * - ConnectionStateMachine handles connection establishment and failure * as well as command execution via TransactionStateMachine. * * When a connection is ready to execute a new task, it first checks its * own readyTaskQueue and otherwise takes a task from the worker pool's * readyTaskQueue (on a first-come-first-serve basis). * * In cases where the tasks finish quickly (e.g. <1ms), a single * connection will often be sufficient to finish all tasks. It is * therefore not necessary that all connections are established * successfully or open a transaction (which may be blocked by an * intermediate pgbouncer in transaction pooling mode). It is therefore * essential that we take a task from the queue only after opening a * transaction block. * * When a command on a worker finishes or the connection is lost, we call * PlacementExecutionDone, which then updates the state of the task * based on whether we need to run it on other placements. When a * connection fails or all connections to a worker fail, we also call * PlacementExecutionDone for all queued tasks to try the next placement * and, if necessary, mark shard placements as inactive. If a task fails * to execute on all placements, the execution fails and the distributed * transaction rolls back. * * For multi-row INSERTs, tasks are executed sequentially by * SequentialRunDistributedExecution instead of in parallel, which allows * a high degree of concurrency without high risk of deadlocks. * Conversely, multi-row UPDATE/DELETE/DDL commands take aggressive locks * which forbids concurrency, but allows parallelism without high risk * of deadlocks. Note that this is unrelated to SEQUENTIAL_CONNECTION, * which indicates that we should use at most one connection per node, but * can run tasks in parallel across nodes. This is used when there are * writes to a reference table that has foreign keys from a distributed * table. * * Execution finishes when all tasks are done, the query errors out, or * the user cancels the query. * ------------------------------------------------------------------------- / All the commits involved here: * Initial unified executor prototype * Latest changes * Fix rebase conflicts to master branch * Add missing variable for assertion * Ensure that master_modify_multiple_shards() returns the affectedTupleCount * Adjust intermediate result sizes The real-time executor uses COPY command to get the results from the worker nodes. Unified executor avoids that which results in less data transfer. Simply adjust the tests to lower sizes. * Force one connection per placement (or co-located placements) when requested The existing executors (real-time and router) always open 1 connection per placement when parallel execution is requested. That might be useful under certain circumstances: (a) User wants to utilize as much as CPUs on the workers per distributed query (b) User has a transaction block which involves COPY command Also, lots of regression tests rely on this execution semantics. So, we'd enable few of the tests with this change as well. * For parameters to be resolved before using them For the details, see PostgreSQL's copyParamList() * Unified executor sorts the returning output * Ensure that unified executor doesn't ignore sequential execution of DDLJob's Certain DDL commands, mainly creating foreign keys to reference tables, should be executed sequentially. Otherwise, we'd end up with a self distributed deadlock. To overcome this situaiton, we set a flag `DDLJob->executeSequentially` and execute it sequentially. Note that we have to do this because the command might not be called within a transaction block, and we cannot call `SetLocalMultiShardModifyModeToSequential()`. This fixes at least two test: multi_insert_select_on_conflit.sql and multi_foreign_key.sql Also, I wouldn't mind scattering local `targetPoolSize` variables within the code. The reason is that we'll soon have a GUC (or a global variable based on a GUC) that'd set the pool size. In that case, we'd simply replace `targetPoolSize` with the global variables. * Fix 2PC conditions for DDL tasks * Improve closing connections that are not fully established in unified execution * Support foreign keys to reference tables in unified executor The idea for supporting foreign keys to reference tables is simple: Keep track of the relation accesses within a transaction block. - If a parallel access happens on a distributed table which has a foreign key to a reference table, one cannot modify the reference table in the same transaction. Otherwise, we're very likely to end-up with a self-distributed deadlock. - If an access to a reference table happens, and then a parallel access to a distributed table (which has a fkey to the reference table) happens, we switch to sequential mode. Unified executor misses the function calls that marks the relation accesses during the execution. Thus, simply add the necessary calls and let the logic kick in. * Make sure to close the failed connections after the execution * Improve comments * Fix savepoints in unified executor. * Rebuild the WaitEventSet only when necessary * Unclaim connections on all errors. * Improve failure handling for unified executor - Implement the notion of errorOnAnyFailure. This is similar to Critical Connections that the connection managament APIs provide - If the nodes inside a modifying transaction expand, activate 2PC - Fix few bugs related to wait event sets - Mark placement INACTIVE during the execution as much as possible as opposed to we do in the COMMIT handler - Fix few bugs related to scheduling next placement executions - Improve decision on when to use 2PC Improve the logic to start a transaction block for distributed transactions - Make sure that only reference table modifications are always executed with distributed transactions - Make sure that stored procedures and functions are executed with distributed transactions * Move waitEventSet to DistributedExecution This could also be local to RunDistributedExecution(), but in that case we had to mark it as "volatile" to avoid PG_TRY()/PG_CATCH() issues, and cast it to non-volatile when doing WaitEventSetFree(). We thought that would make code a bit harder to read than making this non-local, so we move it here. See comments for PG_TRY() in postgres/src/include/elog.h and "man 3 siglongjmp" for more context. * Fix multi_insert_select test outputs Two things: 1) One complex transaction block is now supported. Simply update the test output 2) Due to dynamic nature of the unified executor, the orders of the errors coming from the shards might change (e.g., all of the queries on the shards would fail, but which one appears on the error message?). To fix that, we simply added it to our shardId normalization tool which happens just before diff. * Fix subeury_and_cte test The error message is updated from: failed to execute task To: more than one row returned by a subquery or an expression which is a lot clearer to the user. * Fix intermediate_results test outputs Simply update the error message from: could not receive query results to result "squares" does not exist which makes a lot more sense. * Fix multi_function_in_join test The error messages update from: Failed to execute task XXX To: function f(..) does not exist * Fix multi_query_directory_cleanup test The unified executor does not create any intermediate files. * Fix with_transactions test A test case that just started to work fine * Fix multi_router_planner test outputs The error message is update from: Could not receive query results To: Relation does not exists which is a lot more clearer for the users * Fix multi_router_planner_fast_path test The error message is update from: Could not receive query results To: Relation does not exists which is a lot more clearer for the users * Fix isolation_copy_placement_vs_modification by disabling select_opens_transaction_block * Fix ordering in isolation_multi_shard_modify_vs_all * Add executor locks to unified executor * Make sure to allocate enought WaitEvents The previous code was missing the waitEvents for the latch and postmaster death. * Fix rebase conflicts for master rebase * Make sure that TRUNCATE relies on unified executor * Implement true sequential execution for multi-row INSERTS Execute the individual tasks executed one by one. Note that this is different than MultiShardConnectionType == SEQUENTIAL_CONNECTION case (e.g., sequential execution mode). In that case, running the tasks across the nodes in parallel is acceptable and implemented in that way. However, the executions that are qualified here would perform poorly if the tasks across the workers are executed in parallel. We currently qualify only one class of distributed queries here, multi-row INSERTs. If we do not enforce true sequential execution, concurrent multi-row upserts could easily form a distributed deadlock when the upserts touch the same rows. * Remove SESSION_LIFESPAN flag in unified_executor * Apply failure test updates We've changed the failure behaviour a bit, and also the error messages that show up to the user. This PR covers majority of the updates. * Unified executor honors citus.node_connection_timeout With this commit, unified executor errors out if even a single connection cannot be established within citus.node_connection_timeout. And, as a side effect this fixes failure_connection_establishment test. * Properly increment/decrement pool size variables Before this commit, the idle and active connection counts were not properly calculated. * insert_select_executor goes through unified executor. * Add missing file for task tracker * Modify ExecuteTaskListExtended()'s signature * Sort output of INSERT ... SELECT ... RETURNING * Take partition locks correctly in unified executor * Alternative implementation for force_max_query_parallelization * Fix compile warnings in unified executor * Fix style issues * Decrement idleConnectionCount when idle connection is lost * Always rebuild the wait event sets In the previous implementation, on waitFlag changes, we were only modifying the wait events. However, we've realized that it might be an over optimization since (a) we couldn't see any performance benefits (b) we see some errors on failures and because of (a) we prefer to disable it now. * Make sure to allocate enough sized waitEventSet With multi-row INSERTs, we might have more sessions than taskworkerCount after few calls of RunDistributedExecution() because the previous sessions would also be alive. Instead, re-allocate events when the connectino set changes. Implement SELECT FOR UPDATE on reference tables On master branch, we do two extra things on SELECT FOR UPDATE queries on reference tables: - Acquire executor locks - Execute the query on all replicas With this commit, we're implementing the same logic on the new executor. * SELECT FOR UPDATE opens transaction block even if SelectOpensTransactionBlock disabled Otherwise, users would be very confused and their logic is very likely to break. * Fix build error * Fix the newConnectionCount calculation in ManageWorkerPool * Fix rebase conflicts * Fix minor test output differences * Fix citus indent * Remove duplicate sorts that is added with rebase * Create distributed table via executor * Fix wait flags in CheckConnectionReady * failure_savepoints output for unified executor. * failure_vacuum output (pg 10) for unified executor. * Fix WaitEventSetWait timeout in unified executor * Stabilize failure_truncate test output * Add an ORDER BY to multi_upsert * Fix regression test outputs after rebase to master * Add executor.c comment * Rename executor.c to adaptive_executor.c * Do not schedule tasks if the failed placement is not ready to execute Before the commit, we were blindly scheduling the next placement executions even if the failed placement is not on the ready queue. Now, we're ensuring that if failed placement execution is on a failed pool or session where the execution is on the pendingQueue, we do not schedule the next task. Because the other placement execution should be already running. * Implement a proper custom scan node for adaptive executor - Switch between the executors, add GUC to set the pool size - Add non-adaptive regression test suites - Enable CIRCLE CI for non-adaptive tests - Adjust test output files * Add slow start interval to the executor * Expose max_cached_connection_per_worker to user * Do not start slow when there are cached connections * Consider ExecutorSlowStartInterval in NextEventTimeout * Fix memory issues with ReceiveResults(). * Disable executor via TaskExecutorType * Make sure to execute the tests with the other executor * Use task_executor_type to enable-disable adaptive executor * Remove useless code * Adjust the regression tests * Add slow start regression test * Rebase to master * Fix test failures in adaptive executor. * Rebase to master - 2 * Improve comments & debug messages * Set force_max_query_parallelization in isolation_citus_dist_activity * Force max parallelization for creating shards when asked to use exclusive connection. * Adjust the default pool size * Expand description of max_adaptive_executor_pool_size GUC * Update warnings in FinishRemoteTransactionCommit() * Improve session clean up at the end of execution Explicitly list all the states that the execution might end, otherwise warn. * Remove MULTI_CONNECTION_WAIT_RETRY which is not used at all * Add more ORDER BYs to multi_mx_partitioning	2019-06-28 14:04:40 +02:00
Önder Kalacı	2d57899130	Merge pull request #2806 from citusdata/isolation_consistent_finish_step Isolation tests: consistently name COMMIT '-commit'	2019-06-27 11:57:23 +02:00
Philip Dubé	4e54c1525d	Isolation tests: consistently name COMMIT '-commit'	2019-06-27 07:32:39 +02:00
Önder Kalacı	66b4225d0a	Merge pull request #2777 from citusdata/create-all-schemas-as-superuser Create all distributed schemas as superuser on a separate connection	2019-06-26 17:19:02 +02:00
Hanefi Onaldi	4e08477fed	Add test case for issue 2575	2019-06-26 17:12:28 +02:00
Hanefi Onaldi	7e8fd49b94	Create Schemas as superuser on all shard/table creation UDFs - All the schema creations on the workers will now be via superuser connections - If a shard is being repaired or a shard is replicated, we will create the schema only in the relevant worker; and in all the other cases where a schema creation is needed, we will block operations until we ensure the schema exists in all the workers	2019-06-26 17:12:28 +02:00
Philip Dubé	6ae9158216	Merge pull request #2778 from citusdata/2400_modifying_ctes Support CTEs in router planner for modification queries	2019-06-26 16:47:00 +02:00
Philip Dubé	aa0c47848e	subquery_and_cte: test rejecting volatile ctes Also update isolation_citus_dist_activity from after merge	2019-06-26 16:27:07 +02:00
Philip Dubé	db7fdb1854	Router planner: bail on volatile functions in CTEs	2019-06-26 10:32:01 +02:00
Philip Dubé	5c62f9935a	Router planner: reject SELECT FOR UPDATE ctes	2019-06-26 10:32:01 +02:00
Philip Dubé	18575ccfd3	Add tests to subquery_and_cte, update check-multi-mx expected results	2019-06-26 10:32:01 +02:00
Philip Dubé	77efec04a0	Router Planner: accept SELECT_CMD ctes in modification queries	2019-06-26 10:32:01 +02:00
Philip Dubé	84fe626378	multi_router_planner: refactor error propagation	2019-06-26 10:32:01 +02:00
Philip Dubé	9ed6dd5570	Ignore compile_commands.json, fix typo	2019-06-26 10:32:01 +02:00
Önder Kalacı	c37198f0fa	Merge pull request #2797 from citusdata/normalize_name_lengths Normalize multi_name_lengths.	2019-06-25 16:13:30 +02:00
Hadi Moshayedi	25a984bab4	Normalize multi_name_lengths.	2019-06-25 14:18:33 +02:00
Hadi Moshayedi	c7745486b9	Merge pull request #2793 from citusdata/coordinator_plan Show just coordinator plan in some test outputs.	2019-06-24 14:11:49 +02:00
Hadi Moshayedi	3d0a521295	Show just coordinator plan in some test outputs.	2019-06-24 12:24:30 +02:00
Önder Kalacı	9a137e9486	Merge pull request #2790 from citusdata/fix_fkey_error Change the order of placement access added to the placement access list	2019-06-24 09:49:58 +02:00
Onder Kalaci	ad93d6feea	Change the order of placement access added to the list This is to make sure that the error messages related to foreign keys to reference tables shows the exact placement access name instead of SELECT.	2019-06-23 11:32:58 +02:00
Nils Dijk	eb98f2d13a	Fix null pointer caused by partial initialization of ConnParamsHashEntry (#2789 ) It has been reported a null pointer dereference could be triggered in FreeConnParamsHashEntryFields. Likely cause is an error in GetConnParams which will leave the cached ConnParamsHashEntry in a state that would cause the null pointer dereference in a subsequent connection establishment to the same server. This has been simulated by inserting ereport(ERROR, ...) at certain places in the code. Not only would ConnParamsHashEntry be in a state that would cause a crash, it was also leaking memory in the ConnectionContext due to the loss of pointers as they are only stored on the ConnParamsHashEntry at the end of the function. This patch rewrites both the GetConnParams to store pointers 'durably' at every point in the code so that an error would not lose the pointer as well as FreeConnParamsHashEntryFields in a way that it can clear half initialised ConnParamsHashEntry's in a safer manner.	2019-06-21 18:16:43 +02:00
Hanefi Onaldi	7a6eb2aba0	Fix one regression test that fails on enterprise (#2786 ) GRANT queries are propagated on Enterprise. If a user attempts to create a user and run a GRANT query before creating it on workers, we fail. This issue does not happen in community as the user needs to run the GRANTs on the workers manually.	2019-06-21 15:46:28 +03:00
Nils Dijk	5df1b49bed	Feature: optionally force master_update_node during failover (#2773 ) When `master_update_node` is called to update a node's location it waits for appropriate locks to become available. This is useful during normal operation as new operations will be blocked till after the metadata update while running operations have time to finish. When `master_update_node` is called after a node failure it is less useful to wait for running operations to finish as they can't. The lock being held indicates an operation that once attempted to commit will fail as the machine already failed. Now the downside is the failover is postponed till the termination point of the operation. This has been observed by users to take a significant amount of time causing the rest of the system to be observed unavailable. With this patch it is possible in such situations to invoke `master_update_node` with 2 optional arguments: - `force` (bool defaults to `false`): When called with true the update of the metadata will be forced to proceed by terminating conflicting backends. A cancel is not enough as the backend might be in idle time (eg. an interactive session, or going back and forth between an appliaction), therefore a more intrusive solution of termination is used here. - `lock_cooldown` (int defaults to `10000`): This is the time in milliseconds before conflicting backends are terminated. This is to allow the backends to finish cleanly before terminating them. This allows the user to set an upperbound to the expected time to complete the metadata update, eg. performing the failover. The functionality is implemented by spawning a background worker that has the task of helping a certain backend in acquiring its locks. The backend is either terminated on successful execution of the metadata update, or once the memory context of the expression gets reset, eg. on a cancel of the statement.	2019-06-21 12:03:15 +02:00
Jason Petersen	70055098af	Merge pull request #2689 from citusdata/prop_set_local Add logic to propagate SET LOCAL at xact start cr: @marcocitus	2019-06-20 16:28:35 -07:00
Jason Petersen	d4e1172247	Implement propagation of SET LOCAL commands Adds support for propagation of SET LOCAL commands to all workers involved in a query. For now, SET SESSION (i.e. plain SET) is not supported whatsoever, though this code is intended as somewhat of a base for implementing such support in the future. As SET LOCAL modifications are scoped to the body of a BEGIN/END xact block, queries wishing to use SET LOCAL propagation must be within such a block. In addition, subsequent modifications after e.g. any SAVEPOINT or ROLLBACK statements will correspondingly push or pop variable mod- ifications onto an internal stack such that the behavior of changed values across the cluster will be identical to such behavior on e.g. single-node PostgreSQL (or equivalently, what values are visible to the end user by running SHOW on such variables on the coordinator). If nodes enter the set of participants at some point after SET LOCAL modifications (or SAVEPOINT, ROLLBACK, etc.) have occurred, the SET variable state is eagerly propagated to them upon their entrance (this is identical to, and indeed just augments, the existing logic for the propagation of the SAVEPOINT "stack"). A new GUC (citus.propagate_set_commands) has been added to control this behavior. Though the code suggests the valid settings are 'none', 'local', 'session', and 'all', only 'none' (the default) and 'local' are presently implemented: attempting to use other values will result in an error.	2019-06-20 16:15:43 -07:00
Jason Petersen	1dec6c5163	Change BeginCoordinatedTransaction to internal linkage It's only ever called from a single file, so having it be extern didn't make a whole lot of sense.	2019-06-20 13:44:06 -07:00
Jason Petersen	2349e8e75c	Remove extraneous comments around PG header change	2019-06-20 13:37:53 -07:00
Hadi Moshayedi	602f3cd551	Merge pull request #2780 from citusdata/master_super_copy Make COPY adapt to connection use behaviour of previous commands in transaction	2019-06-20 20:00:30 +02:00
Hadi Moshayedi	4bbae02778	Make COPY compatible with unified executor.	2019-06-20 19:53:40 +02:00
Hadi Moshayedi	17d4d3e5ea	Merge pull request #2781 from citusdata/refactor_ExecuteModifyTasksSequentially Refactor ExecuteModifyTasksSequentially.	2019-06-20 18:45:33 +02:00
Hadi Moshayedi	2e6d04df7b	Refactor ExecuteModifyTasksSequentially.	2019-06-20 18:38:57 +02:00
Hadi Moshayedi	6741ffd716	Merge pull request #2775 from citusdata/remove_unneeded_expected_file Use normalization for multi_subtransaction output	2019-06-19 18:00:24 +02:00
Hadi Moshayedi	d4f3e2809d	Use normalization for multi_subtransaction output	2019-06-19 17:54:33 +02:00
Hadi Moshayedi	adb6afe8b9	Merge pull request #2774 from citusdata/fix_subxact_release Fix subxact release crash.	2019-06-19 17:51:21 +02:00
Hadi Moshayedi	83f6c7dab4	Fix subxact release crash	2019-06-19 17:43:10 +02:00
Önder Kalacı	dabe1e0add	Merge pull request #2769 from citusdata/refactor_create_dist_table Refactor shard creation logic	2019-06-19 16:07:38 +02:00
Onder Kalaci	2b0c4accda	Apply feedback	2019-06-19 10:03:58 +02:00
Onder Kalaci	3a04374a9e	Refactor relation shard list creation during placement creation This change is to make further refactoring even simpler such as using the executor for shard creation.	2019-06-19 10:03:58 +02:00
Onder Kalaci	4fd1fcbbef	Refactor shard creation logic This is a preperation for the new executor, where creating shards would go through the executor. So, explicitly generate the commands for further processing.	2019-06-19 10:03:58 +02:00
Jason Petersen	96d9847aa4	Merge pull request #2757 from citusdata/werror Enable Werror for all warnings cr: @jasonmp85	2019-06-18 14:51:56 -07:00
Jason Petersen	cdaca7297c	Switch to werror-enabled CircleCI image	2019-06-18 14:43:54 -07:00
Philip Dubé	4bfcf5b665	Enable Werror for all warnings Changes to ruleutils match changes made upstream to silence gcc fallthrough warnings	2019-06-18 14:43:54 -07:00
Hadi Moshayedi	04abc1137f	Merge pull request #2772 from citusdata/cancel Use SendCancelationRequest() in ShutdownConnection()	2019-06-18 12:16:40 +02:00
Hadi Moshayedi	b240854b8c	Use SendCancelationRequest() in ShutdownConnection()	2019-06-18 12:10:05 +02:00
Hadi Moshayedi	ee37e3da89	Merge pull request #2765 from citusdata/fix_diff Fix test name detection in bin/diff	2019-06-17 12:39:34 +02:00
Hadi Moshayedi	c42b22f8fd	Fix test name detection in bin/diff	2019-06-17 11:31:42 +02:00
Philip Dubé	ab15a214e0	Merge pull request #2733 from citusdata/fix_2642_joinalias Fix join alias resolution	2019-06-12 17:34:39 -07:00
Philip Dubé	342d423725	Fix join alias resolution FROM (query) alias ignored renaming In nested subqueries the select list would rename, while the join alias would not respect that	2019-06-12 17:25:07 -07:00
Hanefi Onaldi	b613403d87	update changelog for v8.2.2	2019-06-11 15:27:14 +03:00
Marco Slot	c045c9c8eb	Merge pull request #2750 from citusdata/stats_collection_off enable_statistics_collection defaults to off (opt-in)	2019-06-06 12:12:52 +02:00
Marco Slot	c1ac794b77	enable_statistics_collection defaults to off	2019-06-05 18:43:26 +02:00
Hadi Moshayedi	674b7ce29a	Merge pull request #2748 from citusdata/ScanStateGetTupleDescriptor Refactor some scan state info into their own functions.	2019-06-05 09:22:00 -07:00
Hadi Moshayedi	85325e0098	Refactor ScanStateGetExecutorState into its own function.	2019-06-05 09:16:43 -07:00
Hadi Moshayedi	0b01c59fa6	Refactor ScanStateGetTupleDescriptor() into a function.	2019-06-04 15:19:49 -07:00
Hadi Moshayedi	7abd28d3e8	Merge pull request #2654 from citusdata/fix_lateral_joins Search all outer node levels for lateral join params.	2019-06-04 10:18:18 -07:00
Hadi Moshayedi	8e2d328530	Search all outer node levels for lateral join params.	2019-06-04 10:14:05 -07:00
Demur Rumed	5cc8049caa	Merge pull request #2742 from citusdata/fix_2739_outer_join_subquery_error Also check rewrittenQuery jointree for outer join	2019-06-04 07:52:59 -07:00
Philip Dubé	b5ced403d8	Also check rewrittenQuery jointree for outer join	2019-06-04 07:47:35 -07:00
Önder Kalacı	b7f5819281	Merge pull request #2745 from citusdata/refactor_copy Refactor ShardIdForTuple() to a separate function.	2019-06-03 10:24:17 +02:00
Hadi Moshayedi	dee5bc31b4	Refactor ShardIdForTuple() to a separate function.	2019-06-02 09:48:15 -07:00
Önder Kalacı	27b0f0023c	Merge pull request #2716 from citusdata/max_cached_connections Replace session lifespan flag with a configurable number of connections	2019-05-29 15:05:09 +02:00
Marco Slot	c1566d464b	Fix failure and isolation tests On top of citus.max_cached_conns_per_worker GUC, with this commit we're updating the regression tests to comply with the new behaviour.	2019-05-29 14:42:31 +02:00
Marco Slot	bb3a96eacb	Cache a configurable number of connections at xact end	2019-05-29 13:24:31 +02:00
Önder Kalacı	caa8fffbd0	Merge pull request #2736 from citusdata/order_by_fix_9 Make sure that the regression tests are resistant to execution order changes	2019-05-28 12:27:45 +02:00
Onder Kalaci	d46b92d79a	Add order by to multi_mx_schema_support	2019-05-28 12:23:28 +02:00
Onder Kalaci	fa2a6e4d8f	Add order by to multi_mx_router_planner	2019-05-28 12:23:28 +02:00
Onder Kalaci	0a7a173eee	Add order by to multi_mx_reference_table	2019-05-28 12:23:28 +02:00
Onder Kalaci	1553e12ee4	Add order by to multi_subquery_complex_reference_clause	2019-05-28 12:06:57 +02:00
Hadi Moshayedi	d4dbe8f008	Merge pull request #2732 from citusdata/fix_a_typo Fix a typo: WITH CARDINALITY -> WITH ORDINALITY	2019-05-24 15:58:31 -07:00
Hadi Moshayedi	23207a43e0	Fix a typo: WITH CARDINALITY -> WITH ORDINALITY	2019-05-24 15:49:17 -07:00
Demur Rumed	aa74eea955	Merge pull request #2726 from citusdata/fix_2548_alterforeign Propagate more ALTER FOREIGN TABLE commands to workers	2019-05-24 19:59:09 +00:00
Philip Dubé	b8871d9ff4	Propagate more ALTER FOREIGN TABLE to workers	2019-05-24 12:54:05 -07:00
Marco Slot	dff1a8db08	Merge pull request #2725 from citusdata/deprecate_mmms Deprecate master_modify_multiple_shards	2019-05-24 14:33:49 +02:00
Marco Slot	b3fcf2a48f	Deprecate master_modify_multiple_shards	2019-05-24 15:22:06 +02:00
Marco Slot	7a2e3124f7	Merge pull request #2724 from citusdata/truncate_cleanup Stop using master_modify_multiple_shards in TRUNCATE	2019-05-24 13:47:01 +02:00
Marco Slot	7fa5d36057	Stop using master_modify_multiple_shards in TRUNCATE	2019-05-24 14:35:46 +02:00
exialin	59e54de54d	Minor code clean-up	2019-05-24 14:26:26 +02:00
Hanefi Onaldi	b31fbcb28d	Merge pull request #2723 from citusdata/simplify-round-robin-on-router-queries Simplify round robin logic on router queries	2019-05-24 14:24:05 +03:00
Hanefi Onaldi	7443191397	Improve tests for round robin & router queries	2019-05-24 14:16:56 +03:00
Hanefi Onaldi	4d737177e6	Remove redundant active placement filters and unneded sort operations If a query is router executable, it hits a single shard and therefore has a single task associated with it. Therefore there is no need to sort the task list that has a single element. Also we already have a list of active shard placements, sending it in param and reuse it.	2019-05-24 14:16:50 +03:00
Hanefi Onaldi	b935dfb8c8	Cleanup deleted function declaration	2019-05-24 14:04:26 +03:00
Demur Rumed	af16ed7308	Merge pull request #2727 from citusdata/spellcheck Fix misc typos (2)	2019-05-24 00:36:23 +00:00
Philip Dubé	16886b3c63	Fix misc typos	2019-05-23 17:23:27 -07:00
Önder Kalacı	178142fe01	Merge pull request #2721 from citusdata/fix_test_2pc Fix wrong transaction recovery test output	2019-05-22 08:39:46 +02:00
Onder Kalaci	f1a80a609f	Fix wrong test output If replication factor eqauls to 2 and there are two worker nodes, even if two modifications hit different shards, Citus doesn't use 2PC. The reason is that it doesn't fit into the definition of "expanding participating worker nodes". Thus, we're simply fixing the test to fit in the comment on top of it.	2019-05-21 19:12:37 +03:00
Hadi Moshayedi	56708efc87	Merge pull request #2719 from citusdata/fix_some_comments Fix comments for RemoteFileDestReceiverStartup and CitusCopyDestReceiverStartup	2019-05-21 08:08:34 -08:00
Hadi Moshayedi	8ae47e1244	Fix comments for RemoteFileDestReceiverStartup and CitusCopyDestReceiverStartup	2019-05-21 09:03:22 -07:00
Önder Kalacı	21f772030f	Merge pull request #2720 from citusdata/order_by_fix_8 Make sure that the regression tests are resistant to execution order changes	2019-05-21 15:00:13 +02:00
Onder Kalaci	f76abfe470	Add ORDER BY to multi_router_planner	2019-05-21 15:54:33 +03:00
Onder Kalaci	f06a79563d	Add ORDER BY to multi_foreign_key	2019-05-21 15:54:03 +03:00
Hadi Moshayedi	332ccbf8c1	Merge pull request #2718 from citusdata/fix_include Fix an include in recusive_planning.c	2019-05-20 18:01:56 -08:00
Hadi Moshayedi	dce9260c0e	Fix an include in recusive_planning.c	2019-05-20 18:57:03 -07:00
Murat Tuncer	e29bc808b7	Merge pull request #2703 from citusdata/fix_dist_table_cache_initialization Fix DistShardCacheHash initialization	2019-05-15 16:57:01 +03:00
Murat Tuncer	3fe482adbc	Fix DistShardCacheHash initialization InitializeCaches() method may prematurely set performedInitialization without actually creating DistShardCacheHash. Fix makes sure flag is set only if DistShardCacheHash is created successfully. Also introduced a new memory context to allocate aforementioned hash tables. If allocation/initialization fails for any reason we make sure memory is reclaimed by deleting the memory context.	2019-05-15 16:47:44 +03:00
Hanefi Onaldi	986ef6651a	Merge pull request #2707 from citusdata/correct_anchor_shardids_on_round_robin Prevent anchoring reference table shard ids when distributed tables are in join clauses	2019-05-15 09:31:24 +03:00
Hanefi Onaldi	4030d603eb	Merge pull request #2691 from citusdata/update_changelog Add 8.1.2 and 8.2.1 changelog entries	2019-05-15 09:18:58 +03:00
Hadi Moshayedi	1b4dc44996	Merge pull request #2706 from citusdata/simplify_EndRemoteCopy Remove stopOnFailure flag from EndRemoteCopy()	2019-05-13 12:51:15 -08:00
Hadi Moshayedi	b5c0ca45f1	Remove stopOnFailure flag from EndRemoteCopy()	2019-05-11 06:18:34 -07:00
Hadi Moshayedi	39e806c276	Merge pull request #2701 from citusdata/fix_warnings Fix mixed declarations and code warnings	2019-05-08 11:56:10 -08:00
Hadi Moshayedi	e584961267	Fix mixed declarations and code warnings	2019-05-08 12:51:40 -07:00
Hanefi Onaldi	f7081f3119	Merge pull request #2691 from citusdata/update_changelog Add 8.1.2 and 8.2.1 changelog entries	2019-05-07 09:42:51 +03:00
velioglu	b6bbee2cac	Add 8.1.2 and 8.2.1 changelog entries	2019-05-07 09:09:47 +03:00
Claire Giordano	ce53671616	Merge pull request #2699 from citusdata/may-readme Updates to README.md on May 6th	2019-05-06 08:30:55 -07:00
Claire Giordano	c429bc4ced	updated Agari & MixRank user story links	2019-05-06 00:53:38 -07:00
Claire Giordano	61bda2a9e4	fix typo	2019-05-06 00:49:56 -07:00
Claire Giordano	076a9a3232	Updated descr, use case, customer, & get started	2019-05-06 00:47:20 -07:00
Önder Kalacı	53d0dcd659	Merge pull request #2696 from citusdata/order_by_fix_7 Add some more ORDER BYs	2019-05-02 19:15:31 +02:00
Onder Kalaci	5d68a13139	Add order by to multi_shard_update_delete	2019-05-02 20:09:33 +03:00
Onder Kalaci	2c76b4bc46	Add order by to multi_function_in_join test	2019-05-02 20:05:25 +03:00
Önder Kalacı	febe412108	Merge pull request #2688 from citusdata/unify_fkey_to_ref_recording Refactor Parallel Relation Access Recording	2019-05-02 17:19:06 +02:00
Onder Kalaci	495b6e9b62	Refactor Parallel Relation Access Recording Instead of scattering the code around, we move all the logic into a single function. This will help supporting foreign keys to reference tables in the unified executor with a single line of change, just calling this function.	2019-05-02 18:12:33 +03:00
Önder Kalacı	2f55d61800	Merge pull request #2695 from citusdata/order_by_fix_6 Add some ORDER BYs to make the test output consistent	2019-05-02 17:11:47 +02:00
Onder Kalaci	3d871c5334	Add some ORDER BYs to make the test output consistent	2019-05-02 18:00:46 +03:00
Hadi Moshayedi	5205bc4be9	Merge pull request #2673 from citusdata/fix_multishard_transactions Fix savepoint rollback after multi-shard modify/copy failure.	2019-05-01 08:38:25 -08:00
Hadi Moshayedi	32ecb6884c	Test ROLLBACK TO SAVEPOINT with multi-shard CTE failures	2019-05-01 09:33:43 -07:00
Hadi Moshayedi	aafd22dffa	Fix savepoint rollback for INSERT INTO ... SELECT.	2019-05-01 09:33:43 -07:00
Hadi Moshayedi	b69a762e0b	Fix savepoint rollback after multi-shard update failure.	2019-05-01 09:33:43 -07:00
Jason Petersen	5b98f26984	Merge pull request #2669 from citusdata/fix_self_strncmp Fix self strncmp cr: @marcocitus	2019-04-30 15:18:58 -05:00
Jason Petersen	71d5d1c865	Enable variable shadowing warnings; fix all Rather than wait for another place like the previous commit to bite us, I think we should turn on this warning.	2019-04-30 13:24:25 -06:00
Jason Petersen	1125fc9da0	Fix self-strncmp in ConstrIsFKToReferenceTable Make the function do what I assume was intended.	2019-04-30 13:24:25 -06:00
Hadi Moshayedi	885d48f87d	Merge pull request #2690 from citusdata/fix_diff Normalize test results and expected files before comparing them.	2019-04-30 09:44:37 -08:00
Hadi Moshayedi	a9f7c1e8cb	Normalize test result/expected files before doing diff.	2019-04-30 10:19:23 -07:00
Hadi Moshayedi	4cb8ed0f9a	Merge pull request #2664 from citusdata/fix-ActivePlacementList Don't schedule tasks on inactive nodes.	2019-04-26 10:11:58 -07:00
Hadi Moshayedi	c9b1d9c2d1	Check all placements aren't inactive	2019-04-26 10:04:55 -07:00
Hadi Moshayedi	7b1d03772d	Don't schedule tasks on inactive nodes.	2019-04-26 10:04:54 -07:00
Önder Kalacı	116c255d3d	Merge pull request #2683 from citusdata/order_by_fix_5 Add ORDER BYs to multi_subquery and subqueries_deep tests	2019-04-25 10:52:10 +02:00
Onder Kalaci	82813a8796	Add ORDER BYs to multi_subquery and subqueries_deep tests	2019-04-24 13:36:11 +03:00
Önder Kalacı	796141334d	Merge pull request #2677 from citusdata/implicitly_order_returning Sort output of RETURNING	2019-04-24 11:24:42 +02:00
Onder Kalaci	004f28e18c	Sort output of RETURNING The feature is only intended for getting consistent outputs for the regression tests. RETURNING does not have any ordering gurantees and with unified executor, the ordering of query executions on the shards are also becoming unpredictable. Thus, we're enforcing ordering when a GUC is set. We implicitly add an `ORDER BY` something equivalent of ` RETURNING expr1, expr2, .. ,exprN ORDER BY expr1, expr2, .. ,exprN ` As described in the code comments as well, this is probably not the most performant approach we could implement. However, since we're only targeting regression tests, I don't see any issues with that. If we decide to expand this to a feature to users, we should revisit the implementation and improve the performance.	2019-04-24 11:51:19 +03:00
Önder Kalacı	6362c40865	Merge pull request #2678 from citusdata/order_by_another Add some more ORDER BYs	2019-04-24 09:56:32 +02:00
Onder Kalaci	64b323d9eb	Add ORDER BY to set_operations	2019-04-23 11:51:58 +03:00
Onder Kalaci	913ffc9dcd	Add ORDER BY to multi_subquery_in_where_clause	2019-04-23 11:46:00 +03:00
Önder Kalacı	41f98f9c02	Merge pull request #2671 from citusdata/fix_more_orderbys Fix more order bys	2019-04-18 08:31:16 +02:00
Onder Kalaci	753163b4d8	Be less verbose for printing worker ports in intermediate_results	2019-04-17 14:57:20 +03:00
Onder Kalaci	b3af5b2cc4	Add order by multi_mx_modifications	2019-04-17 14:57:20 +03:00
Onder Kalaci	a159bd9aed	Add order by window_functions	2019-04-17 14:57:20 +03:00
Jason Petersen	bcf393eea8	Merge pull request #2651 from citusdata/fix_constraint_naming Address constraint naming issue cr: @pykello	2019-04-16 14:32:18 -06:00
Jason Petersen	4b9519e7d6	Check for non-extended constraint before extending This will only apply to DROP and VALIDATE commands; see the lengthy comment in multi_create_table_constraints.sql for more explanation.	2019-04-15 23:14:21 -06:00
Jason Petersen	5a017c684c	Add repro case for #2484	2019-04-15 23:14:11 -06:00
Önder Kalacı	5e9dd629a2	Merge pull request #2661 from citusdata/add_orderby_subquery Add order by subquery_complex_target_list	2019-04-11 13:08:04 +02:00
Onder Kalaci	6d81fc518c	Add order by subquery_complex_target_list	2019-04-10 19:55:41 +03:00
Hadi Moshayedi	1706813dd7	Merge pull request #2659 from citusdata/fix_more_order_bys Add missing ORDER BYs	2019-04-09 14:00:22 -07:00
Onder Kalaci	58e90ad60d	Add order by multi_outer_join	2019-04-09 12:53:57 +03:00
Onder Kalaci	298e95c441	Add order by multi_shard_update_delete	2019-04-09 12:41:46 +03:00
Onder Kalaci	6a8e2c260a	Add order by multi_insert_select	2019-04-09 12:28:57 +03:00
Onder Kalaci	af096a898c	Add order by subquery_and_cte	2019-04-09 12:19:10 +03:00
Onder Kalaci	56a1a39fd4	Add order by multi_subquery_complex_queries	2019-04-09 12:12:26 +03:00
Onder Kalaci	4effa8c1f8	Add order by multi_schema_support	2019-04-09 11:52:08 +03:00
Önder Kalacı	9c097c9f01	Merge pull request #2657 from citusdata/get_ready_for_unified_executor_order_bys Make sure that the regression tests are resistant to execution order changes	2019-04-08 10:54:20 +02:00
Onder Kalaci	92e87738dd	Make sure that the regression test output is durable to different execution orders Mostly add order bys and suppress worker node ports in the test outputs.	2019-04-08 11:48:08 +03:00
Jason Petersen	358ca53696	Separate follower tests and enable core dumps	2019-04-08 01:05:36 -06:00
Jason Petersen	25eece427f	Remove Travis config, etc.	2019-04-07 22:44:08 -06:00
Önder Kalacı	085b3dd6cb	Merge pull request #2656 from citusdata/get_ready_for_unified_executor Rename MultiConnectionState to MultiConnectionPollState	2019-04-05 15:38:23 +02:00
Onder Kalaci	7d872a343a	Rename MultiConnectionState to MultiConnectionPollState	2019-04-05 11:50:11 +03:00
Önder Kalacı	87db7a7578	Merge pull request #2647 from citusdata/fix_alloca_bug Ensure that stack resizing logic works expected	2019-04-03 12:24:14 +02:00
Onder Kalaci	fb38dc3136	Ensure that stack resizing logic works expected This commit has two goals: (a) Ensure to access both edges of the allocated stack (b) Ensure that any compiler optimizations to prevent the function optimized away. Stack size after the patch: sudo grep -A 1 stack /proc/2119/smaps 7ffe305a6000-7ffe307a9000 rw-p 00000000 00:00 0 [stack] Size: 2060 kB Stack size before the patch: sudo grep -A 1 stack /proc/3610/smaps 7fff09957000-7fff09978000 rw-p 00000000 00:00 0 [stack] Size: 132 kB	2019-04-03 10:58:19 +03:00
Burak Velioglu	4a982358a6	Merge pull request #2644 from citusdata/citus-8.2.0-changelog-1553759261 Bump citus to 8.2.0	2019-03-28 14:48:18 +03:00
Burak Velioglu	c5a7827b48	Add changelog entry for 8.2.0	2019-03-28 13:45:17 +03:00
Murat Tuncer	e803eb8a02	Merge pull request #2631 from citusdata/fix_column_alias Fix column references to aliased joins	2019-03-26 13:17:56 +03:00
Murat Tuncer	1424f75ec9	Support columns referencing an aliased joins We used to rely on PG function flatten_join_alias_vars to resolve actual columns referenced in target entry list. The function goes deep and finds the actual relation. This logic usually works fine. However, when joins are given an alias, inner relation names are not visible to target entry entry. Thus relation resolving should stop when we the target entry column refers an rte of an aliased join. We stopped using PG function and provided our own flatten function.	2019-03-26 09:46:22 +03:00
Jason Petersen	f218549572	Merge pull request #2640 from citusdata/fix_bad_pruning Address unsafe coercion removal in pruning logic cr: @onderkalaci	2019-03-25 23:05:41 -05:00
Jason Petersen	4c7f78bd7e	Code review feedback	2019-03-25 22:07:27 -05:00
Jason Petersen	6a0dc7756e	Formatting fixes Noticed a lot of weird lines wrapped at 80; our standard is 90.	2019-03-22 20:32:19 -06:00
Jason Petersen	6acf52660c	Always coerce RHS of pruning op to part. key type Our assumption that strip_implicit_coercions would leave us with a bi- nary-compatible type to that of the partition key was wrong. Instead, we should ensure the RHS of the comparison we perform is proactively coerced into a compatible type (at least binary compatible).	2019-03-22 20:32:19 -06:00
Jason Petersen	5baa257c91	Add second assert to guard against future changes This isn't entirely necessary but I feel safer with it here.	2019-03-22 20:32:19 -06:00
Jason Petersen	69adb627c3	Add Assert that will crash before coercion fix is in	2019-03-22 20:32:19 -06:00
Hadi Moshayedi	ff1d4f697a	Ignore test_times.log (#2638 )	2019-03-22 10:29:01 -07:00
Nils Dijk	feaac69769	Implementation for asycn FinishConnectionListEstablishment (#2584 )	2019-03-22 17:30:42 +01:00
Marco Slot	7a094edc4c	Merge pull request #2635 from citusdata/rescan_withhold Allow rescan in DECLARE .. WITH HOLD	2019-03-22 16:09:01 +01:00
Marco Slot	e3b7e74f43	Allow rescan in DECLARE .. WITH HOLD	2019-03-22 11:25:55 +01:00
Jason Petersen	1a7c73c37b	Merge pull request #2632 from citusdata/fix_conninfo_memory_bugs Fix conninfo memory bugs cr: @onderkalaci, @marcocitus	2019-03-21 12:47:12 -06:00
Jason Petersen	a2c6f596f9	Address code review comments	2019-03-21 11:59:52 -06:00
Jason Petersen	04aa34da68	Invalidate ConnParamsHash at config reload At configuration reload, we free all "global" (i.e. GUC-set) connection parameters, but these may still have live references in the connection parameters hash. By marking the entries as invalid, we can ensure they will not be used after free.	2019-03-21 00:03:35 -06:00
Jason Petersen	00d836e5a3	alloc non-global conn. params in provided context Having DATA-segment string literals made blindly freeing the keywords/ values difficult, so I've switched to allocating all in the provided context; because of this (and with the knowledge of the end point of the global parameters), we can safely pfree non-global parameters when we come across an invalid connection parameter entry.	2019-03-21 00:03:35 -06:00
Önder Kalacı	67ecbe821a	Merge pull request #2633 from citusdata/trivial_parts_of_faster_all_things Decrease CPU overhead of some of the planner functions	2019-03-20 11:27:43 +01:00
Marco Slot	e8152d9b6d	Only look in top-level rtable in ExtractFirstDistributedTableId	2019-03-20 12:14:46 +03:00
Marco Slot	ee6a0b6943	Speed up RTE walkers Do it in two ways (a) re-use the rte list as much as possible instead of re-calculating over and over again (b) Limit the recursion to the relevant parts of the query tree	2019-03-20 12:14:46 +03:00
Marco Slot	5ff1821411	Cache the current database name Purely for performance reasons.	2019-03-20 12:14:46 +03:00
Marco Slot	0ea4e52df5	Add nodeId to shardPlacements and use it for shard placement comparisons Before this commit, shardPlacements were identified with shardId, nodeName and nodeport. Instead of using nodeName and nodePort, we now use nodeId since it apparently has performance benefits in several places in the code.	2019-03-20 12:14:46 +03:00
Önder Kalacı	32ee0217d5	Merge pull request #2617 from citusdata/add_more_tests Add some more regression tests for outer join pushdown	2019-03-19 11:01:20 +01:00
Onder Kalaci	41d8c4030a	Add some more regression tests for outer join pushdown	2019-03-19 11:49:38 +03:00
Önder Kalacı	7914a039a7	Merge pull request #2628 from citusdata/fix_infinite_recursion Some queries lead to infinite recursion during recursive planning	2019-03-18 15:15:25 +01:00
Onder Kalaci	ad5ff1d01a	Some queries lead to infinite recursion with recurisve planning The rule for infinite recursion is the following: - If the query contains a subquery which is recursively planned, and no other subqueries can be recursively planned due to correlation (e.g., LATERAL joins), the planner keeps recursing again and again. One interesting thing here is that even if a subquery contains only intermediate result(s), we re-recursively plan that. In the end, the logic in the code does the following: - Try recursive planning any of the subqueries in the query tree - If any subquery is recursively planned, call the planner again where the subquery is replaced with the intermediate result. - Try recursively planning any of the queries - If any subquery is recursively planned, call the planner again where the subquery (in this case it is already intermediate result) is replaced with the intermediate result. - Try recursively planning any of the queries - If any subquery is recursively planned, call the planner again where the subquery (in this case it is already intermediate result) is replaced with the intermediate result. - Try recursively planning any of the queries - If any subquery is recursively planned, call the planner again where the subquery (in this case it is already intermediate result) is replaced with the intermediate result. ......	2019-03-18 10:35:00 +03:00
Jason Petersen	8787cb3199	Merge pull request #2587 from citusdata/xact_functions Treat functions as transaction blocks cr: @jasonmp85	2019-03-15 16:54:20 -06:00
Marco Slot	f2abf2b8e5	Functions are treated as transaction blocks	2019-03-15 16:34:08 -06:00
Marco Slot	4b9bd54ae0	Remove create_insert_proxy_for_table	2019-03-15 14:13:03 -06:00
exialin	84b853e1b5	Fix some typos (#2620 )	2019-03-14 16:48:31 -07:00
Hadi Moshayedi	cd00e92cbc	Merge pull request #2629 from citusdata/disable_constraint_checking_on_coordinator Don't execute ALTER TABLE constraint checks in coordinator.	2019-03-14 16:14:30 -07:00
Hadi Moshayedi	a9e6d06a98	Skip execution of ALTER TABLE constraint checks on the coordinator	2019-03-14 15:40:56 -07:00
Hadi Moshayedi	cdd3b15ac8	Fix distributed deadlock for ALTER TABLE ... ATTACH PARTITION. Following scenario resulted in distributed deadlock before this commit: CREATE TABLE partitioning_test(id int, time date) PARTITION BY RANGE (time); CREATE TABLE partitioning_test_2009 (LIKE partitioning_test); CREATE TABLE partitioning_test_reference(id int PRIMARY KEY, subid int); SELECT create_distributed_table('partitioning_test_2009', 'id'), create_distributed_table('partitioning_test', 'id'), create_reference_table('partitioning_test_reference'); ALTER TABLE partitioning_test ADD CONSTRAINT partitioning_reference_fkey FOREIGN KEY (id) REFERENCES partitioning_test_reference(id) ON DELETE CASCADE; ALTER TABLE partitioning_test_2009 ADD CONSTRAINT partitioning_reference_fkey_2009 FOREIGN KEY (id) REFERENCES partitioning_test_reference(id) ON DELETE CASCADE; ALTER TABLE partitioning_test ATTACH PARTITION partitioning_test_2009 FOR VALUES FROM ('2009-01-01') TO ('2010-01-01');	2019-03-14 15:28:37 -07:00
Hadi Moshayedi	f19feb742c	Remove never assigned colocatedRelation from CreateDistributedTable (#2479 )	2019-03-12 14:50:18 -07:00
Hanefi Onaldi	2e0860489f	Merge pull request #2602 from citusdata/improve-mitmproxy-documentation Also: - migrate mitmproxy readme to markdown - create failure test contribution guidelines	2019-03-12 08:26:35 -07:00
Hanefi Onaldi	419f52884f	Merge branch 'master' into improve-mitmproxy-documentation	2019-03-12 07:16:01 -07:00
Murat Tuncer	e813df4d7f	Merge pull request #2601 from citusdata/fix_column_alias Add support for column aliases on join clauses	2019-03-07 13:39:11 +03:00
Murat Tuncer	2681231c98	Create column aliases for shard tables in worker queries when requested	2019-03-07 12:54:42 +03:00
Hadi Moshayedi	f4d3b94e22	Fix some of the casts for groupId (#2609 ) A small change which partially addresses #2608.	2019-03-05 12:06:44 -08:00
Burak Velioglu	900ffa76f5	Merge pull request #2597 from citusdata/full_outer_pushdown Fix full outer join with subquery pushdown	2019-03-05 17:08:08 +03:00
velioglu	faf50849d7	Enhance pushdown planning logic to handle full outer joins with using clause Since flattening query may flatten outer joins' columns into coalesce expr that is in the USING part, and that was not expected before this commit, these queries were erroring out. It is fixed by this commit with considering coalesce expression as well.	2019-03-05 11:49:30 +03:00
Önder Kalacı	6594ffafa1	Merge pull request #2618 from citusdata/fix_relation_size_leak Make sure to clear `PGresult` in missing places	2019-03-01 14:44:43 +01:00
Onder Kalaci	26f569abd8	Make sure to clear PGresult on few places This leads to a memory leak otherwise.	2019-02-28 13:44:34 +03:00
Jason Petersen	bf9a119b6f	Merge pull request #2616 from citusdata/circleci Enable CircleCI	2019-02-26 23:48:11 -07:00
Jason Petersen	dceaae3b95	Remove codecov push from Travis build	2019-02-26 23:01:40 -07:00
Jason Petersen	6c3f7b665f	Squelch indentation errors (uncrustify is old in Travis)	2019-02-26 23:01:40 -07:00
Jason Petersen	3df2f51881	Turn on style-checking, fix lingering violations We'd been ignoring updating uncrustify for some time now because I'd thought these were misclassifications that would require an update in our rules to address. Turns out they're legit, so I'm checking them in.	2019-02-26 23:01:40 -07:00
Jason Petersen	383871af7e	Upload Codecov results after test runs Our first orb use!	2019-02-26 23:01:40 -07:00
Jason Petersen	5817bc3cce	Add test-timing script Through some clever stream redirections and options, we can get decent timing data for each of our tests.	2019-02-26 23:01:40 -07:00
Jason Petersen	1b605a6109	Modernize coverage options These hadn't been looked at in a while, and I'm somewhat certain they actually were running with optimization on, which is pretty bad. Swapped out the lower-level flags for `--coverage`, which will work with both `clang` and `gcc`. On some platforms, linker flags are need- ed as well.	2019-02-26 22:20:31 -07:00
Jason Petersen	5db45bac45	Enable CircleCI The configuration for the build is in the YAML file; the changes to the regression runner are backward-compatible with Travis and just add the logic to detect whether our custom (isolation- and vanilla-enabled) pkg is present.	2019-02-26 22:17:26 -07:00
Önder Kalacı	25b5fc9d14	Merge pull request #2610 from citusdata/improve_round_robin Add transactionId based round robin policy	2019-02-25 13:12:24 +01:00
Onder Kalaci	f706772b2f	Round-robin task assignment policy relies on local transaction id Before this commit, round-robin task assignment policy was relying on the taskId. Thus, even inside a transaction, the tasks were assigned to different nodes. This was especially problematic while reading from reference tables within transaction blocks. Because, we had to expand the distributed transaction to many nodes that are not necessarily already in the distributed transaction.	2019-02-22 19:26:38 +03:00
Önder Kalacı	acc2b0a387	Merge pull request #2606 from citusdata/fast_path_router_planner Introduce fast path router planning	2019-02-22 17:17:17 +01:00
Onder Kalaci	e521e7e39c	Apply feedback	2019-02-22 18:14:30 +03:00
Onder Kalaci	407d0e30f5	Fix selectForUpdate bug	2019-02-21 18:21:41 +03:00
Onder Kalaci	f144bb4911	Introduce fast path router planning In this context, we define "Fast Path Planning for SELECT" as trivial queries where Citus can skip relying on the standard_planner() and handle all the planning. For router planner, standard_planner() is mostly important to generate the necessary restriction information. Later, the restriction information generated by the standard_planner is used to decide whether all the shards that a distributed query touches reside on a single worker node. However, standard_planner() does a lot of extra things such as cost estimation and execution path generations which are completely unnecessary in the context of distributed planning. There are certain types of queries where Citus could skip relying on standard_planner() to generate the restriction information. For queries in the following format, Citus does not need any information that the standard_planner() generates: SELECT ... FROM single_table WHERE distribution_key = X; or DELETE FROM single_table WHERE distribution_key = X; or UPDATE single_table SET value_1 = value_2 + 1 WHERE distribution_key = X; Note that the queries might not be as simple as the above such that GROUP BY, WINDOW FUNCIONS, ORDER BY or HAVING etc. are all acceptable. The only rule is that the query is on a single distributed (or reference) table and there is a "distribution_key = X;" in the WHERE clause. With that, we could use to decide the shard that a distributed query touches reside on a worker node.	2019-02-21 13:27:01 +03:00
Marco Slot	fbc22aa6d3	Merge pull request #2521 from citusdata/citus-sql-auto-target Simplify make file for citus sql files	2019-02-20 12:06:27 +01:00
Nils Dijk	1623c44fc7	Simplify make file for citus sql files	2019-02-19 21:29:20 -05:00
Hanefi Onaldi	d6767ad521	Merge pull request #2572 from citusdata/execute-functions-on-coordinator Wrap function calls in joins inside subqueries	2019-02-04 23:27:04 +03:00
Hanefi Onaldi	148dcad0bb	More documentation and stale comments rewritten	2019-02-04 20:21:51 +03:00
Hanefi Onaldi	825666f912	Query samples in docs and better errors	2019-02-04 19:20:02 +03:00
Hanefi Onaldi	574b071113	Add wrapper function introduced in PG11 for compatibility	2019-02-04 19:20:02 +03:00
Hanefi Onaldi	1106e14385	Wrap functions in subqueries remove debug logs to fix travis tests Support RowType functions in joins Regression tests for a custom type function in join	2019-02-04 19:19:29 +03:00
Hanefi Onaldi	c5c3d6d0a3	Update the mitmscripts readme and migrate readme to markdown and create contribution guidelines	2019-02-04 11:30:05 +03:00
Hanefi Onaldi	588e7e673d	Merge pull request #2535 from citusdata/failure_mx_metadata_sync Failure/cancellation tests for mx metadata sync	2019-02-01 12:17:05 +03:00
Hanefi Onaldi	4dd1f5784b	Failure&cancellation tests for mx metadata sync Failure&Cancellation tests for initial start_metadata_sync() calls to worker and DDL queries that send metadata syncing messages to an MX node Also adds message type definitions for messages that are exchanged during metadata syncing -	2019-02-01 11:50:25 +03:00
Murat Tuncer	967b369f10	Merge pull request #2590 from citusdata/relax_reference_union_pushdown Relax reference table restrictions in subquery union pushdowns	2019-01-31 16:04:06 +03:00
Murat Tuncer	b36b59dd4f	Relax reference table restrictions in subquery union pushdowns We used to error out if there is a reference table in the query participating a union. This has caused pushdownable queries to be evaluated in coordinator. Now we let reference tables inside union queries as long as there is a distributed table in from clause. Existing join checks (reference table on the outer part) sufficient enought that we do not need check the join relation of reference tables.	2019-01-31 15:34:29 +03:00
Önder Kalacı	501eaebe77	Merge pull request #2598 from citusdata/fix_router_errors Queries with only intermediate results do not rely on task assignment policy	2019-01-28 16:39:29 +01:00
Onder Kalaci	ec67381ba2	Queries with only intermediate results do not rely on task assignment policy Previously we allowed task assignment policy to have affect on router queries with only intermediate results. However, that is erroneous since the code-path that assigns placements relies on shardIds and placements, which doesn't exists for intermediate results. With this commit, we do not apply task assignment policies when a router query hits only intermediate results.	2019-01-28 17:59:17 +03:00
Murat Tuncer	913cac2391	Merge pull request #2599 from citusdata/fix_fk_from_partition_to_reference Fix partitioned table operations involving foreign key to reference table	2019-01-28 17:01:43 +03:00
Murat Tuncer	cd5213abee	Set sequential mode execution GUC for alter partitioned table PG recently started propagating foreign key constraints to partition tables. This came with a select query to validate the the constaint. We are already setting sequential mode execution for this command. In order for validation select query to respect this setting we need to explicitly set the GUC. This commit also handles detach partition part.	2019-01-25 15:28:07 +03:00
Burak Velioglu	1f4f6ea041	Merge pull request #2585 from citusdata/plan_recursive_exception Reset planner context instead of popping with recursive planning	2019-01-17 17:24:19 +03:00
velioglu	1bb0ec316a	Reset planner restriction context instead of popping with recursive planning	2019-01-17 14:35:16 +03:00
Jason Petersen	339e6e661e	Remove 9.6 (#2554 ) Removes support and code for PostgreSQL 9.6 cr: @velioglu	2019-01-16 13:11:24 -07:00
Nils Dijk	0de756559c	Merge pull request #2576 from citusdata/test/base-valgrind Add make target to run regression tests in isolation with valgrind	2019-01-16 12:26:14 +01:00
Nils Dijk	3f2bac18df	Add make target to run regression tests in isolation with vagrant Also allow `multi_alter_table_add_constraints` to run in isolation	2019-01-16 11:41:09 +01:00
Jason Petersen	183b2d6c06	Remove 9.6 To spare people the pain while I finish my PR feedback.	2019-01-14 23:46:19 -07:00
Hanefi Onaldi	21f72f5002	Add changelog entry for 8.0.3 (#2581 )	2019-01-09 15:21:45 +03:00
Marco Slot	d7ee6f2127	Merge pull request #2481 from citusdata/outer_join_pushdown Plan outer joins through pushdown planning	2019-01-08 10:46:59 +01:00
Hanefi Onaldi	ad05634444	Add changelog entry for 8.1.1	2019-01-07 18:44:20 +03:00
Marco Slot	1656b519c4	Plan outer joins through pushdown planning	2019-01-05 20:55:27 +01:00
Murat Tuncer	cb77fcce85	Merge pull request #2569 from citusdata/fix_having_with_joins Fix having clause bug for complex joins	2019-01-04 13:32:28 +03:00
Murat Tuncer	b389bebda1	Move repeated code to a function	2019-01-03 17:19:01 +03:00
Murat Tuncer	a72d959735	Fix multi_view tests	2019-01-03 17:07:26 +03:00
Murat Tuncer	2ed7d24591	Fix having clause bug for complex joins We update column attributes of various clauses for a query inluding target columns, select clauses when we introduce new range table entries in the query. It seems having clause column attributes were not updated. This fix resolves the issue	2019-01-03 17:07:26 +03:00
Murat Tuncer	1d421e60f9	Merge pull request #2573 from citusdata/fix_more_spinlocks Move functions calls that can fail to outside of spinlock	2019-01-03 16:55:45 +03:00
Murat Tuncer	ec36030fae	Move functions calls that can fail to outside of spinlock We had recently fixed a spinlock issue due to functions failing, but spinlock is not being released. This is the continuation of that work to eliminate possible regression of the issue. Function calls that are moved out of spinlock scope are macros and plain type casting. However, depending on the configuration they have an alternate implementation in PG source that performs memory allocation. This commit moves last bit of codes to out of spinlock for completion purposes.	2019-01-03 15:59:56 +03:00
Murat Tuncer	1dbbc6664f	Merge pull request #2568 from citusdata/fix_spinlock_use Make sure spinlock is not left unreleased when an exception is thrown	2018-12-25 16:27:49 +03:00
Murat Tuncer	3b95a03c3e	Merge branch 'master' into fix_spinlock_use	2018-12-25 14:41:21 +03:00
Hadi Moshayedi	38579d52d0	Speed-up run_command_on_shards(). (#2564 ) We were establishing connections synchronously. Establishing connections asynchronously results in some parallelization, saving hundreds of milliseconds. In a test I did, this decreased the query time from 150ms to 40ms.	2018-12-24 08:47:01 -05:00
Murat Tuncer	9671bc3cbb	Make sure spinlock is not left unreleased when an exception is thrown A spinlock is not released when an exception is thrown after spinlock is acquired. This has caused infinite wait and eventual crash in maintenance daemon. This work moves the code than can fail to the outside of spinlock scope so that in the case of failure spinlock is not left locked since it was not locked in the first place.	2018-12-24 15:47:21 +03:00
Hanefi Onaldi	fb497ddad1	Bump 8.2devel on master (#2567 )	2018-12-24 13:49:50 +03:00
Jason Petersen	9da2254bfb	Merge pull request #2541 from citusdata/disable_appveyor Disable appveyor cr: @jasonmp85	2018-12-21 16:40:40 -07:00
Onder Kalaci	8dee92bad2	Revert `adb4669`	2018-12-21 15:36:41 -07:00
Onder Kalaci	9fff7d28a7	Revert `4925521`	2018-12-21 15:36:40 -07:00
Jason Petersen	ca83c48097	Merge pull request #2559 from citusdata/concurrent_concurrently Execute CREATE INDEX CONCURRENTLY in parallel cr: @jasonmp85	2018-12-21 15:36:17 -07:00
Marco Slot	2e4029973c	Remove sequential create index concurrently test	2018-12-21 14:03:00 -07:00
Marco Slot	1b1c6374f7	Execute CREATE INDEX CONCURRENTLY concurrently	2018-12-21 14:02:59 -07:00
Hadi Moshayedi	eb398580f7	Git ignore LLVM bitcode files. (#2565 )	2018-12-21 14:55:24 -05:00
Marco Slot	6ca91c1332	Merge pull request #2563 from citusdata/transactions_pgmonitor Restrict visibility of get_*_active_transactions functions to pg_monitor	2018-12-21 14:40:34 +01:00
Marco Slot	8a54999c5c	Merge pull request #2561 from citusdata/run-time-bound-check Move an assert-only array-bound check to run-time.	2018-12-20 17:09:04 +01:00
Marco Slot	3ff2b47366	Restrict visibility of get_*_active_transactions functions to pg_monitor	2018-12-19 18:32:42 +01:00
Dimitri Fontaine	6a1a2b8458	Move an assert-only array-bound check to run-time. When the bound-check fails at run-time, better abort with an error message rather than trying to user memory we did not allocate.	2018-12-19 06:12:05 +01:00
Marco Slot	13f4a0ac9f	Stabilize failure test shard IDs	2018-12-19 04:26:46 +01:00
Marco Slot	5b9376a7f8	Check ownership before taking locks in distributed table creation	2018-12-18 15:32:07 +01:00
Hanefi Onaldi	88717f31b3	Citus 8.1.0 changelog 1545053355 (#2553 ) * Add changelog entry for 8.1.0	2018-12-18 14:48:34 +03:00
Hanefi Onaldi	878af61fa6	Add changelog entry for 8.0.2 (#2546 )	2018-12-14 16:54:31 +03:00
Nils Dijk	595179706c	Merge pull request #2540 from citusdata/fix/enforce-tls upgrade default ssl_ciphers to more restrictive on extension creation	2018-12-12 15:56:59 +01:00
Nils Dijk	694992e946	upgrade default ssl_ciphers to more restrictive on extension creation Show ssl_ciphers in ssl_by_default_test	2018-12-12 15:33:15 +01:00
Marco Slot	02c144378c	Add DESCRIPTION to PR template	2018-12-12 05:35:12 +01:00
Jason Petersen	92893e9601	Fix control file version	2018-12-11 18:50:20 -07:00
Jason Petersen	bd0d1f05e7	Bump SQL version Should have been done when the release-8.0 branch was created…	2018-12-11 10:40:15 -07:00
Hanefi Onaldi	f12676e7b3	Added changelog entry for 7.5.4 (#2538 )	2018-12-11 16:27:37 +03:00
Burak Velioglu	7aaf6b2cb3	Merge pull request #2533 from citusdata/fix_function_oid Fix function oid	2018-12-10 14:33:50 +03:00
velioglu	90704d9a52	Fix getting function oid to get hll_add_agg id	2018-12-10 14:16:19 +03:00
velioglu	3e0cff94a6	Add FunctionOidExtended function	2018-12-10 11:59:41 +03:00
Nils Dijk	4af40eee76	Enable SSL by default during installation of citus	2018-12-07 11:23:19 -07:00
Burak Velioglu	fd3b0044b4	Merge pull request #2523 from citusdata/disable_hashagg_hll Adds support for disabling hash agg with hll functions on coordinator	2018-12-07 19:16:19 +03:00
velioglu	8764a19464	Adds support for disabling hash agg with hll functions on coordinator query	2018-12-07 18:49:25 +03:00
Marco Slot	298613824e	Merge pull request #2496 from citusdata/limit_transmit Only allow transmit from pgsql_job_cache directory	2018-12-06 16:25:47 +01:00
Marco Slot	9cf91c438b	Only allow transmit from pgsql_job_cache directory	2018-12-05 10:18:27 +01:00
Marco Slot	2967d8e65f	Merge pull request #2520 from citusdata/remove_memcpy Remove odd memcpy usage in BuildCachedShardList	2018-12-04 19:32:33 +01:00
Marco Slot	7c2a2d08af	Merge pull request #2511 from citusdata/planner_readme Expand planner readme	2018-12-04 14:29:37 +01:00
Marco Slot	70fb9c851b	Remove odd memcpy usag in BuildCachedShardList	2018-12-04 14:09:10 +01:00
Marco Slot	96b091a1e5	Merge pull request #2516 from citusdata/security-audit Review some strcpy/memcpy/strprintf/sscanf usage in the code.	2018-12-04 13:08:18 +01:00
Önder Kalacı	cb119f4f73	Merge pull request #2514 from citusdata/fix_total_procs Ensure to use initialized MaxBackends	2018-12-04 11:41:30 +01:00
Marco Slot	0388324fbe	Expand planner readme	2018-12-04 09:55:19 +01:00
Dimitri Fontaine	d1b182de7d	Replace calls to unsafe functions like memcpy and sscanf In answer to a security audit, we double check buffer sizes and avoid known-dangerous operations such as sscanf.	2018-12-04 08:54:43 +01:00
Onder Kalaci	621ccf3946	Ensure to use initialized MaxBackends Postgresql loads shared libraries before calculating MaxBackends. However, Citus relies on MaxBackends being set. Thus, with this commit we use the same steps to calculate MaxBackends while Citus is being loaded (e.g., PG_Init is called). Note that this is safe since all the elements that are used to calculate MaxBackends are PGC_POSTMASTER gucs and a constant value.	2018-12-03 13:25:51 +03:00
Onder Kalaci	b6ebd791a6	Sort task list for multi-task explain outputs This is purely for ensuring that regression tests do not randomly fail.	2018-11-30 11:19:37 -07:00
Önder Kalacı	89d32af3ad	Merge pull request #2509 from citusdata/fix_partitioning_test Make sure the explain output for partition wise join is stable	2018-11-30 15:16:01 +01:00
Onder Kalaci	18c9badff5	Make sure the explain output for partition wise join is stable We disable bunch of planning options on the workers. This might be risky if any concurrent test relies on EXPLAIN OUTPUT as well. Still, we want to keep this test, so we should try to not parallelize this test with such test.	2018-11-30 16:44:57 +03:00
Burak Velioglu	e2c4efbaa4	Merge pull request #2305 from citusdata/insert_select_onconflict INSERT...SELECT via coordinator with ON CONFLICT/RETURNING	2018-11-30 16:23:49 +03:00
Marco Slot	8893cc141d	Support INSERT...SELECT with ON CONFLICT or RETURNING via coordinator Before this commit, Citus supported INSERT...SELECT queries with ON CONFLICT or RETURNING clauses only for pushdownable ones, since queries supported via coordinator were utilizing COPY infrastructure of PG to send selected tuples to the target worker nodes. After this PR, INSERT...SELECT queries with ON CONFLICT or RETURNING clauses will be performed in two phases via coordinator. In the first phase selected tuples will be saved to the intermediate table which is colocated with target table of the INSERT...SELECT query. Note that, a utility function to save results to the colocated intermediate result also implemented as a part of this commit. In the second phase, INSERT.. SELECT query is directly run on the worker node using the intermediate table as the source table.	2018-11-30 15:29:12 +03:00
Hanefi Onaldi	a9c299473b	Merge pull request #2507 from citusdata/error_out_grouping_sets_in_subqueries Error out when a subquery has grouping set clause	2018-11-30 15:14:45 +03:00
Hanefi Onaldi	088a2ef66a	throw an error when a subquery has grouping set clause	2018-11-30 13:11:32 +03:00
Önder Kalacı	5a8c79430e	Merge pull request #2508 from citusdata/fix_citus_stat_activity Ensure that citus_dist_activity test outputs do not change	2018-11-30 10:19:27 +01:00
Onder Kalaci	a15f168ce4	Ensure that citus_dist_activity test outputs do not change Since there is no lock ordering among the query that is executed and the select from the view, we prefer to add a timeout before priting the activity.	2018-11-30 11:46:17 +03:00
Nils Dijk	e17d98b0e3	Merge pull request #2486 from citusdata/fix/create-distributed-table-as-owner Fix create_distributed_table as non-table-owner	2018-11-29 16:16:34 +01:00
Nils Dijk	9309e63156	create_distributed_table as user, change table ownership during create	2018-11-29 14:20:42 +01:00
Nils Dijk	6aa191f72c	remove table_ddl_command_array and test master_get_table_ddl_events	2018-11-29 14:20:42 +01:00
Murat Tuncer	cc2422efee	Merge pull request #2505 from citusdata/fix_stat_statements_view Fix citus_stat_statements view	2018-11-29 16:16:52 +03:00
Murat Tuncer	fd868ec268	Fix citus_stat_statements view Join between pg_stat_statements and citus_query_stats should include queryid, dbid, userid instead of just queryid.	2018-11-29 14:49:16 +03:00
Marco Slot	a378a11da0	Merge pull request #2493 from citusdata/allow-adding-node-in-any-group Refrain from having a strong opinion on maxGroupId.	2018-11-28 16:25:15 +01:00
Marco Slot	435a036328	Merge pull request #2494 from citusdata/notsolocky Relax multi-shard modify locks when enable_deadlock_prevention is disabled	2018-11-28 15:21:44 +01:00
Jason Petersen	e8f3b32b64	Add changelog entry for 8.0.1	2018-11-27 23:09:41 -07:00
Jason Petersen	70a65c0d10	Add changelog entry for 7.5.3	2018-11-27 21:28:50 -07:00
Dimitri Fontaine	5ae2d03881	Refrain from having a strong opinion on maxGroupId. When initializing a Citus formation automatically from an external piece of software such as Citus-HA, the following process process may be used: - decide on the groupId in the external software - SELECT * FROM master_add_inactive_node('localhost', 9701, groupid => X) When Citus checks for maxGroupId, it forbids other software to pick their own group Ids to ues with the master_add_inactive_node() API. This patch removes the extra testing around maxGroupId.	2018-11-28 04:29:15 +01:00
Marco Slot	0393910c65	Shard IDs in isolation_citus_dist_stat_activity output changed	2018-11-28 02:59:50 +01:00
Marco Slot	aff37cf1bc	Control multi-shard modify locks with enable_deadlock_prevention	2018-11-28 02:59:50 +01:00
Marco Slot	6fd5c73444	Merge pull request #2492 from citusdata/udf_cleanup Clean up UDFs and revoke unnecessary permissions	2018-11-26 14:21:34 +01:00
Marco Slot	1ec5b6c890	Remove old worker_hash_partition_table API	2018-11-26 14:40:37 +01:00
Marco Slot	5a63deab2e	Clean up UDFs and remove unnecessary permissions	2018-11-26 14:40:37 +01:00
hanefi	27930aa462	Merge pull request #2488 from citusdata/validate_constraint Validate Constraint Support	2018-11-26 15:20:01 +03:00
Hanefi Onaldi	448b241ab4	validate query isolation tests	2018-11-26 14:04:51 +03:00
Hanefi Onaldi	4edb193f25	make the tests parallelizeable helper view table_fkeys_in_workers now allows filtering by schema so that a test case can print out foreign keys in its schema only	2018-11-26 14:04:51 +03:00
Hanefi Onaldi	b3d897039a	constraint validation regression tests	2018-11-26 14:04:51 +03:00
Hanefi Onaldi	7db6991dc0	propagate validate queries to workers	2018-11-26 14:04:51 +03:00
Marco Slot	2afbb89673	Merge pull request #2490 from citusdata/fix_tt_protocol Check ownership in task-tracker protocol functions	2018-11-26 11:45:50 +01:00
Marco Slot	e8e956aa9f	Require superuser when using non-existent job schema in worker_merge_files_into_table	2018-11-24 02:57:16 +01:00
Marco Slot	711eef611f	Merge pull request #2489 from citusdata/task_tracker_suffix Add user ID suffix to intermediate files in task-tracker	2018-11-23 14:17:37 +01:00
Marco Slot	c4ad899dd8	Check schema ownership in worker_merge_* functions	2018-11-23 11:05:09 +01:00
Marco Slot	e9a7295ead	Add multi-user tests for task-tracker protocol functions	2018-11-23 11:05:09 +01:00
Marco Slot	8e93fe5870	Check schema owner in task_tracker_assign_task	2018-11-23 11:05:09 +01:00
Marco Slot	ec957a833a	Check permission in task_tracker_task_status	2018-11-23 11:04:58 +01:00
Marco Slot	4245032849	Add user ID suffixes to filenames in check-worker tests	2018-11-23 08:36:12 +01:00
Marco Slot	6aa5592e52	Add user ID suffix to intermediate files in re-partition jobs	2018-11-23 08:36:11 +01:00
Marco Slot	f608739b4f	Merge pull request #2487 from citusdata/task_result_udf Execute SQL tasks using worker_execute_sql_task UDF when using task-tracker	2018-11-22 18:10:04 +01:00
Marco Slot	a59bf31c76	Use worker_execute_sql_task UDF in task-tracker executor	2018-11-22 18:15:33 +01:00
Marco Slot	30bad7e66f	Add worker_execute_sql_task UDF	2018-11-22 18:15:33 +01:00
Marco Slot	e3521ce320	Test current user in task-tracker queries	2018-11-22 18:15:33 +01:00
Marco Slot	caf402d506	COPY to a task file no longer switches to superuser	2018-11-22 18:15:33 +01:00
Marco Slot	9ff6f1c552	Merge pull request #2483 from citusdata/fix_udf_permissions Fix permissions checks in lesser-known UDFs	2018-11-20 14:51:57 +01:00
Marco Slot	e17025e1d4	Check table ownership in mark_tables_colocated	2018-11-18 00:11:38 +01:00
Marco Slot	18acd00553	Check permissions in lock_relation_if_exists	2018-11-18 00:11:38 +01:00
Marco Slot	aab9f623eb	Check table ownership in upgrade_to_reference_table	2018-11-16 23:27:34 +01:00
Önder Kalacı	fc9f981525	Merge pull request #2475 from citusdata/fix_some_user_permission_bugx Fix permissions checks in UDFs called from the drop table trigger	2018-11-15 16:46:38 +01:00
Onder Kalaci	052ba21b19	Make sure to prevent unauthorized users to drop sequences in Citus MX	2018-11-15 18:08:04 +03:00
Onder Kalaci	7f0a57a153	Make sure to prevent unauthorized users to drop tables in Citus MX	2018-11-15 18:07:03 +03:00
Nils Dijk	f9520be011	Round robin queries to reference tables with task_assignment_policy set to `round-robin` (#2472 ) Description: Support round-robin `task_assignment_policy` for queries to reference tables. This PR allows users to query multiple placements of shards in a round robin fashion. When `citus.task_assignment_policy` is set to `'round-robin'` the planner will use a round robin scheduling feature when multiple shard placements are available. The primary use-case is spreading the load of reference table queries to all the nodes in the cluster instead of hammering only the first placement of the reference table. Since reference tables share the same path for selecting the shards with single shard queries that have multiple placements (`citus.shard_replication_factor > 1`) this setting also allows users to spread the query load on these shards. For modifying queries we do not apply a round-robin strategy. This would be negated by an extra reordering step in the executor for such queries where a `first-replica` strategy is enforced.	2018-11-15 15:11:15 +01:00
Murat Tuncer	5f821e6f64	Merge pull request #2468 from citusdata/update_who_uses_citus Expand who is using Citus section in readme	2018-11-15 15:11:02 +03:00
Murat Tuncer	58cb473958	Expand who is using Citus section in readme - add new users (Pex, Algolia, Copper) to who is using Citus part - add a reference to more use cases in Citus web page	2018-11-15 14:42:54 +03:00
Marco Slot	586f398b47	Merge pull request #2473 from citusdata/node_udf_permissions Use function permissions to guard node metadata functions	2018-11-15 12:23:53 +01:00
Marco Slot	2de8ef29c3	Revoke function permissions for node metadata functions	2018-11-15 06:25:07 +01:00
Burak Velioglu	ef24895add	Merge pull request #2478 from citusdata/master-8.0.0-14112018 Add changelog entry for 7.5.2	2018-11-14 23:31:19 +03:00
velioglu	9b5e4942a0	Add changelog entry for 7.5.2	2018-11-14 21:26:29 +03:00
Nils Dijk	bafec7ea48	Merge pull request #2469 from citusdata/refactor/commands-extract Refactor the UtilityProcess function into its own module	2018-11-14 15:08:06 +01:00
Marco Slot	f383e4f307	Description: Refactor code that handles DDL commands from one file into a module The file handling the utility functions (DDL) for citus organically grew over time and became unreasonably large. This refactor takes that file and refactored the functionality into separate files per command. Initially modeled after the directory and file layout that can be found in postgres. Although the size of the change is quite big there are barely any code changes. Only one two functions have been added for readability purposes: - PostProcessIndexStmt which is extracted from PostProcessUtility - PostProcessAlterTableStmt which is extracted from multi_ProcessUtility A README.md has been added to `src/backend/distributed/commands` describing the contents of the module and every file in the module. We need more documentation around the overloading of the COPY command, for now the boilerplate has been added for people with better knowledge to fill out.	2018-11-14 13:36:27 +01:00
Burak Yücesoy	ce463e9812	Merge pull request #2467 from citusdata/fix-crashes-on-ooms Fix crashes caused by stack size increase under high memory load	2018-11-14 10:49:06 +03:00
Burak Yucesoy	f8e0d37ba1	Fix crashes caused by stack size increase under high memory load Each PostgreSQL backend starts with a predefined amount of stack and this stack size can be increased if there is a need. However, stack size increase during high memory load may cause unexpected crashes, because if there is not enough memory for stack size increase, there is nothing to do for process apart from crashing. An interesting thing is; the process would get OOM error instead of crash, if the process had an explicit memory request (with palloc) for example. However, in the case of stack size increase, there is no system call to get OOM error, so the process simply crashes. With this change, we are increasing the stack size explicitly by requesting extra memory from the stack, so that, even if there is not memory, we can at least get an OOM instead of a crash.	2018-11-14 01:27:53 +03:00
Nils Dijk	2cc803afe0	Merge pull request #2474 from citusdata/fix/min-client-message-fatal Fix failures of tests on recent postgres builds	2018-11-13 17:40:21 +01:00
Nils Dijk	97da44558b	Description: Fix failures of tests on recent postgres builds In recent postgres builds you cannot set client_min_messages to values higher then ERROR, if will silently set it to ERROR if so. During some tests we would set it to fatal to hide random values (eg. pid's of processes) from the test output. This patch will use different tactics for hiding these values.	2018-11-13 16:53:05 +01:00
Murat Tuncer	22e516a4c4	Merge pull request #2471 from citusdata/update_hll_topn_versions Update topn and hll versions used in travis	2018-11-07 17:27:12 +03:00
Murat Tuncer	02bb44794a	Update topn and hll versions used in travis	2018-11-07 16:56:52 +03:00
Murat Tuncer	95ba630002	Merge pull request #2470 from citusdata/add_function_utils Create function_utils for pg function call related utilities	2018-11-07 16:00:49 +03:00
Murat Tuncer	cc401a2616	Create function_utils for pg function call related utilities	2018-11-07 15:29:38 +03:00
Hadi Moshayedi	d3e284dcd6	Use heap_deform_tuple() instead of calling heap_getattr(). (#2464 ) After Fast ALTER TABLE ADD COLUMN with a non-NULL default in PG11, physical heaps might not contain all attributes after a ALTER TABLE ADD COLUMN happens. heap_getattr() returns NULL when the physical tuple doesn't contain an attribute. So we should use heap_deform_tuple() in these cases, which fills in the missing attributes. Our catalog tables evolve over time, and an upgrade might involve some ALTER TABLE ADD COLUMN commands. Note that we don't need to worry about postgres catalog tables and we can use heap_getattr() for them, because they only change between major versions. This also fixes #2453.	2018-11-05 15:11:01 -05:00
Burak Velioglu	3616939dfa	Merge pull request #2466 from citusdata/changelog-8.0.0 Add changelog entry for 8.0.0	2018-10-31 14:24:47 +03:00
velioglu	9d0865e4dd	Add changelog entry for 8.0.0	2018-10-31 13:40:55 +03:00
Önder Kalacı	6d08a6b947	Merge pull request #2463 from citusdata/multi_row_insert_failure Add failure and cancellation tests for multi row inserts	2018-10-30 12:34:20 +03:00
Onder Kalaci	7aa2af8975	Add failure and cancellation tests for multi row inserts	2018-10-29 11:36:02 +03:00
Önder Kalacı	46120df7a9	Merge pull request #2462 from citusdata/cancellation_vacuum Add cancellation tests for VACUUM/ANALYZE	2018-10-26 20:11:04 +03:00
Onder Kalaci	7b4d912904	Add cancellation tests for VACUUM/ANALYZE	2018-10-26 16:25:11 +03:00
Önder Kalacı	c58bb37ad7	Merge pull request #2461 from citusdata/cancellation_multi_shard Add cancellation tests for multi shard modification queries	2018-10-26 15:35:31 +03:00
Onder Kalaci	85d7d074c3	Add cancellation tests for multi shard modification queries	2018-10-26 15:07:52 +03:00
Önder Kalacı	0926c9994a	Merge pull request #2458 from citusdata/add_router_cancel Add cancellation tests for router selects	2018-10-26 15:07:42 +03:00
Onder Kalaci	18eee6d9c8	Add cancellation tests for router selects	2018-10-26 14:29:56 +03:00
Nils Dijk	13226a9a3b	Merge pull request #2440 from citusdata/savepoint_failures Add savepoint failure tests	2018-10-26 11:59:37 +01:00
Jason Petersen	a37a809d49	Add savepoint failure tests Tests at each significant point (i.e. SAVEPOINT, ROLLBACK, RELEASE) that correct semantics are preserved (using both no and statement replication).	2018-10-26 11:12:40 +01:00
Önder Kalacı	0f768349e0	Merge pull request #2457 from citusdata/fix_valgrind_issue Make sure to access PARAM_EXTERN accurately in PG 11	2018-10-26 11:25:21 +03:00
Onder Kalaci	9e2e2a7300	Make sure to access PARAM_EXTERN accurately in PG 11 PG 11 has change the way that PARAM_EXTERN is processed. This commit ensures that Citus follows the same pattern. For details see the related Postgres commit: `6719b238e8`	2018-10-25 21:55:03 +03:00
Murat Tuncer	c7891115ef	Merge pull request #2454 from citusdata/remove_cloudflare Remove CloudFlare from who uses citus section	2018-10-24 14:50:10 +03:00
Murat Tuncer	1e79492f35	Remove CloudFlare from who uses citus section	2018-10-24 13:58:54 +03:00
Önder Kalacı	0d84287a42	Merge pull request #2452 from citusdata/add_multi_shard_commands Processes that are blocked on advisory locks show up in wait edges	2018-10-24 13:57:52 +03:00
Onder Kalaci	6e05921736	Processes that are blocked on advisory locks show up in wait edges Assign the distributed transaction id before trying to acquire the executor advisory locks. This is useful to show this backend in citus lock graphs (e.g., dump_global_wait_edges() and citus_lock_waits).	2018-10-24 13:32:13 +03:00
Nils Dijk	980edac314	Merge pull request #2437 from citusdata/single_shard_mod_failure Add single-shard modification failure tests	2018-10-24 01:10:31 +02:00
Jason Petersen	98c8267a37	Add single-shard modification failure tests I'm pretty sure a lot of this test functionality may be covered in some of our existing regression tests, but I've included them to ensure we put all failure-based tests under our new testing method for that kind of test. Didn't include lower replication factor, as (for a single-shard mod.), it's indistinguishable from modifying a reference table. So these all test modifications which hit a single, replicated shard.	2018-10-23 23:31:40 +01:00
Hadi Moshayedi	5a274aa64c	Merge pull request #2450 from citusdata/fix_dropdb_if_exists Don't throw error for DROP DATABASE IF EXISTS	2018-10-23 10:10:47 -04:00
Hadi Moshayedi	3e00bf1c0d	Don't throw error for DROP DATABASE IF EXISTS	2018-10-23 09:45:03 -04:00
Murat Tuncer	28d307850c	Merge pull request #2446 from citusdata/dont_allow_failure make PG11 builds mandatory again	2018-10-19 15:49:30 +03:00
Murat Tuncer	081594ad03	Don't allow PG11 travis failures anymore We made PG11 builds optional when we had an issue with mx isolation test that we could not solve back then. This commit solves the issue with a workaround by running start_metadata_sync_to_node outside the transaction block.	2018-10-19 15:20:53 +03:00
Metin Döşlü	3a97988ada	Merge pull request #2444 from citusdata/memory_fix_attempt Attempt to address planner context crashes	2018-10-19 10:15:55 +03:00
Jason Petersen	ae9a98c2d1	Attempt to address planner context crashes Both of these are a bit of a shot in the dark. In one case, we noticed a stack trace where a caller received a null pointer and attempted to dereference the memory context field (at 0x010). In the other, I saw that any error thrown from within AdjustParseTree could keep the stack from being cleaned up (presumably if we push we should always pop). Both stack traces were collected during times of high memory pressure and locally reproducing the problem locally or otherwise has been very tricky (i.e. it hasn't been reproduced reliably at all).	2018-10-18 08:41:51 -06:00
Murat Tuncer	38c4059a16	Merge pull request #2431 from citusdata/mt_failure_insert_select_pushdown Add failure test for insert/select pushdown	2018-10-18 00:16:14 -07:00
Murat Tuncer	c7efd8aff0	Add failure test for insert/select pushdown	2018-10-18 09:09:26 +03:00
Sumedh Pathak	b15d1eee62	Update Contributing to install PG10 instead of 9.6 (#2436 ) * Update Contributing to note PG10 install * Updated suggested link to PG10	2018-10-16 12:31:17 -07:00
Hadi Moshayedi	431ac80563	Keep track of cached entries in case of interruption. (#2433 ) * Keep track of cached entries in case of interruption. Previously we set DistTableCacheEntry->sortedShardIntervalArray and DistTableCacheEntry->shardIntervalArrayLength after we entered all related shard entries into DistShardCacheHash. The drawback was that if populating DistShardCacheHash was interrupted, ResetDistTableCacheEntry() didn't see the shard hash entries created, so was unable to clean them up. This patch fixes that by setting sortedShardIntervalArray earlier, and incrementing shardIntervalArrayLength as we enter shards into the cache.	2018-10-15 14:06:56 -04:00
Marco Slot	a9f183a284	Merge pull request #2432 from citusdata/fix_typos Fix user-facing typos	2018-10-10 14:25:59 -07:00
Jason Petersen	9fb951c312	Fix user-facing typos Lintian found these (presumably by looking in the text section and running them through e.g. aspell).	2018-10-09 16:54:03 -07:00
Burak Velioglu	8b9aeb374b	Merge pull request #2425 from citusdata/real-time-select-failure Add failure tests for real time select queries	2018-10-09 14:27:45 -07:00
velioglu	5713019058	Add failure tests for real time select queries	2018-10-09 14:12:02 -07:00
Önder Kalacı	d5ebf22ba1	Merge pull request #2424 from citusdata/clear_intermediate_results Make sure not to leak intermediate result folders on the workers	2018-10-09 23:50:27 +03:00
Onder Kalaci	73696a03e4	Make sure not to leak intermediate result folders on the workers	2018-10-09 22:47:56 +03:00
Marco Slot	5886e69a3a	Merge pull request #2423 from citusdata/writable_standby_coordinator Allow simple DML commands from hot standby	2018-10-09 11:43:08 -07:00
Jason Petersen	1cb48416eb	Add reference table failure tests Fairly straightforward; verified that modifications fail atomically if a worker is down or fails mid-transaction (i.e. all workers need to ack modifications to reference tables in order to persist changes).	2018-10-09 09:39:30 -07:00
Jason Petersen	9bcf2873a7	Add single-shard router select failure tests Including several examples from #1926. I couldn't understand why the recover_prepared_transactions "should be an error", and EXPLAIN has changed since the original bug (so that it runs EXPLAINs in txns, I think for EXPLAIN ANALYZE to not have side effects); other than that, most of the reported bugs now error out rather than crash or return an empty result set.	2018-10-09 08:51:10 -07:00
Jason Petersen	8f2aa00951	Add failure tests for VACUUM/ANALYZE VACUUM runs outside of a transaction, so the failure modes for it are somewhat straightforward, though ANALYZE runs in a 1pc transaction and multi-table VACUUM can fail between statements (PG 11 and higher).	2018-10-09 08:50:37 -07:00
Jason Petersen	ee4114bc7a	Failure tests for modifying multiple shards in txn Tests various failure points during a multi-shard modification within a transaction with multiple statements. Verifies three cases: * Reference tables (single shard, many placements) * Normal table with replication factor two * Multi-shard table with no replication In the replication-factor case, we expect shard health to be affected in some transactions; most others fail the transaction entirely and all we need verify is that no effects of the transaction are visible. Had trouble testing the final PREPARE/COMMIT/ROLLBACK phase of the 2pc, in particular because the error message produced includes the PID of the backend, which is unpredictable.	2018-10-09 09:17:32 -06:00
Murat Tuncer	b45754a4d0	Merge pull request #2428 from citusdata/fix_mx_drop_schema_with_partitions Fix drop schema in mx with partitioned tables	2018-10-09 03:50:32 +03:00
Murat Tuncer	4f8042085c	Fix drop schema in mx with partitioned tables Drop schema command fails in mx mode if there is a partitioned table with active partitions. This is due to fact that sql drop trigger receives all the dropped objects including partitions. When we call drop table on parent partition, it also drops the partitions on the mx node. This causes the drop table command on partitions to fail on mx node because they are already dropped when the partition parent was dropped. With this work we did not require the table to exist on worker_drop_distributed_table.	2018-10-08 17:01:54 -07:00
Murat Tuncer	24e247c1b9	Merge pull request #2426 from citusdata/failure_pull_push_insert_select Add failure tests for insert/select via coordinator	2018-10-08 19:32:29 +03:00
Hadi Moshayedi	7509c6c8fb	Add tests which check we disallow writes to local tables.	2018-10-06 10:54:44 +02:00
Marco Slot	d56baefe3d	Allow simple DML commands from hot standby	2018-10-06 10:54:44 +02:00
Murat Tuncer	71a910d2fa	Add failure tests for insert/select via coordinator	2018-10-04 18:01:19 +03:00
Murat Tuncer	c8151818e7	Merge pull request #2318 from citusdata/mt_failure_test Add new failure tests for multi-shard/CTE modify and cte coordinator pull	2018-10-03 17:07:03 +03:00
Murat Tuncer	0a987e9c0e	Fix cte subquery failure test	2018-10-03 15:43:48 +03:00
Murat Tuncer	d26b312cad	Add failure test for coordinator pull/push for cte	2018-10-03 15:43:48 +03:00
Murat Tuncer	6c66033455	Add failure tests for multi-shard update/delete Failure tests for update/delete on hash distributed tables using 1PC and 2PC	2018-10-03 15:43:48 +03:00
Burak Velioglu	322dd54eee	Merge pull request #2412 from citusdata/add_all_transactions_to_views Show router modify,select and real-time queries on MX views	2018-10-02 22:23:47 +03:00
velioglu	512d23934f	Show router modify,select and real-time queries on MX views	2018-10-02 13:59:38 +03:00
Murat Tuncer	43a4ef939a	Merge pull request #2410 from citusdata/mx_partition_foreign_key Do not create inherited constraints at worker tables	2018-09-28 16:53:13 +03:00
Murat Tuncer	9bdef67bab	Do not create inherited constraints on worker shards PG now allows foreign keys on partitioned tables. Each foreign key constraint on partitioned table is propagated down to partitions. We used to create all constraints on shards when we are creating a new shard, or when just simply moving a shard from one worker to another. We also used the same logic when creating a copy of coordinator table in mx node. With this change we create the constraint on worker node only if it is not an inherited constraint.	2018-09-28 14:14:51 +03:00
Murat Tuncer	0aa9988ae9	Merge pull request #2413 from citusdata/fix_memory_leak_minimal Fix memory leak in FinishRemoteTransactionPrepare	2018-09-28 13:54:07 +03:00
Murat Tuncer	653c7e4ae0	Fix memory leak in FinishRemoteTransactionPrepare	2018-09-28 11:13:21 +03:00
Önder Kalacı	6c389497ab	Merge pull request #2404 from citusdata/fix_truncate Make sure to use correct execution mode for TRUNCATE	2018-09-25 16:31:45 +03:00
Onder Kalaci	cdc0d1491c	Make sure to use correct execution mode for TRUNCATE We used to set the execution mode in the truncate trigger. However, when multiple tables are truncated with a single command, we could set the execution mode very late. Instead, now set the execution mode on the utility hook.	2018-09-25 15:35:27 +03:00
Marco Slot	b2c3fd891b	Merge pull request #2396 from citusdata/insert_select_parameters Do not allow unresolved parameters in INSERT...SELECT	2018-09-24 15:05:55 +02:00
Marco Slot	1ca9a5b867	Do not allow unresolved parameters in INSERT...SELECT	2018-09-24 14:12:04 +02:00
Murat Tuncer	535c535010	Merge pull request #2356 from citusdata/enable_jit Re-enable JIT in PostgreSQL 11 tests	2018-09-24 09:55:11 +03:00
Jason Petersen	d7f10b0896	Rewrite parallel ID test to avoid costly JITting By setting the CPU tuple cost so high, we were triggering JIT. Instead, we should use parallel_tuple_cost. See: rhaas.blogspot.com/2018/06/using-forceparallelmode-correctly.html	2018-09-24 09:29:53 +03:00
Jason Petersen	e62a1ab43d	Revert "Disable JIT during PostgreSQL 11 test runs" This reverts commit `a2fb5a84f1`. JIT wasn't actually interfering with the operation of Citus, a test was just written in a way which caused JIT to run for a function on every row in a 150k-row table.	2018-09-24 09:29:53 +03:00
Jason Petersen	26178f72d3	Merge pull request #2358 from citusdata/fix_func_eval Evaluate functions and parameters anywhere in query cr: @jasonmp85	2018-09-21 13:58:53 -06:00
Marco Slot	877d703ac5	Evaluate functions (and when applicable, parameters) anywhere in query	2018-09-21 12:57:50 -06:00
Metin Döşlü	22173ae272	Merge pull request #2389 from citusdata/partitioned_tables_with_replication Support partitioned tables with replication factor > 1	2018-09-21 18:06:41 +03:00
Onder Kalaci	abc443d7fa	Make sure that shard repair considers replication factor	2018-09-21 15:24:49 +03:00
Onder Kalaci	8520a5b432	worker_append_table_to_shard becomes aware of partitioned tables	2018-09-21 14:40:42 +03:00
Onder Kalaci	c1b5a04f6e	Allow partitioned tables with replication factor > 1 With this commit, we all partitioned distributed tables with replication factor > 1. However, we also have many restrictions. In summary, we disallow all kinds of modifications (including DDLs) on the partition tables. Instead, the user is allowed to run the modifications over the parent table. The necessity for such a restriction have two aspects: - We need to acquire shard resource locks appropriately - We need to handle marking partitions INVALID in case of any failures. Note that, in theory, the parent table should also become INVALID, which is too aggressive.	2018-09-21 14:40:41 +03:00
Murat Tuncer	22f5af1bc3	Merge pull request #2391 from citusdata/truncate_utility Add distributed locking to truncated mx tables	2018-09-21 14:38:00 +03:00
Murat Tuncer	b6930e3db9	Add distributed locking to truncated mx tables We acquire distributed lock on all mx nodes for truncated tables before actually doing truncate operation. This is needed for distributed serialization of the truncate command without causing a deadlock.	2018-09-21 14:23:19 +03:00
Burak Velioglu	5b1dc0ac8d	Merge pull request #2381 from citusdata/add_citus_lock_waits Add citus_lock_waits to show locked distributed queries	2018-09-20 14:59:28 +03:00
velioglu	d7f75e5b48	Add citus_lock_waits to show locked distributed queries	2018-09-20 14:13:51 +03:00
Murat Tuncer	14d514d1df	Merge pull request #2383 from citusdata/pg11_drop_index Fix drop index bug on PG11 partitioned table	2018-09-20 12:24:56 +03:00
Murat Tuncer	0f6e514bfb	Fixes a bug on not being able to drop index on a partitioned table. Reason for the failure is that PG11 introduced a new relation kind RELKIND_PARTITIONED_INDEX to be used for partitioned indices. We expanded our check to cover that case.	2018-09-19 13:15:05 +03:00
Marco Slot	9215c00ee2	Merge pull request #2379 from citusdata/fix_procedure_rollback Fixes a bug preventing rollback in stored procedure	2018-09-19 11:35:30 +02:00
Önder Kalacı	513e753492	Merge pull request #2386 from citusdata/improve_walker Use tree walker instead of mutator in relation visibility	2018-09-18 10:22:51 +03:00
Onder Kalaci	41d606b575	Use tree walker instad of mutator in relation visibility This commit uses _walker instead of _mutator for performance reasons. Given that we're only updating a functionId in the tree, the approach seems fine.	2018-09-18 09:33:01 +03:00
Önder Kalacı	f16ae31ef7	Merge pull request #2376 from citusdata/fix_crash Relax assertion on transaction abort on PREPARE step	2018-09-17 22:57:06 +03:00
Onder Kalaci	4cae856846	Relax assertion on transaction abort on PREPARE step In case a failure happens when a transaction is failed on PREPARE, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-09-17 18:09:16 +03:00
Önder Kalacı	8762af4473	Merge pull request #2384 from citusdata/fix_stuck_spinlock_max_backend Prevent overflow of memory accesses during deadlock detection	2018-09-17 18:08:31 +03:00
Onder Kalaci	a94184fff8	Prevent overflow of memory accesses during deadlock detection In the distributed deadlock detection design, we concluded that prepared transactions cannot be part of a distributed deadlock. The idea is that (a) when the transaction is prepared it already acquires all the locks, so cannot be part of a deadlock (b) even if some other processes blocked on the prepared transaction, prepared transactions would eventually be committed (or rollbacked) and the system will continue operating. With the above in mind, we probably had a mistake in terms of memory allocations. For each backend initialized, we keep a `BackendData` struct. The bug we've introduced is that, we assumed there would only be `MaxBackend` number of backends. However, `MaxBackends` doesn't include the prepared transactions and axuliary processes. When you check Postgres' InitProcGlobal` you'd see that `TotalProcs = MaxBackends + NUM_AUXILIARY_PROCS + max_prepared_xacts;` This commit aligns with total procs processed with that.	2018-09-17 16:23:29 +03:00
Marco Slot	e0942d9df5	Merge pull request #2332 from citusdata/tablesample Support TABLESAMPLE in router queries	2018-09-17 15:02:58 +02:00
Jason Petersen	aa16512a81	Merge pull request #2325 from citusdata/failure-packet-dumps Attempt to stabilize packet dumps and add them back it cr: @jasonmp85	2018-09-13 10:10:44 -06:00
Brian Cloutier	2fae06056a	Attempt to stabilize packet dumps and add them back it	2018-09-12 22:10:39 -06:00
Jason Petersen	1f64bdfc59	Merge pull request #2322 from citusdata/update-mitmproxy-version Update mitmproxy version to remove vulnerability warnings cr: @jasonmp85	2018-09-12 18:25:42 -06:00
Brian Cloutier	5bde8626c5	Travis uses Pipfile instead of re-specifying deps	2018-09-12 17:37:14 -06:00
Brian Cloutier	e61e5d4980	Update mitmproxy version to remove vulnerability warnings	2018-09-12 17:17:22 -06:00
Murat Tuncer	c3dad4bcfd	Merge pull request #2375 from citusdata/pg11_procedure Add regression tests for procedure	2018-09-12 15:53:03 +03:00
Murat Tuncer	ae0032dff8	Add regression tests for procedure calls PG11 introduced PROCEDURE concept similar to FUNCTION Procedure's allow committing/rolling back behavior. This commmit adds regression tests for procedure calls.	2018-09-12 10:28:50 +03:00
Burak Velioglu	38dbc11c9a	Merge pull request #2363 from citusdata/get_whole_all_transactions Adds UDFs for testing MX functionalities with isolation tests	2018-09-12 07:43:53 +03:00
velioglu	d1f005daac	Adds UDFs for testing MX functionalities with isolation tests	2018-09-12 07:04:16 +03:00
Murat Tuncer	470ee0b4d9	Revert multi_partition test back to being required Test was marked as optional (ignore) by previous commit. Reverting that change to make test required	2018-09-11 12:39:44 -06:00
Önder Kalacı	8e608a0e6c	Merge pull request #2360 from citusdata/dist_stat_statements Views to provide some insight about the distributed transactions on Citus MX	2018-09-11 08:46:57 +03:00
Onder Kalaci	d657759c97	Views to Provide some insight about the distributed transactions on Citus MX With this commit, we implement two views that are very similar to pg_stat_activity, but showing queries that are involved in distributed queries: - citus_dist_stat_activity: Shows all the distributed queries - citus_worker_stat_activity: Shows all the queries on the shards that are initiated by distributed queries. Both views have the same columns in the outputs. In very basic terms, both of the views are meant to provide some useful insights about the distributed transactions within the cluster. As the names reveal, both views are similar to pg_stat_activity. Also note that these views can be pretty useful on Citus MX clusters. Note that when the views are queried from the worker nodes, they'd not show the distributed transactions that are initiated from the coordinator node. The reason is that the worker nodes do not know the host/port of the coordinator. Thus, it is advisable to query the views from the coordinator. If we bucket the columns that the views returns, we'd end up with the following: - Hostnames and ports: - query_hostname, query_hostport: The node that the query is running - master_query_host_name, master_query_host_port: The node in the cluster initiated the query. Note that for citus_dist_stat_activity view, the query_hostname-query_hostport is always the same with master_query_host_name-master_query_host_port. The distinction is mostly relevant for citus_worker_stat_activity. For example, on Citus MX, a users starts a transaction on Node-A, which starts worker transactions on Node-B and Node-C. In that case, the query hostnames would be Node-B and Node-C whereas the master_query_host_name would Node-A. - Distributed transaction related things: This is mostly the process_id, distributed transactionId and distributed transaction number. - pg_stat_activity columns: These two views get all the columns from pg_stat_activity. We're basically joining pg_stat_activity with get_all_active_transactions on process_id.	2018-09-10 21:33:27 +03:00
Önder Kalacı	df0ca4617f	Merge pull request #2370 from citusdata/fix_truncate_from_workers Fix the bug introduced via allowing Truncate from MX worker nodes	2018-09-10 18:03:48 +03:00
Onder Kalaci	7de5e30432	Change flaky explain test to non-explain This test's output changes depending on which worker is picked for explain (e.g., worker port in the output changes). Given that the test is only aiming to ensure that CTEs inside CTEs work fine in DML queries, it should be fine to get rid of the EXPLAIN. The output is verified to be correct as well.	2018-09-10 16:01:30 +03:00
Onder Kalaci	76aa6951c2	Properly send commands to other nodes We previously implemented OTHER_WORKERS_WITH_METADATA tag. However, that was wrong. See the related discussion: https://github.com/citusdata/citus/issues/2320 Instead, we switched using OTHER_WORKER_NODES and make the command that we're running optional such that even if the node is not a metadata node, we won't be in trouble.	2018-09-10 16:01:30 +03:00
Onder Kalaci	5cf8fbe7b6	Add infrastructure to relation if exists	2018-09-07 14:49:36 +03:00
Önder Kalacı	5ddba6a7cd	Merge pull request #2367 from citusdata/fix_wrong_escaping Do not recover wrong distributed transactions in MX	2018-09-07 11:44:11 +03:00
Onder Kalaci	bf28dd0cff	Do not recover wrong distributed transactions in MX	2018-09-07 09:52:46 +03:00
Murat Tuncer	e5bba08595	Merge pull request #2366 from citusdata/fix_pg11_build_failure Reflect changed index for constraint scans in PG11	2018-09-07 08:23:04 +03:00
Murat Tuncer	65276311f7	Reflect changed index for constraint scans in PG11	2018-09-07 08:07:01 +03:00
Murat Tuncer	7b63b23808	Merge pull request #2364 from citusdata/pg11_feature_index_include Add support for INCLUDE option in index creation	2018-09-06 20:46:33 +03:00
Murat Tuncer	d8279569b8	Add support for INCLUDE option in index creation INCLUDE is a new feature in index creation in PG11. Included column/expression paramameters are now forwarded to shards	2018-09-06 19:41:06 +03:00
Murat Tuncer	0c1bb26448	Merge pull request #2359 from citusdata/pg11_features Add regression tests related to new PG11 partitioning features	2018-09-06 19:39:06 +03:00
Murat Tuncer	7d3f7c2bf4	Add regression tests related to new PG11 partitioning features	2018-09-06 19:06:28 +03:00
Murat Tuncer	ee27637c11	Merge pull request #2368 from citusdata/allow_pg11_failures Temporarily allow PG11 failures	2018-09-06 18:48:27 +03:00
Murat Tuncer	c7094c083b	Temporarily allow PG11 failures This is a temporary commit to unblock PG11 failures at travis until end of milestone.	2018-09-06 18:33:43 +03:00
Murat Tuncer	49645d1aee	Merge pull request #2361 from citusdata/pg11_features_part_2 Add regression tests for new PG11 window functions	2018-09-06 08:55:55 +03:00
Murat Tuncer	55cf3e321c	Add regression tests for new PG11 window functions - <offset> preceding/following - exclude	2018-09-04 10:48:04 +03:00
Önder Kalacı	c64f669755	Merge pull request #2343 from citusdata/fix_drop_table_deadlock Make sure that table (and metadata) is dropped before shards are dropped on Citus MX	2018-09-04 09:44:57 +03:00
Onder Kalaci	1b3257816e	Make sure that table is dropped before shards are dropped This commit fixes a bug where a concurrent DROP TABLE deadlocks with SELECT (or DML) when the SELECT is executed from the workers. The problem was that Citus used to remove the metadata before droping the table on the workers. That creates a time window where the SELECT starts running on some of the nodes and DROP table on some of the other nodes.	2018-09-04 08:57:20 +03:00
Önder Kalacı	3ace0ad5eb	Merge pull request #2345 from citusdata/truncate_on_workers Support TRUNCATE from the MX worker nodes	2018-09-03 15:29:14 +03:00
Onder Kalaci	2ab0e63b30	Fix flaky test	2018-09-03 14:06:32 +03:00
Onder Kalaci	26e308bf2a	Support TRUNCATE from the MX worker nodes This commit enables support for TRUNCATE on both distributed table and reference tables. The basic idea is to acquire lock on the relation by sending the TRUNCATE command to all metedata worker nodes. We only skip sending the TRUNCATE command to the node that actually executus the command to prevent a self-distributed-deadlock.	2018-09-03 14:06:31 +03:00
Onder Kalaci	97ba7bf2eb	Add the option to skip the node that is executing the node	2018-09-03 14:01:24 +03:00
Marco Slot	f34ab55389	Fix bug preventing rollback in stored procedure	2018-08-31 20:49:20 +02:00
Marco Slot	55f46acedf	Support TABLESAMPLE in router queries	2018-08-31 13:22:38 +02:00
Burak Velioglu	a4c6cefb17	Merge pull request #2351 from citusdata/master-update-changelog-28082018 Add changelog entry for 7.5.1	2018-08-28 14:09:22 +03:00
velioglu	b6cee4bb96	Add changelog entry for 7.5.1	2018-08-28 13:34:37 +03:00
Burak Velioglu	eb2318f413	Merge pull request #2333 from citusdata/dml_on_ref_mx Adds support for writing to reference tables from MX nodes.	2018-08-28 09:06:43 +03:00
velioglu	bd30e3e908	Add support for writing to reference tables from MX nodes	2018-08-27 18:15:04 +03:00
velioglu	2639149bd8	Enterprise functions about metadata/resource locks	2018-08-27 16:32:20 +03:00
Önder Kalacı	6cea01620a	Merge pull request #2339 from citusdata/fix_modifying_cte Make sure that modifying CTEs always use the correct execution mode	2018-08-23 15:44:38 +03:00
Onder Kalaci	b8af8c359b	Make sure that modifying CTEs always use the correct execution mode	2018-08-23 14:53:55 +03:00
Önder Kalacı	1abfce9969	Merge pull request #2346 from citusdata/fix_rte_walker Improve query pushdown planning for very large number of shards and queries with many tables	2018-08-22 20:37:27 +03:00
Onder Kalaci	910ea392f5	Prevent multiple placements of a single shard to lead huge memory allocations	2018-08-22 19:25:01 +03:00
Onder Kalaci	cb481f55cf	Prevent excessive number of unnecessary range table traversal	2018-08-22 11:45:00 +03:00
Mehmet Furkan ŞAHİN	235542fe6b	Merge pull request #2342 from citusdata/applyLogRedactionNoop ApplyLogRedaction noop func is added	2018-08-17 15:11:47 -07:00
mehmet furkan şahin	ef9f38b68d	ApplyLogRedaction noop func is added	2018-08-17 14:48:54 -07:00
Jason Petersen	c3c0d62ca6	Add test showing poolinfo validation works In other words, that it errors out.	2018-08-16 20:14:18 -06:00
Jason Petersen	900e88057c	Merge pull request #2341 from citusdata/disable_jit Disable jit for now, make PG11 required cr: @jasonmp85	2018-08-16 20:11:45 -06:00
Jason Petersen	fd32c3590b	Mark PostgreSQL 11 builds as required We will no longer accept regressions in our support for PostgreSQL 11.	2018-08-16 19:38:55 -06:00
Jason Petersen	c4e2349b80	Mark failing PostgreSQL 11 test as ignored This commit should be reverted once a new PostgreSQL 11 beta is available: it's due to a bug in the partitioning code which has been fixed in REL_11_STABLE but (not yet) a released tag.	2018-08-16 19:37:37 -06:00
Jason Petersen	a2fb5a84f1	Disable JIT during PostgreSQL 11 test runs It's causing problems with one of our tests.	2018-08-16 19:37:14 -06:00
Jason Petersen	54789d6896	Merge pull request #2324 from citusdata/test/pg11-compatibility make tests pass on pg 11 again cr: @jasonmp85	2018-08-15 23:52:47 -06:00
Nils Dijk	6cf4516fdb	fix \d change for indexes in pg11	2018-08-15 23:27:31 -06:00
Nils Dijk	2a9d47e1a6	fix pg11 tests	2018-08-15 23:27:31 -06:00
Mehmet Furkan ŞAHİN	3f0317dfb8	Merge pull request #2327 from citusdata/master_disable-activate_node_with_superuser Make master_disable/activate_node runnable when superuser	2018-08-15 01:09:09 -07:00
mehmet furkan şahin	1a3b9f731e	Make master_disable/activate_node runnable when superuser	2018-08-15 00:43:35 -07:00
Önder Kalacı	904e1781fa	Merge pull request #2335 from citusdata/fix_mx_schema Fix DDL execution bug on MX when search_path is used	2018-08-13 19:52:21 +03:00
Onder Kalaci	85d418412d	Fix DDL execution problem on MX when search_path is used Make sure that the coordinator sends the commands when the search path synchronised with the coordinator's search_path. This is only important when Citus sends the commands that are directly relayed to the worker nodes. For example, the deparsed DLL commands or queries always adds schema qualifications to the queries. So, they do not require this change.	2018-08-13 16:34:50 +03:00
Burak Velioglu	537d80abdb	Merge pull request #2279 from citusdata/add_create_dist_table_failure Add create_distributed_table (without data) failure tests	2018-08-13 10:25:55 +03:00
velioglu	44fc9f46fc	Add create_distributed_table (without data) failure tests	2018-08-13 09:31:15 +03:00
Önder Kalacı	3fa04d8f2c	Merge pull request #2323 from citusdata/shards_go_behind_schema Alternative approach for hiding shards on the MX workers for better UX	2018-08-07 16:24:36 +03:00
Onder Kalaci	974cbf11a5	Hide shard names on MX worker nodes This commit by default enables hiding shard names on MX workers by simple replacing `pg_table_is_visible()` calls with `citus_table_is_visible()` calls on the MX worker nodes. The latter function filters out tables that are known to be shards. The main motivation of this change is a better UX. The functionality can be opted out via a GUC. We also added two views, namely citus_shards_on_worker and citus_shard_indexes_on_worker such that users can query them to see the shards and their corresponding indexes. We also added debug messages such that the filtered tables can be interactively seen by setting the level to DEBUG1.	2018-08-07 14:21:45 +03:00
Onder Kalaci	e13da6a343	Add infrastructure to hide shards on MX worker nodes Add ability to understand whether a table is a known shard on MX workers. Note that this is only useful and applicable for hiding shards on MX worker nodes given that we can have metadata only there.	2018-08-04 09:03:37 +03:00
Mehmet Furkan ŞAHİN	28fd63bee2	Merge pull request #2280 from citusdata/failure-create_distributed_table-non-empty Add create_distributed_table (with data) failure tests	2018-08-03 12:55:04 -07:00
mehmet furkan şahin	c1f7631f98	failure tests on create_distributed_table nonempty	2018-08-03 12:41:25 -07:00
Burak Velioglu	dc55498f80	Merge pull request #2281 from citusdata/add_create_reference_table Add create_reference_table failure tests	2018-08-03 18:22:08 +03:00
velioglu	b21bd2d1a0	Add create_reference_table failure tests	2018-08-03 17:49:57 +03:00
Burak Velioglu	74ee7492ee	Merge pull request #2273 from citusdata/add_hash_copy_failure Add failure test for copy on hash distributed table	2018-08-03 17:41:29 +03:00
velioglu	bc27651dd9	Add failure test for copy on hash distributed table	2018-08-03 17:11:09 +03:00
Brian Cloutier	82fa85fa5b	Add tests for 1PC COPY on append and hash-distributed tables Add tests for 1PC COPY on append and hash-distributed tables	2018-07-31 15:17:59 -07:00
Brian Cloutier	f0f7a691a3	Prevent failure tests from hanging by using a port outside the ephemeral port range - mitmdump now listens on port 9060 - Add some logging to fluent.py, making issues like this easier to debug in the future - Fail the tests if something is already running on the port mitmProxy tries to use - check-failure now works with VPATH builds	2018-07-31 14:30:56 -07:00
Mehmet Furkan ŞAHİN	6ac0434cf3	Merge pull request #2278 from citusdata/failure-copy-reference Adds failure tests for COPY to reference table	2018-07-30 14:40:22 +03:00
mehmet furkan şahin	dde86cb731	Copy to reference table failure tests are added	2018-07-30 11:48:12 +03:00
Mehmet Furkan ŞAHİN	39cc54b4b5	Merge pull request #2316 from citusdata/citus-7.4.2-changelog-1532690856 Add changelog entry for 7.4.2	2018-07-27 15:36:51 +03:00
mehmet furkan şahin	2350eaa4c1	Add changelog entry for 7.4.2	2018-07-27 15:06:55 +03:00
Mehmet Furkan ŞAHİN	50cb434377	Merge pull request #2315 from citusdata/version-8.0-bump-fix Citus versioning fix to 8.0devel	2018-07-26 17:29:00 +03:00
mehmet furkan şahin	bc757845eb	Citus versioning fix	2018-07-26 10:56:34 +03:00
Brian Cloutier	ace248d13c	Remove unnecessary calls to 'conn.allow()'	2018-07-25 17:45:00 -07:00
Mehmet Furkan ŞAHİN	28d572cc00	Merge pull request #2311 from citusdata/master-update-version-1532507443 Bump Citus to 8.0devel	2018-07-25 12:27:14 +03:00
mehmet furkan şahin	887aa8150d	Bump citus version to 8.0devel	2018-07-25 12:03:47 +03:00
Mehmet Furkan ŞAHİN	1e95f5dc93	Merge pull request #2310 from citusdata/citus-7.5.0-changelog-1532467475 Bump citus to 7.5.0	2018-07-25 11:25:53 +03:00
mehmet furkan şahin	c7203c3c9a	Add changelog entry for 7.5.0	2018-07-25 11:06:40 +03:00
Mehmet Furkan ŞAHİN	854e49101c	Merge pull request #2296 from citusdata/add_column_fkey_fix ALTER TABLE %s ADD COLUMN %s [constraint] constraint checks are implemented	2018-07-24 16:06:29 +03:00
velioglu	e23625bf5e	Use contype to check for FK constraint instead of reading catalog table	2018-07-24 15:53:05 +03:00
mehmet furkan şahin	6d0fbbace7	ALTER TABLE %s ADD COLUMN constraint check is added	2018-07-24 15:53:05 +03:00
Jason Petersen	21114620a3	Merge pull request #2299 from citusdata/fix_real_time Don't try to check unopened connection in EXEC_TASK_FAILED state cr: @jasonmp85	2018-07-23 12:12:49 -06:00
Marco Slot	625816242a	Don't try to check unopened connection in EXEC_TASK_FAILED state	2018-07-23 11:41:02 -06:00
Nils Dijk	6ae5fcd6c9	Merge pull request #2297 from citusdata/fix/insert-select-onconflict Fix insert ... select on conflict that allowed updates of distribution column	2018-07-23 16:19:46 +02:00
Nils Dijk	2d13900230	error on unsupported changing of distirbution column in ON CONFLICT for INSERT ... SELECT	2018-07-23 15:18:21 +02:00
Nils Dijk	6a15e1c9fc	extract ErrorIfOnConflictNotSupported function for reuse	2018-07-23 12:20:10 +02:00
Nils Dijk	5b02e139e7	Merge pull request #2298 from citusdata/fix/error-msg-tablein fix missing space for tablein in error	2018-07-23 12:18:40 +02:00
Nils Dijk	df98900f80	fix missing space for tablein in error	2018-07-20 15:05:13 +02:00
Marco Slot	29edbae152	Merge pull request #2275 from citusdata/fix_user Ensure we create shards as the shard owner	2018-07-19 19:38:11 +02:00
Marco Slot	69a3ebea5f	Ensure StartPlacementListConnection connects with username supplied by the caller	2018-07-19 20:10:11 +02:00
Marco Slot	1485945f27	Delete pull_request_template.md	2018-07-19 15:38:43 +02:00
Murat Tuncer	2e62dafd3c	Merge pull request #2277 from citusdata/mt_failure_add_node Added failure test add/disable/inactive nodes	2018-07-13 18:22:02 +03:00
Murat Tuncer	a837dde1a0	Add failure tests for master add/remove/disable/active node	2018-07-13 18:06:24 +03:00
Mehmet Furkan ŞAHİN	a05817e2ec	Merge pull request #2272 from citusdata/failure-truncate Truncate failure tests are added	2018-07-13 14:29:38 +03:00
mehmet furkan şahin	f854420079	truncate failure tests are added	2018-07-13 13:20:50 +03:00
Murat Tuncer	b51d252fcc	Merge pull request #2210 from citusdata/mt_failure_test Added failure test for create index concurrently	2018-07-13 13:18:10 +03:00
Murat Tuncer	2795494758	Added failure test for create index concurrently	2018-07-13 11:53:49 +03:00
Önder Kalacı	54332957fa	Merge pull request #2212 from citusdata/ddl_failure_testing DDL failure testing	2018-07-13 10:55:49 +03:00
Onder Kalaci	a446e71ee7	Add failure testing for DDL commands This commit adds an extensive failure testing, which covers quite a bit of things and their combinations: - 1PC vs 2PC - Replication factor 1 and Replication factor 2 - Network failures and query cancellations - Sequential vs Parallel query execution mode	2018-07-12 13:05:29 +03:00
Jason Petersen	07ac909410	Merge pull request #2268 from citusdata/add_poolinfo Add pg_dist_poolinfo table cr: @marcocitus	2018-07-10 10:09:41 -07:00
Jason Petersen	318119910b	Add pg_dist_poolinfo table For storing nodes' pool host/port overrides.	2018-07-10 09:30:22 -07:00
Mehmet Furkan ŞAHİN	1c24380877	Merge pull request #2193 from citusdata/topn_agg_support Topn aggregate support	2018-07-10 15:36:33 +03:00
mehmet furkan şahin	93e2d26226	.travis.yml change to install TopN on travis	2018-07-10 14:33:42 +03:00
mehmet furkan şahin	3afa7f425d	Topn aggregates are supported	2018-07-10 14:33:42 +03:00
Marco Slot	54f5fb3b26	Merge pull request #2264 from citusdata/marcocitus-patch-1 Create pull_request_template.md with DESCRIPTION line	2018-07-10 13:16:49 +02:00
Marco Slot	a70b06f194	Add DESCRIPTION to PR template	2018-07-10 13:35:13 +02:00
Murat Tuncer	05e182128d	Merge pull request #2266 from citusdata/landlord_base Make citus_stat_statements_reset() super user function	2018-07-10 13:23:34 +03:00
Murat Tuncer	a7277526fd	Make citus_stat_statements_reset() super user function	2018-07-10 11:21:20 +03:00
Marco Slot	74a5b1e14a	Merge pull request #2257 from citusdata/select_opens_transaction_block Add a GUC to disable opening transaction blocks on workers for SELECT-only xacts	2018-07-09 22:34:49 +02:00
Marco Slot	89870e76ce	Add a select_opens_transaction_block GUC	2018-07-08 03:50:39 +02:00
Brian Cloutier	a54f9a6d2c	network proxy-based failure testing - Lots of detail is in src/test/regress/mitmscripts/README - Create a new target, make check-failure, which runs tests - Tells travis how to install everything and run the tests	2018-07-06 12:38:53 -07:00
Burak Velioglu	c6cf40e9c7	Merge pull request #2261 from citusdata/changelog_6-2-6 Add changelog entry for 6.2.6	2018-07-06 15:37:22 +03:00
velioglu	a7200d557d	Add changelog entry for 6.2.6	2018-07-06 15:24:01 +03:00
velioglu	76477fd44e	Revert "Add changelog entry for 6.2.5" This reverts commit `50807bca98`.	2018-07-06 15:23:03 +03:00
Burak Velioglu	30fa6e57a4	Merge pull request #2260 from citusdata/changelog_6-2-5 Add changelog entry for 6.2.5	2018-07-06 15:14:43 +03:00
velioglu	50807bca98	Add changelog entry for 6.2.5	2018-07-06 14:58:38 +03:00
Murat Tuncer	cdbe752e2d	Merge pull request #2255 from citusdata/relax_count_distinct Expand count distinct support	2018-07-06 11:44:38 +03:00
Murat Tuncer	f20258ef10	Expand count distinct support We can now support more complex count distinct operations by pulling necessary columns to coordinator and evalutating the aggreage at coordinator. It supports broad range of expression with the restriction that the expression must contain a column.	2018-07-06 09:44:20 +03:00
Önder Kalacı	80132b5481	Merge pull request #2253 from citusdata/improve_foreign_keys_code Some stylistic improvements in the foreign keys to reference table changes	2018-07-06 00:51:15 +03:00
Onder Kalaci	7fb529aab9	Some stylistic improvements in the foreign keys to reference table changes.	2018-07-05 23:23:34 +03:00
Marco Slot	6656592b8e	Merge pull request #2243 from citusdata/better-guc-description Remove sslmode structs and add more helpful description	2018-07-05 14:33:05 +02:00
Brian Cloutier	735218ee5d	Remove sslmode structs and add more helpful description	2018-07-05 14:12:36 +02:00
Nils Dijk	1bff1c2d3d	Merge pull request #2254 from citusdata/feature/policy-stub create placeholder for policy ddl	2018-07-05 12:45:35 +02:00
Nils Dijk	c1c8c38dc9	create placeholder for policy ddl	2018-07-05 11:07:01 +02:00
Mehmet Furkan ŞAHİN	19e92ee369	Merge pull request #2186 from citusdata/hll_agg_support Hll aggregate support	2018-07-05 10:42:00 +03:00
mehmet furkan şahin	df11dda750	hll aggregates are tested	2018-07-05 08:19:01 +03:00
mehmet furkan şahin	06217be326	hll aggregate functions are supported natively	2018-07-04 16:41:09 +03:00
Marco Slot	0c33d7ea9a	Merge pull request #2251 from citusdata/landlord_base Move partition key logging related code from enterprise	2018-07-04 12:34:28 +02:00
Murat Tuncer	901066a421	Move partition key logging related code from enterprise	2018-07-04 13:11:34 +03:00
Mehmet Furkan ŞAHİN	b480bf8d0e	Merge pull request #2240 from citusdata/foreign_master Foreign Key support from distributed to reference tables	2018-07-03 17:55:59 +03:00
mehmet furkan şahin	f7b901e3fd	CopyShardForeignConstraintCommandList API change for grouped constraints	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	35eac2318d	lock referenced reference table metadata is added For certain operations in enterprise, we need to lock the referenced reference table shard distribution metadata	2018-07-03 17:05:55 +03:00
Onder Kalaci	d83be3a33f	Enforce foreign key restrictions inside transaction blocks When a hash distributed table have a foreign key to a reference table, there are few restrictions we have to apply in order to prevent distributed deadlocks or reading wrong results. The necessity to apply the restrictions arise from cascading nature of foreign keys. When a foreign key on a reference table cascades to a distributed table, a single operation over a single connection can acquire locks on multiple shards of the distributed table. Thus, any parallel operation on that distributed table, in the same transaction should not open parallel connections to the shards. Otherwise, we'd either end-up with a self-distributed deadlock or read wrong results. As briefly described above, the restrictions that we apply is done by tracking the distributed/reference relation accesses inside transaction blocks, and act accordingly when necessary. The two main rules are as follows: - Whenever a parallel distributed relation access conflicts with a consecutive reference relation access, Citus errors out - Whenever a reference relation access is followed by a conflicting parallel relation access, the execution mode is switched to sequential mode. There are also some other notes to mention: - If the user does SET LOCAL citus.multi_shard_modify_mode TO 'sequential';, all the queries should simply work with using one connection per worker and sequentially executing the commands. That's obviously a slower approach than Citus' usual parallel execution. However, we've at least have a way to run all commands successfully. - If an unrelated parallel query executed on any distributed table, we cannot switch to sequential mode. Because, the essense of sequential mode is using one connection per worker. However, in the presence of a parallel connection, the connection manager picks those connections to execute the commands. That contradicts with our purpose, thus we error out. - COPY to a distributed table cannot be executed in sequential mode. Thus, if we switch to sequential mode and COPY is executed, the operation fails and there is currently no way of implementing that. Note that, when the local table is not empty and create_distributed_table is used, citus uses COPY internally. Thus, in those cases, create_distributed_table() will also fail. - There is a GUC called citus.enforce_foreign_key_restrictions to disable all the checks. We added that GUC since the restrictions we apply is sometimes a bit more restrictive than its necessary. The user might want to relax those. Similarly, if you don't have CASCADEing reference tables, you might consider disabling all the checks.	2018-07-03 17:05:55 +03:00
velioglu	6be6911ed9	Create foreign key relation graph and functions to query on it	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	89a8d6ab95	FK from dist to ref is tested for partitioning, MX	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	4db72c99f6	Specific DDLs are sequentialized when there is FK -[x] drop constraint -[x] drop column -[x] alter column type -[x] truncate are sequentialized if there is a foreign constraint from a distributed table to a reference table on the affected relations by the above commands.	2018-07-03 17:05:55 +03:00
mehmet furkan şahin	e37f76c276	tests are added	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	2c5d59f3a8	create_distributed_table in transaction is fixed	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	45f8017f42	create_distributed_table with fk to ref table is implemented	2018-07-03 17:05:01 +03:00
mehmet furkan şahin	2fa4e38841	FK from dist to ref can be added with alter table	2018-07-03 17:05:01 +03:00
Murat Tuncer	58486fd1b9	Merge pull request #2247 from citusdata/enable_preloading_libraries Add pg_stat_statements to shared_preload_libraries if installed	2018-07-03 16:59:34 +03:00
Murat Tuncer	3fc98e8225	Add pg_stat_statements to shared_preload_libraries if installed	2018-07-03 16:33:15 +03:00
Murat Tuncer	54aa865c3c	Merge pull request #2249 from citusdata/landlord_base Update citus_stat_statements view and regression tests	2018-07-03 16:32:47 +03:00
Murat Tuncer	23800f50f1	Update citus_stat_statements view and regression tests	2018-07-03 16:14:13 +03:00
Murat Tuncer	493e3b1b9e	Merge pull request #2248 from citusdata/partition_key_extraction_fix Strip implicit coercions when determining partition key value of an INSERT	2018-07-02 18:21:16 +03:00
Murat Tuncer	e532755a6e	Fix bug in partition column extraction added strip_implicit_coercion prior to checking if the expression is Const. This is important to find values for types like bigint.	2018-07-02 18:08:16 +03:00
Murat Tuncer	a3f1350bfe	Merge pull request #2246 from citusdata/bump-tools-version Bump tools version in .travis.yml	2018-07-02 15:44:36 +03:00
Burak Yucesoy	06589131d7	Bump tools version in .travis.yml To be able to test landlord in travis, we need pg_stat_statements from contrib packages. New tools version, 0.7.8, installs pg_stat_statements too, so we are switching to version 0.7.8 in our travis tests.	2018-07-02 14:55:23 +03:00
Murat Tuncer	1c95d5d497	Merge pull request #2242 from citusdata/master_stage_protocol_refactoring Apply master_stage_protocol refactoring changes	2018-06-28 14:23:48 +03:00
Murat Tuncer	3fc7cdfe6d	Apply master_stage_protocol refactoring changes	2018-06-28 11:24:57 +03:00
Murat Tuncer	86a3dd5a90	Merge pull request #2235 from citusdata/landlord_base Add groundwork for citus_stat_statements api	2018-06-27 14:47:18 +03:00
Murat Tuncer	4d35b92016	Add groundwork for citus_stat_statements api	2018-06-27 14:20:03 +03:00
Brian Cloutier	5ce18327a7	Don't spinloop when trying to cleanup a failed connection	2018-06-26 13:13:34 -07:00
Önder Kalacı	d63cbf3822	Merge pull request #2220 from citusdata/relation_access_via_placement_access Track relation accesses using the connection management infrastructure	2018-06-25 22:52:57 +03:00
Onder Kalaci	4ccabf9544	Increase timeout to keep appveyor happy	2018-06-25 18:40:40 +03:00
Onder Kalaci	7d0f7835e7	Improve relation accesses association to do less job	2018-06-25 18:40:40 +03:00
Onder Kalaci	8ccb8b679e	Real-time executor marks multi shard relation accesses before opening connections	2018-06-25 18:40:31 +03:00
Onder Kalaci	2890154420	Make sure that TRUNCATE always opens a DDL access	2018-06-25 18:40:31 +03:00
Onder Kalaci	21038f0d0e	Make sure that inter-shard DDL commands are always covers both tables	2018-06-25 18:40:30 +03:00
Onder Kalaci	2f01894589	Track relation accesses using the connection management infrastructure	2018-06-25 18:40:30 +03:00
Önder Kalacı	8520ecc460	Merge pull request #2232 from citusdata/start_non_data_access Use non-data connection for intermediate results	2018-06-21 15:33:49 +03:00
Onder Kalaci	d5472614df	Use non-data connection for intermediate results Make sure that intermediate results use a connection that is not associated with any placement. That is useful in two ways: - More complex queries can be executed with CTEs - Safely use the same connections when there is a foreign key to reference table from a distributed table, which needs to use the same connection for modifications since the reference table might cascade to the distributed table.	2018-06-21 13:26:13 +03:00
Önder Kalacı	460eb6f295	Merge pull request #2229 from citusdata/improve_tests Move test UDF under test folder	2018-06-21 09:29:54 +03:00
Onder Kalaci	7762d81cba	Move test UDF under test folder	2018-06-21 08:42:44 +03:00
Jason Petersen	7a75c2ed31	Add connparam invalidation trigger creation logic This needs to live in Community, since we haven't yet added the com- plication of having divergent upgrade scripts in Enterprise.	2018-06-20 14:13:18 -06:00
Burak Velioglu	19cadf52ca	Merge pull request #2230 from citusdata/changelog-7.4.1 Add changelog entry for 7.4.1	2018-06-20 12:24:56 +03:00
velioglu	0ce613405e	Add changelog entry for 7.4.1	2018-06-20 11:26:15 +03:00
Mehmet Furkan ŞAHİN	42be04551c	Merge pull request #2227 from citusdata/seq_create_distributed_table create_distributed_table honors sequential mode	2018-06-19 22:03:14 +03:00
mehmet furkan şahin	2b2ce036eb	create_distributed_table honors sequential mode	2018-06-19 17:33:45 +03:00
Önder Kalacı	0c47d16e8e	Merge pull request #2224 from citusdata/set_local_via_c Implement C interface for setting GUC	2018-06-19 12:14:57 +03:00
Onder Kalaci	8f5821493a	Implement C interface for setting GUC We need the ability to switch to sequential mode (e.g., SET LOCAL citus.multi_shard_modify_mode = 'sequential'). This commit enables that.	2018-06-19 10:23:43 +03:00
Jason Petersen	bdc44f0d29	Merge pull request #2222 from citusdata/fix_insert_select Fix use-after-free that may occur for INSERT..SELECT in prepared statements cr: @jasonmp85	2018-06-19 00:15:09 -06:00
Marco Slot	f3f2805978	Fix use-after-free that may occur for INSERT..SELECT in prepared statements	2018-06-18 22:55:06 -06:00
Burak Velioglu	82829dfdc9	Merge pull request #2197 from citusdata/select_update_hash Adds SELECT ... FOR UPDATE support for router plannable queries	2018-06-18 18:20:55 +03:00
velioglu	53b2e81d01	Adds SELECT ... FOR UPDATE support for router plannable queries	2018-06-18 13:55:17 +03:00
Marco Slot	28860b2469	Remove volatile explain plan from regression tests	2018-06-15 00:21:52 +02:00
Marco Slot	04da0cf9b1	Remove costs from explain plans in window_functions tests	2018-06-14 23:51:46 +02:00
Marco Slot	4686d49b14	Merge pull request #2215 from citusdata/fix_ref_table_failure Always throw errors on failure on critical connection in router executor	2018-06-14 22:48:14 +02:00
Marco Slot	0bbe778760	Rename failOnError to alwaysThrowErrorOnFailure	2018-06-14 23:37:47 +02:00
Marco Slot	0feb1f2eb1	Do not call CheckRemoteTransactionsHealth from commit handler	2018-06-14 23:33:07 +02:00
Marco Slot	bc1cc419e1	Fix could not receive query results error in regression test ouput	2018-06-14 23:33:07 +02:00
Marco Slot	4ab8e87090	Always throw errors on failure on critical connection in router executor	2018-06-14 23:33:07 +02:00
Nils Dijk	1c36ade64d	Merge pull request #2218 from citusdata/fix/session-user refactor grantee serialization for reuse	2018-06-14 12:49:54 +02:00
Nils Dijk	73efcb22c4	Extract RoleSpecString and resolve role references	2018-06-14 11:38:42 +02:00
Jason Petersen	95e546ba5f	Merge pull request #2190 from citusdata/conninfo_guc Add node_conninfo GUC and related logic cr: @marcocitus	2018-06-13 12:26:36 -06:00
Jason Petersen	5bf7bc64ba	Add pg_dist_authinfo schema and validation This table will be used by Citus Enterprise to populate authentication- related fields in outbound connections; Citus Community lacks support for this functionality.	2018-06-13 11:16:26 -06:00
Jason Petersen	57b3f253c5	Add node_conninfo GUC and related logic To support more flexible (i.e. not at compile-time) specification of libpq connection parameters, this change adds a new GUC, node_conninfo, which must be a space-separated string of key-value pairs suitable for parsing by libpq's connection establishment methods. To avoid rebuilding and parsing these values at connection time, this change also adds a cache in front of the configuration params to permit immediate use of any previously-calculated parameters.	2018-06-12 20:23:47 -06:00
Mehmet Furkan ŞAHİN	a0651df574	Merge pull request #2205 from citusdata/foreign_refactoring Foreign key code refactoring into ddl/foreign_constraint.c	2018-06-07 20:10:11 +03:00
mehmet furkan şahin	d1a3b20115	foreign_constraint_utils is created	2018-06-07 18:19:24 +03:00
Önder Kalacı	39e681a0f5	Merge pull request #2194 from citusdata/seq_m_m_m_shards master_modify_multiple_shards() and TRUNCATE honour `citus.multi_shard_modify_mode`	2018-06-07 17:39:46 +03:00
Önder Kalacı	aeaa28c005	Merge pull request #2199 from citusdata/seq_realtime_select Realtime executor honours multi_shard_modify_mode	2018-06-07 16:14:10 +03:00
Önder Kalacı	e45036e3f2	Merge pull request #2198 from citusdata/seq_insert_select INSERT .. SELECT pushdown honors multi_shard_modification_mode	2018-06-07 16:13:18 +03:00
Onder Kalaci	a5370f5bb0	Realtime executor honours multi_shard_modify_mode We're relying on multi_shard_modify_mode GUC for real-time SELECTs. The name of the GUC is unfortunate, but, adding one more GUC (or renaming the GUC) would make the UX even worse. Given that this mode is mostly important for transaction blocks that involve modification /DDL queries along with real-time SELECTs, we can live with the confusion.	2018-06-06 14:59:54 +03:00
Onder Kalaci	d918556dca	INSERT .. SELECT pushdown honors multi_shard_modification_mode	2018-06-06 12:42:23 +03:00
Onder Kalaci	336044f2a8	master_modify_multiple_shards() and TRUNCATE honors multi_shard_modification_mode	2018-06-06 12:29:05 +03:00
Önder Kalacı	87270911d3	Merge pull request #2189 from citusdata/seq_ddl_via_seq_router DDL commands honour `citus.multi_shard_modify_mode`	2018-06-06 09:25:26 +03:00
Onder Kalaci	51cb24b39c	Increase timeout to make the appveyor tests happy	2018-06-05 17:52:18 +03:00
Onder Kalaci	df44956dc3	Make sure that sequential DDL opens a single connection to each node After this commit DDL commands honour `citus.multi_shard_modify_mode`. We preferred using the code-path that executes single task router queries (e.g., ExecuteSingleModifyTask()) in order not to invent a new executor that is only applicable for DDL commands that require sequential execution.	2018-06-05 17:52:17 +03:00
Murat Tuncer	98b99634f3	Merge pull request #2167 from citusdata/serf Add a debug message with distribution column value	2018-06-05 17:38:54 +03:00
Marco Slot	fd4ff29f2f	Add a debug message with distribution column value	2018-06-05 15:09:17 +03:00
Murat Tuncer	37bd4cd80c	Merge pull request #2195 from citusdata/push_grant Add handling for grant/revoke all tables in schema	2018-05-31 14:26:52 +03:00
Murat Tuncer	ba50e3f33e	Add handling for grant/revoke all tables in schema	2018-05-31 13:47:02 +03:00
Jason Petersen	b90c54abc0	Merge pull request #2170 from citusdata/master-update-version-1526383730 Bump Citus to 7.5devel cr: @jasonmp85	2018-05-28 18:11:09 -06:00
Jason Petersen	49c85c2522	Fix windows citus_version	2018-05-28 17:43:39 -06:00
velioglu	28e3003311	Update version in the root control	2018-05-28 17:25:21 -06:00
velioglu	20acee2cd4	Bump citus version to 7.5devel	2018-05-28 17:25:21 -06:00
Burak Velioglu	f069e7efb6	Merge pull request #2178 from citusdata/master-update-72 Add changelog entry for 7.2.2	2018-05-17 11:07:39 +03:00
Marco Slot	673184b8d3	Add changelog entry for 7.2.2	2018-05-17 10:35:15 +03:00
Önder Kalacı	cb634bf113	Merge pull request #2176 from citusdata/fix-appveyor-real-time-flakyness Increase deadlock timeout so we get fewer signals	2018-05-17 09:49:11 +03:00
Brian Cloutier	a7e09d777b	Increase deadlock timeout so we get fewer signals	2018-05-16 17:07:24 -07:00
Brian Cloutier	9667ee5ac9	Alleviate OOM failures in COMMIT callback Previously those failures caused us to crash, postgres abort()s when it notices a failure in the COMMIT callback.	2018-05-15 16:39:33 -07:00
Burak Velioglu	f163932a1b	Merge pull request #2169 from citusdata/citus-7.4.0-changelog-1526370810 Add changelog entry for 7.4.0	2018-05-15 14:18:07 +03:00
velioglu	c248c8ac38	Add changelog entry for 7.4.0	2018-05-15 11:44:30 +03:00
Marco Slot	cce658ad8c	Merge pull request #2166 from citusdata/version_bump_windows Fix regression tests in AppVeyor	2018-05-11 22:27:48 +02:00
Marco Slot	9323db2e05	Merge pull request #2161 from citusdata/move-intermediate-results-directory Move call to RemoveIntermediateResultsDirectory	2018-05-11 11:44:53 +02:00
Marco Slot	61d2c0f618	Stabilise output of multi_shard_update_delete test	2018-05-11 08:33:23 +02:00
Onder Kalaci	ed47e4e6b9	Remove placementId from the ORDER BY to make results consistent	2018-05-11 17:04:50 +03:00
Onder Kalaci	12e50d96dc	Fix tests where hll is not installed	2018-05-11 16:01:47 +03:00
Onder Kalaci	b1619e182d	Make sure windows are build with the latest version	2018-05-11 15:44:19 +03:00
Brian Cloutier	4c2bf5d2d6	Move call to RemoveIntermediateResultsDirectory Errors thrown in the COMMIT handler will cause Postgres to segfault, there's nothing it can do it abort the transaction by the time that handler is called! RemoveIntermediateResultsDirectory is problematic for two reasons: - It has calls to ereport(ERROR which have been known to trigger - It makes memory allocations which raise ERRORs when they fail Once the COMMIT process has begun we don't use the intermediate results, so it's safe to remove them a little earlier in the process. A failure here will abort the transaction. That's pretty unnecessary, it's not that important that we remove the results, but it's still better than a crash.	2018-05-10 19:28:41 -07:00
Brian Cloutier	b3e85e4f71	Fix the huge appveyor diffs	2018-05-10 18:18:43 -07:00
Mehmet Furkan ŞAHİN	bdfe1ed702	Merge pull request #2165 from citusdata/enterprise_test_fix_drop_table_add enterprise test fixes	2018-05-10 14:22:59 +03:00
mehmet furkan şahin	b8c3197399	enterprise test fixes	2018-05-10 13:06:54 +03:00
Önder Kalacı	0e1a01921d	Merge pull request #2164 from citusdata/fix_concurrent_reg_test Run concurrent modification queries in tests sequentially	2018-05-10 12:17:41 +03:00
Onder Kalaci	04d9e886fe	Run concurrent modification queries in tests sequentially	2018-05-10 11:59:18 +03:00
Mehmet Furkan ŞAHİN	42b9690552	Merge pull request #2143 from citusdata/create_distributed_table_test_update_2 Create distributed table test update	2018-05-10 11:39:13 +03:00
mehmet furkan şahin	785a86ed0a	Tests are updated to use create_distributed_table	2018-05-10 11:18:59 +03:00
Mehmet Furkan ŞAHİN	ae97df43be	Merge pull request #2155 from citusdata/valgrind_fix valgrind tests fixed	2018-05-10 10:46:02 +03:00
mehmet furkan şahin	d35f2725bf	valgrind tests fix	2018-05-10 10:20:14 +03:00
Marco Slot	a63e628120	Merge pull request #2142 from citusdata/master_update_node_locking Make master_update_node block writes to the node	2018-05-09 14:27:03 +02:00
Dimitri Fontaine	8b258cbdb0	Lock reads and writes only to the node being updated in master_update_node Rather than locking out all the writes in the cluster, the function now only locks out writes that target shards hosted by the node we're updating.	2018-05-09 15:14:20 +02:00
Hadi Moshayedi	4198ad7618	Merge pull request #2157 from citusdata/fix_router_select Throw an error if placements cannot be found in router executor	2018-05-08 23:04:45 -04:00
Marco Slot	5f5f7b4fe0	Throw an error if placements cannot be found in router executor	2018-05-08 22:39:18 -04:00
Marco Slot	b4cfa2f283	Merge pull request #2158 from citusdata/fix_mx_test Run recursive_dml_queries_mx test on its own	2018-05-08 17:57:21 +02:00
Marco Slot	b86d6eb544	Merge pull request #2151 from citusdata/fix_cte_xact Ensure sigle-shard modifying CTEs are part of distributed transaction	2018-05-08 14:04:40 +02:00
Burak Velioglu	5b079c125b	Merge pull request #2150 from citusdata/modify_volatility_check Check volatile functions in modify queries	2018-05-08 11:45:36 +03:00
velioglu	caa27161ca	Check volatile functions in modify queries	2018-05-08 11:16:40 +03:00
Marco Slot	a7e6689890	Run recursive_dml_queries_mx test on its own	2018-05-06 17:12:53 +02:00
Marco Slot	9438e5bde9	Ensure single-shard modifying CTEs are part of distributed transaction	2018-05-06 12:49:40 +02:00
Hadi Moshayedi	86b12bc2d0	Always prefix operators with their namespace. (#2147 ) Previously we checked if an operator is in pg_catalog, and if it wasn't we prefixed it with namespace in worker queries. This can have a huge impact on performance of physical planner when using custom data types. This happened regardless of current search_path config, because Citus overrides the search path in get_query_def_extended(). When we do so, the check for existence of the operator in current search path in generate_operator_name() fails for any operators outside pg_catalog. This means that nothing gets cached, and in the following calls we will again recheck the system tables for existence of the operators, which took an additional 40-50ms for some of the usecases we were seeing. In this change we skip the pg_catalog check, and always prefix the operator with its namespace.	2018-05-05 13:27:26 -04:00
Marco Slot	0f98e4dd2f	Merge pull request #2137 from citusdata/marco_updel_subquery Implement recursive planning for DML statements	2018-05-03 22:22:14 +02:00
Murat Tuncer	42a8082721	PG11 compatibility refresh adds a shim for a changed function api	2018-05-03 13:21:15 -06:00
Marco Slot	2f9c8c6af0	Allow DML commands with unreferenced SELECT CTEs	2018-05-03 14:53:26 +02:00
Marco Slot	f8cfe07fd1	Support intermediate results in distributed INSERT..SELECT	2018-05-03 14:42:28 +02:00
Marco Slot	90cdfff602	Implement recursive planning for DML statements	2018-05-03 14:42:28 +02:00
Mehmet Furkan ŞAHİN	711128671a	Merge pull request #2141 from citusdata/regression_tests_large_shard_count2 shard count for some of the tests are increased	2018-05-03 11:05:55 +03:00
mehmet furkan şahin	ef90122cd3	shard count for some of the tests are increased	2018-05-03 10:44:43 +03:00
Önder Kalacı	7fd4383886	Merge pull request #2075 from citusdata/hash_single_repartition Implement hash-repartitioning for single repartition joins (e.g., enable single repartition joins among hash distributed tables)	2018-05-02 22:07:31 +03:00
Onder Kalaci	317dd02a2f	Implement single repartitioning on hash distributed tables * Change worker_hash_partition_table() such that the divergence between Citus planner's hashing and worker_hash_partition_table() becomes the same. * Rename single partitioning to single range partitioning. * Add single hash repartitioning. Basically, logical planner treats single hash and range partitioning almost equally. Physical planner, on the other hand, treats single hash and dual hash repartitioning almost equally (except for JoinPruning). * Add a new GUC to enable this feature	2018-05-02 18:50:55 +03:00
Burak Velioglu	7850f93127	Merge pull request #2124 from citusdata/ot_msud_subquery Support UPDATE/DELETE with joins and subqueries	2018-05-02 17:25:07 +03:00
velioglu	32bcd610c1	Support modify queries with multiple tables With this commit we begin to support modify queries with multiple tables if these queries are pushdownable.	2018-05-02 16:22:26 +03:00
Burak Velioglu	38bdd51dea	Merge pull request #2104 from citusdata/refactor_subquery Refactor query pushdown related logic	2018-05-02 15:27:31 +03:00
velioglu	d9fa69c031	Refactor query pushdown related logic	2018-05-02 15:03:09 +03:00
Brian Cloutier	f8fb7a27fb	Don't copyObject into the wrong memory context utilityStmt sometimes (such as when it's inside of a plpgsql function) comes from a cached plan, which is kept in a child of the CacheMemoryContext. When we naively call copyObject we're copying it into a statement-local context, which corrupts the cached plan when it's thrown away.	2018-05-01 15:34:32 -07:00
Marco Slot	663c9b2a39	Merge pull request #2138 from citusdata/drop_shards_users Make DROP TABLE connect as current user instead of superuser	2018-05-01 11:22:40 +02:00
Marco Slot	2559b84049	Drop shards as current user instead of super user	2018-05-01 09:57:20 +02:00
Marco Slot	64280a2329	Merge pull request #1991 from citusdata/remove_broadcast_pr4 Remove usages of large_table_shard_count GUC	2018-04-30 18:22:31 +02:00
Önder Kalacı	b1e6636398	Merge pull request #2107 from citusdata/simplify_optimizer_phase_3 Simplify optimizer [Phase 3] - Move processing each part of the query into its own functions	2018-04-29 12:56:17 +03:00
velioglu	121ff39b26	Removes large_table_shard_count GUC	2018-04-29 10:34:50 +02:00
Onder Kalaci	832c91e28c	Move processing each part of the query into its own functions This commit doesn't change any of the logic at all. Instead, the goal is to: * Get rid of any code duplication * Incremental changes to the optimizer made it slightly hard to follow the code, improve that and make it easier to implement new features * Simplify the code by moving each part of query processing (e.g., DISTINCT, LIMIT etc) into its own function * Make the interaction between each part of the query more obvious (e.g., How DISTINCT affects LIMIT etc)	2018-04-27 17:32:38 +03:00
Mehmet Furkan ŞAHİN	ca2818b569	Merge pull request #2121 from citusdata/vacuum_analyze_verbose_support Add support for (VACUUM \| ANALYZE) VERBOSE	2018-04-27 16:04:35 +03:00
mehmet furkan şahin	f2555317b6	ProcessVacuumStmt update on names	2018-04-27 14:37:01 +03:00
mehmet furkan şahin	a4153c6ab1	notice handler is implemented	2018-04-27 14:37:01 +03:00
Marco Slot	304b3a41ba	Cache the partition column Var	2018-04-26 14:58:16 -06:00
Jason Petersen	643059860a	Merge pull request #2123 from citusdata/improve_connection_errors Improve connection error reporting cr: @jasonmp85	2018-04-26 13:59:59 -06:00
Marco Slot	3d3c19a717	Improve messages for essential connection failures	2018-04-26 12:58:47 -06:00
Marco Slot	88f64d22db	Prevent connection pointer is NULL details	2018-04-26 12:49:57 -06:00
Marco Slot	394732b6be	Add a connection failure error code	2018-04-26 12:49:57 -06:00
Önder Kalacı	ebb8f902c8	Relax assertion on transaction rollback failure (#2052 ) In case a failure happens when a transaction is rollbacked, we used to hit an assertion for ensuring there is no pending activity on the connection. However, that's not true after the changes in #2031. Thus, we've replaced the assertion with a more generic function call to consume any pending activity, if exists.	2018-04-26 13:39:03 -04:00
Hadi Moshayedi	24659a97dc	Fail task in real-time executor if no placements found. (#2133 )	2018-04-26 12:05:24 -04:00
Murat Tuncer	9610fe70f8	Merge pull request #2118 from citusdata/pg11_compat PG 11 compatibility refresh	2018-04-26 11:54:00 +03:00
Murat Tuncer	a6fe5ca183	PG11 compatibility update - changes in ruleutils_11.c is reflected - vacuum statement api change is handled. We now allow multi-table vacuum commands. - some other function header changes are reflected - api conflicts between PG11 and earlier versions are handled by adding shims in version_compat.h - various regression tests are fixed due output and functionality in PG1 - no change is made to support new features in PG11 they need to be handled by new commit	2018-04-26 11:29:43 +03:00
Brian Cloutier	49255213d4	Configure appveyor to run regression tests - Add install.pl to instal .sql files on Windows - Remove a hack to PGDLLIMPORT some variables - Add citus_version.o to the Makefile - Fix pg_regress_multi's PATH generation on Windows - Output regression.diffs when the tests fail - Fix permissions in data directory, make sure postgres can play with it	2018-04-25 18:02:07 -07:00
Önder Kalacı	f0b50f2f99	Merge pull request #2090 from citusdata/simplify_optimizer_phase_2 Refactor logical optimizer [Phase 2] - Eliminate code duplication in `WorkerExtendedOpNode()`	2018-04-25 09:49:05 +03:00
Onder Kalaci	ac8f2f1e6d	Eliminate code duplication in WorkerExtendedOpNode() Before this commit, we had code duplication in the WorkerExtendedOpNode(). The duplication was noticeable and any change is prone to bugs. The PR consists of 4 commits. Each commit incrementally fixes the problem by moving certain parts of the duplicated code into smaller, better-documented functions.	2018-04-25 08:54:59 +03:00
Brian Cloutier	8d4c4d5c58	Close all files before trying to remove them	2018-04-24 14:35:20 -07:00
Brian Cloutier	c5f1235090	Turn the crashes on Windows into WARNINGs	2018-04-24 14:35:20 -07:00
Önder Kalacı	18cb93c107	Merge pull request #2070 from citusdata/simplify_optimizer Refactor logical optimizer [Phase 1] - Unify `extendedOpNode` creation	2018-04-24 14:14:16 +03:00
Onder Kalaci	ee748d9140	Unify extendedOpNode Processing Before this commit, we had a divergence among the creation of master/worker extended op nodes. This commit moves the related parts into a single place and allows the creation of master/extended op nodes to share a common data structure.	2018-04-24 11:56:38 +03:00
Hadi Moshayedi	966f01fad3	Fix write and copy functions for TaskExecution. (#2120 ) We were missing criticalErrorOccurred from CopyNodeTaskExecution() and OutTaskExecution(). This PR fixes it.	2018-04-23 09:07:52 -04:00
Önder Kalacı	6c6cddff04	Merge pull request #2105 from citusdata/fix_null_access Citus should not consider subqueries that are removed by PostgreSQL planner	2018-04-21 04:23:47 +03:00
Onder Kalaci	814f0e3acc	Ensure Citus never try to access a not planned subquery PostgreSQL might remove some of the subqueries when they do not contribute to the query result at all. Citus should not try to access such subqueries during planning.	2018-04-20 13:52:00 +03:00
Brian Cloutier	b0b130f064	Fix Windows crash in multi_copy test Without this change we crash on Windows with COPYing into a table with 62 shards, and we ERROR when COPYing into a table with >62 shards: ERROR: WaitForMutipleObjects() failed: error code 87	2018-04-17 15:48:02 -07:00
Brian Cloutier	d02f761d8e	Change intermediate_results test to not crash	2018-04-17 15:14:02 -07:00
Brian Cloutier	0104790385	Fix hard-coded temp directory in multi_copy /tmp does not exist on Windows, use :temp_dir instead	2018-04-17 15:01:22 -07:00
Brian Cloutier	a59c1c634e	Fix cancellation of real time queries Without this change multi_real_time_transaction blocks forever (on Windows) in the block where it repeatedly calls pg_advisory_lock(15). This happens because the deadlock detector tries to cancel the backend but the backend never processes that signal.	2018-04-17 14:26:22 -07:00
Mehmet Furkan ŞAHİN	cd43d48608	Merge pull request #2109 from citusdata/capital_schema_support Quotation needy schema name support	2018-04-17 22:21:55 +03:00
mehmet furkan şahin	00e786af00	Capital named schema support is added	2018-04-17 17:17:42 +03:00
Mehmet Furkan ŞAHİN	de6d3f2d33	Merge pull request #2102 from citusdata/push_down_multiple_having This is a fairly simple PR that changes the AND clauses in having explicit for worker queries for pushdown planner. Since, they are going to be switched back to be implicit in worker itself, we should provide them in explicit form. Otherwise, the worker errors-out saying the query syntax is wrong.	2018-04-16 15:30:05 +03:00
mehmet furkan şahin	e5a5502b16	Adds support for multiple ANDs in Having This PR adds support for multiple AND expressions in Having for pushdown planner. We simply make a call to make_ands_explicit from MultiLogicalPlanOptimize for the having qual in workerExtendedOpNode.	2018-04-16 14:14:48 +03:00
Brian Cloutier	42ddfa176d	Fix crash on Windows where there is no detail	2018-04-13 12:54:22 -07:00
Burak Velioglu	f54ed5d3b0	Merge pull request #1980 from citusdata/remove_boradcast_pr3_v2 Convert broadcast join to reference join	2018-04-13 16:44:55 +03:00
velioglu	82b2d21b0c	Convert broadcast join to reference join After this commit large_table_shard_count wont be used to check whether broadcast join, which is renamed as reference join, can be applied. Reference join can only be applied over reference tables.	2018-04-13 12:58:14 +03:00
Burak Velioglu	561d9b217c	Merge pull request #2103 from citusdata/fix_copartition_check Add co-placement check to CoPartitionedTable	2018-04-13 12:28:56 +03:00
velioglu	1b92812be2	Add co-placement check to CoPartition function	2018-04-13 12:13:08 +03:00
Marco Slot	b4c9e2c1ea	Merge pull request #2089 from citusdata/fix_size_query Fix issue preventing multiple size function calls per query	2018-04-12 14:44:34 +02:00
Marco Slot	9318aeee6b	Allow multiple size function calls per query	2018-04-12 14:16:17 +02:00
Marco Slot	6df6d841c9	Merge pull request #2013 from citusdata/subquery_pruning Prune shards once per relation in subquery pushdown	2018-04-10 20:19:09 +02:00
Marco Slot	ee132c5ead	Prune shards once per relation in subquery pushdown	2018-04-10 20:33:07 +02:00
Burak Yücesoy	3873d6858d	Merge pull request #2088 from citusdata/fix-drop-partitioning-table-from-worker Prevent DROPping partitioned tables from workers	2018-04-09 14:37:03 +03:00
Burak Yucesoy	b33b282030	Fix bug while DROPping partitioned table from worker We recently added partitionin support to Citus MX. We should not execute DROP table commands from MX workers but at the moment we try to execute such commands for partitioned tables. This PR fixes that problem by adding check.	2018-04-09 13:50:21 +03:00
Burak Yücesoy	0699fbe281	Merge pull request #2083 from citusdata/add_partitioning_support_to_mx Add partitioning support to MX tables	2018-04-06 13:46:46 +03:00
Burak Yucesoy	0c283fa8a3	Add partitioning support to MX tables Previously, we prevented creation of partitioned tables on Citus MX. We decided to not focus on this feature until there is a need. Since now there are requests for this feature, we are implementing support for partitioned tables on Citus MX.	2018-04-06 12:47:06 +03:00
Burak Velioglu	86b733d14c	Merge pull request #2060 from citusdata/master-update-version-1521121896 Bump citus version to 7.4devel	2018-04-05 21:01:23 +03:00
velioglu	f01daa0c83	Bump citus version to 7.4devel	2018-04-05 20:38:47 +03:00
Burak Velioglu	aa0aea9840	Merge pull request #1974 from citusdata/remove_broadcast_pr2_v2 Adds colocation check to local join	2018-04-04 23:19:13 +03:00
velioglu	72dfe4a289	Adds colocation check to local join	2018-04-04 22:49:27 +03:00
Burak Velioglu	236298823c	Merge pull request #1915 from citusdata/remove_broadcast_pr1 Removes data fetch related logic	2018-03-30 14:33:11 +03:00
velioglu	82a864308a	Remove SHARD_STORAGE_RELAY type	2018-03-30 11:45:19 +03:00
velioglu	698d585fb5	Remove broadcast join logic After this change all the logic related to shard data fetch logic will be removed. Planner won't plan any ShardFetchTask anymore. Shard fetch related steps in real time executor and task-tracker executor have been removed.	2018-03-30 11:45:19 +03:00
Matthew Wozniczka	4582a4b398	Fixed a typo	2018-03-27 22:51:36 -06:00
Murat Tuncer	1cb8e5b4bf	Fix isolation tests for windows echo command	2018-03-27 14:18:48 -07:00
Brian Cloutier	9aff4384a1	Make tests platform independent - Force all platforms to use the same collation - Force all platforms to use the same locale - Use /dev/null or NUL, depending on platform - Use /tmp or %TEMP%, dpeending on platform	2018-03-27 14:18:48 -07:00
Brian Cloutier	2140b5d82d	Make pg_regress_multi.pl platform independent - don't hardcode path names - replace system calls for rm/mkdir/rm -rf with perl equivalents - force utf-8 encoding - the Windows shell uses different quoting and escape rules	2018-03-27 14:18:48 -07:00
Brian Cloutier	adb4669d34	Add appveyor.yml, support builds on Windows	2018-03-23 16:54:33 -07:00
Brian Cloutier	f8f0d4aedc	Add Windows replacement for uname	2018-03-21 20:35:56 -07:00
Brian Cloutier	98ffafe16e	Fix error handling in connection_management	2018-03-21 20:05:00 -07:00
Murat Tuncer	224b0a8c14	Replace poll with select/poll Windows does not have poll(), so fall back to select()	2018-03-21 20:05:00 -07:00
Burak Velioglu	997e718b26	Merge pull request #2062 from citusdata/add_missing_changelog Adds missing changelog items for the 7.3	2018-03-16 09:59:05 +03:00
velioglu	cd2a167e48	Adds missing changelog items for the 7.3	2018-03-16 09:47:20 +03:00
Burak Velioglu	991000efc9	Merge pull request #2055 from citusdata/citus-7.3.0-changelog-1521010978 Bump citus to 7.3.0	2018-03-15 13:49:43 +03:00
velioglu	cbe7799863	Add changelog entry for 7.3.0	2018-03-15 10:58:36 +03:00
Metin Döşlü	81cbb7c223	Merge pull request #2051 from citusdata/remove_skip_jsonb_validation_in_copy Remove skip_jsonb_validation_in_copy GUC	2018-03-13 18:38:43 +03:00
Metin Doslu	3b7b64a8b6	Remove skip_jsonb_validation_in_copy GUC	2018-03-13 10:33:27 +02:00
Murat Tuncer	1440caeef2	Fix incorrect limit pushdown when distinct clause is not superset of group by (#2035 ) Pushing down limit and order by into workers may produce wrong output when distinct on() clause has expressions, aggregates, or window functions. This checking allows pushing down of limits only if distinct clause is a superset of group by clause. i.e. it contains all clauses in group by.	2018-03-07 13:24:56 +03:00
Metin Döşlü	27d159d6f6	Merge pull request #2041 from citusdata/make_skip_jsonb_validation_in_copy_off Change default to false for citus.skip_jsonb_validation_in_copy	2018-03-06 14:43:58 +03:00
Metin Doslu	e86d34256c	Change default to false for citus.skip_jsonb_validation_in_copy	2018-03-06 13:19:47 +02:00
Önder Kalacı	1d2a2d13cb	Merge pull request #2038 from citusdata/fix_modify_subquery Improve error messages for INSERT queries that have subqueries	2018-03-05 16:49:53 +03:00
Onder Kalaci	40b898b59f	Improve error messages for INSERT queries that have subqueries	2018-03-05 14:46:47 +02:00
Önder Kalacı	e7b28dd469	Merge pull request #2031 from citusdata/fix_immediate_shut_down_issue Improve error handling on failures	2018-03-02 10:04:12 +03:00
Onder Kalaci	7dc9589b56	Handle failures during I/O This commit checks the connection status right after any IO happens on the socket. This is necessary since before this commit we didn't pass any information to the higher level functions whether we're done with the connection (e.g., no IO required anymore) or an errors happened during the IO.	2018-03-02 08:33:53 +02:00
Onder Kalaci	da0048e0b7	ForgetResults() becomes a wrapper for ClearResults() ClearResults() is able to handle failures properly by checking the result status. So, relying on it makes error handling more generic in Citus.	2018-03-02 08:33:53 +02:00
Murat Tuncer	76f6883d5d	Add support for window functions that can be pushed down to worker (#2008 ) This is the first of series of window function work. We can now support window functions that can be pushed down to workers. Window function must have distribution column in the partition clause to be pushed down.	2018-03-01 19:07:07 +03:00
Marco Slot	8e2c72c054	Merge pull request #2030 from citusdata/bool_agg Add support for bool and bit aggregates	2018-02-28 13:08:35 +01:00
Murat Tuncer	e13c5beced	Fix worker query when order by avg aggregate is used (#2024 ) We push down order by to worker query when limit is specified (with some other additional checks). If the query has an expression on an aggregate or avg aggregate by itself, and there is an order by on this particular target we may send wrong order by to worker query with potential to affect query result. The fix creates a auxilary target entry in the worker query and uses that target entry for sorting.	2018-02-28 12:12:54 +03:00
Marco Slot	dc7213a11c	Use expressions in the ORDER BY in bool_agg	2018-02-27 23:52:44 +01:00
Marco Slot	e79db17b91	Update comment in WorkerAggregateExpressionList	2018-02-27 23:48:25 +01:00
Marco Slot	ef5ff7eb12	Add bit_ and bool_ aggregates to AggregateType	2018-02-27 23:48:25 +01:00
Marco Slot	c723a1fa32	Add support for bool and bit aggregates	2018-02-27 23:48:25 +01:00
Metin Döşlü	8516d8631e	Merge pull request #2006 from citusdata/modifiying_cte Add support for modifying CTEs	2018-02-27 16:26:58 +03:00
Metin Doslu	bcf660475a	Add support for modifying CTEs	2018-02-27 15:08:32 +02:00
Marco Slot	3098a15164	Merge pull request #2028 from citusdata/fix_regressions Fix regression for changes on REL_10_STABLE	2018-02-27 14:01:55 +01:00
Metin Doslu	91165c4140	Add a temporary file to pass Travis tests	2018-02-27 13:50:36 +02:00
Metin Doslu	53bb0b6aee	Fix regression for changes on REL_10_STABLE	2018-02-27 12:52:56 +02:00
Burak Velioglu	ee67ce892f	Merge pull request #2018 from citusdata/distinct_without_groupby_column Add distinct plan after aggregation plan on master planner	2018-02-26 15:51:19 +03:00
velioglu	78e6d990a2	Fix master plan of the query with distinct, aggregate and group by clauses. Before this PR, we were trusting on the columns of group by about guaranteeing the uniqueness of the results. However, this assumption is correct only if the columns in the group by is subset of columns in the distinct clause. It can be wrong if we have part of group by columns and some aggregation columns in the distinct clause. With this PR, we add distinct plan on top of aggregate plan when necessary.	2018-02-26 15:30:15 +03:00
Önder Kalacı	059644c1ab	Merge pull request #2016 from citusdata/non_colocated_subqueries Support non-co-located joins between subqueries	2018-02-26 15:25:04 +03:00
Onder Kalaci	1c930c96a3	Support non-co-located joins between subqueries With #1804 (and related PRs), Citus gained the ability to plan subqueries that are not safe to pushdown. There are two high-level requirements for pushing down subqueries: * Individual subqueries that require a merge step (i.e., GROUP BY on non-distribution key, or LIMIT in the subquery etc). We've handled such subqueries via #1876. * Combination of subqueries that are not joined on distribution keys. This commit aims to recursively plan some of such subqueries to make the whole query safe to pushdown. The main logic behind non colocated subquery joins is that we pick an anchor range table entry and check for distribution key equality of any other subqueries in the given query. If for a given subquery, we cannot find distribution key equality with the anchor rte, we recursively plan that subquery. We also used a hacky solution for picking relations as the anchor range table entries. The hack is that we wrap them into a subquery. This is only necessary since some of the attribute equivalance checks are based on queries rather than range table entries.	2018-02-26 13:50:37 +02:00
Onder Kalaci	7b57e0562a	Add infrastructure for detecting non-colocated subqueries	2018-02-26 13:28:25 +02:00
Onder Kalaci	cdb8d429a7	Add regression tests for non-colocated leaf subqueries	2018-02-26 13:28:24 +02:00
Onder Kalaci	4d4648aabd	Change single shard mx test tables to reference tables	2018-02-26 13:28:24 +02:00
Onder Kalaci	4d70c86645	Leaf level recursive planning for non colocated subqueries With this commit, we enable recursive planning for the subqueries that are not joined on the distribution keys.	2018-02-26 13:28:24 +02:00
Onder Kalaci	e998703ff8	Enable restriction eq. checks for top level set operations We used to only support pushdownable set operations inside a subquery, however, we could easily expand the restriction checks to cover top level set operations as well.	2018-02-26 13:28:24 +02:00
Onder Kalaci	e8aa532a90	Refactor checks for distribution key equality Change some function names, ensure we stick to Citus' function order rules etc.	2018-02-26 13:28:24 +02:00
Marco Slot	846b8b1536	Merge pull request #2023 from citusdata/fix_table_size Do not use new connection in table size functions	2018-02-26 11:04:54 +01:00
Marco Slot	1e9186a3b5	Do not use new connection in table size functions	2018-02-23 07:07:55 +01:00
Marco Slot	e2001a332f	Merge pull request #2015 from MarkusSintonen/jsonb-aggregation Add support for json(b) aggregation	2018-02-21 14:44:07 +01:00
Markus Sintonen	6202e80d06	Implemented jsonb_agg, json_agg, jsonb_object_agg, json_object_agg	2018-02-18 00:19:18 +02:00
Önder Kalacı	62237c40a7	Merge pull request #2007 from citusdata/ref_where_sublinks_v2 Recursively plan subqueries in WHERE clause when FROM recurs	2018-02-14 10:32:20 +03:00
velioglu	195ac948d2	Recursively plan subqueries in WHERE clause when FROM recurs	2018-02-13 19:52:12 +03:00
Marco Slot	6ce4795f1c	Merge pull request #1996 from citusdata/cache_worker_node_array Cache worker node array for faster iteration	2018-02-12 15:26:48 -08:00
Marco Slot	0cba4ab588	Refactor worker node hash initialisation	2018-02-12 23:36:43 +01:00
Marco Slot	40d715d494	Cache worker node array for faster iteration	2018-02-12 23:36:43 +01:00
Marco Slot	65fca44f4f	Merge pull request #1979 from citusdata/fix_abort_errors Handle errors that are discovered during abort	2018-02-12 10:04:00 -08:00
Marco Slot	d9c5c4a8f1	Merge pull request #2003 from citusdata/no_plan_copy Only copy distributed plan when modifying it	2018-02-12 09:34:30 -08:00
Önder Kalacı	bf1e492011	Merge pull request #1989 from citusdata/refactor_restriction_logic Some code refactoring and performance improvements for restriction equivalences	2018-02-12 20:01:35 +03:00
Onder Kalaci	94c5ac6ebb	Remove duplicate join restrictions We use PostgreSQL hooks to accumulate the join restrictions and PostgreSQL gives us all the join paths it tries while deciding on the join order. Thus, for queries that have many joins, this function is likely to remove lots of duplicate join restrictions. This becomes relevant for Citus on query pushdown check peformance.	2018-02-12 18:35:05 +02:00
Onder Kalaci	c228d8ff3d	Refactor equivalance generation related codes This commit changes the APIs for restriction generation to make future changes simpler.	2018-02-12 18:35:04 +02:00
Onder Kalaci	2f2d350924	Refactor relation restriction related codes This commit moves some of the functions to a more relevant source file.	2018-02-12 18:35:04 +02:00
Marco Slot	6e79a34c97	Do not check for cancellation in ClearResultsIfReady	2018-02-12 16:45:02 +01:00
Marco Slot	6051aae56e	Handle errors that are discovered during abort	2018-02-12 16:45:02 +01:00
Marco Slot	ee6a751798	Only copy distributed plan when modifying it	2018-02-12 16:30:55 +01:00
Jason Petersen	e75eb17130	Try new Debian URL	2018-02-07 15:06:37 -07:00
Metin Döşlü	7332244c8c	Merge pull request #1999 from citusdata/citus-7.2.1-changelog-1517919981 Bump citus to 7.2.1	2018-02-06 15:41:44 +03:00
Metin Doslu	238defaee0	Add changelog entry for 7.2.1	2018-02-06 14:27:00 +02:00
Burak Yücesoy	cf5d258043	Merge pull request #1993 from citusdata/subquery_pushdown_count_distinct Fix count distinct using field select on top level query	2018-02-06 15:06:54 +03:00
Murat Tuncer	678223224b	Update regression test output expectation based on recent PG10 change	2018-02-06 14:44:55 +03:00
Murat Tuncer	901b543e20	Fix count distinct using field select on top level query We were allowing count distict queries even if they were not directly on columns if the query is grouped on distribution column. When performing these checks we were skipping subqueries because they also perform this check in a more concise manner. We relied on oid SUBQUERY_RELATION_ID (10000) to decide if a given RTE relation id denotes a subquery, however, we also use SUBQUERY_PUSHDOWN_RELATION_ID (10001) for some subqueries. We skip both type of subqueries with this change.	2018-02-06 13:16:10 +03:00
Metin Döşlü	aba2f47cdf	Merge pull request #1988 from citusdata/respect_enable_hashagg Respect enable_hashagg in the master planner	2018-02-05 16:27:05 +03:00
metdos	35f864bcaf	Respect enable_hashagg in the master planner	2018-02-05 15:06:00 +02:00
metdos	3d540d961c	Fix typo in grouping_is_sortable()	2018-02-05 12:10:19 +02:00
Marco Slot	00f9082cd4	Merge pull request #1965 from citusdata/fast_jsonb_copy Skip JSON validation on coordinator during COPY	2018-02-04 14:56:56 +01:00
Marco Slot	6f7c3bd73b	Skip JSON validation on coordinator during COPY	2018-02-02 15:33:27 +01:00
Brian Cloutier	15511f6ba1	Dynamically allocate connection metadata in WaitForAllConnections	2018-02-01 10:30:41 -08:00
Brian Cloutier	e6ebfc1f53	Remove VLA from UpdateNodeLocation	2018-02-01 10:30:41 -08:00
Brian Cloutier	a2ed45e206	Remove variable length arrays VLAs aren't supported by Visual Studio. - Remove all existing instances of VLAs. - Add a flag, -Werror=vla, which makes gcc refuse to compile if we add VLAs in the future.	2018-02-01 10:30:41 -08:00
Brian Cloutier	2efe80ce55	CheckForDistributedDeadlocks no longer uses a VLA - variable length arrays (VLAs) do not work with Visual Studio - fix an off-by-one error. We incorrectly assumed there would always at least as many edges as there were nodes. - refactor: reduce scope of transactionNodeStack by moving it into the function which uses it. - refactor: break up the distinct uses of currentStackDepth into separate variables.	2018-02-01 10:30:41 -08:00
Brian Cloutier	097fd15a89	small refactor, CheckDeadlockForTransactionNode builds it's own array	2018-02-01 10:30:41 -08:00
Brian Cloutier	457f570b77	Small refactor, we were using incompatible types	2018-01-31 11:05:59 -08:00
Brian Cloutier	b864d014ab	GetNextNodeId() incorrectly called PG_RETURN_DATUM - Also stabilize the output of a multi_router_planner test	2018-01-29 15:32:36 -08:00
Brian Cloutier	61a6b846b9	Refactor: use a temporary timestamp variable It's against our coding convention to call functions inside parameter lists; when single-stepping with a debugger it's difficult to determine what the function returned. That wouldn't be good enough reason to change this code but while porting Citus to Windows I ran into this line of code. assign_distributed_transaction_id was called with a weird timestamp and I wasn't able to find the problem without first making this change.	2018-01-29 11:20:13 -08:00
Marco Slot	0303dfc463	Merge pull request #1981 from citusdata/faster_execute_subplans Skip call to ActiveReadableNodeList when there are no subplans	2018-01-29 17:20:44 +01:00
Marco Slot	bd0ebac865	Skip call to ActiveReadableNodeList when there are no subplans	2018-01-29 16:05:10 +01:00
Hadi Moshayedi	ff26bcd5a5	Include sys/stat.h for S_IRUSR and S_IWUSR. (#1977 )	2018-01-26 16:21:48 -05:00
Marco Slot	ddbcb9fc25	Merge pull request #1944 from citusdata/base_schedule Add base schedule for only running specific regression tests	2018-01-25 22:12:34 +01:00
Marco Slot	4762503c34	Add base schedule for only running specific regression tests	2018-01-25 18:51:22 +01:00
Burak Velioglu	d43e18f398	Merge pull request #1975 from citusdata/add_remaining_changelog Adds missing item to 7.2 changelog	2018-01-25 17:18:01 +03:00
velioglu	414778d360	Adds missing item to 7.2 changelog	2018-01-25 16:58:09 +03:00
Brian Cloutier	76d1edc3fd	Don't rely on gcc-specific features (#1963 ) * Don't use expressions inside compound statements * Don't depend on __builtin_constant_p * Remove reliance on S_ISLNK * Replace use of __func__: older mcvs doesn't support this builtin	2018-01-23 17:03:29 -08:00
Önder Kalacı	59a03d809d	Merge pull request #1961 from citusdata/fix_wrong_deadlock Prevent canceling backends that are active but not involved in the distributed deadlock	2018-01-22 10:09:53 +03:00
Onder Kalaci	fbde87d2d0	Allocate enough space for transaction nodes This fix prevents any potential memory access that might occur while forming the deadlock path.	2018-01-22 08:45:48 +02:00
Onder Kalaci	9a89c0b425	Fix bug while traversing the distributed deadlock graph With this fix, we traverse the graph with DFS which was originally intended. Note that, before the fix, we traverse the graph with BFS which might lead to killing some unrelated backend that is not involved in the distributed deadlock.	2018-01-22 08:45:48 +02:00
Dimitri Fontaine	bff44394fb	Merge pull request #1948 from citusdata/feature/alter-index Add support for Alter Index Commands, and Storage Parameters on tables and indexes.	2018-01-18 11:27:33 +01:00
Dimitri Fontaine	c9760fbb64	Fix CREATE INDEX with storage options on distributed tables. By sharing the implementation of the function AppendOptionListToString on three call sites, we would expand an extra OPTIONS keyword in a create index statement, and omit other bits of the specific syntax here. This patch introduces an AppendStorageParametersToString() function that is very similar to AppendOptionListToString() but handles WITH(a="foo",...) syntax that is used in reloptions (aka Storage Parameters). Fixes #1747.	2018-01-17 21:56:40 +01:00
Dimitri Fontaine	952da72c55	Implement ALTER TABLE\|INDEX ... SET\|RESET (). PostgreSQL implements support for several relation kinds in a single statement, such as in the AlterTableStmt case, which supports both tables and indexes and more (see ATExecSetRelOptions in PostgreSQL source code file src/backend/commands/tablecmds.c for an example of that). As a consequence, this patch implements support for setting and resetting storage parameters on both relation kinds.	2018-01-17 21:56:40 +01:00
Dimitri Fontaine	17266e3301	Implement ALTER INDEX ... RENAME TO ... The command is now distributed among the shards when the table is distributed. To that effect, we fill in the DDLJob's targetRelationId with the OID of the table for which the index is defined, rather than the OID of the index itself.	2018-01-17 21:56:40 +01:00
Burak Velioglu	aee0be881b	Merge pull request #1955 from citusdata/master-update-version-1516091373 Bump Citus to 7.3devel	2018-01-16 13:57:51 +03:00
velioglu	d357d2fccd	Bump citus version to 7.3devel	2018-01-16 11:50:28 +03:00
Burak Velioglu	3a82f4e6d2	Merge pull request #1954 from citusdata/citus-7.2.0-changelog-1516086689 Add changelog entry for 7.2.0	2018-01-16 11:23:14 +03:00
velioglu	1d53a71397	Add changelog entry for 7.2.0	2018-01-16 11:12:12 +03:00
Önder Kalacı	7d3cb721de	Merge pull request #1946 from citusdata/fix_missing_test_tx_id Add test that checks whether distributed transaction ID survives pg_dist_partition invalidation	2018-01-11 17:37:15 +03:00
Marco Slot	6fb8cfc104	Add test that checks whether distributed transaction ID survives pg_dist_partition invalidation	2018-01-11 16:14:39 +02:00
Dimitri Fontaine	6dd1793da9	Merge pull request #1939 from citusdata/feature/rename-table Add support for renaming Distributed Tables	2018-01-11 13:38:18 +01:00
Dimitri Fontaine	1f088791bd	Add DDL tests with non-public schema. Citus sometimes have regressions around non-default schema support, meaning not public and not in the search_path, per @marcocitus. This patch changes some regression tests to use a non-default schema in order to cover more cases.	2018-01-11 13:21:24 +01:00
Dimitri Fontaine	e010238280	Implement ALTER TABLE ... RENAME TO ... The implementation was already mostly in place, but the code was protected by a principled check against the operation. Turns out there's a nasty concurrency bug though with long identifier names, much as in #1664. To prevent deadlocks from happening, we could either review the DDL transaction management in shards and placements, or we can simply reject names with (NAMEDATALEN - 1) chars or more — that's because of the PostgreSQL array types being created with a one-char prefix: '_'.	2018-01-11 13:21:24 +01:00
Burak Velioglu	ef3517a5dc	Merge pull request #1943 from citusdata/release-6.2-changelog Add changelog entry for 6.2.5	2018-01-11 12:31:53 +03:00
velioglu	a7eccd5d9d	Add changelog entry for 6.2.5	2018-01-11 11:28:02 +03:00
Burak Velioglu	e328ff973f	Merge pull request #1929 from citusdata/citus-7.1.2-changelog Bump citus to 7.1.2	2018-01-04 14:23:22 +03:00
velioglu	70fade547b	Add changelog entry for 7.1.2	2018-01-04 13:39:27 +03:00
Hadi Moshayedi	5d7c52ffa6	Don't return in PG_TRY() block when cancellations happen in WaitForConnections(). (#1923 ) We shouldn't return in middle of a PG_TRY() block because if we do, we won't reset PG_exception_stack, and later when a re-throw tries to jump to the jump-point which was active in this PG_TRY() block, it seg-faults. We used to return in middle of PG_TRY() block in WaitForConnections() where we checked for cancellations. Whenever cancellations were caught here, Citus crashed. And example was reported by @onderkalaci at #1903.	2018-01-03 09:54:03 -05:00
Marco Slot	8f69973411	Fix cancellation issues in the real-time executor (#1905 )	2018-01-01 23:10:29 -05:00
Marco Slot	3fd65cb91b	Do not raise errors in the real-time executor (#1903 )	2018-01-01 22:26:31 -05:00
Önder Kalacı	6e34a8fbf4	Merge pull request #1918 from citusdata/fix_outer_join_pushdown Outer joins should also use/try subquery pushdown planner if join clause is not supported	2017-12-29 17:56:13 +03:00
Onder Kalaci	a1bbdf2d44	Outer joins should also use subquery pushdown planner if join clause is not supported This change allows unsupported clauses to go through query pushdown planner instead of erroring out as we already do for non-outer joins.	2017-12-29 16:40:47 +02:00
Önder Kalacı	4e9d4c1bd3	Merge pull request #1894 from citusdata/unions Support set operations	2017-12-26 15:19:23 +03:00
Marco Slot	09c09f650f	Recursively plan set operations when leaf nodes recur	2017-12-26 13:46:55 +02:00
Önder Kalacı	4650418f58	Merge pull request #1907 from citusdata/add_some_tests Add some regression tests for recursively planned subqueries	2017-12-25 16:17:36 +03:00
Onder Kalaci	eb929e9001	Add some more basic regression tests, mostly for documentation purposes	2017-12-25 15:03:45 +02:00
Mehmet Furkan ŞAHİN	e92aca6fe7	Merge pull request #1901 from citusdata/errors_fix Error messages are updated after recursive planner	2017-12-25 15:27:25 +03:00
mehmet furkan şahin	446893234a	unsupported subquery error messages are fixed	2017-12-25 15:10:59 +03:00
Mehmet Furkan ŞAHİN	dcafd1368b	Merge pull request #1897 from citusdata/subquery_debug_output new debug output for subplans	2017-12-25 10:19:04 +03:00
mehmet furkan şahin	57bc86e23d	new debug output for subplans	2017-12-25 09:50:51 +03:00
Marco Slot	a2e2419ad1	Merge pull request #1878 from citusdata/log_remote_command Log remote commands sent via MultiClientSendQuery	2017-12-22 17:30:18 +01:00
Marco Slot	fa7fa2734b	Log remote commands sent via MultiClientSendQuery	2017-12-22 16:18:40 +01:00
Murat Tuncer	87c6f306f1	Fix join clause eq restrictions (#1884 ) We used to error out if the join clause includes filters like t1.a < t2.a even if other filter like t1.key = t2.key exists. Recently we lifted that restriction in subquery planning by not lifting that restriction and focusing on equivalance classes provided by postgres. This checkin forwards previously erroring out real-time queries due to join clauses to subquery planner and let it handle the join even if the query does not have a subquery. We are now pushing down queries that do not have any subqueries in it. Error message looked misleading, changed to a more descriptive one.	2017-12-22 12:16:14 +03:00
Metin Döşlü	3aee73674b	Merge pull request #1895 from citusdata/no_shard_locks_for_ddls Get shard resource locks for only DMLs	2017-12-22 10:57:20 +03:00
metdos	32b7e152a3	Get shard resource locks for only DMLs	2017-12-22 10:30:41 +02:00
Murat Tuncer	a9cf0c3e66	Fix CTE column alias issue (#1893 ) We were creating intermediate query result's target names from subquery target list. Now we also check if cte re-defines its column name aliases, and create intermediate result query accordingly.	2017-12-22 09:39:40 +03:00
Marco Slot	fa134984c2	Merge pull request #1806 from citusdata/deadlock-spam Don't spam the log with deadlock messages	2017-12-21 17:10:32 +03:00
Brian Cloutier	377b31dcf7	Remove enable_deadlock_prevention prevention warning	2017-12-21 14:47:52 +01:00
Marco Slot	6862cd066e	Merge pull request #1818 from citusdata/remove-strtoull-ifdef Remove an ifdef surrounding strtoull	2017-12-21 16:47:34 +03:00
Brian Cloutier	fb7b86fa14	Replace strtoull with pg_strtouint64 The macro we were using to detect strtoull isn't set on Windows, and just in case there are differences use a portable function from PG instead of calling strtoull directly.	2017-12-21 14:28:51 +01:00
Mehmet Furkan ŞAHİN	3f7e0d780e	Merge pull request #1883 from citusdata/intermediate_result_size_limitation Intermediate result size limitation	2017-12-21 14:42:54 +03:00
mehmet furkan şahin	fd546cf322	Intermediate result size limitation This commit introduces a new GUC to limit the intermediate result size which we handle when we use read_intermediate_result function for CTEs and complex subqueries.	2017-12-21 14:26:56 +03:00
Önder Kalacı	54ccfb24be	Merge pull request #1876 from citusdata/subqueries Recursively plan subqueries that are not safe to pushdown (i.e., requires merge step)	2017-12-21 09:58:35 +03:00
Onder Kalaci	e2a5124830	Add regression tests for recursive subquery planning	2017-12-21 08:37:40 +02:00
Onder Kalaci	0d5a4b9c72	Recursively plan subqueries that are not safe to pushdown With this commit, Citus recursively plans subqueries that are not safe to pushdown, in other words, requires a merge step. The algorithm is simple: Recursively traverse the query from bottom up (i.e., bottom meaning the leaf queries). On each level, check whether the query is safe to pushdown (or a single repartition subquery). If the answer is yes, do not touch that subquery. If the answer is no, plan the subquery seperately (i.e., create a subPlan for it) and replace the subquery with a call to `read_intermediate_results(planId, subPlanId)`. During the the execution, run the subPlans first, and make them avaliable to the next query executions. Some of the queries hat this change allows us: * Subqueries with LIMIT * Subqueries with GROUP BY/DISTINCT on non-partition keys * Subqueries involving re-partition joins, router queries * Mixed usage of subqueries and CTEs (i.e., use CTEs in subqueries as well). Nested subqueries as long as we support the subquery inside the nested subquery. * Subqueries with local tables (i.e., those subqueries has the limitation that they have to be leaf subqueries) * VIEWs on the distributed tables just works (i.e., the limitations mentioned below still applies to views) Some of the queries that is still NOT supported: * Corrolated subqueries that are not safe to pushdown * Window function on non-partition keys * Recursively planned subqueries or CTEs on the outer side of an outer join * Only recursively planned subqueries and CTEs in the FROM (i.e., not any distributed tables in the FROM) and subqueries in WHERE clause * Subquery joins that are not on the partition columns (i.e., each subquery is individually joined on partition keys but not the upper level subquery.) * Any limitation that logical planner applies such as aggregate distincts (except for count) when GROUP BY is on non-partition key, or array_agg with ORDER BY	2017-12-21 08:37:40 +02:00
Onder Kalaci	e12ea914b9	Refactor ErrorIfQueryNotSupported to defer errors	2017-12-20 09:03:49 +02:00
Onder Kalaci	71ce42b936	Refactor RecursivelyPlanSubqueriesAndCTEs() to make it ready to work with subqueries	2017-12-20 09:03:47 +02:00
Marco Slot	393e625cb2	Merge pull request #1888 from citusdata/subplan_explain Show distributed subplan ID in EXPLAIN output	2017-12-19 22:13:11 +03:00
Marco Slot	6a6e986c2b	Add EXPLAIN regression test with subplans	2017-12-19 16:34:56 +01:00
Marco Slot	5e0539efa3	Plan CTEs when subquery pushdown is on	2017-12-19 16:34:56 +01:00
Marco Slot	44a1ea631a	Show distributed subplan ID in EXPLAIN output	2017-12-19 16:34:56 +01:00
Marco Slot	7a25ebe257	Merge pull request #1889 from citusdata/fix_my_backend_data Do not reinitialise MyBackendData on cache invalidations	2017-12-19 18:09:38 +03:00
Marco Slot	35dbacdb69	Do not reinitialise MyBackendData	2017-12-19 15:56:26 +01:00
Marco Slot	5c5bd80afc	Merge pull request #1879 from citusdata/result_in_parallel_worker Allow intermediate results to be used in parallel workers	2017-12-19 11:45:48 +03:00
Marco Slot	9b520ae194	Add test for using transaction ID in parallel worker	2017-12-19 09:30:29 +01:00
Marco Slot	af201a2f6d	Allow intermediate results to be used in parallel workers	2017-12-18 19:05:08 +01:00
Marco Slot	704828b237	Merge pull request #1869 from citusdata/result_cost Set cost estimates for read_intermediate_result	2017-12-18 16:55:28 +01:00
Marco Slot	7dab078e67	Set cost estimates for read_intermediate_result	2017-12-18 16:23:44 +01:00
Marco Slot	e49254f876	Revert "Add EXPLAIN regression test with subplans" This reverts commit `8b6d641227`.	2017-12-17 22:34:31 +01:00
Marco Slot	74bd33d0cc	Revert "Plan CTEs when subquery pushdown is on" This reverts commit `e3b953b8e3`.	2017-12-17 22:34:20 +01:00
Marco Slot	aca5f35ab9	Revert "Show distributed subplan ID in EXPLAIN output" This reverts commit `686b079272`.	2017-12-17 22:34:04 +01:00
Marco Slot	8b6d641227	Add EXPLAIN regression test with subplans	2017-12-17 22:00:25 +01:00
Marco Slot	e3b953b8e3	Plan CTEs when subquery pushdown is on	2017-12-17 21:49:36 +01:00
Marco Slot	686b079272	Show distributed subplan ID in EXPLAIN output	2017-12-16 11:32:01 +01:00
Marco Slot	36f049bdc5	Merge pull request #1866 from citusdata/count_distinct_subquery Allow count(distinct) in queries with a subquery	2017-12-15 16:05:25 +01:00
Marco Slot	ea6b98fda4	Allow count(distinct) in queries with a subquery	2017-12-15 15:24:26 +01:00
Marco Slot	fbb7d9c894	Merge pull request #1873 from citusdata/fix_partition_lock Do not take extra access exclusive lock on partitioned tables	2017-12-15 13:32:42 +01:00
Marco Slot	9ee0e68882	Do not take extra access exclusive lock partitioned tables	2017-12-15 13:02:31 +01:00
Marco Slot	cf7dda3892	Merge pull request #1871 from citusdata/relax_from_sublink_checks Relax checks on recurring tuples in FROM with sublinks	2017-12-15 12:13:19 +01:00
Marco Slot	5a69fc1b17	Relax checks on recurring tuples in FROM with sublinks	2017-12-15 11:56:06 +01:00
Marco Slot	a64f0060ba	Reduce the frequency of FinishConnectionIO calls during COPY (#1864 )	2017-12-14 13:21:59 -05:00
Marco Slot	a811aad264	Deparallelise multi_modifying_xacts tests	2017-12-14 10:27:17 +01:00
Marco Slot	c19c3ef4a1	Merge pull request #1853 from citusdata/ctes Add support for CTEs in distributed queries	2017-12-14 10:26:47 +01:00
mehmet furkan şahin	5851f71bfb	Add CTE regression tests	2017-12-14 09:32:55 +01:00
Marco Slot	fa73abe6d4	Regression test output changes after CTE support	2017-12-14 09:32:55 +01:00
Marco Slot	2e2b4e81fa	Add support for CTEs in distributed queries	2017-12-14 09:32:55 +01:00
Marco Slot	d0335ec818	Send BEGIN for SELECTs in the router executor	2017-12-14 09:32:55 +01:00
Marco Slot	cbbd418af2	Add citus.copy_format OIDs to metadata cache	2017-12-14 09:32:55 +01:00
Marco Slot	66f9f1d6cd	Make some intermediate results functions public	2017-12-14 09:32:55 +01:00
Marco Slot	36ee21c323	Make CanUseBinaryCopyFormatForType public	2017-12-14 09:32:55 +01:00
Marco Slot	7d1191954d	Add DistributedSubPlan node	2017-12-14 09:32:55 +01:00
Önder Kalacı	b5784ca03a	Merge pull request #1852 from citusdata/group_by_on_function Treat recurring tuples as reference tables for GROUP BY checks	2017-12-13 16:37:31 +03:00
Onder Kalaci	86b2d9420c	Treat recurring tuples as reference table for GROUP BY checks read_intermediate_results() and immutable functions are implemented. Empty join trees seems not applicable here.	2017-12-13 14:55:42 +02:00
Marco Slot	f0851257fa	Merge pull request #1867 from citusdata/fix_analyze_block Fix issue with multiple ANALYZE in transaction block	2017-12-12 10:54:27 +01:00
Marco Slot	d1a470a52e	Fix issue with multiple ANALYZE in transaction block	2017-12-12 10:28:48 +01:00
Mehmet Furkan ŞAHİN	84957fe6e7	Merge pull request #1861 from citusdata/new_guc_to_allow_task_executor_swap New guc to allow automated task executor swap	2017-12-11 09:48:42 +03:00
mehmet furkan şahin	3c941aedf1	adds citus.enable_repartition_joins GUC The new GUC allows Citus to switch between task executors when necessary	2017-12-11 09:36:37 +03:00
Marco Slot	7544e91c87	Merge pull request #1860 from citusdata/needs_distributed_planning Allow queries with local tables in NeedsDistributedPlanning	2017-12-08 10:11:07 +01:00
Marco Slot	5895c88552	Add materialized view regression tests	2017-12-07 16:20:23 +01:00
Marco Slot	60a1e31671	Allow queries with local tables in NeedsDistributedPlanning	2017-12-07 16:20:23 +01:00
Marco Slot	d71d519672	Merge pull request #1857 from citusdata/fix_intermediate_result Use proper schema in read_intermediate_result signature	2017-12-07 14:20:10 +01:00
Marco Slot	f8550b8c85	Fix issues with read_intermediate_result signature	2017-12-07 13:47:56 +01:00
Marco Slot	d8fea4efb8	Revert "Allow queries with local tables in NeedsDistributedPlanning" This reverts commit `d2bac081e8`.	2017-12-07 11:19:11 +01:00
Marco Slot	d2bac081e8	Allow queries with local tables in NeedsDistributedPlanning	2017-12-07 11:02:16 +01:00
Önder Kalacı	3ceb15ccdf	Merge pull request #1851 from citusdata/fix_annoying_bug Fix bug related to incrementing an index not properly	2017-12-07 10:33:37 +03:00
Onder Kalaci	c42a92afd2	Fix bug related to incrementing an index not properly	2017-12-07 08:50:57 +02:00
Marco Slot	d336167313	Merge pull request #1856 from citusdata/create_drop_deadlock Avoid deadlock with DROP TABLE in ColocatedTableId	2017-12-06 12:03:52 +01:00
Marco Slot	eab15aa035	Avoid deadlock in ColocatedTableId	2017-12-06 11:49:34 +01:00
Metin Döşlü	75eff340e1	Merge pull request #1854 from citusdata/fix_valgrind_tests Increase sleep time in a regression test to give Valgrind tests enough time	2017-12-05 16:36:34 +03:00
metdos	12d5974d97	Increase sleep time in a regression test to give Valgrind tests enough time	2017-12-05 14:59:37 +02:00
Marco Slot	98522d8d7f	Merge pull request #1829 from citusdata/intermediate_result Add infrastructure for moving around intermediate results	2017-12-04 15:02:53 +01:00
Marco Slot	7279d42849	Treat read_intermediate_result as recurring tuples	2017-12-04 14:50:11 +01:00
Marco Slot	716448ddef	Add regression tests for intermediate results	2017-12-04 14:50:11 +01:00
Marco Slot	4cdadfcab6	Add intermediate results infrastructure	2017-12-04 14:50:11 +01:00
Marco Slot	bfcc76df69	Make several COPY-related functions public	2017-12-04 13:12:03 +01:00
Marco Slot	73989b07eb	Refactor query execution functions	2017-12-04 13:12:03 +01:00
Murat Tuncer	2d66bf5f16	Fix hard coded formatting strings for 64 bit numbers (#1831 ) Postgres provides OS agnosting formatting macros for formatting 64 bit numbers. Replaced %ld %lu with INT64_FORMAT and UINT64_FORMAT respectively. Also found some incorrect usages of formatting flags and fixed them.	2017-12-04 14:11:06 +03:00
Burak Velioglu	f77f8c30dc	Merge pull request #1845 from citusdata/test_release_71 Add CHANGELOG entry for 7.1.1	2017-12-01 13:53:47 +03:00
Burak	01a3e7414f	Add CHANGELOG entry for 7.1.1	2017-12-01 12:01:06 +03:00
Hadi Moshayedi	ff706cf556	Test that COPY blocks UPDATE/DELETE/INSERT...SELECT when rep factor 2.	2017-11-30 14:52:29 -05:00
Marco Slot	acbc0fe0de	Use RowExclusiveLock shard resource lock in COPY	2017-11-30 09:15:45 -05:00
Önder Kalacı	b685dfa99f	Merge pull request #1838 from citusdata/fix_common_eq_class The common attribute equivalence class should always include the input relations	2017-11-30 17:14:04 +03:00
Onder Kalaci	a273711500	The common attribute equivalance class always includes the input relations We added the ability to filter out the planner restriction information for specific parts of the query. This might lead to situations where the common restriction includes some other relations that we're searching for. The reason is that while filtering for join restrictions, we add the restriction as soon as we find the relation. With this commit we make sure that the common attribute equivalance class always includes the input relations.	2017-11-30 16:00:26 +02:00
Marco Slot	8cb5734481	Merge pull request #1841 from citusdata/send_begin Send begin in real-time executor when in a coordinated transaction	2017-11-30 13:20:32 +01:00
Marco Slot	0d6a7f5884	Add real-time BEGIN regression tests	2017-11-30 12:59:09 +01:00
Marco Slot	d6dd0b3a81	Send BEGIN in the real-time executor when in a transaction	2017-11-30 12:59:09 +01:00
Marco Slot	581c8c02cc	Merge pull request #1840 from citusdata/remove_filter_checks Remove filter checks on leaf queries	2017-11-30 12:52:11 +01:00
Marco Slot	3a4d5f8182	Remove filter checks on leaf queries	2017-11-30 12:25:14 +01:00
Marco Slot	7b8f13cf35	Merge pull request #1839 from citusdata/union_joins Support UNION with joins in the subqueries	2017-11-30 10:53:54 +01:00
Marco Slot	3f03cb6a6a	Support UNION with joins in the subqueries	2017-11-30 10:37:56 +01:00
Burak Velioglu	906dadddb7	Merge pull request #1785 from citusdata/real_time_xact Make real-time executor work in transactions (and fix pg_partman)	2017-11-30 10:21:29 +03:00
Marco Slot	a9933deac6	Make real time executor work in transactions	2017-11-30 09:59:32 +03:00
Jason Petersen	73cadbecd6	Merge pull request #1836 from citusdata/fix_vacuum_analyze_propagation Ensure VACUUM/ANALYZE stays local when unsupported or DDL prop disabled cr: @pykello	2017-11-29 16:36:46 -08:00
mehmet furkan şahin	6041f85b70	Add tests for non-propagated VACUUM/ANALYZE	2017-11-29 16:06:50 -07:00
Jason Petersen	0eacf6bd95	Refactor VacuumStmt checker to be single-return Decided this would be safer for the future (defaults to unsupported).	2017-11-29 16:06:50 -07:00
Jason Petersen	b12e77ab0e	Ensure unsupported VACUUMs don't go to workers Apparently these two blocks have been incorrect for nearly a year…	2017-11-29 16:06:50 -07:00
Marco Slot	878d8192c4	Merge pull request #1835 from citusdata/zero_shard Round-robin over worker nodes for 0-shard router queries	2017-11-29 18:49:46 +01:00
Marco Slot	7ea718fd8d	Round-robin over worker nodes for 0-shard router queries	2017-11-29 15:52:22 +01:00
Marco Slot	ae67fa0e52	Do not run multi_mx_modifications in parallel with multi_mx_transaction_recovery	2017-11-29 15:35:21 +01:00
Mehmet Furkan ŞAHİN	198438978e	Merge pull request #1826 from citusdata/regression_data_ax Regression data is reduced from 10K to 100 for events_table and users_table	2017-11-28 15:16:03 +03:00
mehmet furkan şahin	b6eb0c2823	multi_subquery_behavioral_analytics.sql query fix by adding proper order by	2017-11-28 14:15:46 +03:00
mehmet furkan şahin	1b06b2b306	The data used in regression tests is reduced This commit reduces the size of the data in users_table.data and events_table.data from 10K rows to 100 rows.	2017-11-28 14:15:46 +03:00
Önder Kalacı	74b9bc409c	Merge pull request #1833 from citusdata/granular_subquery_pushdown Refactor relation restriction equivalence checks to be more granular for subqueries	2017-11-28 11:56:26 +03:00
Onder Kalaci	05fb0dd020	Add infrastructure for filtering restriction contexts based on the input query In subquery pushdown, we first ensure that each relation is joined with at least on another relation on the partition keys. That's fine given that the decision is binary: pushdown the query at all or not. With recursive planning, we'd want to check whether any specific part of the query can be pushded down or not. Thus, we need the ability to understand which part(s) of the subquery is safe to pushdown. This commit adds the infrastructure for doing that.	2017-11-28 09:58:21 +02:00
Onder Kalaci	26d9b58e9e	Make sure that ExtractRangeTableRelationWalker never misses RTE_RELATION	2017-11-28 09:27:34 +02:00
Onder Kalaci	32def06ebd	Split assigning RTE identities and partitioning related query modifications Note that we used to iterate over the RTEs once for performance reasons. However, keeping an extra copy of original query seems more costly and hard to maintain/explain.	2017-11-28 09:27:34 +02:00
Marco Slot	271b9392e2	Merge pull request #1834 from citusdata/function_pushdown Subqueries containing functions go through subquery pushdown	2017-11-27 22:24:48 +01:00
Marco Slot	feffe86440	Subqueries containing functions go through subquery pushdown	2017-11-27 22:13:02 +01:00
Önder Kalacı	8877a68a1f	Merge pull request #1827 from citusdata/allow_non_equi_joins Enable non equi joins in subquery pushdown	2017-11-23 17:51:02 +03:00
Onder Kalaci	48f96bf3e5	Enable non equi joins in subquery pushdown Subquery pushdown planning is based on relation restriction equivalnce. This brings us the opportuneatly to allow any other joins as long as there is an already equi join between the distributed tables. We already allow that for joins with reference tables and this commit allows that for joins among distributed tables.	2017-11-23 16:13:46 +02:00
Mehmet Furkan ŞAHİN	ae2c86dbdd	Merge pull request #1823 from citusdata/regression_parallelization Regression parallelization - PART 2	2017-11-23 14:37:33 +03:00
mehmet furkan şahin	032b34ea52	some more parallelization	2017-11-23 14:10:42 +03:00
Önder Kalacı	309ba9f0d6	Merge pull request #1825 from citusdata/register_custom_scans Register custom scans	2017-11-23 13:00:58 +03:00
Onder Kalaci	16421f089f	Register citus custom scan nodes	2017-11-23 11:38:33 +02:00
Onder Kalaci	83c1143505	Refactor custom scan related codes In this commit, we don't change any codes, only create a new file and move the related functions and types there.	2017-11-23 11:38:12 +02:00
Marco Slot	0799527d14	Merge pull request #1574 from citusdata/auto_2pc_recovery Auto-recover 2PCs, enable 2PC by default	2017-11-23 09:35:38 +01:00
Marco Slot	20a526d5c4	Fix memory leak in ListToHashSet	2017-11-22 11:26:58 +01:00
Marco Slot	f4ceea5a3d	Enable 2PC by default	2017-11-22 11:26:58 +01:00
Marco Slot	8486f76e15	Auto-recover 2PC transactions	2017-11-22 11:26:58 +01:00
Marco Slot	64a5d5da22	Merge pull request #1814 from citusdata/rename_multiplan Rename MultiPlan to DistributedPlan	2017-11-22 09:57:55 +01:00
Marco Slot	6ba3f42d23	Rename MultiPlan to DistributedPlan	2017-11-22 09:36:24 +01:00
Marco Slot	e3bd34727f	Merge pull request #1805 from citusdata/immutable_functions Support immutable table functions as reference tables	2017-11-21 14:48:30 +01:00
Marco Slot	0ad39b36fe	Treat immutable table functions and constant subqueries as reference tables	2017-11-21 14:15:22 +01:00
Önder Kalacı	46c9922def	Merge pull request #1816 from citusdata/fix_reference_regression Relax the checks for ensuring distribution columns in the target list	2017-11-21 16:07:14 +03:00
Onder Kalaci	d558ebb923	Relax the checks on ensuring distribution columns for target entries With this commit, we allow pushing down subqueries with only reference tables where GROUP BY or DISTINCT clause or Window functions include only columns from reference tables.	2017-11-21 12:28:14 +02:00
Andres Freund	d063658d6d	Protect some initializations from being called during backend startup. On EXEC_BACKEND builds these functions shouldn't be called at every backend start.	2017-11-20 15:29:51 -08:00
Brian Cloutier	d267e0f9fa	EXEC_BACKEND: don't put pointers to shared hashes into shared memory Store pointers to shared hashes in process-local variables. Previously pointers to shared hashes were put into shared memory. This causes problems on EXEC_BACKEND because everybody calls execve and receives a brand new address space; the shared hash will be in a different place for every backend. (normally we call fork, which gives you a copy of the address space, so these pointers remain constant)	2017-11-20 15:29:51 -08:00
Brian Cloutier	30a2365d81	Rename CreateDirectory to CitusCreateDirectory	2017-11-20 14:38:26 -08:00
Brian Cloutier	aa2ab023a2	Rename RemoveDirectory -> CitusRemoveDirectory	2017-11-20 14:21:52 -08:00
Brian Cloutier	06f756b0a1	Rename DeleteFile -> CitusDeleteFile	2017-11-20 13:30:11 -08:00
Mehmet Furkan ŞAHİN	4f3f30f939	Merge pull request #1817 from citusdata/regression_parallelization Regression parallelization - Part 1	2017-11-20 19:14:48 +03:00
mehmet furkan şahin	34709c2a16	Regression tests parallelization PART-1	2017-11-20 18:03:37 +03:00
Marco Slot	7b3b59c278	Merge pull request #1696 from citusdata/fast_recovery Rewrite recover_prepared_transactions to be faster, non-blocking	2017-11-20 13:50:32 +00:00
Marco Slot	9793218122	Do not commit already-committed prepared transactions in recovery	2017-11-20 13:18:48 +01:00
Marco Slot	fe798cf0f9	Add recovery vs. recovery isolation test	2017-11-20 12:26:25 +01:00
Marco Slot	ae47df01ea	Observe prepared xacts twice in RecoverWorkerTransactions to avoid race condition	2017-11-20 11:44:08 +01:00
Marco Slot	2410c2e450	Rewrite recover_prepared_transactions to be fast, non-blocking	2017-11-20 11:27:40 +01:00
Mehmet Furkan ŞAHİN	785d94e828	Merge pull request #1810 from citusdata/regression_speedup Reduces default shard count in regression tests from 32 to 4	2017-11-20 13:05:12 +03:00
mehmet furkan şahin	314fc09d90	regression test shard_count is changed from 32 to 4	2017-11-20 12:47:49 +03:00
Mehmet Furkan ŞAHİN	59242383be	Merge pull request #1798 from citusdata/isolation_tests_improve Increases the coverage of the isolation tests by adding some of the concurrency tests	2017-11-20 12:44:08 +03:00
mehmet furkan şahin	8d55754b4d	the tests are separated and some more added	2017-11-20 11:45:48 +03:00
mehmet furkan şahin	636faadc47	create_distributed_table vs create_distributed_table, master_append_table_to_shard vs master_apply_delete_command, master_apply_delete_command vs master_apply_delete_command are added	2017-11-20 11:45:48 +03:00
mehmet furkan şahin	0722334e50	concurrent master_append_table test is added	2017-11-20 11:45:48 +03:00
mehmet furkan şahin	f45988962f	multi-shard update affecting the same/different rows	2017-11-20 11:45:48 +03:00
Önder Kalacı	666e37273a	Merge pull request #1809 from citusdata/get_rid_of_false_positives Get rid of some of the false positive distributed deadlocks	2017-11-15 16:13:57 +03:00
Onder Kalaci	5bea95009b	Skip autovacuum processes for distributed deadlock detection Autovacuum process cancels itself if any modification starts on the table in order to avoid blocking your regular Postgres sessions. That's normal and expected. Thus, any locks held by autovacuum process cannot involve in a distributed deadlock since it'll be released if needed.	2017-11-15 14:32:16 +02:00
Onder Kalaci	c65c153a46	Skip speculative locks for distributed deadlock detection These locks are held for a very short duration time and cannot contribute to a deadlock. Speculative locks are used by Postgres for internal notification mechanism among transactions.	2017-11-15 12:43:45 +02:00
Marco Slot	86a70515c5	Merge pull request #1808 from citusdata/bump_72 Bump Citus version to 7.2devel	2017-11-15 11:27:56 +01:00
Marco Slot	bbbadd6d1b	Bump Citus version to 7.2devel	2017-11-15 10:32:49 +01:00
Marco Slot	f1a05fdc14	Merge pull request #1750 from citusdata/next_shard_id Set shard and placement IDs in regression tests using a GUC	2017-11-15 10:30:35 +01:00
Marco Slot	ea306c6cfe	Use citus.next_placement_id where practical in regression tests	2017-11-15 10:12:06 +01:00
Marco Slot	d3b634b301	Allow generating placement IDs without using the sequence	2017-11-15 10:12:06 +01:00
Marco Slot	89eb833375	Use citus.next_shard_id where practical in regression tests	2017-11-15 10:12:05 +01:00
Marco Slot	c24a0875a5	Allow generating shard IDs without using the sequence	2017-11-15 10:12:05 +01:00
Brian Cloutier	0f3230170f	Pull in INT32_MAXINT and INT32_MININT	2017-11-14 14:03:46 -08:00
Brian Cloutier	0db8277266	remove unused errno import	2017-11-14 13:09:34 -08:00
Brian Cloutier	5d9f3ae7fd	Remove unused poll import from multi_real_time_executor	2017-11-14 13:09:34 -08:00
Marco Slot	ee9d24f77e	Merge pull request #1746 from citusdata/drop_sequence_fix Only drop sequences on workers with metadata	2017-11-14 16:33:50 +01:00
Marco Slot	533a533565	Only drop sequences on workers with metadata	2017-11-14 16:01:56 +01:00
Burak Velioglu	c7c7a33901	Merge pull request #1802 from citusdata/changelog_71 Add 7.1 changelog	2017-11-14 15:49:32 +03:00
velioglu	7b9009fe36	Add 7.1 changelog	2017-11-14 15:37:14 +03:00
Metin Döşlü	02ee714c10	Merge pull request #1790 from citusdata/add_stub_udf Add stub UDFs to run pg_upgrade flawlessly	2017-11-13 15:30:45 +02:00
velioglu	be28ba8e70	Add stub UDF to run pg_upgrade flawlessly	2017-11-13 16:14:45 +02:00
Metin Döşlü	a4d3002b04	Merge pull request #1782 from citusdata/warn_on_cluster_command Warn on CLUSTER command for distributed tables	2017-11-10 11:31:08 +02:00
metdos	111c04c2bd	Warn on CLUSTER command for distributed tables	2017-11-10 12:14:45 +02:00
Burak Yücesoy	cf7a4ae608	Merge pull request #1774 from citusdata/fix_partitioning_in_schema Fix attaching partition to a distributed table in schema	2017-11-09 12:49:55 +02:00
Burak Yücesoy	863df0b874	Merge branch 'master' into fix_partitioning_in_schema	2017-11-09 12:49:35 +02:00
Burak Yucesoy	17229ed7bd	Fix attaching partition to a distributed table in schema While attaching a partition to a distributed table in schema, we mistakenly used unqualified name to find partitioned table's oid. This caused problems while using partitioned tables with schemas. We are fixing this issue in this PR.	2017-11-09 13:20:29 +03:00
Önder Kalacı	14ff31704c	Merge pull request #1777 from citusdata/skip_page_locks Skip page-level locks for distributed deadlock detection	2017-11-09 13:01:21 +03:00
Onder Kalaci	94921a2be1	Skip page-level locks on distributed deadlock detection Short-term share/exclusive page-level locks are used for read/write access. Locks are released immediately after each index row is fetched or inserted. Since those locks may not lead to any deadlocks, it's safe to ignore them in the distributed deadlock detection.	2017-11-09 10:37:23 +02:00
Marco Slot	a85a973c3e	Merge pull request #1768 from citusdata/sslmode_guc Add GUC for specifying sslmode in connections to workers	2017-11-08 14:42:26 +01:00
Marco Slot	f71728f634	Add GUC for specifying sslmode in connections to workers	2017-11-08 14:15:58 +01:00
Murat Tuncer	4e3d633ebf	Add check for connection failures during multishard update (#1765 )	2017-11-07 12:33:25 +02:00
Metin Döşlü	731b1254f9	Merge pull request #1764 from citusdata/relcache_leak Fix a relcache reference leak in stats collection.	2017-11-07 11:59:00 +02:00
Hadi Moshayedi	6d79d25101	Fix a relcache reference leak in stats collection. In DistributedTablesSize() we didn't close the relations that had replication factor > 2. This caused relcache reference leaks, and warning messages like following in logs: WARNING: relcache reference leak: relation "researchers" not closed	2017-11-06 23:16:43 -05:00
Metin Döşlü	5984e7f009	Merge pull request #1762 from citusdata/check_connection_status Check connection status before using it	2017-11-06 15:50:37 +02:00
metdos	c83edc36b5	Check connection status before using it	2017-11-06 14:53:35 +02:00
Murat Tuncer	2332678793	Fix intermittent test failures on count distinct tests (#1761 ) Added analyze to test which seem to force planner to use index.	2017-11-06 10:58:46 +02:00
Brian Cloutier	7be1545843	Support implicit casts during INSERT/SELECT It's possible to build INSERT SELECT queries which include implicit casts, currently we attempt to support these by adding explicit casts to the SELECT query, but this sometimes crashes because we don't update all nodes with the new types. (SortClauses, for instance) This commit removes those explicit casts and passes an unmodified SELECT query to the COPY executor (how we implement INSERT SELECT under the scenes). In lieu of those cases, COPY has been given some extra logic to inspect queries, notice that the types don't line up with the table it's supposed to be inserting into, and "manually" casting every tuple before sending them to workers.	2017-11-03 22:27:15 -07:00
Metin Döşlü	074ae766de	Merge pull request #1751 from citusdata/build_custom_pg_at_push_builds Use custom compiled PostgreSQL on push builds	2017-11-03 11:34:07 +02:00
Marco Slot	55baf0e754	Merge pull request #1759 from citusdata/partitioned_tables Allow distributed partitioned table creation in Cloud	2017-11-03 10:22:22 +01:00
Marco Slot	6883a09cdd	Allow distributed partitioned table creation in Cloud	2017-11-03 10:09:18 +01:00
Marco Slot	6219186683	Allow distributed INSERT...SELECT via worker nodes in MX	2017-11-02 14:38:39 +01:00
Burak Yucesoy	3cf1ef06c0	Use custom compiled PostgreSQL on push builds Previously we compiled PostgreSQL for PR and we used already compiled PostgreSQL packages for push builds. Compiling PostgreSQL allows us to run isolation/vanilla tests. Thus we only ran those tests for PR builds. We did not used compiled PostgreSQL for both builds because; - We wanted to run our tests against packages - Compiling takes too much time. However, there are some benefits in using custom compiled PostgreSQL in push builds instead of PR builds such as; - At the moment, we do not run isolation/vanilla tests until we open a PR, which does not make any sense. - After merging a PR (i.e. after push to the master) push builds run and we do not run isolation tests. - While merging community to enterprise, we have to open a temporary PR to make travis run isolation/vanilla tests. - With this change master branch will have its own cache that means every branch originated from master branch can re-use master branch's cache. (Currently we cannot use cache for the first PR build) With this PR we switch from using custom compiled for PR builds to push builds.	2017-11-01 06:29:59 -07:00
Hadi Moshayedi	7280774cf4	Use list_length() != 1 in SingleReplicatedTable(). ShardPlacementList's implementation can return NIL. In previous implementation we got a segmentation fault in this case. The relation can be dropped after getting distributed table list but before calling SingleReplicatedTable().	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	7691991cb5	Do PG_TRY() inside a subtransaction block. If we don't propagate the errors we are catching in PG_CATCH(), database's internal state might not be clean. So we do PG_TRY() inside a subtransaction so we can rollback to it after catching errors.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	9bfbbf8a04	Make reports hostname configurable and enable stats collection in tests. This patch adds --with-reports-host configure option, which sets the REPORTS_BASE_URL constant. The default is reports.citusdata.com. It also enables stats collection in tests.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	acaf085a80	Add callback function for request by CollectBasicUsageStatistics(). Curl writes the received response to stdout if we don't specify a response callback or an output file. This can pollute the PostgreSQL log. In this change we add a callback function so the response messages aren't added to the log file.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	747e439601	Limit number of stats collection retries to once a day.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	78a2cd9052	Check for Citus updates. Sends a request to /v1/releases/latest?flavor=$CITUS_EDITION once a day, which returns a response similar to {"version": "7.1.0", "major": 7, "minor": 1, "patch": 0}. Then compares it with current Citus version, and if the latest release is newer, logs a LOG message.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	34f3ec0961	Call FlushDistTableCache() before stats collection.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	c18c6625d9	Lock relations before calling citus_table_size(). This is to make sure they don't get dropped.	2017-10-31 21:51:43 -04:00
Hadi Moshayedi	97d544b75c	Follow the patterns used in Deadlock Detection in Stats Collection. This includes: (1) Wrap everything inside a StartTransactionCommand()/CommitTransactionCommand(). This is so we can access the database. This also switches to a new memory context and releases it, so we don't have to do our own memory management. (2) LockCitusExtension() so the extension cannot be dropped or created concurrently. (3) Check CitusHasBeenLoaded() && CheckCitusVersion() before doing any work. (4) Do not PG_TRY() inside a loop.	2017-10-31 21:51:43 -04:00
Marco Slot	100aaeb3f5	Fix typo in distributed deadlock error message	2017-10-31 19:39:32 +01:00
Metin Döşlü	d7171838d7	Merge pull request #1749 from citusdata/fix_reference_table_insert_into_select Don't try to add restrictions for reference tables in insert into select	2017-10-31 19:01:36 +02:00
metdos	8c356b2bc8	Don't try to add restrictions for reference tables in insert into select	2017-10-31 19:44:10 +02:00
Mehmet Furkan ŞAHİN	ca24713b1a	Merge pull request #1739 from citusdata/add_const_prim_key_using_ind Add constraint %s primary key using index %s support	2017-10-31 16:16:49 +03:00
mehmet furkan şahin	32fb19911c	Add Constraint %s Add Primary Key Using index %s support This commit makes a change in relay_event_utility.c to check if the Alter Table command adds a constraint using index. If this is the case, it appends the shard id to the index name.	2017-10-31 16:03:56 +03:00
Marco Slot	c69d8bf4a6	Merge pull request #1743 from citusdata/add_shard_move_mode_param Add shard transfer mode parameter to shard copy functions	2017-10-31 13:54:37 +01:00
Marco Slot	7e34348334	Add shard transfer mode parameter to shard copy functions	2017-10-31 13:30:48 +01:00
Marco Slot	bc3bdeaac8	Merge pull request #1737 from citusdata/fix_writeability_checks Fix issues in WaitForAllConnections	2017-10-31 13:16:06 +01:00
Marco Slot	2bb46bb5ee	Reset connectionReady flag after moving a connection in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	e6e6897499	Defer initial PQflush to main loop in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Marco Slot	d6dadb1b25	Use correct index for ModifyWaitEvent in WaitForAllConnections	2017-10-31 12:06:53 +01:00
Mehmet Furkan ŞAHİN	d85a3952f5	Merge pull request #1734 from citusdata/create_table_replica_identity create_distributed_table minds the replica identity	2017-10-31 13:31:28 +03:00
Furkan Sahin	2b39c52f0b	Replica identity on create_distributed_table By this commit, citus minds the replica identity of the table when we distribute the table. So the shards of the distributed table have the same replica identity with the local table.	2017-10-31 13:08:36 +03:00
Marco Slot	5661062a69	Merge pull request #1687 from citusdata/omit_public_shard_name Omit public schema from shard_name output	2017-10-31 09:33:48 +01:00
Marco Slot	7f68f78ee9	Omit public schema from shard_name output	2017-10-31 00:22:07 +01:00
Murat Tuncer	e16805215d	Support count(distinct) for non-partition columns (#1692 ) Expands count distinct coverage by allowing more cases. We used to support count distinct only if we can push down distinct aggregate to worker query i.e. the count distinct clause was on the partition column of the table, or there was a grouping on the partition column. Now we can support - non-partition columns, with or without grouping on partition column - partition, and non partition column in the same query - having clause - single table subqueries - insert into select queries - join queries where count distinct is on partition, or non-partition column - filters on count distinct clauses (extends existing support) We first try to push down aggregate to worker query (original case), if we can't then we modify worker query to return distinct columns to coordinator node. We do that by adding distinct column targets to group by clauses. Then we perform count distinct operation on the coordinator node. This work should reduce the cases where HLL is used as it can address anything that HLL can. However, if we start having performance issues due to very large number rows, then we can recommend hll use.	2017-10-30 13:12:24 +02:00
Marco Slot	6bb4cf998e	Merge pull request #1725 from citusdata/restore_point_2pc Block only 2PCs instead of all writes in citus_create_restore_point	2017-10-27 00:19:54 +02:00
Marco Slot	be46661bf7	Block only 2PCs instead of all writes in citus_create_restore_point	2017-10-27 00:07:32 +02:00
Mehmet Furkan ŞAHİN	58a511a791	Merge pull request #1731 from citusdata/alter_table_replication_identity ALTER TABLE .. REPLICA IDENTITY support is implemented	2017-10-26 14:18:18 +03:00
mehmet furkan şahin	83ac84d594	order by and unnest are added to multi_colocation_utils tests	2017-10-26 13:44:28 +03:00
mehmet furkan şahin	61ae33dc7f	ALTER TABLE .. REPLICA IDENTITY support is implemented	2017-10-26 13:44:28 +03:00
Brian Cloutier	4a17d12d74	Replace uint with uint32	2017-10-25 19:32:12 -07:00
Burak Velioglu	fade7c1667	Merge pull request #1690 from citusdata/update_delete_multiple_shard Support multi shard update/delete queries	2017-10-25 16:06:12 +03:00
velioglu	0b5db5d826	Support multi shard update/delete queries	2017-10-25 15:52:38 +03:00
Marco Slot	8dd85f5c5a	Merge pull request #1724 from citusdata/relay_dml_error Relay error message if DML fails on worker	2017-10-25 14:38:18 +02:00
Marco Slot	4bde83e1d2	Relay error message if DML fails on worker	2017-10-25 14:23:21 +02:00
Hadi Moshayedi	9a04b78980	Send server_id for statistics reports. (#1698 ) This change introduces the `pg_dist_node_metadata` which has a single jsonb value. When creating the extension, a random server id is generated and stored in there. Everything in the metadata table is added as a nested objected to the json payload that is sent to the reports server.	2017-10-18 21:20:32 -04:00
Hadi Moshayedi	86bcd93a4a	Don't collect stats when there is a version mismatch. (#1712 ) The following scenario can cause an Assert() crash if we don't do this: - Install Citus v7.0-15 - Restart server & run a query to start maintenanced. - Install Citus v7.1 - Restart server & run a query. This will tell user to upgrade. - Type "UPDATE EXTENSION c" & press tab. maintenanced will start and crash with Assert(CitusHasBeenLoaded() && CheckCitusVersion(WARNING)); This change checks Citus version before calling metadata functions so the crash doesn't happen.	2017-10-17 14:01:14 -04:00
Jason Petersen	e4072a3dbb	Merge pull request #1700 from citusdata/improve_build_metadata Improve build metadata generated by configure cr: @pykello	2017-10-16 18:29:05 -06:00
Jason Petersen	f2c593b25c	Add CITUS_NAME and CITUS_EDITION Unambiguous places to check whether we're running simply Citus or Citus Enterprise, or to check for 'community' or 'enterprise'.	2017-10-16 18:09:29 -06:00
Jason Petersen	8544878c4b	Add citus_version(), analogous to PG's version() This will provide the full project name (i.e. Citus/Citus Enterprise), and the host system, compiler, and architecture word size. I wanted to limit the number of copied files in 'config', so I added only config.guess and call it manually, rather than using the macro AC_CANONICAL_HOST, which requires several other files.	2017-10-16 18:09:29 -06:00
Jason Petersen	339d0d6dc7	Add with-extra-version support to configure PostgreSQL has it, now we do too! Example: `./configure --with-extra-version=+git.20171011.a1387f4` would currently result in: `` reporting the more useful: SHOW citus.version; citus.version ------------------------------- 7.1devel+git.20171011.a1387f4 Nice to have for packaging, one-off customer builds, etc. This stuff is generally already in the package metadata, but it will be nice to have it directly within a psql session.	2017-10-16 18:09:29 -06:00
Brian Cloutier	91ff8cd2d5	{*,}create_distributed_table doesn't emit OID (#1710 )	2017-10-16 18:08:51 -06:00
Brian Cloutier	ebcb2b65e9	Add master_move_node function	2017-10-16 10:51:28 -07:00
Burak Yücesoy	4a155bfccf	Merge pull request #1707 from citusdata/add_changelog_entry_for_703 Add CHANGELOG entry for 7.0.3	2017-10-16 00:49:18 -08:00
Burak Yucesoy	0cf63039b4	Add CHANGELOG entry for 7.0.3	2017-10-16 11:40:51 +03:00
Brian Cloutier	58cf15ceca	DistributedTableSize doesn't emit oid when erring out	2017-10-14 02:42:57 +03:00
Hadi Moshayedi	2aec6eda49	Properly use #ifdef HAVE_LIBCURL.	2017-10-13 12:04:36 -06:00
Jason Petersen	01353cb7cb	Use header define rather than -D flag Eclipse apparently doesn't scan build output looking for -D flags, so having the value actually appear in a header is nicer for those of us using IDEs.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	946659aebe	Delete StatsCollection memory context after we are done with stats reporting. Previously we left the memory context untouched, which overtime leaked memory.	2017-10-13 11:00:09 -04:00
Hadi Moshayedi	873fd1e7ff	Fix compiling --without-libcurl. Previously <curl/curl.h> was included even if compiled --without-libcurl. This can fail when libcurl headers are not there. This commit guards this include by checks for HAVE_LIBCURL.	2017-10-13 11:00:09 -04:00
Murat Tuncer	4832abc7cb	Make multi_master_planner.c coding convention compliant Changed order of function definitions and added declarations in the beginning of the file	2017-10-13 14:59:48 +03:00
Murat Tuncer	f7ab901766	Add select distinct, and distinct on support Distinct, and distinct on() clauses are supported in simple selects, joins, subqueries, and insert into select queries.	2017-10-13 14:59:48 +03:00
Hadi Moshayedi	6879f92e23	Fix out of bound memeory access when getting HTTP response code. (#1699 )	2017-10-12 12:51:42 -04:00
Hadi Moshayedi	a1387f4aa8	Basic usage statistics collection. (#1656 ) Adds ```citus.enable_statistics_collection``` GUC variable, which ```true``` by default, unless built without libcurl. If statistics collection is enabled, sends basic usage data to Citus servers every 24 hours. The data that is collected consists of: - Citus version - OS name & release - Hardware Id - Number of tables, rounded to next power of 2 - Size of data, rounded to next power of 2 - Number of workers	2017-10-11 09:55:15 -04:00
Mehmet Furkan ŞAHİN	e202c51fec	Merge pull request #1668 from citusdata/window_function_preliminary_implementation Add window function support for SUBQUERY PUSHDOWN and INSERT INTO SELECT	2017-10-04 17:18:26 +03:00
Onder Kalaci	498ac80d8b	Add window function support for SUBQUERY PUSHDOWN and INSERT INTO SELECT This commit provides the support for window functions in subquery and insert into select queries. Note that our support for window functions is still limited because it must have a partition by clause on the distribution key. This commit makes changes in the files insert_select_planner and multi_logical_planner. The required tests are also added with files multi_subquery_window_functions.out and multi_insert_select_window.out.	2017-10-04 15:33:07 +03:00
Marco Slot	f6b43d81ec	Merge pull request #1684 from citusdata/remove_binary_copy_tests Remove separate citus.binary_worker_copy_format regression tests	2017-10-04 13:33:56 +02:00
Marco Slot	24915779d1	Remove separate citus.binary_worker_copy_format regression tests	2017-10-03 17:44:50 +02:00
Marco Slot	f58c695d0f	Merge pull request #1682 from citusdata/fix_2pc_id Use local group ID when querying for prepared transactions	2017-10-03 17:44:34 +02:00
Marco Slot	9e516513fc	Use local group ID when querying for prepared transactions	2017-10-03 16:36:53 +02:00
Hadi Moshayedi	11adb9b034	Push down LIMIT and HAVING when grouped by partition key. (#1641 ) We can do this because all rows belonging to a group are in the same shard when grouping by distribution column on a range/hash distributed table.	2017-10-02 20:17:51 -04:00
Marco Slot	03bddcbfab	Merge pull request #1681 from citusdata/fix_metadata_cache Invalidate worker and group ID cache in maintenance daemon	2017-10-02 18:29:30 +02:00
Marco Slot	394918f9d0	Invalidate worker and group ID cache in maintenance daemon	2017-10-02 18:14:29 +02:00
Burak Yücesoy	020c21df64	Merge pull request #1678 from citusdata/add_changelog_entry_for_702 Add CHANGELOG entry for 7.0.2	2017-09-28 10:27:36 -07:00
Burak Yucesoy	29a4f88b55	Add CHANGELOG entry for 7.0.2	2017-09-28 10:26:28 -07:00
Burak Yücesoy	45a1b9e590	Merge pull request #1677 from citusdata/add_changelog_entry_for_624 Add CHANGELOG entry for 6.2.4	2017-09-28 10:25:37 -07:00
Burak Yucesoy	05678c3a3b	Add CHANGELOG entry for 6.2.4	2017-09-28 10:24:48 -07:00
Burak Yücesoy	76d05050bd	Merge pull request #1676 from citusdata/add_changelog_entry_for_613 Add CHANGELOG entry for 6.1.3	2017-09-28 10:16:03 -07:00
Burak Yucesoy	ba31e35791	Add CHANGELOG entry for 6.1.3	2017-09-28 10:07:31 -07:00
Marco Slot	632d0c675a	Merge pull request #1672 from citusdata/task_tracker_superuser Execute transmit commands as extension owner during task-tracker queries	2017-09-28 06:43:26 -07:00
Marco Slot	bb50fc9cb5	Add multi-user re-partitioning regression tests	2017-09-28 15:27:26 +02:00
Marco Slot	43d5e79eaa	Execute transmit commands as superuser during task-tracker queries	2017-09-28 15:27:25 +02:00
Marco Slot	306c58d59b	Check for absolute paths in COPY with format transmit	2017-09-28 15:27:11 +02:00
Marco Slot	cb6b0e820c	Allow read-only users to run task-tracker queries	2017-09-28 13:52:36 +02:00
Marco Slot	8483d6213b	Merge pull request #1667 from citusdata/recovery_index_fix Use unique constraint index for transaction record deletion	2017-09-28 03:20:18 -07:00
Marco Slot	da6b42a3e2	Use unique constraint index for transaction record deletion	2017-09-28 12:04:56 +02:00
Önder Kalacı	b2d42a0595	Merge pull request #1670 from citusdata/remove_unnecessary_locks_in_graph Skip relation extension locks on distributed deadlock detection	2017-09-28 10:30:49 +03:00
Onder Kalaci	68ca8cb7f0	Skip relation extension locks We should skip if the process blocked on the relation extension since those locks are hold for a short duration while the relation is actually extended on the disk and released as soon as the extension is done. Thus, recording such waits on our lock graphs could yield detecting wrong distributed deadlocks.	2017-09-28 10:09:09 +03:00
Murat Tuncer	4676c4f7a5	Prevent crash when remote transaction start fails (#1662 ) We sent multiple commands to worker when starting a transaction. Previously we only checked the result of the first command that is transaction 'BEGIN' which always succeeds. Any failure on following commands were not checked. With this commit, we make sure all command results are checked. If there is any error we report the first error found.	2017-09-26 17:25:46 -07:00
Jason Petersen	a8428dff01	Merge pull request #1633 from citusdata/fix_pg_11 Get PostgreSQL 11 build passing cr: @pykello	2017-09-26 11:44:17 -07:00
Jason Petersen	b4d53423fa	Add adapter functions for OpenFile changes	2017-09-25 17:20:24 -07:00
Jason Petersen	d686123dae	Omit now-public Explain methods from PG11 build This copy-pasted code is no longer needed in PG11.	2017-09-25 17:20:24 -07:00
Jason Petersen	89d02c6115	Add ruleutils file for PostgreSQL 11	2017-09-25 17:20:24 -07:00
Jason Petersen	bbc15e0598	Handle HASHPROC changes PostgreSQL 11 now has "standard" and "extended" (64-bit) versions of hash functions.	2017-09-25 17:20:24 -07:00
Jason Petersen	b4474fc0b0	Modify version-output tests for PostgreSQL 11 Basically we just care whether the running version is before or after PostgreSQL 10, so testing the major version against 9 and printing a boolean is sufficient.	2017-09-25 17:20:24 -07:00
Jason Petersen	6c9b19a954	Add version-compat header For polyfill macros, etc.	2017-09-25 17:20:23 -07:00
Jason Petersen	fbeaa2f9d0	Remove direct access to tupleDesc->attrs A level of indirection was removed from this field for PostgreSQL 11. By using the handy provided macro, we can be version agnostic.	2017-09-25 17:20:23 -07:00
Jason Petersen	6a020b5adc	Update CopyGetAttnums with latest from PostgreSQL This function was recently modified to use the TupleDescAttr wrapper, which abstracts away recent changes to TupleDesc.	2017-09-25 17:20:23 -07:00
Marco Slot	91d9b41822	Merge pull request #1591 from citusdata/fix-shard-cache-inval Fix possible shard cache incoherency.	2017-09-25 17:01:03 -07:00
Andres Freund	78716e5546	Fix possible shard cache incoherency. When a table and it's shards are dropped, and afterwards the same shard identifiers are reused, e.g. due to a DROP & CREATE EXTENSION, the old entry in the shard cache and the required entry in the shard cache might be for different tables. Force invalidation for both old and new table to fix.	2017-09-25 13:05:09 -07:00
Jason Petersen	41350fdd54	Remove obsolete lines	2017-09-25 11:18:25 -07:00
Burak Velioglu	24dc95ee98	Merge pull request #1580 from citusdata/change_error_message_of_local_dist Add error detail if query contains both local and distributed tables	2017-09-25 10:05:37 -07:00
velioglu	0a56ed910b	Change error message of queries with distributed and local table Citus can handle INSERT INTO ... SELECT queries if the query inserts into local table by reading data from distributed table. The opposite way is not correct. With this commit we warn the user if the latter option is used.	2017-09-22 13:46:19 -07:00
Önder Kalacı	fb70400b61	Merge pull request #1661 from citusdata/fix_crash Trim trailing characters & copy the message returned by `PQerrorMessage()`	2017-09-22 23:10:56 +03:00
Onder Kalaci	867224bdd7	Make the tests produce more consistent outputs	2017-09-22 20:38:56 +03:00
Onder Kalaci	4782f9f98a	Properly copy and trim the error messages that come from pg_conn When a NULL connection is provided to PQerrorMessage(), the returned error message is a static text. Modifying that static text, which doesn't necessarly be in a writeable memory, is dangreous and might cause a segfault.	2017-09-22 19:43:09 +03:00
Onder Kalaci	6736fd1682	Remove two obsolete functions Namely GetConnectionFromPGconn() and CloseConnectionByPGconn()	2017-09-21 00:36:23 -06:00
Önder Kalacı	9901017f3f	Merge pull request #1652 from citusdata/fix_create_schema Ensure schema exists on reference table creation	2017-09-19 00:12:21 +03:00
Onder Kalaci	33ec33c5b3	Ensure schema exists on reference table creation If the schema doesn't exists on the workers, create it.	2017-09-18 23:50:47 +03:00
Önder Kalacı	c6ec49312c	Merge pull request #1653 from citusdata/fix_group_by Allow pushing down GROUP BYs when at least there is one distribution	2017-09-15 19:38:15 +03:00
Onder Kalaci	6116c8e93d	Allow pushing down GROUP BYs when at least there is one distribution column in the target list	2017-09-15 19:15:06 +03:00
Önder Kalacı	6c9dffccbf	Merge pull request #1628 from citusdata/expand_subquery_reference_table Expand subquery pushdown for reference tables	2017-09-15 00:04:11 +03:00
Onder Kalaci	a5b66912d4	Expand reference table support in subquery pushdown With this commit, we relax the restrictions put on the reference tables with subquery pushdown. We did three notable improvements: 1) Relax equi-join restrictions Previously, we always expected that the non-reference tables are equi joined with reference tables on the partition key of the non-reference table. With this commit, we allow any column of non-reference tables joined using non-equi joins as well. 2) Relax OUTER JOIN restrictions Previously Citus errored out if any reference table exists at any point of the outer part of an outer join. For instance, See the below sketch where (h) denotes a hash distributed relation, (r) denotes a reference table, (L) denotes LEFT JOIN and (I) denotes INNER JOIN. (L) / \ (I) h / \ r h Before this commit Citus would error out since a reference table appears on the left most part of an left join. However, that was too restrictive so that we only error out if the reference table is directly below and in the outer part of an outer join. 3) Bug fixes We've done some minor bugfixes in the existing implementation.	2017-09-14 20:59:22 +03:00
Burak Yucesoy	18b9be3dfa	Add CHANGELOG entry for 7.0.1 release	2017-09-12 17:52:09 -06:00
Marco Slot	c2f4eaa281	Merge pull request #1648 from citusdata/fix_put_copy_data Wait for I/O to finish after PQputCopyData	2017-09-12 16:35:53 -07:00
Marco Slot	d1befa4df9	Wait for I/O to finish after PQputCopyData	2017-09-12 16:18:42 -07:00
Marco Slot	f5361d52e7	Merge pull request #1643 from citusdata/fix_insert_select_memory Free per-tuple COPY memory in INSERT...SELECT via coordinator	2017-09-12 15:49:04 -07:00
Marco Slot	cbe16169b4	Free per-tuple COPY memory in INSERT...SELECT	2017-09-12 15:35:53 -07:00
Marco Slot	b0df3a6746	Merge pull request #1645 from citusdata/fix_prepared_memory Copy MultiPlan to avoid reuse across prepared statements	2017-09-12 14:48:05 -07:00
Marco Slot	27da0a29d7	Add volatile function in prepared statement regression test	2017-09-12 13:09:31 -07:00
Marco Slot	5fe0845d7e	Always copy MultiPlan in GetMultiPlan	2017-09-12 11:38:52 -07:00
Jason Petersen	e29ebe57fd	Merge pull request #1632 from citusdata/update_copied_code Add latest PostgreSQL changes to copy-pasted code cr: @mtuncer	2017-09-08 14:59:14 -06:00
Jason Petersen	8b2c3fcc15	Add clarifying comment to RngVarCallbackForDropIdx We don't need the PARTITION-related logic recently added in PostgreSQL.	2017-09-01 15:57:30 -06:00
Jason Petersen	ec30ad38ba	Update ruleutils_10 with latest PostgreSQL changes See: postgres/postgres@21d304dfed postgres/postgres@bb5d6e80b1 postgres/postgres@d363d42bb9 postgres/postgres@eb145fdfea postgres/postgres@decb08ebdf postgres/postgres@a3ca72ae9a postgres/postgres@bc2d716ad0 postgres/postgres@382ceffdf7 postgres/postgres@c7b8998ebb postgres/postgres@e3860ffa4d postgres/postgres@76a3df6e5e	2017-09-01 14:26:59 -06:00
Jason Petersen	ebecde8f6e	Update ruleutils_96 with latest PostgreSQL changes See: postgres/postgres@41ada83774 postgres/postgres@3b0c2dbed0 postgres/postgres@ff2d537223	2017-09-01 14:26:53 -06:00
Jason Petersen	0e134a9178	Add PG11/master build, bump tools (#1588 ) This build is allowed to fail and finish-fast is enabled, so there is no negative impact on developers, yet we can now stay better abreast of upcoming PostgreSQL changes. The latest citus tools version also adds enable-depend to the flags in our "custom PG" source-based builds which will result in fewer false failures due to build caching behavior.	2017-08-30 18:17:28 -06:00
Burak Yücesoy	56f98c7300	Merge pull request #1630 from citusdata/bump_citus_version Bump Citus version	2017-08-29 15:13:48 +03:00
Burak Yucesoy	273b034720	Bump Citus version	2017-08-28 17:56:39 +03:00
Burak Yücesoy	b485da9fb1	Merge pull request #1629 from citusdata/add_7.0_changelog_entry Add CHANGELOG entry for 7.0 release	2017-08-28 16:33:12 +03:00
Burak Yucesoy	9e89eaa57e	Add CHANGELOG entry for 7.0 release	2017-08-28 16:19:24 +03:00
Marco Slot	c68bd7efa7	Merge pull request #1621 from citusdata/multi_row_insert_defaults Allow default columns in multi-row INSERTs	2017-08-25 11:32:43 +02:00
Marco Slot	0aadbb1760	Convert multi-row INSERT target list to Vars	2017-08-25 10:55:56 +02:00
Marco Slot	1920390688	Multi-row INSERTs no longer throw errors in isolation tests	2017-08-25 10:55:56 +02:00
Marco Slot	ae00795dab	Allow default columns in multi-row INSERTs	2017-08-25 10:55:56 +02:00
Joe Nelson	a658f5ecda	Two more libs I needed to build citus	2017-08-24 13:04:35 -06:00
Marco Slot	b4cc8939fc	Merge pull request #1613 from citusdata/fix_ref_table_multi_row_returning Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:56:44 +02:00
Marco Slot	c97692f382	Fix multi-row INSERT with RETURNING on reference tables	2017-08-24 10:42:12 +02:00
Marco Slot	7ce2308dc1	Merge pull request #1616 from citusdata/deadlock_detection_warning Don't error out if deadlock detection fails to connect to worker	2017-08-24 10:31:15 +02:00
Marco Slot	dbf18df995	Don't error out if BuildGlobalWaitGraph fails to connect	2017-08-23 19:08:03 +02:00
Burak Yücesoy	7e59c0b019	Merge pull request #1602 from citusdata/add_isolation_tests Increase coverage of isolation tests - Part 2	2017-08-23 19:44:23 +03:00
Burak Yucesoy	5be6eb9ef6	Increase coverage of isolation tests - Part 2 With this PR we add isolation tests for COPY to reference table vs. other operations COPY to partitioned table vs. other operations Multi row INSERTs vs other operations INSERT/SELECT vs. other operations UPSERT vs. other operations DELETE vs. other operations TRUNCATE vs. other operations DROP vs. other operations DDL vs. other operations other operations consist of basic SQL operations (like SELECT, INSERT, DELETE, UPSERT, COPY TRUNCATE, CREATE INDEX) as well as some Citus functionalities (like master_modify_multiple_shards, master_apply_delete_command, citus_total_relation_size etc.)	2017-08-23 18:23:36 +03:00
Önder Kalacı	75491b9262	Merge pull request #1612 from citusdata/fix_dead_process Prevent maintanince deamon crashes due to dead processes	2017-08-23 15:56:24 +03:00
Onder Kalaci	c7bb29b69e	Prevent maintanince deamon crashes due to dead processes If after the distributed deadlock detection decides to cancel a backend, the backend has been terminated/killed/cancelled externally, we might be accessing to a NULL pointer. This commit prevents that case by ignoring the current distributed deadlock.	2017-08-23 15:44:09 +03:00
Marco Slot	46f81d5531	Merge pull request #1607 from citusdata/remove_source_dump_local Remove source node argument from dump_local_wait_edges	2017-08-23 13:26:06 +02:00
Marco Slot	641420d79f	Remove source node argument from dump_local_wait_edges	2017-08-23 13:14:00 +02:00
Marco Slot	a67d10957f	Merge pull request #1600 from citusdata/fix_multi_row_returning Add alias for target in multi-row INSERTs	2017-08-23 11:00:27 +02:00
Jason Petersen	8cb69e3a14	Add alias for target in multi-row INSERTs This is necessary for multi-row INSERTs for the same reasons we use it in e.g. UPSERTs: if the range table list has more than one entry, then PostgreSQL's deparse logic requires that vars be prefixed by the name of their corresponding range table entry. This of course doesn't affect single-row INSERTs, but since multi-row INSERTs have a VALUE RTE, they were affected. The piece of ruleutils which builds range table names wasn't modified to handle shard extension; instead UPSERT/INSERT INTO ... SELECT added an alias to the RTE. When present, this alias is favored. Doing the same in the multi-row INSERT case fixes RETURNING for such commands.	2017-08-23 10:24:00 +02:00
Marco Slot	ad1fbbe186	Merge pull request #1608 from citusdata/sequential_multi_row_insert Execute multi-row INSERTs sequentially	2017-08-23 10:17:30 +02:00
Marco Slot	4d7927b672	Execute multi-row INSERTs sequentially	2017-08-23 10:04:57 +02:00
Marco Slot	df6d56c1ed	Merge pull request #1606 from citusdata/fix_copy_dropped_columns Consider dropped columns that precede the partition column in COPY	2017-08-22 13:13:09 +02:00
Marco Slot	cf375d6a66	Consider dropped columns that precede the partition column in COPY	2017-08-22 13:02:35 +02:00
Marco Slot	15af3c5445	Merge pull request #1603 from citusdata/fix_lock_graph_allocs Avoid overflowing PROCStack in BuildWaitGraphForSourceNode	2017-08-22 09:20:48 +02:00
Marco Slot	bd6bf29983	Don't add procs multiple times in BuildWaitGraphForSourceNode	2017-08-21 16:48:30 +02:00
Önder Kalacı	734aeebc47	Merge pull request #1592 from citusdata/improve_maintanince_deamon Terminate bg worker on drop database	2017-08-18 16:38:25 +03:00
Onder Kalaci	6532b69873	Kill the maintenance daemon on DROP DATABASE	2017-08-18 16:03:08 +03:00
Metin Döşlü	b5109028bc	Merge pull request #1598 from citusdata/fix_no_shards_bug Fix a crash on zero-shard tables	2017-08-18 15:15:04 +03:00
Metin Doslu	0d052e9864	Fix a crash on zero-shard tables	2017-08-18 13:53:59 +03:00
Önder Kalacı	96391bea15	Merge pull request #1595 from citusdata/improve_deadlock_detection Improve deadlock detection	2017-08-18 13:28:21 +03:00
Önder Kalacı	b82f886ad3	Merge branch 'master' into improve_deadlock_detection	2017-08-18 13:07:18 +03:00
Marco Slot	2cc46f3a0c	Merge pull request #1584 from citusdata/fix_drop_extension Maintenance daemon ensures that the extension is valid	2017-08-18 11:32:44 +02:00
Marco Slot	7523753a73	Clear metadata OID cache prior to deadlock detection	2017-08-18 11:20:24 +02:00
Andres Freund	b936bde936	Take AccessShareLock on the extension prior to running deadlock detection	2017-08-18 11:20:24 +02:00
Onder Kalaci	20679c9e8b	Relax assertion on deadlock detection considering self deadlocks.	2017-08-18 11:16:38 +03:00
Onder Kalaci	550a5578d8	Skip deadlock detection on the workers Do not run distributed deadlock detection on the worker nodes to prevent errornous decisions to kill the deadlocks.	2017-08-17 19:43:38 +03:00
Burak Yucesoy	0ddcd726c9	Merge pull request #1464 from citusdata/copy_copy_isolation_test	2017-08-17 17:47:42 +03:00
Burak Yucesoy	ae32d786cf	Add new isolation tests	2017-08-17 17:46:03 +03:00
Marco Slot	131baeda3d	Merge pull request #1585 from citusdata/exit_maintenanced Maintenance daemon dies peacefully when it gets lost finding itself	2017-08-17 09:11:27 +02:00
Marco Slot	1eca53ad40	Exit maintenanced on database crash	2017-08-16 18:29:44 +02:00
Marco Slot	a5d54382ef	Merge pull request #1577 from citusdata/follower_get_active_worker_nodes Return readable nodes in master_get_active_worker_nodes	2017-08-16 14:19:59 +02:00
Marco Slot	9e7b1fb858	Return readable nodes in master_get_active_worker_nodes	2017-08-16 11:28:47 +02:00
Hadi Moshayedi	e5fbcf37dd	Add Savepoint Support (#1539 ) This change adds support for SAVEPOINT, ROLLBACK TO SAVEPOINT, and RELEASE SAVEPOINT. When transaction connections are not established yet, savepoints are kept in a stack and sent to the worker when the connection is later established. After establishing connections, savepoint commands are sent as they arrive. This change fixes #1493 .	2017-08-15 13:02:28 -04:00
Önder Kalacı	dcabbc4a8e	Merge pull request #1559 from citusdata/fix_deamon_upgrade Add version check to the maintenance daemon	2017-08-15 19:24:18 +03:00
Onder Kalaci	205501532a	Add version check to the maintenance daemon We should prevent running the deadlock detection if there is a major version change. Otherwise, the daemon may access to obsolete metadata catalog tables.	2017-08-15 18:47:13 +03:00
Marco Slot	0d71fcd8af	Merge pull request #1567 from citusdata/fix_2pc_issues Fix 2pc issues	2017-08-15 14:22:47 +02:00
Marco Slot	3ff46245b3	Make sure we don't use 2PC in copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	4614814de1	Enable 2PC for INSERT...SELECT via coordinator	2017-08-15 13:44:20 +02:00
Marco Slot	fa70089766	Enable 2PC during distributed table creation	2017-08-15 13:44:20 +02:00
Marco Slot	9232823070	Abort on failure on master connection during copy from worker	2017-08-15 13:44:20 +02:00
Marco Slot	df7723cde5	Should not commit on aborted non-critical connections	2017-08-15 13:44:20 +02:00
Burak Yücesoy	e14e5f0d25	Merge pull request #1566 from citusdata/switch_postgres_branch Switch to Postgres REL_STABLE_10 branch	2017-08-15 14:39:52 +03:00
Burak Yucesoy	cdcc5fdf65	Switch to Postgres REL_STABLE_10 branch PostgreSQL master branch is now stamped with 11devel and we should only use master branch if we want to test against PostgreSQL 11. For PostgreSQL 9.6 tests we should use REL9_6_STABLE and for PostgreSQL 10 we should use REL_10_STABLE. v0.6.4 tag in our tools repo addresses this problem. Apart from that we may want to add PostgreSQL 11 to our test matrix soon. v0.6.4 handles that too. We just need add PostgreSQL 11 to our test matrix and stop erroring out if we are compiling Citus against PostgreSQL 11.	2017-08-15 14:25:29 +03:00
Eren Başak	f1b51d7bbe	Merge pull request #1551 from citusdata/fix_pg_worker_list_bug Fix pg_worker_list use-after-free bug	2017-08-14 19:28:50 +03:00
Eren Başak	77626c4238	Fix NULL nodeClusterString crush on pg_worker_list.conf migrations	2017-08-14 18:13:53 +03:00
Eren Başak	b3d2f9ba71	Fix pg_worker_list use-after-free bug This change fixes a use-after-free bug while renaming obsolete `pg_worker_list.conf` file, which causes Citus to crash during upgrade (or even extension creation) if `pg_worker_list.conf` exists.	2017-08-14 18:13:53 +03:00
Burak Yücesoy	b7e55e0c81	Merge pull request #1544 from citusdata/acquire_locks_for_partitioned_table_ops Acquire proper locks for partitioned table operations	2017-08-14 15:09:03 +03:00
Burak Yucesoy	45b273321f	Add tests for locking operations on partitioned tables	2017-08-14 14:55:45 +03:00
Burak Yucesoy	dfdfb44ebf	Acquire shard resource locks on parent tables while operating on partitions	2017-08-14 14:44:30 +03:00
Burak Yucesoy	a321e750c0	Acquire relation locks on partitions while operation on parent table	2017-08-14 14:44:30 +03:00
Burak Yucesoy	52b9e35d50	Add relationIdList field to the Job struct	2017-08-14 14:06:22 +03:00
Önder Kalacı	45957e5688	Merge pull request #1529 from citusdata/deadlock_detection_main Distributed Deadlock detection	2017-08-12 14:02:27 +03:00
Onder Kalaci	4f668ad38b	Make the test outputs consistent by using VACUUM ANALYZE on the tables.	2017-08-12 13:29:25 +03:00
Onder Kalaci	0ba2f9e4e4	Add regression tests for distributed deadlock detection	2017-08-12 13:29:25 +03:00
Onder Kalaci	5b48de7430	Improve deadlock detection for MX We added a new field to the transaction id that is set to true only for the transactions initialized on the coordinator. This is only useful for MX in order to distinguish the transaction that started the distributed transaction on the coordinator where we could have the same transactions' worker queries on the same node.	2017-08-12 13:28:37 +03:00
Onder Kalaci	59133415b0	Add logging infrasture for distributed deadlock detection We added a new GUC citus.log_distributed_deadlock_detection which is off by default. When set to on, we log some debug messages related to the distributed deadlock to the server logs.	2017-08-12 13:28:37 +03:00
Onder Kalaci	e5d5bdff51	Enable distributed deadlock detection on the maintenance deamon With this commit, the maintenance deamon starts to check for distributed deadlocks. We also introduced a GUC variable (distributed_deadlock_detection_factor) whose value is multiplied with Postgres' deadlock_timeout. Setting it to -1 disables the distributed deadlock detection.	2017-08-12 13:28:37 +03:00
Onder Kalaci	66936053a0	Improve error messages when a backend is cancelled by deadlock detection We send SIGINT to a backend that is cancelled due to a deadlock. That approach ends up being a very confusing error message. With this commit we intercept the error messages and show a more meaningful error message to the user.	2017-08-12 13:28:37 +03:00
Onder Kalaci	be4fc45c03	Deprecate enable_deadlock_prevention flag Now that we already have the necessary infrastructure for detecting distributed deadlocks. Thus, we don't need enable_deadlock_prevention which is purely intended for preventing some forms of distributed deadlocks.	2017-08-12 13:28:37 +03:00
Onder Kalaci	a333c9f16c	Add infrastructure for distributed deadlock detection This commit adds all the necessary pieces to do the distributed deadlock detection. Each distributed transaction is already assigned with distributed transaction ids introduced with `3369f3486f`. The dependency among the distributed transactions are gathered with `80ea233ec1`. With this commit, we implement a DFS (depth first seach) on the dependency graph and search for cycles. Finding a cycle reveals a distributed deadlock. Once we find the deadlock, we examine the path that the cycle exists and cancel the youngest distributed transaction. Note that, we're not yet enabling the deadlock detection by default with this commit.	2017-08-12 13:28:37 +03:00
Marco Slot	d19818de21	Merge pull request #1543 from citusdata/test-follower-cluster Add make target for testing follower clusters	2017-08-12 12:18:56 +02:00
Marco Slot	59e626d158	Add regression tests for follower clusters	2017-08-12 12:05:56 +02:00
Marco Slot	55992d4bc0	Disallow task-tracker queries on follower clusters	2017-08-12 11:47:31 +02:00
Marco Slot	c097bc9a01	Merge pull request #1503 from citusdata/fix_drop_create_deadlock Fix drop table - create_distributed_table deadlock	2017-08-11 12:46:51 +02:00
velioglu	100739f62a	Change citus subversion	2017-08-11 11:57:57 +03:00
Marco Slot	53584affa8	Fix locking in create_distributed_table	2017-08-11 11:34:33 +03:00
velioglu	7c65001e23	Do not delete row from colocation table within drop table	2017-08-11 11:34:33 +03:00
Burak Velioglu	c4eb6c5153	Merge pull request #1532 from citusdata/subquery_pushdown_on_reference_tables Subquery pushdown on reference tables	2017-08-11 10:32:22 +03:00
velioglu	b0efffae1c	Correct planner and add more tests	2017-08-11 10:16:13 +03:00
velioglu	7550b8ad52	Fix anchor shard id selection when reference table exists	2017-08-11 10:09:47 +03:00
velioglu	ceba81ce35	Move physical planner checks to logical planner	2017-08-11 10:09:47 +03:00
velioglu	0359d03530	Add set operation check for reference tables	2017-08-11 10:09:47 +03:00
velioglu	c4e3b8b5e1	Add planner changes and tests for subquery on reference tables	2017-08-11 10:09:47 +03:00
velioglu	45717dd013	Check equivalence on reference tables for subquery pushdown	2017-08-11 10:09:47 +03:00
Marco Slot	a6d40b8bc5	Merge pull request #1452 from citusdata/citus_create_restore_point Add citus_create_restore_point for taking distributed snapshots	2017-08-11 08:22:22 +02:00
Marco Slot	0ae265c436	Add citus_create_restore_point for distributed snapshots	2017-08-11 07:36:20 +02:00
Marco Slot	c4cd3a6e06	Merge pull request #1481 from citusdata/async_commit Wait for commit/abort/prepare results asynchronously	2017-08-11 00:17:14 +02:00
Marco Slot	fdff210ef7	Wait for commit/abort/prepare results asynchronously	2017-08-11 00:03:06 +02:00
Marco Slot	fca986f214	Add API for waiting for multiple connections	2017-08-11 00:03:06 +02:00
Brian Cloutier	9d93fb5551	Create citus.use_secondary_nodes GUC This GUC has two settings, 'always' and 'never'. When it's set to 'never' all behavior stays exactly as it was prior to this commit. When it's set to 'always' only SELECT queries are allowed to run, and only secondary nodes are used when processing those queries. Add some helper functions: - WorkerNodeIsSecondary(), checks the noderole of the worker node - WorkerNodeIsReadable(), returns whether we're currently allowed to read from this node - ActiveReadableNodeList(), some functions (namely, the ones on the SELECT path) don't require working with Primary Nodes. They should call this function instead of ActivePrimaryNodeList(), because the latter will error out in contexts where we're not allowed to write to nodes. - ActiveReadableNodeCount(), like the above, replaces ActivePrimaryNodeCount(). - EnsureModificationsCanRun(), error out if we're not currently allowed to run queries which modify data. (Either we're in read-only mode or use_secondary_nodes is set) Some parts of the code were switched over to use readable nodes instead of primary nodes: - Deadlock detection - DistributedTableSize, - the router, real-time, and task tracker executors - ShardPlacement resolution	2017-08-10 17:37:17 +03:00
Brian Cloutier	c854d51cd8	make multi_reference_table test more stable	2017-08-10 17:37:17 +03:00
Brian Cloutier	3fc87a7a29	Metadata sync also syncs nodes in other clusters	2017-08-10 16:55:55 +03:00
Brian Cloutier	0dee4f8418	Metadata sync syncs all nodes, not just primaries	2017-08-10 16:55:55 +03:00
Eren Başak	353d2db913	Merge pull request #1545 from citusdata/create_table_utility_udfs Define Utility Functions	2017-08-10 15:20:36 +03:00
Eren Başak	deb89cb9ce	Delete tesh_helper_functions.h	2017-08-10 14:00:44 +03:00
Eren Başak	f9470329e5	Remove test_helper_functions.h inclusions	2017-08-10 12:42:46 +03:00
Eren Başak	3061737712	Define Some Utility Functions This change declares two new functions: `master_update_table_statistics` updates the statistics of shards belong to the given table as well as its colocated tables. `get_colocated_shard_array` returns the ids of colocated shards of a given shard.	2017-08-10 12:42:46 +03:00
Brian Cloutier	1961add6f9	Improve error message when there are no nodes for a placement	2017-08-10 12:38:51 +03:00
Jason Petersen	4e8d07c672	Merge pull request #1517 from citusdata/feature/multi_row_insert Enable multi-row INSERTs cr: @marcocitus	2017-08-10 01:25:13 -07:00
Jason Petersen	dee66e3959	Final review feedback	2017-08-10 01:10:09 -07:00
Jason Petersen	a578506718	Add multi-row isolation tests	2017-08-10 01:10:09 -07:00
Jason Petersen	addde54464	Add some tests	2017-08-10 00:32:46 -07:00
Jason Petersen	6a35c2937c	Enable multi-row INSERTs This is a pretty substantial refactoring of the existing modify path within the router executor and planner. In particular, we now hunt for all VALUES range table entries in INSERT statements and group the rows contained therein by shard identifier. These rows are stashed away for later in "ModifyRoute" elements. During deparse, the appropriate RTE is extracted from the Query and its values list is replaced by these rows before any SQL is generated. In this way, we can create multiple Tasks, but only one per shard, to piecemeal execute a multi-row INSERT. The execution of jobs containing such tasks now exclusively go through the "multi-router executor" which was previously used for e.g. INSERT INTO ... SELECT. By piggybacking onto that executor, we participate in ongoing trans- actions, get rollback-ability, etc. In short order, the only remaining use of the "single modify" router executor will be for bare single- row INSERT statements (i.e. those not in a transaction). This change appropriately handles deferred pruning as well as master- evaluated functions.	2017-08-10 00:32:46 -07:00
Burak Velioglu	c66c62cd14	Merge pull request #1475 from citusdata/in-any-pruning-v2 Support for IN/=ANY pruning	2017-08-10 09:08:59 +03:00
velioglu	7e436c0277	Add bool expression to pruning instance with a function	2017-08-10 08:56:36 +03:00
Andres Freund	e8b793c454	Support for IN (const, list) and = ANY(const, b, c) pruning.	2017-08-10 08:56:36 +03:00
Önder Kalacı	4a09bb4948	Merge pull request #1516 from citusdata/improve_lw_locks Improve locking semantics for the backend management	2017-08-09 17:33:15 +03:00
Onder Kalaci	b5ea3ab6a3	Improve locking semantics for backend management We use the backend shared memory lock for preventing new backends to be part of a new distributed transaction or an existing backend to leave a distributed transaction while we're reading the all backends' data. The primary goal is to provide consistent view of the current distributed transactions while doing the deadlock detection.	2017-08-09 17:17:12 +03:00
Brian Cloutier	2e0916e15a	Add master_add_secondary_node() UDF	2017-08-09 17:10:48 +03:00
Marco Slot	3f338a3fc6	Merge pull request #1407 from citusdata/fix_node_locking Fix pg_dist_node locking	2017-08-09 16:32:28 +04:00
Marco Slot	08ed6d8269	Prevent pg_dist_node changes during master_create_empty_shard	2017-08-09 14:22:09 +02:00
Murat Tuncer	5cb9466255	Rebase node metadata isolation tests	2017-08-09 14:22:09 +02:00
Marco Slot	3a0571e69b	Remove LockMetadataSnapshot	2017-08-09 14:09:54 +02:00
Marco Slot	ad0fdf57ca	Add add/remove node rollback isolation tests	2017-08-09 14:09:54 +02:00
Marco Slot	c2f8bafa05	Fix shard creation vs. pg_dist_node change locking	2017-08-09 14:09:54 +02:00
Marco Slot	868ee6be83	Fix and simplify pg_dist_node locking	2017-08-09 14:09:54 +02:00
Burak Yücesoy	01d8926228	Merge pull request #1509 from citusdata/create_distributed_partitions Add support for distributed partitioned tables	2017-08-09 14:27:10 +03:00
Burak Yucesoy	ab5f97861b	Add regression tests for distributed partitioned tables	2017-08-09 10:01:35 +03:00
Burak Yucesoy	8455d1a4ef	Ensure we are allowing partitioned tables at all appropriate places	2017-08-09 10:01:35 +03:00
Burak Yucesoy	2eee556738	Add distributed partitioned table support for COPY For partitioned tables, PostgreSQL opens partition and its partitions in BeginCopyFrom and it expects its caller to close those relations. However, we do not have quick access to opened relations and performing special operations for partitioned tables isn't necessary in coordinator node. Therefore before calling BeginCopyFrom, we change relkind of those partitioned tables to RELKIND_RELATION. This prevents PostgreSQL to open its partitions as well.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	31f3221342	Add distributed partitioned table support to router plannable queries In standart_planner, PostgreSQL expands partitioned tables to their partitions and call our restriction hook for each partition. It also, for some queries, skips the partitioned table itself completely. This behaviour makes it difficult to prune shards and decide whether query is router plannable or not. To prevent this behaviour, we change inh flag of partitioned tables to false in the query tree. In this case, PostgreSQL treats those partitioned tables as regular relations and does not expand them. This behaviour is inline with our expectations, because we do not want to treat partitioned tables differently on coordinator. Although we are not entirely comfortable with modifying query tree, other solutions to this problem is overly complicated.	2017-08-09 10:01:35 +03:00
Burak Yucesoy	fddf9b3fcc	Add distributed partitioned table support distributed table creation With this PR, Citus starts to support all possible ways to create distributed partitioned tables. These are; - Distributing already created partitioning hierarchy - CREATE TABLE ... PARTITION OF a distributed_table - ALTER TABLE distributed_table ATTACH PARTITION non_distributed_table - ALTER TABLE distributed_table ATTACH PARTITION distributed_table We also support DETACHing partitions from partitioned tables and propogating TRUNCATE and DDL commands to distributed partitioned tables. This PR also refactors some parts of distributed table creation logic.	2017-08-09 10:01:35 +03:00
Metin Döşlü	a650aaa631	Merge pull request #1483 from citusdata/update_subquery_on_where_false Add support for router UPDATEs and DELETEs with subqueries and joins	2017-08-08 22:24:41 +03:00
Metin Doslu	b8a9e7c1bf	Add support for UPDATE/DELETE with subqueries	2017-08-08 21:35:08 +03:00
Marco Slot	518750bd51	Merge pull request #1519 from citusdata/connection_per_placement Avoid connections that accessed non-colocated placements in multi-shard commands	2017-08-08 20:47:16 +04:00
Marco Slot	d3e9746236	Avoid connections that accessed non-colocated placements in multi-shard commands	2017-08-08 18:32:34 +02:00
Brian Cloutier	7060ade6fe	GetNodeTuple returns NULL it node does not exist It never throws an error.	2017-08-08 13:12:06 +03:00
Brian Cloutier	a3e9bef685	All users of WorkerNodeHash take an AccessShareLock The metadata cache simulates a SELECT on pg_dist_node. Now the locks it takes also simulate that SELECT.	2017-08-08 13:12:06 +03:00
Brian Cloutier	5914c992e6	cluster management UDFs see nodes in different clusters - master_activate_node and master_disable_node correctly toggle isActive, without crashing - master_add_node rejects duplicate nodes, even if they're in different clusters - master_remove_node allows removing nodes in different clusters	2017-08-08 13:12:06 +03:00
Brian Cloutier	3151b52a0b	Add citus.cluster_name GUC - Nodes with a nodecluster which does not match citus.cluster_name are excluded from the metadata cache and never seen by another part of Citus.	2017-08-08 13:12:06 +03:00
Brian Cloutier	94947c0d54	Refactor: ReplicateShardToAllWorkers more explicitly locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	f87fefa323	Refactor: DistributedTableSize more explicitly only locks pg_dist_node	2017-08-08 13:12:06 +03:00
Brian Cloutier	3769381366	Fix inaccurate comment on SetNodeState	2017-08-08 13:12:06 +03:00
Brian Cloutier	bf197e9f0c	Add test for super-long cluster names	2017-08-08 11:18:31 +03:00
Brian Cloutier	fbecf48a03	Disallow adding primary nodes to non-default clusters	2017-08-08 11:18:31 +03:00
Brian Cloutier	5618e69386	Add pg_dist_node.nodecluster	2017-08-08 11:18:31 +03:00
Brian Cloutier	74ce4faab5	Make multi_cluster_management test more stable	2017-08-08 11:18:31 +03:00
Brian Cloutier	e7846ba7d1	Allow metadata sync functions on secondaries {start,stop}_metadata_sync_to_node now toggle the hasMetadata flag when run on secondaries but don't attempt to actually sync any metadata.	2017-08-07 18:46:51 +03:00
Marco Slot	7e4b2c1595	Merge pull request #1474 from citusdata/update_where_false Execute UPDATE/DELETE statements with 0 shards	2017-08-07 17:48:52 +04:00
Marco Slot	4cc7c36596	Simplify metadata lock acquisition for DML	2017-08-07 15:36:58 +02:00
Marco Slot	aa7ca81548	Execute UPDATE/DELETE statements with 0 shards	2017-08-07 15:36:58 +02:00
Marco Slot	3248f1a2b7	Merge pull request #1506 from citusdata/descend_function_evaluation Function evaluation descends into expression trees	2017-08-06 22:24:22 +04:00
Marco Slot	bac60bb64f	Function evaluation descends into expression trees	2017-08-06 19:53:05 +02:00
Brian Cloutier	37985de85e	master_disable_node no longer crashes when given a non-existant node	2017-08-04 11:14:54 +03:00
Hadi Moshayedi	8229a64fe8	Remove distributed tables' dependency on distribution key columns. (#1527 ) This change removes distributed tables' dependency on distribution key columns. We already check that we cannot drop distribution key columns in ErrorIfUnsupportedAlterTableStmt() at multi_utility.c, so we don't need to have distributed table to distribution key column dependency to avoid dropping of distribution key column. Furthermore, having this dependency causes some warnings in pg_dump --schema-only (See #866), which are not desirable. This change also adds check to disallow drop of distribution keys when citus.enable_ddl_propagation is set to false. Regression tests are updated accordingly.	2017-08-03 10:07:04 -04:00
Murat Tuncer	fa18899cf9	Remove serialization/deserialization of multiplan node (#1477 ) introduces copy functions for Citus MultiPlan nodes. uses ExtensibleNode mechanism to store MultiPlan data drops serialiazation of MultiPlans	2017-08-02 08:24:00 +03:00
Burak Yücesoy	f0275fe4ae	Merge pull request #1525 from citusdata/refactor_create_distributed_table Refactor distributed table creation logic	2017-07-31 12:08:18 +03:00
Burak Yucesoy	37b200a52e	Fix broken isolation tests We try to run our isolation tests paralles as much as possible. In some of those isolation tests we used same table name which causes problem while running them in paralles. This commit changes table names in those tests to ensure tests can run in parallel.	2017-07-31 11:11:49 +03:00
Burak Yucesoy	7769f1d012	Refactor distributed table creation logic This commit is preperation for introducing distributed partitioned table support. We want to clean and refactor some code in distributed table creation logic so that we can handle partitioned tables in more robust way.	2017-07-31 11:11:23 +03:00
Murat Tuncer	520d74b96d	Add a regression test for citus.max_task_string_size (#1524 )	2017-07-28 10:49:09 -07:00
Brian Cloutier	7d8bcb6a88	These tests sometimes deadlock on travis	2017-07-28 16:02:43 +03:00
Brian Cloutier	b20a086a8f	master_activate_node UDF also returns noderole	2017-07-28 16:02:43 +03:00
Murat Tuncer	26f020dc6e	Make maxTaskStringSize configurable (#1501 ) maxTaskStringSize determines the size of worker query string. It was originally hard coded to a specific value. This has caused issues at some users. Since it determines initial shared memory allocation, we did not want to set it to an arbitrary higher number. Instead made it configurable. This commit introduces a new GUC variable max_task_string_size Changes in this variable requires restart to be in effect.	2017-07-27 11:39:12 -07:00
Önder Kalacı	6698ca8d9e	Merge pull request #1523 from citusdata/deadlock_detection_main Convert the global wait edges to adjacency lists	2017-07-27 20:14:49 +03:00
Onder Kalaci	6132d17481	Convert global wait edges to adjacency list In this commit, we add ability to convert global wait edges into adjacency list with the following format: [transactionId] = [transactionNode->waitsFor {list of waiting transaction nodes}]	2017-07-27 19:53:51 +03:00
Murat Tuncer	8729b7d55a	Use cstore_table_size function to determine cstore table size (#1521 ) pg_table_size/pg_relation_size variants always return 0 for cstore tables. We should be using cstore_table_size function for cstore_tables.	2017-07-27 09:02:07 -07:00
Brian Cloutier	32e16ffe02	Give isolation tester ability to see locks on workers	2017-07-26 18:43:04 +03:00
Eren Başak	db5206846e	Merge pull request #1508 from citusdata/progress_tracking Add Progress Tracking Infrastructure	2017-07-26 15:08:43 +03:00
Eren Başak	a12f1980de	Add Progress Tracking Infrastructure This change adds a general purpose infrastructure to log and monitor process about long running progresses. It uses `pg_stat_get_progress_info` infrastructure, introduced with PostgreSQL 9.6 and used for tracking `VACUUM` commands. This patch only handles the creation of a memory space in dynamic shared memory, putting its info in `pg_stat_get_progress_info`, fetching the progress monitors on demand and finalizing the progress tracking.	2017-07-26 14:12:15 +03:00
Marco Slot	d33cb7d832	Merge pull request #1495 from citusdata/dump_local_wait_edges Add functions to dump local and global wait edges	2017-07-25 17:05:27 +02:00
Marco Slot	80ea233ec1	Add function for dumping global wait edges	2017-07-25 16:52:32 +02:00
Marco Slot	81198a1d02	Add function for dumping local wait edges	2017-07-25 16:52:32 +02:00
Önder Kalacı	b04aa9bf85	Merge pull request #1514 from citusdata/fix_assigned_tx_bug Fix bug on error check for assigning distributed transaction id	2017-07-25 15:41:34 +03:00
Onder Kalaci	58faffa42b	Fix bug on error check for assigning distributed transaction id to a backend that has already been assigned a transaction.	2017-07-25 14:58:07 +03:00
Marco Slot	5923334114	Add transaction recovery regression tests	2017-07-24 20:44:38 +02:00
Marco Slot	3d7f79127d	Do not release locks in LogTransactionRecord	2017-07-24 20:44:38 +02:00
Brian Cloutier	88702ca58a	node_metadata takes out more sane locks - Never release locks - AddNodeMetadata takes ShareRowExclusiveLock so it'll conflict with the trigger which prevents multiple primary nodes. - ActivateNode and SetNodeState used to take AccessShareLock, but they modify the table so they should take RowExclusiveLock. - DeleteNodeRow and InsertNodeRow used to take AccessExclusiveLock but only need RowExclusiveLock.	2017-07-24 11:57:46 +03:00
Brian Cloutier	ec99f8f983	Add nodeRole column - master_add_node enforces that there is only one primary per group - there's also a trigger on pg_dist_node to prevent multiple primaries per group - functions in metadata cache only return primary nodes - Rename ActiveWorkerNodeList -> ActivePrimaryNodeList - Rename WorkerGetLive{Node->Group}Count() - Refactor WorkerGetRandomCandidateNode - master_remove_node only complains about active shard placements if the node being removed is a primary. - master_remove_node only deletes all reference table placements in the group if the node being removed is the primary. - Rename {Node->NodeGroup}HasShardPlacements, this reflects the behavior it already had. - Rename DeleteAllReferenceTablePlacementsFrom{Node->NodeGroup}. This also reflects the behavior it already had, but the new signature forces the caller to pass in a groupId - Rename {WorkerGetLiveGroup->ActivePrimaryNode}Count	2017-07-24 11:57:46 +03:00
Brian Cloutier	e6c375eb81	Tiny refactor to master_create_empty_shard	2017-07-24 11:57:46 +03:00
Brian Cloutier	ee270b65d7	make WorkerGetNodeWithName a static function	2017-07-24 11:57:46 +03:00
Marco Slot	fd3c007beb	Merge pull request #1504 from citusdata/recovery_record_entropy Add transaction number to 2PC identifiers	2017-07-21 18:29:52 +02:00
Marco Slot	601b17d544	Use distributed transaction number in 2PC identifiers	2017-07-21 17:36:33 +02:00
Marco Slot	18a6e478af	Fix typo in GetCurrentDistributedTransctionId	2017-07-21 17:36:33 +02:00
Önder Kalacı	165a4eb7cb	Merge pull request #1507 from citusdata/fix-10-build-partitioning Fix PG 10 build, UNBOUNDED partitions now have different syntax	2017-07-21 18:10:46 +03:00
Brian Cloutier	7f1343103e	Fix PG 10 build, UNBOUNDED partitions now have different syntax Update code and tests to match the changes made in pg's d363d42	2017-07-21 14:30:11 +03:00
Brian Cloutier	74dd5bb281	Fix crash when removing an inactive node	2017-07-20 18:55:40 +03:00
Hadi Moshayedi	953df34d22	Explicit switch/case fall-throughs to avoid compiler warnings. GCC 7 added `-Wimplicit-fallthrough` to warn for not explicitly specified switch/case fall-throughs. According to https://gcc.gnu.org/gcc-7/changes.html, to suppress that warning we could either use `__attribute__(fallthrough)`, which didn't seem to work for earlier GCC versions, or a `/* fallthrough /` comment just before the following `case`. Previously Citus code had the fall-through comments inside the brackets, which didn't seem to suppress the warning. Putting a `/ fallthrough */` comment outside the brackets and right before the `case` fixes the problem.	2017-07-19 11:41:59 -04:00
Önder Kalacı	b873e6fcc8	Merge pull request #1489 from citusdata/add_distributed_transaction_id Introduce distributed transaction ids	2017-07-18 15:14:17 +03:00
Onder Kalaci	3369f3486f	Introduce distributed transaction ids This commit adds distributed transaction id infrastructure in the scope of distributed deadlock detection. In general, the distributed transaction id consists of a tuple in the form of: `(databaseId, initiatorNodeIdentifier, transactionId, timestamp)`. Briefly, we add a shared memory block on each node, which holds some information per backend (i.e., an array `BackendData backends[MaxBackends]`). Later, on each coordinated transaction, Citus sends `SELECT assign_distributed_transaction_id()` right after `BEGIN`. For that backend on the worker, the distributed transaction id is set to the values assigned via the function call. The aim of the above is to correlate the transactions on the coordinator to the transactions on the worker nodes.	2017-07-18 15:01:42 +03:00
Burak Velioglu	0f4fa854cc	Merge pull request #1436 from citusdata/convert_create_distributed_table_to_new_api_with_commits Convert create distributed table to new api	2017-07-18 12:47:29 +03:00
velioglu	6ea15fbb25	Make create_distributed_table transactional	2017-07-18 12:35:40 +03:00
Marco Slot	fd72cca6c8	Use predictable placement IDs in regression test output	2017-07-17 13:44:29 +03:00
Önder Kalacı	d85fca6b57	Merge pull request #1494 from citusdata/fix_reg_test Fix regression test outputs due to recent Postgres 10 changes	2017-07-14 13:44:57 +03:00
Onder Kalaci	ce8edd88f7	Apply regression test changes that are due to PostgreSQL 10 changes that have recently changed	2017-07-14 13:22:12 +03:00
Burak Yucesoy	79c14f73fa	Add CHANGELOG entry for 6.2.3	2017-07-12 23:50:54 -06:00
Marco Slot	d164b4cd10	Merge pull request #1453 from citusdata/shard-placement-uses-groups Shard placements don't hardcode nodename and nodeport	2017-07-12 14:37:16 +02:00
Brian Cloutier	72d8d2429b	Add a test for upgrading shard placements	2017-07-12 14:18:27 +02:00
Brian Cloutier	ee4edc498f	Don't release locks early in metadata functions	2017-07-12 14:18:27 +02:00
Brian Cloutier	f40f03270a	Fix locking in ReadWorkerNodes()	2017-07-12 14:18:27 +02:00
Brian Cloutier	7ad95b53d2	Rename pg_dist_shard_placement -> pg_dist_placement Comes with a few changes: - Change the signature of some functions to accept groupid - InsertShardPlacementRow - DeleteShardPlacementRow - UpdateShardPlacementState - NodeHasActiveShardPlacements returns true if the group the node is a part of has any active shard placements - TupleToShardPlacement now returns ShardPlacements which have NULL nodeName and nodePort. - Populate (nodeName, nodePort) when creating ShardPlacements - Disallow removing a node if it contains any shard placements - DeleteAllReferenceTablePlacementsFromNode matches based on group. This doesn't change behavior for now (while there is only one node per group), but means in the future callers should be careful about calling it on a secondary node, it'll delete placements on the primary. - Create concept of a GroupShardPlacement, which represents an actual tuple in pg_dist_placement and is distinct from a ShardPlacement, which has been resolved to a specific node. In the future ShardPlacement should be renamed to NodeShardPlacement. - Create some triggers which allow existing code to continue to insert into and update pg_dist_shard_placement as if it still existed.	2017-07-12 14:17:31 +02:00
Brian Cloutier	fe53fd4a8e	Remove functions created just for unit testing These functions are holdovers from pg_shard and were created for unit testing c-level functions (like InsertShardPlacementRow) which our regression tests already test quite effectively. Removing because it makes refactoring the signatures of those c-level functions unnecessarily difficult. - create_healthy_local_shard_placement_row - update_shard_placement_row_state - delete_shard_placement_row	2017-07-12 14:16:24 +02:00
Brian Cloutier	0b64bb1092	Fix typo in comment in CachedRelationLookup	2017-07-12 14:16:24 +02:00
Brian Cloutier	385d9cbbb7	Ignore generated multi_behavioral_analytics_create_table test files	2017-07-12 14:16:24 +02:00
Brian Cloutier	7c8a6c9cee	Add vim swap files to .gitignore	2017-07-12 14:16:23 +02:00
Marco Slot	bf8377082c	Use consistent placement IDs in mulity_modyfing_xactstest	2017-07-12 14:16:23 +02:00
Marco Slot	0437fcd1b2	Merge pull request #1462 from citusdata/clean-regress-multi Remove unused line, @arguments was set but never used	2017-07-12 14:14:21 +02:00
Brian Cloutier	fd8c142530	Remove unused line, @arguments was set but never used	2017-07-12 13:46:27 +02:00
Marco Slot	04e2d64764	Merge pull request #1455 from citusdata/join_xact_modification_levels Rework connection API to remove transaction restrictions	2017-07-12 12:49:54 +02:00
Marco Slot	9f7e4769e2	Clarify placement connection error messages	2017-07-12 11:59:19 +02:00
Marco Slot	d3785b97c0	Remove XactModificationLevel distinction between DML and multi-shard	2017-07-12 11:59:19 +02:00
Marco Slot	710fe8666b	Use GetPlacementListConnection for router DML	2017-07-12 11:26:23 +02:00
Marco Slot	29f21fea59	Use GetPlacementListConnection for multi-shard commands	2017-07-12 11:26:22 +02:00
Marco Slot	01c9b1f921	Use GetPlacementListConnection for router SELECTs	2017-07-12 11:26:22 +02:00
Marco Slot	63676f5d65	Allow choosing a connection for multiple placements with GetPlacementListConnection	2017-07-12 11:26:22 +02:00
Jason Petersen	0d569b722e	Bump to the latest Citus Tools Gets the new uncrustify configuration.	2017-07-11 16:30:03 -06:00
Jason Petersen	9018e698ec	Indentation cleanup Uncrustify 0.65 appears to have changed some defaults, resulting in breakages for those of us who have already upgraded; Travis still uses Uncrustify 0.64, but these changes work with both versions (assuming appropriately updated config), so this should permit use of either version for the time being.	2017-07-11 15:59:28 -06:00
Jason Petersen	d896fe7995	Add some test outputs to gitignore These were bothering me.	2017-07-11 15:37:32 -06:00
Burak Yücesoy	d6d88efc2d	Merge pull request #1488 from citusdata/fix_conflicting_vacuum_insert Fix conflicting locks in VACUUM and INSERT	2017-07-10 16:00:59 +03:00
Burak Yucesoy	a15b3c6df2	Add tests for concurrent INSERT and VACUUM behaviour	2017-07-10 15:46:48 +03:00
Burak Yucesoy	c8b9e4011b	Remove LockRelationDistributionMetadata function	2017-07-10 15:46:37 +03:00
Burak Yucesoy	cb6070c720	Use ShareUpdateExclusiveLock instead ShareLock in VACUUM Before this change, we used ShareLock to acquire lock on distributed tables while running VACUUM. This makes VACUUM and INSERT block each other. With this change we changed lock mode from ShareLock to ShareUpdateExclusiveLock, which does not conflict with the locks INSERT acquire.	2017-07-10 15:46:19 +03:00
Murat Tuncer	2a4eada150	Replace duplicate code and call check_functions_in_node (#1478 ) MasterIrreducibleExpressionWalker has a copied code from function check_functions_in_node() which was available with PG 9.6+. Now PG 9.5 support is dropped we can remove duplicate code and directly call check_functions_in_node().	2017-07-07 10:19:33 +03:00
Marco Slot	ea1fd9dd67	Merge pull request #1487 from citusdata/prepared_implicit_casts Evaluate implicit casts in prepared statements	2017-07-06 21:49:03 +02:00
Marco Slot	31debc96e3	Handle implicit casts in prepared INSERTs	2017-07-06 16:17:35 +02:00
Andres Freund	74a6bac8cc	Merge pull request #1473 from citusdata/fix/cancellation Fix issues around statement cancellation and connection closure.	2017-07-04 14:58:23 -07:00
Andres Freund	d76b093185	Add tests for statement cancellation.	2017-07-04 14:46:03 -07:00
Andres Freund	3461244539	Don't wait for statement completion when aborting coordinated transaction. Previously we used ForgetResults() in StartRemoteTransactionAbort() - that's problematic because there might still be an ongoing statement, and this causes us to wait for its completion. That e.g. happens when a statement running on the coordinator is cancelled.	2017-07-04 14:46:03 -07:00
Andres Freund	0d791f6740	Cancel statements when closing connection at transaction end. That's important because the currently running statement on a worker might continue to hold locks and consume resources, even after the connection is closed. Unfortunately postgres will only notice closed connections when reading from / writing to the network. That might only happen much later.	2017-07-04 14:46:03 -07:00
Andres Freund	be8677f926	Add NonblockingForgetResults(). This is very similar to ForgetResults() except that no network IO is performed. Primarily useful in error handling cases.	2017-07-04 14:46:03 -07:00
Andres Freund	24153fae5d	Add ShutdownConnection() which cancels statement before closing connection. That's primarily useful in error cases, where we want to make sure locks etc held by commands running on workers are released promptly.	2017-07-04 14:46:03 -07:00
Andres Freund	75a7ddea0d	Always use connections in non-blocking mode. Now that there's no blocking libpq callers left, default to using non-blocking mode in connection_management.c. This has two advantages: 1) Blockiness doesn't have to frequently be reset, simplifying code 2) Prevents accidental use of blocking libpq functions, since they'll frequently return 'need IO'	2017-07-04 14:46:03 -07:00
Andres Freund	90a2d13a64	Move multi_copy.c to interrupt aware libpq wrappers.	2017-07-04 14:46:03 -07:00
Andres Freund	21c25abbb1	Move multi_client_executor to interrupt aware libpq wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	ddb0651967	Move citus tools to interrupt aware libpq wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	c674bc8640	Add interrupt aware PQputCopy{End,Data} wrappers.	2017-07-04 12:38:52 -07:00
Andres Freund	b7f9679ccc	Move interrupt handling code from GetRemoteCommandResult to FinishConnectionIO. Nearby commits will add additional interrupt handling functions, this way we don't have to duplicate the code.	2017-07-04 12:38:52 -07:00
Andres Freund	ec0ed677e3	Fix copy & pasto in WARNING message.	2017-07-04 12:38:52 -07:00
Andres Freund	3dedeadb5e	Fix memory leak in RemoteFinalizedShardPlacementList().	2017-07-04 12:38:52 -07:00
Andres Freund	c161c2fbe3	Fix some trailing whitespace.	2017-07-04 12:38:52 -07:00
Burak Velioglu	cd8c547165	Merge pull request #1266 from citusdata/fix_shard_name Change shard_name UDF to include schema	2017-07-04 11:01:52 +03:00
Marco Slot	04fe3f03f6	Change implementation of shard_name UDF to get schema-qualified shard name	2017-07-04 10:49:40 +03:00
Marco Slot	58947e0dcf	Merge pull request #1469 from citusdata/move_insert_select Move INSERT ... SELECT planning logic into one place	2017-06-29 16:50:01 +02:00
Marco Slot	da47a03b18	Move INSERT ... SELECT planning logic into one place	2017-06-29 15:03:14 +02:00
Önder Kalacı	907274a58a	Merge pull request #1463 from citusdata/partition_distributed_table_utils Add some utility functions for partitioned tables	2017-06-28 09:50:30 +03:00
Onder Kalaci	5f3f1d75a3	Add some utility functions for partitioned tables This commit is intended to be a base for supporting declarative partitioning on distributed tables. Here we add the following utility functions and their unit tests: * Very basic functions including differnentiating partitioned tables and partitions, listing the partitions * Generating the PARTITION BY (expr) and adding this to the DDL events of partitioned tables * Ability to generate text representations of the ranges for partitions * Ability to generate the `ALTER TABLE parent_table ATTACH PARTITION partition_table FOR VALUES value_range` * Ability to apply add shard ids to the above command using `worker_apply_inter_shard_ddl_command()` * Ability to generate `ALTER TABLE parent_table DETACH PARTITION`	2017-06-28 09:39:55 +03:00
Andres Freund	0dfc8e6693	Merge pull request #1470 from citusdata/remove_95 Remove 9.5 support.	2017-06-27 14:00:29 -07:00
Andres Freund	d25ccf9e00	Update CONTRIBUTING.md to default to 9.6. 9.5 was not supported anymore, and 10 is not released yet. So 9.6 seems appropriate for now.	2017-06-26 18:09:23 -07:00
Andres Freund	535416384c	Remove version check from pg_regress_multi.pl The check is not necessary anymore after `f59cf2b818`.	2017-06-26 18:07:43 -07:00
Andres Freund	9d7f33be2a	Remove 9.5 references from comments in schedule files. Replace with version-less reference, no point in repeating this for every release.	2017-06-26 18:04:32 -07:00
Andres Freund	bbef9f8650	fixup! Remove 9.5 specific C files.	2017-06-26 17:36:58 -07:00
Andres Freund	2dfd55070c	Remove 9.5 regression test output files.	2017-06-26 12:17:46 -07:00
Andres Freund	dc3997c3b8	Remove 9.5 related node wrappers. Now that all branches support the extensible node infrastructure, we don't need our wrappers anymore.	2017-06-26 08:46:32 -07:00
Andres Freund	b96ba9b490	Fix code only enabled for 9.5. There's still supporting wrappers used, a subsequent commit will remove those. This also removes the already unused tuplecount_t define.	2017-06-26 08:46:32 -07:00
Andres Freund	60c28ce7a6	Remove 9.5 specific C files.	2017-06-26 08:46:32 -07:00
Andres Freund	0ebdd26b58	Remove 9.5 support from configure and travis. This effectively disables 9.5 support, but all the supporting code is still there. Subsequent commits will remove that.	2017-06-26 08:46:32 -07:00
Jason Petersen	2204da19f0	Support PostgreSQL 10 (#1379 ) Adds support for PostgreSQL 10 by copying in the requisite ruleutils and updating all API usages to conform with changes in PostgreSQL 10. Most changes are fairly minor but they are numerous. One particular obstacle was the change in \d behavior in PostgreSQL 10's psql; I had to add SQL implementations (views, mostly) to mimic the pre-10 output.	2017-06-26 02:35:46 -06:00
Andres Freund	71a4e90e82	Merge pull request #1461 from citusdata/feature/maintenanced Add automatically started per-database Maintenance Worker	2017-06-23 12:12:31 -07:00
Andres Freund	4a3b2de4c5	Add some tests checking that maintenance daemon gets started. The 2nd database one is a bit slow, but also shows something important, so we might want to keep it?	2017-06-23 11:53:39 -07:00
Andres Freund	c3b7c5dc33	Introduce per-database maintenance process. This will be used for deadlock detection, prepared transaction recovery amongst others, but currently is just idling around.	2017-06-23 11:53:39 -07:00
Andres Freund	3483bb99eb	Minimal infrastructure for per-backend citus initialization.	2017-06-23 11:20:10 -07:00
Andres Freund	1691f780fd	Force cache invalidation machinery to be initialized earlier. Previously it was not guaranteed that invalidations were registered after creating the extension, only if the extension was used afterwards.	2017-06-23 11:20:10 -07:00
Andres Freund	f645dca593	Centralized metadata_cache cache variables into one struct, to avoid missing resets. E.g. extensionOwner was already missed.	2017-06-23 11:20:10 -07:00
Marco Slot	04e4b7d82a	Fix spuriously failing regression test	2017-06-23 10:06:15 +02:00
Marco Slot	dc799c6e5e	Merge pull request #1468 from citusdata/test_column_name Add weird column name test for create_distributed_table	2017-06-22 17:14:17 +02:00
Marco Slot	6cafbf9b66	Add weird column name to create_distributed_table test	2017-06-22 16:27:39 +02:00
Marco Slot	ce28b6af0d	Merge pull request #1402 from citusdata/local_insert_select Perform INSERT..SELECT via coordinator if it cannot be pushed down	2017-06-22 16:23:34 +02:00
Marco Slot	a6f42e4948	Clarify error message when copying NULL value into table	2017-06-22 15:48:24 +02:00
Marco Slot	2f8ac82660	Execute INSERT..SELECT via coordinator if it cannot be pushed down Add a second implementation of INSERT INTO distributed_table SELECT ... that is used if the query cannot be pushed down. The basic idea is to execute the SELECT query separately and pass the results into the distributed table using a CopyDestReceiver, which is also used for COPY and create_distributed_table. When planning the SELECT, we go through planner hooks again, which means the SELECT can also be a distributed query. EXPLAIN is supported, but EXPLAIN ANALYZE is not because preventing double execution was a lot more complicated in this case.	2017-06-22 15:46:30 +02:00
Marco Slot	2cd358ad1a	Support quoted column-names in COPY logic	2017-06-22 15:45:57 +02:00
Marco Slot	155db4d913	Simplify router planner call path	2017-06-22 15:45:57 +02:00
Murat Tuncer	0c4bf2d943	Remove fall back to select if poll is not available (#1466 ) poll is supported on all relevant systems, there is no need to have a fall back mechanism to use select()	2017-06-22 12:11:18 +03:00
Jason Petersen	294aeff2ed	Don't call PostProcessUtility for local commands It is intended only to aid in processing of distributed DDL commands, but as written could execute during local CREATE INDEX CONCURRENTLY commands.	2017-06-19 15:56:03 -06:00
Burak Yücesoy	0f284c9adf	Merge pull request #1416 from citusdata/test_cache Use cached PostgreSQL build to reduce testing time	2017-06-19 17:37:01 +03:00
Burak Yucesoy	f3109d9380	Use cached PostgreSQL build to reduce testing time With this PR, we started to cache custom compiled PostgreSQL builds. If there are no new commits to the related PostgreSQL branches, we will use already compiled binaries to reduce testing time.	2017-06-19 11:12:41 +03:00
Burak Velioglu	bfde7fcd5a	Merge pull request #1454 from citusdata/phase_out_execute_remote_query Phase out execute remote query and command	2017-06-16 11:48:12 +03:00
velioglu	a17ab6408a	Delete ExecuteRemoteCommand function	2017-06-15 17:11:19 +03:00
velioglu	173fe137af	Convert DropShardsFromWorker to the new connection API	2017-06-15 15:24:06 +03:00
velioglu	d7b68e5647	Convert TableDDLCommandList function to the new connection API	2017-06-14 17:29:58 +03:00
velioglu	0aa9572e18	Convert RemoteTableOwner function to the new connection API	2017-06-14 17:29:58 +03:00
velioglu	7fe29aad4c	Convert worker_fetch_foreign_file to new connection API	2017-06-14 17:29:58 +03:00
velioglu	43d2cdbd35	Convert DistributedTableSizeOnWorker function to new connection API	2017-06-14 17:29:58 +03:00
Marco Slot	802ff0db2f	Merge pull request #1435 from citusdata/unlogged_tables Support unlogged tables	2017-06-14 14:42:42 +02:00
Marco Slot	56876596d5	Add support for unlogged distributed tables	2017-06-14 13:50:00 +02:00
Burak Velioglu	296f41dfd0	Merge pull request #1448 from citusdata/fix_drop_shard_connection Use placement connection to drop shards instead of node connection	2017-06-14 14:34:10 +03:00
velioglu	a1ea29ec2b	Use placement connection to drop shards instead of node connection	2017-06-14 14:14:59 +03:00
Marco Slot	d3e0742b8d	Merge pull request #1441 from citusdata/remove_copy_xact_check Allow COPY after a multi-shard command	2017-06-09 14:19:06 +02:00
Marco Slot	70abfd29d2	Allow COPY after a multi-shard command This change removes the XactModificationLevel check at the start of COPY that was made redundant by consistently using GetPlacementConnection.	2017-06-09 13:54:58 +02:00
Jason Petersen	50501227e9	Add ORDER clause to subquery test missing it	2017-06-08 18:30:14 -06:00
Jason Petersen	cc190a4af9	Remove tracked files from gitignore Causes very hard-to-debug test failures.	2017-06-08 17:39:31 -06:00
Brian Cloutier	38fea7fe66	Add instructions for using citus_indent (#1434 )	2017-06-05 13:21:47 +03:00
Metin Döşlü	a7c94cb36d	Merge pull request #1431 from jmunsch/jmunsch_errmsg Update to errmsg for mixed location insert into	2017-06-02 10:48:49 +03:00
jmunsch	1647d17a14	Clarify error message for local and distributed query plans.	2017-06-01 11:52:49 -07:00
Jason Petersen	a1e44328b2	Add 6.2.2 CHANGELOG entry	2017-05-31 17:02:41 -06:00
Jason Petersen	eaa9dabad1	Add 6.1.2 CHANGELOG entry	2017-05-31 17:02:24 -06:00
Andres Freund	5d0db7a9dd	Merge pull request #1438 from citusdata/fix_shard_move_lock Don't take a table lock in ForeignConstraintGetReferencedTableId	2017-05-31 15:26:19 -07:00
Marco Slot	f1d804180b	Don't take a table lock in ForeignConstraintGetReferencedTableId	2017-05-31 11:15:21 +02:00
Önder Kalacı	0e3369a863	Merge pull request #1389 from citusdata/improve_subquery_reg_tests Improve subquery pushdown regression tests	2017-05-30 14:18:37 +03:00
Onder Kalaci	df494c0403	Improve subquery pushdown regression tests - Use native postgres function for composite key btree functions - Move explain tests to multi_explain.sql (get rid of .out _0.out files) - Get rid of input/output files for multi_subquery.sql by moving table creations - Update some comments	2017-05-30 14:05:15 +03:00
Jason Petersen	b072708802	Bump CHANGELOG for 6.2.1	2017-05-24 13:30:36 -06:00
Marco Slot	1910a93b9a	Merge pull request #1427 from citusdata/fix_version_checks Fix version checks	2017-05-24 19:04:26 +02:00
Burak Yucesoy	aff6a3dcc4	Add tests for version check	2017-05-24 17:39:25 +03:00
Burak Yucesoy	8c1bbf1417	Register cache invalidation callback before version checks With this commit we start to register InvalidateDistRelationCacheCallback function as cache invalidation callback function before version checks because during version checks we use cache to look up relation ids of some relations like pg_dist_relation or pg_dist_partition_logical_relid_index and we want to know about cache invalidation before accessing them.	2017-05-24 17:39:25 +03:00
Burak Yucesoy	c7bfa06cb9	Fix incorrect call to CheckInstalledVersion During version update, we indirectly calld CheckInstalledVersion via ChackCitusVersions. This obviously fails because during version update it is expected to have version mismatch between installed version and binary version. Thus, we remove that ChackCitusVersions. We now only call ChackAvailableVersion.	2017-05-24 17:39:25 +03:00
Önder Kalacı	1ea96b626b	Merge pull request #1426 from citusdata/better_comment_for_tests Add comment to subquery regression test file	2017-05-22 11:12:20 +03:00
Önder Kalacı	757f5be858	Merge branch 'master' into better_comment_for_tests	2017-05-22 10:58:21 +03:00
Onder Kalaci	a5c12b968b	Add comment to the regression test file to prevent any misunderstandings about the usage of enable_router_execution GUC variable.	2017-05-22 10:39:32 +03:00
Burak Yücesoy	7d28423891	Merge pull request #1425 from citusdata/fix_aggressive_error_out Fix aggressive error out behavior	2017-05-21 23:35:34 -08:00
Burak Yucesoy	7a7c74cc87	Add tests for version checks	2017-05-22 09:53:29 +03:00
Burak Yucesoy	9fb15c439c	Add version checks to necessary UDFs	2017-05-22 09:53:29 +03:00
Burak Yucesoy	eea8c51e1f	Only error out on distributed queries when there is version mismatch Before this commit, we were erroring out at almost all queries if there is a version mismatch. With this commit, we started to error out only requested operation touches distributed tables. Normally we would need to use distributed cache to understand whether a table is distributed or not. However, it is not safe to read our metadata tables when there is a version mismatch, thus it is not safe to create distributed cache. Therefore for this specific occasion, we directly read from pg_dist_partition table. However; reading from catalog is costly and we should not use this method in other places as much as possible.	2017-05-22 09:53:29 +03:00
Burak Yucesoy	acb0d23717	Fix crash during upgrade from 5.2 to 6.2 This commit fixes the problem where we incorrectly try to reach distributed table cache when the extension is not loaded completely. We tried to reach the cache because we wanted to get reference table information to activate the node. However it is actually not necessary to explicitly activate the nodes which come from master_initialize_node_metadata. Because it only runs during extension creation and at that time there are no reference tables and all nodes are considered as active.	2017-05-19 00:01:36 +03:00
Jason Petersen	cc45712144	Bump extension and configure PACKAGE versions Actually getting this done before the next dev cycle begins.	2017-05-17 15:25:30 -06:00
Burak Yucesoy	b94285c575	Update CHANGELOG for v6.2.0 CHANGELOG changes for 6.2 release	2017-05-16 22:28:53 -06:00
Jason Petersen	209e2c48ec	Merge pull request #1377 from citusdata/pg10_prep Make minor tweaks to prepare for PostgreSQL 10 cr: @anarazel	2017-05-16 11:44:39 -06:00
Jason Petersen	791cdd7648	Limit sequence SELECT to last_value Unbounded column output differs by version.	2017-05-16 11:05:34 -06:00
Jason Petersen	51137184d9	Suppress hash index warning Irrelevant to the test.	2017-05-16 11:05:34 -06:00
Jason Petersen	489aa73257	Add missing CCI call in metadata seq sync Be explicit about the fact that we've made a modification: we need subsequent commands to see this sequence.	2017-05-16 11:05:34 -06:00
Jason Petersen	97f8302c9c	Change version-sensitive tests to handle '10' Previously assumed period in version; this makes tests future-proof.	2017-05-16 11:05:34 -06:00
Jason Petersen	d6cccee5bc	Remove ALTER SEQUENCE from parallel groups Removing these has no side effect, and in the (current) PostgreSQL 10, an ERROR is printed during concurrent sequence modification.	2017-05-16 11:05:34 -06:00
Jason Petersen	db11324ac7	Add unambiguous ORDER BY clauses to many tests Queries which do not specify an order may arbitrarily change output across PostgreSQL versions.	2017-05-16 11:05:34 -06:00
Jason Petersen	b9bc3fdada	Update header comments to match new test names Just keeping these in sync with the actual file name.	2017-05-16 11:05:33 -06:00
Jason Petersen	9f4a33eee1	Rename very long test files In addition to not actually providing much information, these names can cause problems in PostgreSQL 10.	2017-05-16 11:05:33 -06:00
Jason Petersen	c9fa11b445	Use library and symbol name for bgw entry PostgreSQL 10 takes away the ability to directly assign a function pointer; the other approach (library and symbol name) is supported by all versions.	2017-05-16 11:05:33 -06:00
Jason Petersen	f86920f9d6	Add includes for missing standard headers We use symbols from each of these and were relying on them being included by other headers.	2017-05-16 11:05:33 -06:00
Jason Petersen	82b03d5cb6	Add explicit cast for argument to copyObject PostgreSQL 10 adds a call to typeof, if supported.	2017-05-16 11:05:33 -06:00
Burak Yücesoy	40ebd93be3	Merge pull request #1412 from citusdata/fix_schema_owner_name Send correct and quoted owner name while propagating schema creation	2017-05-15 06:11:08 -08:00
Burak Yucesoy	577ffb2bf2	Add tests for non-default schema owner	2017-05-15 16:49:37 +03:00
Burak Yucesoy	5a3a32d6df	Quote schema's owner name When we propogate the schema creation command to data nodes we add schema's owner name too. Before this patch, we did not quote the owner's name which causes problems with the names containing characters like '-'.	2017-05-15 16:26:32 +03:00
Burak Yucesoy	1b5560b2f7	Fix OwnerName function to work with schemas We incorrectly try to use relation cache to find particular schema's owner and when we cannot find the schema in the relation cache(i.e always), we automatically used current user as the schema's owner. This means we always created schemas in the data nodes with current user. With this patch we started to use namespace cache to find schemas.	2017-05-15 16:26:32 +03:00
Önder Kalacı	a7c65a3ed8	Add 9.5 output file for isolation test (#1413 ) With commit we add one additional regression test output file which has some output syntax differences with its 9.6 equivalence.	2017-05-15 15:27:37 +03:00
Jason Petersen	fb836ee7cc	Merge pull request #1343 from citusdata/test_custom_compiled_postgres Use custom compiled PostgreSQL in Travis for merge commits cr: @jasonmp85	2017-05-12 15:15:55 -06:00
Jason Petersen	05d42b01d3	Mark test failing in 9.5 as 'ignore' This test was added this morning, but is failing in 9.5.	2017-05-12 15:00:42 -06:00
Burak Yucesoy	75d58cbf94	Travis merge jobs use custom-compiled PostgreSQL With this commit, we start to use custom compiled PostgreSQL builds in Travis for merge commits. This allows us to run isolation tests and PostgreSQL's own regression tests along with our regression tests in Travis. Since manually compiling PostgreSQL takes more time and we also add new tests, we only enable running these tests on merge commits.	2017-05-12 15:00:42 -06:00
Önder Kalacı	3adbbdcdcb	Fix typo in the regression test (#1410 )	2017-05-12 15:46:38 +03:00
Önder Kalacı	e0257aecd9	Accept invalidation messages before accessing the metadata cache (#1406 ) * Accept invalidation messages before accessing the metadata cache This commit is crucial to prevent stale metadata reads from the cache. Without this commit, some of the operations may use stale metadata which could end up with various bugs such as crashes, inconsistent/lost data etc. As an example, consider that a COPY operation is blocked on shard metadata lock. Another concurrent session updates the metadata and invalidates the cache. However, since Citus doesn't accept invalidations, COPY continues with the stale metadata once it acquires the lock. With this commit, we make sure that invalidation messages are accepted just before accessing the metadata cache and preventing any operation to use stale metadata. * Add isolation tests for placement changes and conccurrent operations - add node with reference table vs COPY/insert/update/DDL - repair shard vs COPY/insert/update/DDL - repair shard vs repair shard	2017-05-12 12:32:35 +03:00
Marco Slot	94151c9aef	Merge pull request #1405 from citusdata/fix_copy_create_distributed_table Ensure all preceding writes are visible in data migration	2017-05-11 10:05:15 +02:00
Marco Slot	6f9e18de24	Ensure all preceding writes are visible in data migration	2017-05-11 09:42:12 +02:00
Önder Kalacı	3ec502b286	Add support for parametrized execution for subquery pushdown (#1356 ) Distributed query planning for subquery pushdown is done on the original query. This prevents the usage of external parameters on the execution. To overcome this, we manually replace the parameters on the original query.	2017-05-10 09:38:48 +03:00
Marco Slot	97334a9123	Merge pull request #1382 from citusdata/fix_drop_locking Fix locking in master_drop_all_shards / master_apply_delete_command	2017-05-08 18:07:03 +02:00
Metin Doslu	37026dc351	Add truncate first isolation tests	2017-05-08 17:26:55 +02:00
Marco Slot	a8f368fced	Fix locking in master_drop_all_shards / master_apply_delete_command	2017-05-08 17:26:55 +02:00
Jason Petersen	8e2dd3e1f3	Update CHANGELOG for v6.1.1	2017-05-04 17:49:57 -07:00
Jason Petersen	61fbcaeca7	Merge pull request #1360 from citusdata/fix_prepared_ddl Don't change parse tree of DDL commands cr: @anarazel, @jasonmp85	2017-05-04 13:44:46 -07:00
Marco Slot	853f07dd33	Don't change query tree of DDL commands	2017-05-04 21:34:28 +02:00
Jason Petersen	f0c6c47c4e	Fix CREATE SEQUENCE generation bug Apparently we've had a typo all this time causing us to pass the cache value for the start value.	2017-05-03 21:47:06 -07:00
Önder Kalacı	ef6d3587b6	Skip exhaustive test in CoPartitionedTables() if declared colocated (#1376 ) That's considerably cheaper.	2017-05-02 03:33:21 +03:00
Önder Kalacı	b74ed3c8e1	Subqueries in where -- updated (#1372 ) * Support for subqueries in WHERE clause This commit enables subqueries in WHERE clause to be pushed down by the subquery pushdown logic. The support covers: - Correlated subqueries with IN, NOT IN, EXISTS, NOT EXISTS, operator expressions such as (>, <, =, ALL, ANY etc.) - Non-correlated subqueries with (partition_key) IN (SELECT partition_key ..) (partition_key) =ANY (SELECT partition_key ...) Note that this commit heavily utilizes the attribute equivalence logic introduced in the `1cb6a34ba8`. In general, this commit mostly adjusts the logical planner not to error out on the subqueries in WHERE clause. * Improve error checks for subquery pushdown and INSERT ... SELECT Since we allow subqueries in WHERE clause with the previous commit, we should apply the same limitations to those subqueries. With this commit, we do not iterate on each subquery one by one. Instead, we extract all the subqueries and apply the checks directly on those subqueries. The aim of this change is to (i) Simplify the code (ii) Make it close to the checks on INSERT .. SELECT code base. * Extend checks for unresolved paramaters to include SubLinks With the presence of subqueries in where clause (i.e., SubPlans on the query) the existing way for checking unresolved parameters fail. The reason is that the parameters for SubPlans are kept on the parent plan not on the query itself (see primnodes.h for the details). With this commit, instead of checking SubPlans on the modified plans we start to use originalQuery, where SubLinks represent the subqueries in where clause. The unresolved parameters can be found on the SubLinks. * Apply code-review feedback * Remove unnecessary copying of shard interval list This commit removes unnecessary copying of shard interval list. Note that there are no copyObject function implemented for shard intervals.	2017-05-01 17:20:21 +03:00
Marco Slot	8dab40da69	Merge pull request #1357 from citusdata/fix_gitignore Add missing regression test output files to .gitignore	2017-04-28 19:13:26 -07:00
Marco Slot	dee34c24fd	Add missing regression test output files to .gitignore	2017-04-29 03:56:14 +02:00
Marco Slot	5ad8bd37f1	Merge pull request #1265 from citusdata/truncate_propagation Honour enable_ddl_propagation in truncate trigger	2017-04-28 18:47:52 -07:00
Marco Slot	8edba5f309	Honour enable_ddl_propagation in truncate trigger	2017-04-29 03:32:52 +02:00
Brian Cloutier	22e7aa9a4f	Fix crash in isolation tests - There was a crash when the table a shardid belonged to changed during a session. Instead of crashing (a failed assert) we now throw an error - Update the isolation test which was crashing to no longer exercise that code path - Add a regression test to check that the error is thrown	2017-04-29 04:25:26 +03:00
Önder Kalacı	ad5cd326a4	Subquery pushdown - main branch (#1323 ) * Enabling physical planner for subquery pushdown changes This commit applies the logic that exists in INSERT .. SELECT planning to the subquery pushdown changes. The main algorithm is followed as : - pick an anchor relation (i.e., target relation) - per each target shard interval - add the target shard interval's shard range as a restriction to the relations (if all relations joined on the partition keys) - Check whether the query is router plannable per target shard interval. - If router plannable, create a task * Add union support within the JOINS This commit adds support for UNION/UNION ALL subqueries that are in the following form: .... (Q1 UNION Q2 UNION ...) as union_query JOIN (QN) ... In other words, we currently do NOT support the queries that are in the following form where union query is not JOINed with other relations/subqueries : .... (Q1 UNION Q2 UNION ...) as union_query .... * Subquery pushdown planner uses original query With this commit, we change the input to the logical planner for subquery pushdown. Before this commit, the planner was relying on the query tree that is transformed by the postgresql planner. After this commit, the planner uses the original query. The main motivation behind this change is the simplify deparsing of subqueries. * Enable top level subquery join queries This work enables - Top level subquery joins - Joins between subqueries and relations - Joins involving more than 2 range table entries A new regression test file is added to reflect enabled test cases * Add top level union support This commit adds support for UNION/UNION ALL subqueries that are in the following form: .... (Q1 UNION Q2 UNION ...) as union_query .... In other words, Citus supports allow top level unions being wrapped into aggregations queries and/or simple projection queries that only selects some fields from the lower level queries. * Disallow subqueries without a relation in the range table list for subquery pushdown This commit disallows subqueries without relation in the range table list. This commit is only applied for subquery pushdown. In other words, we do not add this limitation for single table re-partition subqueries. The reasoning behind this limitation is that if we allow pushing down such queries, the result would include (shardCount * expectedResults) where in a non distributed world the result would be (expectedResult) only. * Disallow subqueries without a relation in the range table list for INSERT .. SELECT This commit disallows subqueries without relation in the range table list. This commit is only applied for INSERT.. SELECT queries. The reasoning behind this limitation is that if we allow pushing down such queries, the result would include (shardCount * expectedResults) where in a non distributed world the result would be (expectedResult) only. * Change behaviour of subquery pushdown flag (#1315) This commit changes the behaviour of the citus.subquery_pushdown flag. Before this commit, the flag is used to enable subquery pushdown logic. But, with this commit, that behaviour is enabled by default. In other words, the flag is now useless. We prefer to keep the flag since we don't want to break the backward compatibility. Also, we may consider using that flag for other purposes in the next commits. * Require subquery_pushdown when limit is used in subquery Using limit in subqueries may cause returning incorrect results. Therefore we allow limits in subqueries only if user explicitly set subquery_pushdown flag. * Evaluate expressions on the LIMIT clause (#1333) Subquery pushdown uses orignal query, the LIMIT and OFFSET clauses are not evaluated. However, logical optimizer expects these expressions are already evaluated by the standard planner. This commit manually evaluates the functions on the logical planner for subquery pushdown. * Better format subquery regression tests (#1340) * Style fix for subquery pushdown regression tests With this commit we intented a more consistent style for the regression tests we've added in the - multi_subquery_union.sql - multi_subquery_complex_queries.sql - multi_subquery_behavioral_analytics.sql * Enable the tests that are temporarily commented This commit enables some of the regression tests that were commented out until all the development is done. * Fix merge conflicts (#1347) - Update regression tests to meet the changes in the regression test output. - Replace Ifs with Asserts given that the check is already done - Update shard pruning outputs * Add view regression tests for increased subquery coverage (#1348) - joins between views and tables - joins between views - union/union all queries involving views - views with limit - explain queries with view * Improve btree operators for the subquery tests This commit adds the missing comprasion for subquery composite key btree comparator.	2017-04-29 04:09:48 +03:00
Andres Freund	0cc9171984	Merge pull request #1369 from citusdata/featurefix/better-range-pruning Improve / Fix range pruning	2017-04-28 17:45:58 -07:00
Andres Freund	90b211267d	Perform range based pruning if equality pruning has survivor. We previously dismissed this as unimportant, but it turns out to be very useful for the upcoming subquery pushdown, where a user might specify an equality constraint in a subquery, and the subquery pushdown machinery adds >= and <= restrictions on the shard boundary. Previously the latter restriction was ignored.	2017-04-28 17:35:18 -07:00
Andres Freund	6c08fe72f9	Use stricter qual for pruning if both >/< and >=/<= are present. Previously, if both =< and < (>= and < respectively) were specified, we always used the latter restriction. Instead use the stricter one.	2017-04-28 17:35:18 -07:00
Marco Slot	e029724227	Merge pull request #1368 from citusdata/fix_get_live_node_count Fix list length lookup in WorkerGetLiveNodeCount	2017-04-28 17:26:25 -07:00
Marco Slot	6e58067962	Fix list length lookup in WorkerGetLiveNodeCount	2017-04-29 02:13:20 +02:00
Marco Slot	022fc7bbcb	Merge pull request #1349 from citusdata/fix_check_vanilla Fix check-vanilla tests	2017-04-28 17:10:48 -07:00
Burak Yucesoy	6599677902	Fix check-vanilla tests It semms that GEQO optimizations, when it is set to on, create their own memory context and free it after when it is no longer necessary. In join multi_join_restriction_hook we allocate our variables in the CurrentMemoryContext, which is GEQO's memory context if it is active. To prevent deallocation of our variables when GEQO's memory context is freed, we started to allocate memory fo these variables in separate MemoryContext.	2017-04-29 01:55:18 +02:00
Marco Slot	b0fdd5963d	Merge pull request #1365 from citusdata/fix_size Check whether relation ID exists in DistributedTableSize	2017-04-28 16:51:41 -07:00
Marco Slot	0b579d027a	Check whether relation ID exists in citus_relation_size	2017-04-29 01:39:39 +02:00
Andres Freund	4094b45ba9	Merge pull request #1331 from citusdata/feature/faster-pruning Faster Shard Pruning Implementation	2017-04-28 15:01:41 -07:00
Andres Freund	d399f395f7	Faster shard pruning. So far citus used postgres' predicate proofing logic for shard pruning, except for INSERT and COPY which were already optimized for speed. That turns out to be too slow: * Shard pruning for SELECTs is currently O(#shards), because PruneShardList calls predicate_refuted_by() for every shard. Obviously using an O(N) type algorithm for general pruning isn't good. * predicate_refuted_by() is quite expensive on its own right. That's primarily because it's optimized for doing a single refutation proof, rather than performing the same proof over and over. * predicate_refuted_by() does not keep persistent state (see 2.) for function calls, which means that a lot of syscache lookups will be performed. That's particularly bad if the partitioning key is a composite key, because without a persistent FunctionCallInfo record_cmp() has to repeatedly look-up the type definition of the composite key. That's quite expensive. Thus replace this with custom-code that works in two phases: 1) Search restrictions for constraints that can be pruned upon 2) Use those restrictions to search for matching shards in the most efficient manner available: a) Binary search / Hash Lookup in case of hash partitioned tables b) Binary search for equal clauses in case of range or append tables without overlapping shards. c) Binary search for inequality clauses, searching for both lower and upper boundaries, again in case of range or append tables without overlapping shards. d) exhaustive search testing each ShardInterval My measurements suggest that we are considerably, often orders of magnitude, faster than the previous solution, even if we have to fall back to exhaustive pruning.	2017-04-28 14:40:41 -07:00
Andres Freund	6bd2e3ed30	Add DistTableCacheEntry->hasOverlappingShardInterval. This determines whether it's possible to perform binary search on sortedShardIntervalArray or not. If e.g. two shards have overlapping ranges, that'd be prohibitive. That'll be useful in later commit introducing faster shard pruning.	2017-04-28 14:40:38 -07:00
Andres Freund	105483ec56	Add DistTableCacheEntry->shardValueCompareFunction. That's useful when comparing values a hash-partitioned table is filtered by. The existing shardIntervalCompareFunction is about comparing hashed values, not unhashed ones. The added btree opclass function is so we can get a comparator back. This should be changed much more widely, but is not necessary so far.	2017-04-28 14:40:38 -07:00
Andres Freund	52571c00ad	Build DistTableCacheEntry->shardIntervalCompareFunction even for 0 shards. Previously we, unnecessarily, used a the first shard's type information to to look up the comparison function. But that information is already available, so use it. That's helpful because we sometimes want to access the comparator function even if there's no shards.	2017-04-28 14:40:38 -07:00
Andres Freund	ba93d32c8a	Fix: Make FindShardIntervalIndex robust against 0 shards.	2017-04-28 14:40:38 -07:00
Metin Döşlü	59ecf9faa0	Merge pull request #1361 from citusdata/explain_with_savepoint Send explain queries with savepoints	2017-04-28 13:43:27 -07:00
Metin Doslu	b6659bec22	Send explain queries with savepoints With this commit, we started to send explain queries within a savepoint. After running explain query, we rollback to savepoint. This saves us from side effects of EXPLAIN ANALYZE on DML queries.	2017-04-28 12:13:48 -07:00
Jason Petersen	905ca98a9b	Merge pull request #1353 from citusdata/fix_copy_crasher Refactor COPY to not directly use cache entry cr: @marcocitus	2017-04-27 16:06:11 -06:00
Jason Petersen	93e3afc25c	Remove FastShardPruning method With the other simplifications, it doesn't make sense to keep around.	2017-04-27 13:32:36 -06:00
Jason Petersen	42ee7c05f5	Refactor FindShardInterval to use cacheEntry All callers fetch a cache entry and extract/compute arguments for the eventual FindShardInterval call, so it makes more sense to refactor into that function itself; this solves the use-after-free bug, too.	2017-04-27 13:32:36 -06:00
Andres Freund	d70312ddc1	Merge pull request #1351 from citusdata/feature/remove_pruning_debug Remove Pruning Debug Output	2017-04-26 11:58:52 -07:00
Andres Freund	1f93c325fa	Some cleanup in multi_subquery test. Remove trailing whitespace and use of EXPLAIN instead of EXPLAIN (COSTS OFF).	2017-04-26 11:33:56 -07:00
Andres Freund	b0585c7df6	Add back pruning coverage lost in last commit. Because we can't rely on the debuggin message anymore, add a bunch of explain statements that roughly fulfill the same purpose.	2017-04-26 11:33:56 -07:00
Andres Freund	b7dfeb0bec	Boring regression test output adjustments. Soon shard pruning will be optimized not to generally work linearly anymore. Thus we can't print the pruned shard intervals as currently done anymore. The current printing of shard ids also prevents us from running tests in parallel, as otherwise shard ids aren't linearly numbered.	2017-04-26 11:33:56 -07:00
Andres Freund	e637fd802d	Merge pull request #1354 from citusdata/feature/faster-copartitioned-check Skip exhaustive test in CoPartitionedTables() if declared colocated.	2017-04-26 11:33:31 -07:00
Andres Freund	71a7f39b05	Skip exhaustive test in CoPartitionedTables() if declared colocated. That's considerably cheaper.	2017-04-26 11:19:17 -07:00
Andres Freund	1798d4648d	Merge pull request #1350 from citusdata/fix/vpath-builds Fix VPATH builds broken in `087d8427e3`.	2017-04-25 16:25:54 -07:00
Andres Freund	3c17746786	Fix VPATH builds broken in `087d8427e3`. 1) Generated files reside in the build directory, not the source directory. 2) As a generated file is now included in the build, add it to the include path (-I)	2017-04-25 16:04:42 -07:00
Marco Slot	7f9e80db10	Only process error if not NULL in StoreErrorMessage	2017-04-21 17:01:01 +02:00
Marco Slot	7faf4657b7	Use right sizeof in UpdateRelationColocationGroup	2017-04-21 16:37:09 +02:00
Burak Yücesoy	5fafde441d	Merge pull request #1294 from citusdata/fix_test_outputs_for_valgrind Prepare for valgrind automation	2017-04-21 05:51:14 -08:00
Burak Yucesoy	5de61ebf78	Configure valgrind command line arguments	2017-04-21 16:30:12 +03:00
Burak Yucesoy	d6cb88a73a	Stabilize test outputs	2017-04-21 16:08:52 +03:00
Eren Basak	abc84e6b2b	Add support for proper valgrind tests This change allows valgrind tests (`make check-multi-vg`) to be run seamlessly without test output errors and timeout problems.	2017-04-21 16:08:52 +03:00
Marco Slot	c8fec3be1b	Merge pull request #1302 from citusdata/serial_partition_column Support expressions in the partition column in INSERTs	2017-04-21 14:18:13 +02:00
Marco Slot	4ed093970a	Support expressions in the partition column in INSERTs	2017-04-21 14:05:52 +02:00
Burak Velioglu	701aaccd9c	Merge pull request #1292 from citusdata/alter_add_constraint_m Alter Table Add Constraint	2017-04-20 15:33:02 +03:00
velioglu	24d24db25c	Implement ALTER TABLE ADD CONSTRAINT command	2017-04-20 15:02:33 +03:00
Burak Velioglu	fbb6a47adf	Merge pull request #1316 from citusdata/add_guc_for_cross_shard Log cross-shard queries	2017-04-20 14:08:21 +03:00
velioglu	8cbef819be	Log message of across shard queries according to the log level	2017-04-20 12:24:46 +03:00
Burak Velioglu	0d987636a3	Merge pull request #1324 from citusdata/insert_into_select_wo_native Replace native hash function with worker_hash	2017-04-19 22:30:52 +03:00
velioglu	2327b63291	Change native hash function with worker_hash	2017-04-19 22:16:55 +03:00
Jason Petersen	eef4ed31cb	Merge pull request #1312 from citusdata/rename_support Enable distributed ALTER TABLE ... RENAME COLUMN cr: @byucesoy	2017-04-18 22:57:12 -06:00
Jason Petersen	5272c2c44b	Enable distributed ALTER TABLE ... RENAME COLUMN Pretty straightforward. Had some concerns about locking, but due to the fact that all distributed operations use either some level of deparsing or need to enumerate column names, they all block during any concurrent column renames (due to the AccessExclusive lock). In addition, I had some misgivings about permitting renames of the dis- tribution column, but nothing bad comes from just allowing them. Finally, I tried to trigger any sort of error using prepared statements and could not trigger any errors not also exhibited by plain PostgreSQL tables.	2017-04-18 22:47:48 -06:00
Marco Slot	ecddb78815	Merge pull request #1208 from citusdata/remove_job_id_seq Stop using a sequence to generate job IDs	2017-04-18 12:02:07 +02:00
Marco Slot	3d99cdfcc7	Add basic read-only transaction tests	2017-04-18 11:42:33 +02:00
Marco Slot	f838c83809	Remove redundant pg_dist_jobid_seq restarts in tests	2017-04-18 11:42:32 +02:00
Marco Slot	40829c2ba9	Set citus.enable_unique_job_ids in tests with job ID in output	2017-04-18 11:42:32 +02:00
Marco Slot	dfd7d86948	Stop using a sequence to generate unique job IDs	2017-04-18 11:31:51 +02:00
Burak Yücesoy	be6dfaa596	Merge pull request #1332 from citusdata/set_isactive_to_true Set default value of isactive to true	2017-04-17 23:38:45 -08:00
Burak Yucesoy	00747dc8c9	Set default value of isactive to true With this change, we set to default value of isactive column to true so that upgrading users all nodes will be marked as active to not break their environment.	2017-04-18 09:40:44 +03:00
Burak Yücesoy	f5a406a23e	Merge pull request #1326 from citusdata/fix_node_copy_error Fix node copy error	2017-04-17 09:19:55 -08:00
Burak Yucesoy	1a56b99f13	Fix node copy error Instead of directly returning heap tuple obtained from heap scan we return copied version of it.	2017-04-17 19:38:18 +03:00
Marco Slot	acb84d9ca3	Merge pull request #1320 from citusdata/prepared_update_delete Support UPDATE/DELETE with parameterised partition column qual	2017-04-17 16:32:37 +02:00
Metin Doslu	4615100da5	Fix table in name in prepared statement regression tests	2017-04-17 16:17:30 +02:00
Marco Slot	af0e462409	Support UPDATE/DELETE with parameterised partition column qual	2017-04-17 16:17:30 +02:00
Marco Slot	87426b95be	Merge pull request #1321 from citusdata/prepared_function_evaluation Support query parameters in combination with function evaluation	2017-04-17 16:16:27 +02:00
Marco Slot	5e58804d44	Support query parameters in combination with function evaluation	2017-04-17 15:40:55 +02:00
Marco Slot	dd75c5308f	Merge pull request #1232 from citusdata/fetch_faster Create indexes after worker_append_table_to_shard when copying a shard	2017-04-17 15:30:44 +02:00
Marco Slot	0bcc227a62	Create indexes after worker_append_table_to_shard during shard repair	2017-04-17 15:17:21 +02:00
Burak Yücesoy	24ee65a054	Merge pull request #1283 from citusdata/decouple_activate Decouple reference table replication	2017-04-17 02:48:15 -08:00
Burak Yucesoy	e9095e62ec	Decouple reference table replication With this change we add an option to add a node without replicating all reference tables to that node. If a node is added with this option, we mark the node as inactive and no queries will sent to that node. We also added two new UDFs; - master_activate_node(host, port): - marks node as active and replicates all reference tables to that node - master_add_inactive_node(host, port): - only adds node to pg_dist_node	2017-04-17 13:33:31 +03:00
Burak Yücesoy	7097336972	Merge pull request #1309 from citusdata/fix_sql_function_returns_wrong_result Error out on parameterized SQL functions	2017-04-13 06:01:51 -08:00
Burak Yucesoy	7cfcb7d2f8	Error out on parameterized SQL functions Before this commit, we were erroring out for queries containing parameterized SQL functions like 'SELECT parameterized_sql_query(value)' as we should, however we were returning wrong results for queries like 'SELECT * FROM parameterized_sql_query(value)'. With this commit we started to error out on such queries too.	2017-04-13 16:36:24 +03:00
Önder Kalacı	b1363aa1d3	Merge pull request #1268 from citusdata/remove_adding_qual_prototype Replace adding the qual logic with attribute equivalence for INSERT ... SELECT	2017-04-13 13:48:14 +03:00
Onder Kalaci	1cb6a34ba8	Remove uninstantiated qual logic, use attribute equivalences In this PR, we aim to deduce whether each of the RTE_RELATION is joined with at least on another RTE_RELATION on their partition keys. If each RTE_RELATION follows the above rule, we can conclude that all RTE_RELATIONs are joined on their partition keys. In order to do that, we invented a new equivalence class namely: AttributeEquivalenceClass. In very simple words, a AttributeEquivalenceClass is identified by an unique id and consists of a list of AttributeEquivalenceMembers. Each AttributeEquivalenceMember is designed to identify attributes uniquely within the whole query. The necessity of this arise since varno attributes are defined within a single level of a query. Instead, here we want to identify each RTE_RELATION uniquely and try to find equality among each RTE_RELATION's partition key. Whenever we find an equality clause A = B, where both A and B originates from relation attributes (i.e., not random expressions), we create an AttributeEquivalenceClass to record this knowledge. If we later find another equivalence B = C, we create another AttributeEquivalenceClass. Finally, we can apply transitity rules and generate a new AttributeEquivalenceClass which includes A, B and C. Note that equality among the members are identified by the varattno and rteIdentity. Each equality among RTE_RELATION is saved using an AttributeEquivalenceClass where each member attribute is identified by a AttributeEquivalenceMember. In the final step, we try generate a common attribute equivalence class that holds as much as AttributeEquivalenceMembers whose attributes are a partition keys.	2017-04-13 11:51:26 +03:00
Burak Velioglu	12860b1316	Merge pull request #1318 from citusdata/ltree_copy_branch Change checks with built-in type (omit ltree)	2017-04-11 14:50:19 +03:00
velioglu	19d0c66fa5	Change checks with built-in type	2017-04-11 14:41:37 +03:00
Burak Velioglu	cfc0992137	Merge pull request #1300 from citusdata/ltree_copy_branch Change copy format of ltree	2017-04-11 08:41:44 +03:00
velioglu	1fb11c738f	Check binary output function of type.	2017-04-10 16:28:09 +03:00
Jason Petersen	8c5d0f686b	Merge pull request #1277 from citusdata/error_out_incomplete_installation Error out if binary citus version does not match installed extension cr: @jasonmp85	2017-04-04 16:49:03 -06:00
Jason Petersen	8b4620ef16	Use RESET for GUC test, not reconnect More limited in what it does, better test.	2017-04-04 16:40:17 -06:00
Jason Petersen	7e46f41c12	Add comments, use strncmp, clean up GUC desc. Good to go!	2017-04-04 16:16:49 -06:00
Jason Petersen	033fda9183	Clean up remaining error messages Added details and hints, based off of similar PostgreSQL scenarios.	2017-04-04 16:11:59 -06:00
Jason Petersen	ef81b21a49	Clean up ErrorIfUnstableCreateOrAlterExtensionStmt Swaps an Assert in for an ereport, and adds details and hints to the error message to help users with a possibly confusing scenario.	2017-04-04 15:58:57 -06:00
Jason Petersen	ad3fbd9689	Refactor utility-skip/extn-check code This was getting pretty long and complex in the context of the main utility hook. Moved out the checks for what should skip Citus process- ing and what should have version checks performed.	2017-04-04 15:07:22 -06:00
Burak Yucesoy	a09614553f	Add enable_version_checks GUC and address feedback	2017-04-04 19:11:13 +03:00
Jason Petersen	1c2056ec74	Self-implemented review feedback The use of a bare src/ rather than $srcdir caused configure to fail during VPATH builds. With our additional dependency upon AWK, we need to call AC_PROG_AWK, otherwise environments may not have $AWK set. Finally, citus_version.h should be in .gitignore.	2017-04-03 22:55:12 -06:00
Burak Yucesoy	087d8427e3	Error out if binary citus version does not match installed extension With this change, we start to error out if loaded citus binaries does not match the available major version or installed citus extension version. In this case we force user to restart the server or run ALTER EXTENSION depending on the situation	2017-04-03 17:36:13 -06:00
Jason Petersen	bb5ae5eca4	Merge pull request #1287 from citusdata/support_concurrently Support (CREATE\|DROP) INDEX CONCURRENTLY cr: @metdos	2017-04-03 12:06:11 -06:00
Jason Petersen	4cdfc3a10f	Address review feedback Should just about do it.	2017-04-03 11:44:57 -06:00
Jason Petersen	cf775c4773	Improve CONCURRENTLY-related error messages Thought this looked slightly nicer than the default behavior. Changed preventTransaction to concurrent to be clearer that this code path presently affects CONCURRENTLY code only.	2017-04-03 11:19:15 -06:00
Jason Petersen	dd9365433e	Update documentation Ensure all functions have comments, etc.	2017-04-03 11:19:15 -06:00
Jason Petersen	d904e96c59	Address MX CONCURRENTLY problems Adds a non-transactional multi-command method to propagate DDLs to all MX/metadata-synced nodes.	2017-04-03 11:19:15 -06:00
Jason Petersen	32886e97a3	Add code to set index validity on failure Coordinator code marks index as invalid as a base, set it as valid in a transactional layer atop that base, then proceeds with worker commands. If a worker command has problems, the rollback results in an index with isvalid = false. If everything succeeds, the user sees a valid index.	2017-04-03 11:19:15 -06:00
Jason Petersen	dea6c44f75	Remove CONCURRENTLY checks, fix tests Still pending failure testing, which broke with my recent changes.	2017-04-03 11:19:15 -06:00
Jason Petersen	0b6c4e756e	Change DropStmt to generate worker DDL on master Because we can't execute DROP INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:15 -06:00
Jason Petersen	95d8d27c4f	Change IndexStmt to generate worker DDL on master Because we can't execute CREATE INDEX CONCURRENTLY during transactions, worker_apply_shard_ddl_command is insufficient.	2017-04-03 11:19:14 -06:00
Marco Slot	de034d2cab	Merge pull request #851 from citusdata/task_tracker_batching Batch task_tracker_status calls to reduce task-tracker query times	2017-04-03 12:17:13 +02:00
Marco Slot	0f355a4a48	Batch task_tracker_status calls to reduce task-tracker query times	2017-03-31 11:54:11 +02:00
Metin Döşlü	e25df3509c	Merge pull request #1299 from citusdata/support_trigger_all Add disable/enable trigger all support	2017-03-30 18:36:14 +02:00
Metin Doslu	54a277ff01	Add disable/enable trigger all support	2017-03-29 22:00:14 +03:00
Önder Kalacı	95e43eb256	Merge pull request #1261 from citusdata/fix_wrong_pushdown_properly Fix pushing down wrong queries for INSERT ... SELECT queries	2017-03-24 12:52:31 +02:00
Onder Kalaci	11665dbe3c	Fix pushing down wrong queries for INSERT ... SELECT queries Before this commit, in certain cases router planner allowed pushing down JOINs that are not on the partition keys. With @anarazel's suggestion, we change the logic to use uninstantiated parameter. Previously, the planner was traversing on the restriction information and once it finds the parameter, it was replacing it with the shard range. With this commit, instead of traversing the restrict infos, the planner explicitly checks for the equivalence of the relation partition key with the uninstantiated parameter. If finds an equivalence, it adds the restrictions. In this way, we have more control over the queries that are pushed down.	2017-03-24 11:37:35 +02:00
Ozgun Erdogan	6fbd8e27c3	Merge pull request #1295 from citusdata/ozgune-readme-step-numbering Minor update README.md to fix the numbers in Docker installation steps.	2017-03-23 14:52:26 -07:00
Ozgun Erdogan	1285cabf28	Update README.md Updated the Local Cluster steps to include the right numbering.	2017-03-23 11:00:32 -07:00
Jason Petersen	dc8c12f8b0	Merge pull request #1278 from citusdata/master_ddl_first Execute DDL on coordinator before workers cr: @metdos, @anarazel	2017-03-22 17:45:06 -06:00
Jason Petersen	34a62abb7d	Address code review comments	2017-03-22 17:29:17 -06:00
Jason Petersen	d95b5bbad3	Rework ReplicateGrantStmt to use new flow This was the impetus for the previous commit that changed from using a DDLJob * to a List * of them.	2017-03-22 17:29:16 -06:00
Jason Petersen	23f5e4282d	Change DDLJob usage to be wrapped in lists To prepare for GRANT fixes.	2017-03-22 17:29:16 -06:00
Jason Petersen	42c799faee	Fix MX tests Missed some of these. One had a bad DDL statement to begin with (mixed up column type and column name) and other was just master/worker order.	2017-03-22 17:21:49 -06:00
Jason Petersen	f181b24859	Move worker execution to after master, fix tests Some tests relied on worker errors though local commands were invalid. Fixed those by ensuring preconditions were met to have command work correctly. Otherwise most test changes are related to slight changes in local/remote error ordering.	2017-03-22 17:21:49 -06:00
Jason Petersen	419a4c3745	Remove execution from stmt-specific util functions Now have a single Execute call in the main body.	2017-03-22 17:21:49 -06:00
Jason Petersen	a64165767d	Rename ProcessStmt functions to PlanStmt To reflect their new purpose planning a DDLJob rather than fully processing a distributed DDL statement.	2017-03-22 17:21:49 -06:00
Jason Petersen	a02a2a90c7	Refactor ExecuteDistDDLCommand to expect struct Will let us separate out the determination of what to execute from its actual execution.	2017-03-22 17:21:49 -06:00
Jason Petersen	2cb34406d1	Minor permissions test fix When running under Enterprise, some of the GRANT commands and whatnot are propagated. Guarding that section with a call to disable DDL prop. fixes everything.	2017-03-22 17:07:05 -06:00
Jason Petersen	823cd0dc98	Merge pull request #1281 from citusdata/fix_permission_check Fix access permission checks for distributed relations cr: @jasonmp85	2017-03-22 15:35:59 -06:00
Metin Doslu	404e32cdb4	Add basic permission checking tests	2017-03-22 15:25:00 -06:00
Metin Doslu	bcff6aa96c	Update regression tests for changing explain output	2017-03-22 15:25:00 -06:00
Metin Doslu	b1ee7ec93e	Fix access permission checks for distributed relations With this commit, we add the range table list of the original query to our custom plan. Therefore, PostgreSQL can check relations in the original query for access permissions and error out if the proper access is not granted.	2017-03-22 15:25:00 -06:00
Jason Petersen	1fa2a25695	Set tab size for GitHub display Hooray!	2017-03-22 13:03:39 -06:00
Murat Tuncer	4e5736a09d	Merge pull request #1142 from citusdata/bugfix/750_router_modify_errors Rephrase router modify errors	2017-03-16 14:52:25 +02:00
Murat Tuncer	c4734d7d94	Rephrase router modify errors generic "distributed modifications must target exactly one shard" message is replaced by more context aware error messages.	2017-03-16 15:09:10 +03:00
Burak Velioglu	729272aa53	Merge pull request #1267 from citusdata/add_udf_to_get_size_of_table Add udfs to get size of table	2017-03-16 15:02:11 +03:00
velioglu	e32aff1a26	Size UDFs implemented citus_table_size, citus_relation_size and citus_total_relation_size UDFs are implemented.	2017-03-16 13:50:30 +03:00
Metin Döşlü	d9c08c10f4	Merge pull request #1185 from citusdata/custom_plan Use CustomScan API for query execution	2017-03-14 11:39:16 +02:00
Metin Doslu	1f838199f8	Use CustomScan API for query execution Custom Scan is a node in the planned statement which helps external providers to abstract data scan not just for foreign data wrappers but also for regular relations so you can benefit your version of caching or hardware optimizations. This sounds like only an abstraction on the data scan layer, but we can use it as an abstraction for our distributed queries. The only thing we need to do is to find distributable parts of the query, plan for them and replace them with a Citus Custom Scan. Then, whenever PostgreSQL hits this custom scan node in its Vulcano style execution, it will call our callback functions which run distributed plan and provides tuples to the upper node as it scans a regular relation. This means fewer code changes, fewer bugs and more supported features for us! First, in the distributed query planner phase, we create a Custom Scan which wraps the distributed plan. For real-time and task-tracker executors, we add this custom plan under the master query plan. For router executor, we directly pass the custom plan because there is not any master query. Then, we simply let the PostgreSQL executor run this plan. When it hits the custom scan node, we call the related executor parts for distributed plan, fill the tuple store in the custom scan and return results to PostgreSQL executor in Vulcano style, a tuple per XXX_ExecScan() call. * Modify planner to utilize Custom Scan node. * Create different scan methods for different executors. * Use native PostgreSQL Explain for master part of queries.	2017-03-14 12:17:51 +02:00
Andres Freund	52358fe891	Initial temp table removal implementation	2017-03-14 12:09:49 +02:00
Jason Petersen	6f4886cd11	Revert "Remove unused SendCommandToWorker" This reverts commit `c8c308c109`.	2017-03-13 15:48:51 -06:00
Murat Tuncer	1599e943e7	Merge pull request #1272 from citusdata/router_planner_range_partitioned Enable router planner for queries on range partitioned table	2017-03-09 16:18:12 +02:00
Murat Tuncer	f657a744d5	Enable router planner for queries on range partitioned tables Router planner now supports queries using range partitioned tables. Queries on append partitioned tables are still not supported.	2017-03-09 16:39:15 +03:00
Jason Petersen	e060252431	Merge pull request #1263 from citusdata/docker-engine Newer docker installation instructions cr: @jasonmp85	2017-03-08 16:18:49 -07:00
Joe Nelson	b2dd568dba	Use curl everywhere and prevent nested shell session	2017-03-08 15:43:26 -07:00
Joe Nelson	0aec20c8b3	More consistent references to Docker products	2017-03-08 15:43:26 -07:00
Joe Nelson	f5ce760ad3	Link to (for now) newest docker-compose binary	2017-03-08 15:43:26 -07:00
Joe Nelson	a86398fbe3	Make outermost list ordered to emphasize the sequence	2017-03-08 15:43:25 -07:00
Joe Nelson	e8ef19ed30	Docker download url has changed	2017-03-08 15:43:25 -07:00
Joe Nelson	02129b1860	Link to the new tutorial as well	2017-03-08 15:43:25 -07:00
Joe Nelson	8e876bd9c6	Native docker on mac, plus smoother instructions	2017-03-08 15:43:24 -07:00
Brian Cloutier	c8c308c109	Remove unused SendCommandToWorker	2017-03-08 16:30:23 +03:00
Jason Petersen	f5fbf1e621	Merge pull request #1255 from citusdata/remove-unused-func Remove unused metadata functions. cr: @jasonmp85	2017-03-07 16:37:54 -07:00
Brian Cloutier	a2ba565a9e	Remove unused master_stage_shard_{placement_,}row	2017-03-07 11:59:26 +03:00
Brian Cloutier	95936ff481	Remove unused master_get_round_robin_candidate_nodes	2017-03-07 11:51:24 +03:00
Brian Cloutier	807beb7bc0	Remove master_get_local_first_candidate_nodes	2017-03-07 11:50:59 +03:00
Andres Freund	fa5b8fb39f	Fix SendRemoteCommandParams() handling of a NULL MultiConnection->pgConn. (#1271 ) Previously we'd segfault in PQisnonblocking() which, contrary to other libpq calls, doesn't handle a NULL PQconn (because there'd be no appropriate return value for that). cr: @jasonmp85	2017-03-03 12:02:15 -07:00
Murat Tuncer	0a58a7b2c1	Merge pull request #1264 from citusdata/bugfix/1251_remove_default_value Remove default clause from shard DDL when sequences are used	2017-03-01 21:03:40 +02:00
Murat Tuncer	72027f2eba	Remove default clause from shard DDL when sequences are used	2017-03-01 17:32:48 +03:00
Marco Slot	bab1b65491	Fix spelling in master_initialize_node_metadata comment	2017-03-01 12:27:50 +01:00
Jason Petersen	33b58f8e26	Merge pull request #1117 from citusdata/create_table_data_migration Migrate data on create_distributed_table cr: @jasonmp85	2017-02-28 22:55:06 -07:00
Jason Petersen	047825c6ca	Rename misleading allowEmpty parameter Last bit of PR feedback.	2017-02-28 22:48:00 -07:00
Marco Slot	56d4d375c2	Address review feedback in create_distributed_table data loading	2017-02-28 17:39:45 +01:00
Marco Slot	db98c28354	Address review feedback in COPY refactoring	2017-02-28 17:39:45 +01:00
Marco Slot	d74fb764b1	Use CitusCopyDestReceiver for regular COPY	2017-02-28 17:24:45 +01:00
Marco Slot	d11eca7d4a	Load data into distributed table on creation	2017-02-28 17:24:45 +01:00
Marco Slot	bf3541cb24	Add CitusCopyDestReceiver infrastructure	2017-02-28 17:24:45 +01:00
Burak Velioglu	424745a49a	Merge pull request #1246 from citusdata/disallow_master_appy_delete_on_hash Disallow master_apply_delete_command on hash distributed table	2017-02-24 10:47:49 +02:00
Burak Velioglu	e158c7ae67	Merge branch 'master' into disallow_master_appy_delete_on_hash	2017-02-24 10:40:23 +02:00
Burak Velioglu	ac16d4f15f	Merge pull request #1248 from citusdata/fix_error_message_of_start_metadata_sync Fix error message of start_metadata_sync_to_node	2017-02-22 17:13:47 +02:00
velioglu	4dbb69cfc3	Fix error message of start_metadata_sync_to_node Single quotation mark is added around nodename to make the error code consistent with master_add_node usage.	2017-02-22 18:03:58 +03:00
Metin Döşlü	e3e1436680	Merge pull request #1252 from citusdata/reproducible_costs Get reproducible costs between different PostgreSQL versions	2017-02-22 16:04:21 +02:00
Metin Doslu	ee425871ee	Get reproducible costs between different PostgreSQL versions	2017-02-22 15:40:02 +02:00
Burak Velioglu	49812ddfa0	Disallow master_apply_delete_command on hash distributed table Delete operation is blocked for any table distributed by hash using master_apply_delete_command. Suggested master_modify_multiple_shards command as a hint.	2017-02-22 11:54:46 +03:00
Metin Döşlü	b8e4763a1a	Merge pull request #1242 from citusdata/debug4_to_debug2 Use DEBUG2 instead of DEBUG4 in INSERT SELECT tests & debug message	2017-02-20 13:29:31 +02:00
Andres Freund	9721e80901	Use DEBUG2 instead of DEBUG4 in INSERT SELECT tests & debug message. During later work the transaction debug output will change (as it will in postgres 10), which makes it hard to see actual changes in the INSERT ... SELECT ... test. Reduce to DEBUG2 after changing a debug message to that log level.	2017-02-20 12:56:16 +02:00
Eren Başak	5184fc1d92	Merge pull request #1206 from citusdata/for_statement_replication_on_old_apis Enforce statement based replication on old APIs and non-hash tables	2017-02-16 10:45:09 -08:00
Eren Basak	df9cf346ee	Enforce statement based replication on old APIs and non-hash tables This change ignores `citus.replication_model` setting and uses the statement based replication in - Tables distributed via the old `master_create_distributed_table` function - Append and range partitioned tables, even if created via `create_distributed_table` function This seems like the easiest solution to #1191, without changing the existing behavior and harming existing users with custom scripts. This change also prevents RF>1 on streaming replicated tables on `master_create_worker_shards` Prior to this change, `master_create_worker_shards` command was not checking the replication model of the target table, thus allowing RF>1 with streaming replicated tables. With this change, `master_create_worker_shards` errors out on the case.	2017-02-16 10:37:53 -08:00
sumedhpathak	1ba078caea	Merge pull request #1226 from citusdata/6.1-docs-changes Update links to Documentation in Readme to 6.1	2017-02-10 17:12:26 -08:00
sumedhpathak	bb1d782384	Update links to Documentation to 6.1	2017-02-10 17:02:18 -08:00
Jason Petersen	bc21da6f6f	Add 6.1.0 CHANGELOG entries (#1219 ) This is probably missing some stuff, but is my edit of the initial list compiled by Burak. cr: @craigkerstiens	2017-02-09 17:05:17 -07:00
Jason Petersen	ad19b668ac	Fix tests broken by new PostgreSQL patch releases (#1220 ) PostgreSQL 9.5.6 and 9.6.2 were released today and broke several tests by adding TABLESPACE pg_default output to some DDL commands. Fixed all occurrences. cr: @anarazel	2017-02-09 16:53:02 -07:00
Önder Kalacı	1875f86d3b	Merge pull request #1213 from citusdata/fix_fkey_crash Bugfix for creating foreign key	2017-02-07 10:26:11 +02:00
Onder Kalaci	95f8382ca2	Bugfix for creating foreign key This commit fixes crash for adding foreign keys without specifying the referenced column crashes the backend.	2017-02-07 09:34:24 +02:00
Brian Cloutier	e6e5f63d9d	Utility hook does nothing if the extension is not loaded	2017-02-02 17:48:31 +02:00
Brian Cloutier	a30b9b93a4	Set a memory context when throwing deferred errors	2017-02-02 15:14:21 +02:00
Brian Cloutier	e3c763c3f7	Start remote transactions in master_append_table_to_shard Add a call to RemoteTransactionBeginIfNecessary so that BEGIN is actually sent to the remote connections. This means that ROLLBACK and Ctrl-C are respected and don't leave the table in a partial state.	2017-02-01 18:12:19 +02:00
Jason Petersen	1ff9d68e68	Merge pull request #1182 from citusdata/fix_random_travis_failure Fix Random Fails on Travis cr: @jasonmp85	2017-01-31 16:42:19 -07:00
Eren Basak	005c533ee8	Fix Random Fails on Travis This change fixes the random failures on Travis, which is a bug introduced with citus/#1124. Before this fix, travis was failing randomly on `check_multi_mx` test schedule, specifically in the parallel group of `multi_mx_metadata`, 'multi_mx_modifications` and `multi_mx_modifying_xacts` tests. This change fixes this by serializing these three test cases.	2017-01-31 15:23:06 -08:00
Eren Başak	20139b9b1a	Merge pull request #1183 from citusdata/allow_drop_sequence_on_worker Allow dropping sequences on mx workers	2017-01-31 14:59:29 -08:00
Eren Basak	ae0bfb1394	Allow dropping sequences on mx workers This change allows users to drop sequences on MX workers. Previously, Citus didn't allow dropping sequences on MX workers because it could cause shards to be dropped if `DROP SEQUENCE ... CASCADE` is used. We now allow that since allowing sequence creation but not dropping hurts user experience and also may cause problems with custom Citus solutions.	2017-01-31 14:51:44 -08:00
Samay Sharma	88f4956064	Merge pull request #1178 from citusdata/ozgune-patch-1 Update README.md to include use cases	2017-01-30 13:24:52 -08:00
Ozgun Erdogan	80e241e5c8	Update README.md to include use cases Updated to the README.md to reflect newer product features. * Include references to the multi-tenant use case * Minor update to tutorial link to include 6.0 * Minor update to Contributing guidelines (we haven't used #helpwanted tags over the past year) * Minor update to remove link to support & training	2017-01-27 16:36:33 -08:00
Brian Cloutier	6843ad8e91	Fix bug where router executor sends query to failed connections	2017-01-27 09:40:30 +02:00
Brian Cloutier	1173f3f225	Refactor CheckShardPlacements - Break CheckShardPlacements into multiple functions (The most important is MarkFailedShardPlacements), so that we can get rid of the global CoordinatedTransactionUses2PC. - Call MarkFailedShardPlacements in the router executor, so we mark shards as invalid and stop using them while inside transaction blocks.	2017-01-26 13:20:45 +02:00
Marco Slot	c44ae463ae	Merge pull request #1168 from citusdata/copy_inactive Set placement to inactive on connection failure in COPY	2017-01-26 13:23:13 +04:00
Murat Tuncer	0e635b69f0	Add copy failure tests inside transactions	2017-01-26 11:54:40 +03:00
Murat Tuncer	1107439ade	Fix dependent tests	2017-01-25 19:19:39 +03:00
Murat Tuncer	5194111420	Add failure case for regression tests	2017-01-25 19:19:39 +03:00
Marco Slot	f56454360c	Mark failed placements as inactive immediately after COPY	2017-01-25 19:19:39 +03:00
Marco Slot	b1626887d5	Don't mark placements inactive in COPY after successful connection	2017-01-25 19:19:38 +03:00
Marco Slot	d0c76407b8	Set placement to inactive on connection failure in COPY	2017-01-25 19:19:38 +03:00
Marco Slot	4476a7be81	Merge pull request #1171 from citusdata/utility_schemanode_fix Remove call to SchemaNode() in multi_ProcessUtility	2017-01-25 12:23:47 +01:00
Marco Slot	85c1a87999	Short circuit in multi_ProcessUtility on ABORT/COMMIT	2017-01-25 11:57:00 +01:00
Marco Slot	2748660b1c	Always skip foreign key validation when enable_ddl_propagation is off	2017-01-25 11:56:59 +01:00
Marco Slot	4929e2b503	Merge pull request #1163 from citusdata/mx_errors Improve terminology in MX error messages	2017-01-25 11:14:21 +01:00
Marco Slot	ba940a1de9	Use coordinator instead of schema node in terminology	2017-01-25 11:07:23 +01:00
Marco Slot	72725ba30c	Use bigserial instead of BIGINT in sequence error	2017-01-25 11:07:23 +01:00
Burak Yücesoy	9f9fc09320	Merge pull request #1172 from citusdata/add_order_by_to_tests Add ORDER BY to some tests to have consistent output	2017-01-25 11:49:05 +03:00
Burak Yucesoy	00dd42dc44	Add ORDER BY to some tests to have consistent output	2017-01-25 11:43:25 +02:00
Marco Slot	a16c333dfc	Merge pull request #1165 from citusdata/updating-readme Updating citus cloud link to point to sign up	2017-01-24 09:52:58 +01:00
Craig Kerstiens	dfa547dbc1	updating citus cloud link to point to sign up	2017-01-24 09:09:06 +01:00
Eren Başak	8a83665ea0	Merge pull request #1124 from citusdata/mx_worker_tests MX regression tests for queries from workers	2017-01-24 09:53:50 +02:00
Eren Basak	88e9a429e1	Add Regression Tests For Querying MX Tables from Workers	2017-01-24 10:36:59 +03:00
Burak Yücesoy	e796d5cb2b	Merge pull request #1136 from citusdata/convert_drop_shards_to_use_new_api Convert drop shards to use new API	2017-01-23 21:16:16 +03:00
Burak Yucesoy	d80e7849a4	Convert DropShards to use new connection API With this change DropShards function started to use new connection API. DropShards function is used by DROP TABLE, master_drop_all_shards and master_apply_delete_command, therefore all of these functions now support transactional operations. In DropShards function, if we cannot reach a node, we mark shard state of related placements as FILE_TO_DELETE and continue to drop remaining shards; however if any error occurs after establishing the connection, we ROLLBACK whole operation.	2017-01-23 21:08:41 +03:00
Burak Yucesoy	2489c59c15	In case of failed transactions update shard state only if it is FILE_FINALIZED Before this change, when a transaction failed, we update related placements shard states to FILE_INACTIVE during XACT_EVENT_PRE_COMMIT. However that means if another code block changed shard state to something else (e.g. FILE_TO_DELETE) before XACT_EVENT_PRE_COMMIT we overwrite that. To prevent that problem, in case of failure we started to change shard state, only if its current shard state is FILE_FINALIZED.	2017-01-23 21:04:57 +03:00
Burak Yucesoy	484cb12cd0	Add LoadShardPlacement UDF This UDF returns a shard placement from cache given shard id and placement id. At the moment it iterates over all shard placements of given shard by ShardPlacementList and searches given placement id in that list, which is not a good solution performance-wise. However, currently, this function will be used only when there is a failed transaction. If a need arises we can optimize this function in the future.	2017-01-23 21:04:57 +03:00
Marco Slot	c227b35ca1	Merge pull request #1158 from citusdata/convert_multi_shard Use placement connection API for multi-shard transactions	2017-01-23 18:55:16 +01:00
Marco Slot	1585c02322	Use placement connection API for multi-shard transactions	2017-01-23 18:34:50 +01:00
Andres Freund	6becd43c8d	Merge pull request #1134 from citusdata/prepare_hack_poc Extended Prepared Statement Support	2017-01-23 09:31:33 -08:00
Andres Freund	6939cb8c56	Hack up PREPARE/EXECUTE for nearly all distributed queries. All router, real-time, task-tracker plannable queries should now have full prepared statement support (and even use router when possible), unless they don't go through the custom plan interface (which basically just affects LANGUAGE SQL (not plpgsql) functions). This is achieved by forcing postgres' planner to always choose a custom plan, by assigning very low costs to plans with bound parameters (i.e. ones were the postgres planner replanned the query upon EXECUTE with all parameter values provided), instead of the generic one. This requires some trickery, because for custom plans to work the costs for a non-custom plan have to be known, which means we can't error out when planning the generic plan. Instead we have to return a "faux" plan, that'd trigger an error message if executed. But due to the custom plan logic that plan will likely (unless called by an SQL function, or because we can't support that query for some reason) not be executed; instead the custom plan will be chosen.	2017-01-23 09:23:50 -08:00
Andres Freund	c244b8ef4a	Make router planner error handling more flexible. So far router planner had encapsulated different functionality in MultiRouterPlanCreate. Modifications always go through router, selects sometimes. Modifications always error out if the query is unsupported, selects return NULL. Especially the error handling is a problem for the upcoming extension of prepared statement support. Split MultiRouterPlanCreate into CreateRouterPlan and CreateModifyPlan, and change them to not throw errors. Instead errors are now reported by setting the new MultiPlan->plannigError. Callers of router planner functionality now have to throw errors themselves if desired, but also can skip doing so. This is a pre-requisite for expanding prepared statement support. While touching all those lines, improve a number of error messages by getting them closer to the postgres error message guidelines.	2017-01-23 09:23:50 -08:00
Andres Freund	7681f6ab9d	Centralize more of distributed planning into CreateDistributedPlan(). The name CreatePhysicalPlan() hasn't been accurate for a while, and the split of work between multi_planner() and CreatePhysicalPlan() doesn't seem perfect. So rename to CreateDistributedPlan() and move a bit more logic in there.	2017-01-23 09:23:50 -08:00
Andres Freund	557ccc6fda	Support for deferred error messages. It can be useful, e.g. in the upcoming prepared statement support, to be able to return an error from a function that is not raised immediately, but can later be thrown. That allows e.g. to attempt to plan a statment using different methods and to create good error messages in each planner, but to only error out after all planners have been run. To enable that create support for deferred error messages that can be created (supporting errorcode, message, detail, hint) in one function, and then thrown in different place.	2017-01-23 09:23:50 -08:00
Andres Freund	9a82e8f06b	Make usage of static a bit more consistent in multi_planner.c.	2017-01-23 09:23:50 -08:00
Jason Petersen	849f70a409	Merge pull request #1137 from citusdata/make_rep_model_explicit Add replication_model GUC cr: @anarazel	2017-01-23 09:16:52 -07:00
Jason Petersen	56197dbdba	Add replication_model GUC This adds a replication_model GUC which is used as the replication model for any new distributed table that is not a reference table. With this change, tables with replication factor 1 are no longer implicitly MX tables. The GUC is similarly respected during empty shard creation for e.g. existing append-partitioned tables. If the model is set to streaming while replication factor is greater than one, table and shard creation routines will error until this invalid combination is corrected. Changing this parameter requires superuser permissions.	2017-01-23 09:05:14 -07:00
Brian Cloutier	fe5465aa4e	Port master_append_table_to_shard to new connection API (#1149 ) If any placements fail it doesn't update shard statistics on those placements. A minor enabling refactor: Make CoordinatedTransactionUses2PC public (it used to be CoordinatedTransactionUse2PC but that symbol already existed, so renamed it as well)	2017-01-23 15:57:44 +02:00
Burak Yücesoy	a3c20e4ea5	Merge pull request #1118 from citusdata/reword_outer_repartition_error_message Reword error message for outer joins requiring repartition	2017-01-23 11:02:21 +03:00
Burak Yucesoy	2e1df4c910	Reword error message for outer joins requiring repartition We changed error message which appears when user tries to execute outer join command and that command requires repartitioning. Old error message mentioned about 1-to-1 shard partitioning which may not be clear to user.	2017-01-23 10:42:36 +03:00
Marco Slot	5de4054443	Merge pull request #1156 from citusdata/enable_deadlock_prevention Add an enable_deadlock_prevention flag to enable transactions across nodes	2017-01-22 22:06:54 +04:00
Marco Slot	ea855ddf86	Add an enable_deadlock_prevention flag to allow router transactions to expand to multiple nodes	2017-01-22 17:31:24 +01:00
Marco Slot	1d710153d2	Merge pull request #1157 from citusdata/unique_job_id Ensure job IDs are unique across workers	2017-01-22 20:06:31 +04:00
Marco Slot	87ae26aef3	Ensure job IDs are unique across workers	2017-01-22 16:55:14 +01:00
Andres Freund	4a84ee5512	Merge pull request #1155 from citusdata/feature/connection_cleanup Remove parts of old connection / transaction infrastructure	2017-01-21 09:11:31 -08:00
Andres Freund	78b085106a	Remove connection_cache.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	6ec34bed84	Remove remnants of commit_protocol.[ch].	2017-01-21 09:01:15 -08:00
Andres Freund	fd717d6da9	Consistently libpq forward declaration in remote_commands.h.	2017-01-21 09:01:14 -08:00
Andres Freund	52c3369f79	Minimal citus tools conversion to new connection API.	2017-01-21 09:01:14 -08:00
Önder Kalacı	92254ac6b2	Merge pull request #1148 from citusdata/fix_command_counter_increment Improve heap access methods for upgrading to reference tables	2017-01-21 09:33:13 +02:00
Önder Kalacı	594fa761e1	Merge branch 'master' into fix_command_counter_increment	2017-01-21 09:21:19 +02:00
Andres Freund	5b58338ace	Merge pull request #1133 from citusdata/copy_with_new_connection_api Convert multi_copy to new connection api	2017-01-20 22:22:50 -08:00
Murat Tuncer	d76f781ae4	Convert multi copy to use new connection api This enables proper transactional behaviour for copy and relaxes some restrictions like combining COPY with single-row modifications. It also provides the basis for relaxing restrictions further, and for optionally allowing connection caching.	2017-01-20 19:15:19 -08:00
Jason Petersen	cc594e00ca	Merge pull request #1151 from citusdata/default_rep_factor_one Change default replication factor to one cr: @anarazel	2017-01-20 19:11:02 -07:00
Jason Petersen	4e7b23472c	Change default replication factor to one Took the quick-and-dirty approach of changing it back to two during test runs. Can update tests to expect one in due time.	2017-01-20 18:56:43 -07:00
Joe Nelson	ff7ca32bae	Merge pull request #1152 from citusdata/craig-readme-update Suggest cloud in getting started section of README	2017-01-20 16:51:04 -08:00
Joe Nelson	8ccd5ca204	Suggest cloud in quickstart Plus other updates	2017-01-20 16:39:04 -08:00
Andres Freund	a296aca257	Merge pull request #1146 from citusdata/explain_duplication Don't duplicate planner logic in multi_explain.c	2017-01-20 13:58:48 -08:00
Andres Freund	3a36d32c43	Mark some now unnecessarily exposed multi_planner.c functions static.	2017-01-20 12:31:56 -08:00
Andres Freund	608bed0387	Don't duplicate planning logic in citus' explain hook. Instead use pg_plan_query() like the normal explain does, and use that to explain the query. That's important because it allows to remove the duplicated planner logic from multi_explain - and that logic is about to get more complicated.	2017-01-20 12:31:28 -08:00
Andres Freund	0f28a11970	Remove citus.explain_multi_logical/physical_plan. They make fixing explain for prepared statement harder, and they don't really fit into EXPLAIN in the first place. Additionally they're currently not exercised in any tests.	2017-01-20 12:31:19 -08:00
Onder Kalaci	bd825be340	Improve heap access methods This commit improves heap access methods for reference table upgrade and colocation group modifications.	2017-01-20 14:53:29 +02:00
Metin Döşlü	ac7235bfaa	Merge pull request #1145 from citusdata/tenant_isolation_community Tenant isolation	2017-01-20 13:49:04 +02:00
Metin Doslu	2bd8f8f12e	Add a function to delete shard metadata from MX nodes	2017-01-20 14:38:01 +02:00
Metin Doslu	93e626c896	Refactor get_shard_id_for_distribution_column() and other minor changes	2017-01-20 14:38:01 +02:00
Metin Doslu	ed77260aa1	Return a deep copy shard list from ColocatedShardIntervalList()	2017-01-20 14:38:01 +02:00
Metin Doslu	7cff8719c2	Add worker_hash() and a stub for isolate_tenant_to_new_shard()	2017-01-20 14:38:01 +02:00
Jason Petersen	46abe5b692	Merge pull request #1143 from citusdata/remove_hint_from_master_remove_node Remove hint message from master_remove_node UDF cr: @jasonmp85	2017-01-18 22:43:47 -07:00
Murat Tuncer	c12bd7b75e	Remove hint message from master_remove_node UDF Hint about master_disable_node was giving wrong impression to users. Removal is better than keeping it.	2017-01-18 22:33:00 -07:00
Eren Başak	c0f1a7609f	Merge pull request #1103 from citusdata/mx/reference_table_support MX Support for Reference Tables	2017-01-18 17:15:12 +02:00
Eren Basak	4def1ca696	Prevent COPY to reference tables from worker nodes	2017-01-18 17:38:01 +03:00
Eren Basak	e7c15ecc1f	Make `upgrade_to_reference_table` function MX-compatible	2017-01-18 16:49:50 +03:00
Eren Basak	56ca590daa	Propagate metadata changes for deleted reference table placements on master_remove_node call	2017-01-18 16:00:07 +03:00
Eren Basak	be78769ae4	Propagate new reference table placement metadata on `master_add_node`	2017-01-18 15:59:06 +03:00
Eren Basak	23b2619412	Make reference table metadata synced to workers	2017-01-18 15:59:05 +03:00
Eren Basak	e44d226221	Propagate Metadata to Workers on `create_reference_table` call.	2017-01-18 11:05:24 +03:00
Eren Başak	2668160fe8	Merge pull request #1090 from citusdata/mx_sequences Add Sequence Support For MX Tables	2017-01-18 09:18:33 +02:00
Eren Basak	b686d9a025	Add Sequence Support for MX Tables This change adds support for serial columns to be used with MX tables. Prior to this change, sequences of serial columns were created in all workers (for being able to create shards) but never used. With MX, we need to set the sequences so that sequences in each worker create unique values. This is done by setting the MINVALUE, MAXVALUE and START values of the sequence.	2017-01-18 09:43:38 +03:00
Eren Basak	b1ce8d61c0	Create Invalidation Trigger for pg_dist_local_group Table Updates	2017-01-18 09:43:38 +03:00
Andres Freund	b9c4a4b378	Merge pull request #1139 from citusdata/fix-1138 Query placementId in RemoteFinalizedShardPlacementList().	2017-01-17 13:49:53 -08:00
Andres Freund	bdef35ac14	Query placementId in RemoteFinalizedShardPlacementList(). Not having the id in the ShardPlacement struct causes issues while making copy use the placement aware connection management.	2017-01-17 13:27:26 -08:00
Brian Cloutier	67ee357d7f	Port WorkerShardStats to new connection API Part of the work in citusdata/citus#1101, this is a pretty direct port over to the new functions and shouldn't result in any behavior changes.	2017-01-17 17:04:37 +02:00
Brian Cloutier	b1b2b4fadf	Create ExecuteOptionalRemoteCommand A small refactor which pulls some code out of `RecoverWorkerTransactions` and into `remote_commands.c`. This code block currently only occurs in `RecoverWorkerTransactions` but will be useful to other functions shortly. Unfortunately we couldn't call it `ExecuteRemoteCommand`, that name was already taken.	2017-01-17 17:04:37 +02:00
Brian Cloutier	539a205462	Pass entire ShardPlacement into WorkerShardStats A small refactor so we'll be able to call the new connection API (which requires having a ShardPlacement) from within WorkerShardStats.	2017-01-17 17:04:37 +02:00
Andres Freund	3ea73b7879	Merge pull request #1120 from citusdata/feature/colocation_mapping Colocation aware placement connections	2017-01-16 13:54:59 -08:00
Andres Freund	b9385700ee	Make placement_connection.c colocation aware. Because of foreign keys and similar concerns there should only be a single modifying/DDL connection for a set of colocated placements to a node. To enforce placement_connection.c now has an additional hash-table keeping track of the connections to a set of colocated placements. In addition to enforcing per placement restrictions on connections, there's now very similar restrictions for sets of colocated placements.	2017-01-16 13:47:01 -08:00
Andres Freund	6972186652	Add ShardPlacement fields required for colocated placement connection mapping.	2017-01-16 13:42:54 -08:00
Andres Freund	1d79820b74	Fix use of wrong constant. This could potentially lead to spuriously shared connections if the first 63 characters of a hostname are the same.	2017-01-16 13:42:53 -08:00
Andres Freund	4b1d37b7be	Remove fields used in earlier revisions of placement_connection.c.	2017-01-16 13:42:53 -08:00
Jason Petersen	d6ed47a7e6	Merge pull request #1085 from citusdata/improve_insert_select_error_messages Improve error messages for INSERT INTO .. SELECT cr: @jasonmp85	2017-01-16 12:26:49 -07:00
Onder Kalaci	a7ed49c16e	Improve error messages for INSERT INTO .. SELECT This commit is intended to improve the error messages while planning INSERT INTO .. SELECT queries. The main motivation for this change is that we used to map multiple cases into a single message. With this change, we added explicit error messages for many cases.	2017-01-16 12:16:14 -07:00
Burak Yücesoy	e334e6bc40	Merge pull request #1105 from citusdata/update_reference_table_metadata_on_remove_node Remove placement metadata of reference tables after master_remove_node	2017-01-16 11:37:52 +03:00
Burak Yucesoy	3315ae6142	Remove placement metadata of reference tables after master_remove_node With this change, we start to delete placement of reference tables at given worker node after master_remove_node UDF call. We remove placement metadata at master node but we do not drop actual shard from the worker node. There are two reasons for that decision, first, it is not critical to DROP the shards in the workers because Citus will ignore them as long as node is removed from cluster and if we add that node back to cluster we will DROP and recreate all reference tables. Second, if node is unreachable, it becomes complicated to cover failure cases and have a transaction support.	2017-01-16 11:24:56 +03:00
Murat Tuncer	0af5d553d1	Merge pull request #1024 from citusdata/feature/442_view_support Add view support for select queries	2017-01-13 09:02:07 +02:00
Murat Tuncer	e7935a3be4	Report error when original range table id is not found in NewTableId()	2017-01-13 09:39:43 +03:00
Murat Tuncer	77f8db6b14	Add view support Enables use views within distributed queries. User can create and use a view on distributed tables/queries as he/she would use with regular queries. After this change router queries will have full support for views, insert into select queries will support reading from views, not writing into. Outer joins would have a limited support, and would error out at certain cases such as when a view is in the inner side of the outer join. Although PostgreSQL supports writing into views under certain circumstances. We disallowed that for distributed views.	2017-01-13 09:39:42 +03:00
Önder Kalacı	feef1bc70b	Merge pull request #1128 from citusdata/add_order_by_to_shardstate_tests Add ORDER BY clause to shard state tests to have consistent output	2017-01-13 03:18:47 +02:00
Burak Yucesoy	b2c61be4a2	Add ORDER BY clause to shard state tests to have consistent output In tests related to automatic reference table creation and deletion, there were some tests whose output may change order thus creating inconsistent test results. With this change we add ORDER BY clause to related tests to have consistent output.	2017-01-13 02:42:28 +03:00
Önder Kalacı	efa01be99e	Merge pull request #1122 from citusdata/onder_update_ref Refactor CheckShardPlacements() and improve support for node removal	2017-01-12 20:22:00 +02:00
Onder Kalaci	aed5f817fa	Refactor CheckShardPlacements() and improve support for node removal This commit refactors CheckShardPlacements() so that it only considers modifyingConnection. Also, it skips nodes which are removed from the cluster.	2017-01-12 20:10:10 +02:00
Murat Tuncer	dbaf7f0e7e	Merge pull request #1074 from citusdata/router_query_hint Add hint to errored real time queries	2017-01-12 10:56:28 +02:00
Murat Tuncer	cb1dfd0a17	Add hint to errored real time queries	2017-01-12 11:33:35 +03:00
Önder Kalacı	eeee0a7898	Merge pull request #1116 from citusdata/fix_reference_copy Copy on reference tables should never mark placements invalid	2017-01-12 02:55:16 +02:00
Onder Kalaci	1efa301ada	Copy on reference tables should never mark placements invalid This commit ensures that COPY does not mark any placement of reference's state as INVALID in case of an error.	2017-01-12 02:43:41 +02:00
Eren Başak	6eb7751647	Merge pull request #1112 from citusdata/fix_worker_rack_escaping_bug Fix escaping of workerrack in NodeListInsertCommand	2017-01-11 09:25:12 +02:00
Eren Basak	859b920ba9	Fix escaping of workerrack in NodeListInsertCommand This change fixes a small bug about quoting of workerrack column in NodeListInsertCommand: Previous: `"..., '%s'", workerRack` Now: `"..., %s", quote_literal_cstr(workerRack)`	2017-01-11 10:18:48 +03:00
Andres Freund	c4b18da0dd	Merge pull request #1113 from citusdata/feature/colocation_mapping ShardInterval (improved) and ShardPlacement (new) caching.	2017-01-10 18:31:15 -08:00
Andres Freund	b813b39241	Cache ShardPlacements in metadata cache. So far we've reloaded them frequently. Besides avoiding that cost - noticeable for some workloads with large shard counts - it makes it easier to add information to ShardPlacements that help us make placement_connection.c colocation aware.	2017-01-10 18:14:18 -08:00
Andres Freund	8cb47195ba	Make LoadShardInterval() backed by the metadata cache. Doing so requires adding a mapping from shardId to the cache entries. For that metadata_cache.c now maintains an additional hashtable. That hashtable only references shard intervals in the dist table cache.	2017-01-10 17:00:19 -08:00
Andres Freund	f6e8647337	Split DistTableCacheEntry() into separate functions. Previously the function was getting too large. Thus this splits the function into separate parts for looking up the cache entry and building the cache contents.	2017-01-10 15:23:18 -08:00
Önder Kalacı	8624ef5ac4	Merge pull request #1115 from citusdata/fix_remove_node Fix CloseNodeConnections to actually close connections	2017-01-11 01:21:21 +02:00
Onder Kalaci	cd8e41bb79	Fix CloseNodeConnections to actually close connections CloseNodeConnections() is supposed to close connections to a given node. However, before this commit it lacks to actually call PQFinish() on the connections. Using CloseConnection() handles closing and all other necessary actions.	2017-01-11 01:13:58 +02:00
Andres Freund	2435cdb5d1	Merge pull request #1114 from citusdata/pg_regress_fix Fix diff option failure in regression test	2017-01-10 13:18:07 -08:00
Murat Tuncer	739934cdcb	Fix diff option failure in regression test	2017-01-10 22:58:47 +03:00
Murat Tuncer	54b1ebb14e	Merge pull request #1069 from citusdata/feature/citus_tools Feature/citus tools	2017-01-10 17:01:42 +02:00
Murat Tuncer	95862632de	Add citus tools to default configuration	2017-01-10 17:53:27 +03:00
Murat Tuncer	ec2ecafcf9	Merge pull request #1095 from citusdata/master_disable_node Add master_disable_node UDF	2017-01-10 10:10:15 +02:00
Murat Tuncer	b93185d800	Add master_disable_node UDF We can now remove nodes from cluster regardless of them having an active shard placement.	2017-01-10 10:54:57 +03:00
Burak Yücesoy	2a13f23176	Merge pull request #1106 from citusdata/error_out_cte_with_modify Error out on CTEs with data modifying statement	2017-01-10 09:35:46 +02:00
Burak Yucesoy	59d3d05bc4	Error out on CTEs with data modifying statement With this change we start to error out on router planner queries where a common table expression with data-modifying statement is present. We already do not support if there is a data-modifying statement using result of the CTE, now we also error out if CTE itself is data-modifying statement.	2017-01-10 10:30:09 +02:00
Andres Freund	8b3dc1f974	Merge pull request #1110 from citusdata/feature/pg_regress_diffopts Change diff output to unified in pg_regress_multi.pl.	2017-01-09 21:04:13 -08:00
Andres Freund	f2ee63d638	Change diff output to unified in pg_regress_multi.pl. Unified is better understood by a lot of people, and the default almost everywhere (including github).	2017-01-09 20:51:01 -08:00
Marco Slot	16608a3259	Merge pull request #1109 from citusdata/transaction_recovery_connection_api Use GetNodeConnection to establish a connection in transaction recovery	2017-01-09 18:31:00 -08:00
Marco Slot	ef326b202a	PQclear in ReportResultError to prevent memory leaks	2017-01-10 02:51:39 +01:00
Marco Slot	31231ce196	Use GetNodeConnection to establish a connection in transaction recovery	2017-01-10 02:44:34 +01:00
Andres Freund	4f6c2cac67	Merge pull request #1108 from citusdata/feature/interruptible-router-queries Use interrupt checking libpq wrappers in router executor.	2017-01-09 16:33:57 -08:00
Andres Freund	c390daed0f	Use interrupt checking libpq wrappers in router executor.	2017-01-09 14:02:45 -08:00
Andres Freund	d8cdb552e5	Merge pull request #1079 from citusdata/feature/shardstate Centralized placement ->connection mapping & placement health management	2017-01-09 13:28:22 -08:00
Andres Freund	7320c17f00	Convert router executor to placement connection management infrastructure. Remove the router specific transaction and shard management, and replace it with the new placement connection API. This mostly leaves behaviour alone, except that it is now, inside a transaction, legal to select from a shard to which no pre-existing connection exists. To simplify code the code handling task executions for select and modify has been split into two - the previous coding was starting to get confusing due to the amount of only conditionally applicable code. Modification connections & transactions are now always established in parallel, not just for reference tables.	2017-01-09 13:13:02 -08:00
Andres Freund	bfa742d794	Centralized shard/placement connection and state management. Currently there are several places in citus that map placements to connections and that manage placement health. Centralize this knowledge. Because of the centralized knowledge about which connection has previously been used for which shard/placement, this also provides the basis for relaxing restrictions around combining various forms of DDL/DML. Connections for a placement can now be acquired using GetPlacementConnection(). If the connection is used for DML or DDL the FOR_DDL/DML flags should be used respectively. If an individual remote transaction fails (but the transaction on the master succeeds) and FOR_DDL/DML have been specified, the placement is marked as invalid, unless that'd mark all placements for a shard as invalid.	2017-01-09 13:13:02 -08:00
Andres Freund	aca3770364	Merge pull request #1104 from citusdata/connmgr_cleanup Minor connection/transaction management related cleanups	2017-01-06 09:31:24 -08:00
Andres Freund	3286b99ff1	Remove useless changing of CurrentMemoryContext.	2017-01-06 09:16:45 -08:00
Andres Freund	6291998ae1	Use FinishConnectionListEstablishment() instead of manually iterating.	2017-01-06 09:16:01 -08:00
Andres Freund	d256f3fca9	Remove unused LogPreparedTransactions() function. This is unused since `92c7567008`.	2017-01-06 09:15:01 -08:00
Burak Yücesoy	350b1e6431	Merge pull request #1091 from citusdata/replicate_reference_table_on_add_node Replicate reference tables when new node is added	2017-01-05 13:37:45 +02:00
Burak Yucesoy	9c9f479e4b	Replicate reference tables when new node is added With this change, we start to replicate all reference tables to the new node when new node is added to the cluster with master_add_node command. We also update replication factor of reference table's colocation group.	2017-01-05 14:30:41 +03:00
Burak Yucesoy	1d18950860	Modify tests to create clean workspace Since we will now replicate reference tables each time we add node, we need to ensure that test space is clean in terms of reference tables before any add node operation. For this purpose we had to change order of multi_drop_extension test which caused change of some of the colocation ids.	2017-01-05 12:22:44 +03:00
Önder Kalacı	2e3e801768	Merge pull request #1058 from citusdata/reference_tables_use_2pc_and_add_to_task Allow reference tables to use 2PC for all modifications	2017-01-04 12:57:01 +02:00
Onder Kalaci	6d050fd677	Use 2PC for reference table modification With this commit, we ensure that router executor always uses 2PC for reference table modifications and never mark the placements of it as INVALID.	2017-01-04 12:46:35 +02:00
Burak Yücesoy	43f5efecff	Merge pull request #1075 from citusdata/upgrade_reference_table Add upgrade_to_reference_table	2017-01-02 17:03:59 +02:00
Burak Yucesoy	31cd2357fe	Add upgrade_to_reference_table With this change we introduce new UDF, upgrade_to_reference_table, which can be used to upgrade existing broadcast tables reference tables. For upgrading, we require that given table contains only one shard.	2017-01-02 17:54:42 +02:00
Eren Başak	7953916ae2	Merge pull request #1068 from citusdata/mx_error_on_unsupported_operations Error on Unsupported Features on Workers	2017-01-02 16:40:29 +02:00
Eren Basak	7e09bd6836	Error on Unsupported Features on Workers This change makes the metadata workers error out on unsupported commands.	2017-01-02 16:03:45 +03:00
Jason Petersen	3182287574	Merge pull request #1063 from citusdata/multi_shard_multi_connection Convert multi_shard_transaction to the new connection API cr: @jasonmp85	2016-12-30 14:56:17 -07:00
Marco Slot	59bc5972fa	Use MultiConnection in multi-shard transactions	2016-12-30 14:43:21 -07:00
Metin Döşlü	7f10d8562b	Merge pull request #1073 from citusdata/refactor_shard_index Add binary search capability to ShardIndex()	2016-12-30 18:02:22 +02:00
Metin Doslu	1ddc70ca55	Add binary search capability to ShardIndex() Renamed FindShardIntervalIndex() to ShardIndex() and added binary search capability. It used to assume that hash partition tables are always uniformly distributed which is not true if upcoming tenant isolation feature is applied. This commit also reduces code duplication.	2016-12-30 18:55:34 +02:00
Murat Tuncer	29e5e3e715	Merge pull request #1070 from citusdata/feature/where_is_null Add null clause test cases to router planner regression tests	2016-12-29 09:51:25 +02:00
Murat Tuncer	fc01a47ea4	Add null clause test cases to router planner regression tests Router planner already handles cases when all shards are pruned out. This is about missing test cases. Notice that "column is null" and "column = null" have different shard pruning behavior.	2016-12-29 10:42:31 +03:00
Eren Başak	2b90620f15	Merge pull request #1067 from citusdata/fix_mx_drop_sequence_deadlock Prevent Deadlock on Dropping MX Tables with Sequences	2016-12-28 15:52:53 +02:00
Eren Basak	e43eed0f7a	Prevent Deadlock on Dropping MX Tables with Sequences This change prevents a deadlock situation during DROP TABLE on an mx table with sequences on workers with metadata.	2016-12-28 16:32:20 +03:00
Burak Yücesoy	c8bdac699c	Merge pull request #1061 from citusdata/error_out_fk_on_reference_tables Error out on foreign keys with reference tables	2016-12-28 14:56:43 +02:00
Burak Yucesoy	88ee7802dd	Address Onder's comments	2016-12-28 12:26:16 +03:00
Burak Yucesoy	bb9e95e134	Error out on foreign keys with reference tables We have one replication of reference table for each node. Therefore all problems with replication factor > 1 also applies to reference table. As a solution we will not allow foreign keys on reference tables. It is not possible to define foreign key from, to or between reference tables.	2016-12-28 10:58:26 +03:00
Murat Tuncer	19b96a17c1	Merge pull request #1031 from citusdata/fix/750_better_error Add error hint to failing modify query	2016-12-23 18:54:42 +02:00
Murat Tuncer	2f76b4be99	Add error hint to failing modify query	2016-12-23 19:43:55 +03:00
Marco Slot	25e28763b2	Merge pull request #1057 from citusdata/bugfix/add_node_failure Convert worker_transactions to new connection API	2016-12-23 16:30:40 +01:00
Marco Slot	6cbc1945f9	Enable transaction recovery in connection API	2016-12-23 16:14:29 +01:00
Marco Slot	92c7567008	Convert worker_transactions to new connection API	2016-12-23 16:14:29 +01:00
Marco Slot	00d55ad957	Add a wrapper for PQsendQuery	2016-12-23 16:14:29 +01:00
Marco Slot	87c62d598e	Connectionapify SendCommandListToWorkerInSingleTransaction	2016-12-23 16:14:29 +01:00
Burak Yücesoy	5cd21771a9	Merge pull request #1062 from citusdata/grant_public_select_access_to_metadata_tables GRANT SELECT access for metadata tables to public	2016-12-23 16:43:15 +03:00
Burak Yucesoy	0851fd2f0b	GRANT SELECT access for metadata tables to public Previously, we errored out if non-user tries to SELECT query for some metadata tables. It seems that we already GRANT SELECT access to some metadata tables but not others. With this change, we GRANT SELECT access to all existing Citus metadata tables.	2016-12-23 16:32:47 +03:00
Eren Başak	d608ef3311	Merge pull request #1045 from citusdata/propagate_mx_metadata_changes Propagate MX Metadata Changes	2016-12-23 14:58:49 +02:00
Eren Basak	31af40cc26	Handle MX tables on workers during drop table commands	2016-12-23 15:43:32 +03:00
Eren Basak	bed2e353db	Propagate `mark_tables_colocated` changes in `pg_dist_partition` table to metadata workers.	2016-12-23 15:43:32 +03:00
Eren Basak	71d73ec5ff	Propagate DDL commands to metadata workers for MX tables	2016-12-23 15:43:32 +03:00
Eren Basak	048fddf4da	Propagate MX table and shard metadata on `create_distributed_table` call	2016-12-23 15:43:32 +03:00
Eren Basak	efcb1f9dd9	Rename multi_metadata_snapshot to multi_metadata_sync to make it include future mx metadata syncing regression tests	2016-12-23 15:43:32 +03:00
Eren Basak	61a1e487d0	Mark hash distributed tables with replication factor = 1 as streaming replicated tables (repmodel=s). This works only with `create_distributed_table` call.	2016-12-23 15:43:31 +03:00
Marco Slot	6b947c4201	Merge pull request #1010 from citusdata/feature/insert_select_functions Evaluate functions in INSERT..SELECT	2016-12-23 13:24:21 +01:00
Marco Slot	11031bcf55	Enable evaluation of stable functions in INSERT..SELECT	2016-12-23 12:47:21 +01:00
Marco Slot	d745d7bf70	Add explicit RelationShards mapping to tasks	2016-12-23 10:23:43 +01:00
Marco Slot	b7d0a3237b	Merge pull request #1056 from citusdata/feature/mx_locks Add shard locking UDFs	2016-12-22 11:20:43 +01:00
Marco Slot	6852f8a951	Add shard locking UDFs	2016-12-22 11:04:34 +01:00
Burak Yücesoy	501a2ecead	Add get_distribution_value_shardid UDF (#1048 ) * Add get_distribution_value_shardid UDF With this UDF users can now map given distribution value to shard id. We mostly hide shardids from users to prevent unnecessary complexity but some power users might need to know about which entry/value is stored in which shard for maintanence purposes. Signature of this UDF is as follows; bigint get_distribution_value_shardid(table_name regclass, distribution_value anyelement)	2016-12-22 12:17:08 +03:00
Eren Başak	ce3fec00e5	Merge pull request #1055 from citusdata/ignore_multi_outer_join_reference_outputs Make git ignore multi_outer_join_reference test outputs	2016-12-21 15:11:29 +02:00
Eren Basak	cfcb1260a2	Make git ignore multi_outer_join_reference test outputs	2016-12-21 15:58:22 +03:00
Önder Kalacı	4ea4bfbf45	Merge pull request #1018 from citusdata/reference_table_base Reference table Phase-1	2016-12-20 14:15:20 +02:00
Onder Kalaci	2276e99347	Improve regression tests for multi_colocated_shard_transfer Ensure that regression tests outputs are consistent for multi_colocated_shard_transfer.	2016-12-20 14:09:35 +02:00
Onder Kalaci	9f0bd4cb36	Reference Table Support - Phase 1 With this commit, we implemented some basic features of reference tables. To start with, a reference table is * a distributed table whithout a distribution column defined on it * the distributed table is single sharded * and the shard is replicated to all nodes Reference tables follows the same code-path with a single sharded tables. Thus, broadcast JOINs are applicable to reference tables. But, since the table is replicated to all nodes, table fetching is not required any more. Reference tables support the uniqueness constraints for any column. Reference tables can be used in INSERT INTO .. SELECT queries with the following rules: * If a reference table is in the SELECT part of the query, it is safe join with another reference table and/or hash partitioned tables. * If a reference table is in the INSERT part of the query, all other participating tables should be reference tables. Reference tables follow the regular co-location structure. Since all reference tables are single sharded and replicated to all nodes, they are always co-located with each other. Queries involving only reference tables always follows router planner and executor. Reference tables can have composite typed columns and there is no need to create/define the necessary support functions. All modification queries, master_* UDFs, EXPLAIN, DDLs, TRUNCATE, sequences, transactions, COPY, schema support works on reference tables as expected. Plus, all the pre-requisites associated with distribution columns are dismissed.	2016-12-20 14:09:35 +02:00
Eren Başak	a71b79983b	Merge pull request #912 from citusdata/add_timeout_guc Add citus.node_connection_timeout GUC	2016-12-20 13:33:54 +02:00
Eren Basak	296e0bd33a	Add citus.node_connection_timeout GUC	2016-12-20 14:11:37 +03:00
Marco Slot	64c140e78e	Merge pull request #1049 from citusdata/bugfix/schema_owner Fix permissions for multi-user re-partition queries	2016-12-20 11:22:59 +01:00
Marco Slot	dd094bc372	Run copy commands in worker_merge_files_into_table as superuser	2016-12-20 10:15:42 +01:00
Marco Slot	42ff472721	Set user as pg_merge_job_* schema owner	2016-12-20 10:15:42 +01:00
Murat Tuncer	4914ccbaba	Merge pull request #1027 from citusdata/feature/930_keep_router_planner_active_at_all_times Make router planner active at all times	2016-12-20 10:56:13 +02:00
Murat Tuncer	c3a60bff70	Make router planner active at all times We used to disable router planner and executor when task executor is set to task-tracker. This change enables router planning and execution at all times regardless of task execution mode. We are introducing a hidden flag enable_router_execution to enable/disable router execution. Its default value is true. User may disable router planning by setting it to false.	2016-12-20 11:24:01 +03:00
Jason Petersen	6f95875191	Add targeted VACUUM/ANALYZE support Adds support for VACUUM and ANALYZE commands which target a specific distributed table. After grabbing the appropriate locks, this imple- mentation sends VACUUM commands to each placement (using one connec- tion per placement). These commands are sent in parallel, so users with large tables will benefit from sharding. Except for VERBOSE, all VACUUM and ANALYZE options are supported, including the explicit column list used by ANALYZE. As with many of our utility commands, the local command also runs. In the VACUUM/ANALYZE case, the local command is executed before any re- mote propagation. Because error handling is managed after local proc- essing, this can result in a VACUUM completing locally but erroring out when distributed processing commences: a minor technicality in all cases, as there isn't really much reason to ever roll back a VACUUM (an impossibility in any case, as VACUUM cannot run within a transaction). Remote propagation of targeted VACUUM/ANALYZE is controlled by the enable_ddl_propagation setting; warnings are emitted if such a command is attempted when DDL propagation is disabled. Unqualified VACUUM or ANALYZE is not handled, but a warning message informs the user of this. Implementation note: this commit adds a "BARE" value to MultiShard- CommitProtocol. When active, no BEGIN command is ever sent to remote nodes, useful for commands such as VACUUM/ANALYZE which must not run in a transaction block. This value is not user-facing and is reset at transaction end.	2016-12-16 16:59:06 -07:00
Metin Döşlü	6c333d464f	Merge pull request #1046 from citusdata/feature/colocate_with Add colocate_with option to create_distributed_table()	2016-12-16 14:33:22 +02:00
Metin Doslu	20b8f1feeb	Refactor distribution column type check for colocation	2016-12-16 15:24:45 +02:00
Metin Doslu	e2d0bd38f2	Don't allow tables with different replication models to be colocated	2016-12-16 15:23:49 +02:00
Metin Doslu	86cca54857	Add colocate_with option to create_distributed_table() With this commit, we support three versions of colocate_with: i.default, ii.none and iii. a specific table name.	2016-12-16 14:53:35 +02:00
Metin Doslu	edbedbd744	Move colocation related functions to colocation_utils.c	2016-12-16 14:52:40 +02:00
Marco Slot	b3593442c4	Merge pull request #1042 from citusdata/feature/column_name Expose the column_to_column_name UDF	2016-12-16 11:40:29 +01:00
Marco Slot	5714be0da5	Expose the column_to_column_name UDF to make partkey in pg_dist_partition human-readable	2016-12-14 10:46:33 +01:00
Eren Başak	9d4e586457	Merge pull request #997 from citusdata/add_sync_metadata_to_node Add sync_metadata_to_node UDF	2016-12-14 10:06:17 +02:00
Eren Basak	afbb5ffb31	Add stop_metadata_sync_to_node UDF	2016-12-14 10:53:12 +03:00
Eren Basak	b94647c3bc	Propagate CREATE SCHEMA commands with the correct AUTHORIZATION clause in start_metadata_sync_to_node	2016-12-14 10:53:12 +03:00
Eren Basak	fb08093b00	Make start_metadata_sync_to_node UDF to propagate foreign-key constraints	2016-12-14 10:53:12 +03:00
Eren Basak	5e96e4f60e	Make truncate triggers propagated on start_metadata_sync_to_node call	2016-12-14 10:53:10 +03:00
Eren Basak	4fd086f0af	Prevent Transactions in start_metadata_sync_to_node	2016-12-13 10:48:03 +03:00
Eren Basak	c154a91621	Add Regression Tests For start_metadata_sync_to_node	2016-12-13 10:48:03 +03:00
Eren Basak	9eff968d1f	Add start_metadata_sync_to_node UDF This change adds `start_metadata_sync_to_node` UDF which copies the metadata about nodes and MX tables from master to the specified worker, sets its local group ID and marks its hasmetadata to true to allow it receive future DDL changes.	2016-12-13 10:48:03 +03:00
Andres Freund	21effef8b5	Merge pull request #835 from citusdata/valgrind-support Allow to run regression tests under valgrind	2016-12-12 16:34:58 -08:00
Andres Freund	56aaa25cfa	Add support for running regression tests under valgrind. Note that this only provides infrastructure for running tests under valgrind - there's some spurious failures due to timeouts.	2016-12-12 15:42:11 -08:00
Andres Freund	dd149c3e24	Merge pull request #1020 from citusdata/feature/transaction-management Centralized Transaction Management Infrastructure	2016-12-12 15:27:27 -08:00
Andres Freund	80b34a5d6b	Integrate router executor into transaction management framework. One less place managing remote transactions. It also makes it fairly easy to use 2PC for certain modifications (e.g. reference tables). Just issue a CoordinatedTransactionUse2PC(). If every placement failure should cause the whole transaction to abort, additionally mark the relevant transactions as critical.	2016-12-12 15:18:12 -08:00
Andres Freund	fa5e202403	Convert multi_shard_transaction.[ch] to new framework.	2016-12-12 15:18:12 -08:00
Andres Freund	fc298ec095	Coordinated remote transaction management.	2016-12-12 15:18:12 -08:00
Andres Freund	6eeb43af15	Add PQgetResult() wrapper handling interrupts. This makes it possible to implement cancelling queries blocked on communication with remote nodes.	2016-12-12 15:18:12 -08:00
Andres Freund	5a00de6c62	Merge pull request #1017 from citusdata/vanillatests Add support for running postgres tests against a database with citus loaded	2016-12-12 14:34:20 -08:00
Andres Freund	b65814280f	Add support for running postgres tests against a database with citus loaded. Not that this is not with the citus extension loaded, just the shared library. The former runs (by adding --use-existing to the flags) but has a bunch of trivial test differences.	2016-12-12 14:25:06 -08:00
Jason Petersen	a426c50a8c	Merge pull request #1035 from citusdata/tweak_coverage Tweak coverage options cr: @jasonmp85	2016-12-09 14:19:52 -07:00
Jason Petersen	2ca9c78540	Bump target to 87.5% With the ignore rules in place, we can increase the target.	2016-12-09 14:06:35 -07:00
Jason Petersen	1e4f5e1a95	Format for readability	2016-12-09 13:33:58 -07:00
Jason Petersen	145537a5d2	Add ignore rules	2016-12-09 13:03:23 -07:00
Jason Petersen	14f4dfb242	Make patch target 75% If a patch has very bad coverage, reject it.	2016-12-09 13:03:23 -07:00
Jason Petersen	5bd07b253a	Be explicit about coverage project requirements If a change drops coverage (absolutely) beneath 80% or (relatively) reduces coverage by more than 0.5%, give it a bad status.	2016-12-09 13:03:23 -07:00
Jason Petersen	26ae8704d6	Add default Codecov configuration Will change shortly.	2016-12-09 13:03:23 -07:00
Andres Freund	4bea1f621a	Merge pull request #1022 from citusdata/feature/2PCBUMPISM Make prepared transactions available if not configured.	2016-12-08 20:04:00 -08:00
Andres Freund	7434fcc6df	Make prepared transactions available if not configured.	2016-12-08 19:57:22 -08:00
Burak Yücesoy	62b12f413c	Merge pull request #876 from citusdata/foreign_key_push_down_for_alter_table Add Foreign Key Support to ALTER TABLE commands	2016-12-08 14:11:01 +02:00
Burak Yucesoy	8d7cd4d746	Add Foreign Key Support to ALTER TABLE commands With this PR, we add foreign key support to ALTER TABLE commands. For now, we only support foreign constraint creation via ALTER TABLE query, if it is only subcommand in ALTER TABLE subcommand list. We also only allow foreign key creation if replication factor is 1.	2016-12-08 15:03:25 +02:00
Andres Freund	f7d9074aa3	Merge pull request #863 from citusdata/feature/connection-lifecycle Connection Lifecycle Management Infrastructure	2016-12-07 11:51:02 -08:00
Andres Freund	2374905c89	Move multi_client_executor.[ch] ontop of connection_management.[ch]. That way connections can be automatically closed after errors and such, and the connection management infrastructure gets wider testing. It also fixes a few issues around connection string building.	2016-12-07 11:44:24 -08:00
Andres Freund	a77cf36778	Use connection_management.c from within connection_cache.c. This is a temporary step towards removing connection_cache.c.	2016-12-07 11:44:24 -08:00
Andres Freund	3505d431cd	Add initial helpers to make interactions with MultiConnection et al. easier. This includes basic infrastructure for logging of commands sent to remote/worker nodes. Note that this has no effect as of yet, since no callers are converted to the new infrastructure.	2016-12-07 11:44:24 -08:00
Andres Freund	3223b3c92d	Centralized Connection Lifetime Management. Connections are tracked and released by integrating into postgres' transaction handling. That allows to to use connections without having to resort to having to disable interrupts or using PG_TRY/CATCH blocks to avoid leaking connections. This is intended to eventually replace multi_client_executor.c and connection_cache.c, and to provide the basis of a centralized transaction management. The newly introduced transaction hook should, in the future, be the only one in citus, to allow for proper ordering between operations. For now this central handler is responsible for releasing connections and resetting XactModificationLevel after a transaction.	2016-12-07 11:43:18 -08:00
Andres Freund	883af02b54	Add some basic helpers to make use of dynahash hashtables easier.	2016-12-06 14:15:36 -08:00
Jason Petersen	4347daaf4c	Merge pull request #994 from citusdata/enable-coverage-testing Add coverall support for continuous code coverage testing cr: @jasonmp85	2016-12-06 11:42:48 -07:00
Jason Petersen	ac6494a0c3	Add comment for otherwise opaque secure value	2016-12-06 11:30:22 -07:00
Brian Cloutier	29d2b0c49f	Enable instrumentation of coverage Adds an --enable-coverage configure option which provides the necessary flags for coverage instrumentation. A new tools branch uses this flag during all builds. Coverage reports are uploaded to codecov.io, where they are publicly visible.	2016-12-06 11:30:22 -07:00
Marco Slot	f2d151cfeb	Merge pull request #1006 from citusdata/bugfix/placement_id_readfunc Use READ_UINT64_FIELD for placement ID in ReadShardPlacement	2016-12-05 23:08:27 +01:00
Marco Slot	3d09a2e5c2	Use READ_UINT64_FIELD for placement ID in ReadShardPlacement	2016-12-05 17:22:23 +01:00
Murat Tuncer	51f2de276e	Merge pull request #995 from citusdata/fix-870-non-relational-filter Add new tests for non-relational filters in queries	2016-12-05 13:35:46 +02:00
Murat Tuncer	131ed8ca1f	Add new tests for non-relational filters in queries	2016-12-05 14:27:36 +03:00
Marco Slot	ab982c3ecb	Merge pull request #911 from citusdata/bugfix/take_metadata_lock Take shard metadata lock in several UDFs	2016-12-02 16:34:44 +01:00
Marco Slot	172bb457e6	Take shard metadata lock in master_append_table_to_shard	2016-12-02 15:56:30 +01:00
Eren Başak	8013437fc3	Merge pull request #996 from citusdata/sync_pg_dist_node Propagate node add/remove to the nodes with hasmetadata=true	2016-12-02 13:50:52 +02:00
Eren Basak	fb88b167a7	Propagate node add/remove to the nodes with hasmetadata=true This change propagates the changes done by `master_add_node` and `master_remove_node` to the workers that contain metadata.	2016-12-02 14:43:32 +03:00
Brian Cloutier	a4096c9f45	Remove dead code: ResponsiveWorkerNodeList	2016-12-02 13:14:11 +03:00
Andres Freund	0a4889d0af	Use system psql if available, to fix travis build errors. On some systems a new libpq is available than what we're compiling against, but until now we used psql in the version we're compiling against. That' a problem, because (quoting Jason): With 9.6, libpq's default handling of CONTEXT changed: it is hidden unless the level is ERROR or higher. We addressed this ourselves using the SHOW_CONTEXT variable (by setting "always" in pg_regress_multi): in 9.5, this is ignored (and unneeded), in 9.6, it ensures old behavior is preserved. For 9.6 we'd already worked around the problem by specifying that context should always be shown, but < 9.6 psql doesn't know how to do that. As there's no csql anymore, which strictly tied us to a specific version of psql/csql, we can now just use the system's psql if available. We still fall back to the psql of the installation we're compiling against, if there's no other psql in PATH.	2016-12-01 15:58:23 -08:00
Önder Kalacı	89352ae83f	Merge pull request #988 from citusdata/fix_constant_select Bugfix for deparsing INSERT..SELECT queries which involve constant va…	2016-12-01 10:48:01 +02:00
Onder Kalaci	df974e15b8	Bugfix for deparsing INSERT..SELECT queries which involve constant values This commit fixes a bug when the SELECT target list includes a constant value. Previous behaviour of target list re-ordering: * Iterate over the INSERT target list * If it includes a Var, find the corresponding SELECT entry and update its resno accordingly * If it does not include a Var (which we only considered to be DEFAULTs), generate a new SELECT target entry * If the processed target entry count in SELECT target list is less than the original SELECT target list (GROUP BY elements not included in the SELECT target entry), add them in the SELECT target list and update the resnos accordingly. * However, this step was leading to add the CONST SELECT target entries twice. The reason is that when CONST target list entries appear in the SELECT target list, the INSERT target list doesn't include a Var. Instead, it includes CONST as it does for DEFAULTs. New behaviour of target list re-ordering: * Iterate over the INSERT target list * If it includes a Var, find the corresponding SELECT entry and update its resno accordingly * If it does not include a Var (which we consider to be DEFAULTs and CONSTs on the SELECT), generate a new SELECT target entry * If any target entry remains on the SELECT target list which are resjunk, (GROUP BY elements not included in the SELECT target entry), keep them in the SELECT target list by updating the resnos.	2016-12-01 10:41:56 +02:00
Murat Tuncer	86e3086ac8	Merge pull request #1002 from citusdata/fix/395_filters_do_not_work Add support for filter	2016-12-01 09:06:23 +02:00
Murat Tuncer	45762006f3	Add support for filters Ensures filter clauses are stripped from master query, and pushed down to worker queries.	2016-12-01 08:53:46 +03:00
Jason Petersen	19eaf3885f	Merge pull request #1001 from citusdata/add_changelog_entry_for_6.0.1 Add 6.0.1 CHANGELOG entry cr: @jasonmp85	2016-11-29 09:14:23 -07:00
Burak Yucesoy	5e5dc2a1cb	Add 6.0.1 CHANGELOG entry Couple fixes and improvements to existing behavior.	2016-11-29 08:05:36 -08:00
Marco Slot	4525ee937c	Merge pull request #976 from citusdata/fix-error-messa Fixup unsupported error message	2016-11-26 11:19:27 +01:00
Sumedh Pathak	0a0d4784b9	Change DDL error message to say "unsupported" instead of "supported"	2016-11-26 10:30:09 +01:00
Marco Slot	db3964bed9	Merge pull request #975 from citusdata/sumedhpathak-readme-change Update documentation link to 6.0	2016-11-26 10:28:45 +01:00
sumedhpathak	5c99b02620	Update documentation link to 6.0	2016-11-26 10:01:34 +01:00
Murat Tuncer	a1b7de0eb0	Merge pull request #973 from citusdata/fix/964_pg_upgrade_catalog_tables Fix failures during pg_upgrade	2016-11-11 17:33:43 -08:00
Murat Tuncer	b5c1ecb684	Fix failures during pg_upgrade - fix error in CitusHasBeenLoaded() - allow creation of pg_catalog tables during upgrade	2016-11-11 17:22:45 -08:00
Marco Slot	5ee6a0ee3f	Merge pull request #979 from citusdata/bugfix/null_parameter Pass down the correct type for null parameters	2016-11-11 16:57:06 -08:00
Marco Slot	b566c4815c	Pass down the correct type for null parameters	2016-11-11 07:14:08 +01:00
Metin Döşlü	ce3019d39e	Merge pull request #974 from citusdata/fix/use_access_share_lock Use AccessShareLock on the source table while creating a colocated table	2016-11-10 11:30:55 -08:00
Metin Doslu	a0c92b38cb	Use AccessShareLock on the source table while creating a colocated table While creating a colocated table, we don't want the source table to be dropped. However, using a ShareLock blocks DML statements on the source table, and using AccessShareLock is enough to prevent DROP. Therefore, we just loosened the lock to AccessShareLock.	2016-11-10 09:17:05 -08:00
Eren Başak	6810835e1a	Merge pull request #955 from citusdata/master_add_node_column_defs Add Column Definition List for Output Columns for master_add_node	2016-11-07 15:02:36 -08:00
Eren Basak	444f14d546	Add Column Definition List for Output Columns for master_add_node This change allows seeing the names of columns of `master_add_node`, using `SELECT * FROM master_add_node(...)` by specifying output columns in UDF definition.	2016-11-07 14:08:58 -08:00
Jason Petersen	217f899816	Bump CHANGELOG release dates Oops.	2016-11-07 09:51:18 -07:00
Jason Petersen	666909030f	Merge pull request #950 from citusdata/changelog-6.0 CHANGELOG changes cr: @sumedhpathak	2016-11-07 09:49:14 -07:00
Jason Petersen	e7aeb9865e	Add 6.0.0 CHANGELOG entry For the upcoming release.	2016-11-07 09:41:08 -07:00
Jason Petersen	24033a6d0b	Add 5.2.2 CHANGELOG entry Many fixes and improvements to existing behavior.	2016-11-07 09:41:08 -07:00
Marco Slot	cf49638287	Merge pull request #941 from citusdata/bugfix/repair_deadlock Avoid master_copy_shard_placement after modification deadlock	2016-11-03 10:57:32 +01:00
Marco Slot	c157c3b419	Disallow SendCommandListToWorkerInSingleTransaction when modifications have occurred	2016-11-02 12:26:56 +01:00
Marco Slot	d19c4869d6	Merge pull request #939 from citusdata/bugfix/colocated_multi_shards Use co-located shard ID in multi-shard transactions	2016-11-02 11:18:46 +01:00
Marco Slot	f6b3af7a49	Use co-located shard ID in multi_shard_transaction	2016-11-02 11:01:19 +01:00
Jason Petersen	74d8e4f640	Merge pull request #929 from citusdata/fix_create_index_if_not_exists Avoid error during CREATE INDEX IF NOT EXISTS cr: @jasonmp85	2016-11-01 17:11:39 -06:00
Samay Sharma	82e5faa190	Avoid error during CREATE INDEX IF NOT EXISTS Previously, we threw an error when we ran CREATE INDEX IF NOT EXISTS with an already existing index. This change enables expected behavior by checking if the statement has IF NOT EXISTS before throwing the error. We also ensure that we don't execute the command on the workers, if an index already exists on the master.	2016-11-01 14:51:19 -07:00
Burak Yücesoy	99658c626e	Merge pull request #937 from citusdata/fix_typo_in_error_message Fix typo in error message	2016-11-01 17:04:53 +02:00
Burak Yucesoy	b30b339f91	Fix typo in error message	2016-11-01 16:58:27 +02:00
Burak Yücesoy	2cb2e7a352	Merge pull request #936 from citusdata/fix_foreign_constraint_replication_factor_message Change error message we displayed for foreign constraints if RF > 1	2016-11-01 15:57:46 +02:00
Burak Yucesoy	6246702a4c	Change error message we displayed for foreign constraints if RF > 1 At the moment, we do not support foreign constraints if replication factor is greater than 1. However foreign constraints can be used in cloud with high availability option. Therefore we do not want to create an impression such that foreign constraints with high availability is not supported at all. We call users to action with this error message.	2016-11-01 15:47:19 +02:00
Marco Slot	a4d5da4132	Merge pull request #928 from citusdata/fix_drop_shards Always CASCADE while dropping a shard	2016-11-01 10:35:35 +01:00
Önder Kalacı	83e1719541	Always CASCADE while dropping a shard	2016-11-01 10:16:34 +01:00
Brian Cloutier	50805f1e5c	Copy raw_parse_tree before using it Address citusdata/citus#922. Fixes a segfault in PG's installcheck caused by our reuse of raw_parse_tree when handling EXPLAIN EXECUTE.	2016-10-27 18:25:49 +03:00
Önder Kalacı	f3aedb9289	Merge pull request #921 from citusdata/fix_execution_of_insert_select_bug Improve error semantics for INSERT..SELECT	2016-10-27 14:17:13 +03:00
Onder Kalaci	a43e3bad56	Improve error semantics for INSERT..SELECT With this commit, we error out if a worker query cannot be executed on all placements of a target insert shard interval.	2016-10-27 14:09:05 +03:00
Andres Freund	7ae3b31328	Merge pull request #917 from citusdata/isolationtester Basic Isolationtester infrastructure including some basic tests.	2016-10-27 04:03:37 -07:00
Andres Freund	dfe7b357c5	Simple isolationtester dml vs. repair tests.	2016-10-27 00:31:41 -07:00
Andres Freund	121b868da5	Add very basic isolationtester infrastructure including a trivial test.	2016-10-27 00:31:41 -07:00
Andres Freund	ce73ffdf2e	Identify build and source directory of postgres we're compiling against. That's useful when trying to rely on files only present in source and/or build directories, not in the normal installation. E.g. the isolationtester binary, or the valgrind suppression files.	2016-10-27 00:31:41 -07:00
Andres Freund	c3e1d49e34	Don't try to shutdown servers that have not been started in regression tests. This avoids spurious output from failing shutdowns and uninitialized variable warnings if pg_regress_multi.pl fails before starting servers.	2016-10-27 00:31:41 -07:00
Metin Döşlü	03c06a3b68	Merge pull request #920 from citusdata/fix/error_on_different_shard_placement_count Error on different shard placement counts	2016-10-26 18:54:08 +03:00
Metin Doslu	c6f5cabbe3	Error on different shard placement count In ErrorIfShardPlacementsNotColocated(), while checking if shards are colocated, error out if matching shard intervals have different number of shard placements.	2016-10-26 18:46:05 +03:00
Önder Kalacı	7f74d82835	Merge pull request #919 from citusdata/add_stub_for_repair_shards Add stub for Copy shard placement	2016-10-26 18:05:54 +03:00
Onder Kalaci	9cd549f21f	Add stub for Copy shard placement This commit does not change the current behaviour, but, helps to implement enterprise feature without any version changes.	2016-10-26 17:57:55 +03:00
Metin Döşlü	9969594e10	Merge pull request #915 from citusdata/add_mark_tables_colocated Add mark_tables_colocated() to update colocation groups	2016-10-26 17:37:29 +03:00
Metin Doslu	4e555880b7	Add mark_tables_colocated() to update colocation groups Added a new UDF, mark_tables_colocated(), to colocate tables with the same configuration (shard count, shard replication count and distribution column type).	2016-10-26 17:29:03 +03:00
Andres Freund	422bb51ff1	Merge pull request #916 from citusdata/bugfix/prepstate Re-acquire metadata locks in RouterExecutorStart	2016-10-26 06:54:13 -07:00
Marco Slot	275378aa45	Re-acquire metadata locks in RouterExecutorStart	2016-10-26 14:34:59 +02:00
Brian Cloutier	1e6d1ef67e	Fix segfault during EXPLAIN EXECUTE Fix citusdata/citus#886 The way postgres' explain hook is designed means that our hook is never called during EXPLAIN EXECUTE. So, we special-case EXPLAIN EXECUTE by catching it in the utility hook. We then replace the EXECUTE with the original query and pass it back to Citus.	2016-10-26 15:18:42 +03:00
Burak Yücesoy	61f6baf9e3	Merge pull request #908 from citusdata/only_repair_given_shard Only repair given shard	2016-10-26 14:44:48 +03:00
Burak Yucesoy	fc2fea839b	Only repair given shard Previously, when a repair is requested on a shard, we also repair all co-located shards of given shard, which may cause repairing already healthy shards. With this change, we only repair given shard.	2016-10-26 14:36:37 +03:00
Brian Cloutier	80c8cfeabe	Don't add a raw 32-bit int to tuples in create_distributed_table	2016-10-26 14:02:42 +03:00
Andres Freund	837ec67c80	Merge pull request #914 from citusdata/bugfix/minimal-127 Invalidate relcache after pg_dist_shard_placement changes.	2016-10-26 03:50:48 -07:00
Andres Freund	fcd150c7c8	Invalidate relcache after pg_dist_shard_placement changes. This forces prepared statements to be re-planned after changes of the placement metadata. There's some locking issues remaining, but that's a a separate task. Also add regression tests verifying that invalidations take effect on prepared statements.	2016-10-26 03:36:35 -07:00
Önder Kalacı	fa8d39ec91	Merge pull request #859 from citusdata/feature/insert_select Feature/insert select	2016-10-26 11:27:16 +03:00
Onder Kalaci	1673ea937c	Feature: INSERT INTO ... SELECT This commit adds INSERT INTO ... SELECT feature for distributed tables. We implement INSERT INTO ... SELECT by pushing down the SELECT to each shard. To compute that we use the router planner, by adding an "uninstantiated" constraint that the partition column be equal to a certain value. standard_planner() distributes that constraint to all the tables where it knows how to push the restriction safely. An example is that the tables that are connected via equi joins. The router planner then iterates over the target table's shards, for each we replace the "uninstantiated" restriction, with one that PruneShardList() handles. Do so by replacing the partitioning qual parameter added in multi_planner() with the current shard's actual boundary values. Also, add the current shard's boundary values to the top level subquery to ensure that even if the partitioning qual is not distributed to all the tables, we never run the queries on the shards that don't match with the current shard boundaries. Finally, perform the normal shard pruning to decide on whether to push the query to the current shard or not. We do not support certain SQLs on the subquery, which are described/commented on ErrorIfInsertSelectQueryNotSupported(). We also added some locking on the router executor. When an INSERT/SELECT command runs on a distributed table with replication factor >1, we need to ensure that it sees the same result on each placement of a shard. So we added the ability such that router executor takes exclusive locks on shards from which the SELECT in an INSERT/SELECT reads in order to prevent concurrent changes. This is not a very optimal solution, but it's simple and correct. The citus.all_modifications_commutative can be used to avoid aggressive locking. An INSERT/SELECT whose filters are known to exclude any ongoing writes can be marked as commutative. See RequiresConsistentSnapshot() for the details. We also moved the decison of whether the multiPlan should be executed on the router executor or not to the planning phase. This allowed us to integrate multi task router executor tasks to the router executor smoothly.	2016-10-26 10:01:00 +03:00
Onder Kalaci	e0d83d65af	Add ability to reorder target list for INSERT/SELECT queries The necessity for this functionality comes from the fact that ruleutils.c is not supposed to be used on "rewritten" queries (i.e. ones that have been passed through QueryRewrite()). Query rewriting is the process in which views and such are expanded, and, INSERT/UPDATE targetlists are reordered to match the physical order, defaults etc. For the details of reordeing, see transformInsertRow().	2016-10-26 10:00:03 +03:00
Jason Petersen	f900a1a107	Merge pull request #901 from citusdata/fix_udf_schemas Fix function schemas cr: @mtuncer @anarazel	2016-10-25 12:54:04 -06:00
Jason Petersen	73f5b8b05f	Move all funcs to pg_catalog, add test to verify We'd been relying on a single SET search_path command in an earlier script, but a subsequent script RESET search_path, causing any further bare functions to be created in the first schema on the search path. However, starting with an older extension version and executing ALTER scripts one at a time DOES avoid putting any functions in the public namespace, so I wrote an upgrade script resilient to that, especially because PostgreSQL 9.5 will error out if a function is already in the schema it's being moved to.	2016-10-25 12:45:53 -06:00
Brian Cloutier	750855bcc0	Merge pull request #910 from citusdata/int8-nodeport Treat nodePort as the 8byte number it is	2016-10-25 17:12:44 +03:00
Brian Cloutier	c6b74b023f	Treat nodePort as the 8byte number it is	2016-10-25 16:31:48 +03:00
Brian Cloutier	2e96f6ab27	Fix crash when upgrading to Citus 6 Between restart (running the new code) and ALTER EXTENSION citus UPGRADE there was an inconsistency where we assumed that pg_dist_partition had the repmodel column set. Now we give it a default value if the column doesn't exist yet.	2016-10-24 15:18:29 +03:00
Marco Slot	f5ae4330be	Merge pull request #885 from citusdata/feature/parallel_ddl Parallelise DDL commands	2016-10-24 13:29:51 +02:00
Marco Slot	271b20a23e	Parallelise DDL commands	2016-10-24 12:39:08 +02:00
Burak Yücesoy	18f6c9c1a7	Merge pull request #888 from citusdata/foreign_key_support_for_create_table Foreign key support for create table	2016-10-21 16:48:07 +03:00
Burak Yucesoy	5a03acf2bf	Foreign Constraint Support for create_distributed_table and shard move With this change, we now push down foreign key constraints created during CREATE TABLE statements. We also start to send foreign constraints during shard move along with other DDL statements	2016-10-21 15:38:55 +03:00
Marco Slot	84babaa58e	Merge pull request #895 from citusdata/bugfix/evaluation Re-disable master evaluation for SELECT	2016-10-21 11:38:11 +02:00
Marco Slot	02d2b86e68	Re-disable master evaluation for SELECT	2016-10-21 10:51:47 +02:00
Metin Döşlü	2dcca0939b	Merge pull request #892 from citusdata/add_create_reference_table_udf Add create_reference_table()	2016-10-20 15:41:38 +03:00
Metin Doslu	405335fcee	Add create_reference_table() create_reference_table() creates a hash distributed table with shard count equals to 1 and replication factor equals to shard_replication_factor configuration value.	2016-10-20 15:29:30 +03:00
Metin Döşlü	b6a9b61d32	Merge pull request #867 from citusdata/add_create_distributed_table Add create_distributed_table() udf	2016-10-20 11:43:25 +03:00
Metin Doslu	d3e7d9dc8d	Final refactoring	2016-10-20 11:29:11 +03:00
Metin Doslu	58ac477ffb	Change return type of BuildDistributionKeyFromColumnName() to Var * BuildDistributionKeyFromColumnName() always returns a Var pointer, so there is no reason to return a Node pointer instead of a Var pointer.	2016-10-20 10:59:31 +03:00
Metin Doslu	161093908e	Convert colocationid to uint32	2016-10-20 10:59:31 +03:00
Metin Doslu	8334d853c0	Add local function GetNextShardId()	2016-10-20 10:59:31 +03:00
Metin Doslu	40bdafa8d1	Add create_distributed_table() create_distributed_table() creates a hash distributed table with default values of shard count and shard replication factor.	2016-10-20 10:58:25 +03:00
Metin Doslu	d04f4f5935	Add guc variable for shard count	2016-10-19 10:44:50 +03:00
Marco Slot	98e0648d40	Merge pull request #855 from citusdata/feature/parallel_modify Parallelise master_modify_multiple_shards and other things	2016-10-19 09:46:28 +03:00
Marco Slot	65f6d7c02a	Follow consistent execution order in parallel commands	2016-10-19 08:33:08 +02:00
Marco Slot	a497e7178c	Parallelise master_modify_multiple_shards	2016-10-19 08:33:08 +02:00
Marco Slot	9d98acfb6d	Move requiresMasterEvaluation from Task to Job	2016-10-19 08:23:06 +02:00
Marco Slot	213d8419c6	Refactor and redocument executor shard lock code	2016-10-19 08:13:35 +02:00
Jason Petersen	bd9a433709	Merge pull request #850 from citusdata/add_9.6_support Support PostgreSQL 9.6 cr: @anarazel	2016-10-18 16:30:30 -06:00
Andres Freund	ac14b2edbc	Support PostgreSQL 9.6 Adds support for PostgreSQL 9.6 by copying in the requisite ruleutils file and refactoring the out/readfuncs code to flexibly support the old-style copy/pasted out/readfuncs (prior to 9.6) or use extensible node APIs (in 9.6 and higher). Most version-specific code within this change is only needed to set new fields in the AggRef nodes we build for aggregations. Version-specific test output files were added in certain cases, though in most they were not necessary. Each such file begins by e.g. printing the major version in order to clarify its purpose. The comment atop citus_nodes.h details how to add support for new nodes for when that becomes necessary.	2016-10-18 16:23:55 -06:00
Murat Tuncer	a2f6b29a6d	Merge pull request #843 from citusdata/custom_udf_run_all Add Citus tools UDFs	2016-10-18 21:26:56 +03:00
Murat Tuncer	b453f6c7ab	Add master_run_on_worker UDF	2016-10-18 17:59:54 +03:00
Eren Başak	0bce20dd74	Merge pull request #864 from citusdata/migrate_worker_transactions Add worker transaction and transaction recovery infrastructure	2016-10-18 14:25:12 +03:00
Eren Basak	cee7b54e7c	Add worker transaction and transaction recovery infrastructure	2016-10-18 14:18:14 +03:00
Metin Döşlü	a317683046	Merge pull request #820 from citusdata/parameterized_queries_regression_tests Add regression tests for parameterized queries	2016-10-18 14:08:59 +03:00
Metin Doslu	27616cca52	Add regression tests for parameterized queries	2016-10-18 14:02:50 +03:00
Eren Başak	64c8972c19	Merge pull request #865 from citusdata/add_pg_dist_local_group_table Add pg_dist_local_group Metadata Table	2016-10-17 12:02:54 +03:00
Eren Basak	f3ede37c9f	Add hasmetadata column to pg_dist_node	2016-10-17 11:52:18 +03:00
Eren Basak	c7bf2021fa	Add metadata infrastructure for pg_dist_local_group table	2016-10-17 11:52:18 +03:00
Eren Basak	8f477d18f1	Add pg_dist_local_group Metadata Table This change adds the pg_dist_local_group metadata table, which indicates the group id of the current node. It is expected that this table contains one and only one row, which only contains the group id of the node as an integer.	2016-10-14 11:41:14 +03:00
Eren Başak	50569be29c	Merge pull request #874 from citusdata/fix_metadata_snapshot_test_placement_id_change Fix changing placement ids in metadata snapshot test	2016-10-14 11:35:46 +03:00
Eren Basak	630f199d3c	Fix changing placement ids in metadata snapshot test	2016-10-14 11:13:16 +03:00
Brian Cloutier	2965331d80	Merge pull request #868 from citusdata/746-drop-shardalias Drop shardalias	2016-10-14 11:10:54 +03:00
Brian Cloutier	6c3d79b4e7	Drop shardalias	2016-10-14 11:03:26 +03:00
Önder Kalacı	d4950ea510	Merge pull request #840 from citusdata/colocated_shard_copy_and_shard_move Colocation support for master_copy_shard_placement	2016-10-13 18:26:25 +03:00
Burak Yucesoy	6668d19a3b	Make shard transfer functions co-location aware With this change, master_copy_shard_placement and master_move_shard_placement functions start to copy/move given shard along with its co-located shards.	2016-10-13 18:16:40 +03:00
Metin Döşlü	0fbd19550d	Merge pull request #811 from citusdata/having_support Add HAVING support	2016-10-13 16:05:18 +03:00
Metin Doslu	d03a2af778	Add HAVING support This commit completes having support in Citus by adding having support for real-time and task-tracker executors. Multiple tests are added to regression tests to cover new supported queries with having support.	2016-10-13 15:47:53 +03:00
Eren Başak	736c73d008	Merge pull request #799 from citusdata/metadata_sync Add Metadata Snapshot Infrastructure	2016-10-13 10:47:48 +03:00
Eren Basak	ed3af403fd	Add Metadata Snapshot Infrastructure This change adds the required infrastructure about metadata snapshot from MX codebase into Citus, mainly metadata_sync.c file and master_metadata_snapshot UDF.	2016-10-13 10:40:14 +03:00
Jason Petersen	14315f05a5	Merge pull request #862 from citusdata/vars_for_job_and_task_ids Use single-quote interpolation in partition test cr: @marcocitus	2016-10-10 13:16:37 -06:00
Jason Petersen	d140d1c934	Use single-quote interpolation in partition test Noticed an old issue and this outdated comment. Figured I'd fix it.	2016-10-10 13:03:43 -06:00
Jason Petersen	76d86e1ac9	Merge pull request #860 from citusdata/fix_and_run_all_tests Fix tests and tell Travis to run them all cr: @marcocitus	2016-10-07 17:38:19 -06:00
Jason Petersen	bcfc58a7c7	Fix tests and tell Travis to run them all Two sets of tests are fixed by this change: * multi_agg_approximate_distinct * those in multi_task_tracker_extra_schedule The first broke when we renamed stage to load in many files and was never being run because the HyperLogLog extension wasn't easily available in Debian. Now it's in our repo, so we install it and run the test. I removed the distinct HLL target in favor of just always running it and providing an output variant to handle when the extension is absent. Basically, if PostgreSQL thinks HLL is available, the test installs it and runs normally, otherwise the absent variant is used. The second broke when I removed a test variant, erroneously believing it to be related to an older Citus version. I've added a line in that test to clarify why the variant is necessary (a practice we should widely adopt).	2016-10-07 17:32:54 -06:00
Andres Freund	85075b7c28	Merge pull request #857 from citusdata/feature/placementid Introduce placement IDs.	2016-10-07 12:54:12 -07:00
Marco Slot	33b7723530	Use UpdateShardPlacementState where appropriate	2016-10-07 11:59:20 -07:00
Andres Freund	982ad66753	Introduce placement IDs. So far placements were assigned an Oid, but that was just used to track insertion order. It also did so incompletely, as it was not preserved across changes of the shard state. The behaviour around oid wraparound was also not entirely as intended. The newly introduced, explicitly assigned, IDs are preserved across shard-state changes. The prime goal of this change is not to improve ordering of task assignment policies, but to make it easier to reference shards. The newly introduced UpdateShardPlacementState() makes use of that, and so will the in-progress connection and transaction management changes.	2016-10-07 11:59:20 -07:00
Metin Döşlü	7e8efbe540	Merge pull request #841 from citusdata/reduce_min_task_tracker_delay Reduce minimum value of task_tracker_delay to 1ms	2016-10-07 10:10:59 +03:00
Metin Doslu	d94a65e0e9	Reduce minimum value of task_tracker_delay to 1ms	2016-10-07 09:55:56 +03:00
Marco Slot	770d09b48e	Merge pull request #854 from citusdata/marcocitus-patch-1 Update docs links to v5.2 docs	2016-10-05 16:40:15 -07:00
Marco Slot	2843fdf43d	Update docs links to v5.2 docs	2016-10-06 01:27:30 +02:00
Eren Başak	efde4d67f3	Merge pull request #798 from citusdata/786-add_pg_dist_node Replace pg_worker_list.conf with a pg_dist_node table	2016-10-05 13:12:01 +03:00
Brian Cloutier	9d6699b07c	Switch from pg_worker_list.conf file to pg_dist_node metadata table. Related to #786 This change adds the `pg_dist_node` table that contains the information about the workers in the cluster, replacing the previously used `pg_worker_list.conf` file (or the one specified with `citus.worker_list_file`). Upon update, `pg_worker_list.conf` file is read and `pg_dist_node` table is populated with the file's content. After that, `pg_worker_list.conf` file is renamed to `pg_worker_list.conf.obsolete` For adding and removing nodes, the change also includes two new UDFs: `master_add_node` and `master_remove_node`, which require superuser permissions. 'citus.worker_list_file' guc is kept for update purposes but not used after the update is finished.	2016-10-05 13:01:35 +03:00
Marco Slot	4fae2133f1	Merge pull request #816 from citusdata/mx/add_partition_column Add replication model column to pg_dist_partition	2016-10-05 02:38:16 -07:00
Marco Slot	32b2bd4ed8	Add replication model column to pg_dist_partition	2016-10-05 01:14:28 +02:00
Önder Kalacı	40d99d9845	Merge pull request #838 from citusdata/update_function_name Update ColocatedShardPlacementList() function name to	2016-10-04 11:31:47 +03:00
Onder Kalaci	0993f2fb2c	Update ColocatedShardPlacementList() function name to ColocatedShardIntervalList() which was intented.	2016-10-04 09:51:42 +03:00
Marco Slot	09e3d5fd47	Merge pull request #837 from citusdata/bugfix/pnstrdup Avoid use of pnstrdup	2016-10-04 07:09:15 +02:00
Marco Slot	fe3ffdb013	Avoid use of pnstrdup	2016-10-04 00:31:53 +02:00
Marco Slot	6c0fc0c970	Merge pull request #783 from robin900/new-extend-names Provides safe, backwards-compatible shard-extended names to any object name	2016-10-03 23:17:44 +02:00
Robin Thomas	f677fadbe6	Provides safe, idempotent shard-extended names to any object name related to a table that might be distributed, allowing any name that is within regular PostgreSQL length limits to be extended with a shard ID for use in shards on workers. Handles multi-byte character boundaries in identifiers when making prefixes for shard-extended names. Includes tests. Uses hash_any from PostgreSQL's access/hashfunc.c. Removes AppendShardIdToStringInfo() as it's used only once and arguably is best replaced there with a call to AppendShardIdToName(). Adds UDF shard_name(object_name, shard_id) to expose the shard-extended name logic to other PL/PGSQL, UDFs and scripts. Bumps version to 6.0-2 to allow for UDF to be created in migration script. Fixes citusdata/citus#781 and citusdata/citus#179.	2016-10-03 17:02:34 -04:00
Andres Freund	7e18ec59b9	Merge pull request #834 from citusdata/valgrind-clean Fix issues making valgrind fail	2016-10-03 14:01:27 -07:00
Andres Freund	de32b7bbad	Don't create hash-table of zero size in TaskHashCreate(). hash_create(), called by TaskHashCreate(), doesn't work correctly for a zero sized hash table. This triggers valgrind errors, and could potentially cause crashes even without valgring. This currently happens for Jobs with 0 tasks. These probably should be optimized away before reaching TaskHashCreate(), but that's a bigger change.	2016-10-03 13:07:43 -07:00
Andres Freund	6d050bc9f8	Initialize count_agg_clauses argument to 0. count_agg_clause adds the cost of the aggregates to the state variable, it doesn't reinitialize it. That is intentional, as it is used to incrementally add costs in some places.	2016-10-03 13:07:43 -07:00
Andres Freund	a6150c2916	Lower "waiting for activity on tasks took longer than" log level. It's perfectly normal to wait longer in several circumstances, and the output can lead to spurious regression output changes.	2016-10-03 13:07:43 -07:00
Marco Slot	e6ecbc2063	Merge pull request #831 from citusdata/ultimate_citus_improvement Change logicalrelid type to regclass	2016-10-03 20:34:57 +02:00
Marco Slot	a4efb60b54	Change logicalrelid type in pg_dist_partition and pg_dist_shard to regclass	2016-10-03 20:27:16 +02:00
Marco Slot	fa2f5087ad	Merge pull request #832 from citusdata/bugfix/remove_eventinvoke_trigger Remove EventInvokeTrigger from regression test output	2016-10-03 20:26:19 +02:00
Marco Slot	fc93974238	Remove EventInvokeTrigger from regression test output	2016-10-03 20:21:15 +02:00
Marco Slot	3d1e2c1d3a	Merge pull request #819 from robin900/handle-repartitions-by-typname During repartitions use partitionColumnType as ::regtype so that UDTs work	2016-10-03 19:50:45 +02:00
Robin Thomas	c507a0df1c	During repartitions, the partitionColumnType argument sent to workers is now a `::regtype` using the qualified name of the column type, not the column type OID which may differ between master/worker nodes. Test coverage of a hash reparitition using a UDT as the join column. Note that the UDFs `worker_hash_partition_table` and `worker_range_partition_table` are unchanged, and rightly expect an OID for the column type; but the planner code building the commands now allows for `::regtype` casting to do its magic. Fixes citusdata/citus#111.	2016-10-03 13:41:20 -04:00
Marco Slot	4d60aa2d53	Merge pull request #808 from robin900/partial-index-tests Added test coverage for partial unique indexes, exclusion constraints	2016-10-03 17:32:16 +02:00
Robin Thomas	b1493e299e	Added test coverage for partial unique indexes and exclude constraints.	2016-10-03 10:47:30 -04:00
Eren Başak	83ef3d0820	Merge pull request #825 from citusdata/fix_command_counter_increment_wrong_place Fix command counter increment bug	2016-10-03 17:16:58 +03:00
Eren Basak	ac3a4eee21	Fix command counter increment bug Fixes citusdata/citus#714 On `InsertShardRow`, we previously called `CommandCounterIncrement()` before `CitusInvalidateRelcacheByRelid(relationId);`. This might prevent to skip invalidation of the distributed table in the next access within the same session.	2016-10-03 17:00:27 +03:00
Eren Başak	7e7b0f3491	Merge pull request #742 from citusdata/feature/task_tracker_folders Differentiate worker and master job temporary folders - MX Backport	2016-10-03 14:29:54 +03:00
Onder Kalaci	a533b8e7c1	Differentiate worker and master job temporary folders This commit enables to create different worker and master temporary folders. This change is important for citus-mx on task-tracker execution. In simple words, on citus-mx, the worker could actually be reponsible for the master tasks as well. Prior to this change, both master and worker logic on task-tracker executor was accessing and using the same files for different purposes which was dangerous on certain cases (i.e., when task_tracker_delay is low).	2016-10-03 14:24:08 +03:00
Jason Petersen	a8701841b5	Merge pull request #824 from citusdata/use_lock_tranches Move task tracker lwlocks into their own tranche cr: @anarazel	2016-09-30 16:11:11 -06:00
Andres Freund	77efe7fcd4	Move task tracker lwlocks into their own tranche. RequestAddinLWLocks()/LWLockAssign() are gone in 9.6. Luckily all citus supported postgres versions support tranches, so use those.	2016-09-30 16:06:49 -06:00
Jason Petersen	d52ebffce5	Merge pull request #823 from citusdata/update_postgresql_files Update PostgreSQL-sourced files with latest changes cr: @anarazel	2016-09-30 16:06:28 -06:00
Jason Petersen	f59cf2b818	Remove references to 9.4 Some still lingered.	2016-09-29 17:35:19 -06:00
Jason Petersen	37631cd132	Remove alternate multi_hash test file This was made irrelevant by Citus v5.1.0.	2016-09-29 16:43:19 -06:00
Jason Petersen	3046c2b62c	Remove references to PostgreSQL 9.4 support files No longer extant.	2016-09-29 15:54:38 -06:00
Jason Petersen	5634b027b5	Remove gitattributes for csql files This was missed before.	2016-09-29 15:54:38 -06:00
Jason Petersen	6671cf5171	Remove unused dumputils.h header Believe this was used by csql, which is now gone.	2016-09-29 15:54:38 -06:00
Jason Petersen	1c560dfa9c	Update ruleutils_95 with latest PostgreSQL changes Hand-applied changes from a diff I generated between 9.5.0 and 9.5.4.	2016-09-29 15:54:38 -06:00
Marco Slot	9a5b844a81	Merge pull request #815 from citusdata/bugfix/count_null Make count return 0 if all shards are pruned away	2016-09-29 20:32:46 +02:00
Marco Slot	c4bc0742a7	Make count return 0 if all shards are pruned away Before this change, count on a distributed returned NULL if all shards were pruned away, because on the master we replace with count(..) call with a sum(..) call to sum the counts from the shards. However, sum returns NULL when there are no rows, whereas count is expected to return 0.	2016-09-29 20:27:26 +02:00
Jason Petersen	9c19ad8b78	Merge pull request #818 from citusdata/fix_xact_callbacks Directly register transaction callbacks in PG_init cr: @anarazel	2016-09-29 11:52:03 -06:00
Jason Petersen	5b80d4e8dd	Directly register multi-shard callbacks in PG_init I had changed these callbacks to use the same method I chose for the router executor (for consistency), but as that method is flawed, we now want to ensure we directly register them from PG_init as well.	2016-09-29 11:43:19 -06:00
Jason Petersen	5f6264105d	Directly register router xact callbacks in PG_init Not entirely sure why we went with the shared memory hook approach, but it causes problems (multiple registration) during crashes. Changing to a simple direct registration call from PG_init.	2016-09-29 11:43:18 -06:00
Burak Yücesoy	e2f720dbe7	Merge pull request #785 from citusdata/colocation_features Internal co-location API	2016-09-29 12:43:02 +03:00
Burak Yucesoy	1ee39eb098	Internal co-location API With this commit we introduce internal API for co-location related operations.	2016-09-29 11:56:53 +03:00
Jason Petersen	127c0e513f	Merge pull request #814 from citusdata/copy_to_distributed_table_final_goodbye Remove copy_to_distributed_table cr: @jasonmp85	2016-09-28 11:32:08 -06:00
Marco Slot	5cdbe2b86c	Remove copy_to_distributed_table	2016-09-28 11:27:54 -06:00
Murat Tuncer	854ed613fb	Merge pull request #806 from citusdata/fix_805_where_false Make join queries with where false clauses router plannable	2016-09-28 18:54:19 +03:00
Murat Tuncer	5b42318ac4	Make where false queries router plannable	2016-09-28 18:49:26 +03:00
Murat Tuncer	116fcdbcc8	Merge pull request #784 from citusdata/fix_594_shard_refresh Add UDF master_expire_table_cache	2016-09-28 12:13:01 +03:00
Murat Tuncer	c16dec88c3	Add UDF master_expire_table_cache	2016-09-28 12:08:37 +03:00
Jason Petersen	e8a942485c	Merge pull request #812 from citusdata/fix_uniq_constraint_segfault Fix unique-violation-in-xact segfault cr: @anarazel	2016-09-27 16:51:48 -06:00
Jason Petersen	0caf0d95f1	Fix unique-violation-in-xact segfault An interaction between ReraiseRemoteError and DML transaction support causes segfaults: * ReraiseRemoteError calls PurgeConnection, freeing a connection... * That connection is still in the xactParticipantHash At transaction end, the memory in the freed connection might happen to pass the "is this connection OK?" check, causing us to try to send an ABORT over that connection. By removing it from the transaction hash before calling ReraiseRemoteError, we avoid this possibility.	2016-09-27 16:44:03 -06:00
Metin Döşlü	1a04cd123d	Merge pull request #770 from citusdata/fix/insert_query_inside_plpgsql Pass text oid instead of invalid oid for null values	2016-09-27 08:46:04 +03:00
Metin Doslu	c9dcad9b05	Pass text oid inteads of invalid oid for null values Passing invalid oids even for null values in PQsendQueryParams() causes worker nodes to fail. Therefore, we pass text oid for null values.	2016-09-27 08:15:46 +03:00
Jason Petersen	dbc58c60ca	Merge pull request #776 from citusdata/feature/no-movement Support NoMovement direction in router executor cr: @jasonmp85	2016-09-26 18:32:33 -06:00
Andres Freund	776b3868b9	Support NoMovement direction in router executor This is mainly interesting because it allows to use RETURN QUERY/RETURN QUERY EXECUTE and FOR ... IN .. LOOPs in plpgsql.	2016-09-26 18:28:36 -06:00
Jason Petersen	33552c2f13	Merge pull request #721 from citusdata/feature_truncate Add truncate support for distributed tables cr: @jasonmp85	2016-09-26 18:27:52 -06:00
Murat Tuncer	32003c4aa1	Add tests with spaces in table names	2016-09-26 18:23:43 -06:00
Murat Tuncer	2f78fb8f1b	Remove extra space	2016-09-26 18:23:43 -06:00
Murat Tuncer	902e68c9ef	Refactor SendQueryToPlacements api	2016-09-26 18:23:43 -06:00
Murat Tuncer	6317bbe9a8	Address feedback	2016-09-26 18:23:42 -06:00
Murat Tuncer	877694296f	Fix regression test failures after rebase	2016-09-26 18:23:42 -06:00
Murat Tuncer	2eec0167be	Add support for truncate statement	2016-09-26 18:23:42 -06:00
Marco Slot	ad9d0bb69c	Merge pull request #804 from citusdata/bugfix/join_filter_crash Fix segmentation fault in case of joins with WHERE false	2016-09-26 15:23:57 +02:00
Marco Slot	3318288d75	Fix segmentation fault in case of joins with WHERE 1=0	2016-09-26 15:12:29 +02:00
Eren Başak	70fd42c41c	Merge pull request #749 from robin900/forbid-exclusion-constraints Handle EXCLUDE constraints properly on distributed tables	2016-09-22 11:37:25 +03:00
Robin Thomas	614c858375	Forbid EXCLUDE constraints on distributed tables just as we forbid UNIQUE or PRIMARY KEY constraints. Also, properly propagate valid EXCLUDE constraints to worker shard tables. If an EXCLUDE constraint includes the distribution column, the operator must be an equality operator. Tests in regression suite for exclusion constraints that include the partition column, omit it, and include it but with non-equality operator. Regression tests also verify that valid exclusion constraints are propagated to the shard tables. And the tests work in different timezones now. Fixes citusdata/citus#748 and citusdata/citus#778.	2016-09-21 14:02:42 -04:00
Metin Döşlü	a0fa7bf130	Merge pull request #777 from citusdata/remove_pg_toast_from_regression_tests Remove pg_toast_* references from regression tests	2016-09-10 10:32:02 +03:00
Metin Doslu	35eceb6cca	Remove pg_toast_* references from regression tests pg_toast_* oids are constantly changing, and this causes regression tests to fail time to time. With this commit, we remove all of the pg_toast_* references from regression test outputs.	2016-09-09 11:31:51 +03:00
Jason Petersen	9fd6dafe33	Merge pull request #764 from citusdata/feature/allow_multi_ddl_xact_block Permit multiple DDL commands in a transaction cr: @marcocitus	2016-09-08 22:50:07 -05:00
Jason Petersen	74f4e0003b	Permit multiple DDL commands in a transaction Three changes here to get to true multi-statement, multi-relation DDL transactions (same functionality pre-5.2, with benefits of atomicity): 1. Changed the multi-shard utility hook to always run (consistency with router executor hook, removes ad-hoc "installed" boolean) 2. Change the global connection list in multi_shard_transaction to instead be a hash; update related functions to operate on global hash instead of local hash/global list 3. Remove check within DDL code to prevent subsequent DDL commands; place unset/reset guard around call to ConnectToNode to permit connecting to additional nodes after DDL transaction has begun In addition, code has been added to raise an error if a ROLLBACK TO SAVEPOINT is attempted (similar to router executor), and comprehensive tests execute all multi-DDL scenarios (full success, user ROLLBACK, any actual errors (say, duplicate index), partial failure (duplicate index on one node but not others), partial COMMIT (one node fails), and 2PC partial PREPARE (one node fails)). Interleavings with other commands (DML, \copy) are similarly all covered.	2016-09-08 22:35:55 -05:00
Jason Petersen	8b3286b1f5	Merge pull request #773 from citusdata/zombo_final Add syscols in queries; extend relnames in indexes cr: @jasonmp85	2016-09-07 11:59:38 -05:00
Eric B. Ridge	e80f1612a6	Add syscols in queries; extend relnames in indexes To permit use with ZomboDB (https://github.com/zombodb/zombodb), two changes were necessary: 1. Permit use of `tableoid` system column in queries 2. Extend relation names appearing in index expressions The first is accomplished by simply changing the deparse logic to allow system columns in queries destined for distributed tables. The latter was slightly more complex, given that DDL extension currently occurs on workers. But since indexes cannot reference tables other than the one being indexed, it is safe to look for any relation reference ending in a '*' character and extend their penultimate segments with a shard id. This change also adds an error to prevent users from distributing any relations using the WITH (OIDS) feature, which is unsupported.	2016-09-07 11:54:55 -05:00
Marco Slot	1f15d6b162	Merge pull request #769 from citusdata/feature/partcol Allow noop updates of the partition column	2016-09-07 17:15:32 +02:00
Marco Slot	6f6cb1a0d6	Allow noop updates of the partition column	2016-09-07 14:22:41 +02:00
Jason Petersen	be113e99cc	Add 5.2.1 CHANGELOG entry Outer join and memory leak fixes	2016-09-06 11:45:37 -05:00
Jason Petersen	ed027f060e	Add sort call to shard placement test The comparator is kind of broken, but I think this is better than the current state of random failures.	2016-09-06 11:07:27 -05:00
Jason Petersen	0bc3638855	Merge pull request #768 from citusdata/fix/xact_hash_mem_leak Fix CreateShardConnectionHash memory leak cr: @jasonmp85	2016-09-06 11:54:54 -04:00
Jason Petersen	b3684074f3	Fix CreateShardConnectionHash memory leak The call to hash_create specified HASH_CONTEXT without actually setting one using the provided HASHCTL. The hashes returned by this function are used locally, so simply using CurrentMemoryContext is sufficient.	2016-09-06 10:17:18 -05:00
Metin Döşlü	6333f9ba6f	Merge pull request #755 from citusdata/fix_754_add_outer_join_clause_list_check Add outer join clause list extraction for subquery pushdown logic	2016-09-02 15:01:49 +03:00
Metin Doslu	5b50f2c333	Add complex subquery pushdown regression tests	2016-09-02 14:21:51 +03:00
Metin Doslu	7d212b847f	Add outer join clause list extraction for subquery pushdown logic In subquery pushdown, we allow outer joins if the join condition is on the partition columns. WhereClauseList() used to return all join conditions including outer joins. However, this has been changed with a commit related to outer join support on regular queries. With this commit, we refactored ExtractFromExpressionWalker() to return two lists of qualifiers. The first list is for inner join and filter clauses and the second list is for outer join clauses. Therefore, we can also use outer join clauses to check subquery pushdown prerequisites.	2016-09-02 11:54:44 +03:00
Burak Yücesoy	6d2567f1a2	Merge pull request #762 from citusdata/fix/fix_510_error_out_if_table_to_distribute_has_data Error out at master_create_distributed_table if the table has any rows	2016-09-01 17:47:29 +03:00
Burak Yucesoy	12d1aba1fc	Error out at master_create_distributed_table if the table has any rows Before this change, we do not check whether given table which already contains any data in master_create_distributed_table command. If that table contains any data, making it it distributed, makes that data hidden to user. With this change, we now gave error to user if the table contains data.	2016-09-01 17:42:47 +03:00
Jason Petersen	7168fdf62e	Merge pull request #707 from citusdata/feature/allow_single_ddl_xact_block Permit single DDL commands in transaction blocks cr: @marcocitus	2016-08-31 10:44:45 -06:00
Jason Petersen	850c51947a	Re-permit DDL in transactions, selectively Recent changes to DDL and transaction logic resulted in a "regression" from the viewpoint of users. Previously, DDL commands were allowed in multi-command transaction blocks, though they were not processed in any actual transactional manner. We improved the atomicity of our DDL code, but added a restriction that DDL commands themselves must not occur in any BEGIN/END transaction block. To give users back the original functionality (and improved atomicity) we now keep track of whether a multi-command transaction has modified data (DML) or schema (DDL). Interleaving the two modification types in a single transaction is disallowed. This first step simply permits a single DDL command in such a block, admittedly an incomplete solution, but one which will permit us to add full multi-DDL command support in a subsequent commit.	2016-08-30 20:37:19 -06:00
Metin Döşlü	adf992324a	Merge pull request #757 from citusdata/fix_MultiClientQueryResult Return false in MultiClientQueryResult() on failing query	2016-08-29 17:44:58 +03:00
Metin Doslu	75618fc3fb	Return false in MultiClientQueryResult() on failing query	2016-08-29 17:05:35 +03:00
Brian Cloutier	33973d9f20	Merge pull request #737 from citusdata/622-remove-csql Remove csql, remove multi-check-fdw tests, and bump citusdata/tools version that travis uses to v0.4.1	2016-08-26 11:34:19 +03:00
Brian Cloutier	4ecd6b58fb	Remove csql, \stage is no longer needed	2016-08-26 10:41:59 +03:00
Brian Cloutier	640bb8863b	Remove check-multi-fdw tests, nobody uses Citus with fdws	2016-08-26 10:41:33 +03:00
Brian Cloutier	2758af8f83	Bump tools version, to make list of tests travis runs explicit & configurable	2016-08-26 10:38:12 +03:00
Jason Petersen	2c87244ed4	Merge pull request #630 from citusdata/replace_stage_with_copy_in_tests Replace \stage With \copy in Regression Tests cr: @jasonmp85	2016-08-22 13:41:10 -06:00
Jason Petersen	e54d3f6d32	Rename test files with 'stage' in name Ignored FDW files as those test are being removed entirely, I believe.	2016-08-22 13:32:53 -06:00
Jason Petersen	b391abda3d	Replace verb 'stage' with 'load' in test comments "Staging table" will be the only valid use of 'stage' from now on, we will now say "load" when talking about data ingestion. If creation of shards is its own step, we'll just say "shard creation".	2016-08-22 13:24:18 -06:00
Jason Petersen	35e9f51348	Replace verb 'stage' with 'load' in schedules "Staging table" will be the only valid use of 'stage' from now on.	2016-08-22 11:48:41 -06:00
Eren Başak	0322916700	Lowercase \copy to match PostgreSQL's style for local/psql-level functions	2016-08-22 11:31:26 -06:00
Eren Basak	b513f1c911	Replace \stage With \copy on Regression Tests Fixes #547 This change removes all references to \stage in the regression tests and puts \COPY instead. Doing so changed shard counts, min/max values on some test tables (lineitem, orders, etc.).	2016-08-22 11:31:26 -06:00
Robin Thomas	010cbf16fc	Remove all usage of pg_dist_shard.shardalias in extension code. (#739 ) Remove regression test of non-null shardalias.	2016-08-19 17:06:22 +03:00
Jason Petersen	b4e6dc16d3	Merge pull request #701 from citusdata/fix_inttypes_warnings Remove HAVE_INTTYPES_H ifdefs cr: @anarazel	2016-08-18 15:35:11 -06:00
Jason Petersen	91578ff149	Remove HAVE_INTTYPES_H ifdefs I've been seeing warnings on OS X/clang for a while about these lines and finally got tired of it. The main problem is that PRIu64 expects a uint64_t but we were passing a uint64 (a PostgreSQL-defined type). In PostgreSQL 9.5, we now have INT64_MODIFIER, so can build our own zero- padded unsigned 64-bit int format modifier that expects a PostgreSQL- provided uint64 type. This simplifies the code slightly (no more ifdefs) and gets rid of the warning that's been annoying me since April (my TODO creation time).	2016-08-18 15:19:53 -06:00
Jason Petersen	b59ab75e2b	Add 5.2.0 CHANGELOG entry Our longest yet!	2016-08-15 12:55:12 -06:00
Jason Petersen	900f7590ab	Fix Travis local_first_candidate_nodes failures A recent change to the image used in Travis causes some problems for the code we use here to ensure the local replica is first. Since this code is essentially dead in a post-stage world anyhow, we're OK with ripping out the tests to placate Travis.	2016-08-14 23:12:10 -06:00
Murat Tuncer	3a49cf830e	Remove a router planner test for materialized view PostgreSQL 9.5.4 stopped calling planner for materialized view create command when NO DATA option is provided. This causes our test to behave differently between pre-9.5.4 and 9.5.4.	2016-08-14 22:57:09 -06:00
Andres Freund	3ea352e5f9	Merge pull request #717 from citusdata/fix-700 Skip over unreferenced parameters when router executing prepared statement.	2016-08-05 14:22:24 -07:00
Andres Freund	7fdb5fbe29	Skip over unreferenced parameters when router executing prepared statement. When an unreferenced prepared statement parameter does not explicitly have a type assigned, we cannot deserialize it, to send to the remote side. That commonly happens inside plpgsql functions, where local variables are passed in as unused prepared statement parameters.	2016-08-05 14:12:06 -07:00
Jason Petersen	eba8396501	Avoid attempting to lock invalid shard identifier A recent change generates a "dummy" shard placement with its identifier set to INVALID_SHARD_ID for SELECT queries against distributed tables with no shards. Normally, no lock is acquired for SELECT statements, but if all_modifications_commutative is set to true, we will acquire a shared lock, triggering an assertion failure within LockShardResource in the above case. The "dummy" shard placement is actually necessary to ensure such empty queries have somewhere to execute, and INVALID_SHARD_ID seems the most appropriate value for the dummy's shard identifier field, so the most straightforward fix is to just avoid locking invalid shard identifiers.	2016-08-04 13:49:51 -07:00
Jason Petersen	ccc32f9da8	Pass -Werror during configure/compile/test step This will fail any Travis builds that introduce warnings.	2016-08-03 14:59:14 -07:00
Jason Petersen	3a8534eb21	Check style during Travis CI builds This bumps the Citus tools to 0.4.0, which include support for adding a recent (0.6.3) uncrustify to Travis in addition to support for fully installing Travis scripts to system locations. For brevity, suffixes have been removed from Travis shell scripts. The main additional logic here is just ensuring Travis CI gets a newer uncrustify install and that `citus_indent` is called with the `--check` flag, which exits with a nonzero status if any files don't comply.	2016-08-02 23:16:30 -07:00
Jason Petersen	6be5217872	Lock tools version	2016-08-02 21:23:02 -07:00

3155 changed files with 894831 additions and 89052 deletions

									
										7

.codeclimate.ymlNormal file

										View File
									
				@ -0,0 +1,7 @@

				exclude_patterns:

				  - "src/backend/distributed/utils/citus_outfuncs.c"

				  - "src/backend/distributed/deparser/ruleutils_*.c"

				  - "src/include/distributed/citus_nodes.h"

				  - "src/backend/distributed/safeclib"

				  - "src/backend/columnar/safeclib"

				  - "**/vendor/"

									
										40

.codecov.ymlNormal file

										View File
									
				@ -0,0 +1,40 @@

				codecov:

				  notify:

				    require_ci_to_pass: yes

				coverage:

				  precision: 2

				  round: down

				  range: "70...100"

				  ignore:

				    - "src/backend/distributed/utils/citus_outfuncs.c"

				    - "src/backend/distributed/deparser/ruleutils_*.c"

				    - "src/include/distributed/citus_nodes.h"

				    - "src/backend/distributed/safeclib"

				    - "vendor"

				  status:

				    project:

				      default:

				        target: 87.5

				        threshold: 0.5

				    patch:

				      default:

				        target: 75

				    changes: no

				parsers:

				  gcov:

				    branch_detection:

				      conditional: yes

				      loop: yes

				      method: no

				      macro: no

				comment:

				  layout: "header, diff"

				  behavior: default

				  require_changes: no

33

.devcontainer/.gdbinit Normal file

View File

 @ -0,0 +1,33 @@
 # gdbpg.py contains scripts to nicely print the postgres datastructures
 # while in a gdb session. Since the vscode debugger is based on gdb this
 # actually also works when debugging with vscode. Providing nice tools
 # to understand the internal datastructures we are working with.
 source /root/gdbpg.py
 # when debugging postgres it is convenient to _always_ have a breakpoint
 # trigger when an error is logged. Because .gdbinit is sourced before gdb
 # is fully attached and has the sources loaded. To make sure the breakpoint
 # is added when the library is loaded we temporary set the breakpoint pending
 # to on. After we have added out breakpoint we revert back to the default
 # configuration for breakpoint pending.
 # The breakpoint is hard to read, but at entry of the function we don't have
 # the level loaded in elevel. Instead we hardcode the location where the
 # level of the current error is stored. Also gdb doesn't understand the
 # ERROR symbol so we hardcode this to the value of ERROR. It is very unlikely
 # this value will ever change in postgres, but if it does we might need to
 # find a way to conditionally load the correct breakpoint.
 set breakpoint pending on
 break elog.c:errfinish if errordata[errordata_stack_depth].elevel == 21
 set breakpoint pending auto
 echo \n
 echo ----------------------------------------------------------------------------------\n
 echo when attaching to a postgres backend a breakpoint will be set on elog.c:errfinish \n
 echo it will only break on errors being raised in postgres \n
 echo \n
 echo to disable this breakpoint from vscode run `-exec disable 1` in the debug console \n
 echo this assumes it's the first breakpoint loaded as it is loaded from .gdbinit \n
 echo this can be verified with `-exec info break`, enabling can be done with \n
 echo `-exec enable 1` \n
 echo ----------------------------------------------------------------------------------\n
 echo \n

1

.devcontainer/.gitignore vendored Normal file

View File

				`@ -0,0 +1 @@`
				`postgresql-*.tar.bz2`

7

.devcontainer/.psqlrc Normal file

View File

 @ -0,0 +1,7 @@
 \timing on
 \pset linestyle unicode
 \pset border 2
 \setenv PAGER 'pspg --no-mouse -bX --no-commandbar --no-topbar'
 \set HISTSIZE 100000
 \set PROMPT1 '\n%[%033[1m%]%M %n@%/:%> (PID: %p)%R%[%033[0m%]%# '
 \set PROMPT2 '  '

12

.devcontainer/.vscode/Pipfile vendored Normal file

View File

 @ -0,0 +1,12 @@
 [[source]]
 url = "https://pypi.org/simple"
 verify_ssl = true
 name = "pypi"
 [packages]
 docopt = "*"
 [dev-packages]
 [requires]
 python_version = "3.9"

28

.devcontainer/.vscode/Pipfile.lock generated vendored Normal file

View File

 @ -0,0 +1,28 @@
 {
     "_meta": {
         "hash": {
             "sha256": "6956a6700ead5804aa56bd597c93bb4a13f208d2d49d3b5399365fd240ca0797"
         },
         "pipfile-spec": 6,
         "requires": {
             "python_version": "3.9"
         },
         "sources": [
             {
                 "name": "pypi",
                 "url": "https://pypi.org/simple",
                 "verify_ssl": true
             }
         ]
     },
     "default": {
         "docopt": {
             "hashes": [
                 "sha256:49b3a825280bd66b3aa83585ef59c4a8c82f2c8a522dbe754a8bc8d08c85c491"
             ],
             "index": "pypi",
             "version": "==0.6.2"
         }
     },
     "develop": {}
 }

									
										84

.devcontainer/.vscode/generate_c_cpp_properties-json.pyvendoredExecutable file

										View File
									
				@ -0,0 +1,84 @@

				#! /usr/bin/env pipenv-shebang

				"""Generate C/C++ properties file for VSCode.

				Uses pgenv to iterate postgres versions and generate

				a C/C++ properties file for VSCode containing the

				include paths for the postgres headers.

				Usage:

				  generate_c_cpp_properties-json.py <target_path>

				  generate_c_cpp_properties-json.py (-h | --help)

				  generate_c_cpp_properties-json.py --version

				Options:

				  -h --help     Show this screen.

				  --version     Show version.

				"""

				import json

				import subprocess

				from docopt import docopt

				def main(args):

				    target_path = args['<target_path>']

				    output = subprocess.check_output(['pgenv', 'versions'])

				    # typical output is:

				    #      14.8      pgsql-14.8

				    #  *   15.3      pgsql-15.3

				    #      16beta2    pgsql-16beta2

				    # where the line marked with a * is the currently active version

				    #

				    # we are only interested in the first word of each line, which is the version number

				    # thus we strip the whitespace and the * from the line and split it into words

				    # and take the first word

				    versions = [line.strip('* ').split()[0] for line in output.decode('utf-8').splitlines()]

				    # create the list of configurations per version

				    configurations = []

				    for version in versions:

				        configurations.append(generate_configuration(version))

				    # create the json file

				    c_cpp_properties = {

				        "configurations": configurations,

				        "version": 4

				    }

				    # write the c_cpp_properties.json file

				    with open(target_path, 'w') as f:

				        json.dump(c_cpp_properties, f, indent=4)

				def generate_configuration(version):

				    """Returns a configuration for the given postgres version.

				    >>> generate_configuration('14.8')

				    {

				        "name": "Citus Development Configuration - Postgres 14.8",

				        "includePath": [

				            "/usr/local/include",

				            "/home/citus/.pgenv/src/postgresql-14.8/src/**",

				            "${workspaceFolder}/**",

				            "${workspaceFolder}/src/include/",

				        ],

				        "configurationProvider": "ms-vscode.makefile-tools"

				    }

				    """

				    return {

				        "name": f"Citus Development Configuration - Postgres {version}",

				        "includePath": [

				            "/usr/local/include",

				            f"/home/citus/.pgenv/src/postgresql-{version}/src/**",

				            "${workspaceFolder}/**",

				            "${workspaceFolder}/src/include/",

				        ],

				        "configurationProvider": "ms-vscode.makefile-tools"

				    }

				if __name__ == '__main__':

				    arguments = docopt(__doc__, version='0.1.0')

				    main(arguments)

									
										40

.devcontainer/.vscode/launch.jsonvendoredNormal file

										View File
									
				@ -0,0 +1,40 @@

				{

				    "version": "0.2.0",

				    "configurations": [

				        {

				            "name": "Attach Citus (devcontainer)",

				            "type": "cppdbg",

				            "request": "attach",

				            "processId": "${command:pickProcess}",

				            "program": "/home/citus/.pgenv/pgsql/bin/postgres",

				            "additionalSOLibSearchPath": "/home/citus/.pgenv/pgsql/lib",

				            "setupCommands": [

				                {

				                    "text": "handle SIGUSR1 noprint nostop pass",

				                    "description": "let gdb not stop when SIGUSR1 is sent to process",

				                    "ignoreFailures": true

				                }

				            ],

				        },

				        {

				            "name": "Open core file",

				            "type": "cppdbg",

				            "request": "launch",

				            "program": "/home/citus/.pgenv/pgsql/bin/postgres",

				            "coreDumpPath": "${input:corefile}",

				            "cwd": "${workspaceFolder}",

				            "MIMode": "gdb",

				        }

				    ],

				    "inputs": [

				        {

				            "id": "corefile",

				            "type": "command",

				            "command": "extension.commandvariable.file.pickFile",

				            "args": {

				                "dialogTitle": "Select core file",

				                "include": "**/core*",

				            },

				        },

				    ],

				}

									
										222

.devcontainer/DockerfileNormal file

										View File
									
				@ -0,0 +1,222 @@

				FROM ubuntu:22.04 AS base

				# environment is to make python pass an interactive shell, probably not the best timezone given a wide variety of colleagues

				ENV TZ=UTC

				RUN ln -snf /usr/share/zoneinfo/$TZ /etc/localtime && echo $TZ > /etc/timezone

				# install build tools

				RUN apt update && apt install -y \

				    bison \

				    bzip2 \

				    cpanminus \

				    curl \

				    docbook-xml \

				    docbook-xsl \

				    flex \

				    gcc \

				    git \

				    libcurl4-gnutls-dev \

				    libicu-dev \

				    libkrb5-dev \

				    liblz4-dev \

				    libpam0g-dev \

				    libreadline-dev \

				    libselinux1-dev \

				    libssl-dev \

				    libxml2-utils \

				    libxslt-dev \

				    libzstd-dev \

				    locales \

				    make \

				    perl \

				    pkg-config \

				    python3 \

				    python3-pip \

				    software-properties-common \

				    sudo \

				    uuid-dev \

				    valgrind \

				    xsltproc \

				    zlib1g-dev \

				 && add-apt-repository ppa:deadsnakes/ppa -y \

				 && apt install -y \

				    python3.9-full \

				 # software properties pulls in pkexec, which makes the debugger unusable in vscode

				 && apt purge -y \

				    software-properties-common \

				 && apt autoremove -y \

				 && apt clean

				RUN sudo pip3 install pipenv pipenv-shebang

				RUN cpanm install IPC::Run

				RUN locale-gen en_US.UTF-8

				# add the citus user to sudoers and allow all sudoers to login without a password prompt

				RUN useradd -ms /bin/bash citus \

				 && usermod -aG sudo citus \

				 && echo '%sudo ALL=(ALL) NOPASSWD:ALL' >> /etc/sudoers

				WORKDIR /home/citus

				USER citus

				# run all make commands with the number of cores available

				RUN echo "export MAKEFLAGS=\"-j \$(nproc)\"" >> "/home/citus/.bashrc"

				RUN git clone --branch v1.3.2 --depth 1 https://github.com/theory/pgenv.git .pgenv

				COPY --chown=citus:citus pgenv/config/ .pgenv/config/

				ENV PATH="/home/citus/.pgenv/bin:${PATH}"

				ENV PATH="/home/citus/.pgenv/pgsql/bin:${PATH}"

				USER citus

				# build postgres versions separately for effective parrallelism and caching of already built versions when changing only certain versions

				FROM base AS pg15

				RUN MAKEFLAGS="-j $(nproc)" pgenv build 15.13

				RUN rm .pgenv/src/*.tar*

				RUN make -C .pgenv/src/postgresql-*/ clean

				RUN make -C .pgenv/src/postgresql-*/src/include install

				# create a staging directory with all files we want to copy from our pgenv build

				# we will copy the contents of the staged folder into the final image at once

				RUN mkdir .pgenv-staging/

				RUN cp -r .pgenv/src .pgenv/pgsql-* .pgenv/config .pgenv-staging/

				RUN rm .pgenv-staging/config/default.conf

				FROM base AS pg16

				RUN MAKEFLAGS="-j $(nproc)" pgenv build 16.9

				RUN rm .pgenv/src/*.tar*

				RUN make -C .pgenv/src/postgresql-*/ clean

				RUN make -C .pgenv/src/postgresql-*/src/include install

				# create a staging directory with all files we want to copy from our pgenv build

				# we will copy the contents of the staged folder into the final image at once

				RUN mkdir .pgenv-staging/

				RUN cp -r .pgenv/src .pgenv/pgsql-* .pgenv/config .pgenv-staging/

				RUN rm .pgenv-staging/config/default.conf

				FROM base AS pg17

				RUN MAKEFLAGS="-j $(nproc)" pgenv build 17.5

				RUN rm .pgenv/src/*.tar*

				RUN make -C .pgenv/src/postgresql-*/ clean

				RUN make -C .pgenv/src/postgresql-*/src/include install

				# create a staging directory with all files we want to copy from our pgenv build

				# we will copy the contents of the staged folder into the final image at once

				RUN mkdir .pgenv-staging/

				RUN cp -r .pgenv/src .pgenv/pgsql-* .pgenv/config .pgenv-staging/

				RUN rm .pgenv-staging/config/default.conf

				FROM base AS uncrustify-builder

				RUN sudo apt update && sudo apt install -y cmake tree

				WORKDIR /uncrustify

				RUN curl -L https://github.com/uncrustify/uncrustify/archive/uncrustify-0.68.1.tar.gz | tar xz

				WORKDIR /uncrustify/uncrustify-uncrustify-0.68.1/

				RUN mkdir build

				WORKDIR /uncrustify/uncrustify-uncrustify-0.68.1/build/

				RUN cmake ..

				RUN MAKEFLAGS="-j $(nproc)" make -s

				RUN make install DESTDIR=/uncrustify

				# builder for all pipenv's to get them contained in a single layer

				FROM base AS pipenv

				WORKDIR /workspaces/citus/

				# tools to sync pgenv with vscode

				COPY --chown=citus:citus .vscode/Pipfile .vscode/Pipfile.lock .devcontainer/.vscode/

				RUN ( cd .devcontainer/.vscode && pipenv install )

				# environment to run our failure tests

				COPY --chown=citus:citus src/ src/

				RUN ( cd src/test/regress && pipenv install )

				# assemble the final container by copying over the artifacts from separately build containers

				FROM base AS devcontainer

				LABEL org.opencontainers.image.source=https://github.com/citusdata/citus

				LABEL org.opencontainers.image.description="Development container for the Citus project"

				LABEL org.opencontainers.image.licenses=AGPL-3.0-only

				RUN yes | sudo unminimize

				# install developer productivity tools

				RUN sudo apt update \

				 && sudo apt install -y \

				    autoconf2.69 \

				    bash-completion \

				    fswatch \

				    gdb \

				    htop \

				    libdbd-pg-perl \

				    libdbi-perl \

				    lsof \

				    man \

				    net-tools \

				    psmisc \

				    pspg \

				    tree \

				    vim \

				 && sudo apt clean

				# Since gdb will run in the context of the root user when debugging citus we will need to both

				# download the gdbpg.py script as the root user, into their home directory, as well as add .gdbinit

				# as a file owned by root

				# This will make that as soon as the debugger attaches to a postgres backend (or frankly any other process)

				# the gdbpg.py script will be sourced and the developer can direcly use it.

				RUN sudo curl -o /root/gdbpg.py https://raw.githubusercontent.com/tvesely/gdbpg/6065eee7872457785f830925eac665aa535caf62/gdbpg.py

				COPY --chown=root:root .gdbinit /root/

				# install developer dependencies in the global environment

				RUN --mount=type=bind,source=requirements.txt,target=requirements.txt pip install -r requirements.txt

				# for persistent bash history across devcontainers we need to have

				# a) a directory to store the history in

				# b) a prompt command to append the history to the file

				# c) specify the history file to store the history in

				# b and c are done in the .bashrc to make it persistent across shells only

				RUN sudo install -d -o citus -g citus /commandhistory \

				 && echo "export PROMPT_COMMAND='history -a' && export HISTFILE=/commandhistory/.bash_history" >> "/home/citus/.bashrc"

				# install citus-dev

				RUN git clone --branch develop https://github.com/citusdata/tools.git citus-tools \

				 && ( cd citus-tools/citus_dev && pipenv install ) \

				 && mkdir -p ~/.local/bin \

				 && ln -s /home/citus/citus-tools/citus_dev/citus_dev-pipenv .local/bin/citus_dev \

				 && sudo make -C citus-tools/uncrustify install bindir=/usr/local/bin pkgsysconfdir=/usr/local/etc/ \

				 && mkdir -p ~/.local/share/bash-completion/completions/ \

				 && ln -s ~/citus-tools/citus_dev/bash_completion ~/.local/share/bash-completion/completions/citus_dev

				# TODO some LC_ALL errors, possibly solved by locale-gen

				RUN git clone https://github.com/so-fancy/diff-so-fancy.git \

				 && mkdir -p ~/.local/bin \

				 && ln -s /home/citus/diff-so-fancy/diff-so-fancy .local/bin/

				COPY --link --from=uncrustify-builder /uncrustify/usr/ /usr/

				COPY --link --from=pg15 /home/citus/.pgenv-staging/ /home/citus/.pgenv/

				COPY --link --from=pg16 /home/citus/.pgenv-staging/ /home/citus/.pgenv/

				COPY --link --from=pg17 /home/citus/.pgenv-staging/ /home/citus/.pgenv/

				COPY --link --from=pipenv /home/citus/.local/share/virtualenvs/ /home/citus/.local/share/virtualenvs/

				# place to run your cluster with citus_dev

				VOLUME /data

				RUN sudo mkdir /data \

				 && sudo chown citus:citus /data

				COPY --chown=citus:citus .psqlrc .

				# with the copy linking of layers github actions seem to misbehave with the ownership of the

				# directories leading upto the link, hence a small patch layer to have to right ownerships set

				RUN sudo chown --from=root:root citus:citus -R ~

				# sets default pg version

				RUN pgenv switch 17.5

				# make connecting to the coordinator easy

				ENV PGPORT=9700

									
										11

.devcontainer/MakefileNormal file

										View File
									
				@ -0,0 +1,11 @@

				init: ../.vscode/c_cpp_properties.json ../.vscode/launch.json

				../.vscode:

					mkdir -p ../.vscode

				../.vscode/launch.json: ../.vscode .vscode/launch.json

					cp .vscode/launch.json ../.vscode/launch.json

				../.vscode/c_cpp_properties.json: ../.vscode

					./.vscode/generate_c_cpp_properties-json.py ../.vscode/c_cpp_properties.json

									
										37

.devcontainer/devcontainer.jsonNormal file

										View File
									
				@ -0,0 +1,37 @@

				{

				    "image": "ghcr.io/citusdata/citus-devcontainer:main",

				    "runArgs": [

				        "--cap-add=SYS_PTRACE",

				        "--ulimit=core=-1",

				    ],

				    "forwardPorts": [

				        9700

				    ],

				    "customizations": {

				        "vscode": {

				            "extensions": [

				                "eamodio.gitlens",

				                "GitHub.copilot-chat",

				                "GitHub.copilot",

				                "github.vscode-github-actions",

				                "github.vscode-pull-request-github",

				                "ms-vscode.cpptools-extension-pack",

				                "ms-vsliveshare.vsliveshare",

				                "rioj7.command-variable",

				            ],

				            "settings": {

				                "files.exclude": {

				                    "**/*.o": true,

				                    "**/.deps/": true,

				                }

				            },

				        }

				    },

				    "mounts": [

				        "type=volume,target=/data",

				        "source=citus-bashhistory,target=/commandhistory,type=volume",

				    ],

				    "updateContentCommand": "./configure",

				    "postCreateCommand": "make -C .devcontainer/",

				}

15

.devcontainer/pgenv/config/default.conf Normal file

View File

 @ -0,0 +1,15 @@
 PGENV_MAKE_OPTIONS=(-s)
 PGENV_CONFIGURE_OPTIONS=(
     --enable-debug
     --enable-depend
     --enable-cassert
     --enable-tap-tests
     'CFLAGS=-ggdb -Og -g3 -fno-omit-frame-pointer -DUSE_VALGRIND'
     --with-openssl
     --with-libxml
     --with-libxslt
     --with-uuid=e2fs
     --with-icu
     --with-lz4
 )

9

.devcontainer/requirements.txt Normal file

View File

 @ -0,0 +1,9 @@
 black==23.11.0
 click==8.1.7
 isort==5.12.0
 mypy-extensions==1.0.0
 packaging==23.2
 pathspec==0.11.2
 platformdirs==4.0.0
 tomli==2.0.1
 typing_extensions==4.8.0

28

.devcontainer/src/test/regress/Pipfile Normal file

View File

 @ -0,0 +1,28 @@
 [[source]]
 name = "pypi"
 url = "https://pypi.python.org/simple"
 verify_ssl = true
 [packages]
 mitmproxy = {editable = true, ref = "main", git = "https://github.com/citusdata/mitmproxy.git"}
 construct = "*"
 docopt = "==0.6.2"
 cryptography = ">=41.0.4"
 pytest = "*"
 psycopg = "*"
 filelock = "*"
 pytest-asyncio = "*"
 pytest-timeout = "*"
 pytest-xdist = "*"
 pytest-repeat = "*"
 pyyaml = "*"
 werkzeug = "==2.3.7"
 [dev-packages]
 black = "*"
 isort = "*"
 flake8 = "*"
 flake8-bugbear = "*"
 [requires]
 python_version = "3.9"

1041

.devcontainer/src/test/regress/Pipfile.lock generated Normal file

View File

File diff suppressed because it is too large Load Diff

									
										28

.editorconfigNormal file

										View File
									
				@ -0,0 +1,28 @@

				# top-most EditorConfig file

				root = true

				# rules for all files

				# we use tabs with indent size 4

				[*]

				indent_style = tab

				indent_size = 4

				tab_width = 4

				end_of_line = lf

				insert_final_newline = true

				charset = utf-8

				trim_trailing_whitespace = true

				# Don't change test output files, pngs or test data files

				[*.{out,png,data}]

				insert_final_newline = unset

				trim_trailing_whitespace = unset

				[*.{sql,sh,py,toml}]

				indent_style = space

				indent_size = 4

				tab_width = 4

				[*.yml]

				indent_style = space

				indent_size = 2

				tab_width = 2

7

.flake8 Normal file

View File

 @ -0,0 +1,7 @@
 [flake8]
 # E203 is ignored for black
 extend-ignore = E203
 # black will truncate to 88 characters usually, but long string literals it
 # might keep. That's fine in most cases unless it gets really excessive.
 max-line-length = 150
 exclude = .git,__pycache__,vendor,tmp_*

21

.gitattributes vendored

View File

 @ -16,7 +16,6 @@ README.*	conflict-marker-size=32
 # Test output files that contain extra whitespace
 *.out					-whitespace
 src/test/regress/output/*.source	-whitespace
 # These files are maintained or generated elsewhere.  We take them as is.
 configure				-whitespace
 @ -26,17 +25,13 @@ configure				-whitespace
 # except these exceptions...
 src/backend/distributed/utils/citus_outfuncs.c -citus-style
 src/backend/distributed/utils/citus_read.c -citus-style
 src/backend/distributed/utils/citus_readfuncs_94.c -citus-style
 src/backend/distributed/utils/citus_readfuncs_95.c -citus-style
 src/backend/distributed/utils/ruleutils_94.c -citus-style
 src/backend/distributed/utils/ruleutils_95.c -citus-style
 src/backend/distributed/deparser/ruleutils_15.c -citus-style
 src/backend/distributed/deparser/ruleutils_16.c -citus-style
 src/backend/distributed/deparser/ruleutils_17.c -citus-style
 src/backend/distributed/commands/index_pg_source.c -citus-style
 src/include/distributed/citus_nodes.h -citus-style
 src/include/dumputils.h -citus-style
 /vendor/** -citus-style
 # all csql files use PostgreSQL style...
 src/bin/csql/*.[ch] -citus-style
 # except these exceptions
 src/bin/csql/copy_options.c citus-style
 src/bin/csql/stage.[ch] citus-style
 # Hide diff on github by default for copied udfs
 src/backend/distributed/sql/udfs/*/[123456789]*.sql linguist-generated=true

									
										23

.github/actions/parallelization/action.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,23 @@

				name: 'Parallelization matrix'

				inputs:

				  count:

				    required: false

				    default: 32

				outputs:

				  json:

				    value: ${{ steps.generate_matrix.outputs.json }}

				runs:

				  using: "composite"

				  steps:

				    - name: Generate parallelization matrix

				      id: generate_matrix

				      shell: bash

				      run: |-

				        json_array="{\"include\": ["

				        for ((i = 1; i <= ${{ inputs.count }}; i++)); do

				            json_array+="{\"id\":\"$i\"},"

				        done

				        json_array=${json_array%,}

				        json_array+=" ]}"

				        echo "json=$json_array" >> "$GITHUB_OUTPUT"

				        echo "json=$json_array"

									
										38

.github/actions/save_logs_and_results/action.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,38 @@

				name: save_logs_and_results

				inputs:

				  folder:

				    required: false

				    default: "log"

				runs:

				  using: composite

				  steps:

				  - uses: actions/upload-artifact@v4.6.0

				    name: Upload logs

				    with:

				      name: ${{ inputs.folder }}

				      if-no-files-found: ignore

				      path: |

				        src/test/**/proxy.output

				        src/test/**/results/

				        src/test/**/tmp_check/master/log

				        src/test/**/tmp_check/worker.57638/log

				        src/test/**/tmp_check/worker.57637/log

				        src/test/**/*.diffs

				        src/test/**/out/ddls.sql

				        src/test/**/out/queries.sql

				        src/test/**/logfile_*

				        /tmp/pg_upgrade_newData_logs

				  - name: Publish regression.diffs

				    run: |-

				      diffs="$(find src/test/regress -name "*.diffs" -exec cat {} \;)"

				      if ! [ -z "$diffs" ]; then

				        echo '```diff' >> $GITHUB_STEP_SUMMARY

				        echo -E "$diffs" >> $GITHUB_STEP_SUMMARY

				        echo '```' >> $GITHUB_STEP_SUMMARY

				        echo -E $diffs

				      fi

				    shell: bash

				  - name: Print stack traces

				    run: "./ci/print_stack_trace.sh"

				    if: failure()

				    shell: bash

									
										35

.github/actions/setup_extension/action.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,35 @@

				name: setup_extension

				inputs:

				  pg_major:

				    required: false

				  skip_installation:

				    required: false

				    default: false

				    type: boolean

				runs:

				  using: composite

				  steps:

				  - name: Expose $PG_MAJOR to Github Env

				    run: |-

				        if [ -z "${{ inputs.pg_major }}" ]; then

				          echo "PG_MAJOR=${PG_MAJOR}" >> $GITHUB_ENV

				        else

				          echo "PG_MAJOR=${{ inputs.pg_major }}" >> $GITHUB_ENV

				        fi

				    shell: bash

				  - uses: actions/download-artifact@v4.1.8

				    with:

				      name: build-${{ env.PG_MAJOR }}

				  - name: Install Extension

				    if: ${{ inputs.skip_installation == 'false' }}

				    run: tar xfv "install-$PG_MAJOR.tar" --directory /

				    shell: bash

				  - name: Configure

				    run: |-

				      chown -R circleci .

				      git config --global --add safe.directory ${GITHUB_WORKSPACE}

				      gosu circleci ./configure --without-pg-version-check

				    shell: bash

				  - name: Enable core dumps

				    run: ulimit -c unlimited

				    shell: bash

									
										27

.github/actions/upload_coverage/action.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,27 @@

				name: coverage

				inputs:

				  flags:

				    required: false

				  codecov_token:

				    required: true

				runs:

				  using: composite

				  steps:

				  - uses: codecov/codecov-action@v3

				    with:

				      flags: ${{ inputs.flags }}

				      token: ${{ inputs.codecov_token }}

				      verbose: true

				      gcov: true

				  - name: Create codeclimate coverage

				    run: |-

				      lcov --directory . --capture --output-file lcov.info

				      lcov --remove lcov.info -o lcov.info '/usr/*'

				      sed "s=^SF:$PWD/=SF:=g" -i lcov.info # relative pats are required by codeclimate

				      mkdir -p /tmp/codeclimate

				      cc-test-reporter format-coverage -t lcov -o /tmp/codeclimate/${{ inputs.flags }}.json lcov.info

				    shell: bash

				  - uses: actions/upload-artifact@v4.6.0

				    with:

				      path: "/tmp/codeclimate/*.json"

				      name: codeclimate-${{ inputs.flags }}

									
										3

.github/packaging/packaging_ignore.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,3 @@

				base:

				  - ".* warning: ignoring old recipe for target [`']check'"

				  - ".* warning: overriding recipe for target [`']check'"

									
										51

.github/packaging/validate_build_output.shvendoredExecutable file

										View File
									
				@ -0,0 +1,51 @@

				#!/bin/bash

				set -ex

				# Function to get the OS version

				get_rpm_os_version() {

				    if [[ -f /etc/centos-release ]]; then

				        cat /etc/centos-release | awk '{print $4}'

				    elif [[ -f /etc/oracle-release ]]; then

				        cat /etc/oracle-release | awk '{print $5}'

				    else

				        echo "Unknown"

				    fi

				}

				package_type=${1}

				# Since $HOME is set in GH_Actions as /github/home, pyenv fails to create virtualenvs.

				# For this script, we set $HOME to /root and then set it back to /github/home.

				GITHUB_HOME="${HOME}"

				export HOME="/root"

				eval "$(pyenv init -)"

				pyenv versions

				pyenv virtualenv ${PACKAGING_PYTHON_VERSION} packaging_env

				pyenv activate packaging_env

				git clone -b v0.8.27 --depth=1  https://github.com/citusdata/tools.git tools

				python3 -m pip install -r tools/packaging_automation/requirements.txt

				echo "Package type: ${package_type}"

				echo "OS version: $(get_rpm_os_version)"

				 # For RHEL 7, we need to install urllib3<2 due to below execution error

				 # ImportError: urllib3 v2.0 only supports OpenSSL 1.1.1+, currently the 'ssl'

				 # module is compiled with 'OpenSSL 1.0.2k-fips  26 Jan 2017'.

				 # See: https://github.com/urllib3/urllib3/issues/2168

				if [[ ${package_type} == "rpm" && $(get_rpm_os_version) == 7* ]]; then

				    python3 -m pip uninstall -y urllib3

				    python3 -m pip install 'urllib3<2'

				fi

				python3 -m tools.packaging_automation.validate_build_output --output_file output.log \

				                                                            --ignore_file .github/packaging/packaging_ignore.yml \

				                                                            --package_type ${package_type}

				pyenv deactivate

				# Set $HOME back to /github/home

				export HOME=${GITHUB_HOME}

				# Print the output to the console

1

.github/pull_request_template.md vendored Normal file

View File

				`@ -0,0 +1 @@`
				`DESCRIPTION: PR description that will go into the change log, up to 78 characters`

									
										546

.github/workflows/build_and_test.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,546 @@

				name: Build & Test

				run-name: Build & Test - ${{ github.event.pull_request.title || github.ref_name }}

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				on:

				  workflow_dispatch:

				    inputs:

				      skip_test_flakyness:

				        required: false

				        default: false

				        type: boolean

				  push:

				    branches:

				      - "main"

				      - "release-*"

				  pull_request:

				    types: [opened, reopened,synchronize]

				  merge_group:

				jobs:

				  # Since GHA does not interpolate env varibles in matrix context, we need to

				  # define them in a separate job and use them in other jobs.

				  params:

				    runs-on: ubuntu-latest

				    name: Initialize parameters

				    outputs:

				      build_image_name: "ghcr.io/citusdata/extbuilder"

				      test_image_name: "ghcr.io/citusdata/exttester"

				      citusupgrade_image_name: "ghcr.io/citusdata/citusupgradetester"

				      fail_test_image_name: "ghcr.io/citusdata/failtester"

				      pgupgrade_image_name: "ghcr.io/citusdata/pgupgradetester"

				      style_checker_image_name: "ghcr.io/citusdata/stylechecker"

				      style_checker_tools_version: "0.8.18"

				      sql_snapshot_pg_version: "17.5"

				      image_suffix: "-dev-d28f316"

				      pg15_version: '{ "major": "15", "full": "15.13" }'

				      pg16_version: '{ "major": "16", "full": "16.9" }'

				      pg17_version: '{ "major": "17", "full": "17.5" }'

				      upgrade_pg_versions: "15.13-16.9-17.5"

				    steps:

				      # Since GHA jobs need at least one step we use a noop step here.

				      - name: Set up parameters

				        run: echo 'noop'

				  check-sql-snapshots:

				    needs: params

				    runs-on: ubuntu-latest

				    container:

				      image: ${{ needs.params.outputs.build_image_name }}:${{ needs.params.outputs.sql_snapshot_pg_version }}${{ needs.params.outputs.image_suffix }}

				      options: --user root

				    steps:

				    - uses: actions/checkout@v4

				    - name: Check Snapshots

				      run: |

				        git config --global --add safe.directory ${GITHUB_WORKSPACE}

				        ci/check_sql_snapshots.sh

				  check-style:

				    needs: params

				    runs-on: ubuntu-latest

				    container:

				      image: ${{ needs.params.outputs.style_checker_image_name }}:${{ needs.params.outputs.style_checker_tools_version }}${{ needs.params.outputs.image_suffix }}

				    steps:

				    - name: Check Snapshots

				      run: |

				        git config --global --add safe.directory ${GITHUB_WORKSPACE}

				    - uses: actions/checkout@v4

				      with:

				        fetch-depth: 0

				    - name: Check C Style

				      run: citus_indent --check

				    - name: Check Python style

				      run: black --check .

				    - name: Check Python import order

				      run: isort --check .

				    - name: Check Python lints

				      run: flake8 .

				    - name: Fix whitespace

				      run: ci/editorconfig.sh && git diff --exit-code

				    - name: Remove useless declarations

				      run: ci/remove_useless_declarations.sh && git diff --cached --exit-code

				    - name: Sort and group includes

				      run: ci/sort_and_group_includes.sh && git diff --exit-code

				    - name: Normalize test output

				      run: ci/normalize_expected.sh && git diff --exit-code

				    - name: Check for C-style comments in migration files

				      run: ci/disallow_c_comments_in_migrations.sh && git diff --exit-code

				    - name: 'Check for comment--cached ns that start with # character in spec files'

				      run: ci/disallow_hash_comments_in_spec_files.sh && git diff --exit-code

				    - name: Check for gitignore entries .for source files

				      run: ci/fix_gitignore.sh && git diff --exit-code

				    - name: Check for lengths of changelog entries

				      run: ci/disallow_long_changelog_entries.sh

				    - name: Check for banned C API usage

				      run: ci/banned.h.sh

				    - name: Check for tests missing in schedules

				      run: ci/check_all_tests_are_run.sh

				    - name: Check if all CI scripts are actually run

				      run: ci/check_all_ci_scripts_are_run.sh

				    - name: Check if all GUCs are sorted alphabetically

				      run: ci/check_gucs_are_alphabetically_sorted.sh

				    - name: Check for missing downgrade scripts

				      run: ci/check_migration_files.sh

				  build:

				    needs: params

				    name: Build for PG${{ fromJson(matrix.pg_version).major }}

				    strategy:

				      fail-fast: false

				      matrix:

				        image_name:

				          - ${{ needs.params.outputs.build_image_name }}

				        image_suffix:

				          - ${{ needs.params.outputs.image_suffix}}

				        pg_version:

				          - ${{ needs.params.outputs.pg15_version }}

				          - ${{ needs.params.outputs.pg16_version }}

				          - ${{ needs.params.outputs.pg17_version }}

				    runs-on: ubuntu-latest

				    container:

				      image: "${{ matrix.image_name }}:${{ fromJson(matrix.pg_version).full }}${{ matrix.image_suffix }}"

				      options: --user root

				    steps:

				    - uses: actions/checkout@v4

				    - name: Expose $PG_MAJOR to Github Env

				      run: echo "PG_MAJOR=${PG_MAJOR}" >> $GITHUB_ENV

				      shell: bash

				    - name: Build

				      run: "./ci/build-citus.sh"

				      shell: bash

				    - uses: actions/upload-artifact@v4.6.0

				      with:

				        name: build-${{ env.PG_MAJOR }}

				        path: |-

				          ./build-${{ env.PG_MAJOR }}/*

				          ./install-${{ env.PG_MAJOR }}.tar

				  test-citus:

				    name: PG${{ fromJson(matrix.pg_version).major }} - ${{ matrix.make }}

				    strategy:

				      fail-fast: false

				      matrix:

				        suite:

				          - regress

				        image_name:

				          - ${{ needs.params.outputs.test_image_name }}

				        pg_version:

				          - ${{ needs.params.outputs.pg15_version }}

				          - ${{ needs.params.outputs.pg16_version }}

				          - ${{ needs.params.outputs.pg17_version }}

				        make:

				          - check-split

				          - check-multi

				          - check-multi-1

				          - check-multi-mx

				          - check-vanilla

				          - check-isolation

				          - check-operations

				          - check-follower-cluster

				          - check-columnar

				          - check-columnar-isolation

				          - check-enterprise

				          - check-enterprise-isolation

				          - check-enterprise-isolation-logicalrep-1

				          - check-enterprise-isolation-logicalrep-2

				          - check-enterprise-isolation-logicalrep-3

				        include:

				          - make: check-failure

				            pg_version: ${{ needs.params.outputs.pg15_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-failure

				            pg_version: ${{ needs.params.outputs.pg16_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-failure

				            pg_version: ${{ needs.params.outputs.pg17_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-enterprise-failure

				            pg_version: ${{ needs.params.outputs.pg15_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-enterprise-failure

				            pg_version: ${{ needs.params.outputs.pg16_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-enterprise-failure

				            pg_version: ${{ needs.params.outputs.pg17_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-pytest

				            pg_version: ${{ needs.params.outputs.pg15_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-pytest

				            pg_version: ${{ needs.params.outputs.pg16_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-pytest

				            pg_version: ${{ needs.params.outputs.pg17_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: installcheck

				            suite: cdc

				            image_name: ${{ needs.params.outputs.test_image_name }}

				            pg_version: ${{ needs.params.outputs.pg15_version }}

				          - make: installcheck

				            suite: cdc

				            image_name: ${{ needs.params.outputs.test_image_name }}

				            pg_version: ${{ needs.params.outputs.pg16_version }}

				          - make: installcheck

				            suite: cdc

				            image_name: ${{ needs.params.outputs.test_image_name }}

				            pg_version: ${{ needs.params.outputs.pg17_version }}

				          - make: check-query-generator

				            pg_version: ${{ needs.params.outputs.pg15_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-query-generator

				            pg_version: ${{ needs.params.outputs.pg16_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				          - make: check-query-generator

				            pg_version: ${{ needs.params.outputs.pg17_version }}

				            suite: regress

				            image_name: ${{ needs.params.outputs.fail_test_image_name }}

				    runs-on: ubuntu-latest

				    container:

				      image: "${{ matrix.image_name }}:${{ fromJson(matrix.pg_version).full }}${{ needs.params.outputs.image_suffix }}"

				      options: --user root --dns=8.8.8.8

				      # Due to Github creates a default network for each job, we need to use

				      # --dns= to have similar DNS settings as our other CI systems or local

				      # machines. Otherwise, we may see different results.

				    needs:

				    - params

				    - build

				    steps:

				    - uses: actions/checkout@v4

				    - uses: "./.github/actions/setup_extension"

				    - name: Run Test

				      run: gosu circleci make -C src/test/${{ matrix.suite }} ${{ matrix.make }}

				      timeout-minutes: 20

				    - uses: "./.github/actions/save_logs_and_results"

				      if: always()

				      with:

				        folder: ${{ fromJson(matrix.pg_version).major }}_${{ matrix.make }}

				    - uses: "./.github/actions/upload_coverage"

				      if: always()

				      with:

				        flags: ${{ env.PG_MAJOR }}_${{ matrix.suite }}_${{ matrix.make }}

				        codecov_token: ${{ secrets.CODECOV_TOKEN }}

				  test-arbitrary-configs:

				    name: PG${{ fromJson(matrix.pg_version).major }} - check-arbitrary-configs-${{ matrix.parallel }}

				    runs-on: ["self-hosted", "1ES.Pool=1es-gha-citusdata-pool"]

				    container:

				      image: "${{ matrix.image_name }}:${{ fromJson(matrix.pg_version).full }}${{ needs.params.outputs.image_suffix }}"

				      options: --user root

				    needs:

				      - params

				      - build

				    strategy:

				      fail-fast: false

				      matrix:

				        image_name:

				          - ${{ needs.params.outputs.fail_test_image_name }}

				        pg_version:

				          - ${{ needs.params.outputs.pg15_version }}

				          - ${{ needs.params.outputs.pg16_version }}

				          - ${{ needs.params.outputs.pg17_version }}

				        parallel: [0,1,2,3,4,5] # workaround for running 6 parallel jobs

				    steps:

				    - uses: actions/checkout@v4

				    - uses: "./.github/actions/setup_extension"

				    - name: Test arbitrary configs

				      run: |-

				        # we use parallel jobs to split the tests into 6 parts and run them in parallel

				        # the script below extracts the tests for the current job

				        N=6  # Total number of jobs (see matrix.parallel)

				        X=${{ matrix.parallel }}  # Current job number

				        TESTS=$(src/test/regress/citus_tests/print_test_names.py |

				          tr '\n' ',' | awk -v N="$N" -v X="$X" -F, '{

				            split("", parts)

				            for (i = 1; i <= NF; i++) {

				                parts[i % N] = parts[i % N] $i ","

				            }

				            print substr(parts[X], 1, length(parts[X])-1)

				        }')

				        echo $TESTS

				        gosu circleci \

				          make -C src/test/regress \

				            check-arbitrary-configs parallel=4 CONFIGS=$TESTS

				    - uses: "./.github/actions/save_logs_and_results"

				      if: always()

				      with:

				        folder: ${{ env.PG_MAJOR }}_arbitrary_configs_${{ matrix.parallel }}

				    - uses: "./.github/actions/upload_coverage"

				      if: always()

				      with:

				        flags: ${{ env.PG_MAJOR }}_arbitrary_configs_${{ matrix.parallel }}

				        codecov_token: ${{ secrets.CODECOV_TOKEN }}

				  test-pg-upgrade:

				    name: PG${{ matrix.old_pg_major }}-PG${{ matrix.new_pg_major }} - check-pg-upgrade

				    runs-on: ubuntu-latest

				    container:

				      image: "${{ needs.params.outputs.pgupgrade_image_name }}:${{ needs.params.outputs.upgrade_pg_versions }}${{ needs.params.outputs.image_suffix }}"

				      options: --user root

				    needs:

				    - params

				    - build

				    strategy:

				      fail-fast: false

				      matrix:

				        include:

				          - old_pg_major: 15

				            new_pg_major: 16

				          - old_pg_major: 16

				            new_pg_major: 17

				          - old_pg_major: 15

				            new_pg_major: 17

				    env:

				      old_pg_major: ${{ matrix.old_pg_major }}

				      new_pg_major: ${{ matrix.new_pg_major }}

				    steps:

				    - uses: actions/checkout@v4

				    - uses: "./.github/actions/setup_extension"

				      with:

				        pg_major: "${{ env.old_pg_major }}"

				    - uses: "./.github/actions/setup_extension"

				      with:

				        pg_major: "${{ env.new_pg_major }}"

				    - name: Install and test postgres upgrade

				      run: |-

				        gosu circleci \

				          make -C src/test/regress \

				            check-pg-upgrade \

				            old-bindir=/usr/lib/postgresql/${{ env.old_pg_major }}/bin \

				            new-bindir=/usr/lib/postgresql/${{ env.new_pg_major }}/bin

				    - name: Copy pg_upgrade logs for newData dir

				      run: |-

				        mkdir -p /tmp/pg_upgrade_newData_logs

				        if ls src/test/regress/tmp_upgrade/newData/*.log 1> /dev/null 2>&1; then

				            cp src/test/regress/tmp_upgrade/newData/*.log /tmp/pg_upgrade_newData_logs

				        fi

				      if: failure()

				    - uses: "./.github/actions/save_logs_and_results"

				      if: always()

				      with:

				        folder: ${{ env.old_pg_major }}_${{ env.new_pg_major }}_upgrade

				    - uses: "./.github/actions/upload_coverage"

				      if: always()

				      with:

				        flags: ${{ env.old_pg_major }}_${{ env.new_pg_major }}_upgrade

				        codecov_token: ${{ secrets.CODECOV_TOKEN }}

				  test-citus-upgrade:

				    name: PG${{ fromJson(needs.params.outputs.pg15_version).major }} - check-citus-upgrade

				    runs-on: ubuntu-latest

				    container:

				      image: "${{ needs.params.outputs.citusupgrade_image_name }}:${{ fromJson(needs.params.outputs.pg15_version).full }}${{ needs.params.outputs.image_suffix }}"

				      options: --user root

				    needs:

				    - params

				    - build

				    steps:

				    - uses: actions/checkout@v4

				    - uses: "./.github/actions/setup_extension"

				      with:

				        skip_installation: true

				    - name: Install and test citus upgrade

				      run: |-

				        # run make check-citus-upgrade for all citus versions

				        # the image has ${CITUS_VERSIONS} set with all verions it contains the binaries of

				        for citus_version in ${CITUS_VERSIONS}; do \

				          gosu circleci \

				            make -C src/test/regress \

				              check-citus-upgrade \

				              bindir=/usr/lib/postgresql/${PG_MAJOR}/bin \

				              citus-old-version=${citus_version} \

				              citus-pre-tar=/install-pg${PG_MAJOR}-citus${citus_version}.tar \

				              citus-post-tar=${GITHUB_WORKSPACE}/install-$PG_MAJOR.tar; \

				        done;

				        # run make check-citus-upgrade-mixed for all citus versions

				        # the image has ${CITUS_VERSIONS} set with all verions it contains the binaries of

				        for citus_version in ${CITUS_VERSIONS}; do \

				          gosu circleci \

				            make -C src/test/regress \

				              check-citus-upgrade-mixed \

				              citus-old-version=${citus_version} \

				              bindir=/usr/lib/postgresql/${PG_MAJOR}/bin \

				              citus-pre-tar=/install-pg${PG_MAJOR}-citus${citus_version}.tar \

				              citus-post-tar=${GITHUB_WORKSPACE}/install-$PG_MAJOR.tar; \

				        done;

				    - uses: "./.github/actions/save_logs_and_results"

				      if: always()

				      with:

				        folder: ${{ env.PG_MAJOR }}_citus_upgrade

				    - uses: "./.github/actions/upload_coverage"

				      if: always()

				      with:

				        flags: ${{ env.PG_MAJOR }}_citus_upgrade

				        codecov_token: ${{ secrets.CODECOV_TOKEN }}

				  upload-coverage:

				    # secret below is not available for forks so disabling upload action for them

				    if: ${{ github.event.pull_request.head.repo.full_name == github.repository || github.event_name != 'pull_request' }}

				    env:

				      CC_TEST_REPORTER_ID: ${{ secrets.CC_TEST_REPORTER_ID }}

				    runs-on: ubuntu-latest

				    container:

				      image: ${{ needs.params.outputs.test_image_name }}:${{ fromJson(needs.params.outputs.pg17_version).full }}${{ needs.params.outputs.image_suffix }}

				    needs:

				      - params

				      - test-citus

				      - test-arbitrary-configs

				      - test-citus-upgrade

				      - test-pg-upgrade

				    steps:

				      - uses: actions/download-artifact@v4.1.8

				        with:

				          pattern: codeclimate*

				          path: codeclimate

				          merge-multiple: true

				      - name: Upload coverage results to Code Climate

				        run: |-

				          cc-test-reporter sum-coverage codeclimate/*.json -o total.json

				          cc-test-reporter upload-coverage -i total.json

				  ch_benchmark:

				    name: CH Benchmark

				    if: startsWith(github.ref, 'refs/heads/ch_benchmark/')

				    runs-on: ubuntu-latest

				    needs:

				    - build

				    steps:

				    - uses: actions/checkout@v4

				    - uses: azure/login@v1

				      with:

				        creds: ${{ secrets.AZURE_CREDENTIALS }}

				    - name: install dependencies and run ch_benchmark tests

				      uses: azure/CLI@v1

				      with:

				        inlineScript: |

				          cd ./src/test/hammerdb

				          chmod +x run_hammerdb.sh

				          run_hammerdb.sh citusbot_ch_benchmark_rg

				  tpcc_benchmark:

				    name: TPCC Benchmark

				    if: startsWith(github.ref, 'refs/heads/tpcc_benchmark/')

				    runs-on: ubuntu-latest

				    needs:

				    - build

				    steps:

				    - uses: actions/checkout@v4

				    - uses: azure/login@v1

				      with:

				        creds: ${{ secrets.AZURE_CREDENTIALS }}

				    - name: install dependencies and run tpcc_benchmark tests

				      uses: azure/CLI@v1

				      with:

				        inlineScript: |

				          cd ./src/test/hammerdb

				          chmod +x run_hammerdb.sh

				          run_hammerdb.sh citusbot_tpcc_benchmark_rg

				  prepare_parallelization_matrix_32:

				    name: Prepare parallelization matrix

				    if: ${{ needs.test-flakyness-pre.outputs.tests != ''}}

				    needs: test-flakyness-pre

				    runs-on: ubuntu-latest

				    outputs:

				      json: ${{ steps.parallelization.outputs.json }}

				    steps:

				      - uses: actions/checkout@v4

				      - uses: "./.github/actions/parallelization"

				        id: parallelization

				        with:

				          count: 32

				  test-flakyness-pre:

				    name: Detect regression tests need to be ran

				    if: ${{ !inputs.skip_test_flakyness }}}

				    runs-on: ubuntu-latest

				    needs: build

				    outputs:

				      tests: ${{ steps.detect-regression-tests.outputs.tests }}

				    steps:

				    - uses: actions/checkout@v4

				      with:

				        fetch-depth: 0

				    - name: Detect regression tests need to be ran

				      id: detect-regression-tests

				      run: |-

				        detected_changes=$(git diff origin/main... --name-only --diff-filter=AM | (grep 'src/test/regress/sql/.*\.sql\|src/test/regress/spec/.*\.spec\|src/test/regress/citus_tests/test/test_.*\.py' || true))

				        tests=${detected_changes}

				        # split the tests to be skipped --today we only skip upgrade tests

				        skipped_tests=""

				        not_skipped_tests=""

				        for test in $tests; do

				            if [[ $test =~ ^src/test/regress/sql/upgrade_ ]]; then

				                skipped_tests="$skipped_tests $test"

				            else

				                not_skipped_tests="$not_skipped_tests $test"

				            fi

				        done

				        if [ ! -z "$skipped_tests" ]; then

				            echo "Skipped tests " $skipped_tests

				        fi

				        if [ -z "$not_skipped_tests" ]; then

				            echo "Not detected any tests that flaky test detection should run"

				        else

				            echo "Detected tests " $not_skipped_tests

				        fi

				        echo 'tests<<EOF' >> $GITHUB_OUTPUT

				        echo "$not_skipped_tests" >> "$GITHUB_OUTPUT"

				        echo 'EOF' >> $GITHUB_OUTPUT

				  test-flakyness:

				    if: ${{ needs.test-flakyness-pre.outputs.tests != ''}}

				    name: Test flakyness

				    runs-on: ubuntu-latest

				    container:

				      image: ${{ needs.params.outputs.fail_test_image_name }}:${{ fromJson(needs.params.outputs.pg17_version).full }}${{ needs.params.outputs.image_suffix }}

				      options: --user root

				    env:

				      runs: 8

				    needs:

				    - params

				    - build

				    - test-flakyness-pre

				    - prepare_parallelization_matrix_32

				    strategy:

				      fail-fast: false

				      matrix: ${{ fromJson(needs.prepare_parallelization_matrix_32.outputs.json) }}

				    steps:

				    - uses: actions/checkout@v4

				    - uses: actions/download-artifact@v4.1.8

				    - uses: "./.github/actions/setup_extension"

				    - name: Run minimal tests

				      run: |-

				        tests="${{ needs.test-flakyness-pre.outputs.tests }}"

				        tests_array=($tests)

				        for test in "${tests_array[@]}"

				        do

				            test_name=$(echo "$test" | sed -r "s/.+\/(.+)\..+/\1/")

				            gosu circleci src/test/regress/citus_tests/run_test.py $test_name --repeat ${{ env.runs }} --use-whole-schedule-line

				        done

				      shell: bash

				    - uses: "./.github/actions/save_logs_and_results"

				      if: always()

				      with:

				        folder: test_flakyness_parallel_${{ matrix.id }}

									
										79

.github/workflows/codeql.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,79 @@

				name: "CodeQL"

				on:

				  schedule:

				    - cron: '59 23 * * 6'

				  workflow_dispatch:

				jobs:

				  analyze:

				    name: Analyze

				    runs-on: ubuntu-22.04

				    permissions:

				      actions: read

				      contents: read

				      security-events: write

				    strategy:

				      fail-fast: false

				      matrix:

				        language: [ 'cpp', 'python']

				    steps:

				    - name: Checkout repository

				      uses: actions/checkout@v4

				    - name: Initialize CodeQL

				      uses: github/codeql-action/init@v3

				      with:

				        languages: ${{ matrix.language }}

				    - name: Install package dependencies

				      run: |

				        # Create the file repository configuration:

				        sudo sh -c 'echo "deb http://apt.postgresql.org/pub/repos/apt $(lsb_release -cs)-pgdg main 15" > /etc/apt/sources.list.d/pgdg.list'

				        # Import the repository signing key:

				        wget --quiet -O - https://www.postgresql.org/media/keys/ACCC4CF8.asc | sudo apt-key add -

				        sudo apt-get update

				        sudo apt-get install -y --no-install-recommends \

				          autotools-dev \

				          build-essential \

				          ca-certificates \

				          curl \

				          debhelper \

				          devscripts \

				          fakeroot \

				          flex \

				          libcurl4-openssl-dev \

				          libdistro-info-perl \

				          libedit-dev \

				          libfile-fcntllock-perl \

				          libicu-dev \

				          libkrb5-dev \

				          liblz4-1 \

				          liblz4-dev \

				          libpam0g-dev \

				          libreadline-dev \

				          libselinux1-dev \

				          libssl-dev \

				          libxslt-dev \

				          libzstd-dev \

				          libzstd1 \

				          lintian \

				          postgresql-server-dev-15 \

				          postgresql-server-dev-all \

				          python3-pip \

				          python3-setuptools \

				          wget \

				          zlib1g-dev

				    - name: Configure, Build and Install Citus

				      if: matrix.language == 'cpp'

				      run: |

				        ./configure

				        make -sj8

				        sudo make install-all

				    - name: Perform CodeQL Analysis

				      uses: github/codeql-action/analyze@v3

									
										54

.github/workflows/devcontainer.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,54 @@

				name: "Build devcontainer"

				# Since building of containers can be quite time consuming, and take up some storage,

				# there is no need to finish a build for a tag if new changes are concurrently being made.

				# This cancels any previous builds for the same tag, and only the latest one will be kept.

				concurrency:

				  group: ${{ github.workflow }}-${{ github.ref }}

				  cancel-in-progress: true

				on:

				  push:

				    paths:

				      - ".devcontainer/**"

				  workflow_dispatch:

				jobs:

				  docker:

				    runs-on: ubuntu-latest

				    permissions:

				      contents: read

				      packages: write

				      attestations: write

				      id-token: write

				    steps:

				      -

				        name: Docker meta

				        id: meta

				        uses: docker/metadata-action@v5

				        with:

				          images: |

				            ghcr.io/citusdata/citus-devcontainer

				          tags: |

				            type=ref,event=branch

				            type=sha

				      -

				        name: Set up Docker Buildx

				        uses: docker/setup-buildx-action@v2

				      -

				        name: 'Login to GitHub Container Registry'

				        uses: docker/login-action@v3

				        with:

				          registry: ghcr.io

				          username: ${{github.actor}}

				          password: ${{secrets.GITHUB_TOKEN}}

				      -

				        name: Build and push

				        uses: docker/build-push-action@v5

				        with:

				          context: "{{defaultContext}}:.devcontainer"

				          push: true

				          tags: ${{ steps.meta.outputs.tags }}

				          labels: ${{ steps.meta.outputs.labels }}

				          cache-from: type=gha

				          cache-to: type=gha,mode=max

									
										79

.github/workflows/flaky_test_debugging.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,79 @@

				name: Flaky test debugging

				run-name: Flaky test debugging - ${{ inputs.flaky_test }} (${{ inputs.flaky_test_runs_per_job }}x${{ inputs.flaky_test_parallel_jobs }})

				concurrency:

				  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}

				  cancel-in-progress: true

				on:

				  workflow_dispatch:

				    inputs:

				      flaky_test:

				        required: true

				        type: string

				        description: Test to run

				      flaky_test_runs_per_job:

				        required: false

				        default: 8

				        type: number

				        description: Number of times to run the test

				      flaky_test_parallel_jobs:

				        required: false

				        default: 32

				        type: number

				        description: Number of parallel jobs to run

				jobs:

				  build:

				    name: Build Citus

				    runs-on: ubuntu-latest

				    container:

				      image: ${{ vars.build_image_name }}:${{ vars.pg15_version  }}${{ vars.image_suffix }}

				      options: --user root

				    steps:

				    - uses: actions/checkout@v4

				    - name: Configure, Build, and Install

				      run: |

				        echo "PG_MAJOR=${PG_MAJOR}" >> $GITHUB_ENV

				        ./ci/build-citus.sh

				      shell: bash

				    - uses: actions/upload-artifact@v4.6.0

				      with:

				        name: build-${{ env.PG_MAJOR }}

				        path: |-

				          ./build-${{ env.PG_MAJOR }}/*

				          ./install-${{ env.PG_MAJOR }}.tar

				  prepare_parallelization_matrix:

				    name: Prepare parallelization matrix

				    runs-on: ubuntu-latest

				    outputs:

				      json: ${{ steps.parallelization.outputs.json }}

				    steps:

				      - uses: actions/checkout@v4

				      - uses: "./.github/actions/parallelization"

				        id: parallelization

				        with:

				          count: ${{ inputs.flaky_test_parallel_jobs }}

				  test_flakyness:

				    name: Test flakyness

				    runs-on: ubuntu-latest

				    container:

				      image: ${{ vars.fail_test_image_name }}:${{ vars.pg15_version  }}${{ vars.image_suffix }}

				      options: --user root

				    needs:

				      [build, prepare_parallelization_matrix]

				    env:

				      test: "${{ inputs.flaky_test }}"

				      runs: "${{ inputs.flaky_test_runs_per_job }}"

				      skip: false

				    strategy:

				      fail-fast: false

				      matrix: ${{ fromJson(needs.prepare_parallelization_matrix.outputs.json) }}

				    steps:

				    - uses: actions/checkout@v4

				    - uses: "./.github/actions/setup_extension"

				    - name: Run minimal tests

				      run: |-

				          gosu circleci src/test/regress/citus_tests/run_test.py ${{ env.test }} --repeat ${{ env.runs }} --use-whole-schedule-line

				      shell: bash

				    - uses: "./.github/actions/save_logs_and_results"

				      if: always()

				      with:

				          folder: check_flakyness_parallel_${{ matrix.id }}

									
										177

.github/workflows/packaging-test-pipelines.ymlvendoredNormal file

										View File
									
				@ -0,0 +1,177 @@

				name: Build tests in packaging images

				on:

				  pull_request:

				    types: [opened, reopened,synchronize]

				  merge_group:

				  workflow_dispatch:

				concurrency:

				  group: ${{ github.workflow }}-${{ github.ref }}

				  cancel-in-progress: true

				jobs:

				  get_postgres_versions_from_file:

				    runs-on: ubuntu-latest

				    outputs:

				      pg_versions: ${{ steps.get-postgres-versions.outputs.pg_versions }}

				    steps:

				      - name: Checkout

				        uses: actions/checkout@v4

				        with:

				          fetch-depth: 2

				      - name: Get Postgres Versions

				        id: get-postgres-versions

				        run: |

				          set -euxo pipefail

				          # Postgres versions are stored in .github/workflows/build_and_test.yml

				          # file in json strings with major and full keys.

				          # Below command extracts the versions and get the unique values.

				          pg_versions=$(cat .github/workflows/build_and_test.yml | grep -oE '"major": "[0-9]+", "full": "[0-9.]+"' | sed -E 's/"major": "([0-9]+)", "full": "([0-9.]+)"/\1/g' | sort | uniq | tr '\n', ',')

				          pg_versions_array="[ ${pg_versions} ]"

				          echo "Supported PG Versions: ${pg_versions_array}"

				          # Below line is needed to set the output variable to be used in the next job

				          echo "pg_versions=${pg_versions_array}" >> $GITHUB_OUTPUT

				        shell: bash

				  rpm_build_tests:

				    name: rpm_build_tests

				    needs: get_postgres_versions_from_file

				    runs-on: ubuntu-latest

				    strategy:

				      fail-fast: false

				      matrix:

				        # While we use separate images for different Postgres versions in rpm

				        # based distros

				        # For this reason, we need to use a "matrix" to generate names of

				        # rpm images, e.g. citus/packaging:centos-7-pg12

				        packaging_docker_image:

				          - oraclelinux-8

				          - almalinux-8

				          - almalinux-9

				        POSTGRES_VERSION: ${{ fromJson(needs.get_postgres_versions_from_file.outputs.pg_versions) }}

				    container:

				      image: citus/packaging:${{ matrix.packaging_docker_image }}-pg${{ matrix.POSTGRES_VERSION }}

				      options: --user root

				    steps:

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Set Postgres and python parameters for rpm based distros

				        run: |

				          echo "/usr/pgsql-${{ matrix.POSTGRES_VERSION }}/bin" >> $GITHUB_PATH

				          echo "/root/.pyenv/bin:$PATH" >> $GITHUB_PATH

				          echo "PACKAGING_PYTHON_VERSION=3.8.16" >> $GITHUB_ENV

				      - name: Configure

				        run: |

				          echo "Current Shell:$0"

				          echo "GCC Version: $(gcc --version)"

				          ./configure 2>&1 | tee output.log

				      - name: Make clean

				        run: |

				          make clean

				      - name: Make

				        run: |

				          git config --global --add safe.directory ${GITHUB_WORKSPACE}

				          make CFLAGS="-Wno-missing-braces" -sj$(cat /proc/cpuinfo | grep "core id" | wc -l) 2>&1 | tee -a output.log

				          # Check the exit code of the make command

				          make_exit_code=${PIPESTATUS[0]}

				          # If the make command returned a non-zero exit code, exit with the same code

				          if [[ $make_exit_code -ne 0 ]]; then

				              echo "make command failed with exit code $make_exit_code"

				              exit $make_exit_code

				          fi

				      - name: Make install

				        run: |

				          make CFLAGS="-Wno-missing-braces" install 2>&1 | tee -a output.log

				      - name: Validate output

				        env:

				          POSTGRES_VERSION: ${{ matrix.POSTGRES_VERSION }}

				          PACKAGING_DOCKER_IMAGE: ${{ matrix.packaging_docker_image }}

				        run: |

				          echo "Postgres version: ${POSTGRES_VERSION}"

				          ./.github/packaging/validate_build_output.sh "rpm"

				  deb_build_tests:

				    name: deb_build_tests

				    needs: get_postgres_versions_from_file

				    runs-on: ubuntu-latest

				    strategy:

				      fail-fast: false

				      matrix:

				        # On deb based distros, we use the same docker image for

				        # builds based on different Postgres versions because deb

				        # based images include all postgres installations.

				        # For this reason, we have multiple runs --which is 3 today--

				        # for each deb based image and we use POSTGRES_VERSION to set

				        # PG_CONFIG variable in each of those runs.

				        packaging_docker_image:

				          - debian-bookworm-all

				          - debian-bullseye-all

				          - ubuntu-focal-all

				          - ubuntu-jammy-all

				        POSTGRES_VERSION: ${{ fromJson(needs.get_postgres_versions_from_file.outputs.pg_versions) }}

				    container:

				      image: citus/packaging:${{ matrix.packaging_docker_image }}

				      options: --user root

				    steps:

				      - name: Checkout repository

				        uses: actions/checkout@v4

				      - name: Set pg_config path and python parameters for deb based distros

				        run: |

				          echo "PG_CONFIG=/usr/lib/postgresql/${{ matrix.POSTGRES_VERSION }}/bin/pg_config" >> $GITHUB_ENV

				          echo "/root/.pyenv/bin:$PATH" >> $GITHUB_PATH

				          echo "PACKAGING_PYTHON_VERSION=3.8.16" >> $GITHUB_ENV

				      - name: Configure

				        run: |

				          echo "Current Shell:$0"

				          echo "GCC Version: $(gcc --version)"

				          ./configure 2>&1 | tee output.log

				      - name: Make clean

				        run: |

				          make clean

				      - name: Make

				        shell: bash

				        run: |

				          set -e

				          git config --global --add safe.directory ${GITHUB_WORKSPACE}

				          make -sj$(cat /proc/cpuinfo | grep "core id" | wc -l) 2>&1 | tee -a output.log

				          # Check the exit code of the make command

				          make_exit_code=${PIPESTATUS[0]}

				          # If the make command returned a non-zero exit code, exit with the same code

				          if [[ $make_exit_code -ne 0 ]]; then

				              echo "make command failed with exit code $make_exit_code"

				              exit $make_exit_code

				          fi

				      - name: Make install

				        run: |

				          make install 2>&1 | tee -a output.log

				      - name: Validate output

				        env:

				          POSTGRES_VERSION: ${{ matrix.POSTGRES_VERSION }}

				          PACKAGING_DOCKER_IMAGE: ${{ matrix.packaging_docker_image }}

				        run: |

				          echo "Postgres version: ${POSTGRES_VERSION}"

				          ./.github/packaging/validate_build_output.sh "deb"

22

.gitignore vendored

View File

 @ -25,6 +25,7 @@ win32ver.rc
 *.exe
 lib*dll.def
 lib*.pc
 *.bc
 # Local excludes in root directory
 /config.log
 @ -36,3 +37,24 @@ lib*.pc
 /autom4te.cache
 /Makefile.global
 /src/Makefile.custom
 /compile_commands.json
 /src/backend/distributed/cdc/build-cdc-*/*
 /src/test/cdc/tmp_check/*
 # temporary files vim creates
 *.swp
 # vscode
 .vscode/*
 # output from diff normalization that shouldn't be commited
 *.unmodified
 *.modified
 # style related temporary outputs
 *.uncrustify
 .venv
 # added output when modifying check_gucs_are_alphabetically_sorted.sh
 guc.out

1

.ignore Normal file

View File

				`@ -0,0 +1 @@`
				`/vendor`

									
										19

.travis.yml

										View File
									
				@ -1,19 +0,0 @@

				sudo: required

				dist: trusty

				language: c

				cache: apt

				branches:

				  except: [ /^open-.*$/ ]

				env:

				  global:

				    secure: degV+qb2xHiea7E2dGk/WLvmYjq4ZsBn6ZPko+YhRcNm2GRXRaU3FqMBIecPtsEEFYaL5GwCQq/CgBf9aQxgDQ+t2CrmtGTtI9AGAbVBl//amNeJOoLe6QvrDpSQX5pUxwDLCng8cvoQK7ZxGlNCzDKiu4Ep4DUWgQVpauJkQ9nHjtSMZvUqCoI9h1lBy9Mxh7YFfHPW2PAXCqpV4VlNiIYF84UKdX3MXKLy9Yt0JBSNTWLZFp/fFw2qNwzFvN94rF3ZvFSD7Wp6CIhT6R5/6k6Zx8YQIrjWhgm6OVy1osUA8X7W79h2ISPqKqMNVJkjJ+N8S4xuQU0kfejnQ74Ie/uJiHCmbW5W2TjpL1aU3FQpPsGwR8h0rSeHhJAJzd8Ma+z8vvnnQHDyvetPBB0WgA/VMQCu8uEutyfYw2hDmB2+l2dDwkViaI7R95bReAGrpd5uNqklAXuR7yOeArz0ZZpHV0aZHGcNBxznMaZExSVZ5DVPW38UPn7Kgse8BnOWeLgnA1hJVp6CmBCtu+hKYt+atBPgRbM8IUINnKKZf/Sk6HeJIJZs662jD8/X93vFi0ZtyV2jEKJpouWw8j4vrGGsaDzTEUcyJgDqZj7tPJptM2L5B3BcFJmkGj2HO3N+LGDarJrVBBSiEjhTgx4NnLiKZnUbMx547mCRg2akk2w=

				  matrix:

				    - PGVERSION=9.5

				before_install:

				  - git clone --depth 1 https://github.com/citusdata/tools.git

				  - tools/travis/setup_apt.sh

				  - tools/travis/nuke_pg.sh

				install:

				  - tools/travis/install_pg.sh

				script: tools/travis/pg_travis_multi_test.sh

				after_success: tools/travis/sync_to_enterprise

3648

CHANGELOG.md

View File

File diff suppressed because it is too large Load Diff

									
										9

CODE_OF_CONDUCT.mdNormal file

										View File
									
				@ -0,0 +1,9 @@

				# Microsoft Open Source Code of Conduct

				This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).

				Resources:

				- [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/)

				- [Microsoft Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/)

				- Contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with questions or concerns

									
										215

CONTRIBUTING.md

										View File
									
				@ -6,13 +6,70 @@ We're happy you want to contribute! You can help us in different ways:

				  suggestions for improvements

				* Fork this repository and submit a pull request

				Before accepting any code contributions we ask that Citus contributors

				Before accepting any code contributions we ask that contributors

				sign a Contributor License Agreement (CLA). For an explanation of

				why we ask this as well as instructions for how to proceed, see the

				[Citus CLA](https://cla.citusdata.com).

				[Microsoft CLA](https://cla.opensource.microsoft.com/).

				### Devcontainer / Github Codespaces

				The easiest way to start contributing is via our devcontainer. This container works both locally in visual studio code with docker-desktop/docker-for-mac as well as [Github Codespaces](https://github.com/features/codespaces). To open the project in vscode you will need the [Dev Containers extension](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-containers). For codespaces you will need to [create a new codespace](https://codespace.new/citusdata/citus).

				With the extension installed you can run the following from the command pallet to get started

				```

				> Dev Containers: Clone Repository in Container Volume...

				```

				In the subsequent popup paste the url to the repo and hit enter.

				```

				https://github.com/citusdata/citus

				```

				This will create an isolated Workspace in vscode, complete with all tools required to build, test and run the Citus extension. We keep this container up to date with the supported postgres versions as well as the exact versions of tooling we use.

				To quickly start we suggest splitting your terminal once to have two shells. The left one in the `/workspaces/citus`, the second one changed to `/data`. The left terminal will be used to interact with the project, the right one with a testing cluster.

				To get citus installed from source we run `make install -s` in the first terminal. Once installed you can start a Citus cluster in the second terminal via `citus_dev make citus`. The cluster will run in the background, and can be interacted with via `citus_dev`. To get an overview of the available commands.

				With the Citus cluster running you can connect to the coordinator in the first terminal via `psql -p9700`. Because the coordinator is the most common entrypoint the `PGPORT` environment is set accordingly, so a simple `psql` will connect directly to the coordinator.

				### Debugging in the VS code

				1. Start Debugging: Press F5 in VS Code to start debugging. When prompted, you'll need to attach the debugger to the appropriate PostgreSQL process.

				2. Identify the Process: If you're running a psql command, take note of the PID that appears in your psql prompt. For example:

				```

				[local] citus@citus:9700 (PID: 5436)=#

				```

				This PID (5436 in this case) indicates the process that you should attach the debugger to.

				If you are uncertain about which process to attach, you can list all running PostgreSQL processes using the following command:

				```

				ps aux | grep postgres

				```

				Look for the process associated with the PID you noted. For example:

				```

				citus      5436  0.0  0.0  0  0 ?        S    14:00   0:00 postgres: citus citus

				```

				4. Attach the Debugger: Once you've identified the correct PID, select that process when prompted in VS Code to attach the debugger. You should now be able to debug the PostgreSQL session tied to the psql command.

				5. Set Breakpoints and Debug: With the debugger attached, you can set breakpoints within the code. This allows you to step through the code execution, inspect variables, and fully debug the PostgreSQL instance running in your container.

				### Getting and building

				[PostgreSQL documentation](https://www.postgresql.org/support/versioning/) has a

				section on upgrade policy.

					We always recommend that all users run the latest available minor release [for PostgreSQL] for whatever major version is in use.

				We expect Citus users to honor this recommendation and use latest available

				PostgreSQL minor release. Failure to do so may result in failures in our test

				suite. There are some known improvements in PG test architecture such as

				[this commit](https://github.com/postgres/postgres/commit/3f323956128ff8589ce4d3a14e8b950837831803)

				that are missing in earlier minor versions.

				#### Mac

				1. Install Xcode

				@ -20,7 +77,7 @@ why we ask this as well as instructions for how to proceed, see the

				  ```bash

				  brew update

				  brew install git postgresql

				  brew install git postgresql python

				  ```

				3. Get, build, and test the code

				@ -30,9 +87,19 @@ why we ask this as well as instructions for how to proceed, see the

				  cd citus

				  ./configure

				  # If you have already installed the project, you need to clean it first

				  make clean

				  make

				  make install

				  # Optionally, you might instead want to use `make install-all`

				  # since `multi_extension` regression test would fail due to missing downgrade scripts.

				  cd src/test/regress

				  pip install pipenv

				  pipenv --rm

				  pipenv install

				  pipenv shell

				  make check

				  ```

				@ -47,9 +114,11 @@ why we ask this as well as instructions for how to proceed, see the

				       sudo apt-key add -

				  sudo apt-get update

				  sudo apt-get install -y postgresql-server-dev-9.5 postgresql-9.5 \

				                          libedit-dev libselinux1-dev libxslt-dev  \

				                          libpam0g-dev git flex make

				  sudo apt-get install -y postgresql-server-dev-14 postgresql-14 \

				                          autoconf flex git libcurl4-gnutls-dev libicu-dev \

				                          libkrb5-dev liblz4-dev libpam0g-dev libreadline-dev \

				                          libselinux1-dev libssl-dev libxslt1-dev libzstd-dev \

				                          make uuid-dev

				  ```

				2. Get, build, and test the code

				@ -58,35 +127,157 @@ why we ask this as well as instructions for how to proceed, see the

				  git clone https://github.com/citusdata/citus.git

				  cd citus

				  ./configure

				  # If you have already installed the project previously, you need to clean it first

				  make clean

				  make

				  sudo make install

				  # Optionally, you might instead want to use `sudo make install-all`

				  # since `multi_extension` regression test would fail due to missing downgrade scripts.

				  cd src/test/regress

				  pip install pipenv

				  pipenv --rm

				  pipenv install

				  pipenv shell

				  make check

				  ```

				#### Red Hat-based Linux (RHEL, CentOS, Fedora)

				1. Find the PostgreSQL 9.5 RPM URL for your repo at [yum.postgresql.org](http://yum.postgresql.org/repopackages.php#pg95)

				1. Find the RPM URL for your repo at [yum.postgresql.org](http://yum.postgresql.org/repopackages.php)

				2. Register its contents with Yum:

				  ```bash

				  sudo yum install -y <url>

				  ```

				3. Install build dependencies

				3. Register EPEL and SCL repositories for your distro.

				  On CentOS:

				  ```bash

				  yum install -y centos-release-scl-rh epel-release

				  ```

				  On RHEL, see [this RedHat blog post](https://developers.redhat.com/blog/2018/07/07/yum-install-gcc7-clang/) to install set-up SCL first. Then run:

				  ```bash

				  yum install -y epel-release

				  ```

				4. Install build dependencies

				  ```bash

				  sudo yum update -y

				  sudo yum groupinstall -y 'Development Tools'

				  sudo yum install -y postgresql95-devel postgresql95-server    \

				                      libxml2-devel libxslt-devel openssl-devel \

				                      pam-devel readline-devel git

				  sudo yum install -y postgresql14-devel postgresql14-server     \

				                      git libcurl-devel libxml2-devel libxslt-devel \

				                      libzstd-devel llvm-toolset-7-clang llvm5.0 lz4-devel \

				                      openssl-devel pam-devel readline-devel

				  git clone https://github.com/citusdata/citus.git

				  cd citus

				  PG_CONFIG=/usr/pgsql-9.5/bin/pg_config ./configure

				  PG_CONFIG=/usr/pgsql-14/bin/pg_config ./configure

				  # If you have already installed the project previously, you need to clean it first

				  make clean

				  make

				  sudo make install

				  # Optionally, you might instead want to use `sudo make install-all`

				  # since `multi_extension` regression test would fail due to missing downgrade scripts.

				  cd src/test/regress

				  pip install pipenv

				  pipenv --rm

				  pipenv install

				  pipenv shell

				  make check

				  ```

				### Following our coding conventions

				Our coding conventions are documented in [STYLEGUIDE.md](STYLEGUIDE.md).

				### Making SQL changes

				Sometimes you need to make change to the SQL that the citus extension runs upon

				creations. The way this is done is by changing the last file in

				`src/backend/distributed/sql`, or creating it if the last file is from a

				published release. If you needed to create a new file, also change the

				`default_version` field in `src/backend/distributed/citus.control` to match your

				new version. All the files in this directory are run in order based on

				their name. See [this page in the Postgres

				docs](https://www.postgresql.org/docs/current/extend-extensions.html) for more

				information on how Postgres runs these files.

				#### Changing or creating functions

				If you need to change any functions defined by Citus. You should check inside

				`src/backend/distributed/sql/udfs` to see if there is already a directory for

				this function, if not create one. Then change or create the file called

				`latest.sql` in that directory to match how it should create the function. This

				should be including any DROP (IF EXISTS), COMMENT and REVOKE statements for this

				function.

				Then copy the `latest.sql` file to `{version}.sql`, where `{version}` is the

				version for which this sql change is, e.g. `{9.0-1.sql}`. Now that you've

				created this stable snapshot of the function definition for your version you

				should use it in your actual sql file, e.g.

				`src/backend/distributed/sql/citus--8.3-1--9.0-1.sql`. You do this by using C

				style `#include` statements like this:

				```

				#include "udfs/myudf/9.0-1.sql"

				```

				#### Other SQL

				Any other SQL you can put directly in the main sql file, e.g.

				`src/backend/distributed/sql/citus--8.3-1--9.0-1.sql`.

				### Backporting a commit to a release branch

				1. Check out the release branch that you want to backport to `git checkout release-11.3`

				2. Make sure you have the latest changes `git pull`

				3. Create a new release branch with a unique name `git checkout -b release-11.3-<yourname>`

				4. Cherry-pick the commit that you want to backport `git cherry-pick -x <sha>` (the `-x` is important)

				5. Push the branch `git push`

				6. Wait for tests to pass

				7. If the cherry-pick required non-trivial merge conflicts, create a PR and ask

				   for a review.

				8. After the tests pass on CI, fast-forward the release branch `git push origin release-11.3-<yourname>:release-11.3`

				### Running tests

				See [`src/test/regress/README.md`](https://github.com/citusdata/citus/blob/master/src/test/regress/README.md)

				### Documentation

				User-facing documentation is published on [docs.citusdata.com](https://docs.citusdata.com/). When adding a new feature, function, or setting, you can open a pull request or issue against the [Citus docs repo](https://github.com/citusdata/citus_docs/).

				Detailed descriptions of the implementation for Citus developers are provided in the [Citus Technical Documentation](src/backend/distributed/README.md). It is currently a single file for ease of searching. Please update the documentation if you make any changes that affect the design or add major new features.

				# Making a pull request ready for reviews

				Asking for help and asking for reviews are two different things. When you're asking for help, you're asking for someone to help you with something that you're not expected to know.

				But when you're asking for a review, you're asking for someone to review your work and provide feedback. So, when you're asking for a review, you're expected to make sure that:

				* Your changes don't perform **unnecessary line addition / deletions / style changes on unrelated files / lines**.

				* All CI jobs are **passing**, including **style checks** and **flaky test detection jobs**. Note that if you're an external contributor, you don't have to wait CI jobs to run (and finish) because they don't get automatically triggered for external contributors.

				* Your PR has necessary amount of **tests** and that they're passing.

				* You separated as much as possible work into **separate PRs**, e.g., a prerequisite bugfix, a refactoring etc..

				* Your PR doesn't introduce a typo or something that you can easily fix yourself.

				* After all CI jobs pass, code-coverage measurement job (CodeCov as of today) then kicks in. That's why it's important to make the **tests passing** first. At that point, you're expected to check **CodeCov annotations** that can be seen in the **Files Changed** tab and expected to make sure that it doesn't complain about any lines that are not covered. For example, it's ok if CodeCov complains about an `ereport()` call that you put for an "unexpected-but-better-than-crashing" case, but it's not ok if it complains about an uncovered `if` branch that you added.

				* And finally, perform a **self-review** to make sure that:

				  * Code and code-comments reflects the idea **without requiring an extra explanation** via a chat message / email / PR comment.

				    This is important because we don't expect developers to reach out to author / read about the whole discussion in the PR to understand the idea behind a commit merged into `main` branch.

				  * PR description is clear enough.

				  * If-and-only-if you're **introducing a user facing change / bugfix**, your PR has a line that starts with `DESCRIPTION: <Present simple tense word that starts with a capital letter, e.g., Adds support for / Fixes / Disallows>`.

				  * **Commit messages** are clear enough if the commits are doing logically different things.

									
										43

DEVCONTAINER.mdNormal file

										View File
									
				@ -0,0 +1,43 @@

				# Devcontainer

				## Coredumps

				When postgres/citus crashes, there is the option to create a coredump. This is useful for debugging the issue. Coredumps are enabled in the devcontainer by default. However, not all environments are configured correctly out of the box. The most important configuration that is not standardized is the `core_pattern`. The configuration can be verified from the container, however, you cannot change this setting from inside the container as the filesystem containing this setting is in read only mode while inside the container.

				To verify if corefiles are written run the following command in a terminal. This shows the filename pattern with which the corefile will be written.

				```bash

				cat /proc/sys/kernel/core_pattern

				```

				This should be configured with a relative path or simply a simple filename, such as `core`. When your environment shows an absolute path you will need to change this setting. How to change this setting depends highly on the underlying system as the setting needs to be changed on the kernel of the host running the container.

				You can put any pattern in `/proc/sys/kernel/core_pattern` as you see fit. eg. You can add the PID to the core pattern in one of two ways;

				- You either include `%p` in the core_pattern. This gets substituted with the PID of the crashing process.

				- Alternatively you could set `/proc/sys/kernel/core_uses_pid` to `1` in the same way as you set `core_pattern`. This will append the PID to the corefile if `%p` is not explicitly contained in the core_pattern.

				When a coredump is written you can use the debug/launch configuration `Open core file` which is preconfigured in the devcontainer. This will open a fileprompt that lists all coredumps that are found in your workspace. When you want to debug coredumps from `citus_dev` that are run in your `/data` directory, you can add the data directory to your workspace. In the command pallet of vscode you can run `>Workspace: Add Folder to Workspace...` and select the `/data` directory. This will allow you to open the coredumps from the `/data` directory in the `Open core file` debug configuration.

				### Windows (docker desktop)

				When running in docker desktop on windows you will most likely need to change this setting. The linux guest in WSL2 that runs your container is the `docker-desktop` environment. The easiest way to get onto the host, where you can change this setting, is to open a powershell window and verify you have the docker-desktop environment listed.

				```powershell

				wsl --list

				```

				Among others this should list both `docker-desktop` and `docker-desktop-data`. You can then open a shell in the `docker-desktop` environment.

				```powershell

				wsl -d docker-desktop

				```

				Inside this shell you can verify that you have the right environment by running

				```bash

				cat /proc/sys/kernel/core_pattern

				```

				This should show the same configuration as the one you see inside the devcontainer. You can then change the setting by running the following command.

				This will change the setting for the current session. If you want to make the change permanent you will need to add this to a startup script.

				```bash

				echo "core" > /proc/sys/kernel/core_pattern

				```

2

LICENSE

View File

 @ -658,4 +658,4 @@ specific requirements.
   You should also get your employer (if you work as a programmer) or school,
 if any, to sign a "copyright disclaimer" for the program, if necessary.
 For more information on this, and how to apply and follow the GNU AGPL, see
 <http://www.gnu.org/licenses/>.
 <http://www.gnu.org/licenses/>.

									
										55

Makefile

										View File
									
				@ -2,6 +2,7 @@

				citus_subdir = .

				citus_top_builddir = .

				extension_dir = $(shell $(PG_CONFIG) --sharedir)/extension

				# Hint that configure should be run first

				ifeq (,$(wildcard Makefile.global))

				@ -10,47 +11,57 @@ endif

				include Makefile.global

				all: extension csql

				all: extension

				# build columnar only

				columnar:

					$(MAKE) -C src/backend/columnar all

				# build extension

				extension:

				extension: $(citus_top_builddir)/src/include/citus_version.h columnar

					$(MAKE) -C src/backend/distributed/ all

				install-extension: extension

				install-columnar: columnar

					$(MAKE) -C src/backend/columnar install

				install-extension: extension install-columnar

					$(MAKE) -C src/backend/distributed/ install

				install-headers: extension

					$(MKDIR_P) '$(DESTDIR)$(includedir_server)/distributed/'

				# generated headers are located in the build directory

					$(INSTALL_DATA) src/include/citus_config.h '$(DESTDIR)$(includedir_server)/'

					$(INSTALL_DATA) $(citus_top_builddir)/src/include/citus_version.h '$(DESTDIR)$(includedir_server)/'

				# the rest in the source tree

					$(INSTALL_DATA) $(citus_abs_srcdir)/src/include/distributed/*.h '$(DESTDIR)$(includedir_server)/distributed/'

				clean-extension:

					$(MAKE) -C src/backend/distributed/ clean

				.PHONY: extension install-extension clean-extension

					$(MAKE) -C src/backend/columnar/ clean

				clean-full:

					$(MAKE) -C src/backend/distributed/ clean-full

				.PHONY: extension install-extension clean-extension clean-full

				install-downgrades:

					$(MAKE) -C src/backend/distributed/ install-downgrades

				install-all: install-headers

					$(MAKE) -C src/backend/columnar/ install-all

					$(MAKE) -C src/backend/distributed/ install-all

				# Add to generic targets

				install: install-extension install-headers

				clean: clean-extension

				# build csql binary

				csql:

					$(MAKE) -C src/bin/csql/ all

				install-csql: csql

					$(MAKE) -C src/bin/csql/ install

				clean-csql:

					$(MAKE) -C src/bin/csql/ clean

				.PHONY: csql install-csql clean-csql

				# Add to generic targets

				install: install-csql

				clean: clean-csql

				# apply or check style

				reindent:

					cd ${citus_abs_top_srcdir} && citus_indent --quiet

					${citus_abs_top_srcdir}/ci/fix_style.sh

				check-style:

					black . --check --quiet

					isort . --check --quiet

					flake8

					cd ${citus_abs_top_srcdir} && citus_indent --quiet --check

				.PHONY: reindent check-style

				# depend on install for now

				check: all install

					$(MAKE) -C src/test/regress check-full

				# depend on install-all so that downgrade scripts are installed as well

				check: all install-all

					# explicetely does not use $(MAKE) to avoid parallelism

					make -C src/test/regress check

				.PHONY: all check install clean

				.PHONY: all check clean install install-downgrades install-all

									
										36

Makefile.global.in

										View File
									
				@ -11,9 +11,29 @@

				citus_abs_srcdir:=@abs_top_srcdir@/${citus_subdir}

				citus_abs_top_srcdir:=@abs_top_srcdir@

				postgres_abs_srcdir:=@POSTGRES_SRCDIR@

				postgres_abs_builddir:=@POSTGRES_BUILDDIR@

				PG_CONFIG:=@PG_CONFIG@

				PGXS:=$(shell $(PG_CONFIG) --pgxs)

				# if both, git is installed and there is a .git directory in the working dir we set the

				# GIT_VERSION to a human readable gitref that resembles the version from which citus is

				# built. During releases it will show the tagname which by convention is the verion of the

				# release

				ifneq (@GIT_BIN@,)

				ifneq (@HAS_DOTGIT@,)

					# try to find a tag that exactly matches the current branch, swallow the error if cannot find such a tag

					GIT_VERSION := "$(shell @GIT_BIN@ describe --exact-match --dirty --always --tags 2>/dev/null)"

					# if there is not a tag that exactly matches the branch, then GIT_VERSION would still be empty

					# in that case, set GIT_VERSION with current branch's name and the short sha of the HEAD

				ifeq ($(GIT_VERSION),"")

					GIT_VERSION := "$(shell @GIT_BIN@ rev-parse --abbrev-ref HEAD)(sha: $(shell @GIT_BIN@ rev-parse --short HEAD))"

				endif

				endif

				endif

				# Support for VPATH builds (i.e. builds from outside the source tree)

				vpath_build=@vpath_build@

				ifeq ($(vpath_build),yes)

				@ -41,11 +61,11 @@ $(citus_top_builddir)/Makefile.global: $(citus_abs_top_srcdir)/configure $(citus

				# Ensure configuration is generated by the most recent configure,

				# useful for longer existing build directories.

				$(citus_top_builddir)/config.status: $(citus_abs_top_srcdir)/configure

					cd @abs_top_builddir@ && ./config.status --recheck

				$(citus_top_builddir)/config.status: $(citus_abs_top_srcdir)/configure $(citus_abs_top_srcdir)/src/backend/distributed/citus.control

					cd @abs_top_builddir@ && ./config.status --recheck && ./config.status

				# Regenerate configure if configure.in changed

				$(citus_abs_top_srcdir)/configure: $(citus_abs_top_srcdir)/configure.in

				# Regenerate configure if configure.ac changed

				$(citus_abs_top_srcdir)/configure: $(citus_abs_top_srcdir)/configure.ac

					cd ${citus_abs_top_srcdir} && ./autogen.sh

				# If specified via configure, replace the default compiler. Normally

				@ -66,8 +86,12 @@ endif

				# Add options passed to configure or computed therein, to CFLAGS/CPPFLAGS/...

				override CFLAGS += @CFLAGS@ @CITUS_CFLAGS@

				override CPPFLAGS := @CPPFLAGS@ -I '${citus_abs_top_srcdir}/src/include' $(CPPFLAGS)

				override LDFLAGS += @LDFLAGS@

				override BITCODE_CFLAGS := $(BITCODE_CFLAGS) @CITUS_BITCODE_CFLAGS@

				ifneq ($(GIT_VERSION),)

				    override CFLAGS += -DGIT_VERSION=\"$(GIT_VERSION)\"

				endif

				override CPPFLAGS := @CPPFLAGS@ @CITUS_CPPFLAGS@ -I '${citus_abs_top_srcdir}/src/include' -I'${citus_top_builddir}/src/include' $(CPPFLAGS)

				override LDFLAGS += @LDFLAGS@ @CITUS_LDFLAGS@

				# optional file with user defined, additional, rules

				-include ${citus_abs_srcdir}/src/Makefile.custom

99

NOTICE Normal file

View File

 @ -0,0 +1,99 @@
 NOTICES AND INFORMATION
 Do Not Translate or Localize
 This software incorporates material from third parties.
 Microsoft makes certain open source code available at https://3rdpartysource.microsoft.com,
 or you may send a check or money order for US $5.00, including the product name,
 the open source component name, platform, and version number, to:
 Source Code Compliance Team
 Microsoft Corporation
 One Microsoft Way
 Redmond, WA 98052
 USA
 Notwithstanding any other terms, you may reverse engineer this software to the extent
 required to debug changes to any libraries licensed under the GNU Lesser General Public License.
 ---------------------------------------------------------
 ---------------------------------------------------------
 intel/safestringlib 245c4b8cff1d2e7338b7f3a82828fc8e72b29549 - MIT
 Copyright (c) 2014-2018 Intel Corporation
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
 in the Software without restriction, including without limitation the rights
 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 copies of the Software, and to permit persons to whom the Software is
 furnished to do so, subject to the following conditions:
 The above copyright notice and this permission notice shall be included in all
 copies or substantial portions of the Software.
 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
 SOFTWARE.
 ================================================================================
 Copyright (C) 2012, 2013 Cisco Systems
 All rights reserved.
 Permission is hereby granted, free of charge, to any person
 obtaining a copy of this software and associated documentation
 files (the "Software"), to deal in the Software without
 restriction, including without limitation the rights to use,
 copy, modify, merge, publish, distribute, sublicense, and/or
 sell copies of the Software, and to permit persons to whom the
 Software is furnished to do so, subject to the following
 conditions:
 The above copyright notice and this permission notice shall be
 included in all copies or substantial portions of the Software.
 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
 EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES
 OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
 NONINFRINGEMENT.  IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT
 HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
 WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING
 FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
 OTHER DEALINGS IN THE SOFTWARE.
 ---------------------------------------------------------
 postgres/postgres 29be9983a64c011eac0b9ee29895cce71e15ea77
 PostgreSQL Database Management System
 (formerly known as Postgres, then as Postgres95)
 Portions Copyright (c) 1996-2020, PostgreSQL Global Development Group
 Portions Copyright (c) 1994, The Regents of the University of California
 Permission to use, copy, modify, and distribute this software and its
 documentation for any purpose, without fee, and without a written agreement
 is hereby granted, provided that the above copyright notice and this
 paragraph and the following two paragraphs appear in all copies.
 IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR
 DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING
 LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS
 DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE
 POSSIBILITY OF SUCH DAMAGE.
 THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES,
 INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY
 AND FITNESS FOR A PARTICULAR PURPOSE.  THE SOFTWARE PROVIDED HEREUNDER IS
 ON AN "AS IS" BASIS, AND THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATIONS TO
 PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS.
 ---------------------------------------------------------

									
										596

README.md

										View File
									
				@ -1,148 +1,496 @@

				![Citus Banner](/github-banner.png)

				| **<br/>The Citus database is 100% open source.<br/><img width=1000/><br/>Learn what's new in the [Citus 13.0 release blog](https://www.citusdata.com/blog/2025/02/06/distribute-postgresql-17-with-citus-13/) and the [Citus Updates page](https://www.citusdata.com/updates/).<br/><br/>**|

				|---|

				<br/>

				[![Build Status](https://travis-ci.org/citusdata/citus.svg?branch=master)](https://travis-ci.org/citusdata/citus)

				[![Slack Status](http://slack.citusdata.com/badge.svg)](https://slack.citusdata.com)

				[![Latest Docs](https://img.shields.io/badge/docs-latest-brightgreen.svg)](http://docs.citusdata.com/en/v5.1/index.html)

				### What is Citus?

				* **Open-source** PostgreSQL extension (not a fork)

				* **Scalable** across multiple hosts through sharding and replication

				* **Distributed** engine for query parallelization

				* **Highly available** in the face of host failures

				![Citus Banner](images/citus-readme-banner.png)

				Citus horizontally scales PostgreSQL across commodity servers using

				sharding and replication. Its query engine parallelizes incoming

				SQL queries across these servers to enable real-time responses on

				large datasets.

				[![Latest Docs](https://img.shields.io/badge/docs-latest-brightgreen.svg)](https://docs.citusdata.com/)

				[![Stack Overflow](https://img.shields.io/badge/Stack%20Overflow-%20-545353?logo=Stack%20Overflow)](https://stackoverflow.com/questions/tagged/citus)

				[![Slack](https://cituscdn.azureedge.net/images/social/slack-badge.svg)](https://slack.citusdata.com/)

				[![Code Coverage](https://codecov.io/gh/citusdata/citus/branch/master/graph/badge.svg)](https://app.codecov.io/gh/citusdata/citus)

				[![Twitter](https://img.shields.io/twitter/follow/citusdata.svg?label=Follow%20@citusdata)](https://twitter.com/intent/follow?screen_name=citusdata)

				Citus extends the underlying database rather than forking it, which

				gives developers and enterprises the power and familiarity of a

				traditional relational database. As an extension, Citus supports

				new PostgreSQL releases, allowing users to benefit from new features

				while maintaining compatibility with existing PostgreSQL tools.

				Note that Citus supports many (but not all) SQL commands; see the

				[FAQ][faq] for more details.

				[![Citus Deb Packages](https://img.shields.io/badge/deb-packagecloud.io-844fec.svg)](https://packagecloud.io/app/citusdata/community/search?q=&filter=debs)

				[![Citus Rpm Packages](https://img.shields.io/badge/rpm-packagecloud.io-844fec.svg)](https://packagecloud.io/app/citusdata/community/search?q=&filter=rpms)

				Common Use-Cases:

				* Powering real-time analytic dashboards

				* Exploratory queries on events as they happen

				* Large dataset archival and reporting

				* Session analytics (funnels, segmentation, and cohorts)

				## What is Citus?

				To learn more, visit [citusdata.com](https://www.citusdata.com) and join

				the [mailing list](https://groups.google.com/forum/#!forum/citus-users) to

				stay on top of the latest developments.

				Citus is a [PostgreSQL extension](https://www.citusdata.com/blog/2017/10/25/what-it-means-to-be-a-postgresql-extension/) that transforms Postgres into a distributed database—so you can achieve high performance at any scale.

				### Quickstart

				With Citus, you extend your PostgreSQL database with new superpowers:

				#### Local Citus Cluster

				- **Distributed tables** are sharded across a cluster of PostgreSQL nodes to combine their CPU, memory, storage and I/O capacity.

				- **References tables** are replicated to all nodes for joins and foreign keys from distributed tables and maximum read performance.

				- **Distributed query engine** routes and parallelizes SELECT, DML, and other operations on distributed tables across the cluster.

				- **Columnar storage** compresses data, speeds up scans, and supports fast projections, both on regular and distributed tables.

				- **Query from any node** enables you to utilize the full capacity of your cluster for distributed queries

				* Install docker-compose: [Mac][mac_install] | [Linux][linux_install]

				* (Mac only) connect to Docker VM

				  ```bash

				  eval $(docker-machine env default)

				  ```

				You can use these Citus superpowers to make your Postgres database scale-out ready on a single Citus node. Or you can build a large cluster capable of handling **high transaction throughputs**, especially in **multi-tenant apps**, run **fast analytical queries**, and process large amounts of **time series** or **IoT data** for **real-time analytics**. When your data size and volume grow, you can easily add more worker nodes to the cluster and rebalance the shards.

				* Pull and start the docker images

				  ```bash

				  wget https://raw.githubusercontent.com/citusdata/docker/master/docker-compose.yml

				  docker-compose -p citus up -d

				  ```

				Our [SIGMOD '21](https://2021.sigmod.org/) paper [Citus: Distributed PostgreSQL for Data-Intensive Applications](https://doi.org/10.1145/3448016.3457551) gives a more detailed look into what Citus is, how it works, and why it works that way.

				* Connect to the master database

				  ```bash

				  docker exec -it citus_master psql -U postgres -d postgres

				  ```

				![Citus scales out from a single node](images/citus-scale-out.png)

				* Follow the [first tutorial][tutorial] instructions

				* To shut the cluster down, run

				Since Citus is an extension to Postgres, you can use Citus with the latest Postgres versions. And Citus works seamlessly with the PostgreSQL tools and extensions you are already familiar with.

				  ```bash

				  docker-compose -p citus down

				  ```

				- [Why Citus?](#why-citus)

				- [Getting Started](#getting-started)

				- [Using Citus](#using-citus)

				- [Schema-based sharding](#schema-based-sharding)

				- [Setting up with High Availability](#setting-up-with-high-availability)

				- [Documentation](#documentation)

				- [Architecture](#architecture)

				- [When to Use Citus](#when-to-use-citus)

				- [Need Help?](#need-help)

				- [Contributing](#contributing)

				- [Stay Connected](#stay-connected)

				### Talk to Contributors and Learn More

				## Why Citus?

				<table class="tg">

				<col width="45%">

				<col width="65%">

				<tr>

				  <td>Documentation</td>

				  <td>Try the <a

				  href="http://docs.citusdata.com/en/v5.1/tutorials/tut-cluster.html">Citus

				  tutorials</a> for a hands-on introduction or <br/>the <a

				  href="http://docs.citusdata.com/en/v5.1/index.html">documentation</a> for

				  a more comprehensive reference.</td>

				</tr>

				<tr>

				  <td>Google Groups</td>

				  <td>The <a

				  href="https://groups.google.com/forum/#!forum/citus-users">Citus Google

				  Group</a> is our place for detailed questions and discussions.</td>

				</tr>

				<tr>

				  <td>Slack</td>

				  <td>Chat with us in our community <a

				  href="https://slack.citusdata.com">Slack channel</a>.</td>

				</tr>

				<tr>

				  <td>Github Issues</td>

				  <td>We track specific bug reports and feature requests on our <a

				  href="https://github.com/citusdata/citus/issues">project

				  issues</a>.</td>

				</tr>

				<tr>

				  <td>Twitter</td>

				  <td>Follow <a href="https://twitter.com/citusdata">@citusdata</a>

				  for general updates and PostgreSQL scaling tips.</td>

				</tr>

				<tr>

				  <td>Training and Support</td>

				  <td>See our <a

				  href="https://www.citusdata.com/citus-products/citus-data-pricing">support

				  page</a> for training and dedicated support options.</td>

				</tr>

				</table>

				Developers choose Citus for two reasons:

				### Contributing

				1. Your application is outgrowing a single PostgreSQL node

				Citus is built on and of open source. We welcome your contributions,

				and have added a

				[helpwanted](https://github.com/citusdata/citus/labels/helpwanted) label

				to issues which are accessible to new contributors. The

				[CONTRIBUTING.md](CONTRIBUTING.md) file explains how to get started

				developing the Citus extension itself and our code quality guidelines.

					If the size and volume of your data increases over time, you may start seeing any number of performance and scalability problems on a single PostgreSQL node. For example: High CPU utilization and I/O wait times slow down your queries, SQL queries return out of memory errors, autovacuum cannot keep up and increases table bloat, etc.

				### Who is Using Citus?

					With Citus you can distribute and optionally compress your tables to always have enough memory, CPU, and I/O capacity to achieve high performance at scale. The distributed query engine can efficiently route transactions across the cluster, while parallelizing analytical queries and batch operations across all cores. Moreover, you can still use the PostgreSQL features and tools you know and love.

				Citus is deployed in production by many customers, ranging from

				technology start-ups to large enterprises. Here are some examples:

				2. PostgreSQL can do things other systems can’t

				* [CloudFlare](https://www.cloudflare.com/) uses Citus to provide

				real-time analytics on 100 TBs of data from over 4 million customer

				websites. [Case

				Study](https://blog.cloudflare.com/scaling-out-postgresql-for-cloudflare-analytics-using-citusdb/)

				* [MixRank](https://mixrank.com/) uses Citus to efficiently collect

				and analyze vast amounts of data to allow inside B2B sales teams

				to find new customers. [Case

				Study](https://www.citusdata.com/solutions/case-studies/mixrank-case-study)

				* [Neustar](https://www.neustar.biz/) builds and maintains scalable

				ad-tech infrastructure that counts billions of events per day using

				Citus and HyperLogLog.

				* [Agari](https://www.agari.com/) uses Citus to secure more than

				85 percent of U.S. consumer emails on two 6-8 TB clusters. [Case

				Study](https://www.citusdata.com/solutions/case-studies/agari-case-study)

				* [Heap](https://heapanalytics.com/) uses Citus to run dynamic

				funnel, segmentation, and cohort queries across billions of users

				and tens of billions of events. [Watch

				Video](https://www.youtube.com/watch?v=NVl9_6J1G60&list=PLixnExCn6lRpP10ZlpJwx6AuU3XIgNWpL)

					There are many data processing systems that are built to scale out, but few have as many powerful capabilities as PostgreSQL, including: Advanced joins and subqueries, user-defined functions, update/delete/upsert, constraints and foreign keys, powerful extensions (e.g. PostGIS, HyperLogLog), many types of indexes, time-partitioning, and sophisticated JSON support.

					Citus makes PostgreSQL’s most powerful capabilities work at any scale, allowing you to handle complex data-intensive workloads on a single database system.

				## Getting Started

				The quickest way to get started with Citus is to use the [Azure Cosmos DB for PostgreSQL](https://learn.microsoft.com/azure/cosmos-db/postgresql/quickstart-create-portal) managed service in the cloud—or [set up Citus locally](https://docs.citusdata.com/en/stable/installation/single_node.html).

				### Citus Managed Service on Azure

				You can get a fully-managed Citus cluster in minutes through the [Azure Cosmos DB for PostgreSQL portal](https://azure.microsoft.com/products/cosmos-db/). Azure will manage your backups, high availability through auto-failover, software updates, monitoring, and more for all of your servers. To get started Citus on Azure, use the [Azure Cosmos DB for PostgreSQL Quickstart](https://learn.microsoft.com/azure/cosmos-db/postgresql/quickstart-create-portal).

				### Running Citus using Docker

				The smallest possible Citus cluster is a single PostgreSQL node with the Citus extension, which means you can try out Citus by running a single Docker container.

				```bash

				# run PostgreSQL with Citus on port 5500

				docker run -d --name citus -p 5500:5432 -e POSTGRES_PASSWORD=mypassword citusdata/citus

				# connect using psql within the Docker container

				docker exec -it citus psql -U postgres

				# or, connect using local psql

				psql -U postgres -d postgres -h localhost -p 5500

				```

				### Install Citus locally

				If you already have a local PostgreSQL installation, the easiest way to install Citus is to use our packaging repo

				Install packages on Ubuntu / Debian:

				```bash

				curl https://install.citusdata.com/community/deb.sh > add-citus-repo.sh

				sudo bash add-citus-repo.sh

				sudo apt-get -y install postgresql-17-citus-13.0

				```

				Install packages on Red Hat:

				```bash

				curl https://install.citusdata.com/community/rpm.sh > add-citus-repo.sh

				sudo bash add-citus-repo.sh

				sudo yum install -y citus130_17

				```

				To add Citus to your local PostgreSQL database, add the following to `postgresql.conf`:

				```

				shared_preload_libraries = 'citus'

				```

				After restarting PostgreSQL, connect using `psql` and run:

				```sql

				CREATE EXTENSION citus;

				````

				You’re now ready to get started and use Citus tables on a single node.

				### Install Citus on multiple nodes

				If you want to set up a multi-node cluster, you can also set up additional PostgreSQL nodes with the Citus extensions and add them to form a Citus cluster:

				```sql

				-- before adding the first worker node, tell future worker nodes how to reach the coordinator

				SELECT citus_set_coordinator_host('10.0.0.1', 5432);

				-- add worker nodes

				SELECT citus_add_node('10.0.0.2', 5432);

				SELECT citus_add_node('10.0.0.3', 5432);

				-- rebalance the shards over the new worker nodes

				SELECT rebalance_table_shards();

				```

				For more details, see our [documentation on how to set up a multi-node Citus cluster](https://docs.citusdata.com/en/stable/installation/multi_node.html) on various operating systems.

				## Using Citus

				Once you have your Citus cluster, you can start creating distributed tables, reference tables and use columnar storage.

				### Creating Distributed Tables

				The `create_distributed_table` UDF will transparently shard your table locally or across the worker nodes:

				```sql

				CREATE TABLE events (

				  device_id bigint,

				  event_id bigserial,

				  event_time timestamptz default now(),

				  data jsonb not null,

				  PRIMARY KEY (device_id, event_id)

				);

				-- distribute the events table across shards placed locally or on the worker nodes

				SELECT create_distributed_table('events', 'device_id');

				```

				After this operation, queries for a specific device ID will be efficiently routed to a single worker node, while queries across device IDs will be parallelized across the cluster.

				```sql

				-- insert some events

				INSERT INTO events (device_id, data)

				SELECT s % 100, ('{"measurement":'||random()||'}')::jsonb FROM generate_series(1,1000000) s;

				-- get the last 3 events for device 1, routed to a single node

				SELECT * FROM events WHERE device_id = 1 ORDER BY event_time DESC, event_id DESC LIMIT 3;

				┌───────────┬──────────┬───────────────────────────────┬───────────────────────────────────────┐

				│ device_id │ event_id │          event_time           │                 data                  │

				├───────────┼──────────┼───────────────────────────────┼───────────────────────────────────────┤

				│         1 │  1999901 │ 2021-03-04 16:00:31.189963+00 │ {"measurement": 0.88722643925054}     │

				│         1 │  1999801 │ 2021-03-04 16:00:31.189963+00 │ {"measurement": 0.6512231304621992}   │

				│         1 │  1999701 │ 2021-03-04 16:00:31.189963+00 │ {"measurement": 0.019368766051897524} │

				└───────────┴──────────┴───────────────────────────────┴───────────────────────────────────────┘

				(3 rows)

				Time: 4.588 ms

				-- explain plan for a query that is parallelized across shards, which shows the plan for

				-- a query one of the shards and how the aggregation across shards is done

				EXPLAIN (VERBOSE ON) SELECT count(*) FROM events;

				┌────────────────────────────────────────────────────────────────────────────────────┐

				│                                     QUERY PLAN                                     │

				├────────────────────────────────────────────────────────────────────────────────────┤

				│ Aggregate                                                                          │

				│   Output: COALESCE((pg_catalog.sum(remote_scan.count))::bigint, '0'::bigint)       │

				│   ->  Custom Scan (Citus Adaptive)                                                 │

				│         ...                                                                        │

				│         ->  Task                                                                   │

				│               Query: SELECT count(*) AS count FROM events_102008 events WHERE true │

				│               Node: host=localhost port=5432 dbname=postgres                       │

				│               ->  Aggregate                                                        │

				│                     ->  Seq Scan on public.events_102008 events                    │

				└────────────────────────────────────────────────────────────────────────────────────┘

				```

				### Creating Distributed Tables with Co-location

				Distributed tables that have the same distribution column can be co-located to enable high performance distributed joins and foreign keys between distributed tables.

				By default, distributed tables will be co-located based on the type of the distribution column, but you define co-location explicitly with the `colocate_with` argument in `create_distributed_table`.

				```sql

				CREATE TABLE devices (

				  device_id bigint primary key,

				  device_name text,

				  device_type_id int

				);

				CREATE INDEX ON devices (device_type_id);

				-- co-locate the devices table with the events table

				SELECT create_distributed_table('devices', 'device_id', colocate_with := 'events');

				-- insert device metadata

				INSERT INTO devices (device_id, device_name, device_type_id)

				SELECT s, 'device-'||s, 55 FROM generate_series(0, 99) s;

				-- optionally: make sure the application can only insert events for a known device

				ALTER TABLE events ADD CONSTRAINT device_id_fk

				FOREIGN KEY (device_id) REFERENCES devices (device_id);

				-- get the average measurement across all devices of type 55, parallelized across shards

				SELECT avg((data->>'measurement')::double precision)

				FROM events JOIN devices USING (device_id)

				WHERE device_type_id = 55;

				┌────────────────────┐

				│        avg         │

				├────────────────────┤

				│ 0.5000191877513974 │

				└────────────────────┘

				(1 row)

				Time: 209.961 ms

				```

				Co-location also helps you scale [INSERT..SELECT](https://docs.citusdata.com/en/stable/articles/aggregation.html), [stored procedures](https://www.citusdata.com/blog/2020/11/21/making-postgres-stored-procedures-9x-faster-in-citus/), and [distributed transactions](https://www.citusdata.com/blog/2017/06/02/scaling-complex-sql-transactions/).

				### Distributing Tables without interrupting the application

				Some of you already start with Postgres, and decide to distribute tables later on while your application using the tables. In that case, you want to avoid downtime for both reads and writes. `create_distributed_table` command block writes (e.g., DML commands) on the table until the command is finished. Instead, with `create_distributed_table_concurrently` command, your application can continue to read and write the data even during the command.

				```sql

				CREATE TABLE device_logs (

				  device_id bigint primary key,

				  log text

				);

				-- insert device logs

				INSERT INTO device_logs (device_id, log)

				SELECT s, 'device log:'||s FROM generate_series(0, 99) s;

				-- convert device_logs into a distributed table without interrupting the application

				SELECT create_distributed_table_concurrently('device_logs', 'device_id', colocate_with := 'devices');

				-- get the count of the logs, parallelized across shards

				SELECT count(*) FROM device_logs;

				┌───────┐

				│ count │

				├───────┤

				│   100 │

				└───────┘

				(1 row)

				Time: 48.734 ms

				```

				### Creating Reference Tables

				When you need fast joins or foreign keys that do not include the distribution column, you can use `create_reference_table` to replicate a table across all nodes in the cluster.

				```sql

				CREATE TABLE device_types (

				  device_type_id int primary key,

				  device_type_name text not null unique

				);

				-- replicate the table across all nodes to enable foreign keys and joins on any column

				SELECT create_reference_table('device_types');

				-- insert a device type

				INSERT INTO device_types (device_type_id, device_type_name) VALUES (55, 'laptop');

				-- optionally: make sure the application can only insert devices with known types

				ALTER TABLE devices ADD CONSTRAINT device_type_fk

				FOREIGN KEY (device_type_id) REFERENCES device_types (device_type_id);

				-- get the last 3 events for devices whose type name starts with laptop, parallelized across shards

				SELECT device_id, event_time, data->>'measurement' AS value, device_name, device_type_name

				FROM events JOIN devices USING (device_id) JOIN device_types USING (device_type_id)

				WHERE device_type_name LIKE 'laptop%' ORDER BY event_time DESC LIMIT 3;

				┌───────────┬───────────────────────────────┬─────────────────────┬─────────────┬──────────────────┐

				│ device_id │          event_time           │        value        │ device_name │ device_type_name │

				├───────────┼───────────────────────────────┼─────────────────────┼─────────────┼──────────────────┤

				│        60 │ 2021-03-04 16:00:31.189963+00 │ 0.28902084163415864 │ device-60   │ laptop           │

				│         8 │ 2021-03-04 16:00:31.189963+00 │ 0.8723803076285073  │ device-8    │ laptop           │

				│        20 │ 2021-03-04 16:00:31.189963+00 │ 0.8177634801548557  │ device-20   │ laptop           │

				└───────────┴───────────────────────────────┴─────────────────────┴─────────────┴──────────────────┘

				(3 rows)

				Time: 146.063 ms

				```

				Reference tables enable you to scale out complex data models and take full advantage of relational database features.

				### Creating Tables with Columnar Storage

				To use columnar storage in your PostgreSQL database, all you need to do is add `USING columnar` to your `CREATE TABLE` statements and your data will be automatically compressed using the columnar access method.

				```sql

				CREATE TABLE events_columnar (

				  device_id bigint,

				  event_id bigserial,

				  event_time timestamptz default now(),

				  data jsonb not null

				)

				USING columnar;

				-- insert some data

				INSERT INTO events_columnar (device_id, data)

				SELECT d, '{"hello":"columnar"}' FROM generate_series(1,10000000) d;

				-- create a row-based table to compare

				CREATE TABLE events_row AS SELECT * FROM events_columnar;

				-- see the huge size difference!

				\d+

				                                          List of relations

				┌────────┬──────────────────────────────┬──────────┬───────┬─────────────┬────────────┬─────────────┐

				│ Schema │             Name             │   Type   │ Owner │ Persistence │    Size    │ Description │

				├────────┼──────────────────────────────┼──────────┼───────┼─────────────┼────────────┼─────────────┤

				│ public │ events_columnar              │ table    │ marco │ permanent   │ 25 MB      │             │

				│ public │ events_row                   │ table    │ marco │ permanent   │ 651 MB     │             │

				└────────┴──────────────────────────────┴──────────┴───────┴─────────────┴────────────┴─────────────┘

				(2 rows)

				```

				You can use columnar storage by itself, or in a distributed table to combine the benefits of compression and the distributed query engine.

				When using columnar storage, you should only load data in batch using `COPY` or `INSERT..SELECT` to achieve good  compression. Update, delete, and foreign keys are currently unsupported on columnar tables. However, you can use partitioned tables in which newer partitions use row-based storage, and older partitions are compressed using columnar storage.

				To learn more about columnar storage, check out the [columnar storage README](https://github.com/citusdata/citus/blob/master/src/backend/columnar/README.md).

				## Schema-based sharding

				Available since Citus 12.0, [schema-based sharding](https://docs.citusdata.com/en/stable/get_started/concepts.html#schema-based-sharding) is the shared database, separate schema model, the schema becomes the logical shard within the database. Multi-tenant apps can a use a schema per tenant to easily shard along the tenant dimension. Query changes are not required and the application usually only needs a small modification to set the proper search_path when switching tenants. Schema-based sharding is an ideal solution for microservices, and for ISVs deploying applications that cannot undergo the changes required to onboard row-based sharding.

				### Creating distributed schemas

				You can turn an existing schema into a distributed schema by calling `citus_schema_distribute`:

				```sql

				SELECT citus_schema_distribute('user_service');

				```

				Alternatively, you can set `citus.enable_schema_based_sharding` to have all newly created schemas be automatically converted into distributed schemas:

				```sql

				SET citus.enable_schema_based_sharding TO ON;

				CREATE SCHEMA AUTHORIZATION user_service;

				CREATE SCHEMA AUTHORIZATION time_service;

				CREATE SCHEMA AUTHORIZATION ping_service;

				```

				### Running queries

				Queries will be properly routed to schemas based on `search_path` or by explicitly using the schema name in the query.

				For [microservices](https://docs.citusdata.com/en/stable/get_started/tutorial_microservices.html) you would create a USER per service matching the schema name, hence the default `search_path` would contain the schema name. When connected the user queries would be automatically routed and no changes to the microservice would be required.

				```sql

				CREATE USER user_service;

				CREATE SCHEMA AUTHORIZATION user_service;

				```

				For typical multi-tenant applications, you would set the search path to the tenant schema name in your application:

				```sql

				SET search_path = tenant_name, public;

				```

				## Setting up with High Availability

				One of the most popular high availability solutions for PostgreSQL, [Patroni 3.0](https://github.com/zalando/patroni), has [first class support for Citus 10.0 and above](https://patroni.readthedocs.io/en/latest/citus.html#citus), additionally since Citus 11.2 ships with improvements for smoother node switchover in Patroni.

				An example of patronictl list output for the Citus cluster:

				```bash

				postgres@coord1:~$ patronictl list demo

				```

				```text

				+ Citus cluster: demo ----------+--------------+---------+----+-----------+

				| Group | Member  | Host        | Role         | State   | TL | Lag in MB |

				+-------+---------+-------------+--------------+---------+----+-----------+

				|     0 | coord1  | 172.27.0.10 | Replica      | running |  1 |         0 |

				|     0 | coord2  | 172.27.0.6  | Sync Standby | running |  1 |         0 |

				|     0 | coord3  | 172.27.0.4  | Leader       | running |  1 |           |

				|     1 | work1-1 | 172.27.0.8  | Sync Standby | running |  1 |         0 |

				|     1 | work1-2 | 172.27.0.2  | Leader       | running |  1 |           |

				|     2 | work2-1 | 172.27.0.5  | Sync Standby | running |  1 |         0 |

				|     2 | work2-2 | 172.27.0.7  | Leader       | running |  1 |           |

				+-------+---------+-------------+--------------+---------+----+-----------+

				```

				## Documentation

				If you’re ready to get started with Citus or want to know more, we recommend reading the [Citus open source documentation](https://docs.citusdata.com/en/stable/). Or, if you are using Citus on Azure, then the [Azure Cosmos DB for PostgreSQL](https://learn.microsoft.com/azure/cosmos-db/postgresql/introduction) is the place to start.

				Our Citus docs contain comprehensive use case guides on how to build a [multi-tenant SaaS application](https://docs.citusdata.com/en/stable/use_cases/multi_tenant.html), [real-time analytics dashboard]( https://docs.citusdata.com/en/stable/use_cases/realtime_analytics.html), or work with [time series data](https://docs.citusdata.com/en/stable/use_cases/timeseries.html).

				## Architecture

				A Citus database cluster grows from a single PostgreSQL node into a cluster by adding worker nodes. In a Citus cluster, the original node to which the application connects is referred to as the coordinator node. The Citus coordinator contains both the metadata of distributed tables and reference tables, as well as regular (local) tables, sequences, and other database objects (e.g. foreign tables).

				Data in distributed tables is stored in “shards”, which are actually just regular PostgreSQL tables on the worker nodes. When querying a distributed table on the coordinator node, Citus will send regular SQL queries to the worker nodes. That way, all the usual PostgreSQL optimizations and extensions can automatically be used with Citus.

				![Citus architecture](images/citus-architecture.png)

				When you send a query in which all (co-located) distributed tables have the same filter on the distribution column, Citus will automatically detect that and send the whole query to the worker node that stores the data. That way, arbitrarily complex queries are supported with minimal routing overhead, which is especially useful for scaling transactional workloads. If queries do not have a specific filter, each shard is queried in parallel, which is especially useful in analytical workloads. The Citus distributed executor is adaptive and is designed to handle both query types at the same time on the same system under high concurrency, which enables large-scale mixed workloads.

				The schema and metadata of distributed tables and reference tables are automatically synchronized to all the nodes in the cluster. That way, you can connect to any node to run distributed queries. Schema changes and cluster administration still need to go through the coordinator.

				Detailed descriptions of the implementation for Citus developers are provided in the [Citus Technical Documentation](src/backend/distributed/README.md).

				## When to use Citus

				Citus is uniquely capable of scaling both analytical and transactional workloads with up to petabytes of data. Use cases in which Citus is commonly used:

				- **[Customer-facing analytics dashboards](http://docs.citusdata.com/en/stable/use_cases/realtime_analytics.html)**:

				  Citus enables you to build analytics dashboards that simultaneously ingest and process large amounts of data in the database and give sub-second response times even with a large number of concurrent users.

				  The advanced parallel, distributed query engine in Citus combined with PostgreSQL features such as [array types](https://www.postgresql.org/docs/current/arrays.html), [JSONB](https://www.postgresql.org/docs/current/datatype-json.html), [lateral joins](https://heap.io/blog/engineering/postgresqls-powerful-new-join-type-lateral), and extensions like [HyperLogLog](https://github.com/citusdata/postgresql-hll) and [TopN](https://github.com/citusdata/postgresql-topn) allow you to build responsive analytics dashboards no matter how many customers or how much data you have.

				  Example real-time analytics users: [Algolia](https://www.citusdata.com/customers/algolia)

				- **[Time series data](http://docs.citusdata.com/en/stable/use_cases/timeseries.html)**:

				  Citus enables you to process and analyze very large amounts of time series data. The biggest Citus clusters store well over a petabyte of time series data and ingest terabytes per day.

				  Citus integrates seamlessly with [Postgres table partitioning](https://www.postgresql.org/docs/current/ddl-partitioning.html) and has [built-in functions for partitioning by time](https://www.citusdata.com/blog/2021/10/22/how-to-scale-postgres-for-time-series-data-with-citus/), which can speed up queries and writes on time series tables. You can take advantage of Citus’s parallel, distributed query engine for fast analytical queries, and use the built-in *columnar storage* to compress old partitions.

				  Example users: [MixRank](https://www.citusdata.com/customers/mixrank)

				- **[Software-as-a-service (SaaS) applications](http://docs.citusdata.com/en/stable/use_cases/multi_tenant.html)**:

				  SaaS and other multi-tenant applications need to be able to scale their database as the number of tenants/customers grows. Citus enables you to transparently shard a complex data model by the tenant dimension, so your database can grow along with your business.

				  By distributing tables along a tenant ID column and co-locating data for the same tenant, Citus can horizontally scale complex (tenant-scoped) queries, transactions, and foreign key graphs. Reference tables and distributed DDL commands make database management a breeze compared to manual sharding. On top of that, you have a built-in distributed query engine for doing cross-tenant analytics inside the database.

				  Example multi-tenant SaaS users: [Salesloft](https://fivetran.com/case-studies/replicating-sharded-databases-a-case-study-of-salesloft-citus-data-and-fivetran), [ConvertFlow](https://www.citusdata.com/customers/convertflow)

				- **[Microservices](https://docs.citusdata.com/en/stable/get_started/tutorial_microservices.html)**: Citus supports schema based sharding, which allows distributing regular database schemas across many machines. This sharding methodology fits nicely with typical Microservices architecture, where storage is fully owned by the service hence can’t share the same schema definition with other tenants. Citus allows distributing horizontally scalable state across services, solving one of the [main problems](https://stackoverflow.blog/2020/11/23/the-macro-problem-with-microservices/) of microservices.

				- **Geospatial**:

				  Because of the powerful [PostGIS](https://postgis.net/) extension to Postgres that adds support for geographic objects into Postgres, many people run spatial/GIS applications on top of Postgres. And since spatial location information has become part of our daily life, well, there are more geospatial applications than ever. When your Postgres database needs to scale out to handle an increased workload, Citus is a good fit.

				  Example geospatial users: [Helsinki Regional Transportation Authority (HSL)](https://customers.microsoft.com/story/845146-transit-authority-improves-traffic-monitoring-with-azure-database-for-postgresql-hyperscale), [MobilityDB](https://www.citusdata.com/blog/2020/11/09/analyzing-gps-trajectories-at-scale-with-postgres-mobilitydb/).

				## Need Help?

				- **Slack**: Ask questions in our Citus community [Slack channel](https://slack.citusdata.com).

				- **GitHub issues**: Please submit issues via [GitHub issues](https://github.com/citusdata/citus/issues).

				- **Documentation**: Our [Citus docs](https://docs.citusdata.com ) have a wealth of resources, including sections on [query performance tuning](https://docs.citusdata.com/en/stable/performance/performance_tuning.html), [useful diagnostic queries](https://docs.citusdata.com/en/stable/admin_guide/diagnostic_queries.html), and [common error messages](https://docs.citusdata.com/en/stable/reference/common_errors.html).

				- **Docs issues**: You can also submit documentation issues via [GitHub issues for our Citus docs](https://github.com/citusdata/citus_docs/issues).

				- **Updates & Release Notes**: Learn about what's new in each Citus version on the [Citus Updates page](https://www.citusdata.com/updates/).

				## Contributing

				Citus is built on and of open source, and we welcome your contributions. The [CONTRIBUTING.md](CONTRIBUTING.md) file explains how to get started developing the Citus extension itself and our code quality guidelines.

				## Code of Conduct

				This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).

				For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or

				contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.

				## Stay Connected

				- **Twitter**: Follow us [@citusdata](https://twitter.com/citusdata) to track the latest posts & updates on what’s happening.

				- **Citus Blog**: Read our popular [Citus Open Source Blog](https://www.citusdata.com/blog/) for posts about PostgreSQL and Citus.

				- **Citus Newsletter**: Subscribe to our monthly technical [Citus Newsletter](https://www.citusdata.com/join-newsletter) to get a curated collection of our favorite posts, videos, docs, talks, & other Postgres goodies.

				- **Slack**: Our [Citus Public slack](https://slack.citusdata.com/) is a good way to stay connected, not just with us but with other Citus users.

				- **Sister Blog**: Read the PostgreSQL posts on the [Azure Cosmos DB for PostgreSQL blog](https://devblogs.microsoft.com/cosmosdb/category/postgresql/) about our managed service on Azure.

				- **Videos**: Check out this [YouTube playlist](https://www.youtube.com/playlist?list=PLixnExCn6lRq261O0iwo4ClYxHpM9qfVy) of some of our favorite Citus videos and demos. If you want to deep dive into how Citus extends PostgreSQL, you might want to check out Marco Slot’s talk at Carnegie Mellon titled [Citus: Distributed PostgreSQL as an Extension](https://youtu.be/X-aAgXJZRqM) that was part of Andy Pavlo’s Vaccination Database Talks series at CMUDB.

				- **Our other Postgres projects**: Our team also works on other awesome PostgreSQL open source extensions & projects, including: [pg_cron](https://github.com/citusdata/pg_cron), [HyperLogLog](https://github.com/citusdata/postgresql-hll), [TopN](https://github.com/citusdata/postgresql-topn), [pg_auto_failover](https://github.com/citusdata/pg_auto_failover), [activerecord-multi-tenant](https://github.com/citusdata/activerecord-multi-tenant), and [django-multitenant](https://github.com/citusdata/django-multitenant).

				___

				Copyright © 2012–2016 Citus Data, Inc.

				[faq]: https://www.citusdata.com/frequently-asked-questions

				[linux_install]: https://www.digitalocean.com/community/tutorials/how-to-install-and-use-docker-compose-on-ubuntu-14-04

				[mac_install]: https://www.docker.com/products/docker-toolbox

				[tutorial]: http://docs.citusdata.com/en/v5.1/tutorials/tut-hash-distribution.html

				Copyright © Citus Data, Inc.

									
										41

SECURITY.mdNormal file

										View File
									
				@ -0,0 +1,41 @@

				<!-- BEGIN MICROSOFT SECURITY.MD V0.0.8 BLOCK -->

				## Security

				Microsoft takes the security of our software products and services seriously, which includes all source code repositories managed through our GitHub organizations, which include [Microsoft](https://github.com/microsoft), [Azure](https://github.com/Azure), [DotNet](https://github.com/dotnet), [AspNet](https://github.com/aspnet), [Xamarin](https://github.com/xamarin), and [our GitHub organizations](https://opensource.microsoft.com/).

				If you believe you have found a security vulnerability in any Microsoft-owned repository that meets [Microsoft's definition of a security vulnerability](https://aka.ms/opensource/security/definition), please report it to us as described below.

				## Reporting Security Issues

				**Please do not report security vulnerabilities through public GitHub issues.**

				Instead, please report them to the Microsoft Security Response Center (MSRC) at [https://msrc.microsoft.com/create-report](https://aka.ms/opensource/security/create-report).

				If you prefer to submit without logging in, send email to [secure@microsoft.com](mailto:secure@microsoft.com).  If possible, encrypt your message with our PGP key; please download it from the [Microsoft Security Response Center PGP Key page](https://aka.ms/opensource/security/pgpkey).

				You should receive a response within 24 hours. If for some reason you do not, please follow up via email to ensure we received your original message. Additional information can be found at [microsoft.com/msrc](https://aka.ms/opensource/security/msrc).

				Please include the requested information listed below (as much as you can provide) to help us better understand the nature and scope of the possible issue:

				  * Type of issue (e.g. buffer overflow, SQL injection, cross-site scripting, etc.)

				  * Full paths of source file(s) related to the manifestation of the issue

				  * The location of the affected source code (tag/branch/commit or direct URL)

				  * Any special configuration required to reproduce the issue

				  * Step-by-step instructions to reproduce the issue

				  * Proof-of-concept or exploit code (if possible)

				  * Impact of the issue, including how an attacker might exploit the issue

				This information will help us triage your report more quickly.

				If you are reporting for a bug bounty, more complete reports can contribute to a higher bounty award. Please visit our [Microsoft Bug Bounty Program](https://aka.ms/opensource/security/bounty) page for more details about our active programs.

				## Preferred Languages

				We prefer all communications to be in English.

				## Policy

				Microsoft follows the principle of [Coordinated Vulnerability Disclosure](https://aka.ms/opensource/security/cvd).

				<!-- END MICROSOFT SECURITY.MD BLOCK -->

									
										160

STYLEGUIDE.mdNormal file

										View File
									
				@ -0,0 +1,160 @@

				# Coding style

				The existing code-style in our code-base is not super consistent. There are multiple reasons for that. One big reason is because our code-base is relatively old and our standards have changed over time. The second big reason is that our style-guide is different from style-guide of Postgres and some code is copied from Postgres source code and is slightly modified. The below rules are for new code. If you're changing existing code that uses a different style, use your best judgement to decide if you use the rules here or if you match the existing style.

				## Using citus_indent

				CI pipeline will automatically reject any PRs which do not follow our coding

				conventions. The easiest way to ensure your PR adheres to those conventions is

				to use the [citus_indent](https://github.com/citusdata/tools/tree/develop/uncrustify)

				tool. This tool uses `uncrustify` under the hood.

				```bash

				# Uncrustify changes the way it formats code every release a bit. To make sure

				# everyone formats consistently we use version 0.68.1:

				curl -L https://github.com/uncrustify/uncrustify/archive/uncrustify-0.68.1.tar.gz | tar xz

				cd uncrustify-uncrustify-0.68.1/

				mkdir build

				cd build

				cmake ..

				make -j5

				sudo make install

				cd ../..

				git clone https://github.com/citusdata/tools.git

				cd tools

				make uncrustify/.install

				```

				Once you've done that, you can run the `make reindent` command from the top

				directory to recursively check and correct the style of any source files in the

				current directory. Under the hood, `make reindent` will run `citus_indent` and

				some other style corrections for you.

				You can also run the following in the directory of this repository to

				automatically format all the files that you have changed before committing:

				```bash

				cat > .git/hooks/pre-commit << __EOF__

				#!/bin/bash

				citus_indent --check --diff || { citus_indent --diff; exit 1; }

				__EOF__

				chmod +x .git/hooks/pre-commit

				```

				## Other rules we follow that citus_indent does not enforce

				* We almost always use **CamelCase**, when naming functions, variables etc., **not snake_case**.

				* We also have the habits of using a **lowerCamelCase** for some variables named from their type or from their function name, as shown in the examples:

				  ```c

				  bool IsCitusExtensionLoaded = false;

				  bool

				  IsAlterTableRenameStmt(RenameStmt *renameStmt)

				  {

				    AlterTableCmd *alterTableCommand = NULL;

				    ..

				    ..

				    bool isAlterTableRenameStmt = false;

				    ..

				  }

				  ```

				* We **start functions with a comment**:

				  ```c

				  /*

				   * MyNiceFunction <something in present simple tense, e.g., processes / returns / checks / takes X as input / does Y> ..

				   * <some more nice words> ..

				   * <some more nice words> ..

				   */

				  <static?> <return type>

				  MyNiceFunction(..)

				  {

				    ..

				    ..

				  }

				  ```

				* `#includes` needs to be sorted based on below ordering and then alphabetically and we should not include what we don't need in a file:

				  * System includes (eg. #include<...>)

				  * Postgres.h (eg. #include "postgres.h")

				  * Toplevel imports from postgres, not contained in a directory (eg. #include "miscadmin.h")

				  * General postgres includes (eg . #include "nodes/...")

				  * Toplevel citus includes, not contained in a directory (eg. #include "citus_verion.h")

				  * Columnar includes (eg. #include "columnar/...")

				  * Distributed includes (eg. #include "distributed/...")

				* Comments:

				  ```c

				  /* single line comments start with a lower-case */

				  /*

				   * We start multi-line comments with a capital letter

				   * and keep adding a star to the beginning of each line

				   * until we close the comment with a star and a slash.

				   */

				  ```

				* Order of function implementations and their declarations in a file:

				  We define static functions after the functions that call them. For example:

				  ```c

				  #include<..>

				  #include<..>

				  ..

				  ..

				  typedef struct

				  {

				    ..

				    ..

				  } MyNiceStruct;

				  ..

				  ..

				  PG_FUNCTION_INFO_V1(my_nice_udf1);

				  PG_FUNCTION_INFO_V1(my_nice_udf2);

				  ..

				  ..

				  // ..  somewhere on top of the file …

				  static void MyNiceStaticlyDeclaredFunction1(…);

				  static void MyNiceStaticlyDeclaredFunction2(…);

				  ..

				  ..

				  void

				  MyNiceFunctionExternedViaHeaderFile(..)

				  {

				    ..

				    ..

				    MyNiceStaticlyDeclaredFunction1(..);

				    ..

				    ..

				    MyNiceStaticlyDeclaredFunction2(..);

				    ..

				  }

				  ..

				  ..

				  // we define this first because it's called by MyNiceFunctionExternedViaHeaderFile()

				  // before MyNiceStaticlyDeclaredFunction2()

				  static void

				  MyNiceStaticlyDeclaredFunction1(…)

				  {

				  }

				  ..

				  ..

				  // then we define this

				  static void

				  MyNiceStaticlyDeclaredFunction2(…)

				  {

				  }

				  ```

2

aclocal.m4 vendored Normal file

View File

 @ -0,0 +1,2 @@
 dnl aclocal.m4
 m4_include([config/general.m4])

									
										2

autogen.sh

										View File
									
				@ -1,6 +1,6 @@

				#!/bin/bash

				#

				# autogen.sh converts configure.in to configure and creates

				# autogen.sh converts configure.ac to configure and creates

				# citus_config.h.in. The resuting resulting files are checked into

				# the SCM, to avoid everyone needing autoconf installed.

									
										47

cgmanifest.jsonNormal file

										View File
									
				@ -0,0 +1,47 @@

				{

				    "Registrations": [

				        {

				            "Component": {

				                "Type": "git",

				                "git": {

				                    "RepositoryUrl": "https://github.com/intel/safestringlib",

				                    "CommitHash": "245c4b8cff1d2e7338b7f3a82828fc8e72b29549"

				                }

				            },

				            "DevelopmentDependency": false

				        },

				        {

				            "Component": {

				                "Type": "git",

				                "git": {

				                    "RepositoryUrl": "https://github.com/postgres/postgres",

				                    "CommitHash": "29be9983a64c011eac0b9ee29895cce71e15ea77"

				                }

				            },

				            "license": "PostgreSQL",

				            "licenseDetail": [

								"Portions Copyright (c) 1996-2010, The PostgreSQL Global Development Group",

								"",

								"Portions Copyright (c) 1994, The Regents of the University of California",

				                "",

				                "Permission to use, copy, modify, and distribute this software and its documentation for ",

				                "any purpose, without fee, and without a written agreement is hereby granted, provided ",

				                "that the above copyright notice and this paragraph and the following two paragraphs appear ",

				                "in all copies.",

				                "",

				                "IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, ",

				                "INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS ",

				                "SOFTWARE AND ITS DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE ",

				                "POSSIBILITY OF SUCH DAMAGE.",

				                "",

				                "THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES, INCLUDING, BUT NOT LIMITED TO, ",

				                "THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE PROVIDED ",

				                "HEREUNDER IS ON AN \"AS IS\" BASIS, AND THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATIONS TO PROVIDE ",

				                "MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS."

				            ],

				            "version": "0.0.1",

				            "DevelopmentDependency": false

				        }

				    ]

				}

									
										402

ci/README.mdNormal file

										View File
									
				@ -0,0 +1,402 @@

				# CI scripts

				We have a few scripts that we run in CI to confirm that code confirms to our

				standards. Be sure you have followed the setup in the [Following our coding

				conventions](https://github.com/citusdata/citus/blob/master/CONTRIBUTING.md#following-our-coding-conventions)

				section of `CONTRIBUTING.md`. Once you've done that, most of them should be

				fixed automatically, when running:

				```

				make reindent

				```

				See the sections below for details on what a specific failing script means.

				## `citus_indent`

				We format all our code using the coding conventions in the

				[citus_indent](https://github.com/citusdata/tools/tree/develop/uncrustify)

				tool. This tool uses `uncrustify` under the hood. See [Following our coding

				conventions](https://github.com/citusdata/citus/blob/master/CONTRIBUTING.md#following-our-coding-conventions) on how to install this.

				## `editorconfig.sh`

				You should install the Editorconfig plugin for your editor/IDE

				https://editorconfig.org/

				## `banned.h.sh`

				You're using a C library function that is banned by Microsoft, mostly because of

				risk for buffer overflows. This page lists the Microsoft suggested replacements:

				https://liquid.microsoft.com/Web/Object/Read/ms.security/Requirements/Microsoft.Security.SystemsADM.10082#guide

				These replacements are only available on Windows normally. Since we build for

				Linux we make most of them available with this header file:

				```c

				#include "distributed/citus_safe_lib.h"

				```

				This uses https://github.com/intel/safestringlib to provide them.

				However, still not all of them are available. For those cases we provide

				some extra functions in `citus_safe_lib.h`, with similar functionality.

				If none of those replacements match your requirements you have to do one of the

				following:

				1. Add a replacement to `citus_safe_lib.{c,h}` that handles the same error cases

				   that the `{func_name}_s` function that Microsoft suggests.

				2. Add a `/* IGNORE-BANNED */` comment to the line that complains. Doing this

				   requires also adding a comment before explaining why this specific use of the

				   function is safe.

				## `build-citus.sh`

				This is the script used during the build phase of the extension. Historically this script

				was embedded in the docker images. This made maintenance a hassle. Now it lives in tree

				with the rest of the source code.

				When this script fails you most likely have a build error on the postgres version it was

				building at the time of the failure. Fix the compile error and push a new version of your

				code to fix.

				## `check_enterprise_merge.sh`

				This check exists to make sure that we can always merge the `master` branch of

				`community` into the `enterprise-master` branch of the `enterprise` repo.

				There are two conditions in which this check passes:

				1. There are no merge conflicts between your PR branch and `enterprise-master` and after this merge the code compiles.

				2. There are merge conflicts, but there is a branch with the same name in the

				   enterprise repo that:

				   1. Contains the last commit of the community branch with the same name.

				   2. Merges cleanly into `enterprise-master`

				3. After merging, the code can be compiled.

				If the job already passes, you are done, nothing further required! Otherwise

				follow the below steps.

				### Prerequisites

				Before continuing with the real steps make sure you have done the following

				(this only needs to be done once):

				1. You have enabled `git rerere` in globally or in your enterprise repo

				   ([docs](https://git-scm.com/docs/git-rerere), [very useful blog](https://medium.com/@porteneuve/fix-conflicts-only-once-with-git-rerere-7d116b2cec67#.3vui844dt)):

				   ```bash

				   # Enables it globally for all repos

				   git config --global rerere.enabled true

				   # Enables it only for the enterprise repo

				   cd <enterprise-repo>

				   git config rerere.enabled true

				   ```

				2. You have set up the `community` remote on your enterprise as

				   [described in CONTRIBUTING.md](https://github.com/citusdata/citus-enterprise/blob/enterprise-master/CONTRIBUTING.md#merging-community-changes-onto-enterprise).

				#### Important notes on `git rerere`

				This is very useful as it will make sure git will automatically redo merges that

				you have done before. However, this has a downside too. It will also redo merges

				that you did, but that were incorrect. Two work around this you can use these

				commands.

				1. Make `git rerere` forget a merge:

				   ```bash

				   git rerere forget <badly_merged_file>

				   ```

				2. During conflict resolution where `git rerere` already applied the bad merge,

				   simply forgetting it is not enough. Since it is already applied. In that case

				   you also have to undo the apply using:

				   ```bash

				   git checkout --conflict=merge <badly_merged_file>

				   ```

				### Actual steps

				After the prerequisites are met we continue on to the real steps. Say your

				branch name is `$PR_BRANCH`, we will refer to `$PR_BRANCH` on community as

				`community/$PR_BRANCH` and on enterprise as `enterprise/$PR_BRANCH`. First make

				sure these two things are the case:

				1. Get approval from your reviewer for `community/$PR_BRANCH`. Only follow the

				   next steps after you are about to merge the branch to community master.

				2. Make sure your commits are in a nice state, since you should not do

				   "squash and merge" on Github later. Otherwise you will certainly get

				   duplicate commits and possibly get merge conflicts with enterprise again.

				Once that's done, you need to create a merged version of your PR branch on the

				enterprise repo. For example if `community` is added as a remote in

				your enterprise repo, you can do the following:

				```bash

				export PR_BRANCH=<YOUR BRANCHNAME OF THE PR HERE>

				git checkout enterprise-master

				git pull # Make sure your local enterprise-master is up to date

				git fetch community # Fetch your up to date branch name

				git checkout -b "$PR_BRANCH" enterprise-master

				```

				Now you have X in your enterprise repo, which we refer to as

				`enterprise/$PR_BRANCH` (even though in git commands you would reference it as

				`origin/$PR_BRANCH`). This branch is currently the same as `enterprise-master`.

				First to make review easier, you should merge community master into it. This

				should apply without any merge conflicts:

				```bash

				git merge community/master

				```

				Now you need to merge `community/$PR_BRANCH` to `enterprise/$PR_BRANCH`. Solve

				any conflicts and make sure to remove any parts that should not be in enterprise

				even though it doesn't have a conflict, on enterprise repository:

				```bash

				git merge "community/$PR_BRANCH"

				```

				1. You should push this branch to the enterprise repo. This is so that the job

				   on community will see this branch.

				2. Wait until tests on `enterprise/$PR_BRANCH` pass.

				3. Create a PR on the enterprise repo for your `enterprise/$PR_BRANCH` branch.

				4. You should get approval for the merge conflict changes on

				   `enterprise/$PR_BRANCH`, preferably from the same reviewer as they are

				   familiar with the change.

				5. You should rerun the `check-merge-to-enterprise` check on

				   `community/$PR_BRANCH`. You can use re-run from failed option in circle CI.

				6. You can now merge the PR on community. Be sure to NOT use "squash and merge",

				   but instead use the regular "merge commit" mode.

				7. You can now merge the PR on enterprise. Be sure to NOT use "squash and merge",

				   but instead use the regular "merge commit" mode.

				The subsequent PRs on community will be able to pass the

				`check-merge-to-enterprise` check as long as they don't have a conflict with

				`enterprise-master`.

				### What to do when your branch got outdated?

				So there's one issue that can occur. Your branch will become outdated with

				master and you have to make it up to date. There are two ways to do this using

				`git merge` or `git rebase`. As usual, `git merge` is a bit easier than `git

				rebase`, but clutters git history. This section will explain both. If you don't

				know which one makes the most sense, start with `git rebase`. It's possible that

				for whatever reason this doesn't work or becomes very complex, for instance when

				new merge conflicts appear. Feel free to fall back to `git merge` in that case,

				by using `git rebase --abort`.

				#### Updating both branches with `git rebase`

				In the community repo, first update the outdated branch using `rebase`:

				```bash

				git checkout $PR_BRANCH

				# Keep a backup in case you want to fallback to the merge approach

				git checkout -b ${PR_BRANCH}-backup

				git checkout $PR_BRANCH

				# Actually update the branch

				git fetch origin

				git rebase origin/master

				git push origin $PR_BRANCH --force-with-lease

				```

				In the enterprise repo, rebase onto the new community branch with

				`--preserve-merges`:

				```bash

				git checkout $PR_BRANCH

				git fetch community

				git rebase community/$PR_BRANCH --preserve-merges

				```

				Automatic merge might have failed with the above command. However, because of

				`git rerere` it should have re-applied your original merge resolution. If this

				is indeed the case it should show something like this in the output of the

				previous command (note the `Resolved ...` line):

				```

				CONFLICT (content): Merge conflict in <file_path>

				Resolved '<file_path>' using previous resolution.

				Automatic merge failed; fix conflicts and then commit the result.

				Error redoing merge <merge_sha>

				```

				Confirm that the merge conflict is indeed resolved correctly. In that case you

				can do the following:

				```bash

				# Add files that were conflicting

				git add "$(git diff --name-only --diff-filter=U)"

				git rebase --continue

				```

				Before pushing you should do a final check that the commit hash of your final

				non merge commit matches the commit hash that's on the community repo. If that's

				not the case, you should fallback to the `git merge` approach.

				```bash

				git reset origin/$PR_BRANCH --hard

				```

				If the commit hashes were as expected, push the branch:

				```bash

				git push origin $PR_BRANCH --force-with-lease

				```

				#### Updating both branches with `git merge`

				If you are falling back to the `git merge` approach after trying the

				`git rebase` approach, you should first restore the original branch on the

				community repo.

				```bash

				git checkout $PR_BRANCH

				git reset ${PR_BRANCH}-backup --hard

				git push origin $PR_BRANCH --force-with-lease

				```

				In the community repo, first update the outdated branch using `merge`:

				```bash

				git checkout $PR_BRANCH

				git fetch origin

				git merge origin/master

				git push origin $PR_BRANCH

				```

				In the enterprise repo, merge with the updated `community/$PR_BRANCH`:

				```bash

				git checkout $PR_BRANCH

				git fetch community

				git merge community/$PR_BRANCH

				git push origin $PR_BRANCH

				```

				## `check_sql_snapshots.sh`

				To allow for better diffs during review we have snapshots of SQL UDFs. This

				means that `latest.sql` is not up to date with the SQL file of the highest

				version number in the directory. The output of the script shows you what is

				different.

				## `check_all_tests_are_run.sh`

				A test should always be included in a schedule file, otherwise it will not be

				run in CI. This is most commonly forgotten for newly added tests. In that case

				the dev ran it locally without running a full schedule with something like:

				```bash

				make -C src/test/regress/ check-minimal EXTRA_TESTS='multi_create_table_new_features'

				```

				## `check_all_ci_scripts_are_run.sh`

				This is the meta CI script. This checks that all existing CI scripts are

				actually run in CI. This is most commonly forgotten for newly added CI tests

				that the developer only ran locally. It also checks that all CI scripts have a

				section in this `README.md` file and that they include `ci/ci_helpers.sh`.

				## `check_migration_files.sh`

				A branch that touches a set of upgrade scripts is also expected to touch

				corresponding downgrade scripts as well. If this script fails, read the output

				and make sure you update the downgrade scripts in the printed list. If you

				really don't need a downgrade to run any SQL. You can write a comment in the

				file explaining why a downgrade step is not necessary.

				## `disallow_c_comments_in_migrations.sh`

				We do not use C-style comments in migration files as the stripped

				zero-length migration files cause warning during packaging.

				Instead use SQL type comments, i.e:

				```

				-- this is a comment

				```

				See [#3115](https://github.com/citusdata/citus/pull/3115) for more info.

				## `disallow_hash_comments_in_spec_files.sh`

				We do not use comments starting with # in spec files because it creates errors

				from C preprocessor that expects directives after this character.

				Instead use C type comments, i.e:

				```

				// this is a single line comment

				/*

				 * this is a multi line comment

				 */

				```

				## `disallow_long_changelog_entries.sh`

				Having changelog items with entries that are longer than 80 characters are

				forbidden. It's allowed to split up the entry over multiple lines, as long as

				each line of the entry is 80 characters or less.

				## `normalize_expected.sh`

				All files in `src/test/expected` should be committed in normalized form.

				This error mostly happens if someone added a new normalization rule and you have

				not rerun tests that you have added.

				We normalize the test output files using a `sed` script called

				[`normalize.sed`](https://github.com/citusdata/citus/blob/master/src/test/regress/bin/normalize.sed).

				The reason for this is that some output changes randomly in ways we don't care

				about. An example of this is when an error happens on a different port number,

				or a different worker shard, or a different placement, etc. Either randomly or

				because we are running the tests in a slightly different configuration.

				## `remove_useless_declarations.sh`

				This script tries to make sure that we don't add useless declarations to our

				code. What it effectively does is replace this:

				```c

				int a = 0;

				int b = 2;

				Assert(b == 2);

				a = b + b;

				```

				With this equivalent, but shorter version:

				```c

				int b = 2;

				Assert(b == 2);

				int a = b + b;

				```

				It relies on the fact that `citus_indent` formats our code in certain ways. So

				before running this script, make sure that you've done that.

				This replacement is all done using a [regex replace](xkcd.com/1171), so it's

				definitely possible there's a bug in there. So far no bad ones have been found.

				A known issue is that it does not replace code in a block after an `#ifdef` like

				this.

				```c

				int foo = 0;

				#ifdef SOMETHING

				foo = 1

				#else

				foo = 2

				#endif

				```

				This was deemed to be error prone and not worth the effort.

				## `fix_gitignore.sh`

				This script checks and fixes issues with `.gitignore` rules:

				1. Makes sure we do not commit any generated files that should be ignored. If there is an

				   ignored file in the git tree, the user is expected to review the files that are removed

				   from the git tree and commit them.

				## `check_gucs_are_alphabetically_sorted.sh`

				This script checks the order of the GUCs defined in `shared_library_init.c`.

				To solve this failure, please check `shared_library_init.c` and make sure that the GUC

				definitions are in alphabetical order.

				## `print_stack_trace.sh`

				This script prints stack traces for failed tests, if they left core files.

				## `sort_and_group_includes.sh`

				This script checks and fixes issues with include grouping and sorting in C files.

				Includes are grouped in the following groups:

				 - System includes (eg. `#include <math>`)

				 - Postgres.h include (eg. `#include "postgres.h"`)

				 - Toplevel postgres includes (includes not in a directory eg. `#include "miscadmin.h`)

				 - Postgres includes in a directory (eg. `#include "catalog/pg_type.h"`)

				 - Toplevel citus includes (includes not in a directory eg. `#include "pg_version_constants.h"`)

				 - Columnar includes (eg. `#include "columnar/columnar.h"`)

				 - Distributed includes (eg. `#include "distributed/maintenanced.h"`)

				Within every group the include lines are sorted alphabetically.

									
										56

ci/banned.h.shExecutable file

										View File
									
				@ -0,0 +1,56 @@

				#!/bin/bash

				# Checks for the APIs that are banned by microsoft. Since we compile for Linux

				# we use the replacements from https://github.com/intel/safestringlib

				# Not all replacement functions are available in safestringlib. If it doesn't

				# exist and you cannot rewrite the code to not use the banned API, then you can

				# add a comment containing "IGNORE-BANNED" to the line where the error is and

				# this check will ignore that match.

				#

				# The replacement function that you should use are listed here:

				# https://liquid.microsoft.com/Web/Object/Read/ms.security/Requirements/Microsoft.Security.SystemsADM.10082#guide

				set -eu

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				files=$(find src -iname '*.[ch]' | git check-attr --stdin citus-style | grep -v ': unset$' | sed 's/: citus-style: set$//')

				# grep is allowed to fail, that means no banned matches are found

				set +e

				# Required banned from banned.h. These functions are not allowed to be used at

				# all.

				# shellcheck disable=SC2086

				grep -E '\b(strcpy|strcpyA|strcpyW|wcscpy|_tcscpy|_mbscpy|StrCpy|StrCpyA|StrCpyW|lstrcpy|lstrcpyA|lstrcpyW|_tccpy|_mbccpy|_ftcscpy|strcat|strcatA|strcatW|wcscat|_tcscat|_mbscat|StrCat|StrCatA|StrCatW|lstrcat|lstrcatA|lstrcatW|StrCatBuff|StrCatBuffA|StrCatBuffW|StrCatChainW|_tccat|_mbccat|_ftcscat|sprintfW|sprintfA|wsprintf|wsprintfW|wsprintfA|sprintf|swprintf|_stprintf|wvsprintf|wvsprintfA|wvsprintfW|vsprintf|_vstprintf|vswprintf|strncpy|wcsncpy|_tcsncpy|_mbsncpy|_mbsnbcpy|StrCpyN|StrCpyNA|StrCpyNW|StrNCpy|strcpynA|StrNCpyA|StrNCpyW|lstrcpyn|lstrcpynA|lstrcpynW|strncat|wcsncat|_tcsncat|_mbsncat|_mbsnbcat|StrCatN|StrCatNA|StrCatNW|StrNCat|StrNCatA|StrNCatW|lstrncat|lstrcatnA|lstrcatnW|lstrcatn|gets|_getts|_gettws|IsBadWritePtr|IsBadHugeWritePtr|IsBadReadPtr|IsBadHugeReadPtr|IsBadCodePtr|IsBadStringPtr|memcpy|RtlCopyMemory|CopyMemory|wmemcpy|lstrlen)\(' $files \

				    | grep -v "IGNORE-BANNED" \

				    && echo "ERROR: Required banned API usage detected" && exit 1

				# Required banned from table on liquid. These functions are not allowed to be

				# used at all.

				# shellcheck disable=SC2086

				grep -E  '\b(strcat|strcpy|strerror|strncat|strncpy|strtok|wcscat|wcscpy|wcsncat|wcsncpy|wcstok|fprintf|fwprintf|printf|snprintf|sprintf|swprintf|vfprintf|vprintf|vsnprintf|vsprintf|vswprintf|vwprintf|wprintf|fscanf|fwscanf|gets|scanf|sscanf|swscanf|vfscanf|vfwscanf|vscanf|vsscanf|vswscanf|vwscanf|wscanf|asctime|atof|atoi|atol|atoll|bsearch|ctime|fopen|freopen|getenv|gmtime|localtime|mbsrtowcs|mbstowcs|memcpy|memmove|qsort|rewind|setbuf|wmemcpy|wmemmove)\(' $files \

				    | grep -v "IGNORE-BANNED" \

				    && echo "ERROR: Required banned API usage from table detected" && exit 1

				# Recommended banned from banned.h. If you can change the code not to use these

				# that would be great. You can use IGNORE-BANNED if you need to use it anyway.

				# You can also remove it from the regex, if you want to mark the API as allowed

				# throughout the codebase (to not have to add IGNORED-BANNED everywhere). In

				# that case note it in this comment that you did so.

				# shellcheck disable=SC2086

				grep -E '\b(wnsprintf|wnsprintfA|wnsprintfW|_snwprintf|_snprintf|_sntprintf|_vsnprintf|vsnprintf|_vsnwprintf|_vsntprintf|wvnsprintf|wvnsprintfA|wvnsprintfW|strtok|_tcstok|wcstok|_mbstok|makepath|_tmakepath| _makepath|_wmakepath|_splitpath|_tsplitpath|_wsplitpath|scanf|wscanf|_tscanf|sscanf|swscanf|_stscanf|snscanf|snwscanf|_sntscanf|_itoa|_itow|_i64toa|_i64tow|_ui64toa|_ui64tot|_ui64tow|_ultoa|_ultot|_ultow|CharToOem|CharToOemA|CharToOemW|OemToChar|OemToCharA|OemToCharW|CharToOemBuffA|CharToOemBuffW|alloca|_alloca|ChangeWindowMessageFilter)\(' $files  \

				    | grep -v "IGNORE-BANNED" \

				    && echo "ERROR: Recomended banned API usage detected" && exit 1

				# Recommended banned from table on liquid. If you can change the code not to use these

				# that would be great. You can use IGNORE-BANNED if you need to use it anyway.

				# You can also remove it from the regex, if you want to mark the API as allowed

				# throughout the codebase (to not have to add IGNORED-BANNED everywhere). In

				# that case note it in this comment that you did so.

				# Banned APIs ignored throughout the codebase:

				# - strlen

				# shellcheck disable=SC2086

				grep -E '\b(alloca|getwd|mktemp|tmpnam|wcrtomb|wcrtombs|wcslen|wcsrtombs|wcstombs|wctomb|class_addMethod|class_replaceMethod)\(' $files  \

				    | grep -v "IGNORE-BANNED" \

				    && echo "ERROR: Recomended banned API usage detected" && exit 1

				exit 0

									
										44

ci/build-citus.shExecutable file

										View File
									
				@ -0,0 +1,44 @@

				#!/bin/bash

				# make bash behave

				set -euo pipefail

				IFS=$'\n\t'

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# read pg major version, error if not provided

				PG_MAJOR=${PG_MAJOR:?please provide the postgres major version}

				# get codename from release file

				. /etc/os-release

				codename=${VERSION#*(}

				codename=${codename%)*}

				# we'll do everything with absolute paths

				basedir="$(pwd)"

				# get the project and clear out the git repo (reduce workspace size

				rm -rf "${basedir}/.git"

				build_ext() {

				  pg_major="$1"

				  builddir="${basedir}/build-${pg_major}"

				  echo "Beginning build for PostgreSQL ${pg_major}..." >&2

				  # do everything in a subdirectory to avoid clutter in current directory

				  mkdir -p "${builddir}" && cd "${builddir}"

				  CFLAGS=-Werror "${basedir}/configure" PG_CONFIG="/usr/lib/postgresql/${pg_major}/bin/pg_config" --enable-coverage --with-security-flags

				  installdir="${builddir}/install"

				  make -j$(nproc) && mkdir -p "${installdir}" && { make DESTDIR="${installdir}" install-all || make DESTDIR="${installdir}" install ; }

				  cd "${installdir}" && find . -type f -print > "${builddir}/files.lst"

				  tar cvf "${basedir}/install-${pg_major}.tar" `cat ${builddir}/files.lst`

				  cd "${builddir}" && rm -rf install files.lst && make clean

				}

				build_ext "${PG_MAJOR}"

									
										29

ci/check_all_ci_scripts_are_run.shExecutable file

										View File
									
				@ -0,0 +1,29 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# 1. Find all *.sh files in the ci directory

				# 2. Strip the directory

				# 3. Exclude some scripts that we should not run in CI directly

				ci_scripts=$(

				    find ci/ -iname "*.sh" |

				    sed -E 's#^ci/##g' |

				    grep -v -E '^(ci_helpers.sh|fix_style.sh)$'

				)

				for script in $ci_scripts; do

				    if ! grep "\\bci/$script\\b" -r .github > /dev/null; then

				        echo "ERROR: CI script with name \"$script\" is not actually used in .github folder"

				        exit 1

				    fi

				    if ! grep "^## \`$script\`\$" ci/README.md > /dev/null; then

				        echo "ERROR: CI script with name \"$script\" does not have a section in ci/README.md"

				        exit 1

				    fi

				    if ! grep "source ci/ci_helpers.sh" "ci/$script" > /dev/null; then

				        echo "ERROR: CI script with name \"$script\" does not include ci/ci_helpers.sh"

				        exit 1

				    fi

				done

									
										24

ci/check_all_tests_are_run.shExecutable file

										View File
									
				@ -0,0 +1,24 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				cd src/test/regress

				# 1. Find all *.sql and *.spec files in the sql, and spec directories

				# 2. Strip the extension and the directory

				# 3. Ignore names that end with .include, those files are meant to be in an C

				#    preprocessor #include statement. They should not be in schedules.

				test_names=$(

				    find sql spec -iname "*.sql" -o -iname "*.spec" |

				    sed -E 's#^\w+/([^/]+)\.[^.]+$#\1#g' |

				    grep -v '.include$'

				)

				for name in $test_names; do

				    if ! grep "\\b$name\\b" ./*_schedule > /dev/null; then

				        echo "ERROR: Test with name \"$name\" is not used in any of the schedule files"

				        exit 1

				    fi

				done

									
										25

ci/check_gucs_are_alphabetically_sorted.shExecutable file

										View File
									
				@ -0,0 +1,25 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# Find the line that exactly matches "RegisterCitusConfigVariables(void)" in

				# shared_library_init.c. grep command returns something like

				# "934:RegisterCitusConfigVariables(void)" and we extract the line number

				# with cut.

				RegisterCitusConfigVariables_begin_linenumber=$(grep -n "^RegisterCitusConfigVariables(void)$" src/backend/distributed/shared_library_init.c | cut -d: -f1)

				# Consider the lines starting from $RegisterCitusConfigVariables_begin_linenumber,

				# grep the first line that starts with "}" and extract the line number with cut

				# as in the previous step.

				RegisterCitusConfigVariables_length=$(tail -n +$RegisterCitusConfigVariables_begin_linenumber src/backend/distributed/shared_library_init.c | grep -n -m 1 "^}$" | cut -d: -f1)

				# extract the function definition of RegisterCitusConfigVariables into a temp file

				tail -n +$RegisterCitusConfigVariables_begin_linenumber src/backend/distributed/shared_library_init.c | head -n $(($RegisterCitusConfigVariables_length)) > RegisterCitusConfigVariables_func_def.out

				# extract citus gucs in the form of <tab><tab>"citus.X"

				grep -P "^[\t][\t]\"citus\.[a-zA-Z_0-9]+\"" RegisterCitusConfigVariables_func_def.out > gucs.out

				LC_COLLATE=C sort -c gucs.out

				rm gucs.out

				rm RegisterCitusConfigVariables_func_def.out

									
										33

ci/check_migration_files.shExecutable file

										View File
									
				@ -0,0 +1,33 @@

				#! /bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# This file checks for the existence of downgrade scripts for every upgrade script that is changed in the branch.

				# create list of migration files for upgrades

				upgrade_files=$(git diff --name-only origin/main | { grep "src/backend/distributed/sql/citus--.*sql" || exit 0 ; })

				downgrade_files=$(git diff --name-only origin/main | { grep "src/backend/distributed/sql/downgrades/citus--.*sql" || exit 0 ; })

				ret_value=0

				for file in $upgrade_files

				do

				    # There should always be 2 matches, and no need to avoid splitting here

				    # shellcheck disable=SC2207

				    versions=($(grep --only-matching --extended-regexp "[0-9]+\.[0-9]+[-.][0-9]+" <<< "$file"))

				    from_version=${versions[0]};

				    to_version=${versions[1]};

				    downgrade_migration_file="src/backend/distributed/sql/downgrades/citus--$to_version--$from_version.sql"

				    # check for the existence of migration scripts

				    if [[ $(grep --line-regexp --count "$downgrade_migration_file" <<< "$downgrade_files") == 0 ]]

				    then

				        echo "$file is updated, but $downgrade_migration_file is not updated in branch"

				        ret_value=1

				    fi

				done

				exit $ret_value;

									
										20

ci/check_sql_snapshots.shExecutable file

										View File
									
				@ -0,0 +1,20 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				for udf_dir in src/backend/distributed/sql/udfs/* src/backend/columnar/sql/udfs/*; do

				    # We want to find the last snapshotted sql file, to make sure it's the same

				    # as "latest.sql". This is done by:

				    # 1. Getting the filenames in the UDF directory (using find instead of ls, to keep shellcheck happy)

				    # 2. Filter out latest.sql

				    # 3. Sort using "version sort"

				    # 4. Get the last one using tail

				    latest_snapshot=$(\

				        find "$udf_dir" -iname "*.sql" -exec basename {} \; \

				        | { grep --invert-match latest.sql || true; } \

				        | sort --version-sort \

				        | tail --lines 1);

				    diff --unified --color=auto "$udf_dir/latest.sql" "$udf_dir/$latest_snapshot"; \

				done

									
										32

ci/ci_helpers.shNormal file

										View File
									
				@ -0,0 +1,32 @@

				#!/bin/bash

				# For echo commands "set -x" would show the message effectively twice. Once as

				# part of the echo command shown by "set -x" and once because of the output of

				# the echo command. We do not want "set -x" to show the echo command. We only

				# want to see the actual message in the output of echo itself. This function is

				# a trick to do so. Read the StackOverflow post below to understand why this

				# works and what this works around.

				# Source: https://superuser.com/a/1141026/242593

				shopt -s expand_aliases

				alias echo='{ save_flags="$-"; set +x;} 2> /dev/null && echo_and_restore'

				echo_and_restore() {

				        builtin echo "$*"

				        #shellcheck disable=SC2154

				        case "$save_flags" in

				         (*x*)  set -x

				        esac

				}

				# Make sure that on a failing exit we show a useful message

				hint_on_fail() {

				    exit_code=$?

				    # Get filename of the currently running script

				    # Source: https://stackoverflow.com/a/192337/2570866

				    filename=$(basename "$0")

				    if [ $exit_code != 0 ]; then

				        echo "HINT: To solve this failure look here: https://github.com/citusdata/citus/blob/master/ci/README.md#$filename"

				    fi

				    exit $exit_code

				}

				trap hint_on_fail EXIT

									
										32

ci/disallow_c_comments_in_migrations.shExecutable file

										View File
									
				@ -0,0 +1,32 @@

				#! /bin/bash

				set -euo pipefail

				# make ** match all directories and subdirectories

				shopt -s globstar

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# We do not use c-style comments in migration files as the stripped

				# zero-length migration files cause warning during packaging

				# See #3115 for more info

				# In this file, we aim to keep the indentation intact by capturing whitespaces,

				# and reusing them if needed. GNU sed unfortunately does not support lookaround assertions.

				# /* -> --

				find src/backend/{distributed,columnar}/sql/**/*.sql -print0 | xargs -0 sed -i 's#/\*#--#g'

				# */ -> `` (empty string)

				# remove all whitespaces immediately before the match

				find src/backend/{distributed,columnar}/sql/**/*.sql -print0 | xargs -0 sed -i 's#\s*\*/\s*##g'

				# * -> --

				# keep the indentation

				# allow only whitespaces before the match

				find src/backend/{distributed,columnar}/sql/**/*.sql -print0 | xargs -0 sed -i 's#^\(\s*\) \*#\1--#g'

				# // -> --

				# do not touch http:// or similar by allowing only whitespaces before //

				find src/backend/{distributed,columnar}/sql/**/*.sql -print0 | xargs -0 sed -i 's#^\(\s*\)//#\1--#g'

									
										12

ci/disallow_hash_comments_in_spec_files.shExecutable file

										View File
									
				@ -0,0 +1,12 @@

				#! /bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# We do not use comments starting with # in spec files because it creates warnings from

				# preprocessor that expects directives after this character.

				# `# ` -> `-- `

				find src/test/regress/spec/*.spec -print0 | xargs -0 sed -i 's!# !// !g'

									
										19

ci/disallow_long_changelog_entries.shExecutable file

										View File
									
				@ -0,0 +1,19 @@

				#! /bin/bash

				set -eu

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# Having changelog items with entries that are longer than 80 characters are forbidden.

				# Find all lines with disallowed length, and for all such lines store

				#  - line number

				#  - length of the line

				#  - the line content

				too_long_lines=$(awk 'length() > 80 {print NR,"(",length(),"characters ) :",$0}' CHANGELOG.md)

				if [[ -n $too_long_lines ]]

				then

				    echo "We allow at most 80 characters in CHANGELOG.md."

				    echo "${too_long_lines}"

				    exit 1

				fi

									
										22

ci/editorconfig.shExecutable file

										View File
									
				@ -0,0 +1,22 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				for f in $(git ls-tree -r HEAD --name-only); do

				    if [ "$f" = "${f%.out}" ]  &&

				        [ "$f" = "${f%.data}" ] &&

				        [ "$f" = "${f%.png}" ] &&

				        [ -f "$f" ] &&

				        [ "$(echo "$f" | cut -d / -f1)" != "vendor" ] &&

				        [ "$(dirname "$f")" != "src/test/regress/output" ]

				    then

				        # Trim trailing whitespace

				        sed -e 's/[[:space:]]*$//' -i "./$f"

				        # Add final newline if not there

				        if [ -n "$(tail -c1 "$f")" ]; then

				            echo >> "$f"

				        fi

				    fi

				done

									
										19

ci/fix_gitignore.shExecutable file

										View File
									
				@ -0,0 +1,19 @@

				#! /bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# Remove all the ignored files from git tree, and error out

				# find all ignored files in git tree, and use quotation marks to prevent word splitting on filenames with spaces in them

				# NOTE: Option --cached is needed to avoid a bug in git ls-files command.

				ignored_lines_in_git_tree=$(git ls-files --ignored --cached --exclude-standard | sed 's/.*/"&"/')

				if [[ -n $ignored_lines_in_git_tree ]]

				then

				    echo "Ignored files should not be in git tree!"

				    echo "${ignored_lines_in_git_tree}"

				    echo "Removing these files from git tree, please review and commit"

				    echo "$ignored_lines_in_git_tree" | xargs git rm -r --cached

				    exit 1

				fi

									
										22

ci/fix_style.shExecutable file

										View File
									
				@ -0,0 +1,22 @@

				#!/bin/sh

				# fail if trying to reference a variable that is not set.

				set -u / set -o nounset

				# exit immediately if a command fails

				set -e

				cidir="${0%/*}"

				cd ${cidir}/..

				citus_indent . --quiet

				black . --quiet

				isort . --quiet

				ci/editorconfig.sh

				ci/remove_useless_declarations.sh

				ci/disallow_c_comments_in_migrations.sh

				ci/disallow_hash_comments_in_spec_files.sh

				ci/disallow_long_changelog_entries.sh

				ci/normalize_expected.sh

				ci/fix_gitignore.sh

				ci/print_stack_trace.sh

				ci/sort_and_group_includes.sh

									
										157

ci/include_grouping.pyExecutable file

										View File
									
				@ -0,0 +1,157 @@

				#!/usr/bin/env python3

				"""

				easy command line to run against all citus-style checked files:

				$ git ls-files \

				  | git check-attr --stdin citus-style \

				  | grep 'citus-style: set' \

				  | awk '{print $1}' \

				  | cut -d':' -f1 \

				  | xargs -n1 ./ci/include_grouping.py

				"""

				import collections

				import os

				import sys

				def main(args):

				    if len(args) < 2:

				        print("Usage: include_grouping.py <file>")

				        return

				    file = args[1]

				    if not os.path.isfile(file):

				        sys.exit(f"File '{file}' does not exist")

				    with open(file, "r") as in_file:

				        with open(file + ".tmp", "w") as out_file:

				            includes = []

				            skipped_lines = []

				            # This calls print_sorted_includes on a set of consecutive #include lines.

				            # This implicitly keeps separation of any #include lines that are contained in

				            # an #ifdef, because it will order the #include lines inside and after the

				            # #ifdef completely separately.

				            for line in in_file:

				                # if a line starts with #include we don't want to print it yet, instead we

				                # want to collect all consecutive #include lines

				                if line.startswith("#include"):

				                    includes.append(line)

				                    skipped_lines = []

				                    continue

				                # if we have collected any #include lines, we want to print them sorted

				                # before printing the current line. However, if the current line is empty

				                # we want to perform a lookahead to see if the next line is an #include.

				                # To maintain any separation between #include lines and their subsequent

				                # lines we keep track of all lines we have skipped inbetween.

				                if len(includes) > 0:

				                    if len(line.strip()) == 0:

				                        skipped_lines.append(line)

				                        continue

				                    # we have includes that need to be grouped before printing the current

				                    # line.

				                    print_sorted_includes(includes, file=out_file)

				                    includes = []

				                    # print any skipped lines

				                    print("".join(skipped_lines), end="", file=out_file)

				                    skipped_lines = []

				                print(line, end="", file=out_file)

				    # move out_file to file

				    os.rename(file + ".tmp", file)

				def print_sorted_includes(includes, file=sys.stdout):

				    default_group_key = 1

				    groups = collections.defaultdict(set)

				    # define the groups that we separate correctly. The matchers are tested in the order

				    # of their priority field. The first matcher that matches the include is used to

				    # assign the include to a group.

				    # The groups are printed in the order of their group_key.

				    matchers = [

				        {

				            "name": "system includes",

				            "matcher": lambda x: x.startswith("<"),

				            "group_key": -2,

				            "priority": 0,

				        },

				        {

				            "name": "toplevel postgres includes",

				            "matcher": lambda x: "/" not in x,

				            "group_key": 0,

				            "priority": 9,

				        },

				        {

				            "name": "postgres.h",

				            "matcher": lambda x: x.strip() in ['"postgres.h"'],

				            "group_key": -1,

				            "priority": -1,

				        },

				        {

				            "name": "toplevel citus inlcudes",

				            "matcher": lambda x: x.strip()

				            in [

				                '"citus_version.h"',

				                '"pg_version_compat.h"',

				                '"pg_version_constants.h"',

				            ],

				            "group_key": 3,

				            "priority": 0,

				        },

				        {

				            "name": "columnar includes",

				            "matcher": lambda x: x.startswith('"columnar/'),

				            "group_key": 4,

				            "priority": 1,

				        },

				        {

				            "name": "distributed includes",

				            "matcher": lambda x: x.startswith('"distributed/'),

				            "group_key": 5,

				            "priority": 1,

				        },

				    ]

				    matchers.sort(key=lambda x: x["priority"])

				    # throughout our codebase we have some includes where either postgres or citus

				    # includes are wrongfully included with the syntax for system includes. Before we

				    # try to match those we will change the <> to "" to make them match our system. This

				    # will also rewrite the include to the correct syntax.

				    common_system_include_error_prefixes = ["<nodes/", "<distributed/"]

				    # assign every include to a group

				    for include in includes:

				        # extract the group key from the include

				        include_content = include.split(" ")[1]

				        # fix common system includes which are secretly postgres or citus includes

				        for common_prefix in common_system_include_error_prefixes:

				            if include_content.startswith(common_prefix):

				                include_content = '"' + include_content.strip()[1:-1] + '"'

				                include = include.split(" ")[0] + " " + include_content + "\n"

				                break

				        group_key = default_group_key

				        for matcher in matchers:

				            if matcher["matcher"](include_content):

				                group_key = matcher["group_key"]

				                break

				        groups[group_key].add(include)

				    # iterate over all groups in the natural order of its keys

				    for i, group in enumerate(sorted(groups.items())):

				        if i > 0:

				            print(file=file)

				        includes = group[1]

				        print("".join(sorted(includes)), end="", file=file)

				if __name__ == "__main__":

				    main(sys.argv)

									
										10

ci/normalize_expected.shExecutable file

										View File
									
				@ -0,0 +1,10 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				for f in $(git ls-tree -r HEAD --name-only src/test/regress/expected/*.out); do

					sed -Ef src/test/regress/bin/normalize.sed < "$f" > "$f.modified"

					mv "$f.modified" "$f"

				done

									
										25

ci/print_stack_trace.shExecutable file

										View File
									
				@ -0,0 +1,25 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				# find all core files

				core_files=( $(find . -type f -regex .*core.*\d*.*postgres) )

				if [ ${#core_files[@]} -gt 0 ]; then

				    # print stack traces for the core files

				    for core_file in "${core_files[@]}"

				    do

				        # set print frame-arguments all: show all scalars + structures in the frame

				        # set print pretty on:           show structures in indented mode

				        # set print addr off:            do not show pointer address

				        # thread apply all bt full:      show stack traces for all threads

				        gdb --batch \

				            -ex "set print frame-arguments all" \

				            -ex "set print pretty on" \

				            -ex "set print addr off" \

				            -ex "thread apply all bt full" \

				            postgres "${core_file}"

				    done

				fi

									
										34

ci/remove_useless_declarations.shExecutable file

										View File
									
				@ -0,0 +1,34 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				files=$(find src -iname '*.c' -type f | git check-attr --stdin citus-style | grep -v ': unset$' | sed 's/: citus-style: set$//')

				while true; do

				    # A visual version of this regex can be seen here (it is MUCH clearer):

				    # https://www.debuggex.com/r/XodMNE9auT9e-bTx

				    # This visual version only contains the search bit, the replacement bit is

				    # quite simple. It looks like when extracted from the command below:

				    # \n$+{code_between}\t$+{type}$+{variable} =

				    # shellcheck disable=SC2086

				    perl -i -p0e 's/\n\t(?!return )(?P<type>(\w+ )+\**)(?>(?P<variable>\w+)( = *[\w>\s\n-]*?)?;\n(?P<code_between>(?>(?P<comment_or_string_or_not_preprocessor>\/\*.*?\*\/|"(?>\\"|.)*?"|[^#]))*?)(\t)?(?=\b(?P=variable)\b))(?<=\n\t)(?P=variable) =(?![^;]*?[^>_]\b(?P=variable)\b[^_])/\n$+{code_between}\t$+{type}$+{variable} =/sg' $files

				    # The following are simply the same regex, but repeated for different

				    # indentation levels, i.e. finding declarations indented using 2, 3, 4, 5

				    # and 6 tabs. More than 6 don't really occur in the wild.

				    # (this is needed because variable sized backtracking is not supported in perl)

				    # shellcheck disable=SC2086

				    perl -i -p0e 's/\n\t\t(?!return )(?P<type>(\w+ )+\**)(?>(?P<variable>\w+)( = *[\w>\s\n-]*?)?;\n(?P<code_between>(?>(?P<comment_or_string_or_not_preprocessor>\/\*.*?\*\/|"(?>\\"|.)*?"|[^#]))*?)(\t\t)?(?=\b(?P=variable)\b))(?<=\n\t\t)(?P=variable) =(?![^;]*?[^>_]\b(?P=variable)\b[^_])/\n$+{code_between}\t\t$+{type}$+{variable} =/sg' $files

				    # shellcheck disable=SC2086

				    perl -i -p0e 's/\n\t\t\t(?!return )(?P<type>(\w+ )+\**)(?>(?P<variable>\w+)( = *[\w>\s\n-]*?)?;\n(?P<code_between>(?>(?P<comment_or_string_or_not_preprocessor>\/\*.*?\*\/|"(?>\\"|.)*?"|[^#]))*?)(\t\t\t)?(?=\b(?P=variable)\b))(?<=\n\t\t\t)(?P=variable) =(?![^;]*?[^>_]\b(?P=variable)\b[^_])/\n$+{code_between}\t\t\t$+{type}$+{variable} =/sg' $files

				    # shellcheck disable=SC2086

				    perl -i -p0e 's/\n\t\t\t\t(?!return )(?P<type>(\w+ )+\**)(?>(?P<variable>\w+)( = *[\w>\s\n-]*?)?;\n(?P<code_between>(?>(?P<comment_or_string_or_not_preprocessor>\/\*.*?\*\/|"(?>\\"|.)*?"|[^#]))*?)(\t\t\t\t)?(?=\b(?P=variable)\b))(?<=\n\t\t\t\t)(?P=variable) =(?![^;]*?[^>_]\b(?P=variable)\b[^_])/\n$+{code_between}\t\t\t\t$+{type}$+{variable} =/sg' $files

				    # shellcheck disable=SC2086

				    perl -i -p0e 's/\n\t\t\t\t\t(?!return )(?P<type>(\w+ )+\**)(?>(?P<variable>\w+)( = *[\w>\s\n-]*?)?;\n(?P<code_between>(?>(?P<comment_or_string_or_not_preprocessor>\/\*.*?\*\/|"(?>\\"|.)*?"|[^#]))*?)(\t\t\t\t\t)?(?=\b(?P=variable)\b))(?<=\n\t\t\t\t\t)(?P=variable) =(?![^;]*?[^>_]\b(?P=variable)\b[^_])/\n$+{code_between}\t\t\t\t\t$+{type}$+{variable} =/sg' $files

				    # shellcheck disable=SC2086

				    perl -i -p0e 's/\n\t\t\t\t\t\t(?!return )(?P<type>(\w+ )+\**)(?>(?P<variable>\w+)( = *[\w>\s\n-]*?)?;\n(?P<code_between>(?>(?P<comment_or_string_or_not_preprocessor>\/\*.*?\*\/|"(?>\\"|.)*?"|[^#]))*?)(\t\t\t\t\t\t)?(?=\b(?P=variable)\b))(?<=\n\t\t\t\t\t\t)(?P=variable) =(?![^;]*?[^>_]\b(?P=variable)\b[^_])/\n$+{code_between}\t\t\t\t\t\t$+{type}$+{variable} =/sg' $files

				    # shellcheck disable=SC2086

				    git diff --quiet $files && break;

				    # shellcheck disable=SC2086

				    git add $files;

				done

									
										12

ci/sort_and_group_includes.shExecutable file

										View File
									
				@ -0,0 +1,12 @@

				#!/bin/bash

				set -euo pipefail

				# shellcheck disable=SC1091

				source ci/ci_helpers.sh

				git ls-files \

				  | git check-attr --stdin citus-style \

				  | grep 'citus-style: set' \

				  | awk '{print $1}' \

				  | cut -d':' -f1 \

				  | xargs -n1 ./ci/include_grouping.py

1460

config/config.guess vendored Normal file

View File

File diff suppressed because it is too large Load Diff

151

config/general.m4 Normal file

View File

 @ -0,0 +1,151 @@
 # config/general.m4
 # Portions Copyright (c) 1996-2017, PostgreSQL Global Development Group
 # Portions Copyright (c) 1994, The Regents of the University of California
 # This file defines new macros to process configure command line
 # arguments, to replace the brain-dead AC_ARG_WITH and AC_ARG_ENABLE.
 # The flaw in these is particularly that they only differentiate
 # between "given" and "not given" and do not provide enough help to
 # process arguments that only accept "yes/no", that require an
 # argument (other than "yes/no"), etc.
 #
 # The point of this implementation is to reduce code size and
 # redundancy in configure.ac and to improve robustness and consistency
 # in the option evaluation code.
 # Convert type and name to shell variable name (e.g., "enable_long_strings")
 m4_define([pgac_arg_to_variable],
           [$1[]_[]patsubst($2, -, _)])
 # PGAC_ARG(TYPE, NAME, HELP-STRING-LHS-EXTRA, HELP-STRING-RHS,
 #          [ACTION-IF-YES], [ACTION-IF-NO], [ACTION-IF-ARG],
 #          [ACTION-IF-OMITTED])
 # ------------------------------------------------------------
 # This is the base layer. TYPE is either "with" or "enable", depending
 # on what you like.  NAME is the rest of the option name.
 # HELP-STRING-LHS-EXTRA is a string to append to the option name on
 # the left-hand side of the help output, e.g., an argument name.  If
 # set to "-", append nothing, but let the option appear in the
 # negative form (disable/without).  HELP-STRING-RHS is the option
 # description, for the right-hand side of the help output.
 # ACTION-IF-YES is executed if the option is given without an argument
 # (or "yes", which is the same); similar for ACTION-IF-NO.
 AC_DEFUN([PGAC_ARG],
 [
 m4_case([$1],
 enable, [
 AC_ARG_ENABLE([$2], [AS_HELP_STRING([--]m4_if($3, -, disable, enable)[-$2]m4_if($3, -, , $3), [$4])], [
   case [$]enableval in
     yes)
       m4_default([$5], :)
       ;;
     no)
       m4_default([$6], :)
       ;;
     *)
       $7
       ;;
   esac
 ],
 [$8])[]dnl AC_ARG_ENABLE
 ],
 with, [
 AC_ARG_WITH([$2], [AS_HELP_STRING([--]m4_if($3, -, without, with)[-$2]m4_if($3, -, , $3), [$4])], [
   case [$]withval in
     yes)
       m4_default([$5], :)
       ;;
     no)
       m4_default([$6], :)
       ;;
     *)
       $7
       ;;
   esac
 ],
 [$8])[]dnl AC_ARG_WITH
 ],
 [m4_fatal([first argument of $0 must be 'enable' or 'with', not '$1'])]
 )
 ])# PGAC_ARG
 # PGAC_ARG_BOOL(TYPE, NAME, DEFAULT, HELP-STRING-RHS,
 #               [ACTION-IF-YES], [ACTION-IF-NO])
 # ---------------------------------------------------
 # Accept a boolean option, that is, one that only takes yes or no.
 # ("no" is equivalent to "disable" or "without"). DEFAULT is what
 # should be done if the option is omitted; it should be "yes" or "no".
 # (Consequently, one of ACTION-IF-YES and ACTION-IF-NO will always
 # execute.)
 AC_DEFUN([PGAC_ARG_BOOL],
 [dnl The following hack is necessary because in a few instances this
 dnl macro is called twice for the same option with different default
 dnl values.  But we only want it to appear once in the help.  We achieve
 dnl that by making the help string look the same, which is why we need to
 dnl save the default that was passed in previously.
 m4_define([_pgac_helpdefault], m4_ifdef([pgac_defined_$1_$2_bool], [m4_defn([pgac_defined_$1_$2_bool])], [$3]))dnl
 PGAC_ARG([$1], [$2], [m4_if(_pgac_helpdefault, yes, -)], [$4], [$5], [$6],
           [AC_MSG_ERROR([no argument expected for --$1-$2 option])],
           [m4_case([$3],
                    yes, [pgac_arg_to_variable([$1], [$2])=yes
 $5],
                    no,  [pgac_arg_to_variable([$1], [$2])=no
 $6],
                    [m4_fatal([third argument of $0 must be 'yes' or 'no', not '$3'])])])[]dnl
 m4_define([pgac_defined_$1_$2_bool], [$3])dnl
 ])# PGAC_ARG_BOOL
 # PGAC_ARG_REQ(TYPE, NAME, HELP-ARGNAME, HELP-STRING-RHS,
 #              [ACTION-IF-GIVEN], [ACTION-IF-NOT-GIVEN])
 # -------------------------------------------------------
 # This option will require an argument; "yes" or "no" will not be
 # accepted.  HELP-ARGNAME is a name for the argument for the help output.
 AC_DEFUN([PGAC_ARG_REQ],
 [PGAC_ARG([$1], [$2], [=$3], [$4],
           [AC_MSG_ERROR([argument required for --$1-$2 option])],
           [AC_MSG_ERROR([argument required for --$1-$2 option])],
           [$5],
           [$6])])# PGAC_ARG_REQ
 # PGAC_ARG_OPTARG(TYPE, NAME, HELP-ARGNAME, HELP-STRING-RHS,
 #                 [DEFAULT-ACTION], [ARG-ACTION],
 #                 [ACTION-ENABLED], [ACTION-DISABLED])
 # ----------------------------------------------------------
 # This will create an option that behaves as follows: If omitted, or
 # called with "no", then set the enable_variable to "no" and do
 # nothing else. If called with "yes", then execute DEFAULT-ACTION. If
 # called with argument, set enable_variable to "yes" and execute
 # ARG-ACTION. Additionally, execute ACTION-ENABLED if we ended up with
 # "yes" either way, else ACTION-DISABLED.
 #
 # The intent is to allow enabling a feature, and optionally pass an
 # additional piece of information.
 AC_DEFUN([PGAC_ARG_OPTARG],
 [PGAC_ARG([$1], [$2], [@<:@=$3@:>@], [$4], [$5], [],
           [pgac_arg_to_variable([$1], [$2])=yes
 $6],
           [pgac_arg_to_variable([$1], [$2])=no])
 dnl Add this code only if there's a ACTION-ENABLED or ACTION-DISABLED.
 m4_ifval([$7[]$8],
 [
 if test "[$]pgac_arg_to_variable([$1], [$2])" = yes; then
   m4_default([$7], :)
 m4_ifval([$8],
 [else
   $8
 ])[]dnl
 fi
 ])[]dnl
 ])# PGAC_ARG_OPTARG

2019

configure vendored

View File

File diff suppressed because it is too large Load Diff

311

configure.ac Normal file

View File

 @ -0,0 +1,311 @@
 # Citus autoconf input script.
 #
 # Converted into an actual configure script by autogen.sh. This
 # conversion only has to be done when configure.in changes. To avoid
 # everyone needing autoconf installed, the resulting files are checked
 # into the SCM.
 AC_INIT([Citus], [13.2devel])
 AC_COPYRIGHT([Copyright (c) Citus Data, Inc.])
 # we'll need sed and awk for some of the version commands
 AC_PROG_SED
 AC_PROG_AWK
 # CITUS_NAME definition
 AC_DEFINE_UNQUOTED(CITUS_NAME, "$PACKAGE_NAME", [Citus full name as a string])
 case $PACKAGE_NAME in
   'Citus Enterprise') citus_edition=enterprise ;;
                Citus) citus_edition=community ;;
                    *) AC_MSG_ERROR([Unrecognized package name.]) ;;
 esac
 # CITUS_EDITION definition
 AC_DEFINE_UNQUOTED(CITUS_EDITION, "$citus_edition", [Citus edition as a string])
 # CITUS_MAJORVERSION definition
 [CITUS_MAJORVERSION=`expr "$PACKAGE_VERSION" : '\([0-9][0-9]*\.[0-9][0-9]*\)'`]
 AC_DEFINE_UNQUOTED(CITUS_MAJORVERSION, "$CITUS_MAJORVERSION", [Citus major version as a string])
 # CITUS_VERSION definition
 PGAC_ARG_REQ(with, extra-version, [STRING], [append STRING to version],
              [CITUS_VERSION="$PACKAGE_VERSION$withval"],
              [CITUS_VERSION="$PACKAGE_VERSION"])
 AC_DEFINE_UNQUOTED(CITUS_VERSION, "$CITUS_VERSION", [Citus version as a string])
 # CITUS_VERSION_NUM definition
 # awk -F is a regex on some platforms, and not on others, so make "." a tab
 [CITUS_VERSION_NUM="`echo "$PACKAGE_VERSION" | sed 's/[A-Za-z].*$//' |
 tr '.' '	' |
 $AWK '{printf "%d%02d%02d", $1, $2, (NF >= 3) ? $3 : 0}'`"]
 AC_DEFINE_UNQUOTED(CITUS_VERSION_NUM, $CITUS_VERSION_NUM, [Citus version as a number])
 # CITUS_EXTENSIONVERSION definition
 [CITUS_EXTENSIONVERSION="`grep '^default_version' $srcdir/src/backend/distributed/citus.control | cut -d\' -f2`"]
 AC_DEFINE_UNQUOTED([CITUS_EXTENSIONVERSION], "$CITUS_EXTENSIONVERSION", [Extension version expected by this Citus build])
 # Re-check for flex. That allows to compile citus against a postgres
 # which was built without flex available (possible because generated
 # files are included)
 AC_PATH_PROG([FLEX], [flex])
 # Locate pg_config binary
 AC_ARG_VAR([PG_CONFIG], [Location to find pg_config for target PostgreSQL instalation (default PATH)])
 AC_ARG_VAR([PATH], [PATH for target PostgreSQL install pg_config])
 if test -z "$PG_CONFIG"; then
   AC_PATH_PROG(PG_CONFIG, pg_config)
 fi
 if test -z "$PG_CONFIG"; then
    AC_MSG_ERROR([Could not find pg_config. Set PG_CONFIG or PATH.])
 fi
 # check we're building against a supported version of PostgreSQL
 citusac_pg_config_version=$($PG_CONFIG --version 2>/dev/null)
 version_num=$(echo "$citusac_pg_config_version"|
               $SED -e 's/^PostgreSQL \([[0-9]]*\)\(\.[[0-9]]*\)\{0,1\}\(.*\)$/\1\2/')
 # if PostgreSQL version starts with two digits, the major version is those digits
 version_num=$(echo "$version_num"| $SED -e 's/^\([[0-9]]\{2\}\)\(.*\)$/\1/')
 if test -z "$version_num"; then
   AC_MSG_ERROR([Could not detect PostgreSQL version from pg_config.])
 fi
 PGAC_ARG_BOOL(with, pg-version-check, yes,
               [do not check postgres version during configure])
 AC_SUBST(with_pg_version_check)
 if test "$with_pg_version_check" = no; then
     AC_MSG_NOTICE([building against PostgreSQL $version_num (skipped compatibility check)])
 elif test "$version_num" != '15' -a  "$version_num" != '16' -a  "$version_num" != '17'; then
    AC_MSG_ERROR([Citus is not compatible with the detected PostgreSQL version ${version_num}.])
 else
    AC_MSG_NOTICE([building against PostgreSQL $version_num])
 fi;
 # Check whether we're building inside the source tree, if not, prepare
 # the build directory.
 if test "$srcdir" -ef '.' ; then
   vpath_build=no
 else
   vpath_build=yes
   _AS_ECHO_N([preparing build tree... ])
   citusac_abs_top_srcdir=`cd "$srcdir" && pwd`
   $SHELL "$citusac_abs_top_srcdir/prep_buildtree" "$citusac_abs_top_srcdir" "." \
       || AC_MSG_ERROR(failed)
   AC_MSG_RESULT(done)
 fi
 AC_SUBST(vpath_build)
 # Allow to overwrite the C compiler, default to the one postgres was
 # compiled with. We don't want autoconf's default CFLAGS though, so save
 # those.
 SAVE_CFLAGS="$CFLAGS"
 AC_PROG_CC([$($PG_CONFIG --cc)])
 CFLAGS="$SAVE_CFLAGS"
 host_guess=`${SHELL} $srcdir/config/config.guess`
 # Create compiler version string
 if test x"$GCC" = x"yes" ; then
   cc_string=`${CC} --version | sed q`
   case $cc_string in [[A-Za-z]]*) ;; *) cc_string="GCC $cc_string";; esac
 elif test x"$SUN_STUDIO_CC" = x"yes" ; then
   cc_string=`${CC} -V 2>&1 | sed q`
 else
   cc_string=$CC
 fi
 AC_CHECK_SIZEOF([void *])
 AC_DEFINE_UNQUOTED(CITUS_VERSION_STR,
                    ["$PACKAGE_NAME $CITUS_VERSION on $host_guess, compiled by $cc_string, `expr $ac_cv_sizeof_void_p \* 8`-bit"],
                    [A string containing the version number, platform, and C compiler])
 # Locate source and build directory of the postgres we're building
 # against. Can't rely on either still being present, but e.g. optional
 # test infrastructure can rely on it.
 POSTGRES_SRCDIR=$(grep ^abs_top_srcdir $(dirname $($PG_CONFIG --pgxs))/../Makefile.global|cut -d ' ' -f3-)
 POSTGRES_BUILDDIR=$(grep ^abs_top_builddir $(dirname $($PG_CONFIG --pgxs))/../Makefile.global|cut -d ' ' -f3-)
 # check for a number of CFLAGS that make development easier
 # CITUSAC_PROG_CC_CFLAGS_OPT
 # -----------------------
 # Given a string, check if the compiler supports the string as a
 # command-line option. If it does, add the string to CFLAGS.
 AC_DEFUN([CITUSAC_PROG_CC_CFLAGS_OPT],
 [define([Ac_cachevar], [AS_TR_SH([citusac_cv_prog_cc_cflags_$1])])dnl
 AC_CACHE_CHECK([whether $CC supports $1], [Ac_cachevar],
 [citusac_save_CFLAGS=$CFLAGS
 flag=$1
 case $flag in -Wno*)
 	 flag=-W$(echo $flag | cut -c 6-)
 esac
 CFLAGS="$citusac_save_CFLAGS $flag"
 ac_save_c_werror_flag=$ac_c_werror_flag
 ac_c_werror_flag=yes
 _AC_COMPILE_IFELSE([AC_LANG_PROGRAM()],
                    [Ac_cachevar=yes],
                    [Ac_cachevar=no])
 ac_c_werror_flag=$ac_save_c_werror_flag
 CFLAGS="$citusac_save_CFLAGS"])
 if test x"$Ac_cachevar" = x"yes"; then
   CITUS_CFLAGS="$CITUS_CFLAGS $1"
 fi
 undefine([Ac_cachevar])dnl
 ])# CITUSAC_PROG_CC_CFLAGS_OPT
 CITUSAC_PROG_CC_CFLAGS_OPT([-std=gnu99])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wall])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wextra])
 # disarm options included in the above, which are too noisy for now
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-unused-parameter])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-sign-compare])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-missing-field-initializers])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-clobbered])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-gnu-variable-sized-type-not-at-end])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-declaration-after-statement])
 # And add a few extra warnings
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wendif-labels])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wmissing-format-attribute])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wmissing-declarations])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wmissing-prototypes])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wshadow])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Werror=vla])  # visual studio does not support these
 CITUSAC_PROG_CC_CFLAGS_OPT([-Werror=implicit-int])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Werror=implicit-function-declaration])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Werror=return-type])
 # Security flags
 # Flags taken from: https://liquid.microsoft.com/Web/Object/Read/ms.security/Requirements/Microsoft.Security.SystemsADM.10203#guide
 # We do not enforce the following flag because it is only available on GCC>=8
 CITUSAC_PROG_CC_CFLAGS_OPT([-fstack-clash-protection])
 #
 # --enable-coverage enables generation of code coverage metrics with gcov
 #
 AC_ARG_ENABLE([coverage], AS_HELP_STRING([--enable-coverage], [build with coverage testing instrumentation]))
 if test "$enable_coverage" = yes; then
     CITUS_CFLAGS="$CITUS_CFLAGS -O0 -g --coverage"
     CITUS_CPPFLAGS="$CITUS_CPPFLAGS -DNDEBUG"
     CITUS_LDFLAGS="$CITUS_LDFLAGS --coverage"
 fi
 #
 # libcurl
 #
 PGAC_ARG_BOOL(with, libcurl, yes,
               [do not use libcurl for anonymous statistics collection],
               [AC_DEFINE([HAVE_LIBCURL], 1, [Define to 1 to build with libcurl support. (--with-libcurl)])])
 if test "$with_libcurl" = yes; then
   AC_CHECK_LIB(curl, curl_global_init, [],
               [AC_MSG_ERROR([libcurl not found
 If you have libcurl already installed, see config.log for details on the
 failure. It is possible the compiler isn't looking in the proper directory.
 Use --without-libcurl to disable anonymous statistics collection.])])
   AC_CHECK_HEADER(curl/curl.h, [], [AC_MSG_ERROR([libcurl header not found
 If you have libcurl already installed, see config.log for details on the
 failure.  It is possible the compiler isn't looking in the proper directory.
 Use --without-libcurl to disable libcurl support.])])
 fi
 # REPORTS_BASE_URL definition
 PGAC_ARG_REQ(with, reports-hostname, [HOSTNAME],
              [Use HOSTNAME as hostname for statistics collection and update checks],
              [REPORTS_BASE_URL="https://${withval}"],
              [REPORTS_BASE_URL="https://reports.citusdata.com"])
 AC_DEFINE_UNQUOTED(REPORTS_BASE_URL, "$REPORTS_BASE_URL",
                    [Base URL for statistics collection and update checks])
 #
 # LZ4
 #
 PGAC_ARG_BOOL(with, lz4, yes,
               [do not use lz4],
               [AC_DEFINE([HAVE_CITUS_LIBLZ4], 1, [Define to 1 to build with lz4 support. (--with-lz4)])])
 AC_SUBST(with_lz4)
 if test "$with_lz4" = yes; then
   AC_CHECK_LIB(lz4, LZ4_compress_default, [],
               [AC_MSG_ERROR([lz4 library not found
 If you have lz4 installed, see config.log for details on the
 failure.  It is possible the compiler isn't looking in the proper directory.
 Use --without-lz4 to disable lz4 support.])])
   AC_CHECK_HEADER(lz4.h, [], [AC_MSG_ERROR([lz4 header not found
 If you have lz4 already installed, see config.log for details on the
 failure.  It is possible the compiler isn't looking in the proper directory.
 Use --without-lz4 to disable lz4 support.])])
 fi
 #
 # ZSTD
 #
 PGAC_ARG_BOOL(with, zstd, yes,
               [do not use zstd])
 AC_SUBST(with_zstd)
 if test "$with_zstd" = yes; then
   AC_CHECK_LIB(zstd, ZSTD_decompress, [],
               [AC_MSG_ERROR([zstd library not found
 If you have zstd installed, see config.log for details on the
 failure.  It is possible the compiler isn't looking in the proper directory.
 Use --without-zstd to disable zstd support.])])
   AC_CHECK_HEADER(zstd.h, [], [AC_MSG_ERROR([zstd header not found
 If you have zstd already installed, see config.log for details on the
 failure.  It is possible the compiler isn't looking in the proper directory.
 Use --without-zstd to disable zstd support.])])
 fi
 PGAC_ARG_BOOL(with, security-flags, no,
               [use security flags])
 AC_SUBST(with_security_flags)
 if test "$with_security_flags" = yes; then
 # Flags taken from: https://liquid.microsoft.com/Web/Object/Read/ms.security/Requirements/Microsoft.Security.SystemsADM.10203#guide
 # We always want to have some compiler flags for security concerns.
 SECURITY_CFLAGS="-fstack-protector-strong -D_FORTIFY_SOURCE=2 -O2 -z noexecstack -fpic -shared -Wl,-z,relro -Wl,-z,now -Wformat -Wformat-security -Werror=format-security"
 CITUS_CFLAGS="$CITUS_CFLAGS $SECURITY_CFLAGS"
 AC_MSG_NOTICE([Blindly added security flags for linker: $SECURITY_CFLAGS])
 # We always want to have some clang flags for security concerns.
 # This doesn't include "-Wl,-z,relro -Wl,-z,now" on purpuse, because bitcode is not linked.
 # This doesn't include -fsanitize=cfi because it breaks builds on many distros including
 # Debian/Buster, Debian/Stretch, Ubuntu/Bionic, Ubuntu/Xenial and EL7.
 SECURITY_BITCODE_CFLAGS="-fsanitize=safe-stack -fstack-protector-strong -flto -fPIC -Wformat -Wformat-security -Werror=format-security"
 CITUS_BITCODE_CFLAGS="$CITUS_BITCODE_CFLAGS $SECURITY_BITCODE_CFLAGS"
 AC_MSG_NOTICE([Blindly added security flags for llvm: $SECURITY_BITCODE_CFLAGS])
 AC_MSG_WARN([If you run into issues during linking or bitcode compilation, you can use --without-security-flags.])
 fi
 # Check if git is installed, when installed the gitref of the checkout will be baked in the application
 AC_PATH_PROG(GIT_BIN, git)
 AC_CHECK_FILE(.git,[HAS_DOTGIT=yes], [HAS_DOTGIT=])
 AC_SUBST(CITUS_CFLAGS, "$CITUS_CFLAGS")
 AC_SUBST(CITUS_BITCODE_CFLAGS, "$CITUS_BITCODE_CFLAGS")
 AC_SUBST(CITUS_CPPFLAGS, "$CITUS_CPPFLAGS")
 AC_SUBST(CITUS_LDFLAGS, "$LIBS $CITUS_LDFLAGS")
 AC_SUBST(POSTGRES_SRCDIR, "$POSTGRES_SRCDIR")
 AC_SUBST(POSTGRES_BUILDDIR, "$POSTGRES_BUILDDIR")
 AC_SUBST(HAS_DOTGIT, "$HAS_DOTGIT")
 AC_CONFIG_FILES([Makefile.global])
 AC_CONFIG_HEADERS([src/include/citus_config.h] [src/include/citus_version.h])
 AH_TOP([
 /*
  * citus_config.h.in is generated by autoconf/autoheader and
  * converted into citus_config.h by configure.  Include when code needs to
  * depend on determinations made by configure.
  *
  * Do not manually edit!
  */
 ])
 AC_OUTPUT

117

configure.in

View File

 @ -1,117 +0,0 @@
 # Citus autoconf input script.
 #
 # Converted into an actual configure script by autogen.sh. This
 # conversion only has to be done when configure.in changes. To avoid
 # everyone needing autoconf installed, the resulting files are checked
 # into the SCM.
 AC_INIT([Citus], [5.0], [], [citus], [])
 AC_COPYRIGHT([Copyright (c) 2012-2016, Citus Data, Inc.])
 AC_PROG_SED
 # Re-check for flex. That allows to compile citus against a postgres
 # which was built without flex available (possible because generated
 # files are included)
 AC_PATH_PROG([FLEX], [flex])
 # Locate pg_config binary
 AC_ARG_VAR([PG_CONFIG], [Location to find pg_config for target PostgreSQL instalation (default PATH)])
 AC_ARG_VAR([PATH], [PATH for target PostgreSQL install pg_config])
 if test -z "$PG_CONFIG"; then
   AC_PATH_PROG(PG_CONFIG, pg_config)
 fi
 if test -z "$PG_CONFIG"; then
    AC_MSG_ERROR([Could not find pg_config. Set PG_CONFIG or PATH.])
 fi
 # check we're building against a supported version of PostgreSQL
 citusac_pg_config_version=$($PG_CONFIG --version 2>/dev/null)
 version_num=$(echo "$citusac_pg_config_version"|
               $SED -e 's/^PostgreSQL \([[0-9]]*\)\.\([[0-9]]*\)\([[a-zA-Z0-9.]]*\)$/\1.\2/')
 if test -z "$version_num"; then
   AC_MSG_ERROR([Could not detect PostgreSQL version from pg_config.])
 fi
 if test "$version_num" != '9.5'; then
    AC_MSG_ERROR([Citus is not compatible with the detected PostgreSQL version ${version_num}.])
 else
    AC_MSG_NOTICE([building against PostgreSQL $version_num])
 fi;
 # Check whether we're building inside the source tree, if not, prepare
 # the build directory.
 if test "$srcdir" -ef '.' ; then
   vpath_build=no
 else
   vpath_build=yes
   _AS_ECHO_N([preparing build tree... ])
   citusac_abs_top_srcdir=`cd "$srcdir" && pwd`
   $SHELL "$citusac_abs_top_srcdir/prep_buildtree" "$citusac_abs_top_srcdir" "." \
       || AC_MSG_ERROR(failed)
   AC_MSG_RESULT(done)
 fi
 AC_SUBST(vpath_build)
 # Allow to overwrite the C compiler, default to the one postgres was
 # compiled with. We don't want autoconf's default CFLAGS though, so save
 # those.
 SAVE_CFLAGS="$CFLAGS"
 AC_PROG_CC([$($PG_CONFIG --cc)])
 CFLAGS="$SAVE_CFLAGS"
 # check for a number of CFLAGS that make development easier
 # CITUSAC_PROG_CC_CFLAGS_OPT
 # -----------------------
 # Given a string, check if the compiler supports the string as a
 # command-line option. If it does, add the string to CFLAGS.
 AC_DEFUN([CITUSAC_PROG_CC_CFLAGS_OPT],
 [define([Ac_cachevar], [AS_TR_SH([citusac_cv_prog_cc_cflags_$1])])dnl
 AC_CACHE_CHECK([whether $CC supports $1], [Ac_cachevar],
 [citusac_save_CFLAGS=$CFLAGS
 CFLAGS="$citusac_save_CFLAGS $1"
 ac_save_c_werror_flag=$ac_c_werror_flag
 ac_c_werror_flag=yes
 _AC_COMPILE_IFELSE([AC_LANG_PROGRAM()],
                    [Ac_cachevar=yes],
                    [Ac_cachevar=no])
 ac_c_werror_flag=$ac_save_c_werror_flag
 CFLAGS="$citusac_save_CFLAGS"])
 if test x"$Ac_cachevar" = x"yes"; then
   CITUS_CFLAGS="$CITUS_CFLAGS $1"
 fi
 undefine([Ac_cachevar])dnl
 ])# CITUSAC_PROG_CC_CFLAGS_OPT
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wall])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wextra])
 # disarm options included in the above, which are too noisy for now
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-unused-parameter])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-sign-compare])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-missing-field-initializers])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wno-clobbered])
 # And add a few extra warnings
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wdeclaration-after-statement])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wendif-labels])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wmissing-format-attribute])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wmissing-declarations])
 CITUSAC_PROG_CC_CFLAGS_OPT([-Wmissing-prototypes])
 AC_SUBST(CITUS_CFLAGS, "$CITUS_CFLAGS")
 AC_CONFIG_FILES([Makefile.global])
 AC_CONFIG_HEADERS([src/include/citus_config.h])
 AH_TOP([
 /*
  * citus_config.h.in is generated by autoconf/autoheader and
  * converted into citus_config.h by configure.  Include when code needs to
  * depend on determinations made by configure.
  *
  * Do not manually edit!
  */
 ])
 AC_OUTPUT

BIN
github-banner.png

View File

Binary file not shown.

Before

Width: | Height: | Size: 4.0 KiB

BIN
images/2pc-recovery.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 95 KiB

BIN
images/citus-architecture.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 94 KiB

BIN
images/citus-readme-banner.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 22 KiB

BIN
images/citus-scale-out.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 18 KiB

BIN
images/coordinator_delegates_stored_procedure.png Normal file

View File

Binary file not shown.

After

Width: | Height: | Size: 22 KiB

BIN
images/deadlock-detection.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 102 KiB

BIN
images/executor-connections.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 29 KiB

BIN
images/executor-slow-start.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 69 KiB

BIN
images/insert-select-modes.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 111 KiB

BIN
images/mx-dedicated-query-nodes.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 12 KiB

BIN
images/single-repartition-join.png Executable file

View File

Binary file not shown.

After

Width: | Height: | Size: 168 KiB

									
										40

pyproject.tomlNormal file

										View File
									
				@ -0,0 +1,40 @@

				[tool.isort]

				profile = 'black'

				[tool.black]

				include = '(src/test/regress/bin/diff-filter|\.pyi?|\.ipynb)$'

				[tool.pytest.ini_options]

				addopts = [

				    "--import-mode=importlib",

				    "--showlocals",

				    "--tb=short",

				]

				pythonpath = 'src/test/regress/citus_tests'

				asyncio_mode = 'auto'

				# Make test discovery quicker from the root dir of the repo

				testpaths = ['src/test/regress/citus_tests/test']

				# Make test discovery quicker from other directories than root directory

				norecursedirs = [

				    '*.egg',

				    '.*',

				    'build',

				    'venv',

				    'ci',

				    'vendor',

				    'backend',

				    'bin',

				    'include',

				    'tmp_*',

				    'results',

				    'expected',

				    'sql',

				    'spec',

				    'data',

				    '__pycache__',

				]

				# Don't find files with test at the end such as run_test.py

				python_files = ['test_*.py']

25

src/backend/columnar/.gitattributes vendored Normal file

View File

 @ -0,0 +1,25 @@
 *		whitespace=space-before-tab,trailing-space
 *.[chly]	whitespace=space-before-tab,trailing-space,indent-with-non-tab,tabwidth=4
 *.dsl		whitespace=space-before-tab,trailing-space,tab-in-indent
 *.patch		-whitespace
 *.pl		whitespace=space-before-tab,trailing-space,tabwidth=4
 *.po		whitespace=space-before-tab,trailing-space,tab-in-indent,-blank-at-eof
 *.sgml		whitespace=space-before-tab,trailing-space,tab-in-indent,-blank-at-eol
 *.x[ms]l	whitespace=space-before-tab,trailing-space,tab-in-indent
 # Avoid confusing ASCII underlines with leftover merge conflict markers
 README		conflict-marker-size=32
 README.*	conflict-marker-size=32
 # Certain data files that contain special whitespace, and other special cases
 *.data						-whitespace
 # Test output files that contain extra whitespace
 *.out					-whitespace
 # These files are maintained or generated elsewhere.  We take them as is.
 configure				-whitespace
 # all C files (implementation and header) use our style...
 *.[ch] citus-style

3

src/backend/columnar/.gitignore vendored Normal file

View File

 @ -0,0 +1,3 @@
 # The directory used to store columnar sql files after pre-processing them
 # with 'cpp' in build-time, see src/backend/columnar/Makefile.
 /build/

									
										60

src/backend/columnar/MakefileNormal file

										View File
									
				@ -0,0 +1,60 @@

				citus_subdir = src/backend/columnar

				citus_top_builddir = ../../..

				safestringlib_srcdir = $(citus_abs_top_srcdir)/vendor/safestringlib

				SUBDIRS = . safeclib

				SUBDIRS +=

				ENSURE_SUBDIRS_EXIST := $(shell mkdir -p $(SUBDIRS))

				OBJS += \

					$(patsubst $(citus_abs_srcdir)/%.c,%.o,$(foreach dir,$(SUBDIRS), $(sort $(wildcard $(citus_abs_srcdir)/$(dir)/*.c))))

				MODULE_big = citus_columnar

				EXTENSION = citus_columnar

				template_sql_files = $(patsubst $(citus_abs_srcdir)/%,%,$(wildcard $(citus_abs_srcdir)/sql/*.sql))

				template_downgrade_sql_files = $(patsubst $(citus_abs_srcdir)/sql/downgrades/%,%,$(wildcard $(citus_abs_srcdir)/sql/downgrades/*.sql))

				generated_sql_files = $(patsubst %,$(citus_abs_srcdir)/build/%,$(template_sql_files))

				generated_downgrade_sql_files += $(patsubst %,$(citus_abs_srcdir)/build/sql/%,$(template_downgrade_sql_files))

				DATA_built = $(generated_sql_files)

				PG_CPPFLAGS += -I$(libpq_srcdir) -I$(safestringlib_srcdir)/include

				include $(citus_top_builddir)/Makefile.global

				SQL_DEPDIR=.deps/sql

				SQL_BUILDDIR=build/sql

				$(generated_sql_files): $(citus_abs_srcdir)/build/%: %

					@mkdir -p $(citus_abs_srcdir)/$(SQL_DEPDIR) $(citus_abs_srcdir)/$(SQL_BUILDDIR)

					@# -MF is used to store dependency files(.Po) in another directory for separation

					@# -MT is used to change the target of the rule emitted by dependency generation.

					@# -P is used to inhibit generation of linemarkers in the output from the preprocessor.

					@# -undef is used to not predefine any system-specific or GCC-specific macros.

					@# `man cpp` for further information

					cd $(citus_abs_srcdir) && cpp -undef -w -P -MMD -MP -MF$(SQL_DEPDIR)/$(*F).Po -MT$@ $< > $@

				$(generated_downgrade_sql_files): $(citus_abs_srcdir)/build/sql/%: sql/downgrades/%

					@mkdir -p $(citus_abs_srcdir)/$(SQL_DEPDIR) $(citus_abs_srcdir)/$(SQL_BUILDDIR)

					@# -MF is used to store dependency files(.Po) in another directory for separation

					@# -MT is used to change the target of the rule emitted by dependency generation.

					@# -P is used to inhibit generation of linemarkers in the output from the preprocessor.

					@# -undef is used to not predefine any system-specific or GCC-specific macros.

					@# `man cpp` for further information

					cd $(citus_abs_srcdir) && cpp -undef -w -P -MMD -MP -MF$(SQL_DEPDIR)/$(*F).Po -MT$@ $< > $@

				.PHONY: install install-downgrades install-all

				cleanup-before-install:

					rm -f $(DESTDIR)$(datadir)/$(datamoduledir)/citus_columnar.control

					rm -f $(DESTDIR)$(datadir)/$(datamoduledir)/columnar--*

					rm -f $(DESTDIR)$(datadir)/$(datamoduledir)/citus_columnar--*

				install: cleanup-before-install

				# install and install-downgrades should be run sequentially

				install-all: install

					$(MAKE) install-downgrades

				install-downgrades: $(generated_downgrade_sql_files)

					$(INSTALL_DATA) $(generated_downgrade_sql_files) '$(DESTDIR)$(datadir)/$(datamoduledir)/'

									
										321

src/backend/columnar/README.mdNormal file

										View File
									
				@ -0,0 +1,321 @@

				# Introduction

				Citus Columnar offers a per-table option for columnar storage to

				reduce IO requirements though compression and projection pushdown.

				# Design Trade-Offs

				Existing PostgreSQL row tables work well for OLTP:

				* Support `UPDATE`/`DELETE` efficiently

				* Efficient single-tuple lookups

				The Citus Columnar tables work best for analytic or DW workloads:

				* Compression

				* Doesn't read unnecessary columns

				* Efficient `VACUUM`

				# Next generation of cstore_fdw

				Citus Columnar is the next generation of

				[cstore_fdw](https://github.com/citusdata/cstore_fdw/).

				Benefits of Citus Columnar over cstore_fdw:

				* Citus Columnar is based on the [Table Access Method

				  API](https://www.postgresql.org/docs/current/tableam.html), which

				  allows it to behave exactly like an ordinary heap (row) table for

				  most operations.

				* Supports Write-Ahead Log (WAL).

				* Supports ``ROLLBACK``.

				* Supports physical replication.

				* Supports recovery, including Point-In-Time Restore (PITR).

				* Supports ``pg_dump`` and ``pg_upgrade`` without the need for special

				  options or extra steps.

				* Better user experience; simple ``USING``clause.

				* Supports more features that work on ordinary heap (row) tables.

				# Limitations

				* Append-only (no ``UPDATE``/``DELETE`` support)

				* No space reclamation (e.g. rolled-back transactions may still

				  consume disk space)

				* No bitmap index scans

				* No tidscans

				* No sample scans

				* No TOAST support (large values supported inline)

				* No support for [``ON

				  CONFLICT``](https://www.postgresql.org/docs/12/sql-insert.html#SQL-ON-CONFLICT)

				  statements (except ``DO NOTHING`` actions with no target specified).

				* No support for tuple locks (``SELECT ... FOR SHARE``, ``SELECT

				  ... FOR UPDATE``)

				* No support for serializable isolation level

				* Support for PostgreSQL server versions 12+ only

				* No support for foreign keys

				* No support for logical decoding

				* No support for intra-node parallel scans

				* No support for ``AFTER ... FOR EACH ROW`` triggers

				* No `UNLOGGED` columnar tables

				Future iterations will incrementally lift the limitations listed above.

				# User Experience

				Create a Columnar table by specifying ``USING columnar`` when creating

				the table.

				```sql

				CREATE TABLE my_columnar_table

				(

				    id INT,

				    i1 INT,

				    i2 INT8,

				    n NUMERIC,

				    t TEXT

				) USING columnar;

				```

				Insert data into the table and read from it like normal (subject to

				the limitations listed above).

				To see internal statistics about the table, use ``VACUUM

				VERBOSE``. Note that ``VACUUM`` (without ``FULL``) is much faster on a

				columnar table, because it scans only the metadata, and not the actual

				data.

				## Options

				Set options using:

				```sql

				ALTER TABLE my_columnar_table SET

				  (columnar.compression = none, columnar.stripe_row_limit = 10000);

				```

				The following options are available:

				* **columnar.compression**: `[none|pglz|zstd|lz4|lz4hc]` - set the compression type

				  for _newly-inserted_ data. Existing data will not be

				  recompressed/decompressed. The default value is `zstd` (if support

				  has been compiled in).

				* **columnar.compression_level**: ``<integer>`` - Sets compression level. Valid

				  settings are from 1 through 19. If the compression method does not

				  support the level chosen, the closest level will be selected

				  instead.

				* **columnar.stripe_row_limit**: ``<integer>`` - the maximum number of rows per

				  stripe for _newly-inserted_ data. Existing stripes of data will not

				  be changed and may have more rows than this maximum value. The

				  default value is `150000`.

				* **columnar.chunk_group_row_limit**: ``<integer>`` - the maximum number of rows per

				  chunk for _newly-inserted_ data. Existing chunks of data will not be

				  changed and may have more rows than this maximum value. The default

				  value is `10000`.

				View options for all tables with:

				```sql

				SELECT * FROM columnar.options;

				```

				You can also adjust options with a `SET` command of one of the

				following GUCs:

				* `columnar.compression`

				* `columnar.compression_level`

				* `columnar.stripe_row_limit`

				* `columnar.chunk_group_row_limit`

				GUCs only affect newly-created *tables*, not any newly-created

				*stripes* on an existing table.

				## Partitioning

				Columnar tables can be used as partitions; and a partitioned table may

				be made up of any combination of row and columnar partitions.

				```sql

				CREATE TABLE parent(ts timestamptz, i int, n numeric, s text)

				  PARTITION BY RANGE (ts);

				-- columnar partition

				CREATE TABLE p0 PARTITION OF parent

				  FOR VALUES FROM ('2020-01-01') TO ('2020-02-01')

				  USING COLUMNAR;

				-- columnar partition

				CREATE TABLE p1 PARTITION OF parent

				  FOR VALUES FROM ('2020-02-01') TO ('2020-03-01')

				  USING COLUMNAR;

				-- row partition

				CREATE TABLE p2 PARTITION OF parent

				  FOR VALUES FROM ('2020-03-01') TO ('2020-04-01');

				INSERT INTO parent VALUES ('2020-01-15', 10, 100, 'one thousand'); -- columnar

				INSERT INTO parent VALUES ('2020-02-15', 20, 200, 'two thousand'); -- columnar

				INSERT INTO parent VALUES ('2020-03-15', 30, 300, 'three thousand'); -- row

				```

				When performing operations on a partitioned table with a mix of row

				and columnar partitions, take note of the following behaviors for

				operations that are supported on row tables but not columnar

				(e.g. ``UPDATE``, ``DELETE``, tuple locks, etc.):

				* If the operation is targeted at a specific row partition

				  (e.g. ``UPDATE p2 SET i = i + 1``), it will succeed; if targeted at

				  a specified columnar partition (e.g. ``UPDATE p1 SET i = i + 1``),

				  it will fail.

				* If the operation is targeted at the partitioned table and has a

				  ``WHERE`` clause that excludes all columnar partitions

				  (e.g. ``UPDATE parent SET i = i + 1 WHERE ts = '2020-03-15'``), it

				  will succeed.

				* If the operation is targeted at the partitioned table, but does not

				  exclude all columnar partitions, it will fail; even if the actual

				  data to be updated only affects row tables (e.g. ``UPDATE parent SET

				  i = i + 1 WHERE n = 300``).

				Note that Citus Columnar supports `btree` and `hash `indexes (and

				the constraints requiring them) but does not support `gist`, `gin`,

				`spgist` and `brin` indexes.

				For this reason, if some partitions are columnar and if the index is

				not supported by Citus Columnar, then it's impossible to create indexes

				on the partitioned (parent) table directly. In that case, you need to

				create the index on the individual row partitions. Similarly for the

				constraints that require indexes, e.g.:

				```sql

				CREATE INDEX p2_ts_idx ON p2 (ts);

				CREATE UNIQUE INDEX p2_i_unique ON p2 (i);

				ALTER TABLE p2 ADD UNIQUE (n);

				```

				## Converting Between Row and Columnar

				Note: ensure that you understand any advanced features that may be

				used with the table before converting it (e.g. row-level security,

				storage options, constraints, inheritance, etc.), and ensure that they

				are reproduced in the new table or partition appropriately. ``LIKE``,

				used below, is a shorthand that works only in simple cases.

				```sql

				CREATE TABLE my_table(i INT8 DEFAULT '7');

				INSERT INTO my_table VALUES(1);

				-- convert to columnar

				SELECT alter_table_set_access_method('my_table', 'columnar');

				-- back to row

				SELECT alter_table_set_access_method('my_table', 'heap');

				```

				# Performance Microbenchmark

				*Important*: This microbenchmark is not intended to represent any real

				 workload. Compression ratios, and therefore performance, will depend

				 heavily on the specific workload. This is only for the purpose of

				 illustrating a "columnar friendly" contrived workload that showcases

				 the benefits of columnar.

				## Schema

				```sql

				CREATE TABLE perf_row(

				    id INT8,

				    ts TIMESTAMPTZ,

				    customer_id INT8,

				    vendor_id INT8,

				    name TEXT,

				    description TEXT,

				    value NUMERIC,

				    quantity INT4

				);

				CREATE TABLE perf_columnar(LIKE perf_row) USING COLUMNAR;

				```

				## Data

				```sql

				CREATE OR REPLACE FUNCTION random_words(n INT4) RETURNS TEXT LANGUAGE sql AS $$

				  WITH words(w) AS (

				    SELECT ARRAY['zero','one','two','three','four','five','six','seven','eight','nine','ten']

				  ),

				  random (word) AS (

				    SELECT w[(random()*array_length(w, 1))::int] FROM generate_series(1, $1) AS i, words

				  )

				  SELECT string_agg(word, ' ') FROM random;

				$$;

				```

				```sql

				INSERT INTO perf_row

				   SELECT

				    g, -- id

				    '2020-01-01'::timestamptz + ('1 minute'::interval * g), -- ts

				    (random() * 1000000)::INT4, -- customer_id

				    (random() * 100)::INT4, -- vendor_id

				    random_words(7), -- name

				    random_words(100), -- description

				    (random() * 100000)::INT4/100.0, -- value

				    (random() * 100)::INT4 -- quantity

				   FROM generate_series(1,75000000) g;

				INSERT INTO perf_columnar SELECT * FROM perf_row;

				```

				## Compression Ratio

				```

				=> SELECT pg_total_relation_size('perf_row')::numeric/pg_total_relation_size('perf_columnar') AS compression_ratio;

				 compression_ratio

				--------------------

				 5.3958044063457513

				(1 row)

				```

				The overall compression ratio of columnar table, versus the same data

				stored with row storage, is **5.4X**.

				```

				=> VACUUM VERBOSE perf_columnar;

				INFO:  statistics for "perf_columnar":

				storage id: 10000000000

				total file size: 8761368576, total data size: 8734266196

				compression rate: 5.01x

				total row count: 75000000, stripe count: 500, average rows per stripe: 150000

				chunk count: 60000, containing data for dropped columns: 0, zstd compressed: 60000

				```

				``VACUUM VERBOSE`` reports a smaller compression ratio, because it

				only averages the compression ratio of the individual chunks, and does

				not account for the metadata savings of the columnar format.

				## System

				* Azure VM: Standard D2s v3 (2 vcpus, 8 GiB memory)

				* Linux (ubuntu 18.04)

				* Data Drive: Standard HDD (512GB, 500 IOPS Max, 60 MB/s Max)

				* PostgreSQL 13 (``--with-llvm``, ``--with-python``)

				* ``shared_buffers = 128MB``

				* ``max_parallel_workers_per_gather = 0``

				* ``jit = on``

				Note: because this was run on a system with enough physical memory to

				hold a substantial fraction of the table, the IO benefits of columnar

				won't be entirely realized by the query runtime unless the data size

				is substantially increased.

				## Query

				```sql

				-- OFFSET 1000 so that no rows are returned, and we collect only timings

				SELECT vendor_id, SUM(quantity) FROM perf_row GROUP BY vendor_id OFFSET 1000;

				SELECT vendor_id, SUM(quantity) FROM perf_row GROUP BY vendor_id OFFSET 1000;

				SELECT vendor_id, SUM(quantity) FROM perf_row GROUP BY vendor_id OFFSET 1000;

				SELECT vendor_id, SUM(quantity) FROM perf_columnar GROUP BY vendor_id OFFSET 1000;

				SELECT vendor_id, SUM(quantity) FROM perf_columnar GROUP BY vendor_id OFFSET 1000;

				SELECT vendor_id, SUM(quantity) FROM perf_columnar GROUP BY vendor_id OFFSET 1000;

				```

				Timing (median of three runs):

				 * row: 436s

				 * columnar: 16s

				 * speedup: **27X**

6

src/backend/columnar/citus_columnar.control Normal file

View File

 @ -0,0 +1,6 @@
 # Columnar extension
 comment = 'Citus Columnar extension'
 default_version = '12.2-1'
 module_pathname = '$libdir/citus_columnar'
 relocatable = false
 schema = pg_catalog

									
										169

src/backend/columnar/columnar.cNormal file

										View File
									
				@ -0,0 +1,169 @@

				/*-------------------------------------------------------------------------

				 *

				 * columnar.c

				 *

				 * This file contains...

				 *

				 * Copyright (c) 2016, Citus Data, Inc.

				 *

				 * $Id$

				 *

				 *-------------------------------------------------------------------------

				 */

				#include <sys/stat.h>

				#include <unistd.h>

				#include "postgres.h"

				#include "miscadmin.h"

				#include "utils/guc.h"

				#include "utils/rel.h"

				#include "citus_version.h"

				#include "columnar/columnar.h"

				#include "columnar/columnar_tableam.h"

				/* Default values for option parameters */

				#define DEFAULT_STRIPE_ROW_COUNT 150000

				#define DEFAULT_CHUNK_ROW_COUNT 10000

				#if HAVE_LIBZSTD

				#define DEFAULT_COMPRESSION_TYPE COMPRESSION_ZSTD

				#elif HAVE_CITUS_LIBLZ4

				#define DEFAULT_COMPRESSION_TYPE COMPRESSION_LZ4

				#else

				#define DEFAULT_COMPRESSION_TYPE COMPRESSION_PG_LZ

				#endif

				int columnar_compression = DEFAULT_COMPRESSION_TYPE;

				int columnar_stripe_row_limit = DEFAULT_STRIPE_ROW_COUNT;

				int columnar_chunk_group_row_limit = DEFAULT_CHUNK_ROW_COUNT;

				int columnar_compression_level = 3;

				static const struct config_enum_entry columnar_compression_options[] =

				{

					{ "none", COMPRESSION_NONE, false },

					{ "pglz", COMPRESSION_PG_LZ, false },

				#if HAVE_CITUS_LIBLZ4

					{ "lz4", COMPRESSION_LZ4, false },

				#endif

				#if HAVE_LIBZSTD

					{ "zstd", COMPRESSION_ZSTD, false },

				#endif

					{ NULL, 0, false }

				};

				void

				columnar_init(void)

				{

					columnar_init_gucs();

					columnar_tableam_init();

				}

				void

				columnar_init_gucs()

				{

					DefineCustomEnumVariable("columnar.compression",

											 "Compression type for columnar.",

											 NULL,

											 &columnar_compression,

											 DEFAULT_COMPRESSION_TYPE,

											 columnar_compression_options,

											 PGC_USERSET,

											 0,

											 NULL,

											 NULL,

											 NULL);

					DefineCustomIntVariable("columnar.compression_level",

											"Compression level to be used with zstd.",

											NULL,

											&columnar_compression_level,

											3,

											COMPRESSION_LEVEL_MIN,

											COMPRESSION_LEVEL_MAX,

											PGC_USERSET,

											0,

											NULL,

											NULL,

											NULL);

					DefineCustomIntVariable("columnar.stripe_row_limit",

											"Maximum number of tuples per stripe.",

											NULL,

											&columnar_stripe_row_limit,

											DEFAULT_STRIPE_ROW_COUNT,

											STRIPE_ROW_COUNT_MINIMUM,

											STRIPE_ROW_COUNT_MAXIMUM,

											PGC_USERSET,

											0,

											NULL,

											NULL,

											NULL);

					DefineCustomIntVariable("columnar.chunk_group_row_limit",

											"Maximum number of rows per chunk.",

											NULL,

											&columnar_chunk_group_row_limit,

											DEFAULT_CHUNK_ROW_COUNT,

											CHUNK_ROW_COUNT_MINIMUM,

											CHUNK_ROW_COUNT_MAXIMUM,

											PGC_USERSET,

											0,

											NULL,

											NULL,

											NULL);

				}

				/*

				 * ParseCompressionType converts a string to a compression type.

				 * For compression algorithms that are invalid or not compiled, it

				 * returns COMPRESSION_TYPE_INVALID.

				 */

				CompressionType

				ParseCompressionType(const char *compressionTypeString)

				{

					Assert(compressionTypeString != NULL);

					for (int compressionIndex = 0;

						 columnar_compression_options[compressionIndex].name != NULL;

						 compressionIndex++)

					{

						const char *compressionName = columnar_compression_options[compressionIndex].name;

						if (strncmp(compressionTypeString, compressionName, NAMEDATALEN) == 0)

						{

							return columnar_compression_options[compressionIndex].val;

						}

					}

					return COMPRESSION_TYPE_INVALID;

				}

				/*

				 * CompressionTypeStr returns string representation of a compression type.

				 * For compression algorithms that are invalid or not compiled, it

				 * returns NULL.

				 */

				const char *

				CompressionTypeStr(CompressionType requestedType)

				{

					for (int compressionIndex = 0;

						 columnar_compression_options[compressionIndex].name != NULL;

						 compressionIndex++)

					{

						CompressionType compressionType =

							columnar_compression_options[compressionIndex].val;

						if (compressionType == requestedType)

						{

							return columnar_compression_options[compressionIndex].name;

						}

					}

					return NULL;

				}

									
										272

src/backend/columnar/columnar_compression.cNormal file

										View File
									
				@ -0,0 +1,272 @@

				/*-------------------------------------------------------------------------

				 *

				 * columnar_compression.c

				 *

				 * This file contains compression/decompression functions definitions

				 * used for columnar.

				 *

				 * Copyright (c) 2016, Citus Data, Inc.

				 *

				 * $Id$

				 *

				 *-------------------------------------------------------------------------

				 */

				#include "postgres.h"

				#include "common/pg_lzcompress.h"

				#include "lib/stringinfo.h"

				#include "citus_version.h"

				#include "pg_version_constants.h"

				#include "columnar/columnar_compression.h"

				#if HAVE_CITUS_LIBLZ4

				#include <lz4.h>

				#endif

				#if PG_VERSION_NUM >= PG_VERSION_16

				#include "varatt.h"

				#endif

				#if HAVE_LIBZSTD

				#include <zstd.h>

				#endif

				/*

				 *	The information at the start of the compressed data. This decription is taken

				 *	from pg_lzcompress in pre-9.5 version of PostgreSQL.

				 */

				typedef struct ColumnarCompressHeader

				{

					int32 vl_len_;              /* varlena header (do not touch directly!) */

					int32 rawsize;

				} ColumnarCompressHeader;

				/*

				 * Utilities for manipulation of header information for compressed data

				 */

				#define COLUMNAR_COMPRESS_HDRSZ ((int32) sizeof(ColumnarCompressHeader))

				#define COLUMNAR_COMPRESS_RAWSIZE(ptr) (((ColumnarCompressHeader *) (ptr))->rawsize)

				#define COLUMNAR_COMPRESS_RAWDATA(ptr) (((char *) (ptr)) + COLUMNAR_COMPRESS_HDRSZ)

				#define COLUMNAR_COMPRESS_SET_RAWSIZE(ptr, \

													  len) (((ColumnarCompressHeader *) (ptr))->rawsize = \

																(len))

				/*

				 * CompressBuffer compresses the given buffer with the given compression type

				 * outputBuffer enlarged to contain compressed data. The function returns true

				 * if compression is done, returns false if compression is not done.

				 * outputBuffer is valid only if the function returns true.

				 */

				bool

				CompressBuffer(StringInfo inputBuffer,

							   StringInfo outputBuffer,

							   CompressionType compressionType,

							   int compressionLevel)

				{

					switch (compressionType)

					{

				#if HAVE_CITUS_LIBLZ4

						case COMPRESSION_LZ4:

						{

							int maximumLength = LZ4_compressBound(inputBuffer->len);

							resetStringInfo(outputBuffer);

							enlargeStringInfo(outputBuffer, maximumLength);

							int compressedSize = LZ4_compress_default(inputBuffer->data,

																	  outputBuffer->data,

																	  inputBuffer->len, maximumLength);

							if (compressedSize <= 0)

							{

								elog(DEBUG1,

									 "failure in LZ4_compress_default, input size=%d, output size=%d",

									 inputBuffer->len, maximumLength);

								return false;

							}

							elog(DEBUG1, "compressed %d bytes to %d bytes", inputBuffer->len,

								 compressedSize);

							outputBuffer->len = compressedSize;

							return true;

						}

				#endif

				#if HAVE_LIBZSTD

						case COMPRESSION_ZSTD:

						{

							int maximumLength = ZSTD_compressBound(inputBuffer->len);

							resetStringInfo(outputBuffer);

							enlargeStringInfo(outputBuffer, maximumLength);

							size_t compressedSize = ZSTD_compress(outputBuffer->data,

																  outputBuffer->maxlen,

																  inputBuffer->data,

																  inputBuffer->len,

																  compressionLevel);

							if (ZSTD_isError(compressedSize))

							{

								ereport(WARNING, (errmsg("zstd compression failed"),

												  (errdetail("%s", ZSTD_getErrorName(compressedSize)))));

								return false;

							}

							outputBuffer->len = compressedSize;

							return true;

						}

				#endif

						case COMPRESSION_PG_LZ:

						{

							uint64 maximumLength = PGLZ_MAX_OUTPUT(inputBuffer->len) +

												   COLUMNAR_COMPRESS_HDRSZ;

							bool compressionResult = false;

							resetStringInfo(outputBuffer);

							enlargeStringInfo(outputBuffer, maximumLength);

							int32 compressedByteCount = pglz_compress((const char *) inputBuffer->data,

																	  inputBuffer->len,

																	  COLUMNAR_COMPRESS_RAWDATA(

																		  outputBuffer->data),

																	  PGLZ_strategy_always);

							if (compressedByteCount >= 0)

							{

								COLUMNAR_COMPRESS_SET_RAWSIZE(outputBuffer->data, inputBuffer->len);

								SET_VARSIZE_COMPRESSED(outputBuffer->data,

													   compressedByteCount + COLUMNAR_COMPRESS_HDRSZ);

								compressionResult = true;

							}

							if (compressionResult)

							{

								outputBuffer->len = VARSIZE(outputBuffer->data);

							}

							return compressionResult;

						}

						default:

						{

							return false;

						}

					}

				}

				/*

				 * DecompressBuffer decompresses the given buffer with the given compression

				 * type. This function returns the buffer as-is when no compression is applied.

				 */

				StringInfo

				DecompressBuffer(StringInfo buffer,

								 CompressionType compressionType,

								 uint64 decompressedSize)

				{

					switch (compressionType)

					{

						case COMPRESSION_NONE:

						{

							return buffer;

						}

				#if HAVE_CITUS_LIBLZ4

						case COMPRESSION_LZ4:

						{

							StringInfo decompressedBuffer = makeStringInfo();

							enlargeStringInfo(decompressedBuffer, decompressedSize);

							int lz4DecompressSize = LZ4_decompress_safe(buffer->data,

																		decompressedBuffer->data,

																		buffer->len,

																		decompressedSize);

							if (lz4DecompressSize != decompressedSize)

							{

								ereport(ERROR, (errmsg("cannot decompress the buffer"),

												errdetail("Expected %lu bytes, but received %d bytes",

														  decompressedSize, lz4DecompressSize)));

							}

							decompressedBuffer->len = decompressedSize;

							return decompressedBuffer;

						}

				#endif

				#if HAVE_LIBZSTD

						case COMPRESSION_ZSTD:

						{

							StringInfo decompressedBuffer = makeStringInfo();

							enlargeStringInfo(decompressedBuffer, decompressedSize);

							size_t zstdDecompressSize = ZSTD_decompress(decompressedBuffer->data,

																		decompressedSize,

																		buffer->data,

																		buffer->len);

							if (ZSTD_isError(zstdDecompressSize))

							{

								ereport(ERROR, (errmsg("zstd decompression failed"),

												(errdetail("%s", ZSTD_getErrorName(

															   zstdDecompressSize)))));

							}

							if (zstdDecompressSize != decompressedSize)

							{

								ereport(ERROR, (errmsg("unexpected decompressed size"),

												errdetail("Expected %ld, received %ld", decompressedSize,

														  zstdDecompressSize)));

							}

							decompressedBuffer->len = decompressedSize;

							return decompressedBuffer;

						}

				#endif

						case COMPRESSION_PG_LZ:

						{

							uint32 compressedDataSize = VARSIZE(buffer->data) - COLUMNAR_COMPRESS_HDRSZ;

							uint32 decompressedDataSize = COLUMNAR_COMPRESS_RAWSIZE(buffer->data);

							if (compressedDataSize + COLUMNAR_COMPRESS_HDRSZ != buffer->len)

							{

								ereport(ERROR, (errmsg("cannot decompress the buffer"),

												errdetail("Expected %u bytes, but received %u bytes",

														  compressedDataSize, buffer->len)));

							}

							char *decompressedData = palloc0(decompressedDataSize);

							int32 decompressedByteCount = pglz_decompress(COLUMNAR_COMPRESS_RAWDATA(

																			  buffer->data),

																		  compressedDataSize,

																		  decompressedData,

																		  decompressedDataSize, true);

							if (decompressedByteCount < 0)

							{

								ereport(ERROR, (errmsg("cannot decompress the buffer"),

												errdetail("compressed data is corrupted")));

							}

							StringInfo decompressedBuffer = palloc0(sizeof(StringInfoData));

							decompressedBuffer->data = decompressedData;

							decompressedBuffer->len = decompressedDataSize;

							decompressedBuffer->maxlen = decompressedDataSize;

							return decompressedBuffer;

						}

						default:

						{

							ereport(ERROR, (errmsg("unexpected compression type: %d", compressionType)));

						}

					}

				}

2130

src/backend/columnar/columnar_customscan.c Normal file

View File

File diff suppressed because it is too large Load Diff

									
										165

src/backend/columnar/columnar_debug.cNormal file

										View File
									
				@ -0,0 +1,165 @@

				/*-------------------------------------------------------------------------

				 *

				 * columnar_debug.c

				 *

				 * Helper functions to debug column store.

				 *

				 *-------------------------------------------------------------------------

				 */

				#include "postgres.h"

				#include "funcapi.h"

				#include "miscadmin.h"

				#include "access/nbtree.h"

				#include "access/table.h"

				#include "catalog/pg_am.h"

				#include "catalog/pg_type.h"

				#include "storage/fd.h"

				#include "storage/smgr.h"

				#include "utils/guc.h"

				#include "utils/memutils.h"

				#include "utils/rel.h"

				#include "utils/tuplestore.h"

				#include "pg_version_compat.h"

				#include "pg_version_constants.h"

				#include "columnar/columnar.h"

				#include "columnar/columnar_storage.h"

				#include "columnar/columnar_version_compat.h"

				static void MemoryContextTotals(MemoryContext context, MemoryContextCounters *counters);

				PG_FUNCTION_INFO_V1(columnar_store_memory_stats);

				PG_FUNCTION_INFO_V1(columnar_storage_info);

				/*

				 * columnar_store_memory_stats returns a record of 3 values: size of

				 * TopMemoryContext, TopTransactionContext, and Write State context.

				 */

				Datum

				columnar_store_memory_stats(PG_FUNCTION_ARGS)

				{

					const int resultColumnCount = 3;

					TupleDesc tupleDescriptor = CreateTemplateTupleDesc(resultColumnCount);

					TupleDescInitEntry(tupleDescriptor, (AttrNumber) 1, "TopMemoryContext",

									   INT8OID, -1, 0);

					TupleDescInitEntry(tupleDescriptor, (AttrNumber) 2, "TopTransactionContext",

									   INT8OID, -1, 0);

					TupleDescInitEntry(tupleDescriptor, (AttrNumber) 3, "WriteStateContext",

									   INT8OID, -1, 0);

					tupleDescriptor = BlessTupleDesc(tupleDescriptor);

					MemoryContextCounters transactionCounters = { 0 };

					MemoryContextCounters topCounters = { 0 };

					MemoryContextCounters writeStateCounters = { 0 };

					MemoryContextTotals(TopTransactionContext, &transactionCounters);

					MemoryContextTotals(TopMemoryContext, &topCounters);

					MemoryContextTotals(GetWriteContextForDebug(), &writeStateCounters);

					bool nulls[3] = { false };

					Datum values[3] = {

						Int64GetDatum(topCounters.totalspace),

						Int64GetDatum(transactionCounters.totalspace),

						Int64GetDatum(writeStateCounters.totalspace)

					};

					HeapTuple tuple = heap_form_tuple(tupleDescriptor, values, nulls);

					PG_RETURN_DATUM(HeapTupleGetDatum(tuple));

				}

				/*

				 * columnar_storage_info - UDF to return internal storage info for a columnar relation.

				 *

				 * DDL:

				 *  CREATE OR REPLACE FUNCTION columnar_storage_info(

				 *      rel regclass,

				 *      version_major OUT int4,

				 *      version_minor OUT int4,

				 *      storage_id OUT int8,

				 *      reserved_stripe_id OUT int8,

				 *      reserved_row_number OUT int8,

				 *      reserved_offset OUT int8)

				 *    STRICT

				 *    LANGUAGE c AS 'MODULE_PATHNAME', 'columnar_storage_info';

				 */

				Datum

				columnar_storage_info(PG_FUNCTION_ARGS)

				{

				#define STORAGE_INFO_NATTS 6

					Oid relid = PG_GETARG_OID(0);

					TupleDesc tupdesc;

					/* Build a tuple descriptor for our result type */

					if (get_call_result_type(fcinfo, NULL, &tupdesc) != TYPEFUNC_COMPOSITE)

					{

						elog(ERROR, "return type must be a row type");

					}

					if (tupdesc->natts != STORAGE_INFO_NATTS)

					{

						elog(ERROR, "return type must have %d columns", STORAGE_INFO_NATTS);

					}

					Relation rel = table_open(relid, AccessShareLock);

					if (!IsColumnarTableAmTable(relid))

					{

						ereport(ERROR, (errmsg("table \"%s\" is not a columnar table",

											   RelationGetRelationName(rel))));

					}

					Datum values[STORAGE_INFO_NATTS] = { 0 };

					bool nulls[STORAGE_INFO_NATTS] = { 0 };

					/*

					 * Pass force = true so that we can inspect metapages that are not the

					 * current version.

					 *

					 * NB: ensure the order and number of attributes correspond to DDL

					 * declaration.

					 */

					values[0] = Int32GetDatum(ColumnarStorageGetVersionMajor(rel, true));

					values[1] = Int32GetDatum(ColumnarStorageGetVersionMinor(rel, true));

					values[2] = Int64GetDatum(ColumnarStorageGetStorageId(rel, true));

					values[3] = Int64GetDatum(ColumnarStorageGetReservedStripeId(rel, true));

					values[4] = Int64GetDatum(ColumnarStorageGetReservedRowNumber(rel, true));

					values[5] = Int64GetDatum(ColumnarStorageGetReservedOffset(rel, true));

					/* release lock */

					table_close(rel, AccessShareLock);

					HeapTuple tuple = heap_form_tuple(tupdesc, values, nulls);

					PG_RETURN_DATUM(HeapTupleGetDatum(tuple));

				}

				/*

				 * MemoryContextTotals adds stats of the given memory context and its

				 * subtree to the given counters.

				 */

				static void

				MemoryContextTotals(MemoryContext context, MemoryContextCounters *counters)

				{

					if (context == NULL)

					{

						return;

					}

					MemoryContext child;

					for (child = context->firstchild; child != NULL; child = child->nextchild)

					{

						MemoryContextTotals(child, counters);

					}

					context->methods->stats(context, NULL, NULL, counters, true);

				}

2051

src/backend/columnar/columnar_metadata.c Normal file

View File

File diff suppressed because it is too large Load Diff

1694

src/backend/columnar/columnar_reader.c Normal file

View File

File diff suppressed because it is too large Load Diff

									
										866

src/backend/columnar/columnar_storage.cNormal file

										View File
									
				@ -0,0 +1,866 @@

				/*-------------------------------------------------------------------------

				 *

				 * columnar_storage.c

				 *

				 * Copyright (c) Citus Data, Inc.

				 *

				 * Low-level storage layer for columnar.

				 *   - Translates columnar read/write operations on logical offsets into operations on pages/blocks.

				 *   - Emits WAL.

				 *   - Reads/writes the columnar metapage.

				 *   - Reserves data offsets, stripe numbers, and row offsets.

				 *   - Truncation.

				 *

				 * Higher-level columnar operations deal with logical offsets and large

				 * contiguous buffers of data that need to be stored. But the buffer manager

				 * and WAL depend on formatted pages with headers, so these large buffers need

				 * to be written across many pages. This module translates the contiguous

				 * buffers into individual block reads/writes, and performs WAL when

				 * necessary.

				 *

				 * Storage layout: a metapage in block 0, followed by an empty page in block

				 * 1, followed by logical data starting at the first byte after the page

				 * header in block 2 (having logical offset ColumnarFirstLogicalOffset). (XXX:

				 * Block 1 is left empty for no particular reason. Reconsider?). A columnar

				 * table should always have at least 2 blocks.

				 *

				 * Reservation is done with a relation extension lock, and designed for

				 * concurrency, so the callers only need an ordinary lock on the

				 * relation. Initializing the metapage or truncating the relation require that

				 * the caller holds an AccessExclusiveLock. (XXX: New reservations of data are

				 * aligned onto a new page for no particular reason. Reconsider?).

				 *

				 *-------------------------------------------------------------------------

				 */

				#include "postgres.h"

				#include "miscadmin.h"

				#include "safe_lib.h"

				#include "access/generic_xlog.h"

				#include "catalog/storage.h"

				#include "storage/bufmgr.h"

				#include "storage/lmgr.h"

				#include "pg_version_compat.h"

				#include "columnar/columnar.h"

				#include "columnar/columnar_storage.h"

				/*

				 * Content of the first page in main fork, which stores metadata at file

				 * level.

				 */

				typedef struct ColumnarMetapage

				{

					/*

					 * Store version of file format used, so we can detect files from

					 * previous versions if we change file format.

					 */

					uint32 versionMajor;

					uint32 versionMinor;

					/*

					 * Each of the metadata table rows are identified by a storageId.

					 * We store it also in the main fork so we can link metadata rows

					 * with data files.

					 */

					uint64 storageId;

					uint64 reservedStripeId; /* first unused stripe id */

					uint64 reservedRowNumber; /* first unused row number */

					uint64 reservedOffset; /* first unused byte offset */

					/*

					 * Flag set to true in the init fork. After an unlogged table reset (due

					 * to a crash), the init fork will be copied over the main fork. When

					 * trying to read an unlogged table, if this flag is set to true, we must

					 * clear the metadata for the table (because the actual data is gone,

					 * too), and clear the flag. We can cross-check that the table is

					 * UNLOGGED, and that the main fork is at the minimum size (no actual

					 * data).

					 *

					 * XXX: Not used yet; reserved field for later support for UNLOGGED.

					 */

					bool unloggedReset;

				} ColumnarMetapage;

				/* represents a "physical" block+offset address */

				typedef struct PhysicalAddr

				{

					BlockNumber blockno;

					uint32 offset;

				} PhysicalAddr;

				#define COLUMNAR_METAPAGE_BLOCKNO 0

				#define COLUMNAR_EMPTY_BLOCKNO 1

				#define COLUMNAR_INVALID_STRIPE_ID 0

				#define COLUMNAR_FIRST_STRIPE_ID 1

				#define OLD_METAPAGE_VERSION_HINT "Use \"VACUUM\" to upgrade the columnar table format " \

												  "version or run \"ALTER EXTENSION citus UPDATE\"."

				/* only for testing purposes */

				PG_FUNCTION_INFO_V1(test_columnar_storage_write_new_page);

				/*

				 * Map logical offsets to a physical page and offset where the data is kept.

				 */

				static inline PhysicalAddr

				LogicalToPhysical(uint64 logicalOffset)

				{

					PhysicalAddr addr;

					addr.blockno = logicalOffset / COLUMNAR_BYTES_PER_PAGE;

					addr.offset = SizeOfPageHeaderData + (logicalOffset % COLUMNAR_BYTES_PER_PAGE);

					return addr;

				}

				/*

				 * Map a physical page and offset address to a logical address.

				 */

				static inline uint64

				PhysicalToLogical(PhysicalAddr addr)

				{

					return COLUMNAR_BYTES_PER_PAGE * addr.blockno + addr.offset - SizeOfPageHeaderData;

				}

				static void ColumnarOverwriteMetapage(Relation relation,

													  ColumnarMetapage columnarMetapage);

				static ColumnarMetapage ColumnarMetapageRead(Relation rel, bool force);

				static void ReadFromBlock(Relation rel, BlockNumber blockno, uint32 offset,

										  char *buf, uint32 len, bool force);

				static void WriteToBlock(Relation rel, BlockNumber blockno, uint32 offset,

										 char *buf, uint32 len, bool clear);

				static uint64 AlignReservation(uint64 prevReservation);

				static bool ColumnarMetapageIsCurrent(ColumnarMetapage *metapage);

				static bool ColumnarMetapageIsOlder(ColumnarMetapage *metapage);

				static bool ColumnarMetapageIsNewer(ColumnarMetapage *metapage);

				static void ColumnarMetapageCheckVersion(Relation rel, ColumnarMetapage *metapage);

				/*

				 * ColumnarStorageInit - initialize a new metapage in an empty relation

				 * with the given storageId.

				 *

				 * Caller must hold AccessExclusiveLock on the relation.

				 */

				void

				ColumnarStorageInit(SMgrRelation srel, uint64 storageId)

				{

					BlockNumber nblocks = smgrnblocks(srel, MAIN_FORKNUM);

					if (nblocks > 0)

					{

						elog(ERROR,

							 "attempted to initialize metapage, but %d pages already exist",

							 nblocks);

					}

					/* create two pages */

				#if PG_VERSION_NUM >= PG_VERSION_16

					PGIOAlignedBlock block;

				#else

					PGAlignedBlock block;

				#endif

					Page page = block.data;

					/* write metapage */

					PageInit(page, BLCKSZ, 0);

					PageHeader phdr = (PageHeader) page;

					ColumnarMetapage metapage = { 0 };

					metapage.storageId = storageId;

					metapage.versionMajor = COLUMNAR_VERSION_MAJOR;

					metapage.versionMinor = COLUMNAR_VERSION_MINOR;

					metapage.reservedStripeId = COLUMNAR_FIRST_STRIPE_ID;

					metapage.reservedRowNumber = COLUMNAR_FIRST_ROW_NUMBER;

					metapage.reservedOffset = ColumnarFirstLogicalOffset;

					metapage.unloggedReset = false;

					memcpy_s(page + phdr->pd_lower, phdr->pd_upper - phdr->pd_lower,

							 (char *) &metapage, sizeof(ColumnarMetapage));

					phdr->pd_lower += sizeof(ColumnarMetapage);

					log_newpage(RelationPhysicalIdentifierBackend_compat(&srel), MAIN_FORKNUM,

								COLUMNAR_METAPAGE_BLOCKNO, page, true);

					PageSetChecksumInplace(page, COLUMNAR_METAPAGE_BLOCKNO);

					smgrextend(srel, MAIN_FORKNUM, COLUMNAR_METAPAGE_BLOCKNO, page, true);

					/* write empty page */

					PageInit(page, BLCKSZ, 0);

					log_newpage(RelationPhysicalIdentifierBackend_compat(&srel), MAIN_FORKNUM,

								COLUMNAR_EMPTY_BLOCKNO, page, true);

					PageSetChecksumInplace(page, COLUMNAR_EMPTY_BLOCKNO);

					smgrextend(srel, MAIN_FORKNUM, COLUMNAR_EMPTY_BLOCKNO, page, true);

					/*

					 * An immediate sync is required even if we xlog'd the page, because the

					 * write did not go through shared_buffers and therefore a concurrent

					 * checkpoint may have moved the redo pointer past our xlog record.

					 */

					smgrimmedsync(srel, MAIN_FORKNUM);

				}

				/*

				 * ColumnarStorageUpdateCurrent - update the metapage to the current

				 * version. No effect if the version already matches. If 'upgrade' is true,

				 * throw an error if metapage version is newer; if 'upgrade' is false, it's a

				 * downgrade, so throw an error if the metapage version is older.

				 *

				 * NB: caller must ensure that metapage already exists, which might not be the

				 * case on 10.0.

				 */

				void

				ColumnarStorageUpdateCurrent(Relation rel, bool upgrade, uint64 reservedStripeId,

											 uint64 reservedRowNumber, uint64 reservedOffset)

				{

					LockRelationForExtension(rel, ExclusiveLock);

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, true);

					if (ColumnarMetapageIsCurrent(&metapage))

					{

						/* nothing to do */

						return;

					}

					if (upgrade && ColumnarMetapageIsNewer(&metapage))

					{

						elog(ERROR, "found newer columnar metapage while upgrading");

					}

					if (!upgrade && ColumnarMetapageIsOlder(&metapage))

					{

						elog(ERROR, "found older columnar metapage while downgrading");

					}

					metapage.versionMajor = COLUMNAR_VERSION_MAJOR;

					metapage.versionMinor = COLUMNAR_VERSION_MINOR;

					/* storageId remains the same */

					metapage.reservedStripeId = reservedStripeId;

					metapage.reservedRowNumber = reservedRowNumber;

					metapage.reservedOffset = reservedOffset;

					ColumnarOverwriteMetapage(rel, metapage);

					UnlockRelationForExtension(rel, ExclusiveLock);

				}

				/*

				 * ColumnarStorageGetVersionMajor - return major version from the metapage.

				 *

				 * Throw an error if the metapage is not the current version, unless

				 * 'force' is true.

				 */

				uint64

				ColumnarStorageGetVersionMajor(Relation rel, bool force)

				{

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, force);

					return metapage.versionMajor;

				}

				/*

				 * ColumnarStorageGetVersionMinor - return minor version from the metapage.

				 *

				 * Throw an error if the metapage is not the current version, unless

				 * 'force' is true.

				 */

				uint64

				ColumnarStorageGetVersionMinor(Relation rel, bool force)

				{

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, force);

					return metapage.versionMinor;

				}

				/*

				 * ColumnarStorageGetStorageId - return storage ID from the metapage.

				 *

				 * Throw an error if the metapage is not the current version, unless

				 * 'force' is true.

				 */

				uint64

				ColumnarStorageGetStorageId(Relation rel, bool force)

				{

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, force);

					return metapage.storageId;

				}

				/*

				 * ColumnarStorageGetReservedStripeId - return reserved stripe ID from the

				 * metapage.

				 *

				 * Throw an error if the metapage is not the current version, unless

				 * 'force' is true.

				 */

				uint64

				ColumnarStorageGetReservedStripeId(Relation rel, bool force)

				{

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, force);

					return metapage.reservedStripeId;

				}

				/*

				 * ColumnarStorageGetReservedRowNumber - return reserved row number from the

				 * metapage.

				 *

				 * Throw an error if the metapage is not the current version, unless

				 * 'force' is true.

				 */

				uint64

				ColumnarStorageGetReservedRowNumber(Relation rel, bool force)

				{

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, force);

					return metapage.reservedRowNumber;

				}

				/*

				 * ColumnarStorageGetReservedOffset - return reserved offset from the metapage.

				 *

				 * Throw an error if the metapage is not the current version, unless

				 * 'force' is true.

				 */

				uint64

				ColumnarStorageGetReservedOffset(Relation rel, bool force)

				{

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, force);

					return metapage.reservedOffset;

				}

				/*

				 * ColumnarStorageIsCurrent - return true if metapage exists and is not

				 * the current version.

				 */

				bool

				ColumnarStorageIsCurrent(Relation rel)

				{

					BlockNumber nblocks = smgrnblocks(RelationGetSmgr(rel), MAIN_FORKNUM);

					if (nblocks < 2)

					{

						return false;

					}

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, true);

					return ColumnarMetapageIsCurrent(&metapage);

				}

				/*

				 * ColumnarStorageReserveRowNumber returns reservedRowNumber and advances

				 * it for next row number reservation.

				 */

				uint64

				ColumnarStorageReserveRowNumber(Relation rel, uint64 nrows)

				{

					LockRelationForExtension(rel, ExclusiveLock);

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, false);

					uint64 firstRowNumber = metapage.reservedRowNumber;

					metapage.reservedRowNumber += nrows;

					ColumnarOverwriteMetapage(rel, metapage);

					UnlockRelationForExtension(rel, ExclusiveLock);

					return firstRowNumber;

				}

				/*

				 * ColumnarStorageReserveStripeId returns stripeId and advances it for next

				 * stripeId reservation.

				 * Note that this function doesn't handle row number reservation.

				 * See ColumnarStorageReserveRowNumber function.

				 */

				uint64

				ColumnarStorageReserveStripeId(Relation rel)

				{

					LockRelationForExtension(rel, ExclusiveLock);

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, false);

					uint64 stripeId = metapage.reservedStripeId;

					metapage.reservedStripeId++;

					ColumnarOverwriteMetapage(rel, metapage);

					UnlockRelationForExtension(rel, ExclusiveLock);

					return stripeId;

				}

				/*

				 * ColumnarStorageReserveData - reserve logical data offsets for writing.

				 */

				uint64

				ColumnarStorageReserveData(Relation rel, uint64 amount)

				{

					if (amount == 0)

					{

						return ColumnarInvalidLogicalOffset;

					}

					LockRelationForExtension(rel, ExclusiveLock);

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, false);

					uint64 alignedReservation = AlignReservation(metapage.reservedOffset);

					uint64 nextReservation = alignedReservation + amount;

					metapage.reservedOffset = nextReservation;

					/* write new reservation */

					ColumnarOverwriteMetapage(rel, metapage);

					/* last used PhysicalAddr of new reservation */

					PhysicalAddr final = LogicalToPhysical(nextReservation - 1);

					/* extend with new pages */

					BlockNumber nblocks = smgrnblocks(RelationGetSmgr(rel), MAIN_FORKNUM);

					while (nblocks <= final.blockno)

					{

						Buffer newBuffer = ReadBuffer(rel, P_NEW);

						Assert(BufferGetBlockNumber(newBuffer) == nblocks);

						ReleaseBuffer(newBuffer);

						nblocks++;

					}

					UnlockRelationForExtension(rel, ExclusiveLock);

					return alignedReservation;

				}

				/*

				 * ColumnarStorageRead - map the logical offset to a block and offset, then

				 * read the buffer from multiple blocks if necessary.

				 */

				void

				ColumnarStorageRead(Relation rel, uint64 logicalOffset, char *data, uint32 amount)

				{

					/* if there's no work to do, succeed even with invalid offset */

					if (amount == 0)

					{

						return;

					}

					if (!ColumnarLogicalOffsetIsValid(logicalOffset))

					{

						elog(ERROR,

							 "attempted columnar read on relation %d from invalid logical offset: "

							 UINT64_FORMAT,

							 rel->rd_id, logicalOffset);

					}

					uint64 read = 0;

					while (read < amount)

					{

						PhysicalAddr addr = LogicalToPhysical(logicalOffset + read);

						uint32 to_read = Min(amount - read, BLCKSZ - addr.offset);

						ReadFromBlock(rel, addr.blockno, addr.offset, data + read, to_read,

									  false);

						read += to_read;

					}

				}

				/*

				 * ColumnarStorageWrite - map the logical offset to a block and offset, then

				 * write the buffer across multiple blocks if necessary.

				 */

				void

				ColumnarStorageWrite(Relation rel, uint64 logicalOffset, char *data, uint32 amount)

				{

					/* if there's no work to do, succeed even with invalid offset */

					if (amount == 0)

					{

						return;

					}

					if (!ColumnarLogicalOffsetIsValid(logicalOffset))

					{

						elog(ERROR,

							 "attempted columnar write on relation %d to invalid logical offset: "

							 UINT64_FORMAT,

							 rel->rd_id, logicalOffset);

					}

					uint64 written = 0;

					while (written < amount)

					{

						PhysicalAddr addr = LogicalToPhysical(logicalOffset + written);

						uint64 to_write = Min(amount - written, BLCKSZ - addr.offset);

						WriteToBlock(rel, addr.blockno, addr.offset, data + written, to_write,

									 false);

						written += to_write;

					}

				}

				/*

				 * ColumnarStorageTruncate - truncate the columnar storage such that

				 * newDataReservation will be the first unused logical offset available. Free

				 * pages at the end of the relation.

				 *

				 * Caller must hold AccessExclusiveLock on the relation.

				 *

				 * Returns true if pages were truncated; false otherwise.

				 */

				bool

				ColumnarStorageTruncate(Relation rel, uint64 newDataReservation)

				{

					if (!ColumnarLogicalOffsetIsValid(newDataReservation))

					{

						elog(ERROR,

							 "attempted to truncate relation %d to invalid logical offset: " UINT64_FORMAT,

							 rel->rd_id, newDataReservation);

					}

					BlockNumber old_rel_pages = smgrnblocks(RelationGetSmgr(rel), MAIN_FORKNUM);

					if (old_rel_pages == 0)

					{

						/* nothing to do */

						return false;

					}

					LockRelationForExtension(rel, ExclusiveLock);

					ColumnarMetapage metapage = ColumnarMetapageRead(rel, false);

					if (metapage.reservedOffset < newDataReservation)

					{

						elog(ERROR,

							 "attempted to truncate relation %d to offset " UINT64_FORMAT \

							 " which is higher than existing offset " UINT64_FORMAT,

							 rel->rd_id, newDataReservation, metapage.reservedOffset);

					}

					if (metapage.reservedOffset == newDataReservation)

					{

						/* nothing to do */

						UnlockRelationForExtension(rel, ExclusiveLock);

						return false;

					}

					metapage.reservedOffset = newDataReservation;

					/* write new reservation */

					ColumnarOverwriteMetapage(rel, metapage);

					UnlockRelationForExtension(rel, ExclusiveLock);

					PhysicalAddr final = LogicalToPhysical(newDataReservation - 1);

					BlockNumber new_rel_pages = final.blockno + 1;

					Assert(new_rel_pages <= old_rel_pages);

					/*

					 * Truncate the storage. Note that RelationTruncate() takes care of

					 * Write Ahead Logging.

					 */

					if (new_rel_pages < old_rel_pages)

					{

						RelationTruncate(rel, new_rel_pages);

						return true;

					}

					return false;

				}

				/*

				 * ColumnarOverwriteMetapage writes given columnarMetapage back to metapage

				 * for given relation.

				 */

				static void

				ColumnarOverwriteMetapage(Relation relation, ColumnarMetapage columnarMetapage)

				{

					/* clear metapage because we are overwriting */

					bool clear = true;

					WriteToBlock(relation, COLUMNAR_METAPAGE_BLOCKNO, SizeOfPageHeaderData,

								 (char *) &columnarMetapage, sizeof(ColumnarMetapage), clear);

				}

				/*

				 * ColumnarMetapageRead - read the current contents of the metapage. Error if

				 * it does not exist. Throw an error if the metapage is not the current

				 * version, unless 'force' is true.

				 *

				 * NB: it's safe to read a different version of a metapage because we

				 * guarantee that fields will only be added and existing fields will never be

				 * changed. However, it's important that we don't depend on new fields being

				 * set properly when we read an old metapage; an old metapage should only be

				 * read for the purposes of upgrading or error checking.

				 */

				static ColumnarMetapage

				ColumnarMetapageRead(Relation rel, bool force)

				{

					BlockNumber nblocks = smgrnblocks(RelationGetSmgr(rel), MAIN_FORKNUM);

					if (nblocks == 0)

					{

						/*

						 * We only expect this to happen when upgrading citus.so. This is because,

						 * in current version of columnar, we immediately create the metapage

						 * for columnar tables, i.e right after creating the table.

						 * However in older versions, we were creating metapages lazily, i.e

						 * when ingesting data to columnar table.

						 */

						ereport(ERROR, (errmsg("columnar metapage for relation \"%s\" does not exist",

											   RelationGetRelationName(rel)),

										errhint(OLD_METAPAGE_VERSION_HINT)));

					}

					/*

					 * Regardless of "force" parameter, always force read metapage block.

					 * We will check metapage version in ColumnarMetapageCheckVersion

					 * depending on "force".

					 */

					bool forceReadBlock = true;

					ColumnarMetapage metapage;

					ReadFromBlock(rel, COLUMNAR_METAPAGE_BLOCKNO, SizeOfPageHeaderData,

								  (char *) &metapage, sizeof(ColumnarMetapage), forceReadBlock);

					if (!force)

					{

						ColumnarMetapageCheckVersion(rel, &metapage);

					}

					return metapage;

				}

				/*

				 * ReadFromBlock - read bytes from a page at the given offset. If 'force' is

				 * true, don't check pd_lower; useful when reading a metapage of unknown

				 * version.

				 */

				static void

				ReadFromBlock(Relation rel, BlockNumber blockno, uint32 offset, char *buf,

							  uint32 len, bool force)

				{

					Buffer buffer = ReadBuffer(rel, blockno);

					LockBuffer(buffer, BUFFER_LOCK_SHARE);

					Page page = BufferGetPage(buffer);

					PageHeader phdr = (PageHeader) page;

					if (BLCKSZ < offset + len || (!force && (phdr->pd_lower < offset + len)))

					{

						elog(ERROR,

							 "attempt to read columnar data of length %d from offset %d of block %d of relation %d",

							 len, offset, blockno, rel->rd_id);

					}

					memcpy_s(buf, len, page + offset, len);

					UnlockReleaseBuffer(buffer);

				}

				/*

				 * WriteToBlock - append data to a block, initializing if necessary, and emit

				 * WAL. If 'clear' is true, always clear the data on the page and reinitialize

				 * it first, and offset must be SizeOfPageHeaderData. Otherwise, offset must

				 * be equal to pd_lower and pd_lower will be set to the end of the written

				 * data.

				 */

				static void

				WriteToBlock(Relation rel, BlockNumber blockno, uint32 offset, char *buf,

							 uint32 len, bool clear)

				{

					Buffer buffer = ReadBuffer(rel, blockno);

					GenericXLogState *state = GenericXLogStart(rel);

					LockBuffer(buffer, BUFFER_LOCK_EXCLUSIVE);

					Page page = GenericXLogRegisterBuffer(state, buffer, GENERIC_XLOG_FULL_IMAGE);

					PageHeader phdr = (PageHeader) page;

					if (PageIsNew(page) || clear)

					{

						PageInit(page, BLCKSZ, 0);

					}

					if (phdr->pd_lower < offset || phdr->pd_upper - offset < len)

					{

						elog(ERROR,

							 "attempt to write columnar data of length %d to offset %d of block %d of relation %d",

							 len, offset, blockno, rel->rd_id);

					}

					/*

					 * After a transaction has been rolled-back, we might be

					 * over-writing the rolledback write, so phdr->pd_lower can be

					 * different from addr.offset.

					 *

					 * We reset pd_lower to reset the rolledback write.

					 *

					 * Given that we always align page reservation to the next page as of

					 * 10.2, having such a disk page is only possible if write operaion

					 * failed in an older version of columnar, but now user attempts writing

					 * to that table in version >= 10.2.

					 */

					if (phdr->pd_lower > offset)

					{

						ereport(DEBUG4, (errmsg("overwriting page %u", blockno),

										 errdetail("This can happen after a roll-back.")));

						phdr->pd_lower = offset;

					}

					memcpy_s(page + phdr->pd_lower, phdr->pd_upper - phdr->pd_lower, buf, len);

					phdr->pd_lower += len;

					GenericXLogFinish(state);

					UnlockReleaseBuffer(buffer);

				}

				/*

				 * AlignReservation - given an unused logical byte offset, align it so that it

				 * falls at the start of a page.

				 *

				 * XXX: Reconsider whether we want/need to do this at all.

				 */

				static uint64

				AlignReservation(uint64 prevReservation)

				{

					PhysicalAddr prevAddr = LogicalToPhysical(prevReservation);

					uint64 alignedReservation = prevReservation;

					if (prevAddr.offset != SizeOfPageHeaderData)

					{

						/* not aligned; align on beginning of next page */

						PhysicalAddr initial = { 0 };

						initial.blockno = prevAddr.blockno + 1;

						initial.offset = SizeOfPageHeaderData;

						alignedReservation = PhysicalToLogical(initial);

					}

					Assert(alignedReservation >= prevReservation);

					return alignedReservation;

				}

				/*

				 * ColumnarMetapageIsCurrent - is the metapage at the latest version?

				 */

				static bool

				ColumnarMetapageIsCurrent(ColumnarMetapage *metapage)

				{

					return (metapage->versionMajor == COLUMNAR_VERSION_MAJOR &&

							metapage->versionMinor == COLUMNAR_VERSION_MINOR);

				}

				/*

				 * ColumnarMetapageIsOlder - is the metapage older than the current version?

				 */

				static bool

				ColumnarMetapageIsOlder(ColumnarMetapage *metapage)

				{

					return (metapage->versionMajor < COLUMNAR_VERSION_MAJOR ||

							(metapage->versionMajor == COLUMNAR_VERSION_MAJOR &&

							 (int) metapage->versionMinor < (int) COLUMNAR_VERSION_MINOR));

				}

				/*

				 * ColumnarMetapageIsNewer - is the metapage newer than the current version?

				 */

				static bool

				ColumnarMetapageIsNewer(ColumnarMetapage *metapage)

				{

					return (metapage->versionMajor > COLUMNAR_VERSION_MAJOR ||

							(metapage->versionMajor == COLUMNAR_VERSION_MAJOR &&

							 metapage->versionMinor > COLUMNAR_VERSION_MINOR));

				}

				/*

				 * ColumnarMetapageCheckVersion - throw an error if accessing old

				 * version of metapage.

				 */

				static void

				ColumnarMetapageCheckVersion(Relation rel, ColumnarMetapage *metapage)

				{

					if (!ColumnarMetapageIsCurrent(metapage))

					{

						ereport(ERROR, (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),

										errmsg(

											"attempted to access relation \"%s\", which uses an older columnar format",

											RelationGetRelationName(rel)),

										errdetail(

											"Columnar format version %d.%d is required, \"%s\" has version %d.%d.",

											COLUMNAR_VERSION_MAJOR, COLUMNAR_VERSION_MINOR,

											RelationGetRelationName(rel),

											metapage->versionMajor, metapage->versionMinor),

										errhint(OLD_METAPAGE_VERSION_HINT)));

					}

				}

				/*

				 * test_columnar_storage_write_new_page is a UDF only used for testing

				 * purposes. It could make more sense to define this in columnar_debug.c,

				 * but the storage layer doesn't expose ColumnarMetapage to any other files,

				 * so we define it here.

				 */

				Datum

				test_columnar_storage_write_new_page(PG_FUNCTION_ARGS)

				{

					Oid relationId = PG_GETARG_OID(0);

					Relation relation = relation_open(relationId, AccessShareLock);

					/*

					 * Allocate a new page, write some data to there, and set reserved offset

					 * to the start of that page. That way, for a subsequent write operation,

					 * storage layer would try to overwrite the page that we allocated here.

					 */

					uint64 newPageOffset = ColumnarStorageGetReservedOffset(relation, false);

					ColumnarStorageReserveData(relation, 100);

					ColumnarStorageWrite(relation, newPageOffset, "foo_bar", 8);

					ColumnarMetapage metapage = ColumnarMetapageRead(relation, false);

					metapage.reservedOffset = newPageOffset;

					ColumnarOverwriteMetapage(relation, metapage);

					relation_close(relation, AccessShareLock);

					PG_RETURN_VOID();

				}

3106

src/backend/columnar/columnar_tableam.c Normal file

View File

File diff suppressed because it is too large Load Diff

									
										775

src/backend/columnar/columnar_writer.cNormal file

										View File
									
				@ -0,0 +1,775 @@

				/*-------------------------------------------------------------------------

				 *

				 * columnar_writer.c

				 *

				 * This file contains function definitions for writing columnar tables. This

				 * includes the logic for writing file level metadata, writing row stripes,

				 * and calculating chunk skip nodes.

				 *

				 * Copyright (c) 2016, Citus Data, Inc.

				 *

				 * $Id$

				 *

				 *-------------------------------------------------------------------------

				 */

				#include "postgres.h"

				#include "miscadmin.h"

				#include "safe_lib.h"

				#include "access/heapam.h"

				#include "access/nbtree.h"

				#include "catalog/pg_am.h"

				#include "storage/fd.h"

				#include "storage/smgr.h"

				#include "utils/guc.h"

				#include "utils/memutils.h"

				#include "utils/rel.h"

				#include "pg_version_compat.h"

				#include "pg_version_constants.h"

				#include "columnar/columnar.h"

				#include "columnar/columnar_storage.h"

				#include "columnar/columnar_version_compat.h"

				#if PG_VERSION_NUM >= PG_VERSION_16

				#include "storage/relfilelocator.h"

				#include "utils/relfilenumbermap.h"

				#else

				#include "utils/relfilenodemap.h"

				#endif

				struct ColumnarWriteState

				{

					TupleDesc tupleDescriptor;

					FmgrInfo **comparisonFunctionArray;

					RelFileLocator relfilelocator;

					MemoryContext stripeWriteContext;

					MemoryContext perTupleContext;

					StripeBuffers *stripeBuffers;

					StripeSkipList *stripeSkipList;

					EmptyStripeReservation *emptyStripeReservation;

					ColumnarOptions options;

					ChunkData *chunkData;

					List *chunkGroupRowCounts;

					/*

					 * compressionBuffer buffer is used as temporary storage during

					 * data value compression operation. It is kept here to minimize

					 * memory allocations. It lives in stripeWriteContext and gets

					 * deallocated when memory context is reset.

					 */

					StringInfo compressionBuffer;

				};

				static StripeBuffers * CreateEmptyStripeBuffers(uint32 stripeMaxRowCount,

																uint32 chunkRowCount,

																uint32 columnCount);

				static StripeSkipList * CreateEmptyStripeSkipList(uint32 stripeMaxRowCount,

																  uint32 chunkRowCount,

																  uint32 columnCount);

				static void FlushStripe(ColumnarWriteState *writeState);

				static StringInfo SerializeBoolArray(bool *boolArray, uint32 boolArrayLength);

				static void SerializeSingleDatum(StringInfo datumBuffer, Datum datum,

												 bool datumTypeByValue, int datumTypeLength,

												 char datumTypeAlign);

				static void SerializeChunkData(ColumnarWriteState *writeState, uint32 chunkIndex,

											   uint32 rowCount);

				static void UpdateChunkSkipNodeMinMax(ColumnChunkSkipNode *chunkSkipNode,

													  Datum columnValue, bool columnTypeByValue,

													  int columnTypeLength, Oid columnCollation,

													  FmgrInfo *comparisonFunction);

				static Datum DatumCopy(Datum datum, bool datumTypeByValue, int datumTypeLength);

				static StringInfo CopyStringInfo(StringInfo sourceString);

				/*

				 * ColumnarBeginWrite initializes a columnar data load operation and returns a table

				 * handle. This handle should be used for adding the row values and finishing the

				 * data load operation.

				 */

				ColumnarWriteState *

				ColumnarBeginWrite(RelFileLocator relfilelocator,

								   ColumnarOptions options,

								   TupleDesc tupleDescriptor)

				{

					/* get comparison function pointers for each of the columns */

					uint32 columnCount = tupleDescriptor->natts;

					FmgrInfo **comparisonFunctionArray = palloc0(columnCount * sizeof(FmgrInfo *));

					for (uint32 columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						FmgrInfo *comparisonFunction = NULL;

						FormData_pg_attribute *attributeForm = TupleDescAttr(tupleDescriptor,

																			 columnIndex);

						if (!attributeForm->attisdropped)

						{

							Oid typeId = attributeForm->atttypid;

							comparisonFunction = GetFunctionInfoOrNull(typeId, BTREE_AM_OID,

																	   BTORDER_PROC);

						}

						comparisonFunctionArray[columnIndex] = comparisonFunction;

					}

					/*

					 * We allocate all stripe specific data in the stripeWriteContext, and

					 * reset this memory context once we have flushed the stripe to the file.

					 * This is to avoid memory leaks.

					 */

					MemoryContext stripeWriteContext = AllocSetContextCreate(CurrentMemoryContext,

																			 "Stripe Write Memory Context",

																			 ALLOCSET_DEFAULT_SIZES);

					bool *columnMaskArray = palloc(columnCount * sizeof(bool));

					memset(columnMaskArray, true, columnCount * sizeof(bool));

					ChunkData *chunkData = CreateEmptyChunkData(columnCount, columnMaskArray,

																options.chunkRowCount);

					ColumnarWriteState *writeState = palloc0(sizeof(ColumnarWriteState));

					writeState->relfilelocator = relfilelocator;

					writeState->options = options;

					writeState->tupleDescriptor = CreateTupleDescCopy(tupleDescriptor);

					writeState->comparisonFunctionArray = comparisonFunctionArray;

					writeState->stripeBuffers = NULL;

					writeState->stripeSkipList = NULL;

					writeState->emptyStripeReservation = NULL;

					writeState->stripeWriteContext = stripeWriteContext;

					writeState->chunkData = chunkData;

					writeState->compressionBuffer = NULL;

					writeState->perTupleContext = AllocSetContextCreate(CurrentMemoryContext,

																		"Columnar per tuple context",

																		ALLOCSET_DEFAULT_SIZES);

					return writeState;

				}

				/*

				 * ColumnarWriteRow adds a row to the columnar table. If the stripe is not initialized,

				 * we create structures to hold stripe data and skip list. Then, we serialize and

				 * append data to serialized value buffer for each of the columns and update

				 * corresponding skip nodes. Then, whole chunk data is compressed at every

				 * rowChunkCount insertion. Then, if row count exceeds stripeMaxRowCount, we flush

				 * the stripe, and add its metadata to the table footer.

				 *

				 * Returns the "row number" assigned to written row.

				 */

				uint64

				ColumnarWriteRow(ColumnarWriteState *writeState, Datum *columnValues, bool *columnNulls)

				{

					uint32 columnIndex = 0;

					StripeBuffers *stripeBuffers = writeState->stripeBuffers;

					StripeSkipList *stripeSkipList = writeState->stripeSkipList;

					uint32 columnCount = writeState->tupleDescriptor->natts;

					ColumnarOptions *options = &writeState->options;

					const uint32 chunkRowCount = options->chunkRowCount;

					ChunkData *chunkData = writeState->chunkData;

					MemoryContext oldContext = MemoryContextSwitchTo(writeState->stripeWriteContext);

					if (stripeBuffers == NULL)

					{

						stripeBuffers = CreateEmptyStripeBuffers(options->stripeRowCount,

																 chunkRowCount, columnCount);

						stripeSkipList = CreateEmptyStripeSkipList(options->stripeRowCount,

																   chunkRowCount, columnCount);

						writeState->stripeBuffers = stripeBuffers;

						writeState->stripeSkipList = stripeSkipList;

						writeState->compressionBuffer = makeStringInfo();

						Oid relationId = RelidByRelfilenumber(RelationTablespace_compat(

																  writeState->relfilelocator),

															  RelationPhysicalIdentifierNumber_compat(

																  writeState->relfilelocator));

						Relation relation = relation_open(relationId, NoLock);

						writeState->emptyStripeReservation =

							ReserveEmptyStripe(relation, columnCount, chunkRowCount,

											   options->stripeRowCount);

						relation_close(relation, NoLock);

						/*

						 * serializedValueBuffer lives in stripe write memory context so it needs to be

						 * initialized when the stripe is created.

						 */

						for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

						{

							chunkData->valueBufferArray[columnIndex] = makeStringInfo();

						}

					}

					uint32 chunkIndex = stripeBuffers->rowCount / chunkRowCount;

					uint32 chunkRowIndex = stripeBuffers->rowCount % chunkRowCount;

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						ColumnChunkSkipNode **chunkSkipNodeArray = stripeSkipList->chunkSkipNodeArray;

						ColumnChunkSkipNode *chunkSkipNode =

							&chunkSkipNodeArray[columnIndex][chunkIndex];

						if (columnNulls[columnIndex])

						{

							chunkData->existsArray[columnIndex][chunkRowIndex] = false;

						}

						else

						{

							FmgrInfo *comparisonFunction =

								writeState->comparisonFunctionArray[columnIndex];

							Form_pg_attribute attributeForm =

								TupleDescAttr(writeState->tupleDescriptor, columnIndex);

							bool columnTypeByValue = attributeForm->attbyval;

							int columnTypeLength = attributeForm->attlen;

							Oid columnCollation = attributeForm->attcollation;

							char columnTypeAlign = attributeForm->attalign;

							chunkData->existsArray[columnIndex][chunkRowIndex] = true;

							SerializeSingleDatum(chunkData->valueBufferArray[columnIndex],

												 columnValues[columnIndex], columnTypeByValue,

												 columnTypeLength, columnTypeAlign);

							UpdateChunkSkipNodeMinMax(chunkSkipNode, columnValues[columnIndex],

													  columnTypeByValue, columnTypeLength,

													  columnCollation, comparisonFunction);

						}

						chunkSkipNode->rowCount++;

					}

					stripeSkipList->chunkCount = chunkIndex + 1;

					/* last row of the chunk is inserted serialize the chunk */

					if (chunkRowIndex == chunkRowCount - 1)

					{

						SerializeChunkData(writeState, chunkIndex, chunkRowCount);

					}

					uint64 writtenRowNumber = writeState->emptyStripeReservation->stripeFirstRowNumber +

											  stripeBuffers->rowCount;

					stripeBuffers->rowCount++;

					if (stripeBuffers->rowCount >= options->stripeRowCount)

					{

						ColumnarFlushPendingWrites(writeState);

					}

					MemoryContextSwitchTo(oldContext);

					return writtenRowNumber;

				}

				/*

				 * ColumnarEndWrite finishes a columnar data load operation. If we have an unflushed

				 * stripe, we flush it.

				 */

				void

				ColumnarEndWrite(ColumnarWriteState *writeState)

				{

					ColumnarFlushPendingWrites(writeState);

					MemoryContextDelete(writeState->stripeWriteContext);

					pfree(writeState->comparisonFunctionArray);

					FreeChunkData(writeState->chunkData);

					pfree(writeState);

				}

				void

				ColumnarFlushPendingWrites(ColumnarWriteState *writeState)

				{

					StripeBuffers *stripeBuffers = writeState->stripeBuffers;

					if (stripeBuffers != NULL)

					{

						MemoryContext oldContext = MemoryContextSwitchTo(writeState->stripeWriteContext);

						FlushStripe(writeState);

						MemoryContextReset(writeState->stripeWriteContext);

						/* set stripe data and skip list to NULL so they are recreated next time */

						writeState->stripeBuffers = NULL;

						writeState->stripeSkipList = NULL;

						MemoryContextSwitchTo(oldContext);

					}

				}

				/*

				 * ColumnarWritePerTupleContext

				 *

				 * Return per-tuple context for columnar write operation.

				 */

				MemoryContext

				ColumnarWritePerTupleContext(ColumnarWriteState *state)

				{

					return state->perTupleContext;

				}

				/*

				 * CreateEmptyStripeBuffers allocates an empty StripeBuffers structure with the given

				 * column count.

				 */

				static StripeBuffers *

				CreateEmptyStripeBuffers(uint32 stripeMaxRowCount, uint32 chunkRowCount,

										 uint32 columnCount)

				{

					uint32 columnIndex = 0;

					uint32 maxChunkCount = (stripeMaxRowCount / chunkRowCount) + 1;

					ColumnBuffers **columnBuffersArray = palloc0(columnCount * sizeof(ColumnBuffers *));

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						uint32 chunkIndex = 0;

						ColumnChunkBuffers **chunkBuffersArray =

							palloc0(maxChunkCount * sizeof(ColumnChunkBuffers *));

						for (chunkIndex = 0; chunkIndex < maxChunkCount; chunkIndex++)

						{

							chunkBuffersArray[chunkIndex] = palloc0(sizeof(ColumnChunkBuffers));

							chunkBuffersArray[chunkIndex]->existsBuffer = NULL;

							chunkBuffersArray[chunkIndex]->valueBuffer = NULL;

							chunkBuffersArray[chunkIndex]->valueCompressionType = COMPRESSION_NONE;

						}

						columnBuffersArray[columnIndex] = palloc0(sizeof(ColumnBuffers));

						columnBuffersArray[columnIndex]->chunkBuffersArray = chunkBuffersArray;

					}

					StripeBuffers *stripeBuffers = palloc0(sizeof(StripeBuffers));

					stripeBuffers->columnBuffersArray = columnBuffersArray;

					stripeBuffers->columnCount = columnCount;

					stripeBuffers->rowCount = 0;

					return stripeBuffers;

				}

				/*

				 * CreateEmptyStripeSkipList allocates an empty StripeSkipList structure with

				 * the given column count. This structure has enough chunks to hold statistics

				 * for stripeMaxRowCount rows.

				 */

				static StripeSkipList *

				CreateEmptyStripeSkipList(uint32 stripeMaxRowCount, uint32 chunkRowCount,

										  uint32 columnCount)

				{

					uint32 columnIndex = 0;

					uint32 maxChunkCount = (stripeMaxRowCount / chunkRowCount) + 1;

					ColumnChunkSkipNode **chunkSkipNodeArray =

						palloc0(columnCount * sizeof(ColumnChunkSkipNode *));

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						chunkSkipNodeArray[columnIndex] =

							palloc0(maxChunkCount * sizeof(ColumnChunkSkipNode));

					}

					StripeSkipList *stripeSkipList = palloc0(sizeof(StripeSkipList));

					stripeSkipList->columnCount = columnCount;

					stripeSkipList->chunkCount = 0;

					stripeSkipList->chunkSkipNodeArray = chunkSkipNodeArray;

					return stripeSkipList;

				}

				/*

				 * FlushStripe flushes current stripe data into the file. The function first ensures

				 * the last data chunk for each column is properly serialized and compressed. Then,

				 * the function creates the skip list and footer buffers. Finally, the function

				 * flushes the skip list, data, and footer buffers to the file.

				 */

				static void

				FlushStripe(ColumnarWriteState *writeState)

				{

					uint32 columnIndex = 0;

					uint32 chunkIndex = 0;

					StripeBuffers *stripeBuffers = writeState->stripeBuffers;

					StripeSkipList *stripeSkipList = writeState->stripeSkipList;

					ColumnChunkSkipNode **columnSkipNodeArray = stripeSkipList->chunkSkipNodeArray;

					TupleDesc tupleDescriptor = writeState->tupleDescriptor;

					uint32 columnCount = tupleDescriptor->natts;

					uint32 chunkCount = stripeSkipList->chunkCount;

					uint32 chunkRowCount = writeState->options.chunkRowCount;

					uint32 lastChunkIndex = stripeBuffers->rowCount / chunkRowCount;

					uint32 lastChunkRowCount = stripeBuffers->rowCount % chunkRowCount;

					uint64 stripeSize = 0;

					uint64 stripeRowCount = stripeBuffers->rowCount;

					elog(DEBUG1, "Flushing Stripe of size %d", stripeBuffers->rowCount);

					Oid relationId = RelidByRelfilenumber(RelationTablespace_compat(

															  writeState->relfilelocator),

														  RelationPhysicalIdentifierNumber_compat(

															  writeState->relfilelocator));

					Relation relation = relation_open(relationId, NoLock);

					/*

					 * check if the last chunk needs serialization , the last chunk was not serialized

					 * if it was not full yet, e.g.  (rowCount > 0)

					 */

					if (lastChunkRowCount > 0)

					{

						SerializeChunkData(writeState, lastChunkIndex, lastChunkRowCount);

					}

					/* update buffer sizes in stripe skip list */

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						ColumnChunkSkipNode *chunkSkipNodeArray = columnSkipNodeArray[columnIndex];

						ColumnBuffers *columnBuffers = stripeBuffers->columnBuffersArray[columnIndex];

						for (chunkIndex = 0; chunkIndex < chunkCount; chunkIndex++)

						{

							ColumnChunkBuffers *chunkBuffers =

								columnBuffers->chunkBuffersArray[chunkIndex];

							uint64 existsBufferSize = chunkBuffers->existsBuffer->len;

							ColumnChunkSkipNode *chunkSkipNode = &chunkSkipNodeArray[chunkIndex];

							chunkSkipNode->existsChunkOffset = stripeSize;

							chunkSkipNode->existsLength = existsBufferSize;

							stripeSize += existsBufferSize;

						}

						for (chunkIndex = 0; chunkIndex < chunkCount; chunkIndex++)

						{

							ColumnChunkBuffers *chunkBuffers =

								columnBuffers->chunkBuffersArray[chunkIndex];

							uint64 valueBufferSize = chunkBuffers->valueBuffer->len;

							CompressionType valueCompressionType = chunkBuffers->valueCompressionType;

							ColumnChunkSkipNode *chunkSkipNode = &chunkSkipNodeArray[chunkIndex];

							chunkSkipNode->valueChunkOffset = stripeSize;

							chunkSkipNode->valueLength = valueBufferSize;

							chunkSkipNode->valueCompressionType = valueCompressionType;

							chunkSkipNode->valueCompressionLevel = writeState->options.compressionLevel;

							chunkSkipNode->decompressedValueSize = chunkBuffers->decompressedValueSize;

							stripeSize += valueBufferSize;

						}

					}

					StripeMetadata *stripeMetadata =

						CompleteStripeReservation(relation, writeState->emptyStripeReservation->stripeId,

												  stripeSize, stripeRowCount, chunkCount);

					uint64 currentFileOffset = stripeMetadata->fileOffset;

					/*

					 * Each stripe has only one section:

					 * Data section, in which we store data for each column continuously.

					 * We store data for each for each column in chunks. For each chunk, we

					 * store two buffers: "exists" buffer, and "value" buffer. "exists" buffer

					 * tells which values are not NULL. "value" buffer contains values for

					 * present values. For each column, we first store all "exists" buffers,

					 * and then all "value" buffers.

					 */

					/* flush the data buffers */

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						ColumnBuffers *columnBuffers = stripeBuffers->columnBuffersArray[columnIndex];

						for (chunkIndex = 0; chunkIndex < stripeSkipList->chunkCount; chunkIndex++)

						{

							ColumnChunkBuffers *chunkBuffers =

								columnBuffers->chunkBuffersArray[chunkIndex];

							StringInfo existsBuffer = chunkBuffers->existsBuffer;

							ColumnarStorageWrite(relation, currentFileOffset,

												 existsBuffer->data, existsBuffer->len);

							currentFileOffset += existsBuffer->len;

						}

						for (chunkIndex = 0; chunkIndex < stripeSkipList->chunkCount; chunkIndex++)

						{

							ColumnChunkBuffers *chunkBuffers =

								columnBuffers->chunkBuffersArray[chunkIndex];

							StringInfo valueBuffer = chunkBuffers->valueBuffer;

							ColumnarStorageWrite(relation, currentFileOffset,

												 valueBuffer->data, valueBuffer->len);

							currentFileOffset += valueBuffer->len;

						}

					}

					SaveChunkGroups(writeState->relfilelocator,

									stripeMetadata->id,

									writeState->chunkGroupRowCounts);

					SaveStripeSkipList(writeState->relfilelocator,

									   stripeMetadata->id,

									   stripeSkipList, tupleDescriptor);

					writeState->chunkGroupRowCounts = NIL;

					relation_close(relation, NoLock);

				}

				/*

				 * SerializeBoolArray serializes the given boolean array and returns the result

				 * as a StringInfo. This function packs every 8 boolean values into one byte.

				 */

				static StringInfo

				SerializeBoolArray(bool *boolArray, uint32 boolArrayLength)

				{

					uint32 boolArrayIndex = 0;

					uint32 byteCount = ((boolArrayLength * sizeof(bool)) + (8 - sizeof(bool))) / 8;

					StringInfo boolArrayBuffer = makeStringInfo();

					enlargeStringInfo(boolArrayBuffer, byteCount);

					boolArrayBuffer->len = byteCount;

					memset(boolArrayBuffer->data, 0, byteCount);

					for (boolArrayIndex = 0; boolArrayIndex < boolArrayLength; boolArrayIndex++)

					{

						if (boolArray[boolArrayIndex])

						{

							uint32 byteIndex = boolArrayIndex / 8;

							uint32 bitIndex = boolArrayIndex % 8;

							boolArrayBuffer->data[byteIndex] |= (1 << bitIndex);

						}

					}

					return boolArrayBuffer;

				}

				/*

				 * SerializeSingleDatum serializes the given datum value and appends it to the

				 * provided string info buffer.

				 *

				 * Since we don't want to limit datum buffer size to RSIZE_MAX unnecessarily,

				 * we use memcpy instead of memcpy_s several places in this function.

				 */

				static void

				SerializeSingleDatum(StringInfo datumBuffer, Datum datum, bool datumTypeByValue,

									 int datumTypeLength, char datumTypeAlign)

				{

					uint32 datumLength = att_addlength_datum(0, datumTypeLength, datum);

					uint32 datumLengthAligned = att_align_nominal(datumLength, datumTypeAlign);

					enlargeStringInfo(datumBuffer, datumLengthAligned);

					char *currentDatumDataPointer = datumBuffer->data + datumBuffer->len;

					memset(currentDatumDataPointer, 0, datumLengthAligned);

					if (datumTypeLength > 0)

					{

						if (datumTypeByValue)

						{

							store_att_byval(currentDatumDataPointer, datum, datumTypeLength);

						}

						else

						{

							memcpy(currentDatumDataPointer, DatumGetPointer(datum), datumTypeLength); /* IGNORE-BANNED */

						}

					}

					else

					{

						Assert(!datumTypeByValue);

						memcpy(currentDatumDataPointer, DatumGetPointer(datum), datumLength); /* IGNORE-BANNED */

					}

					datumBuffer->len += datumLengthAligned;

				}

				/*

				 * SerializeChunkData serializes and compresses chunk data at given chunk index with given

				 * compression type for every column.

				 */

				static void

				SerializeChunkData(ColumnarWriteState *writeState, uint32 chunkIndex, uint32 rowCount)

				{

					uint32 columnIndex = 0;

					StripeBuffers *stripeBuffers = writeState->stripeBuffers;

					ChunkData *chunkData = writeState->chunkData;

					CompressionType requestedCompressionType = writeState->options.compressionType;

					int compressionLevel = writeState->options.compressionLevel;

					const uint32 columnCount = stripeBuffers->columnCount;

					StringInfo compressionBuffer = writeState->compressionBuffer;

					writeState->chunkGroupRowCounts =

						lappend_int(writeState->chunkGroupRowCounts, rowCount);

					/* serialize exist values, data values are already serialized */

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						ColumnBuffers *columnBuffers = stripeBuffers->columnBuffersArray[columnIndex];

						ColumnChunkBuffers *chunkBuffers = columnBuffers->chunkBuffersArray[chunkIndex];

						chunkBuffers->existsBuffer =

							SerializeBoolArray(chunkData->existsArray[columnIndex], rowCount);

					}

					/*

					 * check and compress value buffers, if a value buffer is not compressable

					 * then keep it as uncompressed, store compression information.

					 */

					for (columnIndex = 0; columnIndex < columnCount; columnIndex++)

					{

						ColumnBuffers *columnBuffers = stripeBuffers->columnBuffersArray[columnIndex];

						ColumnChunkBuffers *chunkBuffers = columnBuffers->chunkBuffersArray[chunkIndex];

						CompressionType actualCompressionType = COMPRESSION_NONE;

						StringInfo serializedValueBuffer = chunkData->valueBufferArray[columnIndex];

						Assert(requestedCompressionType >= 0 &&

							   requestedCompressionType < COMPRESSION_COUNT);

						chunkBuffers->decompressedValueSize =

							chunkData->valueBufferArray[columnIndex]->len;

						/*

						 * if serializedValueBuffer is be compressed, update serializedValueBuffer

						 * with compressed data and store compression type.

						 */

						bool compressed = CompressBuffer(serializedValueBuffer, compressionBuffer,

														 requestedCompressionType,

														 compressionLevel);

						if (compressed)

						{

							serializedValueBuffer = compressionBuffer;

							actualCompressionType = requestedCompressionType;

						}

						/* store (compressed) value buffer */

						chunkBuffers->valueCompressionType = actualCompressionType;

						chunkBuffers->valueBuffer = CopyStringInfo(serializedValueBuffer);

						/* valueBuffer needs to be reset for next chunk's data */

						resetStringInfo(chunkData->valueBufferArray[columnIndex]);

					}

				}

				/*

				 * UpdateChunkSkipNodeMinMax takes the given column value, and checks if this

				 * value falls outside the range of minimum/maximum values of the given column

				 * chunk skip node. If it does, the function updates the column chunk skip node

				 * accordingly.

				 */

				static void

				UpdateChunkSkipNodeMinMax(ColumnChunkSkipNode *chunkSkipNode, Datum columnValue,

										  bool columnTypeByValue, int columnTypeLength,

										  Oid columnCollation, FmgrInfo *comparisonFunction)

				{

					bool hasMinMax = chunkSkipNode->hasMinMax;

					Datum previousMinimum = chunkSkipNode->minimumValue;

					Datum previousMaximum = chunkSkipNode->maximumValue;

					Datum currentMinimum = 0;

					Datum currentMaximum = 0;

					/* if type doesn't have a comparison function, skip min/max values */

					if (comparisonFunction == NULL)

					{

						return;

					}

					if (!hasMinMax)

					{

						currentMinimum = DatumCopy(columnValue, columnTypeByValue, columnTypeLength);

						currentMaximum = DatumCopy(columnValue, columnTypeByValue, columnTypeLength);

					}

					else

					{

						Datum minimumComparisonDatum = FunctionCall2Coll(comparisonFunction,

																		 columnCollation, columnValue,

																		 previousMinimum);

						Datum maximumComparisonDatum = FunctionCall2Coll(comparisonFunction,

																		 columnCollation, columnValue,

																		 previousMaximum);

						int minimumComparison = DatumGetInt32(minimumComparisonDatum);

						int maximumComparison = DatumGetInt32(maximumComparisonDatum);

						if (minimumComparison < 0)

						{

							currentMinimum = DatumCopy(columnValue, columnTypeByValue, columnTypeLength);

						}

						else

						{

							currentMinimum = previousMinimum;

						}

						if (maximumComparison > 0)

						{

							currentMaximum = DatumCopy(columnValue, columnTypeByValue, columnTypeLength);

						}

						else

						{

							currentMaximum = previousMaximum;

						}

					}

					chunkSkipNode->hasMinMax = true;

					chunkSkipNode->minimumValue = currentMinimum;

					chunkSkipNode->maximumValue = currentMaximum;

				}

				/* Creates a copy of the given datum. */

				static Datum

				DatumCopy(Datum datum, bool datumTypeByValue, int datumTypeLength)

				{

					Datum datumCopy = 0;

					if (datumTypeByValue)

					{

						datumCopy = datum;

					}

					else

					{

						uint32 datumLength = att_addlength_datum(0, datumTypeLength, datum);

						char *datumData = palloc0(datumLength);

						/*

						 * We use IGNORE-BANNED here since we don't want to limit datum size to

						 * RSIZE_MAX unnecessarily.

						 */

						memcpy(datumData, DatumGetPointer(datum), datumLength); /* IGNORE-BANNED */

						datumCopy = PointerGetDatum(datumData);

					}

					return datumCopy;

				}

				/*

				 * CopyStringInfo creates a deep copy of given source string allocating only needed

				 * amount of memory.

				 */

				static StringInfo

				CopyStringInfo(StringInfo sourceString)

				{

					StringInfo targetString = palloc0(sizeof(StringInfoData));

					if (sourceString->len > 0)

					{

						targetString->data = palloc0(sourceString->len);

						targetString->len = sourceString->len;

						targetString->maxlen = sourceString->len;

						/*

						 * We use IGNORE-BANNED here since we don't want to limit string

						 * buffer size to RSIZE_MAX unnecessarily.

						 */

						memcpy(targetString->data, sourceString->data, sourceString->len); /* IGNORE-BANNED */

					}

					return targetString;

				}

				bool

				ContainsPendingWrites(ColumnarWriteState *state)

				{

					return state->stripeBuffers != NULL && state->stripeBuffers->rowCount != 0;

				}

Compare commits

6698 Commits v5.2.0 ... main

7 .codeclimate.yml Normal file Unescape Escape View File

40 .codecov.yml Normal file Unescape Escape View File

33 .devcontainer/.gdbinit Normal file Unescape Escape View File

1 .devcontainer/.gitignore vendored Normal file Unescape Escape View File

7 .devcontainer/.psqlrc Normal file Unescape Escape View File

12 .devcontainer/.vscode/Pipfile vendored Normal file Unescape Escape View File

28 .devcontainer/.vscode/Pipfile.lock generated vendored Normal file Unescape Escape View File

84 .devcontainer/.vscode/generate_c_cpp_properties-json.py vendored Executable file Unescape Escape View File

40 .devcontainer/.vscode/launch.json vendored Normal file Unescape Escape View File

222 .devcontainer/Dockerfile Normal file Unescape Escape View File

11 .devcontainer/Makefile Normal file Unescape Escape View File

37 .devcontainer/devcontainer.json Normal file Unescape Escape View File

15 .devcontainer/pgenv/config/default.conf Normal file Unescape Escape View File

9 .devcontainer/requirements.txt Normal file Unescape Escape View File

28 .devcontainer/src/test/regress/Pipfile Normal file Unescape Escape View File

1041 .devcontainer/src/test/regress/Pipfile.lock generated Normal file View File

28 .editorconfig Normal file Unescape Escape View File

7 .flake8 Normal file Unescape Escape View File

21 .gitattributes vendored Unescape Escape View File

23 .github/actions/parallelization/action.yml vendored Normal file Unescape Escape View File

38 .github/actions/save_logs_and_results/action.yml vendored Normal file Unescape Escape View File

35 .github/actions/setup_extension/action.yml vendored Normal file Unescape Escape View File

27 .github/actions/upload_coverage/action.yml vendored Normal file Unescape Escape View File

3 .github/packaging/packaging_ignore.yml vendored Normal file Unescape Escape View File

51 .github/packaging/validate_build_output.sh vendored Executable file Unescape Escape View File

1 .github/pull_request_template.md vendored Normal file Unescape Escape View File

546 .github/workflows/build_and_test.yml vendored Normal file Unescape Escape View File

79 .github/workflows/codeql.yml vendored Normal file Unescape Escape View File

54 .github/workflows/devcontainer.yml vendored Normal file Unescape Escape View File

79 .github/workflows/flaky_test_debugging.yml vendored Normal file Unescape Escape View File

177 .github/workflows/packaging-test-pipelines.yml vendored Normal file Unescape Escape View File

22 .gitignore vendored Unescape Escape View File

1 .ignore Normal file Unescape Escape View File

19 .travis.yml Unescape Escape View File

3648 CHANGELOG.md View File

9 CODE_OF_CONDUCT.md Normal file Unescape Escape View File

215 CONTRIBUTING.md Unescape Escape View File

43 DEVCONTAINER.md Normal file Unescape Escape View File

2 LICENSE Unescape Escape View File

55 Makefile Unescape Escape View File

36 Makefile.global.in Unescape Escape View File

99 NOTICE Normal file Unescape Escape View File

596 README.md Unescape Escape View File

41 SECURITY.md Normal file Unescape Escape View File

160 STYLEGUIDE.md Normal file Unescape Escape View File

2 aclocal.m4 vendored Normal file Unescape Escape View File

2 autogen.sh Unescape Escape View File

47 cgmanifest.json Normal file Unescape Escape View File

402 ci/README.md Normal file Unescape Escape View File

56 ci/banned.h.sh Executable file Unescape Escape View File

44 ci/build-citus.sh Executable file Unescape Escape View File

29 ci/check_all_ci_scripts_are_run.sh Executable file Unescape Escape View File

24 ci/check_all_tests_are_run.sh Executable file Unescape Escape View File

25 ci/check_gucs_are_alphabetically_sorted.sh Executable file Unescape Escape View File

33 ci/check_migration_files.sh Executable file Unescape Escape View File

20 ci/check_sql_snapshots.sh Executable file Unescape Escape View File

32 ci/ci_helpers.sh Normal file Unescape Escape View File

32 ci/disallow_c_comments_in_migrations.sh Executable file Unescape Escape View File

12 ci/disallow_hash_comments_in_spec_files.sh Executable file Unescape Escape View File

19 ci/disallow_long_changelog_entries.sh Executable file Unescape Escape View File

22 ci/editorconfig.sh Executable file Unescape Escape View File

19 ci/fix_gitignore.sh Executable file Unescape Escape View File

22 ci/fix_style.sh Executable file Unescape Escape View File

157 ci/include_grouping.py Executable file Unescape Escape View File

10 ci/normalize_expected.sh Executable file Unescape Escape View File

25 ci/print_stack_trace.sh Executable file Unescape Escape View File

34 ci/remove_useless_declarations.sh Executable file Unescape Escape View File

12 ci/sort_and_group_includes.sh Executable file Unescape Escape View File

1460 config/config.guess vendored Normal file View File

151 config/general.m4 Normal file Unescape Escape View File

2019 configure vendored View File

311 configure.ac Normal file Unescape Escape View File

117 configure.in Unescape Escape View File

BIN github-banner.png View File

BIN images/2pc-recovery.png Executable file View File

BIN images/citus-architecture.png Executable file View File

BIN images/citus-readme-banner.png Executable file View File

BIN images/citus-scale-out.png Executable file View File

6698 Commits

v5.2.0 ... main

7

.codeclimate.yml Normal file

View File

40

.codecov.yml Normal file

View File

33

.devcontainer/.gdbinit Normal file

View File

1

.devcontainer/.gitignore vendored Normal file

View File

7

.devcontainer/.psqlrc Normal file

View File

12

.devcontainer/.vscode/Pipfile vendored Normal file

View File

28

.devcontainer/.vscode/Pipfile.lock generated vendored Normal file

View File

84

.devcontainer/.vscode/generate_c_cpp_properties-json.py vendored Executable file

View File

40

.devcontainer/.vscode/launch.json vendored Normal file

View File

222

.devcontainer/Dockerfile Normal file

View File

11

.devcontainer/Makefile Normal file

View File

37

.devcontainer/devcontainer.json Normal file

View File

15

.devcontainer/pgenv/config/default.conf Normal file

View File

9

.devcontainer/requirements.txt Normal file

View File

28

.devcontainer/src/test/regress/Pipfile Normal file

View File

1041

.devcontainer/src/test/regress/Pipfile.lock generated Normal file

View File

28

.editorconfig Normal file

View File

7

.flake8 Normal file

View File

21

.gitattributes vendored

View File

23

.github/actions/parallelization/action.yml vendored Normal file

View File

38

.github/actions/save_logs_and_results/action.yml vendored Normal file

View File

35

.github/actions/setup_extension/action.yml vendored Normal file

View File

27

.github/actions/upload_coverage/action.yml vendored Normal file

View File

3

.github/packaging/packaging_ignore.yml vendored Normal file

View File

51

.github/packaging/validate_build_output.sh vendored Executable file

View File

1

.github/pull_request_template.md vendored Normal file

View File

546

.github/workflows/build_and_test.yml vendored Normal file

View File

79

.github/workflows/codeql.yml vendored Normal file

View File

54

.github/workflows/devcontainer.yml vendored Normal file

View File

79

.github/workflows/flaky_test_debugging.yml vendored Normal file

View File

177

.github/workflows/packaging-test-pipelines.yml vendored Normal file

View File

22

.gitignore vendored

View File

1

.ignore Normal file

View File

19

.travis.yml

View File

3648

CHANGELOG.md

View File

9

CODE_OF_CONDUCT.md Normal file

View File

215

CONTRIBUTING.md

View File

43

DEVCONTAINER.md Normal file

View File

2

LICENSE

View File

55

Makefile

View File

36

Makefile.global.in

View File

99

NOTICE Normal file

View File

596

README.md

View File

41

SECURITY.md Normal file

View File

160

STYLEGUIDE.md Normal file

View File

2

aclocal.m4 vendored Normal file

View File

2

autogen.sh

View File

47

cgmanifest.json Normal file

View File

402

ci/README.md Normal file

View File

56

ci/banned.h.sh Executable file

View File

44

ci/build-citus.sh Executable file

View File

29

ci/check_all_ci_scripts_are_run.sh Executable file

View File

24

ci/check_all_tests_are_run.sh Executable file

View File

25

ci/check_gucs_are_alphabetically_sorted.sh Executable file

View File

33

ci/check_migration_files.sh Executable file

View File

20

ci/check_sql_snapshots.sh Executable file

View File

32

ci/ci_helpers.sh Normal file

View File

32

ci/disallow_c_comments_in_migrations.sh Executable file

View File

12

ci/disallow_hash_comments_in_spec_files.sh Executable file

View File

19

ci/disallow_long_changelog_entries.sh Executable file

View File

22

ci/editorconfig.sh Executable file

View File

19

ci/fix_gitignore.sh Executable file

View File

22

ci/fix_style.sh Executable file

View File

157

ci/include_grouping.py Executable file

View File

10

ci/normalize_expected.sh Executable file

View File

25

ci/print_stack_trace.sh Executable file

View File

34

ci/remove_useless_declarations.sh Executable file

View File

12

ci/sort_and_group_includes.sh Executable file

View File

1460

config/config.guess vendored Normal file

View File

151

config/general.m4 Normal file

View File

2019

configure vendored

View File

311

configure.ac Normal file

View File

117

configure.in

View File

BIN
github-banner.png

View File

BIN
images/2pc-recovery.png Executable file

View File

BIN
images/citus-architecture.png Executable file

View File

BIN
images/citus-readme-banner.png Executable file

View File

BIN
images/citus-scale-out.png Executable file

View File

BIN
images/coordinator_delegates_stored_procedure.png Normal file

View File