Add a section on PR descriptions to flaky test docs (#6446)

Good PR descriptions for flaky tests are quite helpful when reviewing. Although obviously no PR description is the same, there's a few common pieces of information that are useful for all PRs that fix flaky tests.
2022-10-21 16:52:31 +02:00 · 2022-10-21 16:52:31 +02:00 · 7f05ad033a
parent 162c8a5160
commit 7f05ad033a
1 changed files with 34 additions and 0 deletions
--- a/src/test/regress/flaky_tests.md
+++ b/src/test/regress/flaky_tests.md
@ -321,3 +321,37 @@ https://github.com/citusdata/citus/blob/main/src/test/regress/bin/normalize.sed
 Sometimes removing the test is the only way to make our test suite less flaky.
 Of course this is a last resort, but sometimes it's what we want. If running the
 test does more bad than good, removing will be a net positive.
 ## PR descriptions of flaky tests
 Even if a fix for a flaky test is very simple without a clear description it can
 be hard for a reviewer (or a future git spelunker) to understand its purpose.
 A good PR description of a flaky test includes the following things:
 1. Name of the test that was flaky
 2. The part of the regression.diffs file that caused the test to fail randomly
 3. A link to a CI run that failed because of this flaky test
 4. Explanation of why this output was non-deterministic (was it a bug in Citus?)
 5. Explanation of how this change makes the test deterministic
 An example of such a PR description is this one from [#6272][6272]:
 [6272]: https://github.com/citusdata/citus/pull/6272
 > Sometimes in CI our multi_utilities test fails like this:
 > ```diff
 >  VACUUM (INDEX_CLEANUP ON, PARALLEL 1) local_vacuum_table;
 >  SELECT CASE WHEN s BETWEEN 20000000 AND 25000000 THEN 22500000 ELSE s END size
 >  FROM pg_total_relation_size('local_vacuum_table') s ;
 >     size
 >  ----------
 > - 22500000
 > + 39518208
 >  (1 row)
 > ```
 > Source: https://app.circleci.com/pipelines/github/citusdata/citus/26641/workflows/5caea99c-9f58-4baa-839a-805aea714628/jobs/762870
 >
 > Apparently VACUUM is not as reliable in cleaning up as we thought. This
 > PR increases the range of allowed values to make the test reliable. Important
 > to note is that the range is still completely outside of the allowed range of
 > the initial size. So we know for sure that some data was cleaned up.