[red-knot] Property test improvements #15358

sharkdp · 2025-01-08T20:17:05Z

Summary

Add a workflow to run property tests on a daily basis (based on daily_fuzz.yaml)
Mark assignable_to_is_reflexive as flaky (related to [red-knot] (Gradual) intersection types are not handled in assignability #14899)
Add new (failing) intersection_assignable_to_both test (also related to [red-knot] (Gradual) intersection types are not handled in assignability #14899)

Test Plan

export QUICKCHECK_TESTS=100000
while cargo test --release -p red_knot_python_semantic -- \
  --ignored types::property_tests::stable; do :; done

sharkdp · 2025-01-08T20:19:03Z

.github/workflows/daily_property_tests.yaml

+            cargo test --locked --release --package red_knot_python_semantic -- --ignored types::property_tests::stable
+          done
+
+  create-issue-on-failure:


Like the rest of this workflow, this is copied over from daily_fuzz.yaml. It seems like a reasonable idea (how would we notice otherwise?), but I haven't actually seen any "Daily parser fuzz failed on …" in the issue tracker. Did the daily_fuzz workflow never run, or is this issue-creation disabled somehow?

carljm

Looks good to me!

I have no idea how to verify that the cron thing actually works, other than to land this and wait and see? (I don't even know where to go look tomorrow to see if it ran, assuming it doesn't fail and notify us.)

.github/workflows/daily_property_tests.yaml

sharkdp · 2025-01-08T20:26:12Z

I have no idea how to verify that the cron thing actually works, other than to land this and wait and see?

The fuzzer tests run daily at 00:00 (UTM?), so I decided to run the property test daily at 12:00, to distribute a bit among timezones, and so I will be awake when they run 😄

https://crontab.guru/#0_12___*

I don't even know where to go look tomorrow to see if it ran

Here: https://github.com/astral-sh/ruff/actions/workflows/daily_property_tests.yaml

Sample run is here: https://github.com/astral-sh/ruff/actions/runs/12678401447/job/35335911931

github-actions · 2025-01-08T20:35:01Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

- Mark `assignable_to_is_reflexive` as flaky - Add new (failing) `intersection_assignable_to_both` test - Add a workflow to run property tests on a daily basis

AlexWaygood · 2025-01-08T21:19:26Z

@sharkdp

Like the rest of this workflow, this is copied over from daily_fuzz.yaml. It seems like a reasonable idea (how would we notice otherwise?), but I haven't actually seen any "Daily parser fuzz failed on …" in the issue tracker. Did the daily_fuzz workflow never run, or is this issue-creation disabled somehow?

@carljm

I have no idea how to verify that the cron thing actually works, other than to land this and wait and see? (I don't even know where to go look tomorrow to see if it ran, assuming it doesn't fail and notify us.)

The py-fuzzer script found a number of bugs in our parser immediately after the big parser rewrite in early 2024, but to my knowledge, the daily cron job that we set up to fuzz the parser shortly after the parser rewrite has never failed. @dhruvmanila just wrote a really good parser! 😃 (And also, we haven't made any major updates since the big rewrite.)

However, I'm pretty confident that the issue-creation logic works, because I've used it to great success in multiple other repos. For examples, see:

Typeshed:
- Example issue: Daily tests failed on Sun Dec 29 2024 python/typeshed#13330
- Workflow: https://github.com/python/typeshed/blob/main/.github/workflows/daily.yml
typing_extensions:
- Example issue: Third-party tests failed on Thu Dec 05 2024 python/typing_extensions#513
- Workflow: https://github.com/python/typing_extensions/blob/main/.github/workflows/third_party.yml
typeshed-stats:
- Example issue: Daily test failed on Sat Sep 14 2024 AlexWaygood/typeshed-stats#255
- Workflow: https://github.com/AlexWaygood/typeshed-stats/blob/main/.github/workflows/test.yml

AlexWaygood · 2025-01-08T21:26:36Z

.github/workflows/daily_property_tests.yaml

+              repo: "ruff",
+              title: `Daily property test run failed on ${new Date().toDateString()}`,
+              body: "Runs listed here: https://github.com/astral-sh/ruff/actions/workflows/daily_property_tests.yaml",
+              labels: ["bug", "red_knot", "testing"],


this should be

Suggested change

labels: ["bug", "red_knot", "testing"],

labels: ["bug", "red-knot", "testing"],

(https://github.com/astral-sh/ruff/issues?q=is%3Aissue%20state%3Aopen%20label%3Ared-knot)

Thank you: #15361

AlexWaygood · 2025-01-08T21:28:36Z

.github/workflows/daily_property_tests.yaml

+              owner: "astral-sh",
+              repo: "ruff",
+              title: `Daily property test run failed on ${new Date().toDateString()}`,
+              body: "Runs listed here: https://github.com/astral-sh/ruff/actions/workflows/daily_property_tests.yaml",


A typeshed contributor recently made a great improvement to the typeshed version of this workflow so that the issue text links directly to the exact run that failed rather than a list of all the runs that have ever happened: https://github.com/python/typeshed/pull/13210/files

We could probably make the same improvement to this workflow and daily_fuzz.yaml!

sharkdp added testing Related to testing Ruff itself red-knot Multi-file analysis & type inference labels Jan 8, 2025

sharkdp requested review from carljm, MichaReiser and AlexWaygood as code owners January 8, 2025 20:17

sharkdp commented Jan 8, 2025

View reviewed changes

sharkdp force-pushed the david/property-test-updates branch from 2d06d95 to a3f4526 Compare January 8, 2025 20:22

carljm approved these changes Jan 8, 2025

View reviewed changes

.github/workflows/daily_property_tests.yaml Outdated Show resolved Hide resolved

[red-knot] Property test improvements

3c1abce

- Mark `assignable_to_is_reflexive` as flaky - Add new (failing) `intersection_assignable_to_both` test - Add a workflow to run property tests on a daily basis

sharkdp force-pushed the david/property-test-updates branch from 063b602 to 3c1abce Compare January 8, 2025 21:02

sharkdp merged commit 4fd82d5 into main Jan 8, 2025
23 checks passed

sharkdp deleted the david/property-test-updates branch January 8, 2025 21:24

AlexWaygood reviewed Jan 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Property test improvements #15358

[red-knot] Property test improvements #15358

sharkdp commented Jan 8, 2025

sharkdp Jan 8, 2025

carljm left a comment

sharkdp commented Jan 8, 2025

github-actions bot commented Jan 8, 2025 •

edited

Loading

AlexWaygood commented Jan 8, 2025 •

edited

Loading

AlexWaygood Jan 8, 2025

sharkdp Jan 8, 2025

AlexWaygood Jan 8, 2025

sharkdp Jan 8, 2025

	labels: ["bug", "red_knot", "testing"],
	labels: ["bug", "red-knot", "testing"],

[red-knot] Property test improvements #15358

[red-knot] Property test improvements #15358

Conversation

sharkdp commented Jan 8, 2025

Summary

Test Plan

sharkdp Jan 8, 2025

Choose a reason for hiding this comment

carljm left a comment

Choose a reason for hiding this comment

sharkdp commented Jan 8, 2025

github-actions bot commented Jan 8, 2025 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

AlexWaygood commented Jan 8, 2025 • edited Loading

AlexWaygood Jan 8, 2025

Choose a reason for hiding this comment

sharkdp Jan 8, 2025

Choose a reason for hiding this comment

AlexWaygood Jan 8, 2025

Choose a reason for hiding this comment

sharkdp Jan 8, 2025

Choose a reason for hiding this comment

github-actions bot commented Jan 8, 2025 •

edited

Loading

`ruff-ecosystem` results

AlexWaygood commented Jan 8, 2025 •

edited

Loading