fixed-point iteration support #603

carljm · 2024-10-23T21:23:12Z

This PR removes the existing unwind-based cycle fallback support (a plus for WASM compatibility), and replaces it with support for fixpoint iteration of cycles.

To opt in to fixpoint iteration, provide two additional arguments to salsa::tracked on the definition of a tracked function: cycle_initial and cycle_fn. The former is a function which should provide a provisional starting value for fixpoint iteration on this query, and the latter is a function which has the opportunity, after each iteration that failed to converge, to decide whether to continue iterating or fallback to some fixed value. See the added tests and docs for details.

Usability points that should be covered in the documentation:

With the old cycle fallback, it was sufficient to avoid panic for at least one query in a cycle to define a cycle fallback. With fixpoint iteration, to avoid cycle panics you must define cycle_fn and cycle_initial on every query that might end up as the "head" of a cycle (that is, queried for its value while it is already executing.)
It is entirely possible to define cycle_fn and cycle_initial so as to cause iteration to diverge and never terminate; it's up to the user to avoid this. Techniques to avoid this include a) ensuring that cycles will converge, by defining the initial value and the queries themselves monotonically (for example, in a type-inference scenario, the initial value is the bottom, or empty, type, and types will only widen, never narrow, as the cycle iterates -- thus the cycle must eventually converge to the top type, if nowhere else), and/or b) with a larger hammer, by ensuring that cycle_fn respects the iteration count it is given, and always halts iteration with a fallback value if the count reaches some "too large" value.
It's also entirely possible to define cycle_fn and cycle_initial such that memoized results can vary depending only on the order in which queries occur. Avoid this by minimizing the number of tracked functions that support fixpoint iteration and ensuring initial values and fallback values are consistent among tracked functions that may occur in a cycle together.
You can call Salsa queries from within your cycle_fn and cycle_initial queries, but if the query you call re-enters the same cycle, it could lead to unexpected behavior. Take care what queries you call inside cycle recovery functions.

netlify · 2024-10-23T21:23:37Z

✅ Deploy Preview for salsa-rs canceled.

Name	Link
🔨 Latest commit	`f421b94`
🔍 Latest deploy log	https://app.netlify.com/sites/salsa-rs/deploys/67cf5e620ac0b50008ca5b5b

codspeed-hq · 2024-10-23T21:25:09Z

CodSpeed Performance Report

Merging #603 will degrade performances by 6.58%

_{Comparing carljm:fixpoint (f421b94) with master (9ebc8a3)}

Summary

❌ 6 regressions
✅ 5 untouched benchmarks
🆕 1 new benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`BASE`	`HEAD`	Change
❌	`accumulator`	3.2 ms	3.4 ms	-6.58%
❌	`amortized[Input]`	3.4 µs	3.6 µs	-5.98%
❌	`amortized[InternedInput]`	3.4 µs	3.5 µs	-4.43%
❌	`amortized[SupertypeInput]`	4.2 µs	4.3 µs	-3.76%
❌	`new[InternedInput]`	5.1 µs	5.3 µs	-4.07%
🆕	`converge_diverge`	N/A	169.7 µs	N/A
❌	`many_tracked_structs`	36.4 µs	38 µs	-3.96%

nikomatsakis · 2024-10-24T01:50:42Z

This is very cool! (Admittedly, I say this pre-review.)

MichaReiser

This looks great. I left a few comments where I struggled understanding the implementation or had smaller suggestions.

components/salsa-macro-rules/src/setup_tracked_fn.rs

src/function/execute.rs

src/zalsa_local.rs

tests/cycle_fixpoint.rs

carljm · 2024-10-29T18:15:08Z

In writing more comprehensive tests for this, I realized that it needs some changes to correctly handle multi-revision scenarios; taking it to Draft mode until I get that fixed.

carljm · 2024-10-30T00:37:43Z

Ok, multiple-revision cases are now fixed, and we now populate the initial provisional value only lazily, in case a cycle is actually encountered, which should reduce the number of memos created by quite a lot.

Also added a bunch of tests, including multiple-revision cases and one test involving durability. Still need to add cross-thread cycle tests.

tests/cycle/main.rs

src/function/execute.rs

MichaReiser

The lazy creation of the initial value is a neat improvement. Nice for taking the time to work on it !

src/function/execute.rs

MichaReiser · 2024-10-30T07:35:48Z

The benchmarks show a 4-5% regression. It seems that we're now resizing some hash maps more often. Are we reporting more tracked reads than before? Could you take a look what's causing it?

carljm · 2024-11-01T00:42:07Z

Initial experiments using this in the red-knot type checker are promising: astral-sh/ruff#14029

Not yet using it for loopy control flow in that PR, but there are cycles in the core type definitions of Python builtins and standard library, which we previously had a hacky fallback in place for using Salsa's previous cycle fallback support. Moving over to fixpoint iteration just worked, and fixed the type of a builtin impacted by the cycle.

On the downside, it is a performance regression. Need to do more work there.

src/function/maybe_changed_after.rs

src/ingredient.rs

src/cycle.rs

src/table/sync.rs

src/function/execute.rs

Veykril · 2024-12-28T16:23:06Z

src/function/maybe_changed_after.rs

+/// Result of memo validation.
+pub enum VerifyResult {
+    /// Memo has changed and needs to be recomputed.
+    Changed,
+
+    /// Memo remains valid.
+    ///
+    /// Database keys in the hashset represent cycle heads encountered in validation; don't mark
+    /// memos verified until we've iterated the full cycle to ensure no inputs changed.
+    Unchanged(FxHashSet<DatabaseKeyIndex>),
+}


This change (bool being replaced by VerifyResult) also seems like it will make non cycle queries pay a hefty perf price. That type has a destructor and is 40 bytes where as the previous return value could fit into a register, this can't.

I'll focus first on getting consistently right behavior (including across threads) and then revisit this and other possible ways to claw back perf. Off the top of my head I'm not sure how to improve this, will need to think about it.

Yes of course, I am mainly noting down what I saw, no need to tackle perf nits/reviews until things are working properly. (also happy to help out with perf work on this PR once its at that stage)

With the option boxing we've now significantly reduced the size of VerifyResult, at least.

Veykril · 2024-12-29T10:00:00Z

On a related perf note, would it make sense to move the dataflow test (or a maybe slightly more involved version) into a benchmark?

nikomatsakis · 2025-01-27T20:56:34Z

@carljm -- to confirm, not yet ready for my review, right? (or did I forget something)

carljm · 2025-01-27T21:47:22Z

@nikomatsakis I commented last week at https://salsa.zulipchat.com/#narrow/channel/333573-Contributing-to-Salsa/topic/fixed.20point.20cycles that this was ready for review.

Two things have come up since:

A test failure here in CI (on the cycle_t1_a_t2_b test) that I have never seen locally and so far have not been able to reproduce; it suggests some kind of race condition that I need to track down and fix.
The PR that landed on Friday to fix accumulators (Fix accumulated values for backdated queries #662) conflicts badly with this PR, and I haven't completed that rebase yet (thus all the conflicts shown here.)

I think a review from you of what's here could still be useful (you might have an idea where the race condition is that is causing the test failure?) but it's also fine if you want to wait until I sort the above items.

carljm · 2025-01-27T23:47:40Z

I may try and see if https://github.com/awslabs/shuttle can track down the race condition. It's a bit invasive to use it, since it requires that all our uses from std::thread and std::sync throughout all of Salsa go through a shim module that can cfg redirect them to shuttle in tests.

nikomatsakis · 2025-01-28T01:28:41Z

Ok. I will try to find time to review just in case. Im busy this week with Things but will do my best.

carljm · 2025-01-28T01:50:58Z

Ok, problem (2) is addressed; I've merged in latest main branch and resolved all conflicts. Now back to problem (1).

nikomatsakis

This looks pretty good. I think we gotta get it landed and start testing it.

src/function.rs

src/function/fetch.rs

carljm · 2025-02-20T20:30:45Z

Oops, looks like we're hitting some GitHub CI job limits.

davidbarsky · 2025-02-20T20:35:47Z

No objections to the design or approach of this, but I was wondering if we could get rust-lang/rust-analyzer#18964 landed first. For the corresponding PR to land in rust-analyzer, we'll need an Salsa 1.0 alpha-1 release out with—hopefully—the ability dump the contents of tracked functions (as described in here). Those two together should should take roughly a a week or so of work, at which point salsa 1.0 alpha-2 could contain this change?

carljm · 2025-02-20T20:53:01Z

I'm not in a big rush to merge this on my side, though the merge conflicts do get exhausting to deal with. I'll add some more tests and some documentation, and work on using it in red-knot in the mean time. We may learn some new things from that.

davidbarsky · 2025-02-20T20:59:42Z

I'm not in a big rush to merge this on my side, though the merge conflicts do get exhausting to deal with. I'll add some more tests and some documentation, and work on using it in red-knot in the mean time. We may learn some new things from that.

Yeah, I'm familiar with the rebase pains. Hopefully you won't have too many conflicts with the remaining, in-flight changes, but in the event you do, I want to set a tolerably short upper bound for my "don't land!" request.

carljm · 2025-02-22T01:53:38Z

The current code in this branch requires Rust 1.83, for Default implementations of hashset iterators. Are we OK with this, or do we need to stay compatible with older Rust versions? If so, how old?

Cargo.toml still says rust-version = 1.76.

Main branch doesn't compile with 1.76, it only compiles with 1.80 or newer.

The tests only actually pass with 1.84 or newer, because we have snapshotted compiler output in the compile_fail tests.

So we clearly don't have any CI set up to enforce our declared MSRV.

I put up #712 to fix this.

carljm · 2025-02-22T18:26:50Z

Ok, I added the necessary boilerplate here to maintain compatibility with Rust 1.80 for now.

Veykril · 2025-02-23T11:28:24Z

Stumbled upon this issue #193, might be good to check whether that is still relevant after this PR rewriting most of the cycle handling

Veykril · 2025-02-25T13:11:11Z

Another related issue #9

carljm · 2025-03-04T02:09:11Z

I added some more tests (for tracked-struct and accumulator output from queries in cycles, panics in cycle handling in a cross-thread cycle), and fixed a bug in diff_outputs for outputs in cycles.

I also updated the book sections on cycle handling. I'm afraid I didn't reproduce the detail of the previous "plumbing" section on the old cycle handling, but I did add a doc comment about the new cycle handling in cycle.rs.

I now consider this branch "done" and ready for merge, pending any further review comments (cc @nikomatsakis, @MichaReiser, @Veykril). I'll wait to merge it until @davidbarsky and @Veykril give the green light from the rust-analyzer side.

davidbarsky · 2025-03-10T21:07:43Z

Feel free to land this! We—rust-analyzer—got the changes we needed in https://github.com/salsa-rs/salsa/releases/tag/salsa-v0.19.0 and are happily giving up the "land stuff" lock :)

carljm · 2025-03-10T21:50:19Z

Squashed to a single commit in preparation for landing; the commit history here has a lot of junk and false starts, as well as many merges from main branch.

carljm requested review from nikomatsakis, MichaReiser and davidbarsky October 23, 2024 21:23

MichaReiser reviewed Oct 24, 2024

View reviewed changes

carljm force-pushed the fixpoint branch from f700c39 to f411766 Compare October 24, 2024 20:06

salsa-rs deleted a comment from MichaReiser Oct 24, 2024

carljm marked this pull request as draft October 29, 2024 18:15

carljm marked this pull request as ready for review October 30, 2024 00:37

carljm commented Oct 30, 2024

View reviewed changes

tests/cycle/main.rs Outdated Show resolved Hide resolved

MichaReiser reviewed Oct 30, 2024

View reviewed changes

src/function/execute.rs Outdated Show resolved Hide resolved

MichaReiser reviewed Oct 30, 2024

View reviewed changes

src/function/execute.rs Outdated Show resolved Hide resolved

src/function/execute.rs Outdated Show resolved Hide resolved

src/function/execute.rs Outdated Show resolved Hide resolved

carljm force-pushed the fixpoint branch from c1bbdcf to f44f2f7 Compare November 14, 2024 23:53

MichaReiser reviewed Nov 15, 2024

View reviewed changes

src/function/maybe_changed_after.rs Outdated Show resolved Hide resolved

MichaReiser reviewed Nov 15, 2024

View reviewed changes

src/function/maybe_changed_after.rs Outdated Show resolved Hide resolved

MichaReiser reviewed Nov 17, 2024

View reviewed changes

src/ingredient.rs Show resolved Hide resolved

carljm force-pushed the fixpoint branch from 8325b62 to 5202579 Compare December 18, 2024 02:44

Veykril reviewed Dec 28, 2024

View reviewed changes

nikomatsakis approved these changes Feb 20, 2025

View reviewed changes

src/function.rs Show resolved Hide resolved

src/function.rs Show resolved Hide resolved

src/function/fetch.rs Show resolved Hide resolved

nikomatsakis mentioned this pull request Feb 20, 2025

Panic with predictable value if there is no initial fn in cycle #704

Open

carljm mentioned this pull request Feb 22, 2025

update MSRV to 1.80 and test it in CI #712

Merged

This was referenced Feb 23, 2025

Do not pass ownership of the QueryStack in Runtime::block_on_or_unwind #626

Closed

Pool ActiveQuerys in the query stack #629

Draft

carljm mentioned this pull request Feb 24, 2025

explore alternative impl for cycle handling #193

Open

carljm linked an issue Feb 26, 2025 that may be closed by this pull request

Fixed-point computations and other desirable cycles #9

Closed

carljm changed the title ~~[RFC] fixpoint iteration support~~ fixed-point iteration support Feb 26, 2025

carljm force-pushed the fixpoint branch 2 times, most recently from 88c686d to 5b44fbe Compare March 4, 2025 01:00

carljm mentioned this pull request Mar 5, 2025

Add regression test for salsa-rs#455 #740

Closed

davidbarsky mentioned this pull request Mar 5, 2025

internal: port rust-analyzer to new Salsa rust-lang/rust-analyzer#18964

Merged

rewrite cycle handling to support fixed-point iteration

f421b94

carljm force-pushed the fixpoint branch from 65162bc to f421b94 Compare March 10, 2025 21:49

carljm added this pull request to the merge queue Mar 10, 2025

Merged via the queue into salsa-rs:master with commit 095d8b2 Mar 10, 2025
12 of 13 checks passed

carljm deleted the fixpoint branch March 10, 2025 22:52

github-actions bot mentioned this pull request Mar 10, 2025

chore: release v0.20.0 #753

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed-point iteration support #603

fixed-point iteration support #603

carljm commented Oct 23, 2024 •

edited

Loading

netlify bot commented Oct 23, 2024 •

edited

Loading

codspeed-hq bot commented Oct 23, 2024 •

edited

Loading

nikomatsakis commented Oct 24, 2024

MichaReiser left a comment

carljm commented Oct 29, 2024

carljm commented Oct 30, 2024

MichaReiser left a comment

MichaReiser commented Oct 30, 2024 •

edited

Loading

carljm commented Nov 1, 2024

Veykril Dec 28, 2024

carljm Dec 28, 2024

Veykril Dec 29, 2024 •

edited

Loading

carljm Feb 20, 2025

Veykril commented Dec 29, 2024

nikomatsakis commented Jan 27, 2025

carljm commented Jan 27, 2025 •

edited

Loading

carljm commented Jan 27, 2025

nikomatsakis commented Jan 28, 2025

carljm commented Jan 28, 2025 •

edited

Loading

nikomatsakis left a comment

carljm commented Feb 20, 2025

davidbarsky commented Feb 20, 2025

carljm commented Feb 20, 2025

davidbarsky commented Feb 20, 2025 •

edited

Loading

carljm commented Feb 22, 2025 •

edited

Loading

carljm commented Feb 22, 2025

Veykril commented Feb 23, 2025

Veykril commented Feb 25, 2025

carljm commented Mar 4, 2025

davidbarsky commented Mar 10, 2025 •

edited

Loading

carljm commented Mar 10, 2025

fixed-point iteration support #603

fixed-point iteration support #603

Conversation

carljm commented Oct 23, 2024 • edited Loading

netlify bot commented Oct 23, 2024 • edited Loading

✅ Deploy Preview for salsa-rs canceled.

codspeed-hq bot commented Oct 23, 2024 • edited Loading

CodSpeed Performance Report

Merging #603 will degrade performances by 6.58%

Summary

Benchmarks breakdown

nikomatsakis commented Oct 24, 2024

MichaReiser left a comment

Choose a reason for hiding this comment

carljm commented Oct 29, 2024

carljm commented Oct 30, 2024

MichaReiser left a comment

Choose a reason for hiding this comment

MichaReiser commented Oct 30, 2024 • edited Loading

carljm commented Nov 1, 2024

Veykril Dec 28, 2024

Choose a reason for hiding this comment

carljm Dec 28, 2024

Choose a reason for hiding this comment

Veykril Dec 29, 2024 • edited Loading

Choose a reason for hiding this comment

carljm Feb 20, 2025

Choose a reason for hiding this comment

Veykril commented Dec 29, 2024

nikomatsakis commented Jan 27, 2025

carljm commented Jan 27, 2025 • edited Loading

carljm commented Jan 27, 2025

nikomatsakis commented Jan 28, 2025

carljm commented Jan 28, 2025 • edited Loading

nikomatsakis left a comment

Choose a reason for hiding this comment

carljm commented Feb 20, 2025

davidbarsky commented Feb 20, 2025

carljm commented Feb 20, 2025

davidbarsky commented Feb 20, 2025 • edited Loading

carljm commented Feb 22, 2025 • edited Loading

carljm commented Feb 22, 2025

Veykril commented Feb 23, 2025

Veykril commented Feb 25, 2025

carljm commented Mar 4, 2025

davidbarsky commented Mar 10, 2025 • edited Loading

carljm commented Mar 10, 2025

carljm commented Oct 23, 2024 •

edited

Loading

netlify bot commented Oct 23, 2024 •

edited

Loading

codspeed-hq bot commented Oct 23, 2024 •

edited

Loading

MichaReiser commented Oct 30, 2024 •

edited

Loading

Veykril Dec 29, 2024 •

edited

Loading

carljm commented Jan 27, 2025 •

edited

Loading

carljm commented Jan 28, 2025 •

edited

Loading

davidbarsky commented Feb 20, 2025 •

edited

Loading

carljm commented Feb 22, 2025 •

edited

Loading

davidbarsky commented Mar 10, 2025 •

edited

Loading