perf(orchestrator): add memfile dedup density threshold#2862
perf(orchestrator): add memfile dedup density threshold#2862ValentaTomas wants to merge 4 commits into
Conversation
PR SummaryMedium Risk Overview
Reviewed by Cursor Bugbot for commit 7eed0aa. Bugbot is set up for automated code reviews on this repo. Configure here. |
❌ 5 Tests Failed:
View the full list of 5 ❄️ flaky test(s)
To view more test analytics, go to the Test Analytics Dashboard |
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 6d2a1dec8f
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
6d2a1de to
dc32af5
Compare
dc32af5 to
45404d1
Compare
levb
left a comment
There was a problem hiding this comment.
(still reviewing/understanding)
This approach scans in 1 direction and "locally" - the decision on whether to do a parent frame fetch (or store the page locally) is made solely on a single child's relationship to it. If there are parent frames with many children references, they might still be tossed out by the single-child based criteria.
For now, I am asking to move deduplication into its own file, with a more apparent hook in chunk.go for deduplication. This would make it a little easier to compare strategies if a different one appears in the future.
dobrac
left a comment
There was a problem hiding this comment.
The code flow is really complex. I would prefer simplifying (not a blocker though).
Please add more unit tests for the added functionality.
eb72ac0 to
59c2c0d
Compare
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 59c2c0d958
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
59c2c0d to
e4f45c3
Compare
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: e4f45c3798
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
Adds disabled-by-default controls for memfile dedup fetch fragmentation. When enabled, dedup can promote the cheapest parent-hit pages into the current diff to keep distinct non-empty backing fetch windows under a configured budget, without ever storing empty pages.
Config keys:
maxFetchWindowsPerBlock,maxPromotedParentPagesPerBlock, andfetchRunWindowPages(all0= disabled/default window).