gh-127958: Trace from RESUME in the JIT by Fidget-Spinner · Pull Request #145905 · python/cpython

Fidget-Spinner · 2026-03-13T09:46:30Z

This adds a CACHE entry to RESUME and a RESUME_CHECK_JIT opcode.

Performance numbers are rather promising:

1% geometric mean speedup on fastmark (Sam's fast pyperformance subset) on my system.
0.5% speedup on x86_64 Linux, 2.5% speedup on macOS AArch64 on the Meta runners.
https://github.com/facebookexperimental/free-threading-benchmarking/tree/main/results/bm-20260315-3.15.0a7%2B-2e9b980-JIT

Issue: Trace starting from RESUME in the JIT #127958

This reverts commit 6b65f76.

markshannon

The comprehensions benchmark is so much slower due to RESUME_CHECK_JIT being that much slower than RESUME_CHECK last I checked.

I think that will need to be addressed before we can merge this. I've one suggestion that might help.

Lib/test/test_capi/test_opt.py

Lib/test/test_compile.py

Modules/_testinternalcapi.c

Python/bytecodes.c

Python/ceval.c

Python/optimizer.c

Python/specialize.c

Python/bytecodes.c

bedevere-app · 2026-03-13T17:52:09Z

When you're done making the requested changes, leave the comment: I have made the requested changes; please review again.

Fidget-Spinner · 2026-03-15T09:55:00Z

I think that will need to be addressed before we can merge this. I've one suggestion that might help.

From my investigations, this is one of the strangest slowdowns I have found so far:
RESUME_CHECK_JIT without the _JIT_ (ie, exact same code as RESUME_CHECK) is still 10-20% slower on bm_comprehensions.

Leaving RESUME_CHECK_JIT in the interpreter loop but only allowing specialization to RESUME_CHECK restores the original performance.

I was bout to think this is some compiler/CPU magic, but it does show up on AArch64 and on GCC/Clang as well. Luckily, I remembered RESUME is part of generator static magic, and I finally fixed the slowdown in this commit 5ccf8e9

Fidget-Spinner · 2026-03-15T10:06:15Z

I have made the requested changes; please review again.

bedevere-app · 2026-03-15T10:06:22Z

Thanks for making the requested changes!

@markshannon: please review the changes made to this pull request.

markshannon

This looks good. I've one comment on what was _QUICKEN_RESUME, but that's all.

Do you have benchmark numbers for this PR with the generator fix added?
I assume they will be better, just curious to see what they look like.

Python/bytecodes.c

markshannon · 2026-03-16T12:40:56Z

Python/ceval.c

    { .op.code = INTERPRETER_EXIT, .op.arg = 0 },  /* reached on yield */
-    { .op.code = RESUME, .op.arg = RESUME_OPARG_DEPTH1_MASK | RESUME_AT_FUNC_START }
+    { .op.code = RESUME, .op.arg = RESUME_OPARG_DEPTH1_MASK | RESUME_AT_FUNC_START },
+    { .op.code = CACHE, .op.arg = 0 } /* RESUME's CACHE */


NOTE:
This change is fine, but the CACHE isn't strictly needed as the RESUME is only a marker not to instrument the prior instructions, it will never be executed.

Python/specialize.c

Fidget-Spinner · 2026-03-16T14:06:14Z

Do you have benchmark numbers for this PR with the generator fix added? I assume they will be better, just curious to see what they look like.

I updated the PR's original description with it. The TLDR is that bm_generators is slower (expected, because we still can't optimize recursion well in the JIT, and now we're jitting more of bm_generators), but almost everything else that uses recursion is better.

macOS sees an across-the-board improvement, probably due to the good work you folks at ARM have done :).

Python/specialize.c

Fidget-Spinner · 2026-03-16T16:00:48Z

I think we only want to be jitting at the start of normal functions.

So this produced a really nice result. The slowdown on generators is now gone on my x86_64 machine, while keeping the speedup seen in logging that's also seen in the FT runners

Mean +- std dev: [gen_no_resume] 32.3 ms +- 0.2 ms -> [gen_resume] 31.5 ms +- 0.2 ms: 1.02x faster
logging_format: Mean +- std dev: [logging_no_resume] 6.58 us +- 0.10 us -> [logging_resume] 5.93 us +- 0.05 us: 1.11x faster
logging_silent: Mean +- std dev: [logging_no_resume] 81.8 ns +- 0.3 ns -> [logging_resume] 85.6 ns +- 1.1 ns: 1.05x slower
logging_simple: Mean +- std dev: [logging_no_resume] 5.86 us +- 0.06 us -> [logging_resume] 5.35 us +- 0.04 us: 1.10x faster

markshannon

Looks good.
The code is clean and the performance impact is significant.

Fidget-Spinner · 2026-03-16T16:18:25Z

Just to make sure that the latest commit indeed fixed generators, this branch without the fix sees a 30% slowdown in bm_generators.
Mean +- std dev: [gen_resume] 31.5 ms +- 0.2 ms -> [gen_resume_nofix] 41.2 ms +- 0.4 ms: 1.31x slower

So yes, let's do this.

Fidget-Spinner added 30 commits December 29, 2025 16:03

Add counter to RESUME

6c97cb6

Fix dis, implement jit, broken for async for loops

5ce4d20

Fix all remaining bugs

159601c

Up the resume value to 7918

9f28e87

remove control flow, allow RESUME_CHECK_JIT

16205d7

fix broken tests, fix EXTENDED_ARG

86f886b

Fix another bug

2304a2d

link everythihng

6b65f76

Revert "link everythihng"

d84384d

This reverts commit 6b65f76.

add back is_control_flow

e29ced3

undo changes, turn off RESUME tracing for now

6bbd9cc

temporary

a4e3e5f

reduce diff

ff0e9f0

add back _JIT (still off)

90e76f8

restore RESUME_CHECK_JIT jitting

81aa269

only link up traces when they're backwards jumps

d52a106

stop tracing when we hit an ENTER_EXECUTOR

3233423

re-enable old opt

e4e1cc0

partially disable opt

19f390a

formatting fix

be25244

fix an off-by-one

98c21be

format

dc0c71c

up the resume initial value a little

73681af

fix RESUME slowness

55cf943

fix infinite deopt involving monitoring

2e5e9e6

edit comment

96ea93a

trace over RESUME/RESUME_CHECK_JIT

5cec130

trace over everything except backwards jumps

2373aee

make RESUME tracing a single attempt

fafd6c0

fix recursive generators slowdown

564677c

markshannon requested changes Mar 13, 2026

View reviewed changes

bedevere-app bot added awaiting changes and removed awaiting core review labels Mar 13, 2026

Fidget-Spinner added 3 commits March 15, 2026 16:56

Partially address review

4a0a530

factor out tracing decision, try making _JIT faster

747d402

restore RESUME opt for generators

5ccf8e9

Merge commit '788c3291172b55efa7cf' into resume_tracing

62ea35e

bedevere-app bot added awaiting change review and removed awaiting changes labels Mar 15, 2026

bedevere-app bot requested a review from markshannon March 15, 2026 10:06

Merge remote-tracking branch 'upstream/main' into resume_tracing

2e9b980

markshannon reviewed Mar 16, 2026

View reviewed changes

Fidget-Spinner added 2 commits March 16, 2026 22:11

Rename to _QUICKEN, remove recursion limit set

0cef066

Merge remote-tracking branch 'upstream/main' into resume_tracing

c078999

Fidget-Spinner requested a review from markshannon March 16, 2026 14:18

markshannon reviewed Mar 16, 2026

View reviewed changes

Python/specialize.c Show resolved Hide resolved

Apply Mark's suggestion

672241d

markshannon approved these changes Mar 16, 2026

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting change review labels Mar 16, 2026

Fidget-Spinner merged commit 3d0824a into python:main Mar 16, 2026
77 of 79 checks passed

Fidget-Spinner deleted the resume_tracing branch March 16, 2026 16:19

bedevere-app bot removed the awaiting merge label Mar 16, 2026

Uh oh!

Conversation

Fidget-Spinner commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bedevere-app bot commented Mar 13, 2026

Uh oh!

Fidget-Spinner commented Mar 15, 2026

Uh oh!

Fidget-Spinner commented Mar 15, 2026

Uh oh!

bedevere-app bot commented Mar 15, 2026

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

markshannon Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Fidget-Spinner commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Fidget-Spinner commented Mar 16, 2026

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Fidget-Spinner commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fidget-Spinner commented Mar 13, 2026 •

edited

Loading

Fidget-Spinner commented Mar 16, 2026 •

edited

Loading