ZJIT: Add MemBase::FrameBase that deals with native SP changes (+2) #14009

XrXr · 2025-07-24T17:43:32Z

This solves an immediate problem with def a(n1,n2,n3,n4,n5,n6,n7,n8) = [n8] and gen_new_array(). In the prior, it miscomps as follows:

; call rb_ary_new_capa
0x90: mov x0, #1
0x94: mov x16, #0x1278
0x98: movk x16, #0x4bc, lsl #16
0x9c: movk x16, #1, lsl #32
0xa0: blr x16
; call rb_ary_push
0xa4: mov x1, x0
0xa8: str x1, [sp, #-0x10]! ; c_push() from alloc_reg() to preserve the array past the call below
0xac: mov x0, x1            ; arg0, the array
0xb0: ldur x1, [sp]         ; arg1, n8, but we've just moved sp and now that refers to the array!
0xb4: mov x16, #0x3968
0xb8: movk x16, #0x4bc, lsl #16
0xbc: movk x16, #1, lsl #32
0xc0: blr x16
0xc4: ldr x1, [sp], #0x10

The problem here is that n8 is assigned a fixed offset from sp in codegen.rs, but the backend can move sp. To solve this, this diff adds MemBase::FrameBase, which keeps track of pushes and pops so code can refer to the pre-modification sp (base sp, as I call it in the diff). The tracking has to be done late in the backend because the backend inserts pushes and pops, like we see here.

This will also be useful for things like concatstrings, where we want to generate LIR that pushes onto the native stack and still refer to other on stack elements afterwards.

Now, I'm not a huge fan of this because this feels like a leaky abstraction. It can only track compile-time known modifications to SP like stack pushes and pops. Dynamic stack space allocation will silently miscomp. The tracking is also sensitive to the sequence of LIR that ends up modifying SP, so compile time knowable modifications might not always be properly tracked if they're too indirect.

What to do then?

Some quick possible alternatives:

Don't ever move SP inside the body of functions. This implies tallying all the stack usages up front in the backend and reserving that amount in the prologue. For cases like this one where alloc_reg() needs to preserve values, it would store directly to the preallocated space instead of using push/pop. Also implies banning dynamic stack space allocation using true runtime numbers.
Detect this pitfall by panicking whenever the backend sees push/pop, and also, panic on attempts to hold values across calls. This makes codegen hard to correctly write. Probably too hard.

I'd like your opinion on this.

Keeping the same name makes re-exporting more concise.

This is to fix `def a(n1,n2,n3,n4,n5,n6,n7,n8) = [n8]`, and for future changes that push values onto the native stack. Before this, the backend automatically preserved the array in gen_new_array() with an inserted `asm.c_push()`, which invalidated the static memory offset based on native SP for `n8`.

tekknolagi · 2025-07-24T17:47:40Z

I like high watermark and bump SP (with RBP) approach personally

k0kubun · 2025-07-24T18:04:46Z

Also implies banning dynamic stack space allocation using true runtime numbers

If we always allocate iseq->body->stack_max slots on the stack, would it always guarantee that there's enough space for processing gen_new_array() or concatstrings?

I like high watermark and bump SP (with RBP) approach personally

The problem Alan is talking about is that the backend moves SP on its own for pushing live registers on C calls. To address that with a high watermark, you need to know the maximum number of live registers on C calls. It's impossible to know as of setting up a frame, so I guess we'd have to always bump SP by the number of allocatable registers.

I wonder how much stack space we can safely waste for each frame.

XrXr · 2025-07-24T18:10:32Z

I like high watermark and bump SP (with RBP) approach personally

Ah yes, that's another approach. If we're willing to preserve RBP on x86 (we don't currently) then we can push and pop and refer to things using a static offset based on RBP. On ARM we already always have an RBP equivalent because macOS requires it.

k0kubun · 2025-07-24T18:16:26Z

Sorry for misunderstanding the idea. So... I guess we'll tweak the FrameSetup for x86_64 and switch to RBP-based operands then?

launchable-app · 2025-07-24T18:44:25Z

❌ Tests Failed

✖️no tests failed ✔️62162 tests passed(1 flake)

XrXr · 2025-07-24T19:16:48Z

I guess we'll tweak the FrameSetup for x86_64 and switch to RBP-based operands then?

Yes that's the idea. Seems like the best option for now.

XrXr added 3 commits July 23, 2025 22:42

ZJIT: DRY up underscore rexport anti-pattern

33c93f2

Keeping the same name makes re-exporting more concise.

ZJIT: A64: Add add_extended() which can add a register to sp

aad22bc

matzbot requested a review from a team July 24, 2025 17:43

XrXr mentioned this pull request Jul 26, 2025

ZJIT: Keep a frame pointer and use it for memory params #14019

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ZJIT: Add MemBase::FrameBase that deals with native SP changes (+2) #14009

ZJIT: Add MemBase::FrameBase that deals with native SP changes (+2) #14009

XrXr commented Jul 24, 2025 •

edited

Loading

Uh oh!

tekknolagi commented Jul 24, 2025

Uh oh!

k0kubun commented Jul 24, 2025 •

edited

Loading

Uh oh!

XrXr commented Jul 24, 2025

Uh oh!

k0kubun commented Jul 24, 2025

Uh oh!

launchable-app bot commented Jul 24, 2025

Uh oh!

XrXr commented Jul 24, 2025

Uh oh!

Uh oh!

ZJIT: Add MemBase::FrameBase that deals with native SP changes (+2) #14009

Are you sure you want to change the base?

ZJIT: Add MemBase::FrameBase that deals with native SP changes (+2) #14009

Conversation

XrXr commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What to do then?

Uh oh!

tekknolagi commented Jul 24, 2025

Uh oh!

k0kubun commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

XrXr commented Jul 24, 2025

Uh oh!

k0kubun commented Jul 24, 2025

Uh oh!

launchable-app bot commented Jul 24, 2025

❌ Tests Failed

Uh oh!

XrXr commented Jul 24, 2025

Uh oh!

Uh oh!

XrXr commented Jul 24, 2025 •

edited

Loading

k0kubun commented Jul 24, 2025 •

edited

Loading