[Nasm-devel] A few rip-rel things

Discussion:

Peter Johnson

2007-09-19 05:44:11 UTC

In going through the new rip relative stuff in NASM, and noticed a few
things:

1. It seems ugly to make an exception for just FS and GS to have an
exception to default-RIP-rel (and the current wording in the NASM manual
is confusing too)... wouldn't it be better to just turn off
default-RIP-rel for ALL segment registers? Particularly seeing as an es
override is ignored with a warning. I would think that if a user is using
*any* segment register, having default RIP-relative would be a surprise.

2. It appears handling of 64-bit movoffs values in default rel mode is
broken in NASM:

default rel
mov rax, [123456789abcdef0h]
mov rax, [qword 123456789abcdef0h]

both mov's generate 32-bit RIP-relative values with no warning. This
feels like a bug... but see #3 for more on what's the *real* issue here.

3. I believe that due to 64-bit movoffs forms being "special" (in allowing
a 64-bit displacement), the use of qword in the [] should force the
displacement size, not the address size (and likewise for dword, which
should force a 32-bit displacement size). Note displacement size can be
different than address size in 64-bit mode (okay, only for the movoffs
form, but consistency is good across the board)! Is there a current way
in NASM's world to force either a 32-bit or 64-bit displacement size (NOT
address size!) for mov rax, [...]?

Note that the default in 64-bit mode should be NOT to use the movoffs
form, as this is 4 bytes longer than the modrm-version. Plus a number of
object formats much prefer 32-bit relative offsets rather than the 64-bit
one (win64 for example chokes heavily on 64-bit relocs in the linker
stage, so you'll end up with broken behavior with the current NASM
output).

mov rax, [dword foo] makes the displacement 4 bytes, but generates an a32
prefix on what is still a movoffs form instruction. Not exactly what we
want.

This is why in yasm I use [dword ...] and [qword ...] to force the
displacement size, and a32 and a64 to force the address size, to separate
these two concepts.

So the question is, what should the following example output? (yasm
behavior in comments). Run it through the current NASM and think about
what makes more sense. :)

bits 64
val:

default abs

mov rax, [val] ; 48 8b ... (32-bit disp)
mov rax, [dword val] ; 48 8b ... (32-bit disp)
mov rax, [qword val] ; 48 a1 ... (64-bit disp)
a32 mov rax, [val] ; 67 48 a1 ... (32-bit disp)
a32 mov rax, [dword val] ; 67 48 a1 ... (32-bit disp)
a32 mov rax, [qword val] ; 67 48 a1 ... (32-bit disp)
; [this one is debatable on correctness,
; I chose in yasm to make a32 override]
a64 mov rax, [val] ; 48 8b ... (32-bit disp)
a64 mov rax, [dword val] ; 48 8b ... (32-bit disp)
a64 mov rax, [qword val] ; 48 a1 ... (64-bit disp)

mov rbx, [val] ; 48 8b ... (32-bit disp)
mov rbx, [dword val] ; 48 8b ... (32-bit disp)
mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a32 mov rbx, [val] ; 67 48 8b ... (32-bit disp)
a32 mov rbx, [dword val] ; 67 48 8b ... (32-bit disp)
a32 mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a64 mov rbx, [val] ; 48 8b ... (32-bit disp)
a64 mov rbx, [dword val] ; 48 8b ... (32-bit disp)
a64 mov rbx, [qword val] ; illegal (can't have 64-bit disp)

default rel

; yasm doesn't do this yet, but this is what I think makes sense

mov rax, [val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rax, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rax, [qword val] ; 48 a1 ... (64-bit disp, ABS)
a32 mov rax, [val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rax, [dword val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rax, [qword val] ; 67 48 8b ... (32-bit disp, RIP-rel)
; [this one is debatable on correctness,
; I chose in yasm to make a32 override]
a64 mov rax, [val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rax, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rax, [qword val] ; 48 a1 ... (64-bit disp, ABS)

mov rbx, [val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rbx, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a32 mov rbx, [val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rbx, [dword val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a64 mov rbx, [val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rbx, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rbx, [qword val] ; illegal (can't have 64-bit disp)

Peter

Peter Johnson

2007-09-19 05:48:23 UTC

Permalink

3. I believe that due to 64-bit movoffs forms being "special" (in allowing a
64-bit displacement), the use of qword in the [] should force the
displacement size, not the address size (and likewise for dword, which
should force a 32-bit displacement size). Note displacement size can be
different than address size in 64-bit mode (okay, only for the movoffs form,
but consistency is good across the board)! Is there a current way in NASM's
world to force either a 32-bit or 64-bit displacement size (NOT address
size!) for mov rax, [...]?
Note that the default in 64-bit mode should be NOT to use the movoffs form,
as this is 4 bytes longer than the modrm-version. Plus a number of object
formats much prefer 32-bit relative offsets rather than the 64-bit one
(win64 for example chokes heavily on 64-bit relocs in the linker stage, so
you'll end up with broken behavior with the current NASM output).
mov rax, [dword foo] makes the displacement 4 bytes, but generates an a32
prefix on what is still a movoffs form instruction. Not exactly what we
want.
This is why in yasm I use [dword ...] and [qword ...] to force the
displacement size, and a32 and a64 to force the address size, to separate
these two concepts.
So the question is, what should the following example output? (yasm behavior
in comments). Run it through the current NASM and think about what makes
more sense. :)
bits 64
default abs
mov rax, [val] ; 48 8b ... (32-bit disp)
mov rax, [dword val] ; 48 8b ... (32-bit disp)
mov rax, [qword val] ; 48 a1 ... (64-bit disp)
a32 mov rax, [val] ; 67 48 a1 ... (32-bit disp)
a32 mov rax, [dword val] ; 67 48 a1 ... (32-bit disp)
a32 mov rax, [qword val] ; 67 48 a1 ... (32-bit disp)
; [this one is debatable on correctness,
; I chose in yasm to make a32 override]
a64 mov rax, [val] ; 48 8b ... (32-bit disp)
a64 mov rax, [dword val] ; 48 8b ... (32-bit disp)
a64 mov rax, [qword val] ; 48 a1 ... (64-bit disp)
mov rbx, [val] ; 48 8b ... (32-bit disp)
mov rbx, [dword val] ; 48 8b ... (32-bit disp)
mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a32 mov rbx, [val] ; 67 48 8b ... (32-bit disp)
a32 mov rbx, [dword val] ; 67 48 8b ... (32-bit disp)
a32 mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a64 mov rbx, [val] ; 48 8b ... (32-bit disp)
a64 mov rbx, [dword val] ; 48 8b ... (32-bit disp)
a64 mov rbx, [qword val] ; illegal (can't have 64-bit disp)
default rel
; yasm doesn't do this yet, but this is what I think makes sense
mov rax, [val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rax, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rax, [qword val] ; 48 a1 ... (64-bit disp, ABS)
a32 mov rax, [val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rax, [dword val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rax, [qword val] ; 67 48 8b ... (32-bit disp, RIP-rel)
; [this one is debatable on correctness,
; I chose in yasm to make a32 override]
a64 mov rax, [val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rax, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rax, [qword val] ; 48 a1 ... (64-bit disp, ABS)
mov rbx, [val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rbx, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a32 mov rbx, [val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rbx, [dword val] ; 67 48 8b ... (32-bit disp, RIP-rel)
a32 mov rbx, [qword val] ; illegal (can't have 64-bit disp)
a64 mov rbx, [val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rbx, [dword val] ; 48 8b ... (32-bit disp, RIP-rel)
a64 mov rbx, [qword val] ; illegal (can't have 64-bit disp)

And of course something similar applies to the recent 64-bit immediate
discussion as well...

; yasm's behavior in comments

mov rax, val ; 32-bit imm
mov rax, dword val ; 32-bit imm
mov rax, qword val ; 64-bit imm

mov rbx, val ; 32-bit imm
mov rbx, dword val ; 32-bit imm
mov rbx, qword val ; 64-bit imm

Peter

H. Peter Anvin

2007-09-19 18:42:19 UTC

Permalink

Post by Peter Johnson
In going through the new rip relative stuff in NASM, and noticed a few
1. It seems ugly to make an exception for just FS and GS to have an
exception to default-RIP-rel (and the current wording in the NASM manual
is confusing too)... wouldn't it be better to just turn off
default-RIP-rel for ALL segment registers? Particularly seeing as an es
override is ignored with a warning. I would think that if a user is using
*any* segment register, having default RIP-relative would be a surprise.

No, that really makes sense, as hideously non-orthogonal as it may be
(there are some ugly hacks in x86-64). nasm64developer pointed out to
me that on some CPUs, it is possible to enable limit checks for segment
registers even in 64-bit mode, but only FS and GS have bases.

Post by Peter Johnson
2. It appears handling of 64-bit movoffs values in default rel mode is
default rel
mov rax, [123456789abcdef0h]
mov rax, [qword 123456789abcdef0h]
both mov's generate 32-bit RIP-relative values with no warning. This
feels like a bug... but see #3 for more on what's the *real* issue here.

No, that's the right thing, because you don't know a priori that that
isn't a valid address in that range. [abs 123456789abcdef0h] should do
the movoffs form.

In current NASM, word, dword and qword on an addressing operand applies
to the address size whereas byte specifies the displacement size. This
is inconsistent, yes, but it is the established behaviour and would
definitely have to be maintained for non-64-bit code. Introducing new
behaviour for 64-bit mode would have to be balanced against the other
tradeoffs.

Post by Peter Johnson
3. I believe that due to 64-bit movoffs forms being "special" (in allowing
a 64-bit displacement), the use of qword in the [] should force the
displacement size, not the address size (and likewise for dword, which
should force a 32-bit displacement size). Note displacement size can be
different than address size in 64-bit mode (okay, only for the movoffs
form, but consistency is good across the board)! Is there a current way
in NASM's world to force either a 32-bit or 64-bit displacement size (NOT
address size!) for mov rax, [...]?

There isn't at the moment. See below, though.

However, I'm adamant that "mov rax,[abs 123456789abtcdef0h]" should use
the 64-bit form by default; the optimizer should be allow to reduce it
to 32 bits if the address fits.

Post by Peter Johnson
Note that the default in 64-bit mode should be NOT to use the movoffs
form, as this is 4 bytes longer than the modrm-version. Plus a number of
object formats much prefer 32-bit relative offsets rather than the 64-bit
one (win64 for example chokes heavily on 64-bit relocs in the linker
stage, so you'll end up with broken behavior with the current NASM
output).

That's bogus. That's what "default rel" is for.

The real question is how much value it is in the 32-bit displacement
forms as opposed to the a32 form. Remember, we're only talking absolute
addresses here (not relative addresses nor ); the 32-bit displacement
form can produce addresses in the range ±2 GB whereas the a32 form
produces addresses in the range 0-4 GB. So the only consumers of the
former would be something like an OS kernel which doesn't use
RIP-relative addressing... a pretty rare beast.

Post by Peter Johnson
mov rax, [dword foo] makes the displacement 4 bytes, but generates an a32
prefix on what is still a movoffs form instruction. Not exactly what we
want.

No, I think that is exactly what we want.

I have mentioned in the past that I'd like to use the syntax:

mov rax,[abs a64 dword bluttan]

... to produce the 32-bit displacement form. Yes, it's heavy on syntax,
but it is such a rare corner case.

This isn't implemented yet, though, nor am I really convinced it's the
right thing.

Post by Peter Johnson
This is why in yasm I use [dword ...] and [qword ...] to force the
displacement size, and a32 and a64 to force the address size, to separate
these two concepts.

That's fine in many ways, but it is a much bigger departure from
historical NASM syntax. Even though we could do this in 64-bit mode
(ONLY!), I'm a bit leery of doing so for the benefit of one single
instruction.

-hpa

Peter Johnson

2007-09-20 16:08:18 UTC