Add EIP: EVM64 - Little endian opcodes #9821

sorpaas · 2025-05-26T01:32:04Z

This defines BYTE64, MLOAD64, MSTORE64 and PUSH*64.

eth-bot · 2025-05-26T01:32:32Z

File `EIPS/eip-evm64-le.md`

Requires 1 more reviewers from @g11tech, @lightclient, @SamWilsn

jochem-brouwer

I think it should be motivated why the bytes are "reversed" in some cases. I do not have enough knowledge about lower level hardware, but it seems to me that also if you have 64bit processors you still need to reverse it at some point (either coming from 256-bit EVM (legacy) to here, or from here back into 256-bit EVM again).

The gas costs are also likely too low, especially because of bitmask and reversal "extra" work.

jochem-brouwer · 2025-05-26T01:41:10Z

EIPS/eip-evm64-le.md

+
+### `BYTE64`
+
+`BYTE64` (0xc01a) is defined as `(x >> i * 8) & 0xFF`. Note that the definition is changed from big endian to little endian.


I would assume this masks the stack bits above the 64 bit range? If I would query BYTE 9 this should always push 0 to the stack - right?

Yeah, this is the same behavior as 256-bit BYTE -- if the range is exceeded then we push 0.

jochem-brouwer · 2025-05-26T01:42:06Z

EIPS/eip-evm64-le.md

+
+`BYTE64` (0xc01a) is defined as `(x >> i * 8) & 0xFF`. Note that the definition is changed from big endian to little endian.
+
+The gas cost is `G_VERYLOW64`.


The 256 bit variant costs 3 gas, and this one likely also need to mask the topmost bits (the "bits on the left") first (?)

It does not need to mask the topmost bits -- just discard them. x is 64-bit, and i is 64-bit. Whatever topmost 192 bits were we don't ever touch them!

jochem-brouwer · 2025-05-26T01:45:00Z

EIPS/eip-evm64-le.md

+
+### `MLOAD64` and `MSTORE64`
+
+`MLOAD64` (0xc051) will load a 64-bit integer in little endian onto the stack. `MSTORE64` (0xc052) will read an 64-bit integer from the stack, and store it to memory in little endian.


This changes the memory offset from big endian to little endian. So MSTORE(0, 0x1020) will behave different than MSTORE64(0, 0x1020)? 🤔 EDIT: yes, it will, is explained in the text below 😄 👍

I'm assuming this works like MSTORE8, so it will only write to the 64 bits it could write to and not more, not less?

Yeah they behave different. The length is also different. MSTORE(0, 0x1020) would be memory [0x0 .. 0x0 0x10 0x20] (total 32 bytes), and MSTORE64(0, 0x1020) would be [0x20 0x10 0x0 .. 0x0] (total 8 bytes).

MSTORE8 is only one byte so endianness does not matter.

jochem-brouwer · 2025-05-26T01:45:36Z

EIPS/eip-evm64-le.md

+
+`MLOAD64` (0xc051) will load a 64-bit integer in little endian onto the stack. `MSTORE64` (0xc052) will read an 64-bit integer from the stack, and store it to memory in little endian.
+
+The gas cost for both opcodes is `G_VERYLOW64`.


What about the dynamic resizing costs of the memory? Since this also needs to "reverse" the bytes before writing to the inner memory this seems more work and this is thus likely too low.

About resizing cost it extends the memory by 8 bytes (rather than 32 bytes).

See below comment about "reverse" the bytes!

jochem-brouwer · 2025-05-26T01:46:51Z

EIPS/eip-evm64-le.md

+
+`PUSH2_64` (0xc061) to `PUSH8_64` (0xc067) follows 2-byte to 8-byte literal. The literal is read little endian and pushed onto the stack.
+
+The gas cost for them is `G_VERYLOW64`.


PUSH costs 3 gas, and this has to read code plus reverse it, so this is also likely too low.

sorpaas · 2025-05-26T02:17:34Z

@jochem-brouwer

I think it should be motivated why the bytes are "reversed" in some cases. I do not have enough knowledge about lower level hardware, but it seems to me that also if you have 64bit processors you still need to reverse it at some point (either coming from 256-bit EVM (legacy) to here, or from here back into 256-bit EVM again).

The gas costs are also likely too low, especially because of bitmask and reversal "extra" work.

I want to comment together on the point of needing reversal "extra" work.

An efficient EVM interpreter works by storing the stack 256-bit value already in little endian U256. Generally, it's [u64; 4] (the least significant u64 can be the first or the last item of the array, doesn't matter). This is actually the only way that the interpreter can work, because it cannot do math, or practically any operations (other than clone/copy) on this 256-bit value, unless it's little endian! We use little endian native machine anyway! The interpreter then "reverse" the bytes when needing to store values into memory.

So as you can see, it's not that those 64-bit little endian opcodes need reversal "extra" work, but the 256-bit big endian opcodes! I think this should also answer your concern about the other points for gas costs.

But on the other hand I agree this is "confusing". It took me a long time to even convince myself that doing those little endian opcodes would be beneficial and efficient (they indeed are!). So I perfectly understand all this little endian / big endian stuff sounds really confusing for any core devs. This is also the reason why this is defined as a separate "optional EIP" but not together with the core EVM64 EIP-7937. I believe that one is rather straightforward and easy to understand, but for this one it'll probably take an awful long time before everyone is convinced that everything ought to be defined this way.

jochem-brouwer · 2025-05-26T02:26:58Z

Aha, ok, yes this clarifies a lot. I think my confusion also comes from writing stuff in JavaScript which is obviously very far away from the low-level code. I also realized (or: had to look up) that most systems run in little endian and considering that, you are right, then indeed the big endian would first need to be converted.

I think if you could write in your post above in the EIP it would clarify a lot 😄 👍 On the other hand, this might also be common knowledge and I come from a different "world" where this is not obvious and thus confusing.

github-actions · 2025-05-26T02:31:45Z

The commit c17b9ab (as a parent of da5b456) contains errors.
Please inspect the Run Summary for details.

sorpaas · 2025-05-28T06:55:47Z

@Editors can you help to assign a number for this EIP? Thank you!

EVM64 - Little endian opcodes

3087b24

sorpaas requested a review from eth-bot as a code owner May 26, 2025 01:32

github-actions bot added c-new Creates a brand new proposal s-draft This EIP is a Draft t-core labels May 26, 2025

eth-bot added e-consensus Waiting on editor consensus e-review Waiting on editor to review labels May 26, 2025

eth-bot changed the title ~~EVM64 - Little endian opcodes~~ Add EIP: EVM64 - Little endian opcodes May 26, 2025

github-actions bot added the w-ci Waiting on CI to pass label May 26, 2025

jochem-brouwer reviewed May 26, 2025

View reviewed changes

Update eip-evm64-le.md

c17b9ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add EIP: EVM64 - Little endian opcodes #9821

Add EIP: EVM64 - Little endian opcodes #9821

Uh oh!

sorpaas commented May 26, 2025

Uh oh!

eth-bot commented May 26, 2025 •

edited

Loading

Uh oh!

jochem-brouwer left a comment

Uh oh!

jochem-brouwer May 26, 2025

Uh oh!

sorpaas May 26, 2025

Uh oh!

jochem-brouwer May 26, 2025

Uh oh!

sorpaas May 26, 2025

Uh oh!

jochem-brouwer May 26, 2025 •

edited

Loading

Uh oh!

sorpaas May 26, 2025

Uh oh!

jochem-brouwer May 26, 2025

Uh oh!

sorpaas May 26, 2025

Uh oh!

jochem-brouwer May 26, 2025

Uh oh!

sorpaas commented May 26, 2025

Uh oh!

jochem-brouwer commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

sorpaas commented May 28, 2025

Uh oh!

Uh oh!


		### `BYTE64`

		`BYTE64` (0xc01a) is defined as `(x >> i * 8) & 0xFF`. Note that the definition is changed from big endian to little endian.


		`BYTE64` (0xc01a) is defined as `(x >> i * 8) & 0xFF`. Note that the definition is changed from big endian to little endian.

		The gas cost is `G_VERYLOW64`.


		### `MLOAD64` and `MSTORE64`

		`MLOAD64` (0xc051) will load a 64-bit integer in little endian onto the stack. `MSTORE64` (0xc052) will read an 64-bit integer from the stack, and store it to memory in little endian.


		`MLOAD64` (0xc051) will load a 64-bit integer in little endian onto the stack. `MSTORE64` (0xc052) will read an 64-bit integer from the stack, and store it to memory in little endian.

		The gas cost for both opcodes is `G_VERYLOW64`.


		`PUSH2_64` (0xc061) to `PUSH8_64` (0xc067) follows 2-byte to 8-byte literal. The literal is read little endian and pushed onto the stack.

		The gas cost for them is `G_VERYLOW64`.

Add EIP: EVM64 - Little endian opcodes #9821

Are you sure you want to change the base?

Add EIP: EVM64 - Little endian opcodes #9821

Uh oh!

Conversation

sorpaas commented May 26, 2025

Uh oh!

eth-bot commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

File EIPS/eip-evm64-le.md

Uh oh!

jochem-brouwer left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jochem-brouwer May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sorpaas commented May 26, 2025

Uh oh!

jochem-brouwer commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

sorpaas commented May 28, 2025

Uh oh!

Uh oh!

eth-bot commented May 26, 2025 •

edited

Loading

File `EIPS/eip-evm64-le.md`

jochem-brouwer May 26, 2025 •

edited

Loading