feat(tests): add first few single-opcode test for state access in BloatNet #2040

gballet · 2025-08-14T15:08:59Z

🗒️ Description

Add a test required as part of the BloatNet effort. This is the

🔗 Related Issues or PRs

Not an issue, but a test plan is described here.

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx --with=tox-uv tox -e lint,typecheck,spellcheck,markdownlint
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
~~Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.~~
~~Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.~~

Signed-off-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>

Signed-off-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com> remove leftover single whitespace :|

LouisTsai-Csie · 2025-08-15T07:59:06Z

Hello @gballet ! Thanks for adding this case.

This is the issue tracker for bloatnet test cases, could you please help me (1) add the PR to the issue tracker PR description (like this) (2) link this PR to the issue, this would help us better track the progress, thank you!

For benchmark test, we now add new cases under tests/benchmark, and I think test_worst_stateful_opcodes.py best fit in your test.

I also add some review below, please feel free to let me know if you have any issue! If you want some reference for benchmark test, maybe you can take a look at this This is a similar case for this benchmark! You can take a look at this structure!

LouisTsai-Csie · 2025-08-15T09:13:17Z

tests/prague/eip8047_bloatnet/test_bloatnet.py

+
+REFERENCE_SPEC_GIT_PATH = "DUMMY/eip-DUMMY.md"
+REFERENCE_SPEC_VERSION = "0.1"
+GAS_LIMIT = 30_000_000   # Default gas limit seems to be >90M in this env


Is this for the transaction gas limit cap? If so, this value is updated to 2**24 (reference), and we would like to use this in the test: fork.transaction_gas_limit_cap(), it will adjust to different value based on the fork.

The value I find is much higher than 2**24... but yeah, if that function gives me the info, I'll use it. Is it going to be valid in benchmarks though?

I guess what you found is the following fixture:

BENCHMARKING_MAX_GAS = 1_000_000_000_000 @pytest.fixture def env(request: pytest.FixtureRequest) -> Environment: # noqa: D103 """Return an Environment instance with appropriate gas limit based on test type.""" if request.node.get_closest_marker("benchmark") is not None: return Environment(gas_limit=BENCHMARKING_MAX_GAS) return Environment()

The BENCHMARKING_MAX_GAS is not the actual benchmarking value we are using, currently we are testing under 1,10,30,45,60,100,150M gas limit, and this is passed to gas_benchmark_value parameter. In the past, we create different genesis file with these configuration (so for each config, we need a corresponding genesis file.) but now we set the gas limit to a extremely high value and then restrict the block gas limit usage to these configuration, so that we only need one genesis file. This would help simplify the client's overhead in running benchmarking.

For the fork.transaction_gas_limit_cap(), (1) if you want to test under Prague now, you can directly pass the gas_benchmark_value fixture as the gas limit for the transaction. (2) But if you want to fill the test for Osaka, you might need to use blockchain test that fill the block with tx that each of them consume at most 2**24 as tx gas limit.

But I think you can do the first option now, we are working on a general refactoring that automatically upgrade all the benchmark test to be compatible to Osaka fork.

so what I would like is the max gas limit for the block, not the gas limit per transaction. Is there a helper for that?

nvm, found it

LouisTsai-Csie · 2025-08-15T09:36:25Z

tests/prague/eip8047_bloatnet/test_bloatnet.py

+    while totalgas + gas_increment < GAS_LIMIT:
+        totalgas += gas_increment
+        # print(f"increment={gas_increment} < totalgas={totalgas} i={i}")
+        sstore_code = sstore_code + Op.DUP1
+        if i < 256:
+            sstore_code = sstore_code + Op.PUSH1(i)
+        else:
+            sstore_code = sstore_code + Op.PUSH2(i)
+
+        sstore_code = sstore_code + Op.SSTORE(unchecked=True)
+
+        storage[storage_slot] = 0x02 << 248
+        storage_slot += 1
+        i += 1
+    sstore_code = sstore_code + Op.POP # Drop last value on the stack


My understanding is that here we want to update storage. Maybe we can try something like:

setup = Op.CALLDATALOAD(0) available_gas = ( fork.transaction_gas_limit_cap() # intrinsic gas cost - fork.transaction_intrinsic_cost_calculator()( calldata=code, ) # gas cost for push0 and calldataload - fork.gas_costs().G_BASE * 2 ) # gas cost for each storage operation iteration_count = available_gas // ( fork.gas_costs().G_VERY_LOW * 2 + fork.gas_costs().G_STORAGE_SET + fork.gas_costs().G_COLD_SLOAD ) code = setup + sum(Op.SSTORE(i, Op.DUP1) for i in range(iteration_count)) assert(len(code) - fork.max_code_size)

For SSTORE operation, it will automatically adjust to different push variants based on your data size

I am not sure if this applies to this case, but we use hash function for storage key to make it not consecutive in benchmarking access list tx.

The sum() for i in range() won't work, because we also collect the locations that are touched, although since it grows linearly, we could simply do that in a separate loop. It feels like it'll be slower, but you might know better.

Question: is it really correct to write Op.SSTORE(i, Op.DUP1)? sounds too good to be true, so I want to double-check.

found other instances, ok, that's amazing ❤️

Signed-off-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>

gballet · 2025-08-21T15:34:21Z

Hey @LouisTsai-Csie thanks for the feedback.

This is the issue tracker for bloatnet test cases, could you please help me (1) add the PR to the issue tracker PR description (like this) (2) link this PR to the issue, this would help us better track the progress, thank you!

I tried to do this, but this looks like it's very involved. I did my best effort but since I don't know what you're expecting, and also that I don't have all the time in the world, I'll leave it in your court to comment on that. #2064

For benchmark test, we now add new cases under tests/benchmark, and I think test_worst_stateful_opcodes.py best fit in your test.

if I do that, how do I run the test? it seems to ignore them after I moved it to the directory. I have pushed it to this PR for your consideration.

I also add some review below, please feel free to let me know if you have any issue! If you want some reference for benchmark test, maybe you can take a look at this This is a similar case for this benchmark! You can take a look at this structure!

Thanks for the reference.

LouisTsai-Csie · 2025-08-21T16:03:49Z

@gballet Appologies. I forgot to link the issue tracker for you. We've created an issue tracker based on your documentation.

I help you link this PR to the SSTORE — Fill block with SSTORE(0 → 1) to maximize new storage slot creation, please let me know if this does not fit in the category.

Also, it would be great if you can help me review if there is anything missing / wrong in our issue tracker!

gballet · 2025-08-21T16:12:06Z

I'll need to have a closer look, but it seems fine as a first pass. Do you know what the problem is with moving my file to benchmarks?

LouisTsai-Csie · 2025-08-21T16:20:42Z

Our documentation is incomplete (I will fix them ASAP), for running the test, you will need to add a flag -m benchmark to run the test under the benchmark/ folder. By default, these tests are ignored to avoid some overhead in the CI/release process

This is the command on our documentation:

fill -v tests/benchmark/test_worst_blocks.py::test_block_full_of_ether_transfers --fork Osaka

But I would add some flag to run it:

uv run fill -v tests/benchmark/test_worst_blocks.py::test_block_full_of_ether_transfers --fork Osaka -m benchmark --clean

uv run: we use uv as package manager
-m benchmark: We need this flag or benchmark test will be ignored by default
--clean: you will need this if you already fill test before.

Please let me know if there is anything unclear to you!