Run all benchmarks on merge to main branch #15511

Omega359 · 2025-03-31T19:46:09Z

Is your feature request related to a problem or challenge?

There has been a number of issues where benchmarks stopped working and no one noticed until someone happened to try and run them. This should be resolved and I suggest we do it in the github extended tests where we run all the benchmarks just to verify they actually run whenever a PR is merged into main.

Describe the solution you'd like

Add running of benchmarks in the datafusion extended tests github job.

Describe alternatives you've considered

No response

Additional context

No response

alamb · 2025-03-31T20:21:08Z

I would persoanlly suggest if we are going to run the benchmarks we should also be actively tracking them / making sure they are measuring something useful.

Omega359 · 2025-03-31T20:22:49Z

Sure. #5504

alamb · 2025-04-01T00:05:18Z

I see what you are saying now: #15500 (comment)

I now agree it would be good to test these somehow

jayzhan211 · 2025-04-01T00:34:49Z

There has been a number of issues where benchmarks stopped working and no one noticed until someone happened to try and run them

Instead of running the benchmark, how about adding those benchmark query to tests, I don't think we need to actually "benchmark" the code for each merge.

Shreyaskr1409 · 2025-04-01T12:17:49Z

I don't think we need to actually "benchmark" the code for each merge.

The issue #5504 would require all benchmarks to run after each merge. I think we could just add benchmarks directly for now. What do you think?

I am willing to work on* this.

Shreyaskr1409 · 2025-04-01T12:29:05Z

take

Omega359 · 2025-04-01T13:01:03Z

There has been a number of issues where benchmarks stopped working and no one noticed until someone happened to try and run them

Instead of running the benchmark, how about adding those benchmark query to tests, I don't think we need to actually "benchmark" the code for each merge.

We could do that too however not all benchmarks are sql queries.

jayzhan211 · 2025-04-01T15:25:33Z

I don't think we need to actually "benchmark" the code for each merge.

The issue #5504 would require all benchmarks to run after each merge. I think we could just add benchmarks directly for now. What do you think?

I am willing to work on* this.

I don't think we need to run the benchmark on CI, at least it should be optional and disable by default.

We could do that too however not all benchmarks are sql queries.

I agree that maintaining it is challenging. Adding it to the extended test suite and running it on every merge isn’t a viable solution, as it’s costly and often unnecessary. I don’t think keeping the benchmark functional is essential—it’s more like a script that we can modify as needed, depending on what we want to measure each time

Omega359 · 2025-04-01T15:56:27Z

I disagree. If it's in the code base it should work or it should be marked as 'experimental' imho. No one wants to have to go through and fix a benchmark anytime they happen to want to check something.

berkaysynnada · 2025-04-03T08:21:15Z

Instead of running the benchmark, how about adding those benchmark query to tests, I don't think we need to actually "benchmark" the code for each merge.

Keeping all benchmark coverage with their replicas is challenging (as @Omega359 said not all benchmarks are sql queries) and very prone to go out of sync.

Adding it to the extended test suite and running it on every merge isn’t a viable solution, as it’s costly and often unnecessary

You are also right here about the cost, but what if we can have 2 modes for benchmarks, one for the actual benchmarking purpose, and one with just to validate. If it is in validation mode, it will work with very low loads -- e.g. min sampling count, min batch sizes etc.

cargo test (or extended tests) would run the validation mode, and normal benchmarking goes with the standard mode.

jayzhan211 · 2025-04-04T01:20:52Z

You are also right here about the cost, but what if we can have 2 modes for benchmarks, one for the actual benchmarking purpose, and one with just to validate. If it is in validation mode, it will work with very low loads -- e.g. min sampling count, min batch sizes etc.

This is what I said so I agree with this

how about adding those benchmark query to tests

We mirror the benchmark code as the test, so we run the test and make sure it works, but we don't actually run the benchmark.

If it is sql query, we can add it in sqllogictest crate, if it is not we add it to rust test.

Omega359 added the enhancement New feature or request label Mar 31, 2025

Omega359 mentioned this issue Mar 31, 2025

Add query to extended clickbench suite for "complex filter" #15500

Merged

github-actions bot assigned Shreyaskr1409 Apr 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run all benchmarks on merge to main branch #15511

Run all benchmarks on merge to main branch #15511

Omega359 commented Mar 31, 2025

alamb commented Mar 31, 2025

Omega359 commented Mar 31, 2025

alamb commented Apr 1, 2025

jayzhan211 commented Apr 1, 2025

Shreyaskr1409 commented Apr 1, 2025 •

edited

Loading

Shreyaskr1409 commented Apr 1, 2025

Omega359 commented Apr 1, 2025

jayzhan211 commented Apr 1, 2025

Omega359 commented Apr 1, 2025

berkaysynnada commented Apr 3, 2025

jayzhan211 commented Apr 4, 2025 •

edited

Loading

Run all benchmarks on merge to main branch #15511

Run all benchmarks on merge to main branch #15511

Comments

Omega359 commented Mar 31, 2025

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

alamb commented Mar 31, 2025

Omega359 commented Mar 31, 2025

alamb commented Apr 1, 2025

jayzhan211 commented Apr 1, 2025

Shreyaskr1409 commented Apr 1, 2025 • edited Loading

Shreyaskr1409 commented Apr 1, 2025

Omega359 commented Apr 1, 2025

jayzhan211 commented Apr 1, 2025

Omega359 commented Apr 1, 2025

berkaysynnada commented Apr 3, 2025

jayzhan211 commented Apr 4, 2025 • edited Loading

Shreyaskr1409 commented Apr 1, 2025 •

edited

Loading

jayzhan211 commented Apr 4, 2025 •

edited

Loading