removed waits around enqueue_primitive in verbose mode #2973

lslusarczyk · 2025-03-28T15:12:49Z

Description

When running llama.cpp with SYCL Graph feature enabled by GGML_SYCL_GRAPH variable and with verbose logging on oneDNN, the wait call is executed on the sycl stream. This is illegal when recording graph, and wait cannot be called for a queue which is recording to a command graph exception is thrown by this line in llvm compiler.

I'd like to solve this wait problem in order to allow testing llama.cpp with SYCL and with oneDNN verbose logs, as active work on enabling fast graphs in llama.cpp is currently in progress.

What is the best solution

The simplest solution is just remove wait calls, as proposed in this PR, but in this case measurement of kernel execution time will be wrongly calculated.

Other solutions that I see are:

Use "before" and "after" host tasks and log time from the "after" task.
Use "before" and "after" regular tasks and print time directly from kernel code in the "after" task using SYCL stream.
Use SYCL timers with its get_profiling_info function. This measures just kernel execution time, not data transfers and kernel launch time.
Remove time measurements, as this can be observed by calculated by profiling tools

Is there a perfect solution to this problem already designed, or is there any other solution possible, that avoids waits but still computes and logs kernel execution time correctly?

karturov · 2025-03-28T17:16:59Z

This issue will be resolved under MFDNN-12088 (internal tracker), and a separate PR will be created to address it.

removed waits around enqueue_primitive in verbose mode

91fb7cc

karturov closed this Mar 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

removed waits around enqueue_primitive in verbose mode #2973

removed waits around enqueue_primitive in verbose mode #2973

lslusarczyk commented Mar 28, 2025

karturov commented Mar 28, 2025

removed waits around enqueue_primitive in verbose mode #2973

removed waits around enqueue_primitive in verbose mode #2973

Conversation

lslusarczyk commented Mar 28, 2025

Description

What is the best solution

karturov commented Mar 28, 2025