Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Unitrace] too many very long calls to zeCommandListAppendMemory fill on Lunar Lake #86

Open
s-Nick opened this issue Mar 21, 2025 · 1 comment

Comments

@s-Nick
Copy link

s-Nick commented Mar 21, 2025

Description

I am trying to profile/trace an application that uses GPU on Lunar Lake. The trace I obtain is unreadable due to very long zeCommandListAppendMemoryFill, they look like they are taking 10 minutes, which is way more than the whole application takes. This doesn't happen on Intel ARC B580

An example is:

Image

Environment

>cat /etc/issue
Ubuntu 24.10 

>dpkg -l | grep intel
ii  intel-fw-gpu                                  2024.37.5-362~22.04                      all          Firmware package for Intel integrated and discrete GPUs\
ii  intel-gpu-tools                               1.29-1                                   amd64        tools for debugging the Intel graphics driver
ii  intel-igc-cm                                  1.0.225.54083-1077~24.04                 amd64        Intel(R) C for Metal Compiler -- CM Frontend lib
ii  intel-media-va-driver-non-free:amd64          25.1.0-0ubuntu1~ppa1                     amd64        VAAPI driver for the Intel GEN8+ Graphics family
ii  intel-metrics-discovery                       1.13.179-1077~24.04                      amd64        Intel(R) Metrics Discovery Application Programming Interface --
ii  intel-metrics-library                         1.0.182-1077~24.04                       amd64        Intel(R) Metrics Library for MDAPI (Intel(R) Metrics Discovery
ii  intel-microcode                               3.20250211.0ubuntu0.24.10.1              amd64        Processor microcode firmware for Intel CPUs
ii  intel-ocloc                                   24.52.32224.14-1077~24.04                amd64        Tool for managing Intel Compute GPU device binary format
ii  intel-opencl-icd                              24.52.32224.14-1077~24.04                amd64        Intel graphics compute runtime for OpenCL
ii  libchewing3:amd64                             0.9.0-1                                  amd64        intelligent phonetic input method library
ii  libchewing3-data                              0.9.0-1                                  all          intelligent phonetic input method library - data files
ii  libdrm-intel1:amd64                           2.4.122-1                                amd64        Userspace interface to intel-specific kernel DRM services -- runtime
ii  libze-intel-gpu1                              24.52.32224.14-1077~24.04                amd64        Intel(R) Graphics Compute Runtime for oneAPI Level Zero.
ii  xserver-xorg-video-intel                      2:2.99.917+git20210115-1build1           amd64        X.Org X server -- Intel i8xx, i9xx display driver

>  sycl-ls
[level_zero:gpu][level_zero:0] Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Arc(TM) Graphics 20.4.4 [1.6.32224+14]
@zma2
Copy link
Contributor

zma2 commented Mar 25, 2025

@s-Nick Thank you for reporting this issue. We are triaging it now,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants
@zma2 @s-Nick and others