Skip to content

Dramatic increase of the C++ dlls size. Why? #12267

Answered by zsogitbe
zsogitbe asked this question in Q&A
Discussion options

You must be logged in to vote

I believe I've identified the issue. I now have a smaller ggml-cuda.dll (51MB for the Windows Release with one architecture and 157MB with two architectures). The issue seems to stem from the -arch=native option. NVCC doesn't support this option, but it appears the code requires it for some reason. I had previously removed it, but I’ve now added it back. Additionally, the architecture(s) need to be defined manually (e.g., CMAKE_CUDA_ARCHITECTURES="61;89") while ensuring that -arch=native remains present in the CMake script.

Replies: 2 comments 6 replies

Comment options

You must be logged in to vote
6 replies
@ggerganov
Comment options

@zsogitbe
Comment options

@abc-nix
Comment options

@zsogitbe
Comment options

@abc-nix
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by zsogitbe
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants