-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[intel-npu] Support new internal cached_model_buffer
config for memory mapped cached blobs
#27822
Open
MirceaDan99
wants to merge
28
commits into
openvinotoolkit:master
Choose a base branch
from
MirceaDan99:POC/add_caching_with_mmap_property
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
[intel-npu] Support new internal cached_model_buffer
config for memory mapped cached blobs
#27822
MirceaDan99
wants to merge
28
commits into
openvinotoolkit:master
from
MirceaDan99:POC/add_caching_with_mmap_property
+156
−56
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
github-actions
bot
added
category: inference
OpenVINO Runtime library - Inference
category: Core
OpenVINO Core (aka ngraph)
category: CPU
OpenVINO CPU plugin
category: NPU
OpenVINO NPU plugin
labels
Nov 29, 2024
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
from
November 29, 2024 14:36
ed220a4
to
94e33c4
Compare
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
3 times, most recently
from
December 11, 2024 09:34
539c459
to
8cfb954
Compare
github-actions
bot
removed
category: inference
OpenVINO Runtime library - Inference
category: Core
OpenVINO Core (aka ngraph)
category: CPU
OpenVINO CPU plugin
labels
Dec 11, 2024
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
2 times, most recently
from
December 11, 2024 10:11
1f163fa
to
5a29e3c
Compare
MirceaDan99
changed the title
[intel-npu] Support new internal
[intel-npu] Support new internal Dec 11, 2024
import_model
API for memory mapped cached blobscached_model_buffer
config for memory mapped cached blobs
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
from
December 11, 2024 12:12
5a29e3c
to
60a36c4
Compare
pereanub
reviewed
Dec 11, 2024
src/plugins/intel_npu/src/compiler_adapter/include/ze_graph_ext_wrappers.hpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_npu/src/common/include/intel_npu/common/igraph.hpp
Outdated
Show resolved
Hide resolved
src/plugins/intel_npu/src/common/include/intel_npu/common/igraph.hpp
Outdated
Show resolved
Hide resolved
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
6 times, most recently
from
December 12, 2024 15:47
9859b15
to
99b823b
Compare
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
from
January 13, 2025 13:24
99b823b
to
122788e
Compare
Signed-off-by: Alexandru Enache <alexandru.enache@intel.com>
Signed-off-by: Alexandru Enache <alexandru.enache@intel.com>
Signed-off-by: Alexandru Enache <alexandru.enache@intel.com>
…er`, so only `std::shared_ptr<ov::AlignedBuffer>` parameters will be used
…of OV cache header
…or `ov::AlignedBuffer`
…ffer` (no release for mmap shared object)
…unusedStream` when `model_buffer` is given
…>` and `std::shared_ptr<ov::AlignedBuffer>` blob types
…pped buffers
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
from
January 15, 2025 13:55
ab9eff2
to
fcd4a45
Compare
MirceaDan99
force-pushed
the
POC/add_caching_with_mmap_property
branch
from
January 15, 2025 13:58
fcd4a45
to
49e880b
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Details:
newnew plugins property to pass mmap buffer from PR New plugins property to pass mmap buffer #27981import_model
API from PR New caching api #27644BlobContainer
class forIGraph
objects that may derive withBlobContainerAlignedBuffer
for the newimport_model
API andBlobContainerVector
for the old onegetGraphHandle
function to allow passingconst uint8_t
andsize_t
params instead of `std::vector<uint8_t>Tickets: