Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FORK][FEATURE] DQ IP: performance enhansments #272

Merged

Conversation

dmitry-gorokhov
Copy link
Collaborator

@dmitry-gorokhov dmitry-gorokhov commented Jan 21, 2025

  • allocate aux accums regs on stack
  • precompute grouped src sums
  • optimize pointer arithmetic
  • reduce aux vecs count requred for the microkernel

@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/dq_ip_accum_stack_allocation branch 3 times, most recently from 9960b67 to d421730 Compare January 24, 2025 08:54
@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/dq_ip_accum_stack_allocation branch from bc4e68a to b145489 Compare February 3, 2025 06:46
@dmitry-gorokhov dmitry-gorokhov changed the title [FORK][FIX] DQ IP: allocate aux accums via stack FORK][FEATURE] DQ IP: performance enhansments Feb 3, 2025
@dmitry-gorokhov dmitry-gorokhov changed the title FORK][FEATURE] DQ IP: performance enhansments [FORK][FEATURE] DQ IP: performance enhansments Feb 3, 2025
- allocate aux accums regs on stack
- precompute grouped src sums
- optimize pointer arithmetic
- reduce aux vecs count requred for the microkernel
@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/dq_ip_accum_stack_allocation branch from b145489 to 157fb81 Compare February 7, 2025 11:09
github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Feb 10, 2025

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
### Details:
 - Enables FC dynamic quantization on systems with AVX2 ISA
- Speed-ups for LLMs with asym compressions on LNL vs master: 2x in
average for prompt processing and 1.5x in average for generation stage.
 - OneDNN fork PR: openvinotoolkit/oneDNN#272

### Tickets:
- [CVS-146064](https://jira.devtools.intel.com/browse/CVS-146064) <-
perf numbers
 - [CVS-134037](https://jira.devtools.intel.com/browse/CVS-134037)
 - [CVS-134034](https://jira.devtools.intel.com/browse/CVS-134034)
@dmitry-gorokhov dmitry-gorokhov merged commit 157fb81 into v3.6_for_ie_master Feb 10, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants