-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TorchFX: GPTQ accuracy fix #26294
TorchFX: GPTQ accuracy fix #26294
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where is the test to check accuracy?
Test for GPTQ will require to create a certain pattern and check if all the related passes are applied and the execution is optimized as expected. I am sharing the link to the ticket to track the progress on GPTQ tests and additional improvements. But I don't think it will be part of this PR and I don't think it will be ready in next 2-3 weeks due to other priorities. |
gptq_test.py.txt |
@cavusmustafa Add the test here https://github.com/openvinotoolkit/openvino/tree/master/tests/model_hub_tests/transformation_tests |
I just pushed into this directory. Can you review please? |
@cavusmustafa Please add the test in here: You can refer to |
@mvafin, added the test into the ".yml" file. Please let me know if anything is missing. Thanks. |
Test fails |
### Details: - Fix for the accuracy issues discovered in Llama2 GPTQ with aot_autograd ### Tickets: - [CVS-149032](https://jira.devtools.intel.com/browse/CVS-149032) --------- Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
Details:
Tickets: