TorchFX: GPTQ accuracy fix #26294

cavusmustafa · 2024-08-29T04:43:25Z

Details:

Fix for the accuracy issues discovered in Llama2 GPTQ with aot_autograd

Tickets:

CVS-149032

rkazants

Where is the test to check accuracy?

cavusmustafa · 2024-09-11T06:34:23Z

Where is the test to check accuracy?

Test for GPTQ will require to create a certain pattern and check if all the related passes are applied and the execution is optimized as expected. I am sharing the link to the ticket to track the progress on GPTQ tests and additional improvements. But I don't think it will be part of this PR and I don't think it will be ready in next 2-3 weeks due to other priorities.
https://jira.devtools.intel.com/browse/CVS-152181

cavusmustafa · 2024-10-01T22:12:28Z

gptq_test.py.txt
@rkazants, could you please help deciding where to add this test? Is this a good location to implement it?: https://github.com/openvinotoolkit/openvino/tree/master/tests/model_hub_tests/pytorch
I need to skip model conversion and run it as a regular pytorch model after torch.compile. Can I do it in this structure?
Attached is the standalone script to test GPTQ patterns in a HF model.

mvafin · 2024-10-02T09:08:13Z

@cavusmustafa Add the test here https://github.com/openvinotoolkit/openvino/tree/master/tests/model_hub_tests/transformation_tests
You will not be bound to existing workflow and can create your custom test.

cavusmustafa · 2024-10-03T01:16:48Z

@cavusmustafa Add the test here https://github.com/openvinotoolkit/openvino/tree/master/tests/model_hub_tests/transformation_tests You will not be bound to existing workflow and can create your custom test.

I just pushed into this directory. Can you review please?
@mvafin @rkazants

mvafin · 2024-10-07T09:16:22Z

@cavusmustafa Please add the test in here:
https://github.com/openvinotoolkit/openvino/blob/500284d6bdcf979b4c4eda6abfa0b45cfe8f0635/.github/workflows/job_pytorch_models_tests.yml

You can refer to PagedAttention Test as example

cavusmustafa · 2024-10-08T20:34:07Z

@cavusmustafa Please add the test in here: https://github.com/openvinotoolkit/openvino/blob/500284d6bdcf979b4c4eda6abfa0b45cfe8f0635/.github/workflows/job_pytorch_models_tests.yml

You can refer to PagedAttention Test as example

@mvafin, added the test into the ".yml" file. Please let me know if anything is missing. Thanks.

.github/workflows/job_pytorch_models_tests.yml

mvafin · 2024-10-16T10:56:40Z

Test fails

### Details: - Fix for the accuracy issues discovered in Llama2 GPTQ with aot_autograd ### Tickets: - [CVS-149032](https://jira.devtools.intel.com/browse/CVS-149032) --------- Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

github-actions bot added the category: PyTorch FE OpenVINO PyTorch Frontend label Aug 29, 2024

cavusmustafa added 3 commits August 28, 2024 21:55

TorchFX: GPTQ accuracy fix

0202c6c

Code formatting for gptq pass

6f36fd6

Removed unused variables in GPTQ pass

3e78526

cavusmustafa marked this pull request as ready for review August 29, 2024 05:57

cavusmustafa requested a review from a team as a code owner August 29, 2024 05:57

cavusmustafa requested review from slyalin and mmikolajcz August 29, 2024 05:57

cavusmustafa added the Code Freeze label Aug 29, 2024

cavusmustafa added this to the 2024.4 milestone Aug 29, 2024

cavusmustafa requested a review from suryasidd August 29, 2024 05:58

rkazants reviewed Aug 29, 2024

View reviewed changes

rkazants requested a review from mvafin August 29, 2024 07:08

rkazants added the pr: needs tests PR needs tests updating label Aug 29, 2024

ilya-lavrenov assigned mvafin Aug 29, 2024

mvafin approved these changes Aug 29, 2024

View reviewed changes

Merge branch 'master' into torchfx_gptq_accuracy_fix

5ef2dbf

moslex removed the Code Freeze label Sep 4, 2024

cavusmustafa added 4 commits September 11, 2024 14:52

Merge branch 'master' into torchfx_gptq_accuracy_fix

2f91a7e

Merge branch 'master' into torchfx_gptq_accuracy_fix

a252535

Merge branch 'master' into torchfx_gptq_accuracy_fix

461380a

Merge branch 'master' into torchfx_gptq_accuracy_fix

ddfc9ab

cavusmustafa requested a review from a team as a code owner October 3, 2024 00:46

github-actions bot added category: TF FE OpenVINO TensorFlow FrontEnd category: JAX FE OpenVINO JAX FrontEnd labels Oct 3, 2024

GPTQ transformation test added for TorchFX

eeb4000

mvafin approved these changes Oct 7, 2024

View reviewed changes

cavusmustafa requested a review from a team as a code owner October 8, 2024 20:27

github-actions bot added category: CI OpenVINO public CI github_actions Pull requests that update GitHub Actions code labels Oct 8, 2024

cavusmustafa and others added 5 commits October 8, 2024 13:47

GPTQ pattern test added into model test workflows

060afc3

Changed GPTQ test model to reduce memory utilization

2598b0e

Type casting fix for windows

4d5f498

Increased timeout for GPTQ model test

c2999a1

Increase job timeout

0c4f1e7

mvafin reviewed Oct 11, 2024

View reviewed changes

.github/workflows/job_pytorch_models_tests.yml Outdated Show resolved Hide resolved

Update .github/workflows/job_pytorch_models_tests.yml

5a84b1b

mvafin reviewed Oct 11, 2024

View reviewed changes

.github/workflows/job_pytorch_models_tests.yml Outdated Show resolved Hide resolved

Update .github/workflows/job_pytorch_models_tests.yml

06cfbe2

mvafin added 3 commits October 17, 2024 09:51

Update job_pytorch_models_tests.yml

f8f6ce9

Merge branch 'master' into torchfx_gptq_accuracy_fix

7c84f61

Update job_pytorch_models_tests.yml

fa9254f

mvafin approved these changes Oct 18, 2024

View reviewed changes

mvafin added this pull request to the merge queue Oct 18, 2024

Merged via the queue into openvinotoolkit:master with commit 43df0b6 Oct 18, 2024
160 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TorchFX: GPTQ accuracy fix #26294

TorchFX: GPTQ accuracy fix #26294

cavusmustafa commented Aug 29, 2024 •

edited

Loading

rkazants left a comment

cavusmustafa commented Sep 11, 2024

cavusmustafa commented Oct 1, 2024

mvafin commented Oct 2, 2024

cavusmustafa commented Oct 3, 2024 •

edited

Loading

mvafin commented Oct 7, 2024

cavusmustafa commented Oct 8, 2024

mvafin commented Oct 16, 2024

TorchFX: GPTQ accuracy fix #26294

TorchFX: GPTQ accuracy fix #26294

Conversation

cavusmustafa commented Aug 29, 2024 • edited Loading

Details:

Tickets:

rkazants left a comment

Choose a reason for hiding this comment

cavusmustafa commented Sep 11, 2024

cavusmustafa commented Oct 1, 2024

mvafin commented Oct 2, 2024

cavusmustafa commented Oct 3, 2024 • edited Loading

mvafin commented Oct 7, 2024

cavusmustafa commented Oct 8, 2024

mvafin commented Oct 16, 2024

cavusmustafa commented Aug 29, 2024 •

edited

Loading

cavusmustafa commented Oct 3, 2024 •

edited

Loading