[PyTorch] Refactor activation offloading of quantized tensors. #1738

pggPL · 2025-04-30T14:00:52Z

Description

The code of activation offloading has some complex logic handling offloading Float8Tensor object - it disassembles the object into the data tensors, then offloads them separately and then assembles them.

I add empty_like(..., device=..., pin_memory=...) method to Float8Tensor, which allow easy CPU backup tensor allocation. This allows me to make the code of the offloading much simpler.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

Please list the changes introduced in this PR:

Change A
Change B

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL · 2025-04-30T16:44:15Z

/te-ci pytorch

…floading

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL · 2025-05-05T10:19:46Z

/te-ci pytorch

for more information, see https://pre-commit.ci

Signed-off-by: Pawel Gadzinski <pawelgadzinski@gmail.com>

…floading

Signed-off-by: Pawel Gadzinski <pawelgadzinski@gmail.com>

for more information, see https://pre-commit.ci

pggPL · 2025-05-08T16:40:00Z

/te-ci pytorch

pggPL · 2025-05-08T16:50:26Z

/te-ci pytorch

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

for more information, see https://pre-commit.ci

pggPL · 2025-05-09T13:04:50Z

/te-ci pytorch

pggPL · 2025-05-09T14:02:30Z

/te-ci pytorch

pggPL added 2 commits April 30, 2025 13:47

init

1851c3a

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

offloading

657cbbe

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL force-pushed the quantized_tensor_offloading branch from bbf24cd to 657cbbe Compare April 30, 2025 14:57

pre-commit-ci bot and others added 5 commits April 30, 2025 15:00

[pre-commit.ci] auto fixes from pre-commit.com hooks

44f6494

for more information, see https://pre-commit.ci

fixes

728b174

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

all types

57e7869

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

ad68f91

for more information, see https://pre-commit.ci

typo

001b23c

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL marked this pull request as ready for review April 30, 2025 16:43

pggPL added 2 commits May 5, 2025 09:59

Merge remote-tracking branch 'upstream/main' into quantized_tensor_of…

037a89c

…floading

fix

8b00c6f

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pre-commit-ci bot and others added 5 commits May 5, 2025 10:19

[pre-commit.ci] auto fixes from pre-commit.com hooks

4c8a06d

for more information, see https://pre-commit.ci

fix

bc60990

Signed-off-by: Pawel Gadzinski <pawelgadzinski@gmail.com>

Merge remote-tracking branch 'upstream/main' into quantized_tensor_of…

88accd2

…floading

lint fix

8bf27c1

Signed-off-by: Pawel Gadzinski <pawelgadzinski@gmail.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

ac93bb4

for more information, see https://pre-commit.ci

pggPL force-pushed the quantized_tensor_offloading branch from 32f982b to 4293d32 Compare May 8, 2025 16:47

fixes

da1bbf9

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL force-pushed the quantized_tensor_offloading branch from 7a6b62d to da1bbf9 Compare May 9, 2025 11:56

pggPL and others added 2 commits May 9, 2025 13:56

Merge branch 'main' into quantized_tensor_offloading

15e1612

[pre-commit.ci] auto fixes from pre-commit.com hooks

b6012b1

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] Refactor activation offloading of quantized tensors. #1738

[PyTorch] Refactor activation offloading of quantized tensors. #1738

pggPL commented Apr 30, 2025 •

edited

Loading

pggPL commented Apr 30, 2025

pggPL commented May 5, 2025

pggPL commented May 8, 2025

pggPL commented May 8, 2025

pggPL commented May 9, 2025

pggPL commented May 9, 2025

[PyTorch] Refactor activation offloading of quantized tensors. #1738

Are you sure you want to change the base?

[PyTorch] Refactor activation offloading of quantized tensors. #1738

Conversation

pggPL commented Apr 30, 2025 • edited Loading

Description

Type of change

Changes

Checklist:

pggPL commented Apr 30, 2025

pggPL commented May 5, 2025

pggPL commented May 8, 2025

pggPL commented May 8, 2025

pggPL commented May 9, 2025

pggPL commented May 9, 2025

pggPL commented Apr 30, 2025 •

edited

Loading