attention mask fixes #301

ahmadki · 2025-04-30T15:08:03Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Issues

List issues that this PR closes (syntax):

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

terrykong

@ahmadki can you rebase? I think the dtensor attention mask diff is already resolved top-of-tree. The fsdp1 makes sense. nice find

ahmadki · 2025-05-01T11:36:31Z

I rebased with main.

I don't think the issue with dtensor is resolved in main. attention_mask is created but never used in the train function. Instead, we use an all ones tensor instead here: https://github.com/NVIDIA/nemo-rl/blob/ebb46c3b936e6c31494dae3c7f0953bfeff006fb/nemo_rl/models/policy/dtensor_policy_worker.py#L316

I'm not sure if attention_mask should be removed completely, or it should be taken into account when calculating the logprobs here: https://github.com/NVIDIA/nemo-rl/blob/ebb46c3b936e6c31494dae3c7f0953bfeff006fb/nemo_rl/models/policy/dtensor_policy_worker.py#L332
Either ways, someone needs to take a look at it.

ahmadki added 2 commits April 30, 2025 18:07

removed unused attention mask code

4935b0e

another attention mask fix

cda4a4f

terrykong requested changes Apr 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

attention mask fixes #301

attention mask fixes #301

Uh oh!

ahmadki commented Apr 30, 2025

Uh oh!

terrykong left a comment

Uh oh!

ahmadki commented May 1, 2025

Uh oh!

Uh oh!

attention mask fixes #301

Are you sure you want to change the base?

attention mask fixes #301

Uh oh!

Conversation

ahmadki commented Apr 30, 2025

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Uh oh!

terrykong left a comment

Choose a reason for hiding this comment

Uh oh!

ahmadki commented May 1, 2025

Uh oh!

Uh oh!