Use llm export for refit #435

yfw · 2025-05-21T22:14:16Z

What does this PR do ?

This allows us to use NeMo's llm export mapping and transforms when converting megatron params to HF params for refit. The mapping and transforms in nemo_rl/models/megatron/converters/qwen2.py is currently copied from https://github.com/NVIDIA/NeMo/blob/c882b4885349b5a750147d242338ebcbc7058ef6/nemo/collections/llm/gpt/model/qwen2.py#L316-L358

List issues that this PR closes (syntax):

# Add a code snippet demonstrating how to use this

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>

yfw added 4 commits May 21, 2025 15:06

Use llm export for refit

462d10a

Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>

Use transforms that keep tensors on gpu

1cf53b9

Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>

Working for batched refit

327b840

Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>

Llama (<4) support

c1ef8d1

Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>