Position embedding diff when loading model in different method #271

yangjiabupt · 2025-05-12T09:13:09Z

model = Qwen2_5OmniForConditionalGeneration.from_pretrained("Qwen/Qwen2.5-Omni-7B", torch_dtype="auto", device_map="auto")

When enable torch_dtype="auto"

the audio tower dtype is float32

disable torch_dtype="auto"

the audio tower dtype is also float32

but the value diff

I think the first got wrong

fixes QwenLM/Qwen2.5-Omni#271

BakerBunker · 2025-05-15T12:02:31Z

Will be fixed at huggingface/transformers#38151

BakerBunker added a commit to BakerBunker/transformers that referenced this issue May 15, 2025

Fix Qwen2.5 Omni SinusoidsPositionEmbedding precision

f16be1e

fixes QwenLM/Qwen2.5-Omni#271

BakerBunker linked a pull request May 15, 2025 that will close this issue

Fix Qwen2.5 Omni SinusoidsPositionEmbedding precision huggingface/transformers#38151

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Position embedding diff when loading model in different method #271

Position embedding diff when loading model in different method #271

yangjiabupt commented May 12, 2025

BakerBunker commented May 15, 2025

Position embedding diff when loading model in different method #271

Position embedding diff when loading model in different method #271

Comments

yangjiabupt commented May 12, 2025

BakerBunker commented May 15, 2025