Why temperature=0,top_p=1,seed=42 is still not enough to fix the llm's output!? #17166

beyondguo · 2025-04-25T06:54:43Z

beyondguo
Apr 25, 2025

We are using Qwen2.5-14B-Instruct with vLLM. However, we found the following things can make the output different, even we set temperature=0,top_p=1,seed=42:

vllm serve is different with vllm offline inference, using the same chat_template
vllm serve with different number of cards
different vllm version
using H100 or H200 can make a difference

That is strange. Can someone tell me why? and how can I fix the output, when changing inference enveriments?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why temperature=0,top_p=1,seed=42 is still not enough to fix the llm's output!? #17166

{{title}}

Replies: 0 comments

Select a reply

Why temperature=0,top_p=1,seed=42 is still not enough to fix the llm's output!? #17166

beyondguo Apr 25, 2025

Replies: 0 comments

beyondguo
Apr 25, 2025