Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

需要多大的显存训练chat-huanhuan #366

Open
OCEANOUXIN opened this issue Feb 20, 2025 · 2 comments
Open

需要多大的显存训练chat-huanhuan #366

OCEANOUXIN opened this issue Feb 20, 2025 · 2 comments

Comments

@OCEANOUXIN
Copy link

你好,rt,想知道这样微调需要多大的显存

@spldmanager
Copy link

4070ti 12g报oom

@shawn-happy
Copy link

我6g的显存,调整了参数,
model = AutoModelForCausalLM.from_pretrained(
"deepseek-ai/deepseek-r1",
torch_dtype=torch.float16,
device_map="auto",
low_cpu_mem_usage=True,
offload_folder="offload" # 指定临时卸载目录
)
使用了低内存模式才能训练,但是预估的时候还是报了oom

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants