We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
你好,rt,想知道这样微调需要多大的显存
The text was updated successfully, but these errors were encountered:
4070ti 12g报oom
Sorry, something went wrong.
我6g的显存,调整了参数, model = AutoModelForCausalLM.from_pretrained( "deepseek-ai/deepseek-r1", torch_dtype=torch.float16, device_map="auto", low_cpu_mem_usage=True, offload_folder="offload" # 指定临时卸载目录 ) 使用了低内存模式才能训练,但是预估的时候还是报了oom
No branches or pull requests
你好,rt,想知道这样微调需要多大的显存
The text was updated successfully, but these errors were encountered: