llama.cpp and 7G Llama-2-Chat model: resource requirements, if possible #5172

christian-2 · 2024-01-28T11:01:16Z

christian-2
Jan 28, 2024

I am a new to this project and would like to try inference with llama.cpp and a 7G Llama-2-Chat model: is this combination currently supported and what are the resource requirements? I have e.g. a VM with 8 vCPUs and 16 GB RAM or (if need be) a bare-medal server with 56 CPUs and 500 GB RAM available. The (underlying) hardware is fairly recent in both cases. I guess the bare-metal second could serve in principle, but could the VM as well?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp and 7G Llama-2-Chat model: resource requirements, if possible #5172

{{title}}

Replies: 0 comments

Select a reply

llama.cpp and 7G Llama-2-Chat model: resource requirements, if possible #5172

christian-2 Jan 28, 2024

Replies: 0 comments

christian-2
Jan 28, 2024