Skip to content

Changing the number of experts with a Mixtral GGUF? #5114

Answered by supportend
araleza asked this question in Q&A
Discussion options

You must be logged in to vote
./main --help
[...]
--override-kv KEY=TYPE:VALUE
                        advanced option to override model metadata by key. may be specified multiple times

I tried this with
--override-kv llama.expert_used_count=int:3
and it worked:

llm_load_print_meta: n_expert = 8
llm_load_print_meta: n_expert_used = 3

Replies: 4 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by araleza
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants