Skip to content

Loading GGML converted qlora models #4018

Answered by KerfuffleV2
ragesh2000 asked this question in Q&A
Discussion options

You must be logged in to vote

You need to use the base (full) model with -m and the converted LoRA with --lora. So it should look like -m my_full_model.gguf --lora ggml-adapter-model.bin

Note: -l is not the short form of --lora, it is for setting logit bias.

Replies: 1 comment 11 replies

Comment options

You must be logged in to vote
11 replies
@ragesh2000
Comment options

@KerfuffleV2
Comment options

@ragesh2000
Comment options

@KerfuffleV2
Comment options

Answer selected by ragesh2000
@ragesh2000
Comment options

@KerfuffleV2
Comment options

@ragesh2000
Comment options

@KerfuffleV2
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants