Skip to content

Commit d2f21b3

Browse files
committed
fix: quantization
1 parent e83dd4c commit d2f21b3

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

HakaseCore/llm/llama3.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ def generate_text(self, instruction: str) -> str:
6666
prompt = self.pipe.tokenizer.apply_chat_template(
6767
self.prompt, tokenize=False, add_generation_prompt=True
6868
)
69-
outputs = self.pipe(
69+
outputs = self.pipe.model.generate(
7070
prompt,
7171
do_sample=True,
7272
temperature=0.4,

0 commit comments

Comments
 (0)