Add deepseek support #258

hello-cpaxton · 2025-02-22T01:50:22Z

Description

Adds deepseek as a model option under qwen. Example usage:

python -m stretch.app.chat --llm qwen25-Math-7B-Int4
python -m stretch.app.chat --llm qwen25-0.5B-AWQ
python -m stretch.app.chat --llm qwen25-Coder-7B
python -m stretch.app.chat --llm qwen25-Deepseek-1.5B

Something such as

python -m stretch.app.chat --llm qwen25-Deepseek-3B

should raise error as this is a non existing weight configuration

Note that Deepseek fine tunes aren't all avaialble (i.e., there doesn't seem to be a 3b). So I added bitsandbytes support for quantization, and you can now run Qwen 7b or 14b on ~5 or ~10 gb of vram:

python -m pip install bitsandbytes qwen_vl_utils autoawq transformers>=4.49.0

Checklist

I have performed a self-review of my code
If it is a core feature, I have added thorough tests
I have added documentation for the changes
I have updated the README file if necessary
I have run on hardware if necessary

Screenshots (if applicable)

Add any relevant screenshots or screen recordings to help reviewers understand the changes.

Additional context

Add any other context or information about the pull request here.

maintainable

hello-peiqi

Based on my search, DeepkSeek R1 only distills Qwen of size 32B, 14B, 7B, and 1.5B

hello-peiqi

Coder: 32B, 14B, 7B, 3B, 1.5B, 0.5B, (None, Instruct, Instruct-AWQ, Instruct-GGUF, Instruct-GPTQ-Int4, Instruct-GPTQ-Int8)
Math: 72B, 7B, 1.5B (None, Instruct)
VL: 72B, 7B, 3B (Instruct, Instruct-AWQ)
None: "0.5B", "1.5B", "3B", "7B", "14B", "32B", "72B" (None, Instruct, Instruct-AWQ, Instruct-GGUF, Instruct-GPTQ-Int4, Instruct-GPTQ-Int8)
Deepseek: 1.5B, 7B, 14B, 32B

cpaxton added 2 commits February 21, 2025 20:47

add deepseek

6d51d32

add qwen deepseek and make support a little cleaner and more

8888ed8

maintainable

hello-cpaxton self-assigned this Feb 22, 2025

hello-cpaxton requested a review from hello-peiqi February 22, 2025 01:50

cpaxton added 4 commits February 21, 2025 20:58

updates

99855dc

added qwen quantization and some cleanup

1bede03

update and add bitsandbytes

c788f2e

update

8dd3991

hello-peiqi requested changes Feb 25, 2025

View reviewed changes

hello-peiqi approved these changes Feb 27, 2025

View reviewed changes

hello-peiqi and others added 3 commits February 27, 2025 13:37

suggestions on deepseek update

bc4fcbc

Merge branch 'main' into cpaxton/deepseek

6c86d8b

Merge branch 'main' into cpaxton/deepseek

496149f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add deepseek support #258

Add deepseek support #258

hello-cpaxton commented Feb 22, 2025 •

edited by hello-peiqi

Loading

hello-peiqi left a comment

hello-peiqi left a comment •

edited

Loading

Add deepseek support #258

Are you sure you want to change the base?

Add deepseek support #258

Conversation

hello-cpaxton commented Feb 22, 2025 • edited by hello-peiqi Loading

Description

Checklist

Screenshots (if applicable)

Additional context

hello-peiqi left a comment

Choose a reason for hiding this comment

hello-peiqi left a comment • edited Loading

Choose a reason for hiding this comment

hello-cpaxton commented Feb 22, 2025 •

edited by hello-peiqi

Loading

hello-peiqi left a comment •

edited

Loading