Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 1
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 82
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Usage]: v1 engine on CPU usage How to use vllm
#16056 opened Apr 4, 2025 by harryhan618
1 task done
[Doc]: Steps to run 2 different models on Kaggle GPUs using vllm documentation Improvements or additions to documentation
#16051 opened Apr 4, 2025 by furkanbk
1 task done
[Performance]: LLM Offline Inference Slowing Down Over Time performance Performance-related issues
#16050 opened Apr 4, 2025 by uyzhang
1 task done
[Bug]: Multiple rounds of dialogue, only infering for the last round bug Something isn't working
#16046 opened Apr 4, 2025 by missTL
1 task done
Integrate PPLX-kernels
#16039 opened Apr 3, 2025 by tlrmchlsmth
[Bug]: xgrammar missing file crashes the server bug Something isn't working
#16030 opened Apr 3, 2025 by servient-ashwin
1 task done
[Feature]: Adding tool_choice: required for lm-format-enforcer feature request New feature or request
#16029 opened Apr 3, 2025 by ItzAmirreza
1 task done
[Bug]: Two beginning of sequence tokens for Lllama-3.2-3B-Instruct bug Something isn't working
#16028 opened Apr 3, 2025 by Naqu6
1 task done
[Bug]: Unable to run Phi4 with tensor-parallel-size 4 torch.compile compatiblity bug Something isn't working
#16021 opened Apr 3, 2025 by roguetech
1 task done
[New Model]: support for fashion-clip new model Requests to new models
#16019 opened Apr 3, 2025 by priyankaiiit14
1 task done
[Bug]: Null response for Mistral3.1 bug Something isn't working
#16014 opened Apr 3, 2025 by hahmad2008
1 task done
[Bug]: Tool call auto not working with Qwen models in v0.8.2 bug Something isn't working
#16008 opened Apr 3, 2025 by pivotal-marcela-campo
1 task done
[Bug]: crash during debug, works ok running cli bug Something isn't working torch.compile
#16006 opened Apr 3, 2025 by CharlesJu1
1 task done
[Bug] [Misc]: test_sharded_state_loader run failed bug Something isn't working
#16004 opened Apr 3, 2025 by Accelerator1996
1 task done
[Usage]: Is it possible to run vLLM inside a Jupyter Notebook? usage How to use vllm
#16003 opened Apr 3, 2025 by repodiac
1 task done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.