-
Notifications
You must be signed in to change notification settings - Fork 326
Issues: vllm-project/aibrix
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
The best practices for multi-node multi-GPU deployment of DeepSeek-R1-671B
kind/support
Categorizes issue as a support question.
triage/accepted
Indicates an issue or PR is ready to be actively worked on.
#977
opened Apr 15, 2025 by
ZTurboX
failed to build code follow readme
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#974
opened Apr 13, 2025 by
gaowayne
[RFC]: Support InfinityStore in AIBrix as the new KVCache backend
area/kv-cache
kind/feature
Categorizes issue or PR as related to a new feature.
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
Kickstarting Community Building: Contributors, Maintainers, and Governance Guidelines
area/community
kind/misc
#945
opened Apr 7, 2025 by
Jeffwan
Orchestrate KV Cache for decentralized distributed hashing offerings
area/kv-cache
kind/feature
Categorizes issue or PR as related to a new feature.
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
Multi-Node Inference ' Waiting for creating a placement group of specs for 310 seconds'
#927
opened Apr 1, 2025 by
ying2025
Failed to run RayFleet when using hostNetwork
area/distributed
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#915
opened Mar 28, 2025 by
vaaandark
kv cache deploy the model across different GPUs, it create two etcd pod!
#910
opened Mar 27, 2025 by
ying2025
Move the benchmark codes to aibrix python package
area/benchmark
area/performance
kind/feature
Categorizes issue or PR as related to a new feature.
#903
opened Mar 25, 2025 by
Jeffwan
[Vineyard] vllm engine crashes, failing to connect to vineyard when starting the pod.
area/distributed
area/kv-cache
#874
opened Mar 17, 2025 by
gangmuk
Separate CRDs from manifest installation
area/autoscaling
area/installation
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#873
opened Mar 17, 2025 by
Jeffwan
[Dist KV] vllm pods which do not have kvcache pods running in the same node crashes.
area/installation
area/kv-cache
kind/bug
Something isn't working
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#863
opened Mar 14, 2025 by
gangmuk
Documentation is not clearly defined on how to set the RateLimiting and how to measure the token consumption and how to enable the authentication for different users
area/gateway
kind/support
Categorizes issue as a support question.
#859
opened Mar 13, 2025 by
vivekrsintc
Error: Invalid character 'u' looking for beginning of value
area/gateway
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#858
opened Mar 13, 2025 by
vivekrsintc
Automate local disk management and ai runtime model management
area/runtime
kind/feature
Categorizes issue or PR as related to a new feature.
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#854
opened Mar 12, 2025 by
Jeffwan
Pod scale success,aibrix-controller-manager failed to parse metrics
area/autoscaling
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#852
opened Mar 12, 2025 by
ying2025
[RFC]: Make API Gateway interface OpenAI compatible
area/gateway
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#846
opened Mar 11, 2025 by
Jeffwan
Previous Next
ProTip!
Updated in the last three days: updated:>2025-04-16.