Skip to content

Pull requests: HabanaAI/vllm-fork

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[deepseek r1] HPU support for deepseek
#1030 opened Apr 8, 2025 by xuechendi Loading…
[aice/v1.20.1] PRC branch migration for v1.20.1
#1029 opened Apr 8, 2025 by ranzhejiang Loading…
[SW-224648] Fix test logs redirection
#1027 opened Apr 8, 2025 by bmyrcha Loading…
[SW-224648] Fix test logs redirection
#1026 opened Apr 8, 2025 by bmyrcha Loading…
Support Data Parallel MOE on HPU
#1022 opened Apr 8, 2025 by xinyu-intel Loading…
Enable alibi_slope with FusedSDPA.
#1013 opened Apr 4, 2025 by libinta Loading…
Warmup V1
#1012 opened Apr 4, 2025 by iboiko-habana Loading…
Implement Pipeline Parallelism support for HPU.
#1000 opened Apr 2, 2025 by jmaksymczuk Loading…
Use the correct fp8 range for G2
#984 opened Mar 31, 2025 by czhu15 Loading…
add torch profiler for the LLM engine
#979 opened Mar 28, 2025 by yangulei Loading…
Enable torchrun on Gaudi
#974 opened Mar 27, 2025 by czhu15 Loading…
enable fp32 softmax in flat_pa_mla
#972 opened Mar 27, 2025 by yangulei Loading…
Update linear.py
#964 opened Mar 25, 2025 by michalkuligowski Draft
Update layers.py
#957 opened Mar 25, 2025 by michalkuligowski Draft
ProTip! Add no:assignee to see everything that’s not assigned.