-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix[nvbug/5286515]: trtllm-llmapi-launch on single node single gpu
#4529
opened May 21, 2025 by
Superjomn
Loading…
test: conditional disagg and cache aware balancing for deepseek v3
#4522
opened May 21, 2025 by
zhengd-nv
Loading…
[5234029][5226211] chore: Unwaive multimodal tests for Qwen model.
#4519
opened May 21, 2025 by
hyukn
Loading…
[https://nvbugspro.nvidia.com/bug/5181262] [test] Unwaive Mistral Nemo test
#4515
opened May 21, 2025 by
syuoni
Loading…
[fix] Fix Llama4 allgather error due to None tensor
#4511
opened May 21, 2025 by
jinyangyuan-nvidia
Loading…
[TRTLLM-5053] Refactoring and Unifying the Multimodal input preparation
#4506
opened May 21, 2025 by
rakib-hasan
Loading…
test(perf): Pt.2 Add
Llama-3_3-Nemotron-Super-49B-v1
integration-perf-tests (cpp)
#4499
opened May 20, 2025 by
venkywonka
Loading…
fix: Handle additional model outputs based on pipeline parallel rank
#4498
opened May 20, 2025 by
Funatiq
Loading…
[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H
#4494
opened May 20, 2025 by
tomeras91
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.