Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

chore: clean ucx and nixl mirror.
#4531 opened May 21, 2025 by nv-guomingz Loading…
Qwen3 supports TRTLLM FP4 MoE backend
#4530 opened May 21, 2025 by rosenrodt Loading…
chore: minor refactoring and code clean-up
#4526 opened May 21, 2025 by Superjomn Loading…
feat: better build_wheel.py venv handling
#4525 opened May 21, 2025 by tongyuantongyu Loading…
[5180961] chore: Unwaive test for Qwen model.
#4524 opened May 21, 2025 by hyukn Loading…
[nvbugs5214239] - Unwaive test
#4523 opened May 21, 2025 by yiqingy0 Loading…
fix: Constrain tornado and setuptools
#4521 opened May 21, 2025 by kaiyux Loading…
feat: Skip sampler for intermediate pp stages.
#4514 opened May 21, 2025 by yuxianq Loading…
test: rcca https://nvbugs/5223130
#4510 opened May 21, 2025 by xinhe-nv Draft
fix: TRT-LLM Gen dtype declaration
#4503 opened May 20, 2025 by nekorobov Loading…
Add UB NCCL integration
#4500 opened May 20, 2025 by Tabrizian Draft
feat: forward exceptions to Python and catch OOMs
#4497 opened May 20, 2025 by ixlmar Loading…
Add debug nvtx
#4492 opened May 20, 2025 by Shunkangz Draft
perf: [draft] Add fused q_norm/k_norm/RoPE for Qwen3.
#4482 opened May 20, 2025 by bobboli Loading…
ProTip! Filter pull requests by the default branch with base:main.