-
Notifications
You must be signed in to change notification settings - Fork 416
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[C][PyTorch]Make pytorch extensions pure cpp
2.4.0
#1754
opened May 7, 2025 by
ksivaman
Loading…
8 of 13 tasks
cache sequence chunk ids for reordering
#1751
opened May 6, 2025 by
xrennvidia
Loading…
5 of 13 tasks
[PyTorch] Refactor activation offloading of quantized tensors.
#1738
opened Apr 30, 2025 by
pggPL
Loading…
8 of 13 tasks
fix: update grad_output quant to avoid redundant work
#1736
opened Apr 30, 2025 by
kshitij12345
Loading…
correct weight quantizer for grouped_linear/layernorm_linear and layernorm_mlp
#1733
opened Apr 29, 2025 by
HuangHunag-MT
Loading…
8 of 13 tasks
Support Context Parallel for Multi Latent Attention (MLA)
#1729
opened Apr 29, 2025 by
yuzhongw-nvidia
Loading…
13 tasks
[JAX] Decouple Recipe and ScalingMode
#1728
opened Apr 29, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
Add variance calculation from FusedAdam optimizer states
#1726
opened Apr 28, 2025 by
kwyss-nvidia
Loading…
7 of 13 tasks
[PyTorch] Reduce verbosity of CI logs
testing
Improvements to tests or testing infrastructure
#1725
opened Apr 28, 2025 by
timmoon10
Loading…
8 of 14 tasks
MXFP8 support in Userbuffers
enhancement
New feature or request
#1711
opened Apr 22, 2025 by
timmoon10
Loading…
5 of 13 tasks
[JAX] Updated: unbalanced CP with THD format
#1709
opened Apr 22, 2025 by
huanghua1994
Loading…
8 of 13 tasks
[PyTorch] FP8 Subchannel Recipe With FP8 Gather And Configurable Scaling Factor Tensor Swizzling
#1707
opened Apr 21, 2025 by
zhongbozhu
Loading…
1 of 13 tasks
[JAX] Add collective GEMM without compute/communication overlap
#1675
opened Apr 11, 2025 by
philipphack
Loading…
1 of 6 tasks
[JAX] GroupedQuantizer and GroupedScaledTensor
#1666
opened Apr 10, 2025 by
phu0ngng
Loading…
7 of 13 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.