-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-6674][Breaking Change] Hopper SWA non-cyclic kernels + KV reuse + Spec Dec
#6379
opened Jul 26, 2025 by
symphonylyh
Loading…
1 task
chore: add _prepare_and_schedule_batch function in PyExecutor
#6365
opened Jul 25, 2025 by
QiJune
Loading…
[TRTLLM-6392][feat] Support turning on/off spec decoding dynamically
Community want to contribute
PRs initiated from Community
#6363
opened Jul 25, 2025 by
ziyixiong-nv
Loading…
[nvbug/5320234] fix: test_trtllm_bench_llmapi_launch
#6359
opened Jul 25, 2025 by
Superjomn
Loading…
doc: Add README for wide EP
Community want to contribute
PRs initiated from Community
Documentation
TRTLLM's textual/illustrative materials: API refs, guides, tutorials. Improvement & clarity.
#6356
opened Jul 25, 2025 by
kaiyux
Loading…
[https://nvbugs/5340941][https://nvbugs/5375785] - fix: Wrap attentio…
#6355
opened Jul 25, 2025 by
liji-nv
Loading…
chore: add warning for the default backend on serve and bench commands
#6350
opened Jul 25, 2025 by
Superjomn
Loading…
Add disable_optimistic_tuning flag and update gb_per_token calculation title
#6349
opened Jul 25, 2025 by
venkywonka
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-06-27.