-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-5563][infra] Move test_rerun.py to script folder
#6571
opened Aug 2, 2025 by
yiqingy0
Loading…
[None][feat] Add GPTQ Int8 Options for Qwen TRT path
#6568
opened Aug 1, 2025 by
JyChang012
Loading…
[TRTLLM-6090] doc: Add multimodal part to the feature section
#6567
opened Aug 1, 2025 by
chang-l
Loading…
[None][doc] Add new doc
Community want to contribute
PRs initiated from Community
#6565
opened Aug 1, 2025 by
jamieliNVIDIA
Loading…
[TRTLLM-5500][infra] Update CODEOWNERS with new ownership rules for additional paths
#6564
opened Aug 1, 2025 by
venkywonka
•
Draft
Draft: feat: Include attention dp rank info with KV cache events
#6563
opened Aug 1, 2025 by
pcastonguay
Loading…
[TRTLLM-6069]doc: add trtllm-serve usage for cli reference section
#6562
opened Aug 1, 2025 by
nv-guomingz
Loading…
[None][doc] Create deployment guide for Llama4 Scout FP8 and NVFP4
#6550
opened Aug 1, 2025 by
chenfeiz0326
Loading…
[None][feat] improve dataloading for benchmark_dataset by using batch…
#6548
opened Aug 1, 2025 by
zerollzeng
Loading…
[TRTLLM-4501][feat] AutoTuner tuning config refactor and add tuning for kernel configs.
#6545
opened Aug 1, 2025 by
hyukn
Loading…
[None][doc] Create deployment guide for Llama 3.3 70B FP8 and NVFP4
Community want to contribute
PRs initiated from Community
#6543
opened Aug 1, 2025 by
jamieliNVIDIA
Loading…
[None][fix] Fix NCCL Ops when using MoE chunking
#6541
opened Aug 1, 2025 by
jinyangyuan-nvidia
•
Draft
[TRTLLM-6263][feat] Enable fp8 SwiGLU to minimize host overhead
#6540
opened Aug 1, 2025 by
JunyiXu-nv
Loading…
[None][feat] Use Separate QKV Input Layout for Context MLA
#6538
opened Aug 1, 2025 by
zhhuang-nv
Loading…
[https://nvbugs/5394392] [Fix] Enlarge scheduler and slot manager capacity under disagg bs == 1
#6537
opened Aug 1, 2025 by
yifeizhang-c
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.