-
Notifications
You must be signed in to change notification settings - Fork 190
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[DNM][AMD] agentx-v0.4 rebased from commit chore/agentx-v0.4 commit 7f61
#1709
opened Jun 11, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
[Klaud Cold] dsv4-fp4-mi355x-sglang-disagg: DeepSeek-V4-Pro SGLang disagg (8k1k conc=1 smoke test)
full-sweep-enabled
#1708
opened Jun 11, 2026 by
functionstackx
Collaborator
Loading…
5 tasks
[Klaud Cold] dsv4-fp4-mi355x-vllm-disagg: DeepSeek-V4-Pro vLLM disagg (8k1k conc=1 smoke test)
full-sweep-enabled
#1707
opened Jun 11, 2026 by
functionstackx
Collaborator
Loading…
4 tasks
[WIP] [NV] Update MiniMax B200/B300 aggregate vLLM settings
full-sweep-enabled
#1704
opened Jun 10, 2026 by
jasonlizhengjian
Collaborator
Loading…
Sync dsv4-fp4-b300-trt recipes with B300 agg frontier config
full-sweep-enabled
#1703
opened Jun 10, 2026 by
Oseltamivir
Collaborator
Loading…
dsv4-fp4-b300-sglang-mtp: add piecewise cuda graph flags
full-sweep-enabled
#1702
opened Jun 10, 2026 by
yhyang201
Collaborator
Loading…
[AMD][MI35X] 0610 DSV4
AMD
full-sweep-enabled
#1701
opened Jun 10, 2026 by
1am9trash
Collaborator
Loading…
Sync dsv4-fp4-b200-trt recipes with B200 agg frontier config
full-sweep-enabled
#1699
opened Jun 10, 2026 by
qiaoxj07
Collaborator
Loading…
[Do Not Merge] kimik2.5-fp4-b300-vllm: align server launch with B200 recipe
full-sweep-enabled
#1698
opened Jun 9, 2026 by
RohitNagraj
Collaborator
Loading…
[WIP][NV] add dsv4-fp4-gb300-dynamo-sglang-mtp-1k1k
full-sweep-enabled
#1697
opened Jun 9, 2026 by
hshrivastava-droid
Collaborator
Loading…
dsr1 disagg 8k1k mtp: nightly 20260609 + conc-64 dispatch-bug validation
non-canary-full-sweep-enabled
Run the full sweep without the canary gate (full search space, no trim)
#1696
opened Jun 9, 2026 by
Oseltamivir
Collaborator
Loading…
dsv4-fp4-b300-sglang: enable piecewise cuda graph and mixed chunk
full-sweep-enabled
#1693
opened Jun 9, 2026 by
yhyang201
Collaborator
Loading…
dsv4-fp4-gb300-dynamo-trt: STP + MTP disagg trtllm recipes on GB300
full-sweep-enabled
#1689
opened Jun 8, 2026 by
Ankur-singh
Collaborator
Loading…
dsr1-fp4-b200-dynamo-sglang-mtp: 8k1k 6-variant MTP sweep on local split recipes
full-sweep-enabled
#1688
opened Jun 8, 2026 by
Ankur-singh
Collaborator
Loading…
[NV] DeepSeek-V4-Pro trtllm disagg receipes for STP and MTP
#1687
opened Jun 8, 2026 by
richardhuo-nv
Loading…
[DO NOT MERGE] Run-only: gb300 dsr1 measured power+temp validation
sweep-enabled
#1686
opened Jun 8, 2026 by
arygupt
Collaborator
Loading…
[Maintainers still waiting for PR author to finish his PR & run the tests so that we can review it promptly afterwards][AMD] dsv4-fp4-mi355x-atom-disagg, add multi-node ATOM/mooncake disaggregation support
AMD
full-sweep-enabled
#1683
opened Jun 8, 2026 by
seungrokj
Collaborator
Loading…
5 tasks
dsv4-fp4-b300-sglang: align env vars to GB300
full-sweep-enabled
#1682
opened Jun 8, 2026 by
yhyang201
Collaborator
Loading…
[AMD][MI35X] Qwen3.5-fp4 SGLang single-node benchmark
AMD
full-sweep-enabled
#1680
opened Jun 8, 2026 by
1am9trash
Collaborator
Loading…
Bump actions/checkout from 6.0.2 to 6.0.3 in the github-actions group
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#1679
opened Jun 8, 2026 by
dependabot
Bot
Loading…
Add DSv4-Pro FP4 GB200 SGLang disagg + MTP config
full-sweep-enabled
#1676
opened Jun 5, 2026 by
Ankur-singh
Collaborator
Loading…
Add DSv4-Pro FP4 GB200 SGLang disagg config
full-sweep-enabled
#1675
opened Jun 5, 2026 by
Ankur-singh
Collaborator
Loading…
[AMD][MI355X] Bump qwen3.5-bf16 single-node SGLang image to v0.5.12.post1
#1673
opened Jun 5, 2026 by
ChangLiu0709
Collaborator
Loading…
2 of 3 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.