Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Misc] Fix how model dtype is being configured ready ONLY add when PR is ready to merge/full CI is needed
#1286 opened Dec 11, 2025 by kyuyeunk Loading…
Support overriding logic for hybrid kv cache padding ready ONLY add when PR is ready to merge/full CI is needed
#1285 opened Dec 11, 2025 by kyuyeunk Loading…
[Bugfix][Refactor] Fix compressed tensor moe init ready ONLY add when PR is ready to merge/full CI is needed
#1283 opened Dec 11, 2025 by kyuyeunk Loading…
[Kernel][Misc] Remove jax.named_scope ready ONLY add when PR is ready to merge/full CI is needed
#1278 opened Dec 10, 2025 by kyuyeunk Loading…
[do not review][do not submit] ready ONLY add when PR is ready to merge/full CI is needed
#1277 opened Dec 10, 2025 by QiliangCui Loading…
Move the If nightly==1 check out of command.
#1276 opened Dec 10, 2025 by QiliangCui Loading…
add new kernel and quantization support matrices
#1275 opened Dec 10, 2025 by boe20211 Loading…
docs: update support matrices and improve visuals
#1250 opened Dec 5, 2025 by RobMulla Loading…
Avoid installing CUDA related stuff
#1246 opened Dec 4, 2025 by wdhongtw Loading…
update run_in_docker script for running on local env ready ONLY add when PR is ready to merge/full CI is needed
#1243 opened Dec 4, 2025 by ernie-chang Loading…
Add workflow to build vLLM-TPU wheel using PyPI tpu-inference ready ONLY add when PR is ready to merge/full CI is needed
#1241 opened Dec 4, 2025 by ylangtsou Draft
[CI] Fix awq dtype ready ONLY add when PR is ready to merge/full CI is needed
#1220 opened Dec 2, 2025 by kyuyeunk Loading…
[Oncall] update the SchedulerConfig interface
#1219 opened Dec 2, 2025 by bzgoogle Loading…
Add a SP e2e test.
#1209 opened Dec 2, 2025 by vanbasten23 Loading…
Save size in scalar scratch for bo and bq ready ONLY add when PR is ready to merge/full CI is needed
#1201 opened Dec 1, 2025 by rupengliu-meta Loading…
[Qwix/Flax] Upgrade to Flax 0.12.0 + Qwix 0.1.4
#1170 opened Nov 25, 2025 by jrplatin Loading…
[do not merge] test status check POC ready ONLY add when PR is ready to merge/full CI is needed
#1168 opened Nov 25, 2025 by khluu Loading…
[Feat][TPU Offload] KV cache offload to local cpu buffer ready ONLY add when PR is ready to merge/full CI is needed
#1163 opened Nov 24, 2025 by juncgu-google Loading…
DP support for GPT OSS
#1096 opened Nov 13, 2025 by wenxindongwork Draft
ProTip! Adding no:label will show everything without a label.