-
Notifications
You must be signed in to change notification settings - Fork 54
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add github action for check ready label
ready
ONLY add when PR is ready to merge/full CI is needed
#1269
opened Dec 9, 2025 by
boe20211
Loading…
Fix TPU7x chip counting to account for chiplet architecture
#1266
opened Dec 8, 2025 by
burbajr
Loading…
Replacing bit_width() with itemized_bits().
ready
ONLY add when PR is ready to merge/full CI is needed
#1264
opened Dec 8, 2025 by
aman2930
Loading…
3 tasks done
Add default 'auto' MODEL_IMPL_TYPE that resolves based on architecture
ready
ONLY add when PR is ready to merge/full CI is needed
#1255
opened Dec 5, 2025 by
xingliu14
Loading…
[Kernel][FusedMoE] Fix MoE crash and hang issues
ready
ONLY add when PR is ready to merge/full CI is needed
#1252
opened Dec 5, 2025 by
bythew3i
Loading…
update run_in_docker script for running on local env
ready
ONLY add when PR is ready to merge/full CI is needed
#1243
opened Dec 4, 2025 by
ernie-chang
Loading…
Remove a branch with pl.when in fetching bkv
ready
ONLY add when PR is ready to merge/full CI is needed
#1239
opened Dec 4, 2025 by
rupengliu-meta
Loading…
[DRAFT] Optimize Dockerfile to reduce image size and build time.
#1226
opened Dec 3, 2025 by
py4
Loading…
[CI] Fix awq dtype
ready
ONLY add when PR is ready to merge/full CI is needed
#1220
opened Dec 2, 2025 by
kyuyeunk
Loading…
[RPA] Pipeline flash attention in default kernel
ready
ONLY add when PR is ready to merge/full CI is needed
#1203
opened Dec 1, 2025 by
jrplatin
Loading…
fix(rpa-v3): add sliding window mask to h64 kernel and attention_sink to h128
#1185
opened Nov 26, 2025 by
erfanzar
Loading…
[do not merge] test status check POC
ready
ONLY add when PR is ready to merge/full CI is needed
#1168
opened Nov 25, 2025 by
khluu
Loading…
[Feat][TPU Offload] KV cache offload to local cpu buffer
ready
ONLY add when PR is ready to merge/full CI is needed
#1163
opened Nov 24, 2025 by
juncgu-google
Loading…
Enable Pipeline Parallelism on Jax models
ready
ONLY add when PR is ready to merge/full CI is needed
#1077
opened Nov 12, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-09.