Fix prefix cache OOB in prepare_prefill when num_cached_tokens grows during allocate by yuanmouya-prog · Pull Request #233 · GeeeekExplorer/nano-vllm

yuanmouya-prog · 2026-05-17T09:46:13Z

Bug

When prefix caching hits during block_manager.allocate(), num_cached_tokens can jump from 0 to N*block_size. But the scheduler already computed num_scheduled_tokens based on the old num_cached_tokens=0 before calling allocate(). This causes prepare_prefill to compute end = start + seqlen_q exceeding num_tokens, leading to block_table index out of range.

Reproduce

High concurrency (512 requests) + KV cache full + prefix cache hit → preempt → re-prefill → 100% crash with IndexError: list index out of range at model_runner.py:155.

Fix

One line: clamp seqlen_q to not exceed remaining uncached tokens.

-            seqlen_q = seq.num_scheduled_tokens
+            seqlen_q = min(seq.num_scheduled_tokens, seqlen - start)

…during allocate When prefix caching hits during block_manager.allocate(), num_cached_tokens can jump from 0 to N*block_size. But the scheduler already computed num_scheduled_tokens based on the old num_cached_tokens=0 before calling allocate(). This causes prepare_prefill to compute end = start + seqlen_q exceeding num_tokens, leading to block_table index out of range. The fix clamps seqlen_q to not exceed the remaining uncached tokens. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix prefix cache OOB in prepare_prefill when num_cached_tokens grows during allocate#233

Fix prefix cache OOB in prepare_prefill when num_cached_tokens grows during allocate#233
yuanmouya-prog wants to merge 1 commit into
GeeeekExplorer:mainfrom
yuanmouya-prog:fix/prefix-cache-oob-prepare-prefill

yuanmouya-prog commented May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yuanmouya-prog commented May 17, 2026

Bug

Reproduce

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant