Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 9 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -175,11 +175,18 @@ The documentation covers:

## Testing

kvcached can be enabled by setting the following environmental variables:
kvcached can be enabled by setting `ENABLE_KVCACHED` either in the shell:

```bash
export ENABLE_KVCACHED=true
export KVCACHED_AUTOPATCH=1
```

or from Python script (before `import vllm` / `import sglang`):

```python
import os
os.environ["ENABLE_KVCACHED"] = "true"
from kvcached import autopatch # required when setting from Python;
```

If you are using the engine-specific dockers, you can test kvcached by running the original engines' benchmark scripts. For example:
Expand Down
5 changes: 4 additions & 1 deletion kvcached/integration/sglang/autopatch.py
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,10 @@


def _env_enabled() -> bool:
return os.getenv("KVCACHED_AUTOPATCH", "false").lower() in ("true", "1")
return (
os.getenv("ENABLE_KVCACHED", "false").lower() in ("true", "1")
or os.getenv("KVCACHED_AUTOPATCH", "false").lower() in ("true", "1")
)


@when_imported("sglang")
Expand Down
5 changes: 4 additions & 1 deletion kvcached/integration/vllm/autopatch.py
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,10 @@


def _env_enabled() -> bool:
return os.getenv("KVCACHED_AUTOPATCH", "false").lower() in ("true", "1")
return (
os.getenv("ENABLE_KVCACHED", "false").lower() in ("true", "1")
or os.getenv("KVCACHED_AUTOPATCH", "false").lower() in ("true", "1")
)


@when_imported("vllm")
Expand Down
Loading