Pinned Loading
Repositories
Showing 6 of 6 repositories
- serving-khala Public Forked from knative/serving
Kubernetes-based, scale-to-zero, request-driven compute
hyscale-lab/serving-khala’s past year of commit activity - vllm-kvcache Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hyscale-lab/vllm-kvcache’s past year of commit activity - slimsc Public
Official repository for the EMNLP 2025 paper "Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency".
hyscale-lab/slimsc’s past year of commit activity - scholar-scout Public
hyscale-lab/scholar-scout’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…