Skip to content

Actions: huggingface/text-generation-inference

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,612 workflow run results
5,612 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix cpu and xpu issue (#2116)
Secret Leaks #38: Commit e563983 pushed by Narsil
June 25, 2024 14:47 19s main
June 25, 2024 14:47 19s
fix cpu and xpu issue (#2116)
CI build #79: Commit e563983 pushed by Narsil
June 25, 2024 14:47 57m 40s main
June 25, 2024 14:47 57m 40s
pages build and deployment
pages-build-deployment #723: by Narsil
June 25, 2024 14:47 35s main
June 25, 2024 14:47 35s
add torch dtype
Secret Leaks #156: Commit a7909e6 pushed by mht-sharma
June 25, 2024 14:33 17s fp8_kvcache
June 25, 2024 14:33 17s
Fix nccl regression on PyTorch 2.3 upgrade
Automatic Documentation for Launcher #93: Pull request #2099 synchronize by fxmarty
June 25, 2024 14:28 1m 16s fix-nccl-regression
June 25, 2024 14:28 1m 16s
Fix nccl regression on PyTorch 2.3 upgrade
Server Tests #2221: Pull request #2099 synchronize by fxmarty
June 25, 2024 14:28 15m 55s fix-nccl-regression
June 25, 2024 14:28 15m 55s
Fix nccl regression on PyTorch 2.3 upgrade
CI build #77: Pull request #2099 synchronize by fxmarty
June 25, 2024 14:28 1h 0m 41s fix-nccl-regression
June 25, 2024 14:28 1h 0m 41s
set LD_PRELOAD
Secret Leaks #155: Commit a1695ce pushed by fxmarty
June 25, 2024 14:28 18s fix-nccl-regression
June 25, 2024 14:28 18s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #92: Pull request #1940 synchronize by Narsil
June 25, 2024 14:24 1m 18s flashdecoding
June 25, 2024 14:24 1m 18s
"ipex" -> "cpu"
Secret Leaks #37: Commit 6beb455 pushed by Narsil
June 25, 2024 14:24 22s flashdecoding
June 25, 2024 14:24 22s
Add pytest release marker
CI build #75: Pull request #2114 synchronize by danieldk
June 25, 2024 13:32 1h 1m 31s ci/release-tests
June 25, 2024 13:32 1h 1m 31s
Add pytest release marker
Automatic Documentation for Launcher #91: Pull request #2114 synchronize by danieldk
June 25, 2024 13:32 1m 16s ci/release-tests
June 25, 2024 13:32 1m 16s
Mark many models as release to speed up CI
Secret Leaks #36: Commit 2706cca pushed by danieldk
June 25, 2024 13:32 16s ci/release-tests
June 25, 2024 13:32 16s
Idefics2: sync added image tokens with transformers
Automatic Documentation for Launcher #90: Pull request #2080 synchronize by danieldk
June 25, 2024 13:13 1m 26s bugfix/idefics2-no-image-splitting
June 25, 2024 13:13 1m 26s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
CI build #73: Pull request #1940 synchronize by Narsil
June 25, 2024 13:10 1h 21m 48s flashdecoding
June 25, 2024 13:10 1h 21m 48s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #89: Pull request #1940 synchronize by Narsil
June 25, 2024 13:10 1m 20s flashdecoding
June 25, 2024 13:10 1m 20s
Update?
Secret Leaks #34: Commit 424a625 pushed by Narsil
June 25, 2024 13:10 22s flashdecoding
June 25, 2024 13:10 22s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
Automatic Documentation for Launcher #88: Pull request #1940 synchronize by Narsil
June 25, 2024 12:24 1m 19s flashdecoding
June 25, 2024 12:24 1m 19s