llama: refactor llama_decode_impl (#11381) #3814
build.yml
on: push
Matrix: windows-2019-cmake-cuda
Matrix: windows-latest-cmake-hip-release
Matrix: windows-latest-cmake
macOS-latest-cmake-arm64
12m 38s
macOS-latest-cmake-x64
6m 26s
ubuntu-cpu-cmake
2m 57s
ubuntu-latest-cmake-rpc
3m 27s
ubuntu-22-cmake-vulkan
19m 57s
ubuntu-22-cmake-hip
20m 21s
ubuntu-22-cmake-musa
12m 31s
ubuntu-22-cmake-sycl
5m 10s
ubuntu-22-cmake-sycl-fp16
6m 39s
macOS-latest-cmake-ios
1m 26s
macOS-latest-cmake-tvos
1m 17s
ubuntu-latest-cmake-cuda
11m 51s
windows-latest-cmake-sycl
11m 3s
windows-latest-cmake-hip
16m 29s
ios-xcode-build
1m 38s
android-build
7m 50s
Matrix: macOS-latest-swift
Matrix: openEuler-latest-cmake-cann
Matrix: ubuntu-latest-cmake-sanitizer
Matrix: windows-msys2
release
2m 9s
Annotations
1 error and 7 warnings
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
cudart-llama-bin-win-cu11.7-x64.zip
|
303 MB |
|
cudart-llama-bin-win-cu12.4-x64.zip
|
372 MB |
|
llama-bin-macos-arm64.zip
|
20.9 MB |
|
llama-bin-macos-x64.zip
|
22.4 MB |
|
llama-bin-ubuntu-x64.zip
|
24.2 MB |
|
llama-bin-win-avx-x64.zip
|
13.8 MB |
|
llama-bin-win-avx2-x64.zip
|
13.8 MB |
|
llama-bin-win-avx512-x64.zip
|
13.8 MB |
|
llama-bin-win-cu11.7-x64.zip
|
150 MB |
|
llama-bin-win-cu12.4-x64.zip
|
150 MB |
|
llama-bin-win-hip-x64-gfx1030.zip
|
236 MB |
|
llama-bin-win-hip-x64-gfx1100.zip
|
238 MB |
|
llama-bin-win-hip-x64-gfx1101.zip
|
238 MB |
|
llama-bin-win-kompute-x64.zip
|
14.1 MB |
|
llama-bin-win-llvm-arm64-opencl-adreno.zip
|
17.5 MB |
|
llama-bin-win-llvm-arm64.zip
|
17.5 MB |
|
llama-bin-win-msvc-arm64.zip
|
56.5 MB |
|
llama-bin-win-noavx-x64.zip
|
13.8 MB |
|
llama-bin-win-openblas-x64.zip
|
24.8 MB |
|
llama-bin-win-sycl-x64.zip
|
95.3 MB |
|
llama-bin-win-vulkan-x64.zip
|
15.9 MB |
|