Default Branch

81c7e64fc2 · dsiable curl lib check, this action is missed by commit bd3f59f81289b920bcc597a208c14f55e39ed37e (#12761) (#12937) · Updated 2025-04-14 10:19:07 +00:00

Branches

fbddb26250 · ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1 · Updated 2025-01-12 02:06:49 +00:00

669
7

9605c5fb28 · cmake : remove explicit _XOPEN_SOURCE · Updated 2025-01-06 11:02:48 +00:00

714
2

aa014d7e89 · Use mutex instead of atomics for vk_instance counters · Updated 2024-12-30 05:14:58 +00:00

727
2

a362c74aa2 · profiler: initial support for profiling graph ops · Updated 2024-12-26 23:59:37 +00:00    mirrors

731
1

fe9235d795 · Force max subgroup size for coopmat shaders · Updated 2024-12-18 07:26:27 +00:00    mirrors

773
1

4fbb801a9d · ggml : update ggml_backend_cpu_device_supports_op · Updated 2024-12-17 16:09:02 +00:00    mirrors

783
3

3e92f4ecbe · cont [no ci] · Updated 2024-12-15 10:36:03 +00:00    mirrors

795
2

7e9208e408 · scripts : change build path to "build-bench" for compare-commits.sh · Updated 2024-12-15 09:47:30 +00:00    mirrors

795
1

fb18934a97 · gguf-py : bump version to 0.11.0 · Updated 2024-12-11 21:13:31 +00:00    mirrors

816
0
Included

4f3a7e279b · Force max subgroup size for coopmat shaders · Updated 2024-12-10 20:27:04 +00:00    mirrors

824
2

b8d1b1a5e1 · server : fix infill prompt format · Updated 2024-12-08 20:12:11 +00:00    mirrors

834
1

a6648b9df7 · server : chunked prefill support · Updated 2024-12-08 07:48:18 +00:00    mirrors

838
1

a8046c888a · use calloc instead of malloc · Updated 2024-12-04 16:24:35 +00:00    mirrors

869
3

81611bef72 · server : add tests · Updated 2024-12-04 11:11:26 +00:00    mirrors

869
3

33d7b70c88 · server : do not speculate during prompt processing · Updated 2024-12-03 08:58:43 +00:00    mirrors

882
1

3c8a2a83fe · shmem experiments · Updated 2024-11-26 13:17:38 +00:00    mirrors

945
3

dafedd33d2 · 4x4 -> 4x · Updated 2024-11-26 12:54:02 +00:00    mirrors

945
2

bf3494345e · metal : some mul_mv experiments · Updated 2024-11-26 12:48:50 +00:00    mirrors

945
1

b83cae088c · speculative : add infill mode · Updated 2024-11-26 09:14:17 +00:00    mirrors

950
1

4ff0831ce6 · metal : use F16 math in mul_mat kernels · Updated 2024-11-25 13:15:26 +00:00    mirrors

963
1