Default Branch

81c7e64fc2 · dsiable curl lib check, this action is missed by commit bd3f59f81289b920bcc597a208c14f55e39ed37e (#12761) (#12937) · Updated 2025-04-14 10:19:07 +00:00

Branches

f7b0233eca · wip · Updated 2024-11-16 08:33:55 +00:00    mirrors

1025
1

5e6dad9322 · speculative : experimenting with Qwen2.5 · Updated 2024-11-14 09:31:31 +00:00    mirrors

1047
2

33bdee667e · speculative : fix out-of-bounds access · Updated 2024-11-14 09:23:45 +00:00    mirrors

1047
1

8c1b186cb5 · metal : minor Q4_0 optimization · Updated 2024-11-12 13:30:51 +00:00    mirrors

1057
21

3d1fe1bb4d · metal : int -> short, style · Updated 2024-11-09 08:32:16 +00:00    mirrors

1068
2

bd1198a67a · metal : fix build and some more comments · Updated 2024-11-09 08:09:50 +00:00    mirrors

1068
1

a2385da59c · make : clean-up [no ci] · Updated 2024-11-08 11:46:20 +00:00    mirrors

1075
9

94accca4c2 · vec move mask to shmem · Updated 2024-11-07 18:58:10 +00:00    mirrors

1085
19

c5d8bb5a81 · leave only basic functions for SYCL CI · Updated 2024-11-06 07:47:50 +00:00    mirrors

1150
2

4fc8673d09 · llama-bench : skip repeated values in consecutive lines · Updated 2024-11-02 14:37:33 +00:00    mirrors

1110
1

20e12112fd · llama : suggest reduce ctx size when kv init fails · Updated 2024-11-01 23:55:19 +00:00    mirrors

1113
2

afc4a7de65 · llama : enable flash attn automatically when supported · Updated 2024-10-30 22:30:06 +00:00    mirrors

1130
1

8233009d4d · Support SYCL device register · Updated 2024-10-20 02:06:51 +00:00    mirrors

1211
1

bc82fc2ed8 · llama-bench : add time-to-first-byte stat · Updated 2024-10-18 13:40:02 +00:00    mirrors

1182
1

2d3fc54ac6 · add amx kernel for gemm · Updated 2024-10-18 03:35:49 +00:00    mirrors

1192
1

630bce5a7f · ggml : fix possible buffer use after free in sched reserve · Updated 2024-10-17 22:21:54 +00:00    mirrors

1190
1

17b3a3e8cc · llama : minor llama_grammar refactoring · Updated 2024-10-17 09:23:27 +00:00    mirrors

1218
4

a34fc0dd86 · ci : reduce severity of unused Pyright ignore comments · Updated 2024-09-30 17:59:40 +00:00    mirrors

1273
1

114ab6347e · sampling : fix off-by-one in tail-free sampling · Updated 2024-09-23 08:44:55 +00:00    mirrors

1317
1

6e873e561a · llama : make llm_tokenizer more private · Updated 2024-09-20 08:41:51 +00:00    mirrors

1336
2