Georgi Gerganov
9cb317f77e
ggml : full ALiBi support (#7192)
* ggml : full ALiBi support
* ggml : update ggml_soft_max_ext() CUDA, SYCL
* ggml : ggml_flash_attn_ext() support ALiBi (CPU)
* ggml : ggml_flash_attn_ext() support ALiBi (Metal)
* ggml : fix warning
* ggml : ggml_flash_attn_ext() support ALiBi (CUDA)
ggml-ci
* ggml : fix assert message
* vulkan : add dev notes
* ggml : require mask when using ALiBi
ggml-ci
* convert : fix convert for refact models
2024-05-11 10:32:41 +03:00
..
2024-03-09 14:17:11 +02:00
2024-05-08 15:06:43 +03:00
2024-01-26 14:18:00 +02:00
2024-01-26 14:18:00 +02:00
2024-03-21 11:50:43 +00:00
2024-02-16 11:31:07 +02:00
2024-05-11 10:32:41 +03:00
2024-01-29 15:50:50 -05:00
2024-04-24 11:52:37 +03:00
2023-10-30 19:19:15 +02:00
2023-12-24 14:34:22 +01:00
2024-04-29 14:40:14 -04:00
2024-02-18 18:20:12 +02:00
2024-05-08 21:53:08 +02:00
2024-02-18 18:20:12 +02:00
2024-02-16 11:31:07 +02:00
2024-02-25 12:09:09 +02:00
2024-03-25 19:33:15 +02:00
2024-02-11 15:22:33 +02:00
2023-09-28 19:04:36 +03:00
2024-02-08 09:46:30 +01:00
2024-05-04 08:32:32 +03:00
2024-05-05 08:07:48 +03:00
2024-05-04 08:32:32 +03:00
2024-03-11 17:47:47 +02:00
2024-04-29 16:58:41 +03:00
2024-05-09 23:30:44 +10:00