llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-20 05:26:07 +00:00

History

vulkan: fix mul_mat_vec failure in backend tests (#12529 )

The OOB calculation could be wrong if the last iteration was during one of
the unrolled loops. Adjust the unrolling counts to avoid this. Add a couple
new backend tests that hit this failure on NVIDIA GPUs.

2025-03-24 07:56:17 +01:00

cmake

cmake : enable building llama.cpp using system libggml (#12321 )

2025-03-17 11:05:23 +02:00

include

llama: Add support for RWKV v7 architecture (#12412 )

2025-03-18 07:27:50 +08:00

src

vulkan: fix mul_mat_vec failure in backend tests (#12529 )

2025-03-24 07:56:17 +01:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

SYCL: using graphs is configurable by environment variable and compile option (#12371 )

2025-03-18 11:16:31 +01:00