llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-19 13:06:10 +00:00

History

Nikita Sarychev cae9fb4361

HIP: Only call rocblas_initialize on rocblas versions with the multiple instantation bug (#11080 )

This disables the workaround on rocblas fixed versions (>=4.0.0) to eliminate the runtime cost and unnecessary VRAM allocation of loading all tensile objects.

2025-01-28 16:42:20 +01:00

cmake

cmake: add ggml find package (#11369 )

2025-01-26 12:07:48 -04:00

include

rpc : early register backend devices (#11262 )

2025-01-17 10:57:09 +02:00

src

HIP: Only call rocblas_initialize on rocblas versions with the multiple instantation bug (#11080 )

2025-01-28 16:42:20 +01:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cmake: add ggml find package (#11369 )

2025-01-26 12:07:48 -04:00