llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-20 05:26:07 +00:00

History

* ggml-backend : fix async copy from CPU

* cuda : more reliable async copy, fix stream used when the devices are the same

2024-08-07 13:29:02 +02:00

2024-06-26 18:33:02 +03:00

2024-08-06 10:26:46 +03:00

2024-08-07 13:29:02 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cann: update cmake (#8765 )

2024-07-30 12:37:35 +02:00