llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-17 20:16:09 +00:00

History

Neo Zhang Jianyu 08d5986290

[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035 )

* opt performance by reorder for Intel GPU

* detect hw type and save opt feature, and print opt feature

* correct name

* support optimize graph once when compute graph, record the opt status in tensor->extra, make CI passed

* add env variable GGML_SYCL_DISABLE_OPT for debug

* use syclex::architecture replace the custom hw define, update the guide for GGML_SYCL_DISABLE_OPT

* add performance data

* mv getrows functions to separeted files

* fix global variables

---------

Co-authored-by: arthw <14088817+arthw@users.noreply.github.com>

2025-02-24 22:33:23 +08:00

BLIS.md

make : deprecate (#10514 )

2024-12-02 21:22:53 +02:00

CANN.md

CANN: Update cann.md to display correctly in CLion (#10538 )

2024-11-28 15:27:11 +08:00

OPENCL.md

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

SYCL.md

[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035 )

2025-02-24 22:33:23 +08:00