llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-16 03:26:08 +00:00

History

Vulkan: Add DP4A MMQ and Q8_1 quantization shader (#12135 )

* Vulkan: Add DP4A MMQ and Q8_1 quantization shader

* Add q4_0 x q8_1 matrix matrix multiplication support

* Vulkan: Add int8 coopmat MMQ support

* Vulkan: Add q4_1, q5_0 and q5_1 quants, improve integer dot code

* Add GL_EXT_integer_dot_product check

* Remove ggml changes, fix mmq pipeline picker

* Remove ggml changes, restore Intel coopmat behaviour

* Fix glsl compile attempt when integer vec dot is not supported

* Remove redundant code, use non-saturating integer dot, enable all matmul sizes for mmq

* Remove redundant comment

* Fix integer dot check

* Fix compile issue with unsupported int dot glslc

* Update Windows build Vulkan SDK version

2025-03-31 14:37:01 +02:00

ISSUE_TEMPLATE

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

workflows

Vulkan: Add DP4A MMQ and Q8_1 quantization shader (#12135 )

2025-03-31 14:37:01 +02:00

labeler.yml

ci : add ubuntu cuda build, build with one arch on windows (#10456 )

2024-11-26 13:05:07 +01:00

pull_request_template.md

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00