llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-17 20:16:09 +00:00

History

llama : add PLM GGUF Conversion & Inference Support (#12457 )

* add edgellm model arch[conversation feature doesn't work]

* remove output.weight layer for edgellm arch

* [Model] update the name of the model

* update the name of model arch in convert gguf

* [Model] Refarctor the model arch into llama-model

* [Bug] Fix the bug in create attn kv

* [Code] Fix editorconfig erros

* [Code] Remove Trailing whitespace

* [Code] Remove Trailing whitespace

* [Code] Change the order of model arch in list

* [Code] Fix flake8 Lint errors

* Remove trailing white space

* [Code] Remove  call in model arch

2025-03-27 12:49:15 +02:00

scripts

Refactor gguf scripts to improve metadata handling (#11909 )

2025-02-26 08:04:48 -05:00

__init__.py

convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499 )

2024-07-18 20:40:15 +10:00

constants.py

llama : add PLM GGUF Conversion & Inference Support (#12457 )

2025-03-27 12:49:15 +02:00

gguf_reader.py

Refactor gguf scripts to improve metadata handling (#11909 )