llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-04-23 20:46:06 +00:00

History

Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858 )

* gguf-py, llama : add constants and methods related to Llama-3.1 <|eom_id|> token

* llama : find Llama-3.1 <|eom_id|> token id during vocab loading

* llama-vocab : add Llama-3.1 <|eom_id|> token to the set of tokens stopping the generation

---------

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

2024-08-05 09:38:01 +02:00

__init__.py

convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499 )

2024-07-18 20:40:15 +10:00

constants.py

Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858 )

2024-08-05 09:38:01 +02:00

gguf_reader.py

py : type-check all Python scripts with Pyright (#8341 )

2024-07-07 15:04:39 -04:00

gguf_writer.py

Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858 )