Georgi Gerganov
e0dbec0bc6
llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)
* llama : refactor llama_context, llama_kv_cache, llm_build_context
ggml-ci
* graph : don't mutate the KV cache during defrag
ggml-ci
* context : reduce virtuals + remove test function
ggml-ci
* context : move interface implementation to source file + factory
ggml-ci
* graph : move KV cache build functions to llama_context impl
ggml-ci
* graph : remove model reference from build_pooling
ggml-ci
* graph : remove llama_model reference
ggml-ci
* kv_cache : provide rope factors
ggml-ci
* graph : rework inputs to use only unique_ptr, remove attn input abstraction
ggml-ci
* context : remove llama_context_i abstraction
ggml-ci
* context : clean-up
ggml-ci
* graph : clean-up
ggml-ci
* llama : remove redundant keywords (struct, enum)
ggml-ci
* model : adapt gemma3
ggml-ci
* graph : restore same attention ops as on master
ggml-ci
* llama : remove TODO + fix indent
ggml-ci
2025-03-13 12:35:44 +02:00
..
2025-01-12 11:32:42 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-01-12 11:32:42 +02:00
2025-03-13 12:35:44 +02:00
2024-12-04 23:19:20 +01:00
2025-03-13 12:35:44 +02:00
2025-01-12 11:32:42 +02:00
2025-03-10 13:34:13 +02:00
2025-01-30 19:13:58 +00:00
2024-11-29 21:54:58 +01:00
2025-01-07 18:01:58 +01:00
2025-01-07 18:01:58 +01:00
2025-01-15 18:28:35 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2024-06-13 00:41:52 +01:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-04 18:53:26 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2024-12-23 10:39:30 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-01-12 11:32:42 +02:00
2025-03-13 12:35:44 +02:00
2025-02-15 16:40:57 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-02-24 22:33:23 +08:00
2025-01-12 11:32:42 +02:00
2025-03-04 18:53:26 +02:00
2023-03-29 20:21:09 +03:00
2024-06-13 00:41:52 +01:00
2024-11-09 09:06:54 +02:00
2024-06-13 00:41:52 +01:00
2024-06-13 00:41:52 +01:00
2024-12-18 19:27:21 +02:00
2024-11-13 21:10:38 +11:00
2024-07-07 15:04:39 -04:00
2025-03-05 13:05:13 +00:00
2025-02-15 16:40:57 +02:00
2023-08-30 09:50:55 +03:00
2024-06-13 00:41:52 +01:00
2025-02-15 16:40:57 +02:00
2024-07-14 19:51:21 -04:00
2024-06-13 00:41:52 +01:00
2024-07-05 07:53:33 +03:00
2024-07-07 15:04:39 -04:00
2024-06-13 00:41:52 +01:00
2024-04-12 19:43:38 +01:00