Georgi Gerganov
0e89203b51
speculative : add tree-based sampling example (#3624)
* sampling : one sequence per sampling context
ggml-ci
* speculative : add tree-based sampling support
ggml-ci
* speculative : reuse the n_parallel CLI param
* speculative : refactor sampling
* examples : fix build after sampling refactoring
ggml-ci
* batched : fix n_seq_id
* sampling : fix malloc
ggml-ci
* swift : fix build
ggml-ci
* swift : try to fix build
ggml-ci
* prompts : add assistant.txt
* common : add llama_batch_add() and llama_batch_clear() helpers
* speculative : minor refactor
ggml-ci
* minor : comments + rename
ggml-ci
* speculative : fix off-by-one for n_drafted
* speculative : fix the n_drafted fix + p constants
2023-10-18 16:21:57 +03:00
..
2023-09-28 17:41:44 -04:00
2023-10-18 16:21:57 +03:00
2023-10-18 16:21:57 +03:00
2023-10-18 16:21:57 +03:00
2023-09-28 22:42:38 +03:00
2023-09-20 12:06:08 -04:00
2023-09-15 15:38:27 -04:00
2023-10-18 16:21:57 +03:00
2023-09-28 22:42:38 +03:00
2023-09-28 21:40:11 +03:00
2023-10-13 12:23:10 +02:00
2023-09-15 15:38:27 -04:00
2023-09-27 12:18:07 -04:00
2023-10-18 16:21:57 +03:00
2023-10-06 16:16:38 +03:00
2023-09-28 17:41:44 -04:00
2023-10-18 16:21:57 +03:00
2023-10-18 16:21:57 +03:00
2023-10-02 12:51:49 +03:00
2023-08-21 23:07:43 +03:00
2023-10-18 16:21:57 +03:00
2023-09-28 22:42:38 +03:00
2023-09-28 17:41:44 -04:00
2023-09-28 22:42:38 +03:00
2023-10-17 19:12:46 +03:00
2023-10-18 16:21:57 +03:00
2023-10-18 16:21:57 +03:00
2023-10-18 16:21:57 +03:00
2023-10-17 20:00:58 +03:00
2023-07-06 19:17:50 +03:00
2023-03-29 20:21:09 +03:00
2023-05-03 20:58:11 +03:00
2023-10-03 21:04:01 +03:00
2023-06-15 21:05:53 +03:00
2023-08-30 09:29:32 +03:00
2023-10-12 18:23:18 +03:00
2023-04-13 16:03:39 +03:00
2023-08-23 17:29:09 +03:00
2023-07-21 13:53:27 +03:00
2023-07-21 13:53:27 +03:00
2023-08-08 14:44:48 +03:00
2023-08-30 09:50:55 +03:00
2023-09-27 19:25:12 +03:00
2023-07-21 11:13:18 +03:00
2023-08-23 17:29:09 +03:00
2023-08-23 17:29:09 +03:00