Georgi Gerganov
d1031cf49c
sampling : refactor init to use llama_sampling_params (#3696)
* sampling : refactor init to use llama_sampling_params
* llama : combine repetition, frequency and presence penalties in 1 call
* examples : remove embd-input and gptneox-wip
* sampling : rename penalty params + reduce size of "prev" vector
* sampling : add llama_sampling_print helper
* sampling : hide prev behind API and apply #3661
ggml-ci
2023-10-20 21:07:23 +03:00
..
2023-10-03 09:16:26 +02:00
2023-08-30 09:20:26 +03:00
2023-10-20 14:19:40 +03:00
2023-10-04 15:29:58 +03:00
2023-08-21 23:07:43 +03:00
2023-08-21 23:07:43 +03:00
2023-10-04 15:29:58 +03:00
2023-09-15 15:38:27 -04:00
2023-10-04 15:29:58 +03:00
2023-09-28 19:04:36 +03:00
2023-10-20 21:07:23 +03:00
2023-10-10 18:59:52 +02:00
2023-10-10 18:59:52 +02:00
2023-10-10 18:59:52 +02:00
2023-10-10 18:59:52 +02:00
2023-10-03 09:16:26 +02:00
2023-10-03 09:16:26 +02:00