Pierrick Hymbert
4bd0f93e4a
model: support arch DbrxForCausalLM
(#6515)
* model: dbrx convert to gguf
#6344
* llama: support dbrx
#6344
* doc: dbrx: add the model as supported
* scripts: get-wikitext-2 add unzip
* llama: increase maximum experts allowed
* llama: factorize moe graph implementation between grok, mixtral and dbrx
---------
Co-authored-by: Megha Agarwal <16129366+megha95@users.noreply.github.com>
2024-04-13 11:33:52 +02:00
..
2023-11-27 21:25:42 +02:00
2023-11-02 08:50:16 +02:00
2023-12-29 16:50:29 +02:00
2024-01-26 14:18:00 +02:00
2024-03-26 01:16:01 +01:00
2024-04-01 13:30:43 +02:00
2023-08-27 15:24:58 +03:00
2024-04-09 09:23:19 +03:00
2023-11-27 21:25:42 +02:00
2024-02-18 16:21:52 -05:00
2024-01-18 20:45:39 +02:00
2024-01-09 19:21:13 +02:00
2024-04-13 11:33:52 +02:00
2024-03-23 01:24:36 +01:00
2024-01-18 20:45:39 +02:00
2024-04-11 16:22:47 +03:00
2024-01-31 08:08:07 +05:30
2024-03-26 01:16:01 +01:00
2024-03-26 01:16:01 +01:00
2023-08-29 10:50:30 +03:00
2023-08-29 10:50:30 +03:00
2023-08-29 10:50:30 +03:00
2024-01-26 17:09:44 +02:00
2024-03-26 01:16:01 +01:00
2024-04-09 09:23:19 +03:00
2024-04-09 20:29:06 +03:00
2024-04-09 09:23:19 +03:00
2023-09-22 23:52:23 -04:00