Diego Devesa
a5e47592b6
cuda : optimize argmax (#10441)
* cuda : optimize argmax
* remove unused parameter
ggml-ci
* fixup : use full warps
ggml-ci
* Apply suggestions from code review
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
* fix ub
* ggml : check ne00 <= INT32_MAX in argmax and argsort
---------
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2024-11-21 18:18:50 +01:00
..
2024-03-09 14:17:11 +02:00
2024-11-17 08:30:29 +02:00
2024-01-26 14:18:00 +02:00
2024-01-26 14:18:00 +02:00
2024-11-07 17:31:10 -04:00
2024-10-10 22:57:42 +02:00
2024-02-16 11:31:07 +02:00
2024-11-21 18:18:50 +01:00
2024-11-03 19:34:08 +01:00
2024-01-29 15:50:50 -05:00
2024-10-28 18:45:33 +01:00
2024-07-12 10:46:02 +03:00
2024-09-07 15:16:19 +03:00
2024-09-07 15:16:19 +03:00
2024-10-16 19:03:24 +03:00
2024-09-07 15:16:19 +03:00
2024-10-10 22:57:42 +02:00
2024-08-23 12:58:53 +02:00
2024-02-16 11:31:07 +02:00
2024-11-17 08:30:29 +02:00
2024-11-14 18:04:35 +01:00
2024-11-17 08:30:29 +02:00
2024-11-03 19:34:08 +01:00
2024-10-29 10:42:05 +02:00
2024-10-10 22:57:42 +02:00
2024-05-05 08:07:48 +03:00
2024-05-28 15:04:09 +03:00
2024-10-10 22:57:42 +02:00
2024-10-10 22:57:42 +02:00
2024-07-13 23:35:10 -04:00