Andrew Godfrey
947f64f163
finetune : zero the loraB initial vectors ( #4082 )
...
* finetune : zero the loraB initial vectors
Without this, the first iteration is starting out far from the base model, instead of exactly on it.
Zeroing loraB is what the paper recommends. loralib also zeroes at least one of the init vector pairs
(though it departs from the paper in using a different distribution for the other vector, in some cases).
* tabs to spaces
* Use ggml_set_zero instead of adding a new function
2023-11-17 11:23:11 +01:00
..
2023-09-28 17:41:44 -04:00
2023-10-24 16:48:37 +03:00
2023-10-29 11:31:40 -06:00
2023-10-18 16:21:57 +03:00
2023-10-23 22:40:03 +03:00
2023-11-13 14:16:23 +02:00
2023-10-20 14:19:40 +03:00
2023-11-02 08:50:16 +02:00
2023-11-13 14:16:23 +02:00
2023-11-17 11:23:11 +01:00
2023-09-15 15:38:27 -04:00
2023-11-16 19:14:37 -07:00
2023-10-06 16:16:38 +03:00
2023-11-02 08:50:16 +02:00
2023-11-16 19:14:37 -07:00
2023-11-16 19:14:37 -07:00
2023-10-24 20:48:45 +03:00
2023-11-13 14:16:23 +02:00
2023-11-11 23:04:58 -07:00
2023-11-16 19:14:37 -07:00
2023-11-02 08:50:16 +02:00
2023-11-02 08:50:16 +02:00
2023-11-02 08:50:16 +02:00
2023-11-16 19:14:37 -07:00
2023-10-27 08:37:41 -06:00
2023-11-03 09:41:56 +02:00
2023-11-13 14:16:23 +02:00
2023-07-06 19:17:50 +03:00
2023-03-29 20:21:09 +03:00
2023-05-03 20:58:11 +03:00
2023-10-03 21:04:01 +03:00
2023-06-15 21:05:53 +03:00
2023-08-30 09:29:32 +03:00
2023-10-20 21:07:23 +03:00
2023-04-13 16:03:39 +03:00
2023-08-23 17:29:09 +03:00
2023-07-21 13:53:27 +03:00
2023-07-21 13:53:27 +03:00
2023-08-08 14:44:48 +03:00
2023-08-30 09:50:55 +03:00
2023-09-27 19:25:12 +03:00
2023-07-21 11:13:18 +03:00
2023-08-23 17:29:09 +03:00
2023-08-23 17:29:09 +03:00