David Koski
|
ac273a14ea
|
fix float types in Phi (use float16) (#25)
- per suggestions in #23 ensure that the values that go into the cache are float16
|
2024-03-14 13:18:40 -07:00 |
|
David Koski
|
0fb74cbfdc
|
adopt MLXFast.scaledDotProductAttention (#23)
|
2024-03-12 14:04:43 -07:00 |
|
David Koski
|
bb7bacc077
|
fix for #2 -- CodeLlama crashes
- add replacement tokenizer class for unknown tokenizers
- fix quantization for models that don't have lm_head quantized
Requires https://github.com/ml-explore/mlx-swift/pull/28
|
2024-02-26 10:38:05 -08:00 |
|
David Koski
|
b6d1e14465
|
initial commit
|
2024-02-22 10:41:02 -08:00 |
|