Commit Graph

4 Commits

Author SHA1 Message Date
David Koski
0fb74cbfdc adopt MLXFast.scaledDotProductAttention (#23) 2024-03-12 14:04:43 -07:00
David Koski
c7919cf7fe fix rmsnorm for gemma 2024-02-26 14:09:48 -08:00
David Koski
bb7bacc077 fix for #2 -- CodeLlama crashes
- add replacement tokenizer class for unknown tokenizers
- fix quantization for models that don't have lm_head quantized

Requires https://github.com/ml-explore/mlx-swift/pull/28
2024-02-26 10:38:05 -08:00
David Koski
b6d1e14465 initial commit 2024-02-22 10:41:02 -08:00