- add replacement tokenizer class for unknown tokenizers - fix quantization for models that don't have lm_head quantized Requires https://github.com/ml-explore/mlx-swift/pull/28
Llama
This is a port of several models from:
You can use this to load models from huggingface, e.g.:
Currently supported model types are:
- Llama / Mistral
- Gemma
- Phi
See Configuration.swift for more info.
See llm-tool