- handle loading models with different names for the safetensors files (gemma) - handle merge tokens that can't be split - organize code into Load/Evaluate
Llama
This is a port of several models from:
You can use this to load models from huggingface, e.g.:
Currently supported model types are:
- Llama / Mistral
- Gemma
- Phi
See Configuration.swift for more info.
See llm-tool