Files

David Koski 0af2efe8a0 switch swift-tokenizers to main, remove some workarounds (#26 )

* switch swift-tokenizers to main, remove some workarounds

- swift-tokenizers is getting a lot of updates and fixes, let's track main for now
- remove some workarounds that are no longer needed

- https://github.com/huggingface/swift-transformers/issues/63

2024-03-14 13:18:57 -07:00

Configuration.swift

feat: Support Starcoder2 (#20 )

2024-03-07 21:28:37 -08:00

Evaluate.swift

partial fix for #1

2024-02-26 13:23:21 -08:00

Gemma.swift

adopt MLXFast.scaledDotProductAttention (#23 )

2024-03-12 14:04:43 -07:00

Llama.swift

adopt MLXFast.scaledDotProductAttention (#23 )

2024-03-12 14:04:43 -07:00

LLM.h

initial commit

2024-02-22 10:41:02 -08:00

LLMModel.swift

fix for #2 -- CodeLlama crashes

2024-02-26 10:38:05 -08:00

Load.swift

swift-format!

2024-03-01 14:47:43 -08:00

Models.swift

update qwen2 chat template (#18 )

2024-03-07 07:51:54 -08:00

Phi.swift

fix float types in Phi (use float16) (#25 )

2024-03-14 13:18:40 -07:00

Qwen2.swift

adopt MLXFast.scaledDotProductAttention (#23 )

2024-03-12 14:04:43 -07:00

README.md

llm improvements

2024-03-01 14:46:32 -08:00

Starcoder2.swift

adopt MLXFast.scaledDotProductAttention (#23 )

2024-03-12 14:04:43 -07:00

Tokenizer.swift

switch swift-tokenizers to main, remove some workarounds (#26 )

2024-03-14 13:18:57 -07:00

README.md

Llama

This is a port of several models from:

https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/models/

using the Hugging Face swift transformers package to provide tokenization:

https://github.com/huggingface/swift-transformers

The Models.swift provides minor overrides and customization -- if you require overrides for the tokenizer or prompt customizations they can be added there.

This is set up to load models from Hugging Face, e.g. https://huggingface.co/mlx-community

The following models have been tried:

mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
mlx-community/CodeLlama-13b-Instruct-hf-4bit-MLX
mlx-community/phi-2-hf-4bit-mlx
mlx-community/quantized-gemma-2b-it

Currently supported model types are:

Llama / Mistral
Gemma
Phi

See Configuration.swift for more info.

See llm-tool