mlx-swift-examples

Author	SHA1	Message	Date
David Koski	9d74afd119	handle partially quantized models (#76 ) * handle partially quantized models - fix for #53 #71 #69 #74 - in order to test the models - I added a default prompt of an appropriate form - while working on the model configuration also added additional stop tokens (#74) - fixed the repetitionPenalty code (#71)	2024-05-28 16:35:11 -07:00
Anthony DePasquale	65f4968e5f	Fix download progress (#78 )	2024-05-28 14:05:37 -07:00
David Koski	6c0b66f90a	implement LoRA / QLoRA (#46 ) * implement LoRA / QLoRA - example of using MLX to fine-tune an LLM with low rank adaptation (LoRA) for a target task - see also https://arxiv.org/abs/2106.09685 - based on https://github.com/ml-explore/mlx-examples/tree/main/lora * add some command line flags I found useful during use - --quiet -- don't print decorator text, just the generated text - --prompt @/tmp/file.txt -- load prompt from file * user can specify path to model OR model identifier in huggingface * update mlx-swift reference Co-authored-by: Ashraful Islam <ashraful.meche@gmail.com> Co-authored-by: JustinMeans <46542161+JustinMeans@users.noreply.github.com>	2024-04-22 09:30:12 -07:00
David Koski	2157333905	swift-format!	2024-03-01 14:47:43 -08:00
David Koski	82f6a969d4	llm improvements - document the tokenizer used (https://github.com/huggingface/swift-transformers) - provide a hook for tokenizer configuration, prompt augmentation - this isn't as rich as the python equivalents but it helps a little	2024-03-01 14:46:32 -08:00
David Koski	4fad86d84b	split tokenizer code out into new file	2024-02-26 14:42:40 -08:00
David Koski	a2ff291608	add reference to filed issue	2024-02-26 13:31:33 -08:00
David Koski	c86d1c195e	partial fix for #1 - handle loading models with different names for the safetensors files (gemma) - handle merge tokens that can't be split - organize code into Load/Evaluate	2024-02-26 13:23:21 -08:00

8 Commits