mlx-swift-examples

Author	SHA1	Message	Date
Awni Hannun	15b38cd146	Use fast (#38 ) * update to latest mlx swift and use fast norms * gpu usage -> memory usage	2024-03-27 16:37:35 -07:00
Anchen	3314e20a24	chore: add top_p sampling example (#34 )	2024-03-26 12:44:13 -07:00
Anchen	63531bc5fa	feat: add command r model support (#35 ) * feat: add command r model support	2024-03-26 12:43:10 -07:00
Pedro Cuenca	6c270a9d12	Remove Tokenizer wrapper (#28 ) * Remove Tokenizer wrapper Depends on https://github.com/huggingface/swift-transformers/pull/70 * Update swift-transformers revision	2024-03-18 09:07:52 -07:00
David Koski	0af2efe8a0	switch swift-tokenizers to main, remove some workarounds (#26 ) * switch swift-tokenizers to main, remove some workarounds - swift-tokenizers is getting a lot of updates and fixes, let's track main for now - remove some workarounds that are no longer needed - https://github.com/huggingface/swift-transformers/issues/63	2024-03-14 13:18:57 -07:00
David Koski	ac273a14ea	fix float types in Phi (use float16) (#25 ) - per suggestions in #23 ensure that the values that go into the cache are float16	2024-03-14 13:18:40 -07:00
David Koski	0fb74cbfdc	adopt MLXFast.scaledDotProductAttention (#23 )	2024-03-12 14:04:43 -07:00
John Mai	a94bf79d7e	feat: Support Starcoder2 (#20 ) * feat: Support Starcoder2	2024-03-07 21:28:37 -08:00
Madroid Ma	e876e18605	update qwen2 chat template (#18 )	2024-03-07 07:51:54 -08:00
David Koski	dfc9f2fc01	apply swift-format	2024-03-03 18:40:49 -08:00
John Mai	66d9202360	feat: Qwen2 support	2024-03-03 22:26:28 +08:00
David Koski	7b746cb89c	allow alternate location for tokenizer	2024-03-01 23:27:03 -08:00
David Koski	ff7a615db7	improve phi prompt -- partial fix for #9	2024-03-01 22:45:01 -08:00
David Koski	c49dd73c28	swift-format, circleci setup	2024-03-01 16:10:34 -08:00
David Koski	b41f14fba7	add LLM evaluator example - runs on iOS and macOS - downloads a model / tokenizer from hugging face - evaluates the given prompt	2024-03-01 16:10:00 -08:00
David Koski	2157333905	swift-format!	2024-03-01 14:47:43 -08:00
David Koski	82f6a969d4	llm improvements - document the tokenizer used (https://github.com/huggingface/swift-transformers) - provide a hook for tokenizer configuration, prompt augmentation - this isn't as rich as the python equivalents but it helps a little	2024-03-01 14:46:32 -08:00
David Koski	3f02fcc1cb	expose eosToken	2024-02-26 14:58:51 -08:00
David Koski	4fad86d84b	split tokenizer code out into new file	2024-02-26 14:42:40 -08:00
David Koski	c7919cf7fe	fix rmsnorm for gemma	2024-02-26 14:09:48 -08:00
David Koski	a2ff291608	add reference to filed issue	2024-02-26 13:31:33 -08:00
David Koski	c86d1c195e	partial fix for #1 - handle loading models with different names for the safetensors files (gemma) - handle merge tokens that can't be split - organize code into Load/Evaluate	2024-02-26 13:23:21 -08:00
David Koski	bb7bacc077	fix for #2 -- CodeLlama crashes - add replacement tokenizer class for unknown tokenizers - fix quantization for models that don't have lm_head quantized Requires https://github.com/ml-explore/mlx-swift/pull/28	2024-02-26 10:38:05 -08:00
David Koski	5a83d7d92b	fix broken links, clarify documentation	2024-02-22 12:46:44 -08:00
David Koski	b6d1e14465	initial commit	2024-02-22 10:41:02 -08:00

25 Commits