Commit Graph

59 Commits

Author SHA1 Message Date
Anchen
c27208812d chore: add repetition_penalty example (#45) 2024-04-04 15:15:50 -07:00
Anchen
2d0fdfe3a9 chore(llm-tool): add the top_p option in the llm-tool (#41)
* chore: add top p option in llm-tool
* chore: wire up the top p with async generate
2024-04-03 07:54:54 -07:00
Ronald Mannak
b3eb428c60 Add Package.swift (#39)
* Add Package.swift for LLM and MNIST
* Make ModelType properties public
* Make ModelType method createModel public
* Add installation instructions to readme
2024-04-01 11:04:02 -07:00
Florent Morin
e48e2ce2c9 Append visionOS support to LLMEval (#43)
* Update `mlx-swift` to last revision

* Add Apple Vision Target

* Update visionOS UI
2024-03-31 20:48:46 -07:00
David Koski
0199407d93 LLMEval performance (#40)
* notes about performance and some performance improvements (don't update the display for every token)

* swift-format

* Update Applications/LLMEval/README.md

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>

* Update Applications/LLMEval/README.md

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>

---------

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
2024-03-28 12:00:52 -07:00
Awni Hannun
15b38cd146 Use fast (#38)
* update to latest mlx swift and use fast norms
* gpu usage -> memory usage
2024-03-27 16:37:35 -07:00
Anchen
3314e20a24 chore: add top_p sampling example (#34) 2024-03-26 12:44:13 -07:00
Anchen
63531bc5fa feat: add command r model support (#35)
* feat: add command r model support
2024-03-26 12:43:10 -07:00
Ashraful Islam
c37018d7d2 feat: adds gpu usages stat in the toolbar (#36)
* feat: adds gpu usages stat in the toolbar
2024-03-25 10:29:54 -07:00
David Koski
452b49aef0 fixed height for the progress view, produce more tokens (#33) 2024-03-19 08:56:37 -07:00
David Koski
857a10de91 make the mnist app/library targets match each other (#32)
* make the mnist app/library targets match each other
* switch mnist tool to 14.2 as well
2024-03-19 08:56:22 -07:00
Rounak
9e18eaa479 Add MNIST Digit Prediction/Inference (#22)
* Add Prediction to MNISTTrainer
2024-03-18 19:18:41 -07:00
Loc Bui
0588abec77 fix: Tokenizer dependency (#30) 2024-03-18 12:57:04 -07:00
Ashraful Islam
a7b2b54f18 LLMEval UI Improvements (#27)
* Feat: LLMEval UI Improvements

1. adds Markdown rendering in the UI
2. Adds init time and token/second stat
3. Minor UI enhancements

* feat: adds a copy to clipboard button for llm outputs

* adds scrollviewreader to sync with main

* ran pre-format to resolve formatting issues

* updates the missing dependency in project definition

* feat: switch between plain text and markdown

adds a segemented picker to switch between plain text and markdown
2024-03-18 09:15:50 -07:00
Pedro Cuenca
6c270a9d12 Remove Tokenizer wrapper (#28)
* Remove Tokenizer wrapper

Depends on https://github.com/huggingface/swift-transformers/pull/70

* Update swift-transformers revision
2024-03-18 09:07:52 -07:00
David Koski
0af2efe8a0 switch swift-tokenizers to main, remove some workarounds (#26)
* switch swift-tokenizers to main, remove some workarounds

- swift-tokenizers is getting a lot of updates and fixes, let's track main for now
- remove some workarounds that are no longer needed

- https://github.com/huggingface/swift-transformers/issues/63
2024-03-14 13:18:57 -07:00
David Koski
ac273a14ea fix float types in Phi (use float16) (#25)
- per suggestions in #23 ensure that the values that go into the cache are float16
2024-03-14 13:18:40 -07:00
David Koski
a1431e7155 scroll to bottom when text is generated (#24)
- also restore circleci
2024-03-14 13:18:28 -07:00
David Koski
0fb74cbfdc adopt MLXFast.scaledDotProductAttention (#23) 2024-03-12 14:04:43 -07:00
John Mai
a94bf79d7e feat: Support Starcoder2 (#20)
* feat: Support Starcoder2
2024-03-07 21:28:37 -08:00
Madroid Ma
e876e18605 update qwen2 chat template (#18) 2024-03-07 07:51:54 -08:00
John Mai
c4a1e1d108 fix: typo (#16) 2024-03-06 08:42:08 -08:00
David Koski
61105bf0c4 use memory limit API (#13)
* add buffer cache limit

* swift-format

* a more reasonable size

* add memory stats to command line tool, update to final api

* add note about changing models
2024-03-05 15:22:12 -08:00
David Koski
430b464c8d pick up ios build fix (#15) 2024-03-04 20:26:37 -08:00
Awni Hannun
4ed4ec69e7 LeNet on MNIST + readme update (#12)
* LeNet on MNIST + readme update

* tanh + remove device toggle

* remove device entirely
2024-03-04 14:16:20 -08:00
David Koski
dfc9f2fc01 apply swift-format 2024-03-03 18:40:49 -08:00
David Koski
0f454999a4 Merge pull request #11 from maiqingqiang/feat-qwen2
feat: Qwen2 support
2024-03-03 18:37:29 -08:00
John Mai
66d9202360 feat: Qwen2 support 2024-03-03 22:26:28 +08:00
David Koski
7b746cb89c allow alternate location for tokenizer 2024-03-01 23:27:03 -08:00
David Koski
fe116f857d swift-format 2024-03-01 23:26:25 -08:00
David Koski
98ff923d91 use temperature 0.6 for command line examples as well (match python) 2024-03-01 23:23:32 -08:00
David Koski
33d4b6f57e make the generated output a little more interesting 2024-03-01 22:56:28 -08:00
David Koski
ff7a615db7 improve phi prompt -- partial fix for #9 2024-03-01 22:45:01 -08:00
David Koski
23fc53c43e allow selection of output 2024-03-01 22:44:33 -08:00
David Koski
9cea67f7bc add missing path to Info.plist -- fix #10 2024-03-01 22:20:29 -08:00
David Koski
19b80a6cee don't build applications -- codesign fails with no identity 2024-03-01 19:51:02 -08:00
David Koski
82b86e4484 fix https://github.com/ml-explore/mlx-swift-examples/issues/8 -- do not specify info plist location 2024-03-01 19:49:19 -08:00
David Koski
2efd5575c7 remove dev team -- this needs to be supplied 2024-03-01 16:58:48 -08:00
David Koski
0374e4b073 update documentation 2024-03-01 16:33:49 -08:00
David Koski
807c8136c0 update README 2024-03-01 16:13:29 -08:00
David Koski
c49dd73c28 swift-format, circleci setup 2024-03-01 16:10:34 -08:00
David Koski
b41f14fba7 add LLM evaluator example
- runs on iOS and macOS
- downloads a model / tokenizer from hugging face
- evaluates the given prompt
2024-03-01 16:10:00 -08:00
David Koski
79e0620891 add MNIST training example 2024-03-01 15:55:36 -08:00
David Koski
2157333905 swift-format! 2024-03-01 14:47:43 -08:00
David Koski
82f6a969d4 llm improvements
- document the tokenizer used (https://github.com/huggingface/swift-transformers)
- provide a hook for tokenizer configuration, prompt augmentation
	- this isn't as rich as the python equivalents but it helps a little
2024-03-01 14:46:32 -08:00
David Koski
599661774a Merge pull request #6 from rudro/rudro_fixmain
Fix position of @main annotation
2024-02-27 13:32:39 -08:00
Rudro Samanta
362f1c8ad8 Fix position of @main annotation
- Also update package versions otherwise things don't compile out of the box
  (you need the version where `callAsFunction` is marked `open`)
2024-02-27 13:02:13 -08:00
David Koski
3f02fcc1cb expose eosToken 2024-02-26 14:58:51 -08:00
David Koski
4fad86d84b split tokenizer code out into new file 2024-02-26 14:42:40 -08:00
David Koski
c7919cf7fe fix rmsnorm for gemma 2024-02-26 14:09:48 -08:00