handle partially quantized models (#76)

* handle partially quantized models

- fix for #53 #71 #69 #74
- in order to test the models
	- I added a default prompt of an appropriate form
	- while working on the model configuration also added additional stop tokens (#74)
- fixed the repetitionPenalty code (#71)
This commit is contained in:
David Koski
2024-05-28 16:35:11 -07:00
committed by GitHub
parent 65f4968e5f
commit 9d74afd119
12 changed files with 139 additions and 67 deletions

View File

@@ -16,7 +16,7 @@
"location" : "https://github.com/ml-explore/mlx-swift",
"state" : {
"branch" : "main",
"revision" : "3c802c808d281c191d5f26f37a4f93135d8ca119"
"revision" : "d6d9472da5bf7ec2654e8914bd1d15622f45b6a9"
}
},
{
@@ -61,7 +61,7 @@
"location" : "https://github.com/gonzalezreal/swift-markdown-ui",
"state" : {
"branch" : "main",
"revision" : "723249a1ba361042812cf785244de94f11f7c8fd"
"revision" : "c0daf6eb79d97964180f3113868c990bd1c4a007"
}
},
{