Add Llama 3.1 (#98)
* Update Mistral 7B config * Add Mistral NeMo * Update for Llama 3.1 * Align LlamaConfiguration with Python implementation * Fix model configuration names * Refine DynamicNTKScalingRoPE * compute base only once --------- Co-authored-by: Awni Hannun <awni@apple.com>
This commit is contained in:
@@ -159,7 +159,7 @@ class LLMEvaluator {
|
||||
|
||||
/// this controls which model loads -- phi4bit is one of the smaller ones so this will fit on
|
||||
/// more devices
|
||||
let modelConfiguration = ModelConfiguration.phi34bit
|
||||
let modelConfiguration = ModelConfiguration.phi3_4bit
|
||||
|
||||
/// parameters controlling the output
|
||||
let generateParameters = GenerateParameters(temperature: 0.6)
|
||||
|
||||
Reference in New Issue
Block a user