Add Llama 3.1 (#98)

* Update Mistral 7B config * Add Mistral NeMo * Update for Llama 3.1 * Align LlamaConfiguration with Python implementation * Fix model configuration names * Refine DynamicNTKScalingRoPE * compute base only once --------- Co-authored-by: Awni Hannun <awni@apple.com>
2024-07-26 22:05:42 +02:00
parent c4fda0e036
commit ac6bdfccec
3 changed files with 200 additions and 84 deletions
--- a/Applications/LLMEval/ContentView.swift
+++ b/Applications/LLMEval/ContentView.swift
@@ -159,7 +159,7 @@ class LLMEvaluator {

    /// this controls which model loads -- phi4bit is one of the smaller ones so this will fit on
    /// more devices
-    let modelConfiguration = ModelConfiguration.phi34bit
+    let modelConfiguration = ModelConfiguration.phi3_4bit

    /// parameters controlling the output
    let generateParameters = GenerateParameters(temperature: 0.6)