- Also update package versions otherwise things don't compile out of the box (you need the version where `callAsFunction` is marked `open`)
- handle loading models with different names for the safetensors files (gemma) - handle merge tokens that can't be split - organize code into Load/Evaluate