New
0.22.0
LlamaBarn now uses llama-server in Router Mode. The server stays running in the background and loads models automatically when they are needed. You no longer have to manually select a model before using it. This version also adds an optional "Unload when idle" setting that automatically removes models from memory after a period of inactivity.
- Migrate to llama-server Router Mode
- Introduce "Unload when idle" setting for memory management
- Add automatic download retry logic with exponential backoff
- Add error reporting for failed model downloads