What's Changed
- Refactor Gemma ctor and improve pool NUMA support by @copybara-service in https://github.com/google/gemma.cpp/pull/520
- Fix the prompt wrapping of gemma3-1b by @ufownl in https://github.com/google/gemma.cpp/pull/523
- Add note on attention length and SFP by @copybara-service in https://github.com/google/gemma.cpp/pull/521
- Add support for a secondary EOS token by @copybar...