New
v0.1.4
What's Changed
- Refactor Gemma ctor and improve pool NUMA support by @copybara-service in https://github.com/google/gemma.cpp/pull/520
- Fix the prompt wrapping of gemma3-1b by @ufownl in https://github.com/google/gemma.cpp/pull/523
- Add note on attention length and SFP by @copybara-service in https://github.com/google/gemma.cpp/pull/521
- Add support for a secondary EOS token by @copybara-service in https://github.com/google/gemma.cpp/pull/525
- Update app argument documentation by @copybara-service in https://github.com/google/gemma.cpp/pull/526
- Set the secondary EOS for Gemma2 by @ufownl in https://github.com/google/gemma.cpp/pull/527
Full Changelog: https://github.com/google/gemma.cpp/compare/v0.1.3...v0.1.4