New
Version 1.2.0
1. ResponsesAPI
- Unified function/endpoint combining chat simplicity with tool use and state management.
- Out-of-the-box tool support - It natively supports first-party tools like
web_search,file_search, andcomputer_use, enabling you to invoke these capabilities without additional orchestration. - Built-in multi-turn conversation chaining- Use the
previous_response_idparameter to link requests into a chain of turns, and theinstructionsparameter to inject or override system/developer messages on a per-call basis. - Multimodal input and output - Beyond text, the API accepts images and audio in the same request, letting you build fully multimodal, tool-augmented experiences in a single call.
2. Core and OpenAI Enhancements
- JSON mode handling improvements and fallback json-repair implementation (port of
json-repairby @mangiucugna) - New models:
o3,o4-mini,gpt-4.1, andgpt-4.5series - Web search support (
gpt-4o-search-preview) - Chat completion parameters expanded (
store,reasoning_effort,service_tier,parallel_tool_calls,metadata) - Streaming and non-streaming IO conversion adapters developed and enhanced
- Token counting updated (
jtokkit v1.1.0) - Usage analytics improved
3. Anthropic Platform Enhancements
- Thinking and streaming settings integration
- Claude 3.7 Sonnet (Vanilla and via Bedrock)
- Citations handling, text blocks encoding improvements
- Caching support
- Enhanced token-limit error handling and mapped Anthropic to OpenAI exceptions
- A ton of new examples (also for Vision and PDF processing)
4. Google Gemini Integration
- New Google Gemini module and models introduced (Gemini 2.5 / 2.0 Pro and Flash)
- Gemini JSON schema handling improved, including OpenAI wrapper integration
- System message caching, domain content management, and usage tracking adjustments
- Btw. Google Vertex now also supports JSON schema mode
5. Perplexity Sonar Integration
- New Perplexity Sonar module and models introduced (sonar-deep-research, reasoning-pro, sonar-pro, etc.)
- Sonar JSON and regex response support, and citations formatting/handling
- OpenAI chat completion wrappers
6. Other Providers: Deepseek, Groq, Grok, FireworksAI, and Novita
- Groq JSON handling unified and adjusted, with
deepseek-r1-distill-llama-70bintegration - JSON schema handling for Grok models
- FireworksAI improvements (document inlining), Deepseek model integrations
- Message conversions, filtering thinking tokens, reasoning effort examples
- Llama 4 family
- New Deepseek models (deepseek-r1, DeepSeek-R1 distill) across providers (FireworksAI, Groq, Together AI), plus other models such as Phi-3-vision-128k-instruct, Deepseek-v2-lite-chat, and Llama-3.3-70b
- New chat completion provider:
Novita— Welcome to the family!
7. General Project Setup and CI/CD
- Build setup adjustments (build.sbt registrations, env helpers)
- GitHub CI - upload-artifact version bump (to v4)
- Example datasets added (e.g., norway_wiki dump), imports optimized
- README extended with more examples