v3.10.0
3.10.0 (2025-06-12)
Features
- JSON Schema Grammar:
$defsand$refsupport with full inferred types (#472) (9cdbce9) inspect ggufcommand: format and print the Jinja chat template with--key .chatTemplate(#472) (9cdbce9)
Bug Fixes
JinjaTemplateChatWrapper: first function call prefix detection (#472) (9cdbce9)QwenChatWrapper: improve Qwen chat template detection (#472) (9cdbce9)- apply
maxTokenson function calling parameters (#472) (9cdbce9) - adjust default prompt completion length based on SWA size when relevant (#472) (9cdbce9)
- improve thought segmentation syntax extraction (#472) (9cdbce9)
- adapt to
llama.cppchanges (#472) (9cdbce9)
Shipped with llama.cpp release b5640
To use the latest
llama.cpprelease available, runnpx -n node-llama-cpp source download --release latest. (learn more)