v3.9.0
3.9.0 (2025-06-04)
Features
- reasoning budget (#468) (ea8d904) (documentation: Set Reasoning Budget)
- SWA (Sliding Window Attention) support - greatly reduced context memory consumption on supported models (#468) (ea8d904)
- documentation: LLMs friendly
llms.mdandllms-full.mdfiles (#468) (ea8d904)
Bug Fixes
Shipped with llama.cpp release b5590
To use the latest
llama.cpprelease available, runnpx -n node-llama-cpp source download --release latest. (learn more)