v3.14.0
3.14.0 (2025-10-02)
Features
- Qwen3 Reranker support (#506) (00305f7) (see #506 for prequantized Qwen3 Reranker models you can use)
Bug Fixes
- handle HuggingFace rate limit responses (#506) (00305f7)
- adapt to
llama.cppbreaking changes (#506) (00305f7)
Shipped with llama.cpp release b6673
To use the latest
llama.cpprelease available, runnpx -n node-llama-cpp source download --release latest. (learn more)