model: support Rnj-1 #17811

philip-essential · 2025-12-06T02:19:47Z

This adds support for Rnj-1, which is an 8B model we just released. We've been using llama.cpp to play around with the model internally, and we released a GGUF checkpoint for the instruction-tuned version.

The model architecture is similar enough to Gemma3 that in Transformers/VLLM/SGLang we can reuse the same model file. However, in llama.cpp we need some small changes, so I've added a new implementation, based closely on the Gemma3 one. The changes are:

All layers use global attention.
Long-context is via YaRN.

Because our huggingface config.json uses "Gemma3ForCausalLM" as the architecture, convert_hf_to_gguf.py is unable to tell that these configs are for Rnj-1. The solution I came up with is to manually change the architecture to Rnj1ForCausalLM before converting the checkpoint. I added a note in convert_hf_to_gguf.py about this. But perhaps there's a better solution?

CISC · 2025-12-06T13:48:39Z

Because our huggingface config.json uses "Gemma3ForCausalLM" as the architecture, convert_hf_to_gguf.py is unable to tell that these configs are for Rnj-1. The solution I came up with is to manually change the architecture to Rnj1ForCausalLM before converting the checkpoint. I added a note in convert_hf_to_gguf.py about this. But perhaps there's a better solution?

Instead change llm_build_gemma3_iswa into a templated llm_build_gemma3, like f.ex. smallthinker and add support for YaRN and non-SWA in Gemma3Model conversion.

add support for rnj1

e1b7d46

philip-essential requested review from CISC and ggerganov as code owners December 6, 2025 02:19

github-actions bot added model Model specific python python script changes labels Dec 6, 2025

loci-dev mentioned this pull request Dec 6, 2025

UPSTREAM PR #17811: model: support Rnj-1 auroralabs-loci/llama.cpp#464

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model: support Rnj-1 #17811

model: support Rnj-1 #17811

philip-essential commented Dec 6, 2025

Uh oh!

CISC commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

model: support Rnj-1 #17811

Are you sure you want to change the base?

model: support Rnj-1 #17811

Conversation

philip-essential commented Dec 6, 2025

Uh oh!

CISC commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants