Skip to content

Commit aebab5f

Browse files
committed
fix mscale_all_dim
1 parent 249eda4 commit aebab5f

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

convert_hf_to_gguf.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9990,6 +9990,7 @@ def set_gguf_parameters(self):
99909990
MistralModel.set_mistral_config(self.gguf_writer, self.hparams)
99919991
yarn_params = self.hparams["yarn"]
99929992
self.gguf_writer.add_attn_temperature_length(yarn_params["original_max_position_embeddings"])
9993+
self.gguf_writer.add_rope_scaling_yarn_log_mul(0.1) # mscale_all_dim * 0.1
99939994

99949995
# TODO @ngxson : this should be in tensor_mapping, but I don't have time for now
99959996
# copied from https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/mistral_large_3.py

0 commit comments

Comments
 (0)