Skip to content

Commit bfc2d6a

Browse files
committed
Add Llama 4 Maverick
1 parent d967a90 commit bfc2d6a

File tree

1 file changed

+13
-0
lines changed

1 file changed

+13
-0
lines changed

vec_inf/config/models.yaml

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1068,3 +1068,16 @@ models:
10681068
--tensor-parallel-size: 4
10691069
--pipeline-parallel-size: 2
10701070
--max-model-len: 40960
1071+
Llama-4-Maverick-17B-128E-Instruct:
1072+
model_family: Llama-4
1073+
model_variant: Maverick-17B-128E-Instruct
1074+
model_type: VLM
1075+
gpus_per_node: 4
1076+
num_nodes: 8
1077+
vocab_size: 202048
1078+
time: 03:00:00
1079+
resource_type: l40s
1080+
vllm_args:
1081+
--max-model-len: 16384
1082+
--tensor-parallel-size: 4
1083+
--pipeline-parallel-size: 8

0 commit comments

Comments
 (0)