@@ -149,35 +149,52 @@ Since batch launches use heterogeneous jobs, users can request different partiti
149149
150150# ## `status` command
151151
152- You can check the inference server status by providing the Slurm job ID to the `status` command :
152+ You can check the status of all inference servers launched through `vec-inf` by running the `status` command :
153+ ` ` ` bash
154+ vec-inf status`
155+ ```
156+
157+ And you should see an output like this:
158+ ```
159+ ┏━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━┓
160+ ┃ Job ID ┃ Model Name ┃ Status ┃ Base URL ┃
161+ ┡━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━┩
162+ │ 1434429 │ Qwen3-8B │ READY │ http://gpu113:8080/v1 │
163+ │ 1434584 │ Qwen3-14B │ READY │ http://gpu053:8080/v1 │
164+ │ 1435035+0 │ Qwen3-32B │ PENDING │ UNAVAILABLE │
165+ │ 1435035+1 │ Qwen3-14B │ PENDING │ UNAVAILABLE │
166+ └───────────┴────────────┴─────────┴───────────────────────┘
167+ ```
168+
169+ If you want to check why a specific job is pending or failing, append the job ID to the status command:
153170
154171``` bash
155- vec-inf status 15373800
172+ vec-inf status 1435035+1
156173```
157174
158175If the server is pending for resources, you should see an output like this:
159176
160177```
161- ┏━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ┓
162- ┃ Job Status ┃ Value ┃
163- ┡━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ┩
164- │ Model Name │ Meta-Llama-3.1-8B-Instruct │
165- │ Model Status │ PENDING │
166- │ Pending Reason │ Resources │
167- │ Base URL │ UNAVAILABLE │
168- └────────────────┴──────────────────────────── ┘
178+ ┏━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┓
179+ ┃ Job Status ┃ Value ┃
180+ ┡━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━┩
181+ │ Model Name │ Qwen3-14B │
182+ │ Model Status │ PENDING │
183+ │ Pending Reason │ Resources │
184+ │ Base URL │ UNAVAILABLE │
185+ └────────────────┴─────────────┘
169186```
170187
171188When the server is ready, you should see an output like this:
172189
173190```
174- ┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ┓
175- ┃ Job Status ┃ Value ┃
176- ┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ┩
177- │ Model Name │ Meta-Llama-3.1-8B-Instruct │
178- │ Model Status │ READY │
179- │ Base URL │ http://gpu042 :8080/v1 │
180- └──────────────┴──────────────────────────── ┘
191+ ┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━┓
192+ ┃ Job Status ┃ Value ┃
193+ ┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━┩
194+ │ Model Name │ Qwen3-14B │
195+ │ Model Status │ READY │
196+ │ Base URL │ http://gpu105 :8080/v1 │
197+ └──────────────┴───────────────────────┘
181198```
182199
183200There are 5 possible states:
0 commit comments