You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The example provided above is for the Vector Killarney cluster, change the variables accordingly for your environment. The IP address for the compute nodes on Killarney follow `10.1.1.XX` pattern, where `XX` is the GPU number (`kn029` -> `29` in this example).
135
+
136
+
## Reference
137
+
If you found Vector Inference useful in your research or applications, please cite using the following BibTeX template:
138
+
```
139
+
@software{vector_inference,
140
+
title = {Vector Inference: Efficient LLM inference on Slurm clusters using vLLM},
0 commit comments