Skip to content

Commit 97108ec

Browse files
x574chenXiaotong Chen
andauthored
update multimodal doc and requirement (#47)
Co-authored-by: Xiaotong Chen <“cxt459847@alibaba-inc.com”>
1 parent 1b2a6ad commit 97108ec

File tree

4 files changed

+10
-5
lines changed

4 files changed

+10
-5
lines changed

docs/sphinx/vlm/vlm_offline_inference_en.rst

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -107,15 +107,19 @@ Launching with CLI
107107
You can also opt to install dashinfer-vlm locally and use command line to launch server.
108108

109109
1. Pull dashinfer docker image (see :ref:`docker-label`)
110-
2. Download and extract the TensorRT GA build
110+
2. Install TensorRT Python package, and download TensorRT GA build from NVIDIA Developer Zone.
111+
112+
Example: TensorRT 10.6.0.26 for CUDA 12.6, Linux x86_64
111113

112114
.. code-block:: bash
113115
116+
pip install tensorrt
114117
wget https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.6.0/tars/TensorRT-10.6.0.26.Linux.x86_64-gnu.cuda-12.6.tar.gz
115118
tar -xvzf TensorRT-10.6.0.26.Linux.x86_64-gnu.cuda-12.6.tar.gz
116-
export TRT_LIBPATH=`pwd`/TensorRT-10.6.0.26
119+
export LD_LIBRARY_PATH=`pwd`/TensorRT-10.6.0.26/lib
117120
118-
3. Install ``dashinfer-vlm``: ``pip install dashinfer-vlm``.
121+
3. Install dashinfer Python Package from `release <https://github.com/modelscope/dash-infer/releases>`_
122+
4. Install dashinfer-vlm: ``pip install dashinfer-vlm``.
119123

120124
Now you can launch server with command line:
121125

multimodal/Dockerfile

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@ RUN mkdir /root/code/
66
COPY ./dashinfer_vlm /root/code/dashinfer_vlm
77
COPY ./setup.py code/
88
COPY ./requirements.txt /root/code/requirements.txt
9+
RUN python3 -m pip install https://github.com/modelscope/dash-infer/releases/download/v2.0.0-rc2/dashinfer-2.0.0rc2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
910
RUN python3 -m pip install -r /root/code/requirements.txt --index-url=http://mirrors.aliyun.com/pypi/simple/ --trusted-host=mirrors.aliyun.com
1011

1112
RUN python3 -m pip install -e /root/code/

multimodal/requirements.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,3 @@
1-
dashinfer
21
av
32
numpy==1.24.3
43
requests==2.32.3
@@ -12,7 +11,8 @@ shortuuid
1211
fastapi
1312
pydantic_settings
1413
uvicorn
15-
cmake==3.22.6
14+
cmake==3.22.6
1615
modelscope
1716
aiohttp
1817
onnx
18+
torchvision
75.2 KB
Loading

0 commit comments

Comments
 (0)