File tree Expand file tree Collapse file tree 4 files changed +10
-5
lines changed
Expand file tree Collapse file tree 4 files changed +10
-5
lines changed Original file line number Diff line number Diff line change @@ -107,15 +107,19 @@ Launching with CLI
107107You can also opt to install dashinfer-vlm locally and use command line to launch server.
108108
1091091. Pull dashinfer docker image (see :ref: `docker-label `)
110- 2. Download and extract the TensorRT GA build
110+ 2. Install TensorRT Python package, and download TensorRT GA build from NVIDIA Developer Zone.
111+
112+ Example: TensorRT 10.6.0.26 for CUDA 12.6, Linux x86_64
111113
112114.. code-block :: bash
113115
116+ pip install tensorrt
114117 wget https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.6.0/tars/TensorRT-10.6.0.26.Linux.x86_64-gnu.cuda-12.6.tar.gz
115118 tar -xvzf TensorRT-10.6.0.26.Linux.x86_64-gnu.cuda-12.6.tar.gz
116- export TRT_LIBPATH =` pwd` /TensorRT-10.6.0.26
119+ export LD_LIBRARY_PATH =` pwd` /TensorRT-10.6.0.26/lib
117120
118- 3. Install ``dashinfer-vlm ``: ``pip install dashinfer-vlm ``.
121+ 3. Install dashinfer Python Package from `release <https://github.com/modelscope/dash-infer/releases >`_
122+ 4. Install dashinfer-vlm: ``pip install dashinfer-vlm ``.
119123
120124Now you can launch server with command line:
121125
Original file line number Diff line number Diff line change @@ -6,6 +6,7 @@ RUN mkdir /root/code/
66COPY ./dashinfer_vlm /root/code/dashinfer_vlm
77COPY ./setup.py code/
88COPY ./requirements.txt /root/code/requirements.txt
9+ RUN python3 -m pip install https://github.com/modelscope/dash-infer/releases/download/v2.0.0-rc2/dashinfer-2.0.0rc2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
910RUN python3 -m pip install -r /root/code/requirements.txt --index-url=http://mirrors.aliyun.com/pypi/simple/ --trusted-host=mirrors.aliyun.com
1011
1112RUN python3 -m pip install -e /root/code/
Original file line number Diff line number Diff line change 1- dashinfer
21av
32numpy == 1.24.3
43requests == 2.32.3
@@ -12,7 +11,8 @@ shortuuid
1211fastapi
1312pydantic_settings
1413uvicorn
15- cmake == 3.22.6
14+ cmake == 3.22.6
1615modelscope
1716aiohttp
1817onnx
18+ torchvision
You can’t perform that action at this time.
0 commit comments