Skip to content

Commit 29f4e46

Browse files
committed
Merge branch 'main' into release
2 parents 2cb480e + a96b240 commit 29f4e46

File tree

22 files changed

+573
-1826
lines changed

22 files changed

+573
-1826
lines changed

.github/workflows/unifiedcache_test.yml

Lines changed: 0 additions & 37 deletions
Original file line numberDiff line numberDiff line change
@@ -18,40 +18,3 @@ jobs:
1818

1919
call-lint:
2020
uses: ./.github/workflows/pre-commit.yml
21-
22-
unit-test:
23-
needs: call-lint
24-
name: Run Unittests
25-
runs-on: ubuntu-latest
26-
steps:
27-
- name: Free disk space
28-
run: |
29-
sudo rm -rf /usr/share/dotnet
30-
sudo rm -rf /opt/ghc
31-
sudo rm -rf "/usr/local/share/boost"
32-
sudo rm -rf "$AGENT_TOOLSDIRECTORY"
33-
docker system prune -af
34-
df -h
35-
36-
- name: Checkout unified-cache-management repo
37-
uses: actions/checkout@v4
38-
39-
- name: Run unit test inside vLLM container
40-
run: |
41-
docker run --rm \
42-
-e VLLM_USE_PRECOMPILED=1 \
43-
-e PLATFORM=cuda \
44-
-v ${{ github.workspace }}:/workspace/unified-cache-management \
45-
-w /workspace/unified-cache-management \
46-
--entrypoint /bin/bash \
47-
vllm/vllm-openai:v0.9.2 \
48-
-c "
49-
set -euo pipefail
50-
pip install -v -e . --no-build-isolation
51-
cd \$(pip show vllm | grep Location | awk '{print \$2}') &&
52-
git apply /workspace/unified-cache-management/ucm/integration/vllm/patch/0.9.2/vllm-adapt-pc.patch
53-
git apply /workspace/unified-cache-management/ucm/integration/vllm/patch/0.9.2/vllm-adapt-aggre.patch
54-
git apply /workspace/unified-cache-management/ucm/integration/vllm/patch/0.9.2/vllm-adapt-sparse.patch
55-
cd /workspace/unified-cache-management
56-
python3 -m unittest discover -s test
57-
"

CMakeLists.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ set(CMAKE_CXX_EXTENSIONS OFF)
77
set(CMAKE_EXPORT_COMPILE_COMMANDS ON)
88

99
option(BUILD_UCM_STORE "build ucm store module." ON)
10-
option(BUILD_UCM_SPARSE "build ucm sparse module." ON)
10+
option(BUILD_UCM_SPARSE "build ucm sparse module." OFF)
1111
option(BUILD_UNIT_TESTS "build all unit test suits." OFF)
1212
option(BUILD_NUMA "build numactl library." OFF)
1313
option(DOWNLOAD_DEPENDENCE "download dependence by cmake." ON)

MANIFEST.in

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,7 @@
1-
recursive-include ucm/csrc *
2-
exclude CMakeLists.txt
1+
include LICENSE
2+
include pyproject.toml
3+
include CMakeLists.txt
4+
include requirements.txt
5+
6+
recursive-include examples *
7+
recursive-include benchmarks *

README.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
<p align="center">
22
<picture>
3-
<source media="(prefers-color-scheme: dark)" srcset="docs/source/logos/UCM-dark.png">
4-
<img alt="UCM" src="docs/source/logos/UCM-light.png" width=50%>
3+
<source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/ModelEngine-Group/unified-cache-management/main/docs/source/logos/UCM-dark.png">
4+
<img alt="UCM" src="https://raw.githubusercontent.com/ModelEngine-Group/unified-cache-management/main/docs/source/logos/UCM-light.png" width=50%>
55
</picture>
66
</p>
77

@@ -32,7 +32,7 @@ Sparse KV cache have many different choices. Recently paper point out that there
3232
and all models. So better to build a common framework then different sparse algorithms can be plugin to it like KV
3333
connector for PC.
3434

35-
![architecture.png](./docs/source/_static/images/idea.png)
35+
![architecture.png](https://raw.githubusercontent.com/ModelEngine-Group/unified-cache-management/main/docs/source/_static/images/idea.png)
3636

3737
All gray boxes in the diagram represent existing classes in vLLM version 0.9.2, while the green boxes indicate newly added components by UCM.
3838
The light green boxes demonstrate potential future subclass extensions based on this framework.
@@ -68,7 +68,7 @@ in either a local filesystem for single-machine scenarios or through NFS mount p
6868

6969
## Quick Start
7070

71-
please refer to [Quick Start](./docs/source/getting-started/quick_start.md).
71+
please refer to [Quick Start](https://ucm.readthedocs.io/en/latest/getting-started/quick_start.html).
7272

7373
---
7474

@@ -85,8 +85,8 @@ please refer to [Quick Start](./docs/source/getting-started/quick_start.md).
8585
1. For technical questions and feature requests, please use GitHub [Issues](https://github.com/ModelEngine-Group/unified-cache-management/issues).
8686
2. WeChat technical discussion group: Scan the QR code below.
8787

88-
<img src="docs/source/_static/images/qrcode_for_wechat.png" alt="wechat-gh" width="40%">
88+
<img src="https://raw.githubusercontent.com/ModelEngine-Group/unified-cache-management/main/docs/source/_static/images/qrcode_for_wechat.png" alt="wechat-gh" width="40%">
8989

9090
## License
9191

92-
UCM is licensed under the MIT with additional conditions. Please read the [LICENSE](./LICENSE) file for details.
92+
UCM is licensed under the MIT with additional conditions. Please read the [LICENSE](https://raw.githubusercontent.com/ModelEngine-Group/unified-cache-management/main/LICENSE) file for details.

docs/source/getting-started/installation_gpu.md

Lines changed: 0 additions & 86 deletions
This file was deleted.

docs/source/getting-started/installation_npu.md

Lines changed: 0 additions & 88 deletions
This file was deleted.

docs/source/getting-started/quick_start.md

Lines changed: 0 additions & 119 deletions
This file was deleted.

0 commit comments

Comments
 (0)