Skip to content

Commit dfd89e4

Browse files
authored
User updating
1 parent bc76d2c commit dfd89e4

File tree

2 files changed

+1
-7
lines changed

2 files changed

+1
-7
lines changed

README.md

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -20,21 +20,18 @@ Notes:
2020
* System roles may not work well on certain models.
2121
* Certain models do not support chat templates (which will be used when the system role is not "None").
2222
* Certain devices and dtype options may not work for certain models. "Audo" in dtype does not work for all models either.
23-
* For example, `Gemma-3-1B-It` does not work on WebGPU, and `SmolLM2-1.7B-Instruct` only works when dtype is set to `int8`, `uint8`, `bnb4` or `q4f16`.
23+
* For example, `Gemma-3-1B-It` does not work on WebGPU, and `SmolLM2-1.7B-Instruct` only works for dtype = `int8`, `uint8`, `bnb4` or `q4f16`.
2424
* After loading a model, you must refresh the page to load a different one. There is no way to release the old model from the memory, and trying to load more than two models proved to be problematic.
2525

2626
```json
2727
{
2828
"models": {
2929
"SmolLM2-135M-Instruct": "HuggingFaceTB/SmolLM2-135M-Instruct",
3030
"SmolLM2-360M-Instruct": "HuggingFaceTB/SmolLM2-360M-Instruct",
31-
"codegen-350M-mono": "Xenova/codegen-350M-mono",
3231
"Qwen2.5-0.5B-Instruct": "Mozilla/Qwen2.5-0.5B-Instruct",
3332
"Qwen3-0.6B": "onnx-community/Qwen3-0.6B-ONNX",
3433
"Gemma-3-1B-It": "onnx-community/gemma-3-1b-it-ONNX",
3534
"Falcon3-1B-Instruct": "onnx-community/Falcon3-1B-Instruct",
36-
"AMD-OLMo-1B-SFT-DPO": "onnx-community/AMD-OLMo-1B-SFT-DPO",
37-
"ZR1-1.5B": "onnx-community/ZR1-1.5B-ONNX",
3835
"SmolLM2-1.7B-Instruct": "HuggingFaceTB/SmolLM2-1.7B-Instruct",
3936
"Phi-3-mini-4k-Instruct (3.8B)": "Xenova/Phi-3-mini-4k-instruct"
4037
},

src/model/Config.json

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,13 +2,10 @@
22
"models": {
33
"SmolLM2-135M-Instruct": "HuggingFaceTB/SmolLM2-135M-Instruct",
44
"SmolLM2-360M-Instruct": "HuggingFaceTB/SmolLM2-360M-Instruct",
5-
"codegen-350M-mono": "Xenova/codegen-350M-mono",
65
"Qwen2.5-0.5B-Instruct": "Mozilla/Qwen2.5-0.5B-Instruct",
76
"Qwen3-0.6B": "onnx-community/Qwen3-0.6B-ONNX",
87
"Gemma-3-1B-It": "onnx-community/gemma-3-1b-it-ONNX",
98
"Falcon3-1B-Instruct": "onnx-community/Falcon3-1B-Instruct",
10-
"AMD-OLMo-1B-SFT-DPO": "onnx-community/AMD-OLMo-1B-SFT-DPO",
11-
"ZR1-1.5B": "onnx-community/ZR1-1.5B-ONNX",
129
"SmolLM2-1.7B-Instruct": "HuggingFaceTB/SmolLM2-1.7B-Instruct",
1310
"Phi-3-mini-4k-Instruct (3.8B)": "Xenova/Phi-3-mini-4k-instruct"
1411
},

0 commit comments

Comments
 (0)