Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 6 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -1347,6 +1347,12 @@ simulator focused on mobile cloud/edge/iot infrastructures.
to the ONNX and deployed with DeepSparse for GPU-class performance on CPU
hardware.

- [Shimmy](https://github.com/ptsochantaris/shimmy): Self-hosted AI inference
server with OpenAI API compatibility, designed for edge deployment. Supports
multiple model formats (GGUF, Core ML, etc.) with built-in streaming and
multimodal capabilities. Lightweight and fast for resource-constrained edge
environments.

- [SparseZoo](https://github.com/neuralmagic/sparsezoo): SparseZoo is a
constantly-growing repository of sparsified (pruned and pruned-quantized)
models with matching sparsification recipes for neural networks. It simplifies
Expand Down