diff --git a/README.md b/README.md index c31b433..0291ca3 100644 --- a/README.md +++ b/README.md @@ -1347,6 +1347,12 @@ simulator focused on mobile cloud/edge/iot infrastructures. to the ONNX and deployed with DeepSparse for GPU-class performance on CPU hardware. +- [Shimmy](https://github.com/ptsochantaris/shimmy): Self-hosted AI inference + server with OpenAI API compatibility, designed for edge deployment. Supports + multiple model formats (GGUF, Core ML, etc.) with built-in streaming and + multimodal capabilities. Lightweight and fast for resource-constrained edge + environments. + - [SparseZoo](https://github.com/neuralmagic/sparsezoo): SparseZoo is a constantly-growing repository of sparsified (pruned and pruned-quantized) models with matching sparsification recipes for neural networks. It simplifies