Search. Pull. Build.

LiteLLM is a unified interface to call 100+ LLMs using the OpenAI format, providing a proxy server for multiple LLM providers.

Latest tag: v1.87.0 + 101 more tags

lmcache-vllm-openai

Last changed

LMCache is an LLM serving engine extension that stores and reuses KV caches across requests to reduce time-to-first-token (TTFT) and increase throughput. It integrates with vLLM to provide GPU-accelerated inference with shared KV cache management.

Latest tag: v0.4.6 + 17 more tags

nemo

Last changed

NVIDIA NeMo Framework is an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere.

Latest tag: v2.7.3 + 77 more tags

nvidia-container-toolkit

Last changed

The NVIDIA Container Toolkit allows users to build and run GPU accelerated containers.

Latest tag: v1.19.1 + 63 more tags

nvidia-gpu-driver

Last changed

Tools necessary for GPU and feature discovery for NVIDIA GPU driver container that allows the provisioning of the NVIDIA driver through the use of containers.

Latest tag: v550.54.14 + 13 more tags

ollama

Last changed

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Latest tag: v0.24.0 + 267 more tags

34 images

The trusted source for open source

Talk to an expert

Privacy

Terms

© 2026 Chainguard, Inc. All Rights Reserved.
Chainguard® and the Chainguard logo are registered trademarks of Chainguard, Inc. in the United States and/or other countries.
The other respective trademarks mentioned on this page are owned by the respective companies and use of them does not imply any affiliation or endorsement.

Search. Pull. Build.

The trusted source for open source

Product

Solutions

Customers

Resources

Company