What is Salad AI Gateway?
Salad AI Gateway is a managed LLM API service that gives you direct access to powerful open-source models through a single, OpenAI-compatible endpoint - with no infrastructure to deploy or manage. It is powered by SaladCloud’s distributed GPU network, bringing the same cost-efficiency of self-hosted models with the simplicity of a hosted API. Any tool or library that supports a custom base URL - Aider, OpenCode, OpenClaw, Cline, Cursor, Goose, Vercel AI SDK, and more - works with Salad AI Gateway out of the box.API Endpoint
Available Models
The following models are available during the closed beta:| Model | Description |
|---|---|
qwen3.5-35b-a3b | Qwen 3.5 35B Mixture of Experts - best for agentic tasks and complex reasoning |
qwen3.5-27b | Qwen 3.5 27B - strong balance of capability and speed |
qwen3.5-9b | Qwen 3.5 9B - fastest response times, suited for lighter tasks |
Quick Start
The API follows the OpenAIv1/chat/completions format. Once you have access, send requests using your Salad API key in
the Authorization header:
<your-salad-api-key> with your actual Salad API key.
Use with OpenAI-Compatible Tools
Because Salad AI Gateway exposes a standard OpenAI-compatible endpoint, you can use it with any tool that supports a custom base URL. We have integration guides for many popular agentic tools - Cline, Aider, OpenCode, OpenClaw, Kilo Code, Goose, Hermes Agent, and more. All of those guides apply here with one difference: use standard authentication instead of theSalad-Api-Key custom header. Set your Salad API key as the API key in whichever field the tool provides (the
Authorization: Bearer header), and point the base URL to https://ai.salad.cloud:40404/v1.
Cline
VS Code AI coding agent
Aider
CLI AI pair programmer
OpenCode
Terminal coding agent
OpenClaw
Always-on AI assistant
Kilo Code
VS Code coding agent
Vercel AI SDK
VS Code multi-agent coding
Goose
CLI + desktop AI agent
Hermes Agent
Self-improving AI agent with MCP
Why Salad AI Gateway?
- No Infrastructure - no container groups to deploy, no replicas to manage, no cold starts to worry about
- Always on - our distributed GPU network ensures high availability and reliability
- OpenAI-compatible - easy integration with any OpenAI-compatible tool
- SaladCloud Pricing - 90% cost savings vs traditional providers
- Data Privacy - your data is never used for training
- Powered by Sustainable Computing - utilizing otherwise wasted GPU resources to reduce carbon footprint