Overview
This recipe runsQwen3.5-9B with the official SGLang runtime on a Salad GPU. The model is
downloaded automatically on first startup, and the container exposes an OpenAI-compatible API for tools such as
OpenClaw, OpenCode, and other compatible clients.
This recipe is designed to be easy to use:
- the model is already chosen for you
- it is public by default, so you can test it immediately after deployment
- it is preconfigured for Qwen reasoning and tool-calling
- you can connect it to OpenClaw or any other OpenAI-compatible client
Quick Start
- Open the SaladCloud Portal.
- Open your Organization and click “Deploy Container Group”
- Pick Qwen3.5-9B (SGLang) from the recipe list.
- Enter a Container Group Name.
- Decide whether to enable Require Container Gateway Authentication:
- Disabled: public access.
- Enabled: requests must include your SaladCloud API key.
- Deploy and wait for the first startup to finish.
The model is downloaded from Hugging Face at startup, so it can take several minutes before the deployment becomes
ready.
https://<your-dns>.salad.cloud/v1/chat/completions.
Use With OpenClaw
If you want to connect this recipe to OpenClaw, follow this guide:Current Defaults
The recipe currently defaults to:- Model source:
Qwen/Qwen3.5-9B - Served model name:
qwen3.5-9b - Host bind:
:: - Runtime image:
lmsysorg/sglang:latest-runtime - Context length:
131072 - Tensor parallel size:
1 - Memory fraction:
0.8 - Attention backend:
triton - Reasoning parser:
qwen3 - Tool call parser:
qwen3_coder - Authentication: disabled by default
Qwen/Qwen3.5-9B, with a triton attention backend selected
for better compatibility on Salad GPU nodes.
Thinking Mode
Qwen thinking is enabled by default. If you want a direct response for a specific request, include this field in the JSON body:Authentication
Require Container Gateway Authentication is available in the deployment form and is unchecked by default.- Disabled: anyone with the URL can call the API.
- Enabled: every request must include the
Salad-Api-Keyheader.
Example Request
For Technical Users
If you want to change the model or runtime settings later, open the container group in the SaladCloud Portal and edit Advanced Configuration. Useful environment variables include:SGLANG_MODEL_PATHSGLANG_SERVED_MODEL_NAMESGLANG_CONTEXT_LENGTHSGLANG_MEM_FRACTION_STATICSGLANG_TP_SIZE