集成所需的一切。支持 Anthropic 与 OpenAI 两种协议——同一把 Key,同一套规则。
注册并登录后,进入 控制台 → API Keys → 新建 Key。你会得到一串以 cc_user_ 开头的密钥——请立即复制,它只展示这一次。
基址为 https://api.wdllm.com(或运营方告知你的地址)。
curl https://api.wdllm.com/v1/messages \
-H "x-api-key: $YOUR_KEY" \
-H "anthropic-version: 2023-06-01" \
-H "content-type: application/json" \
-d '{
"model": "claude-sonnet-4-6",
"max_tokens": 512,
"messages": [
{"role": "user", "content": "Hello, world!"}
]
}'
curl https://api.wdllm.com/v1/chat/completions \
-H "Authorization: Bearer $YOUR_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-sonnet-4-6",
"messages": [
{"role": "user", "content": "Hello, world!"}
]
}'
curl https://api.wdllm.com/v1/messages \
-H "x-api-key: $YOUR_KEY" \
-H "anthropic-version: 2023-06-01" \
-d '{"model":"claude-sonnet-4-6","stream":true,"max_tokens":1024,
"messages":[{"role":"user","content":"Tell me a haiku."}]}'
完整可调用的模型见 /pricing 或编程式拉取 /v1/models(实时反映上游 Anthropic 的最新清单)。同一模型可以通过**模型名后缀**调节推理强度,无需任何额外字段——任何 SDK / 客户端都能用。
后缀语法:<base>[-<effort>][-thinking]。两个维度独立,可自由组合:
| 模型名 | effort | 思考 | 适合 |
|---|---|---|---|
claude-opus-4-8 | — | off | 普通问答 · 最低延迟 |
claude-opus-4-8-max | max | off | 要详尽长答但不要慢推理 |
claude-opus-4-8-thinking | — | on | 要推理 · 默认详尽度 |
claude-opus-4-8-xhigh-thinking | xhigh | on | 复杂重构 / 长链推理(火力全开) |
claude-haiku-4-5-thinking | — | on | 老模型(无 effort)开思考 |
low / medium / high / xhigh / max——回答详尽度,开关思考都生效。xhigh 仅 Opus 4.8 / 4.7;max 限 Opus 4.6+/Sonnet 4.6。-thinking 后才思考;老模型(Haiku 4.5/Sonnet 4.5/Opus 4.1)只支持手动 budget,用 -thinking 启用。reasoning_effort(支持 low/medium/high/xhigh/max,是后缀的等价字段);原生 Anthropic 可直接传 output_config.effort + thinking:{type:"adaptive"},网关原样转发不覆盖。curl https://api.wdllm.com/v1/messages \
-H "x-api-key: $YOUR_KEY" \
-H "anthropic-version: 2023-06-01" \
-H "content-type: application/json" \
-d '{
"model": "claude-opus-4-8-xhigh-thinking",
"max_tokens": 4096,
"messages": [
{"role": "user", "content": "Refactor this code for performance."}
]
}'
内联 base64 编码图片:
{
"model": "claude-sonnet-4-6",
"max_tokens": 512,
"messages": [{
"role": "user",
"content": [
{"type": "image", "source": {
"type": "base64",
"media_type": "image/png",
"data": "iVBORw0KG..."
}},
{"type": "text", "text": "What's in this image?"}
]
}]
}
用 cache_control 标记消息块,使其驻留缓存,后续读取仅按 10% 计费:
{
"role": "user",
"content": [
{"type": "text", "text": "Large system context...",
"cache_control": {"type": "ephemeral"}},
{"type": "text", "text": "User question goes here"}
]
}
{
"model": "claude-sonnet-4-6",
"max_tokens": 1024,
"tools": [{
"name": "get_weather",
"description": "Look up current weather",
"input_schema": {
"type": "object",
"properties": {"city": {"type": "string"}},
"required": ["city"]
}
}],
"messages": [{"role":"user","content":"Weather in Tokyo?"}]
}
每次成功请求都会返回:
X-CC-Request-Id —— 我方内部 id(用于后续查询用量)X-CC-Account-Id —— 为你服务的上游 Anthropic 账号request-id —— Anthropic 自己的 id,用于工单升级查询任意历史请求的用量:
GET /v1/usage/{request_id}
→ {
"input_tokens": 781,
"output_tokens": 921,
"cache_read_input_tokens": 0,
"cache_creation": {"ephemeral_5m_input_tokens": 0, ...},
"cost_micro": 18234,
"model": "claude-sonnet-4-6"
}
| 状态码 | 含义 |
|---|---|
401 | 缺少或无效的 API Key。 |
402 | 余额不足(<code class="inline">insufficient_funds</code>)—— 请充值后继续;或请求了未定价模型(<code class="inline">model_not_priced</code>)—— 该模型当前未在定价表中,请联系运营添加。两种情况下网关都不会调用上游、不消耗 token。 |
429 | 触发限流(每 Key RPM 或上游账号 RPM)。 |
502 | 上游 Anthropic 报错;冷却系统会自动改路。 |
503 | 无健康上游账号,请稍后重试。 |
沿用你现有的任意 SDK,只需替换基址:
# Python (anthropic-sdk-python)
client = anthropic.Anthropic(
api_key="cc_user_...",
base_url="https://api.wdllm.com",
)
# Python (openai-python)
client = openai.OpenAI(
api_key="cc_user_...",
base_url="https://api.wdllm.com/v1",
)
# Node.js (@anthropic-ai/sdk)
const client = new Anthropic({ apiKey: process.env.YOUR_KEY,
baseURL: 'https://api.wdllm.com' });
需要这里没有列出的功能?发邮件至 support@wdllm.com,我们乐意帮忙。