Baseten Model APIs
$0.01/call
OpenAI-compatible inference API for high-performance LLMs. Drop-in replacement for OpenAI SDK - just change base_url and api_key. **Supported Models:** | Model | Slug | Context | |-------|------|--------| | DeepSeek V3 0324 | `deepseek-ai/DeepSeek-V3-0324` | 164k | | DeepSeek V3.1 | `deepseek-ai/DeepSeek-V3.1` | 164k | | GLM 4.6 (Zhipu) | `zai-org/GLM-4.6` | 200k | | GLM 4.7 (Zhipu) | `zai-org/GLM-4.7` | 200k | | Kimi K2 0905 | `moonshotai/Kimi-K2-Instruct-0905` | 128k | | Kimi K2 Thinking | `moonshotai/Kimi-K2-Thinking` | 262k | | Kimi K2.5 | `moonshotai/Kimi-K2.5` | 262k | | OpenAI GPT OSS 120B | `openai/gpt-oss-120b` | 128k | **Features:** Chat completions, streaming, tool calling, structured outputs, reasoning modes. **Pricing:** ~$0.60/1M tokens (varies by model)
Connect Baseten Model APIs tools
Cursor
Claude Code
Claude Desktop
Windsurf
VS Code
Cline
Roo Code
ChatGPT
Gemini CLI
Amazon Q
Goose
Augment
n8n
API / cURL
AI SDK
TypeScript SDK
{
"mcpServers": {
"baseten": {
"url": "https://baseten.mcp.xpay.sh/mcp?key=YOUR_API_KEY"
}
}
}Or connect all tools
Access all tools (including Baseten Model APIs) through a single MCP connection.
{
"mcpServers": {
"xpay": {
"url": "https://mcp.xpay.sh/mcp?key=YOUR_API_KEY"
}
}
}Agent Discovery
Machine-readable catalogs for LLM agents and automation.
curl https://hub.xpay.sh/tools/llms.txt
curl https://hub.xpay.sh/tools/agents.txt
curl https://hub.xpay.sh/tools/skill.md
Pricing
Pay per tool call. No subscriptions.
1 Baseten Model APIs tool available
