baseten_chat_completions
Create a chat completion using OpenAI-compatible API. **Supported Models:** - `deepseek-ai/DeepSeek-V3-0324` - DeepSeek V3 0324 (164k context) 🧠 - `deepseek-ai/DeepSeek-V3.1` - DeepSeek V3.1 (164k context) 🧠 - `zai-org/GLM-4.6` - GLM 4.6 (200k context) 🧠 - `zai-org/GLM-4.7` - GLM 4.7 (200k context) 🧠 - `moonshotai/Kimi-K2-Instruct-0905` - Kimi K2 0905 (128k context) - `moonshotai/Kimi-K2-Thinking` - Kimi K2 Thinking (262k context) 🧠 always-on - `moonshotai/Kimi-K2.5` - Kimi K2.5 (262k context) - `openai/gpt-oss-120b` - OpenAI GPT OSS 120B (128k context) 🧠 = Reasoning model. Use `reasoning_effort` param (low/medium/high) to control thinking depth. Response includes `reasoning_content` field with chain-of-thought. Supports streaming, tool calling, structured outputs.
Pricing
Per call
$0.01
Model
flat
Pay only for what you use. No subscriptions.
Inputs
top_logprobs
numberreasoning_effort
stringlogit_bias
objectseed
numberbad
stringskip_special_tokens
booleandocuments
stringpresence_penalty
numberecho
booleantop_p_min
numberearly_stopping
booleantools
stringlogprobs
booleantop_p
numberfrequency_penalty
numberresponse_format
objecttruncate_prompt_tokens
numberbest_of
numberstream
booleantop_k
numberdisaggregated_params
objecttemperature
numbertool_choice
stringmodel *
stringignore_eos
booleanchat_template
stringmax_tokens
numberadd_generation_prompt
booleann
numbermin_tokens
numbermin_p
numberspaces_between_special_tokens
booleanchat_template_args
objectstop
stringparallel_tool_calls
booleaninclude_stop_str_in_output
booleanmessages *
stringbad_token_ids
stringstream_options
objectuser
stringrepetition_penalty
numberlength_penalty
numberstop_token_ids
stringadd_special_tokens
boolean
