curl --request POST \
--url https://api.minimax.io/v1/responses \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: <content-type>' \
--data '
{
"model": "MiniMax-M3",
"input": "Hello!"
}
'{
"id": "abc123",
"object": "response",
"created_at": 1764000000,
"model": "MiniMax-M3",
"status": "completed",
"output": [
{
"id": "abc123_msg",
"type": "message",
"status": "completed",
"role": "assistant",
"content": [
{
"type": "output_text",
"text": "Hello! I'm MiniMax. How can I help you today?",
"annotations": []
}
]
}
],
"output_text": "Hello! I'm MiniMax. How can I help you today?",
"usage": {
"input_tokens": 8,
"input_tokens_details": {
"cached_tokens": 0
},
"output_tokens": 14,
"output_tokens_details": {
"reasoning_tokens": 0
},
"total_tokens": 22
},
"parallel_tool_calls": true,
"store": false,
"truncation": "disabled"
}Create Response
Call MiniMax models via the OpenAI Responses API compatible main endpoint. Generates model replies, supports streaming and non-streaming.
curl --request POST \
--url https://api.minimax.io/v1/responses \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: <content-type>' \
--data '
{
"model": "MiniMax-M3",
"input": "Hello!"
}
'{
"id": "abc123",
"object": "response",
"created_at": 1764000000,
"model": "MiniMax-M3",
"status": "completed",
"output": [
{
"id": "abc123_msg",
"type": "message",
"status": "completed",
"role": "assistant",
"content": [
{
"type": "output_text",
"text": "Hello! I'm MiniMax. How can I help you today?",
"annotations": []
}
]
}
],
"output_text": "Hello! I'm MiniMax. How can I help you today?",
"usage": {
"input_tokens": 8,
"input_tokens_details": {
"cached_tokens": 0
},
"output_tokens": 14,
"output_tokens_details": {
"reasoning_tokens": 0
},
"total_tokens": 22
},
"parallel_tool_calls": true,
"store": false,
"truncation": "disabled"
}Reasoning Control
ForMiniMax-M3, the reasoning field controls whether the response can include reasoning output.
- If
reasoningis omitted, reasoning is on by default and the response includes an output item withtype: "reasoning". - Set
reasoning: {"effort": "none"}to disable reasoning output forMiniMax-M3. - Values
minimal,low,medium, andhighare accepted for compatibility and keep reasoning enabled, but they do not tune MiniMax-M3’s reasoning depth. - For M2.x models, reasoning cannot be disabled;
reasoning: {"effort": "none"}is accepted but reasoning remains on.
{
"model": "MiniMax-M3",
"input": "Which is larger, 9.11 or 9.9?"
}
{
"model": "MiniMax-M3",
"input": "Which is larger, 9.11 or 9.9?",
"reasoning": {
"effort": "none"
}
}
Authorizations
HTTP: Bearer Auth
- Security Scheme Type: http
- HTTP Authorization Scheme: Bearer API_key, used to authenticate your account. View it in Account Management > API Keys
Headers
Media type of the request body. Must be set to application/json
application/json Body
Model name to invoke, e.g. MiniMax-M3
"MiniMax-M3"
Conversation content. Supports either a simple text or a full conversation history array
System instructions
Maximum output token count
Sampling temperature, range (0, 1]
0 <= x <= 1Nucleus sampling, range (0, 1]
0 <= x <= 1Set to true to enable SSE streaming response
Tool list
Show child attributes
Show child attributes
Tool selection strategy: none means no tool will be called; auto lets the model decide whether to call tools
none, auto Request metadata. Both keys and values are strings
Show child attributes
Show child attributes
Prompt cache routing identifier
Output format control
Show child attributes
Show child attributes
Reasoning control. For MiniMax-M3, reasoning is on by default and responses include a reasoning output item. Set effort to none to disable MiniMax-M3 reasoning output. The minimal, low, medium, and high values are accepted for compatibility and keep reasoning enabled; they do not tune MiniMax-M3's reasoning depth. For M2.x models, reasoning cannot be disabled.
Show child attributes
Show child attributes
Response
Successful response
Response ID
"abc123"
Object type, always response
response Response creation time (Unix seconds)
Actual model that processed the request
Response status
completed, incomplete, failed Model output list
Assistant reply
- Message
- Reasoning
- Function Call
Show child attributes
Show child attributes
Convenience field. Concatenation of all text outputs
Show child attributes
Show child attributes
Error info, only returned when status=failed
Show child attributes
Show child attributes
Reason for incompletion, only returned when status=incomplete
Show child attributes
Show child attributes
Whether parallel tool calls are supported
Whether the response is persisted
Context truncation strategy
disabled