Model Advanced Parameters

Each model in model_config.toml can set the extra_params field, allowing you to pass provider-specific additional parameters during API calls.

💡 Tip: Except for the three special keys (headers, query, body), all other keys are merged into the request body (i.e., OpenAI SDK's extra_body parameter).

API Provider Advanced Config

Advanced Auth Configuration

🏷️ Item	💡 What It Is	📝 How to Fill
`auth_header_name`	Header auth name	`Authorization` (default)
`auth_header_prefix`	Header auth prefix	`Bearer` (default)
`auth_query_name`	Query auth param name	`api_key` (default)

Advanced Parameters

🏷️ Item	💡 Purpose	📊 Recommended
`default_headers`	Default HTTP headers	`{}`
`default_query`	Default query params	`{}`
`organization`	OpenAI org (optional)	`None`
`project`	OpenAI project (optional)	`None`
`model_list_endpoint`	Model list endpoint	`/models`
`reasoning_parse_mode`	Reasoning parse mode	`auto`
`tool_argument_parse_mode`	Tool argument parse mode	`auto`

Runtime Configuration

🏷️ Item	💡 Purpose	📊 Recommended
`timeout`	Timeout	`10` seconds
`max_retry`	Max retries	`2` times
`retry_interval`	Retry interval	`10` seconds

Internal Translation Mechanism

extra_params is an internal MaiBot configuration field. It is not sent as-is to the model provider. Before the request is sent, the client converts it into the extra arguments supported by the corresponding SDK.

The OpenAI-compatible client (client_type = "openai") splits extra_params using these rules:

Syntax	Actual Use
`headers`	Sent as request headers
`query`	Sent as URL query parameters
`body`	Merged into the request body
Other plain keys	Sent as extra request body fields

For example:

toml

extra_params = {
  headers = {"X-Trace-Id" = "test-001"},
  query = {version = "2024-01-01"},
  body = {metadata = {source = "maibot"}},
  enable_thinking = "false"
}

This is converted to request extras similar to:

python

extra_headers = {"X-Trace-Id": "test-001"}
extra_query = {"version": "2024-01-01"}
extra_body = {
    "metadata": {"source": "maibot"},
    "enable_thinking": "false",
}

A common configuration like extra_params = {enable_thinking = "false"} sends enable_thinking as a request body field to the provider, not as a nested {"extra_params": {"enable_thinking": "false"}}.

Model Advanced Parameters

Parameter Overrides

🏷️ Item	💡 What It Is	📝 How to Fill
`temperature`	Model-level temperature	Overrides task config temp, optional
`max_tokens`	Model-level max tokens	Overrides task config max_tokens, optional

Other Advanced Parameters

🏷️ Item	💡 What It Is	📝 How to Fill
`force_stream_mode`	Force streaming	`false` (default), set `true` if non-streaming unsupported
`extra_params`	Extra parameters	`{}` (default), custom API params, see scenarios below

Priority Rules

temperature and max_tokens can be written in extra_params as model-level defaults, but the dedicated model fields are recommended:

toml

temperature = 0.7
max_tokens = 4096

This keeps the configuration clearer and avoids confusion with provider-specific request body fields that may use the same names.

When the same parameter exists in multiple places, the priority order is:

Values explicitly passed by the current request
Dedicated fields in the current model config, such as temperature and max_tokens
Same-name fields in the current model's extra_params
Defaults from the current task config

Enabling Thinking Mode

Many large models support "thinking mode" — letting the model perform deep reasoning before answering, improving response quality for complex questions.

DeepSeek

toml

[[models]]
name = "deepseek-r1"
model_identifier = "deepseek-reasoner"
api_provider = "deepseek"
visual = false
extra_params = {enable_thinking = true}   # Enable thinking mode

Parameter	Type	Description
`enable_thinking`	`bool`	`true` to enable thinking, `false` to disable

Adjusting Reasoning Depth

OpenAI's reasoning models use the reasoning_effort parameter to control reasoning depth.

Level	Use Case	Characteristics
`none`	Simple Q&A, information retrieval	Fastest, no reasoning
`minimal`	Minimal reasoning	Almost no added latency
`low`	Tool calls, search, multi-step decisions	Light reasoning
`medium`	Planning, complex reasoning (default)	Balance of quality and speed
`high`	Complex debugging, deep planning	Quality prioritized
`xhigh`	Deep research, async tasks	Highest quality, maximum latency

toml

[[models]]
name = "gpt-5"
model_identifier = "gpt-5.5"
api_provider = "openai"
visual = false
extra_params = {reasoning_effort = "medium"}

💡 Recommendation: Use medium for daily use, low for speed-sensitive tasks, high for deep analysis.

About client_type and Gemini

client_type determines which client MaiBot uses to communicate with the API:

Value	Description
`openai`	OpenAI-compatible interface (default), works with DeepSeek, Alibaba Bailian, OpenAI, etc.
`google`	Google Gemini native interface, supports thinking budget control

Gemini Thinking Configuration

Gemini models use thinking_config in extra_params to control thinking:

toml

[[models]]
name = "gemini-2.5-flash"
model_identifier = "gemini-2.5-flash"
api_provider = "google-gemini"
visual = true
client_type = "google"
extra_params = {thinking_config = {thinking_budget = 4096}}

⚠️ Google API is not directly accessible in China. You'll need a proxy.

Gemini Extra Parameter Fields

When client_type = "google", extra_params does not follow the OpenAI-compatible headers/query/body splitting rules. The Gemini client filters and maps fields according to what it supports:

Content generation: mapped to supported GenerateContentConfig fields
Embeddings: mapped to supported EmbedContentConfig fields

Parameter	Purpose
`thinking_budget`	Thinking budget (token count)
`include_thoughts`	Whether to include thinking process in responses
`enable_google_search`	Enable Google search capability
`task_type`	Embedding task type
`output_dimensionality`	Embedding output dimensionality
`audio_mime_type`	MIME type for audio requests

Custom HTTP Requests

extra_params supports three special keys for precise API request control:

Special Key	Purpose	Example
`headers`	Add HTTP request headers	`{headers: {"X-Custom": "value"}}`
`query`	Add URL query parameters	`{query: {"key": "value"}}`
`body`	Override request body fields	`{body: {"field": "value"}}`

toml

[[models]]
name = "custom-model"
model_identifier = "custom-model-v1"
api_provider = "custom"
visual = false
extra_params = {headers = {"X-API-Version" = "2024-06", "X-Priority" = "high"}}

Combining Parameters

You can use multiple parameters together:

toml

[[models]]
name = "gpt-5-advanced"
model_identifier = "gpt-5.5"
api_provider = "openai"
visual = true
extra_params = {
    reasoning_effort = "high",
    headers = {"X-Request-ID" = "custom-id", "X-Priority" = "high"}
}

Quick Parameter Reference

Parameter	Providers	Type	Description
`enable_thinking`	DeepSeek	`bool`	Enable thinking mode
`reasoning_effort`	OpenAI	`str`	Reasoning depth level
`headers`	All	`dict`	Custom HTTP request headers
`query`	All	`dict`	Custom URL query parameters
`body`	All	`dict`	Custom request body fields
`thinking_config`	Gemini	`dict`	Thinking budget config

⚠️ Note: Parameters are passed directly to the LLM API. Ensure parameter names and value formats match your provider's documentation, otherwise API calls may fail.

Model Advanced Parameters ​

API Provider Advanced Config ​

Advanced Auth Configuration ​

Advanced Parameters ​

Runtime Configuration ​

Internal Translation Mechanism ​

Model Advanced Parameters ​

Parameter Overrides ​

Other Advanced Parameters ​

Priority Rules ​

Enabling Thinking Mode ​

DeepSeek ​

Adjusting Reasoning Depth ​

About client_type and Gemini ​

Gemini Thinking Configuration ​

Gemini Extra Parameter Fields ​

Custom HTTP Requests ​

Combining Parameters ​

Quick Parameter Reference ​

Model Advanced Parameters

API Provider Advanced Config

Advanced Auth Configuration

Advanced Parameters

Runtime Configuration

Internal Translation Mechanism

Model Advanced Parameters

Parameter Overrides

Other Advanced Parameters

Priority Rules

Enabling Thinking Mode

DeepSeek

Adjusting Reasoning Depth

About client_type and Gemini

Gemini Thinking Configuration

Gemini Extra Parameter Fields

Custom HTTP Requests

Combining Parameters

Quick Parameter Reference