Skip to content
GET
Lang

List Models(OpenAI Chat Completion Protocol)

GET https://zenmux.ai/api/v1/models

This endpoint is used to retrieve information about the OpenAI-compatible models supported by the platform.

Request params

This endpoint does not require any request parameters.

Returns

Returns a JSON object containing information for all available models.

data array

An array of models, containing detailed information for all available models.

object string

The object type, fixed as "list".

data object

id string

The model’s unique identifier, in the format <provider>/<model_name>.

object string

The object type, fixed as "model".

display_name string

The model’s display name, used for UI presentation.

created integer

The model creation timestamp (Unix timestamp).

owned_by string

The model provider.

input_modalities array

The input modalities supported by the model. Possible values include:

  • "text" - Text input
  • "image" - Image input
  • "video" - Video input
  • "audio" - Audio input
  • "file" - File input

output_modalities array

The output modalities supported by the model. Possible values include:

  • "text" - Text output
  • "image" - Image output
  • "video" - Video output
  • "audio" - Audio output
  • "file" - File output

capabilities object

The model’s capabilities.

capabilities.reasoning boolean

Whether reasoning is supported. true means reasoning is supported; false means it is not.

context_length integer

The context length limit, i.e., the maximum number of tokens the model can process.

pricings object

The pricing object, containing pricing configurations for different types of usage.

pricings.prompt array

Pricing configuration array for processing input text.

pricings.completion array

Pricing configuration array for generated output text.

pricings.input_cache_read array

Pricing configuration array for reading input data from cache.

pricings.input_cache_write_5_min array

Pricing configuration array for writing to cache with a 5-minute retention.

pricings.input_cache_write_1_h array

Pricing configuration array for writing to cache with a 1-hour retention.

pricings.input_cache_write array

Pricing configuration array for writing to cache.

pricings.web_search array

Pricing configuration array for invoking web search (optional; supported by some models).

pricings.internal_reasoning array

Pricing configuration array for the model’s internal reasoning process (optional; supported by some advanced reasoning models). Additional charges apply when the model enables internal chain-of-thought or detailed reasoning processes.

pricings.video array

Pricing configuration array for video output processing (optional; models that support video understanding). Billed by video duration, resolution, or frame count.

pricings.image array

Pricing configuration array for image output processing (optional; models that support image understanding). Typically billed by image count, resolution, or pixel count.

pricings.audio array

Pricing configuration array for audio output processing (optional; models that support audio understanding). Billed by audio duration or processing volume.

pricings.audio_and_video array

Pricing configuration array for generating video content with audio (optional; models that support multimodal audio-video understanding). Applies to scenarios that require analyzing both video frames and audio content. Note: there are two video generation scenarios—silent video uses pricings.video, while video with audio uses pricings.audio_and_video.

Pricing item schema

Each pricing array within the pricings object (such as completion, prompt, etc.) contains one or more pricing configuration objects. Each pricing configuration object includes the following fields:

value number

The discounted effective price. Free services show as 0.

unit string

The pricing unit. Possible values include:

  • "perMTokens" - Per million tokens
  • "perCount" - Per call
  • "perSecond" - Per second (for time-based billing scenarios such as audio and video)

currency string

The currency type, fixed as "USD", meaning US dollars.

conditions object

Pricing conditions (optional), commonly used for tiered pricing.

conditions.prompt_tokens object

Token-usage condition for the input content provided by the user.

conditions.completion_tokens object

Token-usage condition for tokens consumed when the model generates the response.

Pricing conditions schema

When a pricing configuration includes the conditions field, it defines the specific conditions under which the price applies. The condition objects for prompt_tokens and completion_tokens include the following fields:

unit string

The token unit, fixed as "kTokens", meaning thousand tokens (1000 tokens).

gte number

Minimum token count (inclusive). The actual token count must be ≥ this value.

lte number

Maximum token count (inclusive). The actual token count must be ≤ this value.

gt number

Minimum token count (exclusive). The actual token count must be > this value.

lt number

Maximum token count (exclusive). The actual token count must be < this value; null indicates no upper bound.

json
{
  "data": [
    {
      "id": "anthropic/claude-sonnet-4.5",
      "object": "model",
      "display_name": "Anthropic: Claude Sonnet 4.5",
      "created": 1759196009,
      "owned_by": "anthropic",
      "input_modalities": ["text", "image", "file"],
      "output_modalities": ["text"],
      "capabilities": {
        "reasoning": true
      },
      "context_length": 200000,
      "pricings": {
        "completion": [
          {
            "value": 15,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 0,
                "lt": 200
              }
            }
          },
          {
            "value": 22.5,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 200
              }
            }
          }
        ],
        "input_cache_read": [
          {
            "value": 0.3,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 0,
                "lt": 200
              }
            }
          },
          {
            "value": 0.6,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 200
              }
            }
          }
        ],
        "input_cache_write_1_h": [
          {
            "value": 6,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 0,
                "lt": 200
              }
            }
          },
          {
            "value": 12,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 200
              }
            }
          }
        ],
        "input_cache_write_5_min": [
          {
            "value": 3.75,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 0,
                "lt": 200
              }
            }
          },
          {
            "value": 7.5,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 200
              }
            }
          }
        ],
        "prompt": [
          {
            "value": 3,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 0,
                "lt": 200
              }
            }
          },
          {
            "value": 6,
            "unit": "perMTokens",
            "currency": "USD",
            "conditions": {
              "prompt_tokens": {
                "unit": "kTokens",
                "gte": 200
              }
            }
          }
        ],
        "web_search": [
          {
            "value": 0.01,
            "unit": "perCount",
            "currency": "USD"
          }
        ]
      }
    },
    {
      "id": "openai/gpt-5.2",
      "object": "model",
      "display_name": "OpenAI: GPT-5.2",
      "created": 1765438613,
      "owned_by": "openai",
      "input_modalities": ["image", "text", "file"],
      "output_modalities": ["text"],
      "capabilities": {
        "reasoning": true
      },
      "context_length": 400000,
      "pricings": {
        "completion": [
          {
            "value": 14,
            "unit": "perMTokens",
            "currency": "USD"
          }
        ],
        "input_cache_read": [
          {
            "value": 0.175,
            "unit": "perMTokens",
            "currency": "USD"
          }
        ],
        "prompt": [
          {
            "value": 1.75,
            "unit": "perMTokens",
            "currency": "USD"
          }
        ],
        "web_search": [
          {
            "value": 0.01,
            "unit": "perCount",
            "currency": "USD"
          }
        ]
      }
    }
  ],
  "object": "list"
}
cURL
curl https://zenmux.ai/api/v1/models