Skip to content

Subscription Plans

The ZenMux Builder Plan subscription offers individual developers a fixed monthly fee and a predictable way to call AI models—so you can focus on coding and creating without worrying about the cost of each API request.

What Are Flows?

A Flow is ZenMux’s composite billing unit that combines token consumption and per-request overhead. You can think of it as a currency—just as different products have different prices in USD, different AI models consume different numbers of Flows per request. The Flow/USD exchange rate may fluctuate over time, and the latest rate is always published in real time on the Pricing page.

💱 Real-Time Flow/USD Exchange Rate

Real-time Flow/USD exchange rate

💡 Screenshot example

The Flow/USD exchange rate and related data are dynamically calculated. The screenshot is for reference only. Please refer to the real-time data shown on the Pricing page.

Exchange rate notes

The Flow/USD exchange rate is currently anchored at 1 Flow = $0.02525 (approximately 40 Flows = $1). This rate may be adjusted periodically based on market conditions and model pricing changes. The latest exchange rate will always be published and displayed here in real time.

💡 Insider member Flow value

The standard Flow value above applies to all regular subscription users. However, for early Insider members who maintain an active, continuous subscription (no interruptions), each Flow will have a higher USD equivalent value as a loyalty reward—meaning you get more value per Flow consumed.

Important — Abuse policy

For accounts found violating the Builder Plan Terms of Service (e.g., automated abuse, resource hoarding, unauthorized reselling, etc.), the effective Flow value will be reduced below the standard rate. This means the USD equivalent value per Flow will decrease. Please use your subscription responsibly.

📊 Plan Comparison — Monthly Max Flows and USD Equivalent Value

Plan comparison - monthly max Flows and USD equivalent value

💡 Screenshot example

Plan comparison data is dynamically calculated. The screenshot is for reference only. Please refer to the real-time data shown on the Pricing page.

PlanPrice5h Quota (Flows)Weekly Max FlowsMonthly Max FlowsUSD Equivalent ValueValue Multiplier
Free$0/mo550.4216--
Pro$20/mo505042,160$54.552.73x
Max$100/mo3003,02412,960$327.273.27x
Ultra$200/mo8008,06434,560$872.734.36x

USD Equivalent Value = Monthly Max Flows × Flow unit price ($0.02525/Flow)

Value Multiplier = USD Equivalent Value / Plan price — indicates how many times more API value you get compared to the subscription fee

Why Choose the Builder Plan?

💡 Key Benefits

Pain Point ScenarioSubscription Solution
Worried about burning money while vibe codingFixed pricing starting at $20/month—code freely
High cost to learn new techExplore a wide range of AI models at low cost
Messy multi-platform account managementOne API Key to call all models
Diverse use casesCoding + image generation + chat—full coverage

🚀 Three Core Values

  1. Full use-case model coverage

    The Builder Plan covers three major model categories. Whether you’re a developer, designer, product manager, or operator, one subscription meets the full-spectrum needs of Vibe Builders:

    Model CategoryRepresentative Models
    Coding modelsClaude Opus 4.5 / GPT-5.2-Codex / Gemini-3-Pro-Preview ...
    Image generationNanoBananaPro / GPT-Image-1.5 ... (rolling out)
    Text generationGPT-5.2 / Qwen3-Max-Thinking / ERNIE 5.0 ...
  2. An all-star model lineup

    One subscription, orchestrating world-class models (Gemini 2.5 Pro, the GPT‑5 series, Claude Opus/Sonnet 4 series, etc.). Get access to the latest top models immediately—like commanding the strongest compute fleet on the internet.

  3. Seamless IDE integration

    No tool lock-in. One subscription API Key works across mainstream community developer tools such as Claude Code, Cursor, CodeX, and more.

Plan Comparison

subscription-free

Free - Free Trial


subscription-free

Supported models:

  • deepseek/deepseek-chat - DeepSeek-V3.2 (Non-thinking Mode)
  • deepseek/deepseek-reasoner - DeepSeek-R1.8
  • inclusionai/ling-1t - inclusionAI: Ling-1T
  • inclusionai/ling-mini-2.0 - inclusionAI: Ling-Mini-2.0
  • inclusionai/ring-1t - inclusionAI: Ring-1T
  • inclusionai/ring-mini-2.0 - inclusionAI: Ring-Mini-2.0
  • minimax/minimax-m2.1 - MiniMax: MiniMax M2.1
  • stepfun/step-3 - StepFun: Step-3
  • volcengine/doubao-seed-1.8 - VolcanoEngine: Doubao-Seed-1.8
  • xiaomi/mimo-v2-flash - Xiaomi: MiMo-V2-Flash
  • z-ai/glm-4.6v-flash - Z.AI: GLM 4.6V Flash
  • z-ai/glm-4.7 - Z.AI: GLM 4.7

Pro - Top Choice for Developers


subscription-free

Supported models: 70+ premium models, organized by provider below (model slug - description)


Anthropic Claude Series

  • anthropic/claude-opus-4.6 - Claude Opus 4.6
  • anthropic/claude-opus-4.5 - Claude Opus 4.5
  • anthropic/claude-sonnet-4.5 - Claude Sonnet 4.5
  • anthropic/claude-haiku-4.5 - Claude Haiku 4.5
  • anthropic/claude-opus-4.1 - Claude Opus 4.1
  • anthropic/claude-opus-4 - Claude Opus 4
  • anthropic/claude-3.5-sonnet - Claude 3.5 Sonnet
  • anthropic/claude-3.5-haiku - Claude 3.5 Haiku
  • anthropic/claude-3.7-sonnet - Claude 3.7 Sonnet
  • anthropic/claude-sonnet-4 - Claude Sonnet 4

OpenAI GPT Series

  • openai/gpt-5.2 - GPT‑5.2
  • openai/gpt-5.2-chat - GPT‑5.2 Chat
  • openai/gpt-5.2-codex - GPT‑5.2 Codex
  • openai/gpt-5 - GPT‑5
  • openai/gpt-5-chat - GPT‑5 Chat
  • openai/gpt-5-codex - GPT‑5 Codex
  • openai/gpt-5-mini - GPT‑5 Mini
  • openai/gpt-5-nano - GPT‑5 Nano
  • openai/gpt-5.1 - GPT‑5.1
  • openai/gpt-5.1-chat - GPT‑5.1 Chat
  • openai/gpt-5.1-codex - GPT‑5.1 Codex
  • openai/gpt-5.1-codex-mini - GPT‑5.1 Codex Mini
  • openai/gpt-4.1 - GPT‑4.1
  • openai/gpt-4.1-mini - GPT‑4.1 Mini
  • openai/gpt-4.1-nano - GPT‑4.1 Nano
  • openai/gpt-4o - GPT‑4o
  • openai/gpt-4o-mini - GPT‑4o Mini
  • openai/o4-mini - o4-mini

Google Gemini / Gemma Series

  • google/gemini-2.0-flash - Gemini 2.0 Flash
  • google/gemini-2.0-flash-lite-001 - Gemini 2.0 Flash Lite
  • google/gemini-2.5-flash - Gemini 2.5 Flash
  • google/gemini-2.5-flash-lite - Gemini 2.5 Flash Lite
  • google/gemini-2.5-flash-image - Gemini 2.5 Flash Image
  • google/gemini-2.5-pro - Gemini 2.5 Pro
  • google/gemini-3-flash-preview - Gemini 3 Flash Preview
  • google/gemini-3-pro-preview - Gemini 3 Pro Preview
  • google/gemini-3-pro-image-preview - Gemini 3 Pro Image Preview
  • google/gemma-3-12b-it - Gemma 3 12B IT

xAI Grok Series

  • x-ai/grok-4 - Grok 4
  • x-ai/grok-4-fast - Grok 4 Fast
  • x-ai/grok-4-fast-non-reasoning - Grok 4 Fast Non‑Reasoning
  • x-ai/grok-4.1-fast - Grok 4.1 Fast
  • x-ai/grok-4.1-fast-non-reasoning - Grok 4.1 Fast Non‑Reasoning
  • x-ai/grok-code-fast-1 - Grok Code Fast 1

Z.AI GLM Series

  • z-ai/glm-4.6v - GLM 4.6V
  • z-ai/glm-4.6v-flash - GLM 4.6V Flash
  • z-ai/glm-4.7 - GLM 4.7
  • z-ai/glm-4.5 - GLM 4.5
  • z-ai/glm-4.5-air - GLM 4.5 Air
  • z-ai/glm-4.6 - GLM 4.6

DeepSeek Series

  • deepseek/deepseek-chat - DeepSeek Chat
  • deepseek/deepseek-chat-v3.1 - DeepSeek Chat V3.1
  • deepseek/deepseek-v3.2 - DeepSeek V3.2
  • deepseek/deepseek-v3.2-exp - DeepSeek V3.2 Exp
  • deepseek/deepseek-r1-0528 - DeepSeek R1 0528
  • deepseek/deepseek-reasoner - DeepSeek Reasoner

Qwen Series

  • qwen/qwen3-coder - Qwen3 Coder
  • qwen/qwen3-coder-plus - Qwen3 Coder Plus
  • qwen/qwen3-max - Qwen3 Max
  • qwen/qwen3-max-preview - Qwen3 Max Preview
  • qwen/qwen3-vl-plus - Qwen3 VL Plus
  • qwen/qwen3-14b - Qwen3 14B
  • qwen/qwen3-235b-a22b-2507 - Qwen3 235B A22B 2507
  • qwen/qwen3-235b-a22b-thinking-2507 - Qwen3 235B A22B Thinking 2507

Moonshot / Kimi Series

  • moonshotai/kimi-k2.5 - Kimi K2.5
  • moonshotai/kimi-k2-thinking - Kimi K2 Thinking
  • moonshotai/kimi-k2-thinking-turbo - Kimi K2 Thinking Turbo
  • moonshotai/kimi-k2-0711 - Kimi K2 0711
  • moonshotai/kimi-k2-0905 - Kimi K2 0905

Baidu ERNIE Series

  • baidu/ernie-5.0-thinking-preview - ERNIE 5.0 Thinking Preview
  • baidu/ernie-x1.1-preview - ERNIE X1.1 Preview

InclusionAI Series

  • inclusionai/ling-1t - Ling‑1T
  • inclusionai/ling-flash-2.0 - Ling Flash 2.0
  • inclusionai/ling-mini-2.0 - Ling Mini 2.0
  • inclusionai/llada2.0-flash-cap - LLADA 2.0 Flash Cap
  • inclusionai/ming-flash-omni-preview - Ming Flash Omni Preview
  • inclusionai/ring-1t - Ring‑1T
  • inclusionai/ring-flash-2.0 - Ring Flash 2.0
  • inclusionai/ring-mini-2.0 - Ring Mini 2.0

Meta Llama Series

  • meta/llama-3.3-70b-instruct - Llama 3.3 70B Instruct
  • meta/llama-4-scout-17b-16e-instruct - Llama 4 Scout 17B 16E Instruct

Mistral Series

  • mistralai/mistral-large-2512 - Mistral Large 2512

MiniMax Series

  • minimax/minimax-m2-her - MiniMax M2 her
  • minimax/minimax-m2.1 - MiniMax M2.1
  • minimax/minimax-m2 - MiniMax M2

Kuaishou

  • kuaishou/kat-coder-pro-v1 - KAT‑Coder‑Pro‑V1

Stepfun

  • stepfun/step-3 - Step 3

Volcengine Doubao

  • volcengine/doubao-seed-1-6-vision - Doubao Seed 1.6 Vision
  • volcengine/doubao-seed-1.8 - Doubao Seed 1.8
  • volcengine/doubao-seed-code - Doubao Seed Code

Xiaomi

  • xiaomi/mimo-v2-flash - MiMo V2 Flash

Image Generation Models

  • nanobanana/nanobanana-pro - NanoBananaPro (2K resolution, supports multiple aspect ratios such as 16:9)
  • openai/gpt-image-1.5 - GPT-Image-1.5 (coming soon)
  • tencent/hunyuan-image3 - Hunyuan-Image3 (coming soon)

Max - High-Intensity Development


subscription-free

Additional ultra-flagship models:

  • openai/gpt-5.2-pro - GPT-5.2 Pro
  • openai/gpt-5-pro - GPT-5 Pro

Ultra - Professional-Grade Flagship


subscription-free

Supported models: Same as the Max plan, including all premium models and ultra-flagship models.

Usage Limits

⚠️ Important

Subscription plans are designed for personal development, learning/exploration, and vibe coding in non-production scenarios. Please follow the usage guidelines below:

Rate Limits

  • Rate Limit: 10-15 RPM (requests per minute)
  • Quota window: Refreshes within a rolling 5-hour window
  • Weekly limit: Resets within a rolling weekly window

Applicable Scenarios

Allowed:

  • Personal development and learning
  • Vibe coding and rapid prototyping
  • Technical exploration and experimentation
  • Personal projects and non-commercial applications

Not allowed:

  • Production environments that are already live
  • Commercial products or services
  • End-user-facing applications
  • Abusive behaviors such as multi-account pooling/rotation

💡 Production recommendation

If your project is about to go live or is already commercialized, switch to the Pay-As-You-Go usage-based plan to get:

  • Higher SLA coverage
  • More stable service quality
  • More flexible scalability
  • Professional business support

How to Subscribe

Step 1: Review plan details

Visit the ZenMux Pricing page to see detailed information and pricing for all subscription plans.

subscription-free

Step 2: Choose and subscribe

  1. Select the plan that fits your needs on the Pricing page
  2. Click "Get Max" or "Get Ultra" for the corresponding plan (Pro users click "Upgrade")
  3. Complete the payment flow

💡 Public beta seat limit

Subscriptions are now available in public beta, with a total of 999 seats. Once all seats are taken, new sign-ups will be temporarily closed. Subscribe early to secure access.

Step 3: Manage your subscription and get an API Key

After subscribing successfully, visit the Subscription Management page:

subscription-free
  • 📊 View usage

    • Usage and remaining time in the current 5-hour window
    • This week’s cumulative usage stats
    • Flow consumption breakdown
  • 🔑 Get a subscription API Key

    • Generate an API Key dedicated to subscriptions
    • Manage and rotate existing keys
    • View the key’s last used time
  • 💳 Manage subscription

    • View current plan information
    • Upgrade or downgrade your plan
    • View billing history

Extra Usage - Automatic Overage Switching

When enabled, once your Builder Plan subscription quota hits the 5-hour or weekly limit, it automatically switches to your selected Pay As You Go Key to ensure uninterrupted usage; when the quota resets, it automatically switches back to the subscription Key.

💡 Key Benefits

  • Seamless switching - Automatically switches to pay-as-you-go when quota runs out, no manual API Key changes needed
  • Uninterrupted workflows - No impact on your dev/coding/chat flows
  • Automatic recovery - Switches back to subscription billing once quota is restored
  • Flexible control - Enable or disable anytime

Setup Steps

Go to the Subscription Management page and follow the steps below to configure Extra Usage:

Step 1: Enable Extra Usage

In Subscription Management, find the Extra Usage section and toggle the switch on the right to enable it.

Enable Extra Usage

Step 2: Select a Pay As You Go API Key

After toggling, a "Select a Key" dialog will pop up. You can:

  • Choose an existing Pay As You Go API Key from the dropdown
  • Or click "Create new key" to create a new pay-as-you-go API Key
Select API Key

After selecting, click "Select" to confirm.

Step 3: Done

Once enabled, the page will display details of the Pay As You Go API Key you selected, including:

  • API Key name and key value
  • Enablement status
  • Created time and last used time
  • Current amount spent
  • Action options (reselect another key)
Extra Usage Enabled

How It Works

  1. Normal usage - Subscription quota (Flows) is used first
  2. Quota exhausted - When the 5-hour or weekly window quota reaches the limit, it automatically switches to the configured Pay As You Go API Key
  3. Usage-based billing - Calls are charged against your account balance
  4. Automatic recovery - When the subscription quota window resets, it automatically switches back to subscription billing

💡 Recommendations

  • Make sure your Pay As You Go account has sufficient balance to avoid interruptions after switching
  • You can view Extra Usage consumption and spending at any time on the Subscription Management page
  • To change the backup key, click "Reselect" and choose again

⚠️ Billing reminder

After enabling Extra Usage, when your subscription quota runs out, charges will be automatically deducted from your Pay As You Go balance. Monitor your balance to avoid unexpected costs.

Step 4: Use it in developer tools

After you obtain your subscription API Key, you can use it across developer tools and applications.

Same as Pay As You Go

Subscription API Keys work exactly the same way as Pay As You Go, supporting the OpenAI SDK, Anthropic SDK, and direct HTTP calls. The only difference is that you use the subscription-specific API Key (prefixed with sk-ss-v1-), and usage is deducted from your subscription quota rather than your balance.

💡 API call example

For complete API call examples, see the Quickstart guide. Just replace the API Key in the examples with your subscription API Key.

Integrate with mainstream developer tools

Subscription API Keys can be seamlessly integrated into various AI coding tools and apps. For detailed configuration steps, see:

🔧 AI Coding Tools

💬 Knowledge Management & Chat Tools

🤖 AI App Platforms

📚 More integrations

More integration guides are continuously being added. If you need help, visit the Discord community or contact technical support.


Using Subscriptions in Studio Chat

In addition to using your subscription quota via API Key in developer tools, you can also use your subscription quota directly in the ZenMux Studio Chat web app.

Switching billing modes

Studio Chat billing mode switch

On the Studio Chat page, you can choose whether each conversation uses subscription quota or your Pay As You Go balance:

As shown, you can find the Billing Mode option in the conversation settings:

  • Subscription - Uses subscription quota (Flows) and does not consume account balance
  • Pay As You Go - Charges against your account balance based on actual usage

💡 Flexible switching

You can use different billing modes for different conversations. For example:

  • Daily development, learning, and prototype validation → Subscription
  • Production testing and commercial project validation → Pay As You Go

You can switch between the two modes anytime without affecting each other, so you can always pick the best billing method for each scenario.

⚠️ Shared quota pool

Studio Chat and API calls share the same subscription quota pool. Conversations in Studio Chat consume subscription Flows, so allocate your usage accordingly.


FAQ

What’s the difference between subscriptions and pay-as-you-go?

FeatureSubscription (Builder Plan)Pay-As-You-Go
Billing modelFixed monthly feeUsage-based billing
Best forPersonal dev, learningProduction, commercial apps
Cost predictability✅ High (fixed monthly fee)⚠️ Medium (varies with usage)
Rate limits10-15 RPMHigher, customizable
SLAStandardHigher
Value multiplier✅ 2.73-4.36xStandard API pricing

When does the quota reset?

  • 5-hour window: Uses a rolling window mechanism. Metering starts when you send a request and resets every 5 hours.
  • Weekly limit: Metering starts when you send a request and resets every 7 days.

You can view remaining quota and reset times in real time on the Subscription Management page.

Can I use multiple plans at the same time?

No. One account can only have one active subscription plan at a time. If you need higher limits, upgrade to Max or Ultra.

Next Steps

Now that you understand how the Builder Plan subscription works, you can:

Contact Us

If you run into any issues, feel free to reach out via:

For more contact options and details, visit our Contact page.