Subscription Plans
The ZenMux Builder Plan subscription offers individual developers a fixed monthly fee and a predictable way to call AI models—so you can focus on coding and creating without worrying about the cost of each API request.
What Are Flows?
A Flow is ZenMux’s composite billing unit that combines token consumption and per-request overhead. You can think of it as a currency—just as different products have different prices in USD, different AI models consume different numbers of Flows per request. The Flow/USD exchange rate may fluctuate over time, and the latest rate is always published in real time on the Pricing page.
💱 Real-Time Flow/USD Exchange Rate

💡 Screenshot example
The Flow/USD exchange rate and related data are dynamically calculated. The screenshot is for reference only. Please refer to the real-time data shown on the Pricing page.
Exchange rate notes
The Flow/USD exchange rate is currently anchored at 1 Flow = $0.02525 (approximately 40 Flows = $1). This rate may be adjusted periodically based on market conditions and model pricing changes. The latest exchange rate will always be published and displayed here in real time.
💡 Insider member Flow value
The standard Flow value above applies to all regular subscription users. However, for early Insider members who maintain an active, continuous subscription (no interruptions), each Flow will have a higher USD equivalent value as a loyalty reward—meaning you get more value per Flow consumed.
Important — Abuse policy
For accounts found violating the Builder Plan Terms of Service (e.g., automated abuse, resource hoarding, unauthorized reselling, etc.), the effective Flow value will be reduced below the standard rate. This means the USD equivalent value per Flow will decrease. Please use your subscription responsibly.
📊 Plan Comparison — Monthly Max Flows and USD Equivalent Value

💡 Screenshot example
Plan comparison data is dynamically calculated. The screenshot is for reference only. Please refer to the real-time data shown on the Pricing page.
| Plan | Price | 5h Quota (Flows) | Weekly Max Flows | Monthly Max Flows | USD Equivalent Value | Value Multiplier |
|---|---|---|---|---|---|---|
| Free | $0/mo | 5 | 50.4 | 216 | - | - |
| Pro | $20/mo | 50 | 504 | 2,160 | $54.55 | 2.73x |
| Max | $100/mo | 300 | 3,024 | 12,960 | $327.27 | 3.27x |
| Ultra | $200/mo | 800 | 8,064 | 34,560 | $872.73 | 4.36x |
USD Equivalent Value = Monthly Max Flows × Flow unit price ($0.02525/Flow)
Value Multiplier = USD Equivalent Value / Plan price — indicates how many times more API value you get compared to the subscription fee
Why Choose the Builder Plan?
💡 Key Benefits
| Pain Point Scenario | Subscription Solution |
|---|---|
| Worried about burning money while vibe coding | Fixed pricing starting at $20/month—code freely |
| High cost to learn new tech | Explore a wide range of AI models at low cost |
| Messy multi-platform account management | One API Key to call all models |
| Diverse use cases | Coding + image generation + chat—full coverage |
🚀 Three Core Values
Full use-case model coverage
The Builder Plan covers three major model categories. Whether you’re a developer, designer, product manager, or operator, one subscription meets the full-spectrum needs of Vibe Builders:
Model Category Representative Models Coding models Claude Opus 4.5 / GPT-5.2-Codex / Gemini-3-Pro-Preview ... Image generation NanoBananaPro / GPT-Image-1.5 ... (rolling out) Text generation GPT-5.2 / Qwen3-Max-Thinking / ERNIE 5.0 ... An all-star model lineup
One subscription, orchestrating world-class models (Gemini 2.5 Pro, the GPT‑5 series, Claude Opus/Sonnet 4 series, etc.). Get access to the latest top models immediately—like commanding the strongest compute fleet on the internet.
Seamless IDE integration
No tool lock-in. One subscription API Key works across mainstream community developer tools such as Claude Code, Cursor, CodeX, and more.
Plan Comparison

Free - Free Trial

Supported models:
deepseek/deepseek-chat- DeepSeek-V3.2 (Non-thinking Mode)deepseek/deepseek-reasoner- DeepSeek-R1.8inclusionai/ling-1t- inclusionAI: Ling-1Tinclusionai/ling-mini-2.0- inclusionAI: Ling-Mini-2.0inclusionai/ring-1t- inclusionAI: Ring-1Tinclusionai/ring-mini-2.0- inclusionAI: Ring-Mini-2.0minimax/minimax-m2.1- MiniMax: MiniMax M2.1stepfun/step-3- StepFun: Step-3volcengine/doubao-seed-1.8- VolcanoEngine: Doubao-Seed-1.8xiaomi/mimo-v2-flash- Xiaomi: MiMo-V2-Flashz-ai/glm-4.6v-flash- Z.AI: GLM 4.6V Flashz-ai/glm-4.7- Z.AI: GLM 4.7
Pro - Top Choice for Developers

Supported models: 70+ premium models, organized by provider below (model slug - description)
Anthropic Claude Series
anthropic/claude-opus-4.6- Claude Opus 4.6anthropic/claude-opus-4.5- Claude Opus 4.5anthropic/claude-sonnet-4.5- Claude Sonnet 4.5anthropic/claude-haiku-4.5- Claude Haiku 4.5anthropic/claude-opus-4.1- Claude Opus 4.1anthropic/claude-opus-4- Claude Opus 4anthropic/claude-3.5-sonnet- Claude 3.5 Sonnetanthropic/claude-3.5-haiku- Claude 3.5 Haikuanthropic/claude-3.7-sonnet- Claude 3.7 Sonnetanthropic/claude-sonnet-4- Claude Sonnet 4
OpenAI GPT Series
openai/gpt-5.2- GPT‑5.2openai/gpt-5.2-chat- GPT‑5.2 Chatopenai/gpt-5.2-codex- GPT‑5.2 Codexopenai/gpt-5- GPT‑5openai/gpt-5-chat- GPT‑5 Chatopenai/gpt-5-codex- GPT‑5 Codexopenai/gpt-5-mini- GPT‑5 Miniopenai/gpt-5-nano- GPT‑5 Nanoopenai/gpt-5.1- GPT‑5.1openai/gpt-5.1-chat- GPT‑5.1 Chatopenai/gpt-5.1-codex- GPT‑5.1 Codexopenai/gpt-5.1-codex-mini- GPT‑5.1 Codex Miniopenai/gpt-4.1- GPT‑4.1openai/gpt-4.1-mini- GPT‑4.1 Miniopenai/gpt-4.1-nano- GPT‑4.1 Nanoopenai/gpt-4o- GPT‑4oopenai/gpt-4o-mini- GPT‑4o Miniopenai/o4-mini- o4-mini
Google Gemini / Gemma Series
google/gemini-2.0-flash- Gemini 2.0 Flashgoogle/gemini-2.0-flash-lite-001- Gemini 2.0 Flash Litegoogle/gemini-2.5-flash- Gemini 2.5 Flashgoogle/gemini-2.5-flash-lite- Gemini 2.5 Flash Litegoogle/gemini-2.5-flash-image- Gemini 2.5 Flash Imagegoogle/gemini-2.5-pro- Gemini 2.5 Progoogle/gemini-3-flash-preview- Gemini 3 Flash Previewgoogle/gemini-3-pro-preview- Gemini 3 Pro Previewgoogle/gemini-3-pro-image-preview- Gemini 3 Pro Image Previewgoogle/gemma-3-12b-it- Gemma 3 12B IT
xAI Grok Series
x-ai/grok-4- Grok 4x-ai/grok-4-fast- Grok 4 Fastx-ai/grok-4-fast-non-reasoning- Grok 4 Fast Non‑Reasoningx-ai/grok-4.1-fast- Grok 4.1 Fastx-ai/grok-4.1-fast-non-reasoning- Grok 4.1 Fast Non‑Reasoningx-ai/grok-code-fast-1- Grok Code Fast 1
Z.AI GLM Series
z-ai/glm-4.6v- GLM 4.6Vz-ai/glm-4.6v-flash- GLM 4.6V Flashz-ai/glm-4.7- GLM 4.7z-ai/glm-4.5- GLM 4.5z-ai/glm-4.5-air- GLM 4.5 Airz-ai/glm-4.6- GLM 4.6
DeepSeek Series
deepseek/deepseek-chat- DeepSeek Chatdeepseek/deepseek-chat-v3.1- DeepSeek Chat V3.1deepseek/deepseek-v3.2- DeepSeek V3.2deepseek/deepseek-v3.2-exp- DeepSeek V3.2 Expdeepseek/deepseek-r1-0528- DeepSeek R1 0528deepseek/deepseek-reasoner- DeepSeek Reasoner
Qwen Series
qwen/qwen3-coder- Qwen3 Coderqwen/qwen3-coder-plus- Qwen3 Coder Plusqwen/qwen3-max- Qwen3 Maxqwen/qwen3-max-preview- Qwen3 Max Previewqwen/qwen3-vl-plus- Qwen3 VL Plusqwen/qwen3-14b- Qwen3 14Bqwen/qwen3-235b-a22b-2507- Qwen3 235B A22B 2507qwen/qwen3-235b-a22b-thinking-2507- Qwen3 235B A22B Thinking 2507
Moonshot / Kimi Series
moonshotai/kimi-k2.5- Kimi K2.5moonshotai/kimi-k2-thinking- Kimi K2 Thinkingmoonshotai/kimi-k2-thinking-turbo- Kimi K2 Thinking Turbomoonshotai/kimi-k2-0711- Kimi K2 0711moonshotai/kimi-k2-0905- Kimi K2 0905
Baidu ERNIE Series
baidu/ernie-5.0-thinking-preview- ERNIE 5.0 Thinking Previewbaidu/ernie-x1.1-preview- ERNIE X1.1 Preview
InclusionAI Series
inclusionai/ling-1t- Ling‑1Tinclusionai/ling-flash-2.0- Ling Flash 2.0inclusionai/ling-mini-2.0- Ling Mini 2.0inclusionai/llada2.0-flash-cap- LLADA 2.0 Flash Capinclusionai/ming-flash-omni-preview- Ming Flash Omni Previewinclusionai/ring-1t- Ring‑1Tinclusionai/ring-flash-2.0- Ring Flash 2.0inclusionai/ring-mini-2.0- Ring Mini 2.0
Meta Llama Series
meta/llama-3.3-70b-instruct- Llama 3.3 70B Instructmeta/llama-4-scout-17b-16e-instruct- Llama 4 Scout 17B 16E Instruct
Mistral Series
mistralai/mistral-large-2512- Mistral Large 2512
MiniMax Series
minimax/minimax-m2-her- MiniMax M2 herminimax/minimax-m2.1- MiniMax M2.1minimax/minimax-m2- MiniMax M2
Kuaishou
kuaishou/kat-coder-pro-v1- KAT‑Coder‑Pro‑V1
Stepfun
stepfun/step-3- Step 3
Volcengine Doubao
volcengine/doubao-seed-1-6-vision- Doubao Seed 1.6 Visionvolcengine/doubao-seed-1.8- Doubao Seed 1.8volcengine/doubao-seed-code- Doubao Seed Code
Xiaomi
xiaomi/mimo-v2-flash- MiMo V2 Flash
Image Generation Models
nanobanana/nanobanana-pro- NanoBananaPro (2K resolution, supports multiple aspect ratios such as 16:9)openai/gpt-image-1.5- GPT-Image-1.5 (coming soon)tencent/hunyuan-image3- Hunyuan-Image3 (coming soon)
Max - High-Intensity Development

Additional ultra-flagship models:
openai/gpt-5.2-pro- GPT-5.2 Proopenai/gpt-5-pro- GPT-5 Pro
Ultra - Professional-Grade Flagship

Supported models: Same as the Max plan, including all premium models and ultra-flagship models.
Usage Limits
⚠️ Important
Subscription plans are designed for personal development, learning/exploration, and vibe coding in non-production scenarios. Please follow the usage guidelines below:
Rate Limits
- Rate Limit: 10-15 RPM (requests per minute)
- Quota window: Refreshes within a rolling 5-hour window
- Weekly limit: Resets within a rolling weekly window
Applicable Scenarios
✅ Allowed:
- Personal development and learning
- Vibe coding and rapid prototyping
- Technical exploration and experimentation
- Personal projects and non-commercial applications
❌ Not allowed:
- Production environments that are already live
- Commercial products or services
- End-user-facing applications
- Abusive behaviors such as multi-account pooling/rotation
💡 Production recommendation
If your project is about to go live or is already commercialized, switch to the Pay-As-You-Go usage-based plan to get:
- Higher SLA coverage
- More stable service quality
- More flexible scalability
- Professional business support
How to Subscribe
Step 1: Review plan details
Visit the ZenMux Pricing page to see detailed information and pricing for all subscription plans.

Step 2: Choose and subscribe
- Select the plan that fits your needs on the Pricing page
- Click "Get Max" or "Get Ultra" for the corresponding plan (Pro users click "Upgrade")
- Complete the payment flow
💡 Public beta seat limit
Subscriptions are now available in public beta, with a total of 999 seats. Once all seats are taken, new sign-ups will be temporarily closed. Subscribe early to secure access.
Step 3: Manage your subscription and get an API Key
After subscribing successfully, visit the Subscription Management page:

📊 View usage
- Usage and remaining time in the current 5-hour window
- This week’s cumulative usage stats
- Flow consumption breakdown
🔑 Get a subscription API Key
- Generate an API Key dedicated to subscriptions
- Manage and rotate existing keys
- View the key’s last used time
💳 Manage subscription
- View current plan information
- Upgrade or downgrade your plan
- View billing history
Extra Usage - Automatic Overage Switching
When enabled, once your Builder Plan subscription quota hits the 5-hour or weekly limit, it automatically switches to your selected Pay As You Go Key to ensure uninterrupted usage; when the quota resets, it automatically switches back to the subscription Key.
💡 Key Benefits
- Seamless switching - Automatically switches to pay-as-you-go when quota runs out, no manual API Key changes needed
- Uninterrupted workflows - No impact on your dev/coding/chat flows
- Automatic recovery - Switches back to subscription billing once quota is restored
- Flexible control - Enable or disable anytime
Setup Steps
Go to the Subscription Management page and follow the steps below to configure Extra Usage:
Step 1: Enable Extra Usage
In Subscription Management, find the Extra Usage section and toggle the switch on the right to enable it.

Step 2: Select a Pay As You Go API Key
After toggling, a "Select a Key" dialog will pop up. You can:
- Choose an existing Pay As You Go API Key from the dropdown
- Or click "Create new key" to create a new pay-as-you-go API Key

After selecting, click "Select" to confirm.
Step 3: Done
Once enabled, the page will display details of the Pay As You Go API Key you selected, including:
- API Key name and key value
- Enablement status
- Created time and last used time
- Current amount spent
- Action options (reselect another key)

How It Works
- Normal usage - Subscription quota (Flows) is used first
- Quota exhausted - When the 5-hour or weekly window quota reaches the limit, it automatically switches to the configured Pay As You Go API Key
- Usage-based billing - Calls are charged against your account balance
- Automatic recovery - When the subscription quota window resets, it automatically switches back to subscription billing
💡 Recommendations
- Make sure your Pay As You Go account has sufficient balance to avoid interruptions after switching
- You can view Extra Usage consumption and spending at any time on the Subscription Management page
- To change the backup key, click "Reselect" and choose again
⚠️ Billing reminder
After enabling Extra Usage, when your subscription quota runs out, charges will be automatically deducted from your Pay As You Go balance. Monitor your balance to avoid unexpected costs.
Step 4: Use it in developer tools
After you obtain your subscription API Key, you can use it across developer tools and applications.
Same as Pay As You Go
Subscription API Keys work exactly the same way as Pay As You Go, supporting the OpenAI SDK, Anthropic SDK, and direct HTTP calls. The only difference is that you use the subscription-specific API Key (prefixed with sk-ss-v1-), and usage is deducted from your subscription quota rather than your balance.
💡 API call example
For complete API call examples, see the Quickstart guide. Just replace the API Key in the examples with your subscription API Key.
Integrate with mainstream developer tools
Subscription API Keys can be seamlessly integrated into various AI coding tools and apps. For detailed configuration steps, see:
🔧 AI Coding Tools
- Claude Code Integration Guide - Anthropic official CLI tool
- CodeX Integration Guide - Intelligent code editor
- Cline Integration Guide - VS Code AI assistant extension
- VS Code Copilot Integration Guide - GitHub Copilot alternative
- Neovate Integration Guide - Modern AI coding tool
- OpenCode Integration Guide - Open-source AI coding assistant
💬 Knowledge Management & Chat Tools
- Cherry Studio Integration Guide - Desktop AI chat app
- Obsidian Integration Guide - Knowledge management AI plugin
- Sider Integration Guide - Browser sidebar AI assistant
🤖 AI App Platforms
- Dify Integration Guide - LLM app development platform
- Open WebUI Integration Guide - Self-hosted AI chat UI
📚 More integrations
More integration guides are continuously being added. If you need help, visit the Discord community or contact technical support.
Using Subscriptions in Studio Chat
In addition to using your subscription quota via API Key in developer tools, you can also use your subscription quota directly in the ZenMux Studio Chat web app.
Switching billing modes

On the Studio Chat page, you can choose whether each conversation uses subscription quota or your Pay As You Go balance:
As shown, you can find the Billing Mode option in the conversation settings:
- Subscription - Uses subscription quota (Flows) and does not consume account balance
- Pay As You Go - Charges against your account balance based on actual usage
💡 Flexible switching
You can use different billing modes for different conversations. For example:
- Daily development, learning, and prototype validation → Subscription
- Production testing and commercial project validation → Pay As You Go
You can switch between the two modes anytime without affecting each other, so you can always pick the best billing method for each scenario.
⚠️ Shared quota pool
Studio Chat and API calls share the same subscription quota pool. Conversations in Studio Chat consume subscription Flows, so allocate your usage accordingly.
FAQ
What’s the difference between subscriptions and pay-as-you-go?
| Feature | Subscription (Builder Plan) | Pay-As-You-Go |
|---|---|---|
| Billing model | Fixed monthly fee | Usage-based billing |
| Best for | Personal dev, learning | Production, commercial apps |
| Cost predictability | ✅ High (fixed monthly fee) | ⚠️ Medium (varies with usage) |
| Rate limits | 10-15 RPM | Higher, customizable |
| SLA | Standard | Higher |
| Value multiplier | ✅ 2.73-4.36x | Standard API pricing |
When does the quota reset?
- 5-hour window: Uses a rolling window mechanism. Metering starts when you send a request and resets every 5 hours.
- Weekly limit: Metering starts when you send a request and resets every 7 days.
You can view remaining quota and reset times in real time on the Subscription Management page.
Can I use multiple plans at the same time?
No. One account can only have one active subscription plan at a time. If you need higher limits, upgrade to Max or Ultra.
Next Steps
Now that you understand how the Builder Plan subscription works, you can:
- 📚 Read the Quickstart guide to learn detailed API usage
- 🔧 Check Best Practices to integrate your subscription API Key into developer tools
- 💰 Learn about the Pay-as-you-go plan for production billing
- 📊 Visit the Usage Analytics page to monitor subscription quota consumption in real time
- 💵 Review Cost Analysis to learn how to optimize costs
Contact Us
If you run into any issues, feel free to reach out via:
- Official website: https://zenmux.ai
- Support email: [email protected]
- Business email: [email protected]
- Twitter: @ZenMuxAI
- Discord: https://discord.gg/vHZZzj84Bm
For more contact options and details, visit our Contact page.