Intelligent Compute Platform · Unified Model Gateway

One API,
every model.

70+ leading models from 10+ vendors behind one API. OpenAI-compatible. Swap one line of base URL — invoke any model.

✓ OpenAI / Anthropic SDK compatible✓ Per-token, pay-as-you-go✓ Multi-vendor automatic failover✓ Localized support · invoicing
# Just swap base_url and api_key
from openai import OpenAI
client = OpenAI(
    base_url="https://nodekey.xinghanyun.cn/v1",
    api_key="sk-xxxxxxxxxxxxxxxxxxx",
)
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
70+
Models · Leading models
10+
Vendors · Vendors
99.99%
Uptime · Platform uptime
OpenAI-compat
Compatible · Zero-migration SDK
Pricing

Models we support

10+ providers, 70+ active models. Switch provider to see live unit prices.

See all model pricing
More
Loading pricing…

CNY / 1M tokens · Live prices, synced with the pricing center; pay by token, no subscription, smart routing auto-picks the cheapest fitting model.

Why Intelligent Compute Platform

Why Intelligent Compute Platform

Five reasons, one switch

Every AI call burns tokens — the token is the unit of account of the AI era. Intelligent Compute Platform unifies token aggregation, routing, metering, and settlement in one place: broadest coverage, smart routing, transparent pricing, production-grade reliability, localized service — from indie developers to enterprises, the single entry point to AI models.

Breadth
70+
Models

Broadest reach — one interface

70+ leading models from 10+ vendors, all behind a single base URL. No more juggling separate accounts for OpenAI, Anthropic, DeepSeek, or Qwen.

Smart Routing
-30%↓
Avg. cost

Pick the right model per task

Routes by task type, context length, latency, and cost target — light models for intent classification, high-end models for code and reasoning, long-context models for long-form. The same workload can cost 30–60% less.

Pricing
Pay-as-you-go
No lock-in

Transparent, pay-as-you-go

Charged on actual token usage. No subscriptions. Model prices update in real time and roll up into one invoice — easy to expense and split across teams.

Reliability
99.99%
Uptime

Multi-vendor failover, no outages

When one model fails, routes automatically to a same-capability backup. Production SLA targets 99.99% uptime. Edge nodes route locally for lower latency.

Local Service
24/7
Local support

Localized support, top to bottom

Localized documentation, WeChat/WeCom support, B2B invoicing, bank reconciliation, data-processing compliance, private deployment options. Built around what regional enterprises actually need.

Quickstart

From signup to first call in 5 minutes

Three steps to live
01

Sign up and get an API Key

Open the console and create your API Key. Enterprise tier — talk to sales for dedicated quotas and billing.

02

Swap one base URL

If you already use the OpenAI / Anthropic SDK, change only the base URL and API Key. Nothing else needs to move.

03

Top up and start calling

Pay-as-you-go billing. The console shows usage, cost, and routing decisions in real time.

Console

Visible usage, clear bills

Observe, analyze, optimize
Visible usage, clear bills

Real-time usage

See call counts and token consumption by model, project, or API key.

Cost dashboard

Roll up cross-model spend onto one invoice. Split by department, aggregate by day.

Routing observability

Per-call routing decision, latency, and success rate — all visible.

Built for

From indie developers to enterprises

For every scale

Indie developers

One account for every model

  • Pay-as-you-go — no subscription, no minimum
  • One top-up unlocks every model
  • All billing in one wallet
  • 24-hour top-up & settlement
Most popular

AI startups

Ship without delay, fail-over by default

  • 70+ models in one integration, switchable any time
  • Multi-vendor failover keeps your product up
  • Usage dashboard split by project / team
  • Enterprise inquiry response within 4 hours

Enterprises

Private deployment & dedicated SLA

  • Dedicated cluster & private deployment
  • Data redaction + compliance audit logs
  • B2B invoicing, bank reconciliation, annual reports
  • Security & compliance support
Enterprise

Enterprise capabilities

Built for scale & compliance

Dedicated clusters, compliance audit, B2B settlement, and private deployment for mid-to-large enterprises.

01

Dedicated cluster

Isolated compute pool with dedicated rate-limit policies — no noisy neighbors.

02

Enterprise invoicing

B2B transfers, VAT special invoices, bank reconciliation, monthly rollups.

03

SLA & failover drills

Production-grade availability commitment. Quarterly failover drills and DR cutover.

04

Data compliance

Keep data in-region, redaction in transit, audit logs, security-tier compliance support.

05

Private deployment

Run Intelligent Compute Platform's scheduling layer in your own IDC. Models can stay flexible (self-hosted / partner / public).

06

Custom routing

Routing policy tailored to your priorities — business value, region, compliance.

FAQ

Developer questions

Frequently asked
How is Intelligent Compute Platform different from OpenAI's official API?

Intelligent Compute Platform is a unified model gateway — one account, one integration, 70+ models. For regional developers we add localized invoicing, B2B reconciliation, local payment methods, and compliance — removing the friction of cross-border accounts and payments.

Compatible with the OpenAI SDK?

Fully. Existing OpenAI-SDK code only needs the base URL and API Key swapped — call shapes, parameters, and response formats are identical. The native Anthropic spec is also supported.

How does billing work?

Per-token usage is metered consistently across models. Pay-as-you-go after a top-up — no subscription, no minimum. Enterprises get monthly reconciliation and VAT special invoices. See the pricing center for details.

Can you guarantee uptime?

We target 99.99% uptime in production. The multi-vendor routing policy automatically failovers to a same-capability backup on outage — degradation happens transparently at the request level.

Is private deployment supported?

Yes — the enterprise tier supports running Intelligent Compute Platform's scheduler and metering layer in customer IDCs, with model choice still flexible (self-hosted / partner / public). Talk to sales for specifics.

Will my data be used to train models?

Intelligent Compute Platform itself does not store call content and does not train on customer data. Each upstream vendor's data policy applies — we evaluate compliance during onboarding.

One API, every model

Five lines of code to your first call

Sign up, get an API key, swap the base URL — start invoking 70+ leading models.