Recharge credits for an OpenAI-compatible relay

NexRelay uses prepaid credits with separate input, output, and cached read token meters. This is an independent third-party gateway, not an official OpenAI service.

Starter

For prototypes, small tools, and light API usage.

$9recharge

9 USD credits added to wallet

Input allowance: 3M tokens
Output allowance: 600K tokens
Cached read allowance: 6M tokens
Context: 32K
Monthly cap: 10M
Max output: 1K
Rate limit: 20 RPM
Concurrency: 2
High-cost budget: $1.2/mo

Default route: gpt-5.2 -> gpt-5.3 -> gpt-5.4 -> gpt-5.5

OKAll GPT versions available with cost controls
OK32K max context and 1K max output tokens
OK2 concurrent requests, 20 RPM
OKGPT-5.5 limited to short, low-frequency requests

USDT backup payment

Manual/on-chain confirmation, no surprise billing.

Model access is controlled by routing priority, context, budgets, and API key policy.

Pro

For developers and teams running production workflows.

$29recharge

29 USD credits added to wallet

Input allowance: 18M tokens
Output allowance: 4M tokens
Cached read allowance: 40M tokens
Context: 128K
Monthly cap: 62M
Max output: 4K
Rate limit: 120 RPM
Concurrency: 10
High-cost budget: $8/mo

Default route: gpt-5.3 -> gpt-5.2 -> gpt-5.4 -> gpt-5.5

OKProduction routing across GPT-5.2 to GPT-5.5
OK128K max context and 4K max output tokens
OK10 concurrent requests, 120 RPM
OKBest default for predictable margin and growth

USDT backup payment

Manual/on-chain confirmation, no surprise billing.

Model access is controlled by routing priority, context, budgets, and API key policy.

Business

For higher concurrency, longer context, and priority routing.

$99recharge

99 USD credits added to wallet

Input allowance: 85M tokens
Output allowance: 18M tokens
Cached read allowance: 200M tokens
Context: 256K
Monthly cap: 303M
Max output: 8K
Rate limit: 600 RPM
Concurrency: 50
High-cost budget: $35/mo

Default route: gpt-5.4 -> gpt-5.5 -> gpt-5.3 -> gpt-5.2

OKPriority routing for high-value workloads
OK256K max context and 8K max output tokens
OK50 concurrent requests, 600 RPM
OKHigher GPT-5.5 budget with strict caps

USDT backup payment

Manual/on-chain confirmation, no surprise billing.

Model access is controlled by routing priority, context, budgets, and API key policy.

Token meters

Meter	Unit price	Billing note
Input tokens	$1.00 / 1M credits	Billed from reported request usage.
Output tokens	$1.00 / 1M credits	Billed from reported request usage.
Cached read tokens	$0.10 / 1M credits	Cache hits are billed at a lower weighted rate.

Deduction example

A request with 100,000 input tokens, 20,000 output tokens, and 50,000 cached read tokens deducts 125,000 credits, equal to $0.1250.

Formula: input credits + output credits + cached read credits x 0.1.

Billing rules

Each API request has one request id and can be billed at most once.
Balance is checked before the upstream request. Insufficient balance returns 402 and no upstream request is sent.
Successful requests are billed from actual input, output, and cached read token usage when upstream usage is available.
Upstream failures before a usable response are not billed. Partial successful streamed responses can be billed by reported usage.
Retries and abnormal traffic are logged; repeated failures or high retry rates may be rate-limited or reviewed.

OpenAI-compatible API surface

Unified wallet and usage reconciliation

PayPal unavailable; USDT backup payment available