Recharge credits for an OpenAI-compatible relay

NexRelay uses prepaid credits with separate input, output, and cached read token meters. This is an independent third-party gateway, not an official OpenAI service.

Starter

For prototypes, small tools, and light API usage.

$9recharge

9 USD credits added to wallet

Input allowance
3M tokens
Output allowance
600K tokens
Cached read allowance
6M tokens
Context
32K
Monthly cap
10M
Max output
1K
Rate limit
20 RPM
Concurrency
2
High-cost budget
$1.2/mo
Default route: gpt-5.2 -> gpt-5.3 -> gpt-5.4 -> gpt-5.5
  • OKAll GPT versions available with cost controls
  • OK32K max context and 1K max output tokens
  • OK2 concurrent requests, 20 RPM
  • OKGPT-5.5 limited to short, low-frequency requests

USDT backup payment

Manual/on-chain confirmation, no surprise billing.

Model access is controlled by routing priority, context, budgets, and API key policy.

MOST POPULAR

Pro

For developers and teams running production workflows.

$29recharge

29 USD credits added to wallet

Input allowance
18M tokens
Output allowance
4M tokens
Cached read allowance
40M tokens
Context
128K
Monthly cap
62M
Max output
4K
Rate limit
120 RPM
Concurrency
10
High-cost budget
$8/mo
Default route: gpt-5.3 -> gpt-5.2 -> gpt-5.4 -> gpt-5.5
  • OKProduction routing across GPT-5.2 to GPT-5.5
  • OK128K max context and 4K max output tokens
  • OK10 concurrent requests, 120 RPM
  • OKBest default for predictable margin and growth

USDT backup payment

Manual/on-chain confirmation, no surprise billing.

Model access is controlled by routing priority, context, budgets, and API key policy.

Business

For higher concurrency, longer context, and priority routing.

$99recharge

99 USD credits added to wallet

Input allowance
85M tokens
Output allowance
18M tokens
Cached read allowance
200M tokens
Context
256K
Monthly cap
303M
Max output
8K
Rate limit
600 RPM
Concurrency
50
High-cost budget
$35/mo
Default route: gpt-5.4 -> gpt-5.5 -> gpt-5.3 -> gpt-5.2
  • OKPriority routing for high-value workloads
  • OK256K max context and 8K max output tokens
  • OK50 concurrent requests, 600 RPM
  • OKHigher GPT-5.5 budget with strict caps

USDT backup payment

Manual/on-chain confirmation, no surprise billing.

Model access is controlled by routing priority, context, budgets, and API key policy.

Token meters

MeterUnit priceBilling note
Input tokens$1.00 / 1M creditsBilled from reported request usage.
Output tokens$1.00 / 1M creditsBilled from reported request usage.
Cached read tokens$0.10 / 1M creditsCache hits are billed at a lower weighted rate.

Deduction example

A request with 100,000 input tokens, 20,000 output tokens, and 50,000 cached read tokens deducts 125,000 credits, equal to $0.1250.

Formula: input credits + output credits + cached read credits x 0.1.

Billing rules

  • Each API request has one request id and can be billed at most once.
  • Balance is checked before the upstream request. Insufficient balance returns 402 and no upstream request is sent.
  • Successful requests are billed from actual input, output, and cached read token usage when upstream usage is available.
  • Upstream failures before a usable response are not billed. Partial successful streamed responses can be billed by reported usage.
  • Retries and abnormal traffic are logged; repeated failures or high retry rates may be rate-limited or reviewed.

OpenAI-compatible API surface

Unified wallet and usage reconciliation

PayPal unavailable; USDT backup payment available