Image gen, video gen, OCR, speech-to-text, text-to-speech, reasoning, code, tool calls — one key for every capability your agent needs.
from openai import OpenAI
# One Relay key. Every model.
client = OpenAI(
api_key="rly_...",
base_url="https://api.relay.ai5labs.com/v1",
)
# Reasoning
client.chat.completions.create(
model="claude-opus-4.8",
messages=[{"role": "user", "content": "..."}],
)
# Code
client.chat.completions.create(
model="claude-sonnet-4.6",
messages=[{"role": "user", "content": "..."}],
)
# Fast & cheap
client.chat.completions.create(
model="deepseek-v3",
messages=[{"role": "user", "content": "..."}],
)Okay — the user wants a refund on order #4523, the WH-1000XM6 headphones at $1,240. First, eligibility. The order shipped 6 days ago and was delivered yesterday. Our policy is 30 days from delivery, so they're comfortably within window. No restocking fee on a first return — that's clean. Pulling the call transcript: they said "the headphones never sounded right". Vague, but consistent with a defect claim, not buyer's remorse. No need to push back or offer a partial. Account history: signed up over a year ago. No prior refund flags, no chargebacks, no suspicious patterns. Three previous orders, all kept. Real customer, not someone gaming the policy. Cost check: $1,240 is under the $2,000 manager-approval threshold. I can process it directly without escalation. So: full refund to the original card, processed today. Email should be short and empathetic — acknowledge the disappointment, confirm the refund timing (5–7 business days), and offer a no-questions return shipping label. Don't speculate about what went wrong; let support follow up if they want a replacement.
{
"repo": "ai5labs/relay-sdk",
"title": "Add retry policy for 429s",
"labels": ["robustness", "p1"],
"body": "Provider 429s currently bubble..."
}Real features that ship with v0.2 — not roadmap promises.
Every response carries a cost and its source — never wrong by default.
Tool deltas keyed by index, thinking blocks preserved.
Pass a Pydantic model, get a validated object back.
Any MCP server, any model, one schema compiler.
Batch multiple requests behind one call, ~50% off.
Retry on a backup model when the primary fails.
Run it yourself with your own keys, or use the hosted Gateway for one bill across every model.
First-mover pricing and a direct engineering channel as we build out the inference tier.
engineering@ai5labs.comPreference for institutional funds with a sovereign-AI thesis, or strategic capital aligned with the IndiaAI Mission's compute deployment.
bryan@ai5labs.com