AKey Registry
Prefix-only preview, label, scope, cap, status. Generate mints a real sl- key as soon as auth service is live.
| Label · Prefix | Scope | Created | Last used | Monthly cap | Status | Actions |
|---|
BPer-key Usage
Prompt + completion tokens split, dollar spend per rate card, call/error/rate-limit counts. Range tabs filter.
Per-model breakdown · this key
CLive test · the killer feature
Paste any sl- key, hit Test. We actually call api.slashed.pro and show the raw response. Quatarly hides this; we surface it.
1. Models discovery GET /v1/models
2. Chat completion · 1 token POST /v1/chat/completions
Deeper proof · 6 tests minting demo key…
No paste needed. Page mints an ephemeral sl-demo-XXX key (60-sec TTL, auto-renews every 50s, rate-gated). Every test below hits the real gateway — what you see is what your code gets.
03Model A/B/C race 3 × POST /v1/chat/completions · Promise.all
Same prompt, three frontier models, fired in parallel. Pick any model, same API surface, half the list price. The killer demo — drop us into your switch statement and route by latency, cost, or quality without writing three SDKs.
04Server-sent streaming POST /v1/chat/completions · stream:true
Same shape as OpenAI's data: {...}\\ndata: [DONE] wire format. Tokens land as they're generated — no buffering, no rebuffering. Real bytes through the SLASHED proxy.
05JSON mode · structured output response_format: json_object
Ask the model for a JSON object, get a JSON object. We pass response_format: {type: "json_object"} through to the upstream, then parse + validate before the bytes ever hit your code. If the model returns malformed JSON, we surface that loudly instead of silently retrying.
06Long context · large window ~100k chars · ~25k tokens
Paste a long doc, ask one question, prove the proxy handles full upstream context without truncation. Same envelope as a 200-byte ping — no special endpoint, no payload chunking, no special headers.
07Code generation · codex models gpt-5.3-codex
The gpt-5.3-codex code model, same chat-completions envelope. Pre-filled with a small Python task — proves you can use SLASHED for inline code-gen at half the OpenAI list price.
08Cleaning layer · raw vs clean diff debug-mode response
This is what you actually buy. Upstream models return inconsistent shapes — prompt-tax preamble ("Sure, here's…"), content: null when there's a tool call, missing usage on streaming, etc. We strip all of it on the wire so your code sees one canonical shape across all 11 frontier models.
waiting…
waiting…
DPer-key audit log
Last 100 calls. Timestamp, model, tokens, status, latency, response_id. Filter by status class. CSV / JSON export.
| Timestamp | Key | Model | Prompt tok | Compl tok | Status | Latency | Response ID |
|---|
ERate-limit dashboard
Token-bucket state per key. The exact X-RateLimit-* headers SLASHED returns. Burst tester to observe backoff.
FHealth check · per key
One-button validation. Runs /v1/models, /v1/chat/completions probe, and scope verification against the model registry.
What this surface exposes · vs Quatarly Per /vs-quatarly receipts · documented Quatarly behavior
SLASHED · /dashboard-key-plug
- ✓ Live test affordance hitting api.slashed.pro
- ✓ Raw JSON viewer for every response
- ✓ Prompt + completion token split per call
- ✓ Per-model cost breakdown per key
- ✓ Audit log w/ response_id, filterable, CSV+JSON export
- ✓ X-RateLimit-* headers surfaced verbatim
- ✓ Burst tester for backoff observation
- ✓ Scope verification against live registry
- ✓ Health check with truncated probe results
- ✓ Generation modal w/ scope + cap + expiry
Quatarly · documented dashboard
- · Key list view, no test affordance
- · Hidden raw JSON behind summary cards
- · Aggregated tokens only — no split
- · Monthly total only — no per-model break
- · No audit log surfaced to user
- · Rate-limit headers stripped upstream
- · No burst tester
- · No scope verification UI
- · Status page only, no per-key probe
- · Generation gated behind sales call