Good morning, there 👋
Sunday, April 5 · Starter Plan
Latency degradation on mistral-large-2
p99 latency spiked to 2,100ms — auto-failover redirecting to claude-3-5-sonnet
6
3 providers
4.2B
this month
$2,979
of $3,000 budget
1,847
running now
Token Usage
Billions of tokens · 30d
Cost by Model
This month · $2,979 total
Daily spend trend
Models
| Model | Status | RPM | p50 (ms) | p99 (ms) | Cost | |
|---|---|---|---|---|---|---|
llama-3.1-70b Meta / Groq | active | 3,840 | 85 | 160 | $210.40 | |
gpt-4o OpenAI | active | 2,840 | 312 | 680 | $1240.50 | |
claude-3-5-sonnet Anthropic | active | 1,920 | 278 | 520 | $890.20 | |
gemini-1.5-pro | active | 1,240 | 195 | 380 | $562.80 | |
phi-3-medium Microsoft | active | 980 | 68 | 120 | $95.60 | |
mistral-large-2 Mistral | degraded | 420 | 890 | 2100 | $180.10 |
Recent Activity
gpt-4o deployed to production edge
3 regions · v2.4.1
mistral-large-2 latency spike
p99 > 2000ms · auto-failover active
Agent workflow 'data-pipeline' completed
1,247 steps · 3.2M tokens
Monthly budget alert: 80% used
$2,352 of $3,000 budget
claude-3-5-sonnet auto-scaled
3 → 10 replicas · high traffic
Agent 'customer-support-v3' updated
New system prompt deployed