
Classification · Routing · Moderation · Extraction · Triage
Free first evolution. No credit card. Your prompt stays yours.
Paste your production prompt and upload labeled examples. No data? No problem. Give us 5 examples and we'll generate 50 for you.
We spawn 15 candidate prompts, score each against your data, and breed the top performers across up to 25 generations. Watch it happen live.
Get the winning prompt with before/after accuracy and 95% CI. Copy it into your production system. Yours to deploy anywhere.
A human-written starting prompt scored 68% on 110 real Capterra app reviews (customer-action routing). The Cambrian-evolved version hit 87% on the same data. Same model. Bred smarter through generations of selection pressure.
Across our benchmark tasks, lifts range from +6% to +34% depending on task difficulty. Average lift: +19 absolute points.
See all 11 case studies →Evolved prompts are plain text. Paste them into Claude, GPT, Gemini, Llama, or any future model. The structural improvements (cascading rules, label definitions, tie-breakers) transfer across providers.
No vendor lock-in. Switch providers any time, your prompt travels with you.
Customer language drifts. Edge cases emerge. Model providers ship new versions. A prompt that hit 89% in Q1 can quietly slip to 80% by Q3. Same prompt, different reality.
Top teams re-evolve every 1-3 months to recapture the last 10-20% of lift portability leaves on the table.
Every LLM provider has a “make this prompt better” button. Here’s what they don’t do.
| Built-in tools | Cambrian Lab | |
|---|---|---|
| Tests on your real data | No | Yes |
| Measurable before/after accuracy | No | Yes, with 95% CI |
| Tests multiple candidates | 1 suggestion | 15 per generation |
| Iterative improvement | One pass | Up to 25 generations |
| Works across providers | Locked in | Any LLM |
| Typical accuracy gain | Marginal | +19 pts average |
Every evolution runs the full 15-agent pipeline across up to 25 generations on your data.
All plans: bring your own LLM API key. Inference runs on your account, we never see or store it. Most evolutions complete in 10-15 minutes and use roughly the equivalent of a small batch evaluation, not continuous usage.
Annual billing saves 2 months. Custom enterprise plans available on request.
Your first evolution is free. Upload your prompt, drop in 20 labeled examples, and watch a measurably better prompt emerge in under 15 minutes.
Evolve my prompt →