Two-step demo: write a cached system prefix, then read it via cacheControl: ephemeral.
Step 1 — Warm cache
Write the stable 1,024+ token system prefix to the cache.
Step 2 — Reuse cache
Send the same prefix again; expect cache read tokens.
This agent runs twice with the same long system prompt. Step 1 writes the prefix to Anthropic's ephemeral cache; Step 2 reads it back so fewer input tokens are recomputed and latency drops.
Step 1 asks
Summarize the refund policy for annual plans in two short bullets.
Step 2 asks
What is the enterprise support SLA for severity-1 incidents? One sentence.