1. Configure

Test endpoints you own or have permission to probe. Burst is capped at 10 requests, paced at 250-1000ms apart, and each request times out at 5s — this is a measurement, not a stress test.

URL or API endpoint

Burst size (requests)

3-10. Higher = better signal on rate-limit threshold.

Inter-request delay (ms)

250-2000ms. Lower = more agent-like cadence.

Rate-limit hygiene

—

Per-request results

Hygiene checks

Reading the score: 25 points each for (a) the response structure stays consistent under burst, (b) Retry-After appears when 429s fire, (c) X-RateLimit-* headers expose remaining quota, (d) the error body is structured JSON with retryable fields. 100 = an agent can recover gracefully without a meltdown.

What this tests

Why it matters for AI agents

Documented 2026 failure pattern: agents repeat-hammer rate-limited endpoints, miss Retry-After guidance, fail to fall back to a backup model, and burn auth cooldowns in 30-60 minute loops. Token-based rate limiting + structured 429 bodies + clear retry guidance is the modern fix.

Retry-After header

RFC 9110 §10.2.3. Either an HTTP-date or seconds-from-now. When a 429 fires without it, agents have to guess the wait time — usually with exponential backoff that's much slower than your actual reset window. A 30-second Retry-After is dramatically better than agent-side guessing.

X-RateLimit-* headers

Conventional (no IETF spec yet): X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Reset. Some hosts use RateLimit (no X-) per IETF Internet-Draft. Either flavor lets a well-behaved agent self-throttle BEFORE hitting the 429.

Structured error body

When a 4xx/5xx fires, the response body should be parseable JSON with a recognizable error structure (RFC 7807 Problem Details, or a vendor convention like {error: {code, message, retry_after}}). HTML error pages force the agent to scrape, which is where most retry-storm meltdowns start.

Disclaimer

This tool fires up to 10 requests at the URL you specify, paced at the delay you set. It is intended for probing endpoints you own or have permission to test. Hard caps prevent it from being used as a stress tester. If you operate the target, run this to characterize your 429 behavior; if you do not, ask first.

Agent Rate-Limit Probe