Gateway Failure Policy

When the gateway cannot call Fairvisor (timeout, DNS, network), choose an explicit policy.

Decision matrix

PolicySecurity postureAvailability impactTypical use
Fail-closedStrongHigher risk of blocking good traffichigh-risk/regulated endpoints
Fail-openWeakerBetter uptime under control-plane failureslow-risk/public traffic

Timeout guidance

  • keep gateway->edge timeout low (typically 200-500 ms)
  • avoid unbounded auth call retries on hot path
  • log and metric every fallback decision
  • write/update/admin endpoints: fail-closed
  • read-only/public endpoints: fail-open (with alerting)

Operational requirement

Document the chosen policy per route and test both normal and failure paths in staging.