Root Cause Analysis — Service Degradation, March 3, 2026
Duration: Approximately 1 hour and 45 minutes of degraded service (8:15 AM – 10:00 AM AST), including ~26 minutes of confirmed downtime.
Impact: Users experienced intermittent errors and slow response times across the platform during the affected window.
What Happened
During a routine production deployment, a database connection pooling configuration that was appropriate for our previous architecture caused connection saturation und...