From September 9 21:45 UTC to September 11 00:30 UTC, a major incident occurred in Cloudflare's infrastructure affecting Adapty services availability. During the incident, a total of 3.3% of clients were affected, primarily in Poland, the United Kingdom, the United States, and Germany. The issue was related to incorrect traffic routing to Cloudflare data centers. The incident was not related to Adapty's internal infrastructure.
2. Timeline
| Time Stamp | Event |
|---|---|
| 9.09 19:40 UTC | Incident start, first alerts received |
| 9.09 19:40 UTC | Automated checks detected api.adapty.io unavailability |
| 9.09 19:41 UTC | Incident escalated to S1, on-call team engaged |
| 9.09 19:45 UTC | Retransmissions detected on front nodes, CF confirmed as source |
| 9.09 20:00 UTC | Diagnostics showed routing through London Heathrow (LHR) for problematic UK traffic |
| 9.09 20:41 UTC | Cache was disabled in the Cloudflare CDN. |
| 10.09 21:13 UTC | Cloudflare proxying was disabled. Traffic was routed directly to the origin. |
| 10.09 21:30 UTC | Issue partially resolved, reduction in 503 errors |
| 11.09 21:35 UTC | Full service restoration, incident end |
3. Three whys
4. Action items
We're sorry and are working hard to implement solutions to prevent this in the future.