Us2 Outage
Incident Report for QLess
Postmortem

We would like to provide additional detail surrounding the downtime which occurred on 3/02/2021

What happened?
At 8:07am PST, the US2 server environment experienced downtime which resulted in all QLess applications becoming temporarily inaccessible on the server.
Duration: 21 minutes.

Cause
As a result of a regular “cache clear" action, QLess service became unresponsive as routinely running threads were blocked from execution.

Remediation
Upon receiving a monitoring alert notification QLess engineers restarted the service.

Prevention
QLess has initiated a complete refactoring of the affected application. The new service application will be more fault-tolerant, available and self-healing.

Posted Mar 02, 2021 - 14:49 PST

Resolved
This incident has been resolved.
Posted Mar 02, 2021 - 11:24 PST
Update
Qless services are now operational
Posted Mar 02, 2021 - 08:31 PST
Investigating
We are currently investigating this issue.
Posted Mar 02, 2021 - 08:22 PST
This incident affected: US2.