Coveralls is in Read Only mode while we work on updating the system. Sorry for the inconvenience.

VIGILANCE! Check this page any time you notice a problem with coveralls

More reports of “Website under heavy load”

Incident Report for Coveralls

Postmortem

We believe this issue has been resolved for now.

The underlying cause still appears to be large spikes in incoming Web traffic from other outlier repositories that we have not yet identified or not yet paused.

Interim Solution:

To reduce the risk of recurrence, we have applied temporary load balancer adjustments that change how requests are distributed, which should lower—if not eliminate—the frequency of 503This website is under heavy loaderrors.

Permanent Solution:

We are also designing a permanent solution to rate-limit abnormal request patterns. This will require coordination at the policy/SLA level before it can be fully implemented.

In the meantime, we will continue to closely monitor traffic and use targeted load balancer and web server configurations to mitigate the impact of outlier traffic spikes.

More details:

For a more detailed assessment / RCA of this incident and its recent, related incidents, see this postmortem.

Update (Thu, Aug 21):

We have identified a different permanent solution, which does not entail changes to SLA-level details for “outlier repos.” We may still implement such a solution, but our alternative solution should avoid further 503 errors and be implemented in the next 48-72 hrs.

Posted Aug 20, 2025 - 15:22 PDT

Resolved

We have implemented an intermediate solution and believe this issue has been resolved for now. To fully resolve the root cause, we will need to implement a more long-term solution, which is currently in design. (Please read postmortem for more information.)

In the meantime, we will be doing our best to monitor for spikes in traffic from outlier repos and manually respond where our intermediate solution may not mitigate as much as we hope it will.
Posted Aug 20, 2025 - 15:21 PDT

Monitoring

A fix has been implemented and we are monitoring the results.
Posted Aug 20, 2025 - 12:08 PDT

Identified

The issue has been identified and a fix is being implemented.
Posted Aug 20, 2025 - 10:11 PDT

Update

We are continuing to investigate this issue.
Posted Aug 20, 2025 - 10:11 PDT

Investigating

We are currently investigating this issue.
Posted Aug 20, 2025 - 08:48 PDT
This incident affected: Coveralls.io Web and Coveralls.io API.