Previous incidents | Trigger.dev

Jun 2025 to Aug 2025

August 2025

Aug 01, 2025

1 incident

Run log failures and cascading API failures

Degraded

Resolved Aug 01 at 01:29am BST

This was resolved at 00:29am. Logs and the API started recovering once a valid partition was in place.

1 previous update

July 2025

Jul 23, 2025

1 incident

Runs are missing from the dashboard and runs.list is degraded

Degraded

Resolved Jul 23 at 03:45pm BST

The dashboard/runs.list is back to normal. We're working on and deploying multiple changes which will reduce and prevent these kind of issues from happening.

1 previous update

Jul 22, 2025

1 incident

Runs are missing from the dashboard and runs.list is degraded

Degraded

Resolved Jul 23 at 12:14am BST

The runs list is now fully operational. There is still missing data that we will be backfilling ASAP.

2 previous updates

Jul 18, 2025

1 incident

Some runs list calls impacted by ClickHouse server crashes

Degraded

Resolved Jul 18 at 04:26pm BST

We've opened a case with ClickHouse Cloud to try and understand why this happened.

2 previous updates

Jul 15, 2025

1 incident

Batches with more than 20 runs are slow to process

Degraded

Resolved Jul 15 at 03:15pm BST

Batches are processing as normal now. We have increased future capacity.

This was caused by a runaway loop of batches by a customer and this part of the system didn't have enough capacity to process them all fast enough.

We are updating how we process and rate-limit batches to prevent this from happening again, as well as improved internal alerts if similar issues happen in the future.

2 previous updates

Jul 13, 2025

1 incident

Realtime not processing updates

Downtime

Resolved Jul 13 at 05:37pm BST

Realtime is sending updates again. The attached storage stopped working and restarting the AWS task didn't work. A hard reset caused it to become healthy again.

We're looking into how to prevent this from happening again

1 previous update

Jul 04, 2025

1 incident

Realtime not sending updates

Downtime

Resolved Jul 04 at 10:34am BST

This is resolved – Realtime is sending updates again.

Restarting Electric released and reacquired the Postgres replication slot. We're discussing why this happened to try and prevent it in the future.

1 previous update

June 2025

Jun 14, 2025

1 incident

Realtime not sending updates

Downtime

Resolved Jun 14 at 01:15pm BST

This has been resolved by rolling back to ElectricSQL 1.0.3.

We were on 1.0.20 which worked when we first deployed it yesterday. Then on a second deploy today it stopped receiving publications from Postgres. We tried restarting it a few times but it wouldn't receive any updates, so we rolled back.

1 previous update