Previous incidents
Intermittent DNS failures affecting some run executions
Resolved Jan 23, 2026 at 4:19am UTC
Full service has been restored. Task execution is back to normal. If you experienced failures between 01:37 and 04:19 UTC, those runs can be retried successfully now.
What happened: During a period of high activity, a backlog of completed runs ...
1 previous update
Dashboard runs list is delayed
Resolved Jan 21, 2026 at 5:32pm UTC
The root cause of the issue was a failed upgrade of our clickhouse service from 25.6 to 25.8. While the service failed to rollout, the rollback also failed and so our clickhouse service was split between some instances on 25.6 and 25.8, which caused various issues. Our clickhouse provider is continuing to investigate the failed release but we can confirm that the rollback has completed and queries are back to normal.
2 previous updates
Some schedules have stopped working
Resolved Jan 21, 2026 at 11:15am UTC
All schedules have been fully restored.
1 previous update
Issue with task logs
Resolved Jan 16, 2026 at 5:44pm UTC
We have finally been able to provision additional capacity and logs are working again. A full post-mortem will follow.
1 previous update
Dashboard runs list is delayed
Resolved Jan 2, 2026 at 9:16pm UTC
The runs list is now up to do and syncing live updates again.
4 previous updates
Batches are slow to process
Resolved Jan 1, 2026 at 3:10pm UTC
The new batch concurrency processing defaults have brought the processing queue down to zero
2 previous updates
Runs list is delayed
Resolved Dec 17, 2025 at 3:30pm UTC
Runs are now syncing live and the dashboard is back to normal.
1 previous update
Realtime streams v2 is degraded
Resolved Dec 17, 2025 at 8:16am UTC
Fix has been applied and realtime streams v2 is fully operational.
2 previous updates
Dashboard issues due to ClickHouse Cloud
Resolved Dec 16, 2025 at 10:42pm UTC
Operations have returned to normal, we're continuing to investigate the root cause and will provide more detail as we know more.
1 previous update
Dashboard is degraded
Resolved Dec 16, 2025 at 11:05am UTC
The issue in ClickHouse is now resolved. The dashboard is back to being fully operational.
The root cause was a faulty node in the ClickHouse cluster which we couldn't kill. We're speaking to the ClickHouse Cloud team to find out why it happened.
1 previous update
Deployments for new project are failing
Resolved Dec 5, 2025 at 6:52pm UTC
Deployment for new projects have been fully restored.
1 previous update
Runs list is lagging behind
Resolved Dec 4, 2025 at 3:35pm UTC
The runs list has all caught up and the dashboard is no longer displaying stale data. We're continuing to investigate the root cause of this issue
2 previous updates
Realtime delays
Resolved Dec 2, 2025 at 12:51am UTC
Realtime updates back to normal. We're monitoring recovery.
1 previous update
open telemetry logs and spans ingestion issues
Resolved Dec 2, 2025 at 11:18am UTC
We've published a full post-mortem on this incident here: https://trigger.dev/blog/clickhouse-too-many-parts-postmortem
3 previous updates
Dashboard unreliable as we work through clickhouse issues
Resolved Nov 28, 2025 at 7:54pm UTC
We have recovered the clickhouse instance and the dashboard is responsive again and serving queries and data ingestion is back online. There has been some loss of otel data during this downtime but we don't know the extent of it at this moment as we continue to recover and investigate.
1 previous update
Realtime is down
Resolved Nov 23, 2025 at 12:38pm UTC
Realtime is working again. ElectricSQL was only working for the first 10 seconds after the server came up. We updated some settings around the startup sequencing that means it is now working properly and are working with the Electric team to determine why this was happening.
1 previous update