Offline and Disconnected Operation: Keeping Critical Workloads Running Without WAN
Not every edge site has a reliable internet connection, and even the ones that do will occasionally lose it. Offline and disconnected operation is the design requirement that critical edge workloads keep functioning correctly through that gap — not just surviving the outage, but continuing to do their job as if nothing happened.
Store-and-Forward: The Core Pattern
The most common technique for handling disconnection is store-and-forward: when the WAN link is down, data and events are written to a local queue instead of being discarded or blocked. Processing continues locally using that queued data. Once connectivity returns, the queue drains automatically, syncing everything that accumulated during the outage back to the cloud — without any manual intervention or data loss.
What Has to Keep Working Locally
For this pattern to be meaningful, the workloads that actually matter can’t depend on the cloud connection at all. Authentication, core business logic, safety checks, and local decision-making all need to run entirely on local infrastructure. Only non-critical functions — centralized reporting, cross-site analytics, remote configuration changes — should be allowed to simply wait until connectivity resumes.
Real-World Environments That Demand This
- Cruise ships and offshore platforms — often relying on expensive, low-bandwidth satellite links that can drop entirely in poor weather.
- Remote mining and agriculture sites — where cellular coverage may be weak, intermittent, or entirely absent.
- Rural retail locations — where a WAN outage can’t be allowed to stop point-of-sale transactions from processing.
In each case, the business impact of “computing stops when the network does” is unacceptable, so offline operation isn’t optional — it’s a hard requirement.
Current Trends
Conflict-free replicated data types (CRDTs) are gaining traction as the underlying mechanism for syncing data that was modified independently at multiple disconnected sites, resolving conflicts automatically once connectivity returns rather than requiring manual reconciliation. Low-earth-orbit satellite connectivity — Starlink and similar services — is also increasingly deployed as a resilient fallback WAN link for remote sites, reducing how often true offline operation is actually needed, even if the primary design must still assume it will happen.