AWS Outage - What happened?

AWS

On Monday, Oct. 20, 2025, Amazon Web Services (AWS) experienced a large-scale outage that knocked out large portions of the Internet. Below is a technically based overview of what happened and what this means for organizations.

Timeline & core data

  • The outage reportedly began around 07:11 UTC (08:11 Dutch time) on Oct. 20. 
  • AWS indicated that the starting point was in the US-EAST-1 (Virginia) region of their data centers. 
  • The immediate cause: an error in the Domain Name System (DNS) resolution of a key API endpoint of the database service Amazon DynamoDB. In other words, the API servers could not be found because the Internet's ‘phonebook’ function (DNS) failed. 
  • That DNS error made specific AWS services inaccessible, leading to increased error rates and delays in multiple other AWS components. 
  • During the morning/afternoon, AWS indicated that the core issues were “fully resolved” but that there were still systems with backlogs or residual processing, such as AWS Lambda. 
  • The outage affected hundreds to thousands other services and applications worldwide that depend on AWS infrastructure. 

Why this is relevant to our customers

As a customer of Analyst ICT, it is important to understand how such an outage can impact your organization - even though your infrastructure may not be running directly on AWS. At Analyst ICT, we don't build infrastructures on AWS but many software parties do.

Dependencies in the chain

Many modern applications, websites, back-end systems and cloud services are built on (or use) AWS components: storage (S3), compute (EC2, Lambda), databases (DynamoDB, RDS), network, DNS, and so on. When one component fails (as here, the DNS to DynamoDB), the chain reactions cause:

  • Proprietary systems running directly on AWS can fail or become very slow.
  • Third parties (externally delivered apps, SaaS services, links) may fail because they depend on AWS components.
  • You notice not only loss of functionality, but also delays, errors, inaccessibility.
  • Residual effects: buffers, queues, backlogs (e.g., in messages, events) take time to eliminate.

Infrastructure-level risks

  • Concentration risk: one large cloud provider (such as AWS) dominates much of the infrastructure. This makes downtime tangible for many organizations at once. 
  • Chain reactions: a technical error (such as DNS) in the foundation creates effects in upper layers.
  • SLAs and recovery: even if AWS says “recovered,” there may be residual issues. The accompanying recovery process is often slower than the core fix. 

What can organizations do?

Given what has happened, it is wise to consider how you (or Analyst ICT on your behalf) have set up your infrastructure and dependencies. Here are some points of interest:

  1. Mapping dependencies
    • What systems are running on AWS services? Are they critical to your business?
    • What external services or SaaS products do you use that run on AWS?
    • Are there single points of failure through one cloud provider or region?
  2. Resilience and fallback strategies
    • Considering multi-region or multi-cloud approach: if region US-EAST-1 fails, do we have alternatives?
    • For critical services: timely backups, redundancy, fail-over mechanisms.
    • Monitoring for external dependencies: knowing when a third-party service (hosted on AWS) gets problems.
  3. Capital at sight: communication & recovery plan
    • During outages, prompt communication is important - internally and externally (customers).
    • Recovery plan: not just fixes, but aftercare (such as clearing backlogs, emptying queues).
    • Customer service ready: provide clarity, answer questions.
  4. Evaluation and lessons learned
    • Analyze what exactly the impact was on your organization due to this outage.
    • Ask questions, “What went wrong?”, “How quickly did we notice?”, “Which systems had problems?”, “How quickly was recovery noticeable?”
    • Use the outages as input for architecture and process improvement.

The outage at AWS on Oct. 20, 2025 is a harsh reminder that even the largest cloud provider is vulnerable. For organizations - including Analyst ICT's clients - this means above all: be prepared. It's not just about ‘will it ever happen,’ but about How we react when it happens. Structure, understanding dependencies and good plans make the difference.

Would you like us to work together your infrastructure and dependency map Walk through to see where you are vulnerable? We are happy to help with that.

Recent blogs

Macadmins Leiden
Blog
MacAdmins Meeting: What's relevant for your organization?
Last week, we attended the MacAdmins Meeting in Leiden. It's a gathering focused on Apple administration, security, and innovation. What stood out? Developments are moving fast. But more importantly: they are becoming increasingly relevant for SMEs. We'd like to share the key insights with you. What's happening? And what does that mean for your organization? Running AI Locally: Control Over Data and Costs AI is now everywhere. But one question remains central: where does your data reside? A significant topic during the meeting was running AI models (LLMs) locally. Instead of relying on external cloud platforms, more and more...
Microsoft AI
Blog
Microsoft and OpenAI break exclusivity: what does this mean for your IT strategy?
The collaboration between Microsoft and OpenAI – the powerhouse behind ChatGPT and Copilot, among others – is undergoing a fundamental change. Until now, Microsoft was the exclusive partner. But that is no longer the case. And this is more significant than it might seem. What has changed? Microsoft and OpenAI have restructured their partnership. The key changes: Exclusivity is gone OpenAI can now offer its technology through other cloud providers such as Amazon and Google Microsoft remains important, but no longer exclusively Azure remains the primary cloud partner, but not the only one The collaboration continues, but changes financially OpenAI will continue to make payments until 2030…
istorm egypt cairo
Blog
From Cupertino to Cairo: How Apple Connects the World
Sometimes things come together in a special way. Last week, we were in Egypt. A country with a rich history, impressive culture, and – perhaps less known – a fast-growing digital market. What immediately struck us: Egypt doesn't have an official Apple Store yet. And yet, Apple is clearly present there. Not through the well-known flagship stores, but through strong local partners. iStorm in Cairo One of those partners is iStorm, an Apple Premium Reseller with multiple branches in Egypt. And that's precisely where the personal connection was for us. During our visit to Cupertino earlier this year, we met people from...

A newsletter

Superlogic right?