AWS Global Infrastructure: A Foundation for Resilient Architectures
AWS Global Infrastructure
AWS Global Infrastructure: A Foundation for Resilient Architectures
This study guide explores the physical and logical components that comprise the AWS Global Infrastructure, focusing on how to architect for high availability, low latency, and disaster recovery as required for the AWS Certified Solutions Architect - Professional (SAP-C02) exam.
Learning Objectives
- Distinguish between Regions, Availability Zones (AZs), Local Zones, and Edge Locations.
- Explain the concept of Fault Isolation and blast radius reduction.
- Identify the differences between Zonal and Regional AWS services.
- Compare global networking services like AWS Global Accelerator and Amazon CloudFront.
- Evaluate multi-Region strategies for Disaster Recovery (DR) and global data residency.
Key Terms & Glossary
- AWS Region: A physical location in the world where AWS clusters data centers. Each Region is geographically isolated.
- Availability Zone (AZ): One or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
- Local Zone: An extension of an AWS Region that places compute, storage, and database services close to large population or industry centers.
- Edge Location: Data centers used by CloudFront and Global Accelerator to deliver content to end users with lower latency.
- Fault Isolation Boundary: A logical or physical segment (like an AZ or Region) designed to limit the impact of a failure.
- RTO (Recovery Time Objective): The maximum acceptable delay between the interruption of service and restoration.
- RPO (Recovery Point Objective): The maximum acceptable amount of data loss measured in time.
The "Big Idea"
[!IMPORTANT] The fundamental philosophy of AWS Global Infrastructure is "Design for Failure." By providing multiple layers of physical and logical isolation (AZs within Regions, Regions across continents), AWS enables architects to build systems where a single component failure—or even a localized natural disaster—does not result in total service downtime.
Formula / Concept Box
| Concept | Metric / Rule | Significance |
|---|---|---|
| AZ Distance | < 100 km (typically) | Ensures single-digit millisecond latency for synchronous replication. |
| ELB SLA | 99.99% Availability | Standard availability target for ALB, NLB, and CLB. |
| Multi-AZ Min. | At least 3 AZs | Standard AWS Region design to ensure quorum and high availability. |
| Blast Radius | Boundary Size | The goal is to minimize this through fault-isolated components. |
Hierarchical Outline
- AWS Global Infrastructure
- Physical Layer
- Regions (e.g., us-east-1, eu-west-1)
- Availability Zones (Interconnected via high-throughput fiber)
- Data Centers (The physical buildings containing servers)
- Edge Network
- Edge Locations (300+ worldwide)
- Regional Edge Caches (Middle-tier caching layer)
- Specialized Infrastructure
- Local Zones (Low latency for specific metro areas)
- Wavelength Zones (Embedded in 5G networks for mobile edge computing)
- Physical Layer
Visual Anchors
AWS Infrastructure Hierarchy
Regional Fault Isolation
\begin{tikzpicture} \draw[thick, dashed] (0,0) rectangle (6,4) node[above right] {AWS Region}; \draw[fill=blue!10] (0.5,0.5) rectangle (2,3.5) node[midway, align=center] {AZ A\ (Isolated Power)}; \draw[fill=blue!10] (2.2,0.5) rectangle (3.7,3.5) node[midway, align=center] {AZ B\ (Isolated Power)}; \draw[fill=blue!10] (3.9,0.5) rectangle (5.4,3.5) node[midway, align=center] {AZ C\ (Isolated Power)}; \draw[<->, thick, orange] (1.25,2) -- (3,2); \draw[<->, thick, orange] (3,2) -- (4.6,2); \node at (3,2.3) [orange] {Low Latency Fiber}; \end{tikzpicture}
Definition-Example Pairs
- Zonal Service: A service where resources are tied to a specific AZ.
- Example: An Amazon EC2 instance or an EBS Volume. If the AZ fails, the instance fails.
- Regional Service: A service that abstracts the AZs and operates across the entire Region automatically.
- Example: Amazon S3 or Amazon DynamoDB. They replicate data across multiple AZs by default.
- Global Service: A service with a single global endpoint that manages resources across all Regions.
- Example: AWS IAM or Amazon Route 53.
Worked Examples
Scenario: Global Low-Latency Trading Platform
Problem: A firm needs to provide a sub-50ms experience for users in New York and London while ensuring that if one Region goes down, the other can take over.
Solution Breakdown:
- Network Entry: Use AWS Global Accelerator. It provides two static Anycast IPs. It routes traffic over the AWS private backbone instead of the public internet.
- Traffic Routing: Use Route 53 with Latency-Based Routing. This ensures a user in London hits the
eu-west-1endpoint while a user in NY hitsus-east-1. - Database Replication: Deploy Amazon Aurora Global Database. This allows for a primary (writer) in
us-east-1and a secondary (reader) ineu-west-1with typical replication latency under 1 second. - Failover: If
us-east-1fails, Global Accelerator and Route 53 health checks detect the failure and reroute all traffic toeu-west-1. Aurora is promoted to primary.
Checkpoint Questions
- What is the primary difference between a Local Zone and an Edge Location?
- Why does AWS recommend a Multi-AZ strategy before a Multi-Region strategy for most workloads?
- Which service would you use to reduce latency for non-HTTP traffic globally?
- True or False: Amazon S3 stores data in only one Availability Zone by default.
▶Click to see answers
- Local Zones allow you to run compute/storage (EC2/RDS) closer to users, whereas Edge Locations are primarily for caching (CloudFront) and networking (Global Accelerator).
- Multi-AZ provides high availability with low latency and synchronous replication, whereas Multi-Region adds complexity, cost, and typically requires asynchronous replication (potential data loss).
- AWS Global Accelerator.
- False. S3 (Standard) replicates data across at least three AZs automatically.
Muddy Points & Cross-Refs
- CloudFront vs. Global Accelerator: Both use the Edge Network. Remember: CloudFront is for content caching (images, video, static/dynamic web content). Global Accelerator is for optimizing the network path to your application using Anycast IPs (good for gaming, IoT, or non-HTTP protocols).
- Regional vs. Zonal Fate: If you deploy an EC2 instance, you are responsible for its