AWS Global Infrastructure and Distributed Computing Study Guide
Distributed computing concepts supported by AWS global infrastructure and edge services
AWS Global Infrastructure and Distributed Computing
This guide explores how AWS provides a globally distributed platform to support high-performing, resilient, and low-latency applications through its physical infrastructure and specialized edge services.
Learning Objectives
After studying this guide, you should be able to:
- Differentiate between Regions, Availability Zones (AZs), and Edge Locations.
- Identify the core AWS services provided at the edge to reduce latency and improve security.
- Explain the importance of decoupling in distributed systems using SQS and SNS.
- Select appropriate scaling strategies (Horizontal vs. Vertical) for distributed workloads.
- Understand how AWS Global Accelerator and Amazon CloudFront optimize global traffic.
Key Terms & Glossary
- Region: A physical location in the world where AWS has multiple Availability Zones. Each region is geographically isolated.
- Availability Zone (AZ): One or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
- Edge Location: Sites that AWS uses to cache content and reduce latency for end users, often used by CloudFront and Route 53.
- Decoupling: An architectural approach where components of a system remain independent so that the failure or scaling of one does not directly impact the other.
- Microservices: A design pattern where an application is built as a collection of small, independent services.
The "Big Idea"
[!IMPORTANT] Distributed computing on AWS is built on the principle of geographical abstraction. By moving logic and data closer to the user (via Edge Locations) and spreading workloads across isolated failure zones (Regions/AZs), applications achieve "The Cloud Trinity": Low Latency, High Availability, and Infinite Scalability.
Formula / Concept Box
| Concept | Definition / Rule | Key Metric |
|---|---|---|
| Vertical Scaling | Increasing the "size" of a single resource (e.g., adding RAM to an EC2). | Instance Type (e.g., t3.micro to m5.large) |
| Horizontal Scaling | Increasing the "number" of resources (e.g., adding more EC2 instances). | Instance Count |
| RTO (Recovery Time Objective) | The maximum acceptable delay between the service failure and restoration. | Time (Minutes/Hours) |
| RPO (Recovery Point Objective) | The maximum acceptable amount of data loss measured in time. | Data Age (Seconds/Minutes) |
Hierarchical Outline
- AWS Physical Infrastructure
- Global Regions: Compliance, data sovereignty, and proximity.
- Availability Zones: Fault isolation and high availability within a region.
- Local Zones / Outposts: Extending AWS to specific geographic locations or on-premises.
- Edge Networking & Performance
- Amazon CloudFront: Content Delivery Network (CDN) for caching static/dynamic content.
- AWS Global Accelerator: Uses the AWS private network to optimize the path to your application.
- Amazon Route 53: Scalable DNS and health checking.
- Distributed System Patterns
- Asynchronous Messaging: Using Amazon SQS (queuing) and Amazon SNS (pub/sub).
- Orchestration: Managing containers with Amazon ECS or EKS.
- Serverless: Event-driven computing with AWS Lambda and Fargate.
Visual Anchors
AWS Infrastructure Hierarchy
Request Flow: Edge vs. Origin
Definition-Example Pairs
- Service Quotas: The maximum number of resources you can create in an AWS account.
- Example: An AWS account might have a default limit of 200 running On-Demand EC2 instances per region.
- Read Replicas: A copy of a database used to offload read traffic from the primary instance.
- Example: A WordPress site experiences high traffic; the admin creates an RDS Read Replica so the database can handle more "view" requests without slowing down "write" requests.
- Loose Coupling: Reducing the dependencies between components so they can scale independently.
- Example: A web server places a photo-processing job into an SQS Queue instead of sending it directly to a worker server. If the worker server crashes, the job stays in the queue until a new worker is ready.
Worked Examples
Problem: Global Low-Latency Video Streaming
Scenario: A company based in London wants to serve high-definition video to users in Tokyo and New York with minimal buffering.
Step-by-Step Breakdown:
- Storage: Upload video files to an Amazon S3 bucket in the
eu-west-2(London) region. - Distribution: Create an Amazon CloudFront distribution. Set the S3 bucket as the "Origin."
- Edge Delivery: When a user in Tokyo requests a video, CloudFront routes the request to the Edge Location in Tokyo.
- Caching: If it's the first request, CloudFront fetches the file from London once and caches it in Tokyo. Subsequent Tokyo users get the file instantly from the local cache.
- Security: Attach AWS WAF (Web Application Firewall) to the CloudFront distribution at the edge to block malicious SQL injection attacks before they reach the London origin.
Checkpoint Questions
- Which AWS infrastructure component consists of one or more discrete data centers?
- Answer: Availability Zone (AZ).
- If you need to send a message to multiple subscribers simultaneously (Fan-out pattern), which service should you use?
- Answer: Amazon SNS (Simple Notification Service).
- What is the primary difference between AWS Global Accelerator and Amazon CloudFront?
- Answer: CloudFront caches content at the edge (great for images/video), while Global Accelerator optimizes the network path using Anycast IPs (great for non-HTTP traffic like gaming or VoIP).
- True or False: Availability Zones are connected to each other via low-latency, private fiber-optic networking.
- Answer: True.
Comparison Tables
| Feature | Regions | Availability Zones | Edge Locations |
|---|---|---|---|
| Primary Goal | Compliance & Proximity | Fault Tolerance | Low Latency Delivery |
| Isolation | Completely independent | Power/Network isolated | Shared global network |
| Services Hosted | Most AWS Services | EC2, RDS, VPC Subnets | CloudFront, Route 53, WAF |
| Relationship | Contains 3+ AZs | Contained in a Region | Outside Regions (Near users) |