AWS Global Infrastructure and Distributed Computing

This guide explores how AWS provides a globally distributed platform to support high-performing, resilient, and low-latency applications through its physical infrastructure and specialized edge services.

Learning Objectives

After studying this guide, you should be able to:

Differentiate between Regions, Availability Zones (AZs), and Edge Locations.
Identify the core AWS services provided at the edge to reduce latency and improve security.
Explain the importance of decoupling in distributed systems using SQS and SNS.
Select appropriate scaling strategies (Horizontal vs. Vertical) for distributed workloads.
Understand how AWS Global Accelerator and Amazon CloudFront optimize global traffic.

Key Terms & Glossary

Region: A physical location in the world where AWS has multiple Availability Zones. Each region is geographically isolated.
Availability Zone (AZ): One or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
Edge Location: Sites that AWS uses to cache content and reduce latency for end users, often used by CloudFront and Route 53.
Decoupling: An architectural approach where components of a system remain independent so that the failure or scaling of one does not directly impact the other.
Microservices: A design pattern where an application is built as a collection of small, independent services.

The "Big Idea"

[!IMPORTANT] Distributed computing on AWS is built on the principle of geographical abstraction. By moving logic and data closer to the user (via Edge Locations) and spreading workloads across isolated failure zones (Regions/AZs), applications achieve "The Cloud Trinity": Low Latency, High Availability, and Infinite Scalability.

Formula / Concept Box

Concept	Definition / Rule	Key Metric
Vertical Scaling	Increasing the "size" of a single resource (e.g., adding RAM to an EC2).	Instance Type (e.g., t3.micro to m5.large)
Horizontal Scaling	Increasing the "number" of resources (e.g., adding more EC2 instances).	Instance Count
RTO (Recovery Time Objective)	The maximum acceptable delay between the service failure and restoration.	Time (Minutes/Hours)
RPO (Recovery Point Objective)	The maximum acceptable amount of data loss measured in time.	Data Age (Seconds/Minutes)

Hierarchical Outline

AWS Physical Infrastructure
- Global Regions: Compliance, data sovereignty, and proximity.
- Availability Zones: Fault isolation and high availability within a region.
- Local Zones / Outposts: Extending AWS to specific geographic locations or on-premises.
Edge Networking & Performance
- Amazon CloudFront: Content Delivery Network (CDN) for caching static/dynamic content.
- AWS Global Accelerator: Uses the AWS private network to optimize the path to your application.
- Amazon Route 53: Scalable DNS and health checking.
Distributed System Patterns
- Asynchronous Messaging: Using Amazon SQS (queuing) and Amazon SNS (pub/sub).
- Orchestration: Managing containers with Amazon ECS or EKS.
- Serverless: Event-driven computing with AWS Lambda and Fargate.

Visual Anchors

AWS Infrastructure Hierarchy

Loading Diagram...

Request Flow: Edge vs. Origin

Compiling TikZ diagram…

⏳

Running TeX engine…

This may take a few seconds

Definition-Example Pairs

Service Quotas: The maximum number of resources you can create in an AWS account.
- Example: An AWS account might have a default limit of 200 running On-Demand EC2 instances per region.
Read Replicas: A copy of a database used to offload read traffic from the primary instance.
- Example: A WordPress site experiences high traffic; the admin creates an RDS Read Replica so the database can handle more "view" requests without slowing down "write" requests.
Loose Coupling: Reducing the dependencies between components so they can scale independently.
- Example: A web server places a photo-processing job into an SQS Queue instead of sending it directly to a worker server. If the worker server crashes, the job stays in the queue until a new worker is ready.

Worked Examples

Problem: Global Low-Latency Video Streaming

Scenario: A company based in London wants to serve high-definition video to users in Tokyo and New York with minimal buffering.

Step-by-Step Breakdown:

Storage: Upload video files to an Amazon S3 bucket in the eu-west-2 (London) region.
Distribution: Create an Amazon CloudFront distribution. Set the S3 bucket as the "Origin."
Edge Delivery: When a user in Tokyo requests a video, CloudFront routes the request to the Edge Location in Tokyo.
Caching: If it's the first request, CloudFront fetches the file from London once and caches it in Tokyo. Subsequent Tokyo users get the file instantly from the local cache.
Security: Attach AWS WAF (Web Application Firewall) to the CloudFront distribution at the edge to block malicious SQL injection attacks before they reach the London origin.

Checkpoint Questions

Which AWS infrastructure component consists of one or more discrete data centers?
- Answer: Availability Zone (AZ).
If you need to send a message to multiple subscribers simultaneously (Fan-out pattern), which service should you use?
- Answer: Amazon SNS (Simple Notification Service).
What is the primary difference between AWS Global Accelerator and Amazon CloudFront?
- Answer: CloudFront caches content at the edge (great for images/video), while Global Accelerator optimizes the network path using Anycast IPs (great for non-HTTP traffic like gaming or VoIP).
True or False: Availability Zones are connected to each other via low-latency, private fiber-optic networking.
- Answer: True.

Comparison Tables

Feature	Regions	Availability Zones	Edge Locations
Primary Goal	Compliance & Proximity	Fault Tolerance	Low Latency Delivery
Isolation	Completely independent	Power/Network isolated	Shared global network
Services Hosted	Most AWS Services	EC2, RDS, VPC Subnets	CloudFront, Route 53, WAF
Relationship	Contains 3+ AZs	Contained in a Region	Outside Regions (Near users)

AWS Global Infrastructure and Distributed Computing

Learning Objectives

After studying this guide, you should be able to:

Differentiate between Regions, Availability Zones (AZs), and Edge Locations.
Identify the core AWS services provided at the edge to reduce latency and improve security.
Explain the importance of decoupling in distributed systems using SQS and SNS.
Select appropriate scaling strategies (Horizontal vs. Vertical) for distributed workloads.
Understand how AWS Global Accelerator and Amazon CloudFront optimize global traffic.

Key Terms & Glossary

Region: A physical location in the world where AWS has multiple Availability Zones. Each region is geographically isolated.
Availability Zone (AZ): One or more discrete data centers with redundant power, networking, and connectivity in an AWS Region.
Edge Location: Sites that AWS uses to cache content and reduce latency for end users, often used by CloudFront and Route 53.
Decoupling: An architectural approach where components of a system remain independent so that the failure or scaling of one does not directly impact the other.
Microservices: A design pattern where an application is built as a collection of small, independent services.

The "Big Idea"

[!IMPORTANT] Distributed computing on AWS is built on the principle of geographical abstraction. By moving logic and data closer to the user (via Edge Locations) and spreading workloads across isolated failure zones (Regions/AZs), applications achieve "The Cloud Trinity": Low Latency, High Availability, and Infinite Scalability.

Formula / Concept Box

Concept	Definition / Rule	Key Metric
Vertical Scaling	Increasing the "size" of a single resource (e.g., adding RAM to an EC2).	Instance Type (e.g., t3.micro to m5.large)
Horizontal Scaling	Increasing the "number" of resources (e.g., adding more EC2 instances).	Instance Count
RTO (Recovery Time Objective)	The maximum acceptable delay between the service failure and restoration.	Time (Minutes/Hours)
RPO (Recovery Point Objective)	The maximum acceptable amount of data loss measured in time.	Data Age (Seconds/Minutes)

Hierarchical Outline

AWS Physical Infrastructure
- Global Regions: Compliance, data sovereignty, and proximity.
- Availability Zones: Fault isolation and high availability within a region.
- Local Zones / Outposts: Extending AWS to specific geographic locations or on-premises.
Edge Networking & Performance
- Amazon CloudFront: Content Delivery Network (CDN) for caching static/dynamic content.
- AWS Global Accelerator: Uses the AWS private network to optimize the path to your application.
- Amazon Route 53: Scalable DNS and health checking.
Distributed System Patterns
- Asynchronous Messaging: Using Amazon SQS (queuing) and Amazon SNS (pub/sub).
- Orchestration: Managing containers with Amazon ECS or EKS.
- Serverless: Event-driven computing with AWS Lambda and Fargate.

Visual Anchors

AWS Infrastructure Hierarchy

Loading Diagram...

Request Flow: Edge vs. Origin

Compiling TikZ diagram…

⏳

Running TeX engine…

This may take a few seconds

Definition-Example Pairs

Service Quotas: The maximum number of resources you can create in an AWS account.
- Example: An AWS account might have a default limit of 200 running On-Demand EC2 instances per region.
Read Replicas: A copy of a database used to offload read traffic from the primary instance.
- Example: A WordPress site experiences high traffic; the admin creates an RDS Read Replica so the database can handle more "view" requests without slowing down "write" requests.
Loose Coupling: Reducing the dependencies between components so they can scale independently.
- Example: A web server places a photo-processing job into an SQS Queue instead of sending it directly to a worker server. If the worker server crashes, the job stays in the queue until a new worker is ready.

Worked Examples

Problem: Global Low-Latency Video Streaming

Scenario: A company based in London wants to serve high-definition video to users in Tokyo and New York with minimal buffering.

Step-by-Step Breakdown:

Storage: Upload video files to an Amazon S3 bucket in the eu-west-2 (London) region.
Distribution: Create an Amazon CloudFront distribution. Set the S3 bucket as the "Origin."
Edge Delivery: When a user in Tokyo requests a video, CloudFront routes the request to the Edge Location in Tokyo.
Caching: If it's the first request, CloudFront fetches the file from London once and caches it in Tokyo. Subsequent Tokyo users get the file instantly from the local cache.
Security: Attach AWS WAF (Web Application Firewall) to the CloudFront distribution at the edge to block malicious SQL injection attacks before they reach the London origin.

Checkpoint Questions

Which AWS infrastructure component consists of one or more discrete data centers?
- Answer: Availability Zone (AZ).
If you need to send a message to multiple subscribers simultaneously (Fan-out pattern), which service should you use?
- Answer: Amazon SNS (Simple Notification Service).
What is the primary difference between AWS Global Accelerator and Amazon CloudFront?
- Answer: CloudFront caches content at the edge (great for images/video), while Global Accelerator optimizes the network path using Anycast IPs (great for non-HTTP traffic like gaming or VoIP).
True or False: Availability Zones are connected to each other via low-latency, private fiber-optic networking.
- Answer: True.

Comparison Tables

Feature	Regions	Availability Zones	Edge Locations
Primary Goal	Compliance & Proximity	Fault Tolerance	Low Latency Delivery
Isolation	Completely independent	Power/Network isolated	Shared global network
Services Hosted	Most AWS Services	EC2, RDS, VPC Subnets	CloudFront, Route 53, WAF
Relationship	Contains 3+ AZs	Contained in a Region	Outside Regions (Near users)

AWS Global Infrastructure and Distributed Computing Study Guide

AWS Global Infrastructure and Distributed Computing

Learning Objectives

Key Terms & Glossary

The "Big Idea"

Formula / Concept Box

Hierarchical Outline

Visual Anchors

AWS Infrastructure Hierarchy

Request Flow: Edge vs. Origin

Definition-Example Pairs

Worked Examples

Problem: Global Low-Latency Video Streaming

Checkpoint Questions

Comparison Tables

AWS Global Infrastructure and Distributed Computing Study Guide

AWS Global Infrastructure and Distributed Computing

Learning Objectives

Key Terms & Glossary

The "Big Idea"

Formula / Concept Box

Hierarchical Outline

Visual Anchors

AWS Infrastructure Hierarchy

Request Flow: Edge vs. Origin

Definition-Example Pairs

Worked Examples

Problem: Global Low-Latency Video Streaming

Checkpoint Questions

Comparison Tables