Study Guide980 words

Mastering AWS Service Quotas and Resource Constraints

Service quotas and limits

Mastering AWS Service Quotas and Resource Constraints

This study guide explores the mechanisms AWS uses to protect infrastructure and accounts from over-provisioning and abuse. Understanding these limits is critical for designing reliable, scalable architectures at the Professional level.

Learning Objectives

After studying this guide, you should be able to:

  • Differentiate between service quotas (soft limits) and hard limits.
  • Identify environmental constraints that impact workload performance.
  • Describe the tools used to monitor resource usage (Trusted Advisor, Service Quotas API).
  • Explain the architectural impact of hitting a hard limit versus a soft limit.
  • Navigate the process for requesting quota increases.

Key Terms & Glossary

  • Service Quota: A default, pre-defined value for a resource or action in an AWS account (e.g., number of VPCs per Region).
  • Soft Limit: An adjustable service quota that can be increased via a support request or the Service Quotas console.
  • Hard Limit: A fixed service quota that cannot be changed under any circumstances; requires architectural re-design if reached.
  • Environmental Constraint: Physical limitations of the underlying infrastructure, such as network bandwidth or hardware disk throughput.
  • Trusted Advisor: An AWS service that provides real-time guidance to help you provision resources following AWS best practices, including monitoring service limits.

The "Big Idea"

AWS Service Quotas act as guardrails, not just for AWS's protection, but for yours. They prevent "runaway" resource provisioning (like a bug spawning 10,000 EC2 instances) that could lead to catastrophic billing surprises. However, for a Solutions Architect, these guardrails define the boundaries of the sandbox. High-scale designs must be built with these boundaries in mind, ensuring that the architecture can either scale within those limits or be distributed across multiple accounts and regions to bypass them.

Formula / Concept Box

Metric / ToolDetailKey Thresholds
Trusted Advisor AlertMonitoring against service limits80% (Yellow), 100% (Red)
Scope of QuotasUsually per Account per RegionVaries by service
Request MethodService Quotas Console or Support CaseN/A
Refresh RateLimit/Usage data latencyUp to 24 hours

Hierarchical Outline

  • I. Resource Constraints Overview
    • A. Service Quotas (Account-level protections)
    • B. Environmental Constraints (Physical infrastructure limits)
  • II. Service Quotas Deep Dive
    • A. Soft Limits (Adjustable)
      • Example: Number of VPCs per Region (Default: 5).
      • Increase via Service Quotas API/Console.
    • B. Hard Limits (Non-adjustable)
      • Example: Number of IPv6 CIDR blocks per VPC (Fixed at 1).
      • Requires re-design if exceeded.
  • III. Monitoring and Management
    • A. Service Quotas Console (View and request increases).
    • B. AWS Trusted Advisor (Proactive alerting at 80/100% utilization).
    • C. AWS Organizations (Centralized reporting across multiple accounts).
  • IV. Architectural Strategy
    • A. Mitigation of Hard Limits (Horizontal scaling, multi-account strategies).
    • B. Proactive Monitoring (Integration with CloudWatch/EventBridge).

Visual Anchors

Quota Increase Workflow

Loading Diagram...

Utilization Curve vs. Limit

\begin{tikzpicture} % Axis \draw[->] (0,0) -- (6,0) node[right] {Time}; \draw[->] (0,0) -- (0,4) node[above] {Usage};

% Hard Limit Line \draw[red, dashed, thick] (0,3.5) -- (5.5,3.5) node[right] {Hard Limit};

% Soft Limit Line \draw[blue, thick] (0,2.5) -- (5.5,2.5) node[right] {Soft Limit};

% Usage Curve \draw[thick, black] (0,0.5) .. controls (2,0.6) and (3,2.2) .. (5,3.2); \node at (4.5, 1.5) {Workload Growth};

% Markers \filldraw[orange] (3.4, 2.5) circle (2pt) node[above left] {Request Increase}; \end{tikzpicture}

Definition-Example Pairs

  • Soft Limit
    • Definition: A quota that can be increased upon request to AWS support.
    • Example: Increasing the maximum number of Running On-Demand Standard instances from the default to 100 to support a seasonal traffic spike.
  • Hard Limit
    • Definition: A fixed constraint imposed by AWS that cannot be changed regardless of the support plan.
    • Example: A VPC is limited to exactly one primary IPv6 CIDR block. If you need more, you must create a separate VPC.
  • Environmental Constraint
    • Definition: A limitation based on the physical reality of the hardware or networking infrastructure.
    • Example: The 10 Gbps maximum bandwidth of a single Direct Connect connection. To get more, you must aggregate multiple connections (LAG).

Worked Examples

Scenario 1: The VPC Expansion

Problem: A company is migrating 100 legacy applications into a single AWS account in us-east-1. They realize they need 10 VPCs, but the default quota is 5. Solution:

  1. Check the Service Quotas console to confirm the limit is "Adjustable".
  2. Submit a quota increase request for 10 VPCs.
  3. Since this is a soft limit, AWS typically approves it within hours/days.

Scenario 2: The Hard Limit Wall

Problem: A developer wants to associate 3 IPv6 CIDR blocks with a single VPC to segregate traffic for different business units. Solution:

  1. Review documentation; find that 1 IPv6 CIDR per VPC is a hard limit.
  2. Re-design: Instead of CIDR segregation, use multiple VPCs connected via Transit Gateway, or use Subnet-level tagging and Security Groups to achieve logical isolation within the single allowed CIDR.

Checkpoint Questions

  1. What is the default percentage threshold at which Trusted Advisor turns "Yellow" for a service limit check?
  2. True or False: All service quotas are scoped at the Global level.
  3. If you reach a hard limit, what is the only viable path forward for your architecture?
  4. Which tool allows you to see quota usage across an entire AWS Organization from a central location?
Click to see answers
  1. 80%
  2. False (Most are per-Region).
  3. Architectural re-design (e.g., using multiple accounts or different services).
  4. Trusted Advisor (with AWS Organizations integration).

Muddy Points & Cross-Refs

  • Confusion between Quotas and Throttling: Quotas usually refer to resource counts (how many S3 buckets), while Throttling refers to API rate limits (how many PutObject calls per second). Both are limits, but managed differently.
  • Support Plan Dependency: Note that access to the Support API (for automated quota checks) requires a Business or Enterprise support plan.
  • Cross-Ref: See Chapter 3 (Design a Multi-Account AWS Environment) to understand how to bypass account-level quotas using AWS Organizations.

Comparison Tables

FeatureService QuotasEnvironmental Constraints
SourceAdministrative/Policy settingsPhysical hardware/Networking
FlexibilityOften adjustable (Soft limits)Generally fixed by physics/spec
MonitoringTrusted Advisor / Quotas ConsoleCloudWatch Metrics (e.g., Throughput)
ExampleInstances per RegionDisk IOPS / Network Bandwidth
Quota TypeCan be increased?Resolution Strategy
Soft LimitYesRequest via Console/API
Hard LimitNoArchitectural Re-design

Ready to study AWS Certified Solutions Architect - Professional (SAP-C02)?

Practice tests, flashcards, and all study notes — free, no sign-up needed.

Start Studying — Free