Study Guide948 words

Mastering AWS Data Transfer Costs: Architect's Study Guide

Data transfer costs

Mastering AWS Data Transfer Costs: Architect's Study Guide

Understanding data transfer costs is critical for the AWS Certified Solutions Architect - Professional exam. It is often the hidden "tax" that differentiates a technically sound architecture from a cost-optimized one.

Learning Objectives

After studying this guide, you should be able to:

  • Differentiate between Data Transfer In (DTI) and Data Transfer Out (DTO) pricing.
  • Compare the cost-efficiency of AWS Direct Connect (DX) versus VPN and Internet-based transfers.
  • Identify the cost factors associated with storage services (S3 and EBS).
  • Select the most cost-effective networking architecture for cross-region and hybrid-cloud data flows.

Key Terms & Glossary

  • Data Transfer Out (DTO): Data sent from AWS to the internet or to an on-premises location. This is almost always the primary driver of networking costs.
  • Direct Connect (DX): A dedicated network connection from on-premises to AWS that bypasses the public internet, offering consistent performance and lower DTO rates.
  • Port Hours: The hourly fee for maintaining a physical connection at a Direct Connect location.
  • NAT Gateway Processing: The per-GB fee charged for data passing through a NAT Gateway in addition to standard DTO charges.
  • S3 Lifecycle Policy: Rules that automatically transition data to cheaper storage tiers (e.g., S3 Glacier) to reduce storage and potentially retrieval costs.

The "Big Idea"

[!IMPORTANT] The Gravity of Data: In AWS, "Data Transfer In" is generally free, while "Data Transfer Out" is the primary expense. To optimize costs, architects must minimize the distance data travels and choose the "cheapest pipe." Data sent over Direct Connect (DX) is often an order of magnitude cheaper than data sent over the Internet or a VPN.

Formula / Concept Box

Service / ScenarioPricing LogicRule of Thumb
Data Transfer INFree (mostly)Inbound data from internet/other regions is $0/GB
DTO to InternetRegion-dependent tiered pricingExpensive; starts ~$0.09/GB in most regions
DTO over DXRegion + DX Location dependentCheapest; often < $0.02/GB depending on proximity
S3 StorageGB/Month + Requests + DTOLower storage tiers have higher retrieval costs
NAT GatewayHourly fee + Per GB processedAvoid for high-volume DTO; use VPC Endpoints instead

Hierarchical Outline

  • I. Hybrid Connectivity Costs
    • A. AWS Managed VPN
      • Paid per hour + standard DTO rates (same as Internet).
      • Best for low-volume, occasional usage.
    • B. AWS Direct Connect (DX)
      • Paid per port hour + discounted DTO rates.
      • Cost varies by AWS Region and DX Location proximity.
      • Higher resiliency requires multiple connections (2-4 ports).
  • II. Storage Data Transfer
    • A. Amazon EBS
      • Charges: Provisioned GB + Snapshot Storage (S3) + DTO from instance.
    • B. Amazon S3
      • Charges: Storage tier (Standard, IA, Glacier) + Request fees (PUT/GET) + DTO.
      • Retrieval Fees: Charged for IA and Glacier tiers.
  • III. Internal Networking
    • A. Inter-AZ Transfer: Costs apply to data moving between Availability Zones.
    • B. Cross-Region Transfer: Higher costs than Inter-AZ; depends on destination region.

Visual Anchors

Data Transfer Cost Hierarchy

Loading Diagram...

Direct Connect Resiliency vs. Cost

This diagram represents the physical layout required for maximum resiliency as mentioned in the source material.

\begin{tikzpicture} [node distance=2cm, box/.style={rectangle, draw, minimum width=2.5cm, minimum height=1cm, align=center}]

\node[box] (R1) {Region A}; \node[box, right of=R1, xshift=3cm] (R2) {Region B}; \node[box, below of=R1, yshift=-1cm] (DX1) {DX Location 1}; \node[box, below of=R2, yshift=-1cm] (DX2) {DX Location 2}; \node[box, below of=DX1, xshift=2.5cm, yshift=-1cm] (ONPREM) {On-Premises Center};

\draw[<->, thick] (R1) -- (DX1); \draw[<->, thick] (R1) -- (DX2); \draw[<->, thick] (R2) -- (DX1); \draw[<->, thick] (R2) -- (DX2); \draw[<->, dashed] (DX1) -- (ONPREM) node[midway, left] {Port 1+2}; \draw[<->, dashed] (DX2) -- (ONPREM) node[midway, right] {Port 3+4};

\node[draw=red, dashed, fit=(DX1) (DX2) (ONPREM), inner sep=10pt] (label) {}; \node[anchor=north] at (label.south) {Paying for 4 Port Hours for Max Resiliency}; \end{tikzpicture}

Definition-Example Pairs

  • Direct Connect Location Proximity: The physical distance between the AWS Region and the DX facility.
    • Example: Transferring data from an AWS region in Europe to a DX location in Europe is cheaper than transferring from an Asia region to that same European DX location.
  • S3 Retrieval Charges: A per-GB fee for accessing data in archival or infrequent access tiers.
    • Example: If you store 1TB in S3 Glacier Deep Archive, the storage is very cheap ($1/month), but pulling that 1TB out for a restore could cost $20+ in retrieval fees plus DTO.

Worked Examples

Scenario: Comparing VPN vs. Direct Connect

Problem: A company needs to move 50 TB of data per month from us-east-1 to their data center.

  • Option A (VPN): Monthly fee (~$36) + DTO ($0.09/GB).
  • Option B (DX): 1G Port Hour ($216/mo) + DTO ($0.02/GB).

Calculation:

  • VPN Cost: $50,000 GB \times 0.09 = $4,500 + $36 = $4,536$
  • DX Cost: $50,000 GB \times 0.02 = $1,000 + $216 = $1,216$

Result: Even with the base port fee, Direct Connect saves $3,320 per month in this scenario.

Checkpoint Questions

  1. True or False: Data transferred from the internet into an Amazon S3 bucket incurs DTI charges.
  2. Which is cheaper: Data Transfer Out over a VPN or Data Transfer Out over Direct Connect?
  3. What are the three primary cost factors for Amazon EBS volumes?
  4. Why might a company choose a VPN over Direct Connect initially?
Click to see answers
  1. False. Data Transfer In (DTI) from the internet is generally free.
  2. Direct Connect. It is significantly cheaper, sometimes by an order of magnitude.
  3. Provisioned Storage (GB), Snapshot Storage (S3), and Data Transfer Out.
  4. Simplicity and Low Volume. VPNs are easy to set up over existing broadband and have no upfront port costs, making them ideal for low-volume traffic.

Muddy Points & Cross-Refs

  • The NAT Gateway Trap: Many students forget that NAT Gateways charge for processing on top of DTO. If you have massive data flows to S3, use an S3 Gateway Endpoint (which is free) to avoid these processing charges.
  • Cross-Region Replication: In S3, if you enable Cross-Region Replication (CRR), you pay for the DTO from the source region AND the storage in the destination region.

Comparison Tables

FeatureInternet / VPNDirect Connect (DX)VPC Peering / TGW
Setup TimeMinutesWeeks/MonthsMinutes
ConsistencyVariable (Best effort)Highly ConsistentHigh
DTO CostHigh (~$0.09/GB)Lowest (~$0.02/GB)Medium (Regional rates)
SecurityEncrypted (VPN)Private (needs MACsec for encryption)Private (AWS Backbone)
Best ForStartups, Low VolumeEnterprise, Hybrid, Large MigrationInter-VPC Traffic

Ready to study AWS Certified Solutions Architect - Professional (SAP-C02)?

Practice tests, flashcards, and all study notes — free, no sign-up needed.

Start Studying — Free