SageMaker AI Security and Compliance

This guide covers the essential security architectures, compliance frameworks, and monitoring tools required to secure machine learning workflows within Amazon SageMaker AI, aligned with the AWS Certified Machine Learning Engineer – Associate (MLA-C01) exam.

Learning Objectives

After studying this material, you should be able to:

Implement Identity and Access Management (IAM) using the principle of least privilege for ML resources.
Configure Network Isolation using VPCs, security groups, and private subnets.
Manage Data Protection through encryption at rest (AWS KMS) and in transit (TLS).
Audit and Monitor ML Workflows using CloudWatch, CloudTrail, and EventBridge.
Navigate Compliance Frameworks such as HIPAA, PCI DSS, and ISO 27001 using AWS Artifact.

Key Terms & Glossary

Least Privilege: The practice of limiting access rights for users to the bare minimum permissions they need to perform their work.
AWS KMS (Key Management Service): A managed service that makes it easy for you to create and control the cryptographic keys used to encrypt your data.
TLS (Transport Layer Security): A cryptographic protocol designed to provide communications security over a computer network; used for encryption in transit.
VPC (Virtual Private Cloud): A logically isolated section of the AWS Cloud where you can launch AWS resources in a virtual network that you define.
AWS Artifact: A central resource for compliance-related information that provides on-demand access to AWS’ security and compliance reports.

The "Big Idea"

Security in SageMaker is not a final step but a holistic, integrated lifecycle. By applying a "Security-by-Design" strategy, organizations ensure that data is protected from ingestion through training to deployment. This is achieved through Defense in Depth, where multiple layers of security (Identity, Network, Encryption, and Auditing) work together to protect the integrity of the ML model and the sensitivity of the data.

Formula / Concept Box

Concept	Security Application	Key Mechanism
Identity	Who can access the training job?	IAM Roles & Policies
Infrastructure	Is the data moving over the public internet?	VPC Endpoints & Security Groups
Data Protection	Is the model artifact encrypted on S3?	AWS KMS (SSE-KMS)
Auditing	Who deleted the endpoint at 3:00 PM?	AWS CloudTrail
Monitoring	Is the model latency spiking?	Amazon CloudWatch Metrics

Hierarchical Outline

Identity and Access Management (IAM)
- Roles & Policies: Define granular permissions for users and SageMaker services.
- SageMaker Role Manager: Simplifies creating roles with pre-defined personas.
- MFA & SCPs: Multi-Factor Authentication and Service Control Policies for organizational-level guardrails.
Infrastructure Security
- VPC Isolation: Running SageMaker in a private VPC to prevent internet access.
- Security Groups: Acting as virtual firewalls to control inbound/outbound traffic to instances.
- Network ACLs: Subnet-level traffic control.
Data Protection
- Encryption at Rest: Using KMS keys (AWS Managed or BYOK) for S3 buckets and EBS volumes.
- Encryption in Transit: Mandatory TLS for communication between SageMaker and other services.
Logging and Auditing
- CloudWatch: Monitoring performance health and setting operational alarms.
- CloudTrail: Recording API calls for compliance auditing and incident investigation.

Visual Anchors

Security Flowchart

Loading Diagram...

Defense in Depth Layers

\begin{center}

Compiling TikZ diagram…

⏳

Running TeX engine…

This may take a few seconds

\end{center}

Definition-Example Pairs

Compliance Framework: A set of guidelines and best practices for organizations to follow to meet regulatory requirements.
- Example: A healthcare company using SageMaker to process patient records must ensure the environment is HIPAA-compliant to protect sensitive health data (PHI).
BYOK (Bring Your Own Key): Allowing customers to use their own cryptographic keys for encryption rather than default AWS keys.
- Example: A financial institution requires exclusive control over key rotation and deletion for their model artifacts to satisfy PCI DSS requirements.

Worked Examples

Scenario: Securing an S3 Bucket for Training

Goal: Ensure a SageMaker training job can only read from a specific S3 bucket and that all data is encrypted.

Create a KMS Key: Generate a customer-managed key in AWS KMS.
IAM Policy: Create an execution role for SageMaker with the following permissions:
- s3:GetObject and s3:ListBucket limited to the specific bucket ARN.
- kms:Decrypt and kms:GenerateDataKey for the specific KMS Key ARN.
VPC Config: Ensure the training job is launched within a private VPC with an S3 Interface Endpoint to keep traffic off the public internet.

Checkpoint Questions

Which AWS service would you use to download a SOC 2 report for SageMaker?
What is the difference between encryption at rest and encryption in transit in SageMaker?
True or False: Security groups act as a firewall at the subnet level.
How does CloudTrail assist in a security post-mortem after an unauthorized configuration change?

▶Click to see answers

AWS Artifact.
Encryption at rest protects stored data (S3/EBS) using KMS; encryption in transit protects moving data using TLS.
False. Security groups are at the instance/resource level; Network ACLs are at the subnet level.
It provides a detailed record of which IAM user made the API call, from which IP address, and at what time.

Muddy Points & Cross-Refs

KMS Managed vs. Custom Keys: If you need to audit the use of the key or rotate it on your own schedule, use Customer Managed Keys. Default AWS Managed Keys are free but offer less control.
Security Groups vs. NACLs: Remember that Security Groups are stateful (if you allow inbound, outbound is automatically allowed), while Network ACLs are stateless (you must explicitly allow both ways).
Cross-Ref: For deeper network security, see the "VPC Endpoints and PrivateLink" chapter.

Comparison Tables

Monitoring vs. Auditing

Feature	Amazon CloudWatch	AWS CloudTrail
Primary Goal	Performance & Health	Governance & Compliance
Data Type	Metrics, Logs, Alarms	API Call History
Use Case	"Is my CPU usage too high?"	"Who deleted this notebook?"
Real-time?	Yes	Near real-time (usually < 15 mins)

Major Compliance Standards

Standard	Industry / Focus	Key Requirement for SageMaker
HIPAA	Healthcare	Protection of PHI (Patient Data)
PCI DSS	Finance / Payments	Security of Cardholder Data
ISO 27001	International General	Information Security Management Systems
FedRAMP	US Government	Stringent security for cloud services

SageMaker AI Security and Compliance

Learning Objectives

After studying this material, you should be able to:

Implement Identity and Access Management (IAM) using the principle of least privilege for ML resources.
Configure Network Isolation using VPCs, security groups, and private subnets.
Manage Data Protection through encryption at rest (AWS KMS) and in transit (TLS).
Audit and Monitor ML Workflows using CloudWatch, CloudTrail, and EventBridge.
Navigate Compliance Frameworks such as HIPAA, PCI DSS, and ISO 27001 using AWS Artifact.

Key Terms & Glossary

Least Privilege: The practice of limiting access rights for users to the bare minimum permissions they need to perform their work.
AWS KMS (Key Management Service): A managed service that makes it easy for you to create and control the cryptographic keys used to encrypt your data.
TLS (Transport Layer Security): A cryptographic protocol designed to provide communications security over a computer network; used for encryption in transit.
VPC (Virtual Private Cloud): A logically isolated section of the AWS Cloud where you can launch AWS resources in a virtual network that you define.
AWS Artifact: A central resource for compliance-related information that provides on-demand access to AWS’ security and compliance reports.

The "Big Idea"

Formula / Concept Box

Concept	Security Application	Key Mechanism
Identity	Who can access the training job?	IAM Roles & Policies
Infrastructure	Is the data moving over the public internet?	VPC Endpoints & Security Groups
Data Protection	Is the model artifact encrypted on S3?	AWS KMS (SSE-KMS)
Auditing	Who deleted the endpoint at 3:00 PM?	AWS CloudTrail
Monitoring	Is the model latency spiking?	Amazon CloudWatch Metrics

Hierarchical Outline

Identity and Access Management (IAM)
- Roles & Policies: Define granular permissions for users and SageMaker services.
- SageMaker Role Manager: Simplifies creating roles with pre-defined personas.
- MFA & SCPs: Multi-Factor Authentication and Service Control Policies for organizational-level guardrails.
Infrastructure Security
- VPC Isolation: Running SageMaker in a private VPC to prevent internet access.
- Security Groups: Acting as virtual firewalls to control inbound/outbound traffic to instances.
- Network ACLs: Subnet-level traffic control.
Data Protection
- Encryption at Rest: Using KMS keys (AWS Managed or BYOK) for S3 buckets and EBS volumes.
- Encryption in Transit: Mandatory TLS for communication between SageMaker and other services.
Logging and Auditing
- CloudWatch: Monitoring performance health and setting operational alarms.
- CloudTrail: Recording API calls for compliance auditing and incident investigation.

Visual Anchors

Security Flowchart

Loading Diagram...

Defense in Depth Layers

\begin{center}

Compiling TikZ diagram…

⏳

Running TeX engine…

This may take a few seconds

\end{center}

Definition-Example Pairs

Compliance Framework: A set of guidelines and best practices for organizations to follow to meet regulatory requirements.
- Example: A healthcare company using SageMaker to process patient records must ensure the environment is HIPAA-compliant to protect sensitive health data (PHI).
BYOK (Bring Your Own Key): Allowing customers to use their own cryptographic keys for encryption rather than default AWS keys.
- Example: A financial institution requires exclusive control over key rotation and deletion for their model artifacts to satisfy PCI DSS requirements.

Worked Examples

Scenario: Securing an S3 Bucket for Training

Goal: Ensure a SageMaker training job can only read from a specific S3 bucket and that all data is encrypted.

Create a KMS Key: Generate a customer-managed key in AWS KMS.
IAM Policy: Create an execution role for SageMaker with the following permissions:
- s3:GetObject and s3:ListBucket limited to the specific bucket ARN.
- kms:Decrypt and kms:GenerateDataKey for the specific KMS Key ARN.
VPC Config: Ensure the training job is launched within a private VPC with an S3 Interface Endpoint to keep traffic off the public internet.

Checkpoint Questions

Which AWS service would you use to download a SOC 2 report for SageMaker?
What is the difference between encryption at rest and encryption in transit in SageMaker?
True or False: Security groups act as a firewall at the subnet level.
How does CloudTrail assist in a security post-mortem after an unauthorized configuration change?

▶Click to see answers

AWS Artifact.
Encryption at rest protects stored data (S3/EBS) using KMS; encryption in transit protects moving data using TLS.
False. Security groups are at the instance/resource level; Network ACLs are at the subnet level.
It provides a detailed record of which IAM user made the API call, from which IP address, and at what time.

Muddy Points & Cross-Refs

KMS Managed vs. Custom Keys: If you need to audit the use of the key or rotate it on your own schedule, use Customer Managed Keys. Default AWS Managed Keys are free but offer less control.
Security Groups vs. NACLs: Remember that Security Groups are stateful (if you allow inbound, outbound is automatically allowed), while Network ACLs are stateless (you must explicitly allow both ways).
Cross-Ref: For deeper network security, see the "VPC Endpoints and PrivateLink" chapter.

Comparison Tables

Monitoring vs. Auditing

Feature	Amazon CloudWatch	AWS CloudTrail
Primary Goal	Performance & Health	Governance & Compliance
Data Type	Metrics, Logs, Alarms	API Call History
Use Case	"Is my CPU usage too high?"	"Who deleted this notebook?"
Real-time?	Yes	Near real-time (usually < 15 mins)

Major Compliance Standards

Standard	Industry / Focus	Key Requirement for SageMaker
HIPAA	Healthcare	Protection of PHI (Patient Data)
PCI DSS	Finance / Payments	Security of Cardholder Data
ISO 27001	International General	Information Security Management Systems
FedRAMP	US Government	Stringent security for cloud services

SageMaker AI Security and Compliance: A Comprehensive Study Guide

SageMaker AI Security and Compliance

Learning Objectives

Key Terms & Glossary

The "Big Idea"

Formula / Concept Box

Hierarchical Outline

Visual Anchors

Security Flowchart

Defense in Depth Layers

Definition-Example Pairs

Worked Examples

Scenario: Securing an S3 Bucket for Training

Checkpoint Questions

Muddy Points & Cross-Refs

Comparison Tables

Monitoring vs. Auditing

Major Compliance Standards

SageMaker AI Security and Compliance: A Comprehensive Study Guide

SageMaker AI Security and Compliance

Learning Objectives

Key Terms & Glossary

The "Big Idea"

Formula / Concept Box

Hierarchical Outline

Visual Anchors

Security Flowchart

Defense in Depth Layers

Definition-Example Pairs

Worked Examples

Scenario: Securing an S3 Bucket for Training

Checkpoint Questions

Muddy Points & Cross-Refs

Comparison Tables

Monitoring vs. Auditing

Major Compliance Standards