Curriculum Overview: Core Generative AI Concepts

This curriculum provides a foundational roadmap for mastering Generative AI (GenAI) within the context of the AWS Certified AI Practitioner (AIF-C01) certification. It bridges the gap between theoretical machine learning and practical application using AWS-managed services.

## Prerequisites

Before diving into Core GenAI Concepts, learners should have a firm grasp of the following fundamental AI/ML topics as outlined in Unit 1 of the AWS curriculum:

The AI Hierarchy: Understanding that GenAI is a subset of Deep Learning, which is a subset of Machine Learning, which is a subset of Artificial Intelligence.
Learning Paradigms: Familiarity with Supervised Learning (labeled data), Unsupervised Learning (pattern discovery), and Reinforcement Learning (reward-based).
Basic Terminology: Knowledge of models, algorithms, training vs. inferencing, and neural network basics (input, hidden, and output layers).
AWS Cloud Basics: Basic understanding of AWS global infrastructure and identity management (IAM).

Loading Diagram...

## Module Breakdown

Module ID	Module Name	Focus Area	Difficulty
GEN-01	Technical Foundations	Tokens, Embeddings, and Vectors	Beginner
GEN-02	Architectures & Models	Transformers, FMs, and Multi-modal systems	Intermediate
GEN-03	The Model Lifecycle	Pre-training, Fine-tuning, and Adaptation	Advanced
GEN-04	Interaction & Controls	Prompt Engineering and Inference Parameters	Intermediate
GEN-05	Business & AWS Strategy	Cost-tradeoffs, Bedrock, and SageMaker	Intermediate

## Learning Objectives per Module

Module 1: Technical Foundations

Define Tokenization: Explain how text is broken into the smallest units (words or sub-words) for model processing.
Explain Embeddings: Describe how text is converted into high-dimensional numerical vectors that capture semantic meaning.
Chunking: Understand how large datasets are divided into manageable segments for vector search.

Module 2: Architectures & Models

The Transformer Architecture: Describe the self-attention mechanism that allows models to handle long-range dependencies in text.
Foundation Models (FMs): Identify large, pre-trained models that serve as versatile starting points for specific tasks.
Specialized Models: Differentiate between Diffusion Models (image generation) and Generative Adversarial Networks (GANs).

Module 3: The Model Lifecycle

Lifecycle Stages: Map the path from data selection and pre-training to fine-tuning, evaluation, and deployment.
Adaptation Techniques: Compare Retrieval Augmented Generation (RAG), Fine-tuning, and In-context Learning.

Loading Diagram...

Module 4: Interaction & Controls

Prompt Engineering: Apply zero-shot, few-shot, and chain-of-thought techniques to guide model output.
Inference Parameters: Adjust Temperature ( $T$ $T$ ) and Top-P to control the randomness and creativity of the model.
- [!TIP]
- Lower Temperature ( $T \rightarrow 0$ ) = More deterministic/factual.
- Higher Temperature ( $T \rightarrow 1$ ) = More creative/diverse.

## Success Metrics

To demonstrate mastery of this curriculum, learners must be able to evaluate both technical model performance and business impact using the following metrics:

1. Technical Performance Metrics

ROUGE (Recall-Oriented Understudy for Gisting Evaluation): Used primarily for summarization quality.
BLEU (Bilingual Evaluation Understudy): Used for assessing translation accuracy.
BERTScore: Leverages contextual embeddings to find semantic similarity between generated and reference text.

2. Business Value Metrics

ROI & Conversion Rate: Measuring the financial return on implementing GenAI solutions.
Efficiency Gains: Reduction in time-to-market and operational costs.
Accuracy & Trust: Tracking the rate of hallucinations (incorrect info presented as fact) and model bias.

## Real-World Application

Industry Use Cases

Customer Service: Using Amazon Lex and Amazon Bedrock Agents to create responsive, multi-step chatbots.
Knowledge Management: Implementing RAG using Amazon Bedrock Knowledge Bases to query internal corporate documents.
Software Development: Utilizing Amazon Q for automated code generation and bug fixing.

Risk & Responsibility

Learners will apply the Responsible AI framework to mitigate risks such as:

Nondeterminism: Managing the fact that LLMs can produce different outputs for the same prompt.
Security: Protecting against prompt injection, hijacking, and jailbreaking.
Compliance: Ensuring data used in fine-tuning meets legal and sustainability standards.

[!IMPORTANT] GenAI is not a "silver bullet." A core skill in this curriculum is performing a cost-benefit analysis to determine when a simple regression model or a non-AI solution is more appropriate than a massive Foundation Model.

Curriculum Overview: Core Generative AI Concepts

## Prerequisites

Before diving into Core GenAI Concepts, learners should have a firm grasp of the following fundamental AI/ML topics as outlined in Unit 1 of the AWS curriculum:

The AI Hierarchy: Understanding that GenAI is a subset of Deep Learning, which is a subset of Machine Learning, which is a subset of Artificial Intelligence.
Learning Paradigms: Familiarity with Supervised Learning (labeled data), Unsupervised Learning (pattern discovery), and Reinforcement Learning (reward-based).
Basic Terminology: Knowledge of models, algorithms, training vs. inferencing, and neural network basics (input, hidden, and output layers).
AWS Cloud Basics: Basic understanding of AWS global infrastructure and identity management (IAM).

Loading Diagram...

## Module Breakdown

Module ID	Module Name	Focus Area	Difficulty
GEN-01	Technical Foundations	Tokens, Embeddings, and Vectors	Beginner
GEN-02	Architectures & Models	Transformers, FMs, and Multi-modal systems	Intermediate
GEN-03	The Model Lifecycle	Pre-training, Fine-tuning, and Adaptation	Advanced
GEN-04	Interaction & Controls	Prompt Engineering and Inference Parameters	Intermediate
GEN-05	Business & AWS Strategy	Cost-tradeoffs, Bedrock, and SageMaker	Intermediate

## Learning Objectives per Module

Module 1: Technical Foundations

Define Tokenization: Explain how text is broken into the smallest units (words or sub-words) for model processing.
Explain Embeddings: Describe how text is converted into high-dimensional numerical vectors that capture semantic meaning.
Chunking: Understand how large datasets are divided into manageable segments for vector search.

Module 2: Architectures & Models

The Transformer Architecture: Describe the self-attention mechanism that allows models to handle long-range dependencies in text.
Foundation Models (FMs): Identify large, pre-trained models that serve as versatile starting points for specific tasks.
Specialized Models: Differentiate between Diffusion Models (image generation) and Generative Adversarial Networks (GANs).

Module 3: The Model Lifecycle

Lifecycle Stages: Map the path from data selection and pre-training to fine-tuning, evaluation, and deployment.
Adaptation Techniques: Compare Retrieval Augmented Generation (RAG), Fine-tuning, and In-context Learning.

Loading Diagram...

Module 4: Interaction & Controls

Prompt Engineering: Apply zero-shot, few-shot, and chain-of-thought techniques to guide model output.
Inference Parameters: Adjust Temperature ( $T$ $T$ ) and Top-P to control the randomness and creativity of the model.
- [!TIP]
- Lower Temperature ( $T \rightarrow 0$ ) = More deterministic/factual.
- Higher Temperature ( $T \rightarrow 1$ ) = More creative/diverse.

## Success Metrics

To demonstrate mastery of this curriculum, learners must be able to evaluate both technical model performance and business impact using the following metrics:

1. Technical Performance Metrics

ROUGE (Recall-Oriented Understudy for Gisting Evaluation): Used primarily for summarization quality.
BLEU (Bilingual Evaluation Understudy): Used for assessing translation accuracy.
BERTScore: Leverages contextual embeddings to find semantic similarity between generated and reference text.

2. Business Value Metrics

ROI & Conversion Rate: Measuring the financial return on implementing GenAI solutions.
Efficiency Gains: Reduction in time-to-market and operational costs.
Accuracy & Trust: Tracking the rate of hallucinations (incorrect info presented as fact) and model bias.

## Real-World Application

Industry Use Cases

Customer Service: Using Amazon Lex and Amazon Bedrock Agents to create responsive, multi-step chatbots.
Knowledge Management: Implementing RAG using Amazon Bedrock Knowledge Bases to query internal corporate documents.
Software Development: Utilizing Amazon Q for automated code generation and bug fixing.

Risk & Responsibility

Learners will apply the Responsible AI framework to mitigate risks such as:

Nondeterminism: Managing the fact that LLMs can produce different outputs for the same prompt.
Security: Protecting against prompt injection, hijacking, and jailbreaking.
Compliance: Ensuring data used in fine-tuning meets legal and sustainability standards.

[!IMPORTANT] GenAI is not a "silver bullet." A core skill in this curriculum is performing a cost-benefit analysis to determine when a simple regression model or a non-AI solution is more appropriate than a massive Foundation Model.

Curriculum Overview: Core Generative AI Concepts (AWS AIF-C01)

Curriculum Overview: Core Generative AI Concepts

## Prerequisites

## Module Breakdown

## Learning Objectives per Module

Module 1: Technical Foundations

Module 2: Architectures & Models

Module 3: The Model Lifecycle

Module 4: Interaction & Controls

## Success Metrics

1. Technical Performance Metrics

2. Business Value Metrics

## Real-World Application

Industry Use Cases

Risk & Responsibility

Curriculum Overview: Core Generative AI Concepts (AWS AIF-C01)

Curriculum Overview: Core Generative AI Concepts

## Prerequisites

## Module Breakdown

## Learning Objectives per Module

Module 1: Technical Foundations

Module 2: Architectures & Models

Module 3: The Model Lifecycle

Module 4: Interaction & Controls

## Success Metrics

1. Technical Performance Metrics

2. Business Value Metrics

## Real-World Application

Industry Use Cases

Risk & Responsibility