Curriculum Overview: Azure Computer Vision Tools & Services

This curriculum provides a structured path to mastering the identification and application of Azure's suite of computer vision tools. Students will learn to distinguish between general-purpose vision APIs and specialized custom modeling services, ensuring they can select the most efficient tool for any visual AI workload.

Prerequisites

Before starting this module, learners should possess:

Fundamental AI Knowledge: Understanding of Unit 1 (AI Workloads) and Unit 2 (Machine Learning Principles) of the AI-900 curriculum.
Conceptual Awareness: Familiarity with what computer vision is (e.g., image classification vs. object detection).
Cloud Basics: A basic understanding of the Azure Portal and the concept of a "Resource."
Mathematical Intuition: A high-level grasp of how Convolutional Neural Networks (CNNs) process pixel data.

Module Breakdown

Module ID	Module Name	Focus Area	Difficulty
CV-01	The Azure AI Vision Service	Object detection, OCR, and spatial analysis	Beginner
CV-02	Azure AI Custom Vision	Training models with proprietary/niche data	Intermediate
CV-03	Specialized Face Analysis	Facial detection and attribute recognition	Beginner
CV-04	Azure AI Services (Multi-Service)	Unified endpoints and cross-functional AI apps	Intermediate

Learning Objectives per Module

CV-01: The Azure AI Vision Service

Identify the primary capabilities of Azure AI Vision, including image tagging and landmark detection.
Describe how Optical Character Recognition (OCR) is used to extract text from images and documents.
Map business requirements (like garage parking tracking) to specific vision features.

CV-02: Azure AI Custom Vision

Distinguish between pre-trained models and custom-trained models.
Define the workflow for tagging images and retraining a model for niche categories (e.g., identifying specific crop diseases).

CV-03: Specialized Face Analysis

Identify the specific capabilities of the Azure AI Face service.
Understand the difference between facial detection (is there a face?) and facial analysis (what are the attributes?).

CV-04: Azure AI Services (Unified Approach)

Describe the benefits of a single endpoint and access key for multi-modal applications.
Analyze scenarios where combining Vision, Translation, and Search is more efficient than using standalone resources.

Visual Selection Guide

Use the following logic to determine which service fits your specific computer vision project:

Loading Diagram...

Success Metrics

To demonstrate mastery of this curriculum, the learner must be able to:

Select the Tool: Correctly identify that Azure AI Vision is the service for object detection and facial analysis in standard scenarios.
Explain the Architecture: Understand that Convolutional Neural Networks (CNNs) power these services, specifically how Pooling Layers reduce feature map size to focus on essential details.
Optimize Management: Articulate why a travel app developer should use the Azure AI Services resource (single endpoint) rather than multiple individual keys.
Differentiate Use Cases: Explain why a generic vision model wouldn't work for agriculture disease detection, necessitating Custom Vision.

Real-World Application

Case Study: The Smart Urban Infrastructure

In a modern "Smart City," Azure Computer Vision tools work in tandem:

Parking Garages: Azure AI Vision tracks available spaces and detects unauthorized vehicles in real-time.
Agriculture: Farmers use mobile apps powered by Custom Vision to take photos of leaves and receive instant diagnoses of fungal infections specific to their local climate.
Retail/Accessibility: OCR (Optical Character Recognition) within the Vision service helps visually impaired shoppers read product labels and prices via their smartphones.

[!IMPORTANT] When designing these solutions, always refer to the Responsible AI principles (Fairness, Privacy, and Security), especially when utilizing facial analysis services.

Visualizing the CNN Backbone

All Azure Vision services rely on deep learning. Below is a simplified representation of how a CNN processes visual data before it reaches the Azure API output.

Compiling TikZ diagram…

⏳

Running TeX engine…

This may take a few seconds

Curriculum Overview: Azure Computer Vision Tools & Services

Prerequisites

Before starting this module, learners should possess:

Fundamental AI Knowledge: Understanding of Unit 1 (AI Workloads) and Unit 2 (Machine Learning Principles) of the AI-900 curriculum.
Conceptual Awareness: Familiarity with what computer vision is (e.g., image classification vs. object detection).
Cloud Basics: A basic understanding of the Azure Portal and the concept of a "Resource."
Mathematical Intuition: A high-level grasp of how Convolutional Neural Networks (CNNs) process pixel data.

Module Breakdown

Module ID	Module Name	Focus Area	Difficulty
CV-01	The Azure AI Vision Service	Object detection, OCR, and spatial analysis	Beginner
CV-02	Azure AI Custom Vision	Training models with proprietary/niche data	Intermediate
CV-03	Specialized Face Analysis	Facial detection and attribute recognition	Beginner
CV-04	Azure AI Services (Multi-Service)	Unified endpoints and cross-functional AI apps	Intermediate

Learning Objectives per Module

CV-01: The Azure AI Vision Service

Identify the primary capabilities of Azure AI Vision, including image tagging and landmark detection.
Describe how Optical Character Recognition (OCR) is used to extract text from images and documents.
Map business requirements (like garage parking tracking) to specific vision features.

CV-02: Azure AI Custom Vision

Distinguish between pre-trained models and custom-trained models.
Define the workflow for tagging images and retraining a model for niche categories (e.g., identifying specific crop diseases).

CV-03: Specialized Face Analysis

Identify the specific capabilities of the Azure AI Face service.
Understand the difference between facial detection (is there a face?) and facial analysis (what are the attributes?).

CV-04: Azure AI Services (Unified Approach)

Describe the benefits of a single endpoint and access key for multi-modal applications.
Analyze scenarios where combining Vision, Translation, and Search is more efficient than using standalone resources.

Visual Selection Guide

Use the following logic to determine which service fits your specific computer vision project:

Loading Diagram...

Success Metrics

To demonstrate mastery of this curriculum, the learner must be able to:

Select the Tool: Correctly identify that Azure AI Vision is the service for object detection and facial analysis in standard scenarios.
Explain the Architecture: Understand that Convolutional Neural Networks (CNNs) power these services, specifically how Pooling Layers reduce feature map size to focus on essential details.
Optimize Management: Articulate why a travel app developer should use the Azure AI Services resource (single endpoint) rather than multiple individual keys.
Differentiate Use Cases: Explain why a generic vision model wouldn't work for agriculture disease detection, necessitating Custom Vision.

Real-World Application

Case Study: The Smart Urban Infrastructure

In a modern "Smart City," Azure Computer Vision tools work in tandem:

Parking Garages: Azure AI Vision tracks available spaces and detects unauthorized vehicles in real-time.
Agriculture: Farmers use mobile apps powered by Custom Vision to take photos of leaves and receive instant diagnoses of fungal infections specific to their local climate.
Retail/Accessibility: OCR (Optical Character Recognition) within the Vision service helps visually impaired shoppers read product labels and prices via their smartphones.

[!IMPORTANT] When designing these solutions, always refer to the Responsible AI principles (Fairness, Privacy, and Security), especially when utilizing facial analysis services.

Visualizing the CNN Backbone

All Azure Vision services rely on deep learning. Below is a simplified representation of how a CNN processes visual data before it reaches the Azure API output.

Compiling TikZ diagram…

⏳

Running TeX engine…

This may take a few seconds

Curriculum Overview: Azure Computer Vision Tools and Services

Curriculum Overview: Azure Computer Vision Tools & Services

Prerequisites

Module Breakdown

Learning Objectives per Module

CV-01: The Azure AI Vision Service

CV-02: Azure AI Custom Vision

CV-03: Specialized Face Analysis

CV-04: Azure AI Services (Unified Approach)

Visual Selection Guide

Success Metrics

Real-World Application

Case Study: The Smart Urban Infrastructure

Visualizing the CNN Backbone

Curriculum Overview: Azure Computer Vision Tools and Services

Curriculum Overview: Azure Computer Vision Tools & Services

Prerequisites

Module Breakdown

Learning Objectives per Module

CV-01: The Azure AI Vision Service

CV-02: Azure AI Custom Vision

CV-03: Specialized Face Analysis

CV-04: Azure AI Services (Unified Approach)

Visual Selection Guide

Success Metrics

Real-World Application

Case Study: The Smart Urban Infrastructure

Visualizing the CNN Backbone