Step 1 — AI/ML Fundamentals

Most people studying for AIF-C01 have used ChatGPT or Alexa, but few can explain the layers underneath them. That gap is exactly what this exam probes first — not your ability to write code, but whether you understand what these systems are, how they learn, and which AWS service handles which job. Let’s build that foundation properly.

Untangling AI, ML, and Deep Learning

These three terms get thrown around interchangeably, and the exam will absolutely test whether you know they’re not the same thing. Think of them as nested circles, each one a subset of the one before it.

┌─────────────────────────────────────────────────────┐
│  Artificial Intelligence                             │
│  Any technique that lets machines mimic behavior     │
│  we'd call "intelligent" (rule engines, planning,     │
│  search algorithms, expert systems, ML...)            │
│                                                        │
│   ┌─────────────────────────────────────────────┐    │
│   │  Machine Learning                            │    │
│   │  Systems that learn patterns from data        │    │
│   │  instead of being explicitly programmed        │    │
│   │                                                │    │
│   │   ┌───────────────────────────────────────┐  │    │
│   │   │  Deep Learning                        │  │    │
│   │   │  ML using multi-layer neural           │  │    │
│   │   │  networks — powers modern vision,      │  │    │
│   │   │  speech, and language models           │  │    │
│   │   │                                        │  │    │
│   │   │   ┌───────────────────────────────┐   │  │    │
│   │   │   │  Generative AI                │   │  │    │
│   │   │   │  Deep learning models that     │   │  │    │
│   │   │   │  create new content: text,     │   │  │    │
│   │   │   │  images, audio, code           │   │  │    │
│   │   │   └───────────────────────────────┘   │  │    │
│   │   └───────────────────────────────────────┘  │    │
│   └─────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────┘

AI is the umbrella. A 1980s chess program using hand-written rules is AI, but it isn’t ML — nobody trained it on data, someone just wrote “if opponent moves here, respond there.” ML flips that: instead of writing the rules, you feed the system examples and let it discover the rules itself. Deep learning is a specific ML approach built from layered neural networks, and generative AI is deep learning aimed specifically at producing new content rather than just classifying or predicting a number.

You will see distractor answers on the exam that swap these terms deliberately — “which of the following is a subset of machine learning” being one of the more common phrasings. Keep the nesting order memorized and you’ll clear those questions without even reading the options twice.

The Three Ways a Model Learns

Every ML approach fits into one of a small number of learning paradigms. AIF-C01 wants conceptual fluency here — no math, just correct pattern recognition.

Supervised learning — You give the model labeled examples: inputs paired with correct answers. Show it thousands of emails tagged “spam” or “not spam,” and it learns the mapping. This covers most classification and regression tasks: predicting house prices, detecting fraud, classifying images.

Unsupervised learning — No labels at all. The model looks at raw data and finds structure on its own — grouping similar customers together (clustering), or reducing thousands of features down to a handful that matter (dimensionality reduction). You use this when you don’t already know what the “right answer” categories are.

Reinforcement learning — An agent takes actions in an environment and gets rewards or penalties, gradually learning a policy that maximizes reward over time. Think of a robot learning to walk, or a game-playing agent. This paradigm also underlies how many modern chat-style models get fine-tuned to be more helpful and less harmful — a technique broadly known as reinforcement learning from human feedback.

Learning Type	Data Needed	Typical Use Case	Example AWS Service
Supervised	Labeled data	Fraud detection, demand forecasting	SageMaker (built-in algorithms)
Unsupervised	Unlabeled data	Customer segmentation, anomaly detection	SageMaker (clustering algorithms)
Reinforcement	Reward signal, no fixed labels	Robotics, game AI, model alignment	SageMaker RL, Bedrock model tuning

A quick gut-check question for yourself: if someone hands you a spreadsheet of 50,000 past loan applications with an “approved / denied” column already filled in, which paradigm applies? Supervised — the labels already exist, you’re just learning the mapping.

The ML Lifecycle, Start to Finish

Exam questions frequently describe a scenario and ask “which phase of the ML lifecycle does this represent?” So it pays to know the stages cold, and to know they loop rather than run once.

  ┌──────────────┐     ┌──────────────┐     ┌──────────────┐
  │   Business    │────▶│    Data      │────▶│    Data      │
  │   Problem     │     │  Collection  │     │  Preparation │
  │  Framing      │     │              │     │  & Cleaning  │
  └──────────────┘     └──────────────┘     └──────┬───────┘
                                                     │
  ┌──────────────┐     ┌──────────────┐     ┌───────▼──────┐
  │  Monitoring   │◀────│  Deployment  │◀────│    Model     │
  │  & Retraining │     │  & Inference │     │   Training   │
  └──────┬───────┘     └──────────────┘     └──────┬───────┘
         │                                          │
         │              ┌──────────────┐            │
         └─────────────▶│  Evaluation  │◀───────────┘
                        │  & Tuning    │
                        └──────────────┘

Business problem framing — Before touching data, define success. Are you optimizing for accuracy, latency, or cost? A fraud model that’s 99.9% accurate is worthless if it misses the 0.1% of transactions that are actually fraudulent — this is where accuracy alone can lie to you.

Data collection — Pulling raw data from wherever it lives: databases, logs, S3 buckets, third-party feeds.

Data preparation — Cleaning, deduplicating, handling missing values, and splitting into training, validation, and test sets. This step consumes more practitioner time than any other, and the exam knows it — expect at least one question framed around “the model is underperforming because of data quality.”

Model training — Feeding the prepared data through an algorithm so it learns parameters that minimize error.

Evaluation and tuning — Measuring performance against the test set using metrics appropriate to the task (accuracy, precision, recall, F1, RMSE, depending on whether it’s classification or regression), then adjusting hyperparameters and repeating.

Deployment and inference — Putting the trained model where it can serve real predictions, whether that’s real-time (an endpoint responding in milliseconds) or batch (processing a large file overnight).

Monitoring and retraining — Watching for model drift as real-world data shifts away from training data, then retraining before accuracy silently degrades.

Notice the loop back from monitoring into data collection. Models are not “finished” the day they deploy — they decay as the world around them changes, and a mature ML practice budgets for that from day one.

Where Real Organizations Use This

The exam likes to test recognition of use cases, matching a business scenario to the right category of AI/ML solution. A few patterns worth internalizing:

Healthcare — Medical image analysis, patient readmission risk scoring, drug discovery acceleration
Financial services — Fraud detection, credit risk modeling, algorithmic trading signals
Retail — Demand forecasting, personalized recommendations, dynamic pricing
Manufacturing — Predictive maintenance (catching equipment failure before it happens), quality inspection via computer vision
Media and entertainment — Content recommendation, automated captioning, generative content creation
Customer service — Chatbots, sentiment analysis on support tickets, call transcription and summarization

Where AWS Fits: The AI/ML Stack

AWS organizes its AI/ML offerings into three broad layers, each trading flexibility for ease of use in the opposite direction.

┌───────────────────────────────────────────────────────────┐
│ TOP LAYER — AI Services (pre-built, API-driven)            │
│  Rekognition (vision) · Comprehend (NLP) · Textract (OCR)   │
│  Transcribe (speech-to-text) · Polly (text-to-speech)       │
│  Translate · Lex (conversational bots)                      │
│  → No ML expertise required, fastest time to value          │
├───────────────────────────────────────────────────────────┤
│ MIDDLE LAYER — Amazon Bedrock                                │
│  Access foundation models from multiple providers,           │
│  build RAG apps, agents, and custom generative solutions      │
│  → Some prompt/architecture skill needed, no infra to manage │
├───────────────────────────────────────────────────────────┤
│ BOTTOM LAYER — Amazon SageMaker                               │
│  Full ML platform: build, train, tune, deploy custom models  │
│  → Requires ML/data science skill, maximum control            │
└───────────────────────────────────────────────────────────┘

If a scenario says “we need to extract text from scanned invoices quickly, no ML team available,” the answer is Textract — a pre-built AI service, not a custom SageMaker model. If the scenario says “we need full control over model architecture and training data for a proprietary use case,” that points toward SageMaker. If it says “we want to build a chatbot on top of an existing large language model without training anything from scratch,” that’s Bedrock.

A rough mental shortcut that helps on exam day: the higher you go up that stack, the less you build and the faster you ship; the lower you go, the more control you get and the more expertise it demands.

Exam Focus: What Questions Test From This Step

Correctly ordering AI ⊃ ML ⊃ Deep Learning ⊃ Generative AI as nested subsets, not synonyms
Matching a data scenario (labeled vs. unlabeled vs. reward-based) to supervised, unsupervised, or reinforcement learning
Identifying which phase of the ML lifecycle a described activity belongs to, especially data preparation and monitoring/drift
Recognizing that models degrade over time and require retraining — not a “set and forget” exam trap
Choosing the correct AWS layer (AI services vs. Bedrock vs. SageMaker) given a scenario’s skill level and control requirements
Matching business use cases (fraud detection, predictive maintenance, personalization) to the right AI/ML category

Written by NPBlue Cloud Team — Cloud & Platform Engineers who runs production workloads on AWS daily and writes from real deployment experience, not the docs alone.

Reviewed for technical accuracy. Spot an error? Let us know.