Cloud/ AWS / AWS Certified AI Practitioner (AIF-C01) / AI & ML Fundamentals for AWS Certified AI Practitioner (AIF-C01)

AWS Amazon Web Services Foundational Step 1 of 5 106 guides ยท updated 2026

Hands-on guides to compute, storage, databases, networking, and serverless on the world's most widely adopted cloud platform.

Step 1 โ€” AI/ML Fundamentals

Most people studying for AIF-C01 have used ChatGPT or Alexa, but few can explain the layers underneath them. That gap is exactly what this exam probes first โ€” not your ability to write code, but whether you understand what these systems are, how they learn, and which AWS service handles which job. Letโ€™s build that foundation properly.


Untangling AI, ML, and Deep Learning

These three terms get thrown around interchangeably, and the exam will absolutely test whether you know theyโ€™re not the same thing. Think of them as nested circles, each one a subset of the one before it.

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Artificial Intelligence โ”‚
โ”‚ Any technique that lets machines mimic behavior โ”‚
โ”‚ we'd call "intelligent" (rule engines, planning, โ”‚
โ”‚ search algorithms, expert systems, ML...) โ”‚
โ”‚ โ”‚
โ”‚ โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚
โ”‚ โ”‚ Machine Learning โ”‚ โ”‚
โ”‚ โ”‚ Systems that learn patterns from data โ”‚ โ”‚
โ”‚ โ”‚ instead of being explicitly programmed โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ Deep Learning โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ ML using multi-layer neural โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ networks โ€” powers modern vision, โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ speech, and language models โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”‚ Generative AI โ”‚ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”‚ Deep learning models that โ”‚ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”‚ create new content: text, โ”‚ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ”‚ images, audio, code โ”‚ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ”‚ โ”‚ โ”‚
โ”‚ โ”‚ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ”‚ โ”‚
โ”‚ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

AI is the umbrella. A 1980s chess program using hand-written rules is AI, but it isnโ€™t ML โ€” nobody trained it on data, someone just wrote โ€œif opponent moves here, respond there.โ€ ML flips that: instead of writing the rules, you feed the system examples and let it discover the rules itself. Deep learning is a specific ML approach built from layered neural networks, and generative AI is deep learning aimed specifically at producing new content rather than just classifying or predicting a number.

You will see distractor answers on the exam that swap these terms deliberately โ€” โ€œwhich of the following is a subset of machine learningโ€ being one of the more common phrasings. Keep the nesting order memorized and youโ€™ll clear those questions without even reading the options twice.


The Three Ways a Model Learns

Every ML approach fits into one of a small number of learning paradigms. AIF-C01 wants conceptual fluency here โ€” no math, just correct pattern recognition.

Supervised learning โ€” You give the model labeled examples: inputs paired with correct answers. Show it thousands of emails tagged โ€œspamโ€ or โ€œnot spam,โ€ and it learns the mapping. This covers most classification and regression tasks: predicting house prices, detecting fraud, classifying images.

Unsupervised learning โ€” No labels at all. The model looks at raw data and finds structure on its own โ€” grouping similar customers together (clustering), or reducing thousands of features down to a handful that matter (dimensionality reduction). You use this when you donโ€™t already know what the โ€œright answerโ€ categories are.

Reinforcement learning โ€” An agent takes actions in an environment and gets rewards or penalties, gradually learning a policy that maximizes reward over time. Think of a robot learning to walk, or a game-playing agent. This paradigm also underlies how many modern chat-style models get fine-tuned to be more helpful and less harmful โ€” a technique broadly known as reinforcement learning from human feedback.

Learning TypeData NeededTypical Use CaseExample AWS Service
SupervisedLabeled dataFraud detection, demand forecastingSageMaker (built-in algorithms)
UnsupervisedUnlabeled dataCustomer segmentation, anomaly detectionSageMaker (clustering algorithms)
ReinforcementReward signal, no fixed labelsRobotics, game AI, model alignmentSageMaker RL, Bedrock model tuning

A quick gut-check question for yourself: if someone hands you a spreadsheet of 50,000 past loan applications with an โ€œapproved / deniedโ€ column already filled in, which paradigm applies? Supervised โ€” the labels already exist, youโ€™re just learning the mapping.


The ML Lifecycle, Start to Finish

Exam questions frequently describe a scenario and ask โ€œwhich phase of the ML lifecycle does this represent?โ€ So it pays to know the stages cold, and to know they loop rather than run once.

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Business โ”‚โ”€โ”€โ”€โ”€โ–ถโ”‚ Data โ”‚โ”€โ”€โ”€โ”€โ–ถโ”‚ Data โ”‚
โ”‚ Problem โ”‚ โ”‚ Collection โ”‚ โ”‚ Preparation โ”‚
โ”‚ Framing โ”‚ โ”‚ โ”‚ โ”‚ & Cleaning โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ผโ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Monitoring โ”‚โ—€โ”€โ”€โ”€โ”€โ”‚ Deployment โ”‚โ—€โ”€โ”€โ”€โ”€โ”‚ Model โ”‚
โ”‚ & Retraining โ”‚ โ”‚ & Inference โ”‚ โ”‚ Training โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚ โ”‚
โ”‚ โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ถโ”‚ Evaluation โ”‚โ—€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚ & Tuning โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Business problem framing โ€” Before touching data, define success. Are you optimizing for accuracy, latency, or cost? A fraud model thatโ€™s 99.9% accurate is worthless if it misses the 0.1% of transactions that are actually fraudulent โ€” this is where accuracy alone can lie to you.

Data collection โ€” Pulling raw data from wherever it lives: databases, logs, S3 buckets, third-party feeds.

Data preparation โ€” Cleaning, deduplicating, handling missing values, and splitting into training, validation, and test sets. This step consumes more practitioner time than any other, and the exam knows it โ€” expect at least one question framed around โ€œthe model is underperforming because of data quality.โ€

Model training โ€” Feeding the prepared data through an algorithm so it learns parameters that minimize error.

Evaluation and tuning โ€” Measuring performance against the test set using metrics appropriate to the task (accuracy, precision, recall, F1, RMSE, depending on whether itโ€™s classification or regression), then adjusting hyperparameters and repeating.

Deployment and inference โ€” Putting the trained model where it can serve real predictions, whether thatโ€™s real-time (an endpoint responding in milliseconds) or batch (processing a large file overnight).

Monitoring and retraining โ€” Watching for model drift as real-world data shifts away from training data, then retraining before accuracy silently degrades.

Notice the loop back from monitoring into data collection. Models are not โ€œfinishedโ€ the day they deploy โ€” they decay as the world around them changes, and a mature ML practice budgets for that from day one.


Where Real Organizations Use This

The exam likes to test recognition of use cases, matching a business scenario to the right category of AI/ML solution. A few patterns worth internalizing:


Where AWS Fits: The AI/ML Stack

AWS organizes its AI/ML offerings into three broad layers, each trading flexibility for ease of use in the opposite direction.

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ TOP LAYER โ€” AI Services (pre-built, API-driven) โ”‚
โ”‚ Rekognition (vision) ยท Comprehend (NLP) ยท Textract (OCR) โ”‚
โ”‚ Transcribe (speech-to-text) ยท Polly (text-to-speech) โ”‚
โ”‚ Translate ยท Lex (conversational bots) โ”‚
โ”‚ โ†’ No ML expertise required, fastest time to value โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚ MIDDLE LAYER โ€” Amazon Bedrock โ”‚
โ”‚ Access foundation models from multiple providers, โ”‚
โ”‚ build RAG apps, agents, and custom generative solutions โ”‚
โ”‚ โ†’ Some prompt/architecture skill needed, no infra to manage โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚ BOTTOM LAYER โ€” Amazon SageMaker โ”‚
โ”‚ Full ML platform: build, train, tune, deploy custom models โ”‚
โ”‚ โ†’ Requires ML/data science skill, maximum control โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

If a scenario says โ€œwe need to extract text from scanned invoices quickly, no ML team available,โ€ the answer is Textract โ€” a pre-built AI service, not a custom SageMaker model. If the scenario says โ€œwe need full control over model architecture and training data for a proprietary use case,โ€ that points toward SageMaker. If it says โ€œwe want to build a chatbot on top of an existing large language model without training anything from scratch,โ€ thatโ€™s Bedrock.

A rough mental shortcut that helps on exam day: the higher you go up that stack, the less you build and the faster you ship; the lower you go, the more control you get and the more expertise it demands.


Exam Focus: What Questions Test From This Step