AI Foundations for PMs
PM 01Chapter 1 of 7

AI vs ML vs Deep Learning

~6 min essentials·18 min full·6 sections

The hierarchy you'll explain a hundred times in your career — AI vs ML vs Deep Learning, and how separating them changes how you scope work and earn engineering trust.

Full — every example, fold, and depth note.

Key takeaway

AI is the umbrella, machine learning is one approach to AI, and deep learning is one approach to ML. Every deep learning system is ML; every ML system is AI; the reverse is never true.

Highlight any sentence below for a plain-English explanation
§1.1·~1 min

What is Artificial Intelligence

More than robots — why every software decision is now an AI decision

Key takeaway

AI is a behaviour label, not a technique. The moment software starts making judgement calls instead of executing instructions, you're inside the AI conversation — even when nothing modern is happening under the hood.

Why this matters for you

The next time a vendor opens a procurement call with 'we're an AI-first platform', you'll know that sentence describes their marketing posture, not their architecture. Your follow-up question — which judgement calls is your software actually making? — sets the tone for the rest of the meeting.

Walk into any product review in 2026 and someone will say their feature is 'AI-powered' before they've described what it does. The phrase is doing a lot of work — and most of it is rhetorical. Artificial intelligence is the umbrella term for any software system that does something we'd call intelligent if a human did it, which is so broad that it captures everything from a thermostat with three temperature rules to GPT-5. What that means in practice is that 'AI' describes the behaviour, not the mechanism. A chess engine that brute-forces every move with hand-coded heuristics is AI. A model trained on every game ever played is also AI. The label tells you nothing about how the thing was built.

§1.2·~1 min

What is Machine Learning

When systems learn from data instead of following rules

Key takeaway

Machine learning is the moment you stop writing rules and start curating data. The system's behaviour becomes a function of what you trained it on — which means your dataset, not your code, is the product surface area.

Why this matters for you

In your next sprint review, when an engineer says 'the model is underperforming on enterprise accounts', the right reflex isn't to ask what's broken in the code. It's to ask what enterprise data was in the training set. ML bugs almost always live in the data first.

Picture two ways to build a spam filter. The first is a senior engineer writing 200 lines of if-statements: block this sender, flag that keyword, score these patterns. The second is showing a system a hundred million emails labelled spam or not-spam and letting it work out the pattern itself. Machine learning is the second approach: instead of programming behaviour directly, you give the system examples and let an algorithm derive the rules from them. The first approach is engineering. The second is closer to teaching, with all the messiness that implies.

§1.3·~1 min

What is Deep Learning

Why neural networks changed everything

Key takeaway

Deep learning is the kind of ML that learns its own features. That single capability is what cracked vision, speech, translation and language — and why the last decade of AI progress looks like a wall, not a slope.

Why this matters for you

When an engineer tells you a feature 'needs a deep learning model', they are telling you three things: it needs a lot of data, it needs a lot of compute, and you will not be able to explain individual decisions to a customer or regulator. Plan for all three before you commit on the roadmap.

For decades, getting an ML system to do anything useful with images, speech or text required a small army of domain experts to hand-design the inputs. You couldn't just hand a model raw pixels — you'd hand it carefully engineered features like edge counts, colour histograms, or speech spectrograms, and the model learned from those. Deep learning collapsed all of that. A deep neural network takes raw inputs — pixels, audio waveforms, characters — and learns the useful features for itself, layer by layer.

Feature Extraction in Deep Learning

Deep learning models automatically build complex concepts from simple ones by passing data through sequential hidden layers, removing the need for human engineers to hand-label features.

Start with broad AIDefine the full automation opportunity before selecting methods.
Narrow to machine learningIdentify tasks where prediction from historical data creates value.
Apply deep learningUse neural networks for language, vision, or speech complexity.
Price the trade-offBalance capability gains against cost, opacity, and dependency risk.
Deploy with controlsSet evaluation, fallback paths, and monitoring before scaling.
§1.4·~1 min

The nested hierarchy explained

How AI, ML and DL relate — and why conflating them costs you credibility

Key takeaway

AI ⊃ ML ⊃ Deep Learning. Three nested circles, in that order, no exceptions. Holding the hierarchy in your head with that level of precision is the cheapest credibility upgrade available to a non-technical PM.

Why this matters for you

In your next architecture review, when a staff engineer says 'this is ML but it's not deep learning', the look on their face will tell you whether you reacted in a way that earned the next ten minutes of their attention. The hierarchy is the price of admission to those conversations.

The AI, ML, and DL Hierarchy

Every deep learning system is machine learning, and every machine learning system is AI, but the reverse is never true.

Artificial Intelligence (outer set)
Any system that performs tasks requiring human-like intelligence.
Rules engines, search, optimization, machine learning
Machine Learning (inside AI)
Models learn patterns from data instead of hard-coded rules.
Supervised learning, unsupervised learning, reinforcement learning
Deep Learning (inside ML)
Neural-network methods for high-dimensional, unstructured signals.
Transformers, large vision models, speech models

The cleanest way to hold these three terms in your head is as three nested circles. AI is the outer circle: any system that behaves intelligently. Machine learning sits inside it: AI systems that learn from data. Deep learning sits inside ML: the subset that uses large neural networks. That asymmetry matters. The terms are not synonyms with different vibes — they describe progressively smaller and more specific commitments about how a system is built.

§1.5·~1 min

Rule-based systems vs learned systems

The question that exposes fake AI in vendor pitches

Key takeaway

If you could write the system as a list of if-statements, it's not machine learning — no matter what the slide says. The distinction is binary, auditable, and the single most useful diagnostic question in your AI vocabulary.

Why this matters for you

In every vendor procurement call you'll run for the rest of your career, this is the question that saves you from buying a rules engine at AI prices. Ask it in the first ten minutes; the answer changes everything that comes after.

A rule-based system is software where every behaviour was written down by a human. If income > $50k AND credit_score > 700 → approve loan. If subject_line contains 'lottery' → flag email. If user hasn't logged in for 14 days → send re-engagement email. A learned system has no explicit rules. It has a trained model — a pile of weights — that maps inputs to outputs. The 'rules' are implicit and statistical, and they shift every time the model is retrained.

The Paradigm Shift in Software Engineering

Traditional software requires a human to explicitly write the rules; machine learning requires a human to provide the correct answers so the algorithm can discover the rules.

Model Confidence
How sure the model sounds
Probability-like confidence can remain high even when answers are wrong.
Actual Correctness
What users truly need
Ground-truth accuracy must be measured independently of model self-belief.
Calibration Controls
How to close the gap
Use abstention rules, verification steps, and human review thresholds.
§1.6·~1 min

PM decision lens: spotting real AI vs marketing AI

The 3 questions to ask before trusting any 'AI-powered' claim

Key takeaway

Three questions, asked in order, separate real AI work from marketing veneer in under twenty minutes. The questions are free to ask and expensive to skip — they are the closest thing the field has to a universal due-diligence script.

Why this matters for you

Next time you sit through a vendor demo or an internal pitch with 'AI-powered' in the deck, ask these three questions in order and watch the room. The PMs who consistently ask them get put on AI strategy committees; the ones who don't get handed the rollout.

Question one: is there a trained model, or are these hand-written rules? This is the rules-versus-learning diagnostic from the previous section, applied as the opening move. If the answer is 'no model, just rules', the conversation is no longer about AI — it's about software, with all the normal evaluation criteria. A useful follow-up: 'what kind of model, and what does it predict?' One sentence of specificity here separates a serious team from a marketing one. You don't need to understand the technique; you need to hear that they do.

As a PM: the version of you that conflates these terms gets handed feature requests. The version that separates them cleanly gets handed strategy.

Stripe Radar — AI as a hybrid product, not a single technique

Stripe Radar markets itself as AI for fraud detection, and it genuinely is — but it sits on top of an explicit rules engine that merchants can edit themselves. The dashboard separates 'rule blocks' from 'model risk score' and shows both for every transaction. That separation is not an implementation detail; it's the product. Merchants tune each layer independently, which builds more trust than hiding the seams ever would. The wider lesson: mature AI products are almost always hybrids, and exposing the layers is a feature.

Concept check · 1 of 6
Multiple choice

A roadmap deck has a slide titled 'AI-powered onboarding'. The feature is a series of if-statements that show different welcome screens based on the user's role, plan, and signup source. Which of these is the most credibility-preserving thing to say in the review?

Portrait of Krishna Kumar, Curator

Vetted by Krishna KumarCurator, FactorBeam