Leader 01Chapter 2 of 8

How AI Models Learn — What Business Leaders Must Understand to Evaluate Vendors

~8 min essentials·23 min full·8 sections

Understanding how models learn changes how you evaluate vendor claims, negotiate contracts, and set expectations for AI performance. The training process determines what the model can and cannot do — and that translates directly into business risk and budget decisions.

Full — every example, fold, and depth note.

Key takeaway

Models learn from data — and the quality, recency, and ownership of that data determines what the model can do for your business. Fine-tuning and transfer learning are the practical levers that make pre-trained models business-ready, but they require data investment and governance that must be planned before deployment.

Highlight any sentence below for a plain-English explanation

§2.1·~1 min

What Training Actually Means

How a model goes from nothing to useful — and why this process determines everything downstream

Key takeaway

Training is the process by which a model learns patterns from data. It is a capital expenditure that happens before you ever use the model. Understanding what was in the training data — and what was not — explains most of what the model can and cannot do.

Why this matters for you

When a vendor says 'our model was trained on industry data', that sentence contains a procurement decision: whose data, how much, how recent, and labelled by whom? Each answer changes your risk and value assessment.

Training a machine learning model is fundamentally an optimisation process. The system is shown millions of examples — input-output pairs — and adjusts its internal parameters until its outputs match the expected results as closely as possible. The result is a set of numerical weights that encode the learned patterns. Before purchasing any AI tool, ask: what problem was the training data designed to solve, and does that match our problem?

§2.2·~1 min

Parameters — The Weight of Learning

What model size means for capability, cost, and vendor pricing

Key takeaway

Parameters are the numerical values that encode what a model has learned. More parameters generally mean more capability — and more compute cost. The parameter count in vendor specifications is a pricing and capability signal, not just a technical specification.

Why this matters for you

Vendors price model access partly on parameter count. CFOs reviewing AI tool budgets and CTOs selecting model tiers need to understand what they are buying when they see '7B', '70B', or '400B' parameter models — and whether the capability difference justifies the cost difference.

A model's parameters are numerical weights that determine its behaviour on any input. Think of parameters as the model's encoded memory of everything it learned during training. A 7-billion parameter model has 7 billion such weights. A 400-billion parameter model has 400 billion. Each additional parameter requires storage and compute to run. Do not select AI models primarily on parameter count. Select on benchmark performance for your specific task at your required cost and latency.

§2.3·~1 min

Training Data is Destiny

Why the data the model learned from determines what your business can rely on it to do

Key takeaway

A model can only perform well on patterns that were present in its training data. Training data determines capability ceilings, failure modes, demographic representation, and the domains where the model is reliable. Business leaders must understand training data provenance before deploying AI in consequential decisions.

Why this matters for you

Most AI project failures that are labelled 'the model didn't work' are actually 'the training data did not match the deployment context'. This is a procurement and scoping failure that business leaders can prevent.

Training data defines the distribution the model understands. A model trained on English-language customer service transcripts from US retail will perform poorly on Australian English, on B2B contexts, or on multilingual interactions — because those patterns were not in the training distribution. Before any AI deployment, define your actual deployment distribution — the language, context, domain, and population — and verify that the training data covered it.

Training Data is Destiny

A model can only perform well on patterns that were present in its training data. Training data determines capability ceilings, failure modes, demographic…

Initialize parametersStart with trainable weights that encode model behavior.

Run forward passGenerate predictions from current weights on training examples.

Compute lossMeasure prediction error against labeled outcomes.

Backpropagate gradientsCalculate how each weight contributed to the error.

Update and repeatAdjust weights over many batches until validation performance stabilizes.

§2.4·~1 min

Overfitting — When the Model Memorises Instead of Learns

The performance illusion that collapses in production — and how to catch it before you sign

Key takeaway

Overfitting occurs when a model performs well on its training data but fails on new data it has never seen. It is the most common technical reason AI tools look impressive in vendor demos and disappoint in production. Business leaders can catch it by insisting on out-of-sample performance data.

Why this matters for you

A vendor who shows you only training performance is showing you a model's memory, not its intelligence. Production performance on your data is the only number that matters — and the gap between demo and production is where AI projects most commonly fail.

An overfit model has memorised the training examples rather than learning the underlying pattern. Imagine a model trained to identify at-risk customers. If overfit, it recognises the specific customers it saw in training — but when presented with a new at-risk customer who looks slightly different, it misclassifies confidently. Always request test-set performance — performance on data the model never saw during training — from any AI vendor. Training performance without test performance is not evidence.

§2.5·~1 min

Fine-Tuning — Making a General Model Specific

The practical mechanism behind most 'custom AI' offers — what it costs and what it delivers

Key takeaway

Fine-tuning takes a pre-trained general model and continues training it on domain-specific data to improve performance in a target area. Most 'custom AI' vendor offers are fine-tuning, not training from scratch. Understanding this distinction clarifies cost expectations, ownership questions, and performance limits.

Why this matters for you

Fine-tuning is the most commercially viable path to domain-specific AI for mid-market organisations. But it requires data investment, raises data rights questions, and has performance ceilings set by the base model. Leaders who understand this negotiate better and set correct expectations.

Fine-tuning does not build a new model — it adapts an existing one. A vendor offering 'custom AI trained on your industry data' almost certainly means: we took GPT-4 or Llama or a similar foundation model and continued training it on domain-specific examples. The result inherits the base model's capabilities — and its limitations. When vendors offer custom AI, clarify: is this fine-tuning on a foundation model, and which model? The base model's capabilities and licensing terms apply to your 'custom' tool.

Fine-Tuning — Making a General Model Specific

Fine-tuning takes a pre-trained general model and continues training it on domain-specific data to improve performance in a target area. Most 'custom AI'…

Start from base modelRent a strong general-purpose foundation model

Curate domain dataCollect labelled examples for your target task

Fine-tune weightsContinue training on domain-specific corpus

Evaluate liftCompare against prompt-only baseline on golden set

Deploy with monitoringTrack drift and retrain when performance drops

§2.6·~1 min

Transfer Learning — Standing on Capable Shoulders

How pre-trained knowledge becomes business-ready capability without prohibitive cost

Key takeaway

Transfer learning is the mechanism that makes practical AI feasible for organisations without Google-scale compute budgets. A model trained on vast general knowledge transfers that learning to a specific domain with comparatively modest additional training. Almost all enterprise AI tools today are transfer learning applications.

Why this matters for you

Understanding transfer learning explains why AI tools improve rapidly, why base model provider relationships matter, and why the vendor landscape is structured around a small number of foundation model providers whose capabilities become everyone else's starting point.

Transfer learning means reusing knowledge learned in one context to accelerate learning in another. A language model pre-trained on billions of web pages, books, and code has learned grammar, facts, reasoning patterns, and domain vocabulary. When fine-tuned on medical text, it does not need to re-learn English grammar or clinical terminology structure from scratch — it transfers that knowledge. The practical implication: your AI project timeline should not include building foundational capabilities. Budget for domain adaptation, not capability creation from zero.

§2.7·~1 min

Training Data in the Tool You Are Already Using

Where the model underlying your vendor's product learned — and what that means for its behaviour

Key takeaway

Every AI tool your organisation uses is downstream of training decisions made by someone else. The tool's behaviour — its strengths, its biases, its knowledge cutoffs, its demographic performance — reflects choices about training data that you can investigate but did not make. Leaders who ask these questions before deployment are exercising fiduciary responsibility.

Why this matters for you

Your legal, compliance, HR, and operations teams are making decisions based on AI tool outputs. Those outputs reflect training data choices that affect accuracy, fairness, and regulatory compliance. 'The vendor is responsible' is not an adequate governance position.

The AI tools your organisation uses contain embedded training decisions affecting their behaviour. A vendor's document classifier was trained on specific document types — likely English-language, US or EU format, from specific industries. A vendor's people analytics tool was trained on specific workforce demographics. These training choices determine where the tool is reliable and where it is not. Add 'model card or data sheet provision' to your AI vendor onboarding checklist as a standard governance requirement.

§2.8·~1 min

BL Vendor Training Questions

The eight questions every business leader should ask before signing an AI vendor contract

Key takeaway

Eight questions, asked in the right order, surface the training-related risks in any AI vendor engagement. They take thirty minutes and require no technical background. Leaders who run them consistently avoid the most common AI procurement failures.

Why this matters for you

Training quality determines product quality. Vendors who cannot answer these questions clearly are concealing limitations that will appear in production. Vendors who answer them well earn faster trust and accelerate the governance process.

Questions one through three: understanding what was learned and from what. One: what data was the model trained on — what domain, geography, language, time period, and demographic coverage? Two: what is the knowledge or data cutoff, and what is the update or retraining schedule? Three: was the training data labelled by humans or automatically, and what was the error rate in labelling? If a vendor cannot answer these three questions with specifics, the governance burden shifts to you — and the deployment risk increases proportionally.

Real product examples

As a business leader: you own budget, risk, and adoption — not model weights. Every section ends with a decision you can make in your next leadership meeting.

Bloomberg GPT — purpose-built financial training

Bloomberg trained a 50B parameter model on 700B tokens of financial documents — earnings transcripts, analyst reports, regulatory filings, news. The performance advantage on financial NLP tasks reflects the training specialisation. Finance leaders evaluating AI for financial analysis should ask whether the model was trained on domain-appropriate data — not just whether it is a 'large language model'.

Concept check · 1 of 6

Multiple choice

A vendor offers 'custom AI trained on your industry data' for your HR use case. What is the most important clarifying question?

Vetted by Krishna KumarCurator, FactorBeam