Book a consult

Loading scheduler…

← Back to Blog
Measurement

Retail Demand Forecasting: A CPG Guide

Retail demand forecasting is the practice of estimating how many units of a product will sell, at which retailers, over a future window. For a consumer packaged goods brand, that forecast is the number the plant schedules against, the number finance plans cash against, and the number the sales team promises a buyer. Get it roughly right and the shelf stays stocked without a warehouse full of slow movers. Get it wrong in either direction and you are paying for the mistake, in lost sales on one side or write-offs on the other.

The hard part is not the math. Plenty of tools will fit a curve. The hard part is feeding the forecast the right inputs, choosing a method that matches the data you have, and being honest about how wrong the forecast tends to be. Each of those has a CPG trap that turns a reasonable forecast into a misleading one.

What goes into a retail demand forecast

A demand forecast is only as good as what you feed it. Four inputs do most of the work, and a forecast that ignores any of them will drift in predictable ways.

  • POS history. Point-of-sale data, what actually rang up at the register, is the spine of the forecast. Shipment data tells you what left your warehouse, which is a noisier proxy because it moves in case-pack lumps and reflects retailer ordering behavior, not shopper demand. Where you can get it, sell-through POS is the cleaner signal. For more on reading that signal, see CPG sales velocity.
  • Seasonality. Most CPG categories breathe on an annual cycle: sunscreen in summer, soup in winter, anything with a holiday hook in Q4. A forecast that does not separate the seasonal swing from the underlying trend will read a December spike as growth and a January drop as a problem, when both are just the calendar.
  • The promotion calendar. Promotions distort sell-through more than almost anything else. A feature-and-display week can move several times a base week's volume, then pull a dip behind it as shoppers who pantry-loaded stay home. The forecast needs to know which past weeks were promoted and which future weeks are planned, or it will treat promo spikes as the new normal.
  • Distribution changes. Gaining or losing stores changes the baseline in a way that has nothing to do with per-store demand. Add 400 doors and total volume jumps even if velocity is flat. A forecast built on raw totals will misread a distribution event as a demand event, which is one of the more common ways a CPG forecast goes sideways.

Baseline volume vs. promoted volume

The single most useful split in cpg demand forecasting is separating baseline from promoted volume. Baseline is what would have sold at the everyday shelf price with no promotion running. Promoted, or incremental, volume is the lift on top of that, driven by a temporary price reduction, a feature in the circular, or an end-cap display.

These two behave differently, so they should be forecast differently. Baseline is relatively stable and responds to slow-moving things: distribution, brand momentum, price position versus the category. Promoted volume is event-driven and spiky, and depends on the mechanics of each planned promotion. If you forecast the blended total as one series, every promoted week looks like a random outlier and the model either over-smooths the spikes or chases them. Split the two, forecast the baseline cleanly, then layer planned promotional lift on top using the promo calendar. The result is a forecast you can explain to a planner.

Decomposing baseline and incremental volume is also the foundation of any honest promotion read: a promotion that sold a lot of units but mostly pulled forward baseline demand is not the win the gross number suggests.

Forecasting methods, from simple to complex

There is a tendency to reach for the most sophisticated method available. Resist it. The right method is the simplest one that captures the structure in your data, and for a lot of SKUs that is not machine learning.

MethodWhat it doesWhen to use it
Moving averageAverages recent periods to project the next oneStable, low-volume SKUs with no strong seasonality or promo activity
Exponential smoothingWeights recent periods more heavily, with optional trend and seasonal termsItems with a clear trend or repeating seasonal pattern and modest promo effects
Statistical (ARIMA, regression)Models trend, seasonality, and the effect of known drivers like price or promoSKUs with enough history and identifiable drivers you can quantify
Machine learningLearns nonlinear patterns across many SKUs and features at onceLarge assortments where cross-SKU patterns and many interacting drivers matter and you have the data volume to train on

A few honest caveats. A moving average is not a punchline; for a steady, mature item it is often within a point or two of anything fancier, and it never surprises you. Exponential smoothing is the workhorse most planning systems run by default. Statistical models earn their keep when you can attribute swings to named causes, which is what the baseline-versus-promoted split sets up. Machine learning pays off on big, varied assortments where it can borrow signal across similar SKUs, but it needs clean, deep data and it is harder to explain when a planner asks why the number moved. More method does not mean more accuracy.

Measuring forecast accuracy: MAPE and bias

You cannot improve a forecast you do not measure, and you need two different measurements because they answer two different questions. The first is how far off you are. The second is whether you are consistently too high or too low.

MAPE, mean absolute percentage error, answers the first. For each period you take the absolute gap between forecast and actual, express it as a percent of actual, and average across periods. A MAPE of 20% means the forecast misses by about a fifth on a typical period, regardless of direction. It is the most quoted number in demand planning because it is easy to explain and compares across SKUs of different sizes. Its weakness: it blows up on low-volume items, where a small unit miss is a huge percentage, so it flatters high-volume SKUs and punishes the long tail.

Bias answers the second question and is the one teams skip, to their cost. Bias is the average signed error: are the misses canceling out, or do they pile up on one side? A forecast can have a respectable MAPE and still be badly biased if it is always a little high. Persistent positive bias quietly builds excess inventory; persistent negative bias quietly builds stockouts. Track both. A forecast that is unbiased and reasonably accurate beats one that is slightly more accurate on average but always leans the same way, because biased error compounds in the warehouse.

Set a realistic bar. Accuracy varies by category, horizon, and how promoted the item is. A stable staple over a short horizon can run a low MAPE; a heavily promoted item over a 13-week horizon will run much higher, and that is not a failure, it is the nature of the demand. What you are watching is the trend in your own numbers over time, not a benchmark someone quoted at a conference.

How the forecast connects to S&OP and inventory

A demand forecast is not a deliverable on its own. It is the input that drives two downstream processes, and its whole value is measured by how well those run.

The first is sales and operations planning. S&OP is the monthly cycle where commercial, supply, and finance reconcile one demand number, then plan production, capacity, and cash against it. The demand forecast is the starting point of that conversation. If it is biased or volatile, every downstream plan inherits the problem, and the meeting turns into an argument about whose number is right instead of what to do about it. The mechanics of that cycle are covered in sales and operations planning.

The second is inventory. Retail inventory forecasting translates the demand forecast, plus lead times and a service-level target, into how much stock to hold and when to reorder. Safety stock exists to absorb forecast error, so a more accurate, less biased forecast lets you hold less inventory at the same service level. That is the direct line from forecast quality to working capital, and it is why supply teams care about your MAPE as much as you do. For how these signals fit together, see supply chain analytics.

Common failure modes in retail demand forecasting

Most bad forecasts are not bad math. They are a handful of structural traps that show up again and again.

  • New items. A SKU with no sales history has nothing to extrapolate from, so a naive model forecasts zero or guesses wildly. New items need a different approach: analogs from similar products, planned distribution and velocity assumptions, and a willingness to revise weekly as real data arrives. Treating a launch like an established item is a guaranteed miss.
  • Promo distortion. If past promotions are not flagged in the history, the model bakes promoted weeks into the baseline and over-forecasts the quiet weeks that follow. Worse, it can miss the post-promotion dip entirely. Clean promo flags on history are the fix, and they are tedious to maintain, which is why this trap is so common.
  • Hierarchy mismatches. Demand can be forecast at the item level, the brand level, by retailer, by region, or for the total. These do not automatically agree. A bottom-up sum of SKU forecasts often will not match a top-down brand forecast, and if the planning system reconciles them carelessly, accuracy at one level is bought by distorting another. Decide which level is the source of truth and reconcile deliberately.
  • Dirty or misaligned inputs. POS and shipment data measure different things, syndicated extracts lag, and retailer calendars do not all line up. A forecast built on inputs that quietly disagree will be confidently wrong. Reconciling those sources before they hit the model is unglamorous and it is where a lot of accuracy actually lives.

That last point is where tooling helps. Scout reconciles POS and shipment data into one consistent view, so the series feeding a forecast reflect demand rather than a tangle of mismatched ordering and scanning artifacts. A cleaner input does not replace a planner's judgment, but it removes a class of errors before they reach the model.

If you are still mapping the data sources behind all this, the primers on syndicated data and SPINS data cover where retail POS and panel numbers come from, and the CPG glossary defines the terms.

Frequently asked questions

What is a good forecast accuracy for CPG?
There is no single number, because accuracy depends on the category, the horizon, and how promoted the item is. A stable staple over a short horizon can run a low MAPE, while a heavily promoted item over a 13-week horizon will run much higher, and that is expected rather than a failure. The useful question is whether your own MAPE and bias are improving over time, not whether you hit a benchmark from a conference slide.
What is the difference between baseline and promoted volume?
Baseline is what would sell at the everyday shelf price with no promotion running. Promoted, or incremental, volume is the lift on top of that from a temporary price cut, a feature, or a display. Baseline is relatively stable and moves with distribution and price position; promoted volume is spiky and event-driven. Forecasting them separately, then layering planned promotional lift on a clean baseline, produces a forecast you can actually explain.
Should I forecast from POS or shipment data?
POS, where you can get it. Point-of-sale data reflects actual shopper demand, while shipment data reflects retailer ordering and moves in case-pack lumps, so it is a noisier proxy. Many brands reconcile both because shipments are easier to obtain and POS is the cleaner demand signal. Aligning the two before they reach the model removes a common source of forecast error.

See this on your own data

Scout gives CPG sales teams the analytics infrastructure they need — without spreadsheets.

Get a 15-min demo