AI Demand Forecasting for FMCG in India — The Pawanshree Story

May 20, 2026·8 min read
PR
Praveen Rai

CEO, Sort String Solutions LLP

The conversation that turned five years of SalesPort data into a productised AI module happened on a Wednesday afternoon in March 2026. We had run a demand-forecasting prototype against six months of Pawanshree Dairy's distribution data — SKU-level dispatch volumes across their 140 collection routes, trained on weather, day-of-week, festival calendar, and primary-sales-velocity features. The model beat their previous manual forecast by 22% on MAPE for fast-moving SKUs.

The MD looked at the dashboard, paused for about 10 seconds, and said: "We will love to pay extra for this."

That single sentence converted "we have data" from an internal capability into a productised module set. This post walks through what that means in practice — what the data actually looks like, why classical ML beats LLM-everything for this problem, and how it ships to clients.

## The data already exists — that's the wedge

The deepest moat in any AI-for-FMCG product is not the model architecture. It's the data depth. Most standalone AI vendors targeting Indian FMCG (Aforza, Wiz.ai, RELEX, o9 Solutions) spend the first 6-12 months of an engagement just getting clean operational data flowing — fixing schema mismatches, deduplicating orders, reconciling primary vs secondary, untangling scheme attribution.

For SalesPort clients, that data is already flowing. Every order, every dispatch, every scheme application, every GPS ping from a salesperson's phone, every farmer milk-collection event — already structured, already validated, already in a database we control.

Across our 45 deployments:

  • 49 Lakh+ orders with 1.96 Crore order line items
  • 11.44 Lakh dispatches with full vehicle + GPS + delivery traces
  • 17.43 Lakh schemes auto-applied with slab tier + claim window + price impact
  • ₹2,677 Crore of payment flows across distributor and retailer wallets
  • 21.64 Crore GPS data points from 2.3 Lakh daily active mobile users
  • ₹803 Crore of milk procurement across 83,785 farmer accounts

That depth — and the fact that it's per-client isolated, audit-trailed, and structurally clean — is what makes the AI modules ship in weeks, not quarters.

## Why XGBoost, not GPT-4

A common question we get from technical buyers: are you using LLMs for this?

For demand forecasting, no. The problem shape is structured numerical prediction — given SKU + retailer + day-of-week + weather + holiday-calendar + promotional-status, predict next-week dispatch volume. Classical gradient-boosted regression (XGBoost) handles this better than any LLM-based approach. The features are numeric, the target is numeric, the relationships are mostly local and non-linear, and the data is tabular.

For the WhatsApp order bot — where retailers text natural-language orders to the brand's WhatsApp Business number — yes, we use LLMs (specifically Anthropic Claude via API). The problem there is natural language understanding, which is genuinely what LLMs are good at.

The principle: use LLMs where they add value, classical ML where it does better. Most AI-for-FMCG buzz mixes these up. We don't.

## What the Pawanshree forecast actually looks like

The Pawanshree deployment runs daily SKU-level forecasts at 30/60/90-day horizons across 140 collection routes. Each route has 6-12 active SKUs. The forecast updates every night at 2 AM after the previous day's actuals close.

The dashboard shows three lines per SKU per route:

1. Yesterday's actual — what actually dispatched 2. Forecast for today — what the model predicts 3. Forecast variance — the gap between forecast and actual for the most recent 7 days

The operations team uses the forecast for two decisions: (a) procurement planning at the plant level (how much raw milk to receive), and (b) route-level dispatch sizing (so cold trucks don't run half-empty or under-loaded).

Initial pilot improvement was 22% MAPE reduction on fast-moving SKUs and 14% on slow-movers. After six months of production use the model has retrained on the new data and the lift has grown modestly — fast-moving SKUs now at 28% MAPE reduction, slow-movers at 18%.

## The productisation — what every SalesPort client gets

The Pawanshree custom build became the template for what every SalesPort client gets at a +₹25K/month add-on. The deployment is configuration-driven, not a new engineering project per client. The model retrains automatically on each client's data; no client's data leaks into another's model.

Three deployment phases:

1. Data validation (week 1) — confirm 12+ months of clean data, check for SKU master alignment, validate scheme attribution 2. Initial training (week 2) — fit XGBoost models per major SKU cluster, validate against held-out test sets, surface variance warnings to the client's ops team 3. Production rollout (weeks 3-4) — wire the forecasts into the Live Sales Analytics Dashboard, train the client's ops team on interpreting forecast variance

For clients with less than 12 months of SalesPort data, the forecasting module can still light up at lower confidence — typically MAPE reduction of 10-15% on fast-movers. The 22%+ numbers from Pawanshree come from deeper historical depth.

## What ships next

Demand forecasting is the first of seven AI modules. The roadmap:

  • Q2 2026 (now): Demand Forecasting + Live Sales Analytics Dashboard
  • Q3 2026: Route Optimisation + Trade Promotion ROI Engine + WhatsApp Order Bot
  • Q4 2026: Image Recognition (Perfect Store) for shelf audits
  • Q1 2027: Distributor Credit Scoring

The pattern is the same across all of them — productise data the client already owns, ship as an add-on to existing SalesPort AMC, light up in weeks instead of quarters.

The wedge that the Pawanshree MD opened with "we will love to pay extra for this" is the same wedge for every existing SalesPort client. Data they're already generating, productised into AI modules they actually use.

Frequently Asked Questions

Quick answers

How is SalesPort's AI demand forecasting different from standalone vendors like RELEX or Aforza?

Two differences. (1) The data is already there — SalesPort clients have been generating structured distribution data for 12-60+ months, so there's no 6-month data-engineering project to get clean inputs. (2) Pricing — RELEX/Aforza enterprise deployments run ₹3-15 Lakh/month, SalesPort's demand-forecasting add-on is +₹25K/month for existing clients. The trade-off is model breadth: enterprise vendors aggregate across thousands of clients, we model per-client. For mid-market Indian FMCG, per-client models actually do better.

What kind of MAPE improvement should I expect from AI demand forecasting?

Depends on your data depth and current baseline. Across deployments with 12+ months of SalesPort data, we typically see 15-25% MAPE reduction on fast-moving SKUs (8+ sales events per week) and 8-15% on slow-movers (1-7 events per week). Pawanshree's deeper data history (60+ months) gives them 28% on fast-movers, 18% on slow-movers. Clients with less than 12 months of data should expect 10-15% lift initially, with the model improving as more data accumulates.

Does the AI forecasting use LLMs like ChatGPT or Claude?

Not for demand forecasting itself. We use XGBoost (gradient-boosted regression) because the problem shape — structured numerical prediction from tabular features — is what classical ML handles best. For the WhatsApp order bot (where natural-language retailer messages need to be parsed into structured orders), yes, we use Anthropic Claude via API. We pick the right tool per module rather than putting LLMs on everything.

Can the model run on-premise for compliance-heavy deployments?

Yes. For on-premise SalesPort deployments (NDDB-affiliated dairies, large cooperatives, enterprises with strict data-residency requirements), the AI modules deploy on the same infrastructure as the core platform. Model training runs once per month per client; inference is daily. Per-client data isolation is the default — no cross-client model mixing, on-premise or cloud.

How long until the demand-forecasting module is in production?

4 weeks for existing SalesPort clients with 12+ months of data. Week 1 is data validation and SKU master alignment, week 2 is initial model training and held-out validation, weeks 3-4 are production rollout and ops-team training on interpreting the dashboard. The module is available now (Q2 2026); see the [AI module suite](/ai) page for the full roadmap.

Share this article

Digitise your milk procurement

Schedule a Walkthrough

More Articles

Dairy

Why 70% of Dairy Companies Still Use Paper for Dispatch Tracking

India's dairy distribution network moves lakhs of litres every day. Yet most dairy companies still track dispatch through paper registers and phone calls.

April 4, 2026 · 5 min read
Distribution

How GPS Tracking Changed Field Force Accountability for 45 Companies

We have tracked 21.64 Crore GPS data points across 45 companies. That data reveals what actually happens in Indian field sales operations.

April 7, 2026 · 6 min read
Dairy

From Paper Registers to 83,785 Farmers: How Milk Procurement Went Digital

SalesPort Milk Procurement manages 83,785 farmers across India and Nepal. No other platform in India offers a milk procurement module at this scale — here is how it works, what it captures, and why it matters.

April 11, 2026 · 11 min read
Distribution

What is a Distribution Management System (DMS)? A Complete Guide for Indian Businesses

A Distribution Management System (DMS) digitises the entire flow from manufacturer to retailer. Here is everything Indian dairy and FMCG companies need to know before choosing one.

April 15, 2026 · 8 min read
Distribution

How to Choose the Right DMS Software for Your Distribution Business

Not all DMS software is created equal. Here are 8 criteria Indian distribution companies should evaluate before committing to a platform.

April 17, 2026 · 7 min read
Distribution

Beat Plan Management: The Complete Guide for Field Sales Teams in India

Beat plans are the backbone of field sales execution. This guide covers how digital beat planning transforms retailer coverage and salesperson productivity.

April 21, 2026 · 6 min read
Technology

Tally Integration for Distribution Companies: Eliminate Double Entry Forever

Distribution companies lose hours every day to double data entry between their distribution software and Tally. Here is how to eliminate it permanently.

April 24, 2026 · 5 min read
Technology

How We Built an Offline-First Mobile App for Rural India

Building a mobile app that works without internet sounds simple. It is not. Here is the engineering behind SalesPort's offline-first architecture — deployed across 132 apps in rural India and Nepal.

April 24, 2026 · 6 min read
Dairy

5 Biggest Challenges in Indian Dairy Distribution (And How Technology Solves Them)

Indian dairy distribution faces unique challenges from perishable logistics to rural last-mile delivery. Here are the five biggest problems and how technology addresses each one.

May 1, 2026 · 6 min read
FMCG

FMCG Distribution in India: Why 5,600 SKUs Need More Than Spreadsheets

Managing thousands of SKUs across hundreds of distributors and lakhs of retail outlets requires purpose-built distribution technology — not Excel sheets and WhatsApp groups.

May 5, 2026 · 7 min read
FMCG

How Automatic Scheme Management Prevents Revenue Leakage in FMCG Distribution

Manual scheme management costs FMCG companies crores in revenue leakage every year. Here is how automatic scheme engines solve the problem.

May 8, 2026 · 6 min read
Technology

The Tech Behind Processing 12 Lakh Transactions Daily

When people hear SalesPort processes 12 Lakh transactions every day across 45 companies, the first question from any technical person is: how? Here is the architecture, the scaling challenges, and the engineering trade-offs.

April 17, 2026 · 7 min read
FMCG

Primary vs Secondary Sales: What Every FMCG Distributor Should Know

Most FMCG companies know exactly what they ship to distributors. Very few know what actually reaches the retail shelf. That gap between primary and secondary sales is where margins disappear.

April 22, 2026 · 5 min read
FMCG

How to Digitise FMCG Distribution in India: A Step-by-Step Guide

Digitising FMCG distribution in India is not a technology project — it is an operational transformation. Here is the step-by-step playbook we use when onboarding FMCG clients at SalesPort.

April 29, 2026 · 9 min read
Dairy

5 Distribution KPIs Every Dairy Operations Head Should Track

If you run dairy distribution operations, these are the 5 numbers you should check every morning. Most companies only track the first one. The other four are where the real insights hide.

May 1, 2026 · 6 min read
Buyer Guides

FieldAssist vs Bizom vs BeatRoute vs SalesPort — 2026 Buyer's Matrix

A factual side-by-side comparison of the four DMS / SFA platforms Indian buyers shortlist most often in 2026 — features, pricing, industries, and the deciding factors that actually matter.

May 9, 2026 · 12 min read
Buyer Guides

BeatRoute Pricing 2026 — What FMCG Buyers Actually Pay

BeatRoute publishes per-user SaaS pricing — ₹700 to ₹1,470 per user per month. Here's what that actually compounds to at 50, 100, and 200 users, plus the hidden costs buyers miss.

May 6, 2026 · 9 min read
Dairy

SFA Software for Dairy — The 2026 Buyer's Guide

Dairy distribution is not generic FMCG. SFA software built for FMCG distributors misses the perishables, cold chain, and procurement realities of dairy operations. Here is what dairy SFA actually needs.

May 2, 2026 · 11 min read
Distribution

GST E-Invoicing for Distributors — The 14-Point Readiness Checklist

GST e-invoicing thresholds keep dropping. Most Indian distributors are now in scope. Here is the 14-point readiness checklist that distribution operations should run through before the next compliance deadline.

April 28, 2026 · 10 min read
Buyer Guides

DMS Software Pricing in India 2026 — What 45 Deployments Taught Us

How much does DMS software cost in India? Real pricing ranges, the two pricing models in market, and the cost drivers buyers consistently underestimate. Lessons from 45 SortString deployments.

April 25, 2026 · 10 min read
Technology

Route Optimisation in FMCG Distribution — Why 21.64 Crore GPS Points Matter

How SalesPort's route optimisation module works under the hood — vehicle-routing-problem solvers trained on 21.64 Crore real GPS data points, cutting field force fuel and time by 18-25%.

May 20, 2026 · 7 min read