

Velocity has been the default agile forecasting tool for two decades, and in my experience it is also one of the most-misused metrics in the field. By 2026, what I’ve found is that AI-augmented forecasting offers a more honest alternative - probabilistic forecasts that show the distribution of outcomes rather than a single number that creates false confidence.
In this guide I explain the modern AI velocity forecasting approach I use, when I reach for it, the math behind it (without overcomplicating), and how I communicate forecasts to stakeholders without losing the room. By the end you’ll know whether and how to upgrade your team’s forecasting practice.
Classic velocity is the average story points completed per sprint over recent sprints. The lie:
Teams that stake commitments on classic velocity routinely miss commitments by 20-40% and lose stakeholder trust.
| Approach | What it measures | When to use |
| Monte Carlo | Probabilistic completion dates | Forecasting when N stories will complete |
| Throughput | Stories per sprint, distribution | Capacity planning |
| Cycle time | Days per story, distribution | Identifying flow bottlenecks |
Strong teams use all three for different questions.
Monte Carlo simulation runs thousands of synthetic future sprints based on historical throughput. Output: probability of completing N stories by date X.
How it works:
Stakeholders see the distribution, not a fake-precise single date.
The math is simple enough that anyone can run it in a spreadsheet. The benefit is enormous: honest forecasting beats fake precision every quarter.
Simpler than Monte Carlo, often sufficient. Track:
Use percentiles: “We complete at least 12 stories per sprint 80% of the time. We complete 16+ only 30% of the time.”
This framing is more honest than “our velocity is 14 points”.
Cycle time = days from starting a story to completing it. AI helps by:
A useful prompt:
“From this cycle time data, identify: median, p85, p95. Surface the top 5 outlier stories and likely causes. Propose 3 process changes that would tighten the distribution.”
Each approach answers a different question:
Strong teams report all three quarterly. Each surfaces different signals about team health and predictability.
| Tool | Strength |
| ActionableAgile | Strong Monte Carlo, mature |
| Twin (formerly Agile Forecast) | Lightweight, web-based |
| Native Jira AI | Increasingly offers forecasting |
| Linear AI | Built-in throughput views |
| Custom dashboards (Tableau, Hex, Looker) with AI summarisation | Most flexible |
For most teams, the native PM tool’s forecasting is enough to start. Specialised tools earn their cost when forecasting is mission-critical.
The hardest part of probabilistic forecasting is communication. Patterns that work:
Stakeholders who learn to read these forecasts become better decision-makers.
Most stakeholders haven’t seen probabilistic forecasting. Investment in education pays off:
After 2-3 quarters, most stakeholders prefer probabilistic forecasts because they enable better business planning.
For multi-team programs:
PI planning in SAFe especially benefits from program-level Monte Carlo.
Week 1: pull last 20 sprints of data. Compute throughput distribution and cycle time distribution.
Week 2: introduce Monte Carlo for one upcoming forecast. Compare to classic velocity prediction.
Week 3: run weekly forecast updates. Communicate distribution to stakeholders.
Week 4: evaluate accuracy. Refine the model. Decide on permanent adoption.
By week 4 most teams have abandoned classic velocity for forecasting and use it only as a retrospective metric.
In regulated industries (finance, healthcare, government), forecasting carries audit weight:
Probabilistic forecasting is auditable; ad-hoc velocity guesses often aren’t.
Paul Lister, an Agilist and a Certified Scrum Trainer (CST) with 20+ years of experience, coaches Scrum courses, co-founded the Surrey & Sussex Agile meetup. He also writes short stories, novels, and have directed and produced short films.
QUICK FACTS
For relative sizing within a team, yes. For forecasting, story counts plus cycle time are better. Story points are useful inputs, not outputs.