How long should you backtest? A practical answer
A year is the popular default for backtest length. A year is often not enough. The real question isn't time — it's how many trades you generate and how many market regimes you cover. Here's how to think about it without rules of thumb that betray you.
Time is the wrong unit
Backtest “length” is almost always discussed in months or years. That's the wrong frame. A strategy that makes one trade per week over five years (260 trades) is better-validated than one that makes one trade per month over twenty years (240 trades). What matters is the trade count and the variety of market regimes encountered, not the calendar duration.
Minimum trade counts
For statistical significance, most systematic strategies need 100–200 trades minimum. Below that, the signal-to-noise ratio is too low to distinguish a real edge from luck. With 30 trades, even a 65% win rate could plausibly be from chance. With 200 trades, that same win rate is meaningful evidence of an edge. See risk of ruin and statistical significance for the math behind these thresholds.
Regime coverage matters more than calendar time
A backtest only validates a strategy against the regimes contained in the sample. If your year of data was all uptrend, you don't know whether the strategy survives a downtrend, a chop, or a high-volatility crash. Different markets cycle through regimes on different timescales — index futures might see all four major regime types in 2–3 years; commodity markets often need 5+ years; FX pairs vary widely.
| Regime type | Characteristics | Typical time to encounter |
|---|---|---|
| Uptrend | Sustained directional move | Months |
| Downtrend | Sustained reverse move | Months |
| Range | Bound between defined levels | Months |
| High volatility | Wide bars, frequent reversals | Weeks to months |
| Quiet | Tight ranges, low volume | Months to years |
| Crisis | Sudden gap, liquidity stress | Years apart |
Length by trade frequency
| Strategy frequency | Recommended minimum backtest | Why |
|---|---|---|
| Scalping (10+ trades/day) | 3–6 months | Generates 10,000+ trades; sample size not the constraint |
| Intraday (1–5 trades/day) | 1–2 years | 500–1,000 trades; covers 2–3 regime cycles |
| Swing (1–3 trades/week) | 3–5 years | 200–500 trades; multiple full regime cycles |
| Position (1–3 trades/month) | 7–10 years | 100–200 trades; full bull/bear cycles |
Special considerations for prop firm strategies
Prop firm evaluations typically last 30–60 days. A 1-year backtest that produces 500 trades gives roughly 40–80 trades per evaluation window — enough to see whether the strategy will reach the target within the time limit. A 1-year backtest that produces 50 trades gives 4–8 trades per evaluation, way too few to validate the strategy will reliably hit the target on time. For low-frequency prop strategies, you need 3–5+ years of backtest data. See time-to-first-payout for how this plays out in practice.
When you can't get enough data
If your strategy is so new or so low-frequency that you can't reach 100+ trades on available data, you have two honest options:
- Trade it forward in paper / sim until enough trades accumulate to validate.
- Accept that what you have is a hypothesis, not a tested strategy, and size accordingly.
Pretending a 30-trade backtest validates a strategy is how people blow up accounts. The 12-month sample window used in the Puravida Edge methodology generates 500–2,000+ trades per portfolio depending on strategy frequency — well above the statistical significance threshold across multiple regime cycles.
FAQ
Is one year of data enough to backtest a trading strategy?
It depends on trade count. A high-frequency strategy generating 1,000+ trades in one year is well-validated; a low-frequency strategy generating 30 trades in one year is not. The minimum useful sample is roughly 100–200 trades across multiple regime types.
How many trades do I need to validate a trading strategy?
Most systematic strategies need 100–200 trades minimum for statistical significance. Below that, you can't reliably distinguish a real edge from random luck. For high-confidence validation, 500+ trades across multiple market regimes is the practical standard.
Can I backtest on demo or simulation data?
Yes — demo / sim is a legitimate way to accumulate trade history when historical data isn't enough. The key difference vs live: execution costs and slippage are usually unrealistically favorable in sim. Apply realistic commissions and slippage assumptions when interpreting sim results.
Not financial advice. Performance figures referenced are hypothetical, modeled outputs (1,500-path Monte Carlo on a 12-month sample). Past performance does not guarantee future results. Tool names are referenced for education; verify current features and prop-firm rules directly.