backtesting-py-oracle

安装量: 56
排名: #13247

安装

npx skills add https://github.com/terrylica/cc-skills --skill backtesting-py-oracle
backtesting.py Oracle Validation for Range Bar Patterns
Configuration and anti-patterns for using backtesting.py to validate ClickHouse SQL sweep results. Ensures bit-atomic replicability between SQL and Python trade evaluation.
Companion skills
:
clickhouse-antipatterns
(SQL correctness, AP-16) |
sweep-methodology
(sweep design) |
rangebar-eval-metrics
(evaluation metrics)
Validated
Gen600 oracle verification (2026-02-12) — 3 assets, 5 gates, ALL PASS. Critical Configuration (NEVER omit) from backtesting import Backtest bt = Backtest ( df , Strategy , cash = 100_000 , commission = 0 , hedging = True ,

REQUIRED: Multiple concurrent positions

exclusive_orders

False ,

REQUIRED: Don't auto-close on new signal

)
Why
SQL evaluates each signal independently (overlapping trades allowed). Without
hedging=True
, backtesting.py skips signals while a position is open, producing fewer trades than SQL. This was discovered when SOLUSDT produced 105 Python trades vs 121 SQL trades — 16 signals were silently skipped.
Anti-Patterns (Ordered by Severity)
BP-01: Missing Multi-Position Mode (CRITICAL)
Symptom
Python produces fewer trades than SQL. Gate 1 (signal count) fails.
Root Cause
Default
exclusive_orders=True
prevents opening new positions while one is active.
Fix
Always use
hedging=True, exclusive_orders=False
.
BP-02: ExitTime Sort Order (CRITICAL)
Symptom
Entry prices appear mismatched (Gate 3 fails) even though both SQL and Python use the same price source.
Root Cause
:
stats._trades
is sorted by ExitTime, not EntryTime. When overlapping trades exit in a different order than they entered, trade[i] no longer maps to signal[i].
Fix
:
trades
=
stats
.
_trades
.
sort_values
(
"EntryTime"
)
.
reset_index
(
drop
=
True
)
BP-03: NaN Poisoning in Rolling Quantile (CRITICAL)
Symptom
Cross-asset tests fail with far fewer Python trades. Feature quantile becomes NaN and propagates forward indefinitely.
Root Cause
:
np.percentile
with NaN inputs returns NaN. If even one NaN feature value enters the rolling window, all subsequent quantiles become NaN, making all subsequent filter comparisons fail.
Fix
Skip NaN values when building the signal window: def _rolling_quantile_on_signals ( feature_arr , is_signal_arr , quantile_pct , window = 1000 ) : result = np . full ( len ( feature_arr ) , np . nan ) signal_values = [ ] for i in range ( len ( feature_arr ) ) : if is_signal_arr [ i ] : if len ( signal_values )

0 : window_data = signal_values [ - window : ] result [ i ] = np . percentile ( window_data , quantile_pct * 100 )

Only append non-NaN values (matches SQL quantileExactExclusive NULL handling)

if
not
np
.
isnan
(
feature_arr
[
i
]
)
:
signal_values
.
append
(
feature_arr
[
i
]
)
return
result
BP-04: Data Range Mismatch (MODERATE)
Symptom
Different signal counts between SQL and Python for assets with early data (BNB, XRP).
Root Cause
:
load_range_bars()
defaults to
start='2020-01-01'
but SQL has no lower bound.
Fix
Always pass
start='2017-01-01'
to cover all available data.
BP-05: Margin Exhaustion with Overlapping Positions (MODERATE)
Symptom
Orders canceled with insufficient margin. Fewer trades than expected.
Root Cause
With
hedging=True
and default full-equity sizing, overlapping positions exhaust available margin.
Fix
Use fixed fractional sizing: self . buy ( size = 0.01 )

1% equity per trade

BP-06: Signal Timestamp vs Entry Timestamp (LOW)
Symptom
Gate 2 (timestamp match) fails because SQL uses signal bar timestamps while Python uses entry bar timestamps.
Root Cause
SQL outputs the signal detection bar's
timestamp_ms
. Python's
EntryTime
is the fill bar (next bar after signal). These differ by 1 bar.
Fix
Record signal bar timestamps in the strategy's next() method:

Before calling self.buy()

self
.
_signal_timestamps
.
append
(
int
(
self
.
data
.
index
[
-
1
]
.
timestamp
(
)
*
1000
)
)
5-Gate Oracle Validation Framework
Gate
Metric
Threshold
What it catches
1
Signal Count
<5% diff
Missing signals, filter misalignment
2
Timestamp Match
>95%
Timing offset, warmup differences
3
Entry Price
>95%
Price source mismatch, sort ordering
4
Exit Type
>90%
Barrier logic differences
5
Kelly Fraction
<0.02
Aggregate outcome alignment
Expected residual
1-2 exit type mismatches per asset at TIME barrier boundary (bar 50). SQL uses fwd_closes[max_bars] , backtesting.py closes at current bar price. Impact on Kelly < 0.006. Strategy Architecture: Single vs Multi-Position Mode Constructor Use Case Position Sizing Single-position hedging=False (default) Champion 1-bar hold Full equity Multi-position hedging=True, exclusive_orders=False SQL oracle validation Fixed fractional ( size=0.01 ) Multi-Position Strategy Template class Gen600Strategy ( Strategy ) : def next ( self ) : current_bar = len ( self . data ) - 1

1. Register newly filled trades and set barriers

for trade in self . trades : tid = id ( trade ) if tid not in self . _known_trades : self . _known_trades . add ( tid ) self . _trade_entry_bar [ tid ] = current_bar actual_entry = trade . entry_price if self . tp_mult

0 : trade . tp = actual_entry * ( 1.0 + self . tp_mult * self . threshold_pct ) if self . sl_mult

0 : trade . sl = actual_entry * ( 1.0 - self . sl_mult * self . threshold_pct )

2. Check time barrier for each open trade

for trade in list ( self . trades ) : tid = id ( trade ) entry_bar = self . _trade_entry_bar . get ( tid , current_bar ) if self . max_bars

0 and ( current_bar - entry_bar ) = self . max_bars : trade . close ( ) self . _trade_entry_bar . pop ( tid , None )

3. Check for new signal (no position guard — overlapping allowed)

if self . _is_signal [ current_bar ] : self . buy ( size = 0.01 ) Data Loading from data_loader import load_range_bars df = load_range_bars ( symbol = "SOLUSDT" , threshold = 1000 , start = "2017-01-01" ,

Cover all available data

end

"2025-02-05" ,

Match SQL cutoff

extra_columns

[ "volume_per_trade" , "lookback_price_range" ] ,

Gen600 features

) Project Artifacts (rangebar-patterns repo) Artifact Path Oracle comparison script scripts/gen600_oracle_compare.py Gen600 strategy (reference) backtest/backtesting_py/gen600_strategy.py SQL oracle query template sql/gen600_oracle_trades.sql Oracle validation findings findings/2026-02-12-gen600-oracle-validation.md Backtest CLAUDE.md backtest/CLAUDE.md ClickHouse AP-16 .claude/skills/clickhouse-antipatterns/SKILL.md Fork source ~/fork-tools/backtesting.py/

返回排行榜