Worker Benchmarks Skill Run comprehensive performance benchmarks for the agentic-flow worker system. Quick Start
Run full benchmark suite
npx agentic-flow workers benchmark
Run specific benchmark
- npx agentic-flow workers benchmark
- --type
- trigger-detection
- npx agentic-flow workers benchmark
- --type
- registry
- npx agentic-flow workers benchmark
- --type
- agent-selection
- npx agentic-flow workers benchmark
- --type
- concurrent
- Benchmark Types
- 1. Trigger Detection (
- trigger-detection
- )
- Tests keyword detection speed across 12 worker triggers.
- Target
-
- p95 < 5ms
- Iterations
-
- 1000
- Metrics
-
- latency, throughput, histogram
- 2. Worker Registry (
- registry
- )
- Tests CRUD operations on worker entries.
- Target
-
- p95 < 10ms
- Iterations
-
- 500 creates, gets, updates
- Metrics
-
- per-operation latency breakdown
- 3. Agent Selection (
- agent-selection
- )
- Tests performance-based agent selection.
- Target
-
- p95 < 1ms
- Iterations
-
- 1000
- Metrics
-
- selection confidence, agent scores
- 4. Model Cache (
- cache
- )
- Tests model caching performance.
- Target
-
- p95 < 0.5ms
- Metrics
-
- hit rate, cache size, eviction stats
- 5. Concurrent Workers (
- concurrent
- )
- Tests parallel worker creation and updates.
- Target
-
- < 1000ms for 10 workers
- Metrics
-
- per-worker latency, memory usage
- 6. Memory Key Generation (
- memory-keys
- )
- Tests memory pattern key generation.
- Target
-
- p95 < 0.1ms
- Iterations
-
- 5000
- Metrics
-
- unique patterns, throughput
- Output Format
- ═══════════════════════════════════════════════════════════
- 📈 BENCHMARK RESULTS
- ═══════════════════════════════════════════════════════════
- ✅ Trigger Detection
- Operation: detect
- Count: 1,000
- Avg: 0.045ms | p95: 0.120ms (target: 5ms)
- Throughput: 22,222 ops$s
- Memory Δ: 0.12MB
- ✅ Worker Registry
- Operation: crud
- Count: 1,500
- Avg: 1.234ms | p95: 3.456ms (target: 10ms)
- Throughput: 810 ops$s
- Memory Δ: 2.34MB
- ───────────────────────────────────────────────────────────
- 📊 SUMMARY
- ───────────────────────────────────────────────────────────
- Total Tests: 6
- Passed: 6 | Failed: 0
- Avg Latency: 0.567ms
- Total Duration: 2345ms
- Peak Memory: 8.90MB
- ═══════════════════════════════════════════════════════════
- Integration with Settings
- Benchmark thresholds are configured in
- .claude$settings.json
- :
- {
- "performance"
- :
- {
- "benchmarkThresholds"
- :
- {
- "triggerDetection"
- :
- {
- "p95Ms"
- :
- 5
- }
- ,
- "workerRegistry"
- :
- {
- "p95Ms"
- :
- 10
- }
- ,
- "agentSelection"
- :
- {
- "p95Ms"
- :
- 1
- }
- ,
- "memoryKeyGeneration"
- :
- {
- "p95Ms"
- :
- 0.1
- }
- ,
- "concurrentWorkers"
- :
- {
- "totalMs"
- :
- 1000
- }
- }
- }
- }
- Programmatic Usage
- import
- {
- workerBenchmarks
- ,
- runBenchmarks
- }
- from
- 'agentic-flow$workers$worker-benchmarks'
- ;
- // Run full suite
- const
- suite
- =
- await
- runBenchmarks
- (
- )
- ;
- console
- .
- log
- (
- suite
- .
- summary
- )
- ;
- // Run individual benchmarks
- const
- triggerResult
- =
- await
- workerBenchmarks
- .
- benchmarkTriggerDetection
- (
- 1000
- )
- ;
- const
- registryResult
- =
- await
- workerBenchmarks
- .
- benchmarkRegistryOperations
- (
- 500
- )
- ;
- Performance Optimization Tips
- Model Cache
-
- Enable with
- CLAUDE_FLOW_MODEL_CACHE_MB=512
- Parallel Workers
-
- Enable with
- CLAUDE_FLOW_WORKER_PARALLEL=true
- Warning Suppression
-
- Enable with
- CLAUDE_FLOW_SUPPRESS_WARNINGS=true
- SQLite WAL Mode
- Automatic for better concurrent performance