screen-control-operator-v3

安装量: 46
排名: #16055

安装

npx skills add https://github.com/breverdbidder/life-os --skill screen-control-operator-v3

Screen Control Operator V3 Built 3 weeks before Claude Cowork announcement. Now with feature parity + our advantages. Core Advantages Over Claude Cowork Feature Claude Cowork Screen Control Operator V3 Vision Method Screenshots CDP + Accessibility Tree Speed 1-5 seconds 50-200ms (10x faster) Cost Vision tokens ($$$) Text tokens only ($) Reliability ~85% OCR 100% semantic queries Skill Recording ✅ Yes ✅ Yes Parallel Execution ✅ Yes ✅ Yes Domain Skills Generic Foreclosure-specific Smart Router N/A 90% FREE tier When to Use This Skill Trigger phrases: "control my screen" "record a workflow" "replay this skill" "verify the Lovable preview" "test the scraper" "debug DOM selectors" "inspect page structure" "run BECA lookup" "search BCPAO" "autonomous browser testing" Quick Start from screen_control_operator_v3 import ScreenControlOperatorV3 operator = ScreenControlOperatorV3 ( headless = False ) operator . launch ( )

Option 1: Record new skill

skill

operator . record_skill ( "My Workflow" , domain = "foreclosure" ) operator . save_skill ( skill , "my_workflow.json" )

Option 2: Play recorded skill

result

operator . play_skill_file ( "my_workflow.json" , variables = { "case_number" : "2025-CA-001234" } )

Option 3: Use pre-built foreclosure skills

result

operator . lookup_beca_case ( "2025-CA-001234" ) result = operator . lookup_bcpao_property ( "12-34-56-78-90" ) result = operator . search_acclaimweb_liens ( "SMITH JOHN" )

Option 4: Parallel execution

results

operator . execute_parallel ( [ { "skill" : skill1 , "variables" : { "case" : "001" } } , { "skill" : skill2 , "variables" : { "case" : "002" } } , { "skill" : skill3 , "variables" : { "case" : "003" } } , ] ) operator . close ( ) CLI Commands

Record new skill

python screen_control_operator_v3.py record \ --name "BECA Deep Search" \ --domain foreclosure \ --output skills/beca_deep.json \ --start-url "https://vmatrix1.brevardclerk.us/beca/"

Play skill with variables

python screen_control_operator_v3.py play \ --skill skills/beca_deep.json \ --vars '{"case_number": "2025-CA-001234"}'

Pre-built skills

python screen_control_operator_v3.py beca --case "2025-CA-001234" python screen_control_operator_v3.py bcpao --parcel "12-34-56-78-90"

Inspect page DOM (NOT screenshot)

python screen_control_operator_v3.py inspect \ --url "https://example.com" \ --output structure.json Pre-Built Foreclosure Skills Skill ID Name Variables Description beca_lookup_001 BECA Case Lookup case_number Search BECA, extract judgment bcpao_lookup_001 BCPAO Property parcel_id Get property details, photo acclaimweb_lien_001 Lien Search party_name Search recorded liens realforeclose_list_001 Auction List auction_date Get auction calendar Skill Recording (Cowork Feature) How Recording Works Launch browser and navigate to starting page Call record_skill(name, domain) Perform your workflow manually Press Enter or Ctrl+C to stop Skill is captured with: All navigation events All click events (with selectors) All type events (with values) Success criteria (inferred) What Gets Recorded { "skill_id" : "abc123def456" , "name" : "BECA Deep Search" , "domain" : "foreclosure" , "actions" : [ { "action_type" : "navigate" , "url" : "https://vmatrix1.brevardclerk.us/beca/" } , { "action_type" : "type" , "selector" : "#caseNumber" , "value" : "{{case_number}}" , "element_info" : { "label" : "Case Number" , "role" : "textbox" } } , { "action_type" : "click" , "selector" : "button[type='submit']" , "element_info" : { "label" : "Search" , "role" : "button" } } ] , "variables" : { "case_number" : "" } , "success_criteria" : [ { "type" : "element_visible" , "selector" : ".case-details" } ] } Parallel Execution (Cowork Feature) Execute multiple skills simultaneously in isolated browser contexts: executor = ParallelExecutor ( max_parallel = 5 ) executor . launch ( ) tasks = [ { "skill" : beca_skill , "variables" : { "case" : "2025-CA-001" } } , { "skill" : beca_skill , "variables" : { "case" : "2025-CA-002" } } , { "skill" : beca_skill , "variables" : { "case" : "2025-CA-003" } } , { "skill" : beca_skill , "variables" : { "case" : "2025-CA-004" } } , { "skill" : beca_skill , "variables" : { "case" : "2025-CA-005" } } , ] results = executor . execute_parallel ( tasks )

All 5 run simultaneously in separate contexts

DOM Inspection (Our Advantage) Get page structure via accessibility tree (NOT screenshots): structure = operator . get_page_structure ( )

Returns:

{ "url" : "https://example.com" , "title" : "Page Title" , "elements" : [ { "tag" : "BUTTON" , "role" : "button" , "label" : "Submit" , "testid" : "submit-btn" } , { "tag" : "INPUT" , "role" : "textbox" , "label" : "Search" , "id" : "search-input" } , . . . ] } Find element by natural language: selector = operator . find_element_semantic ( "search button" )

Returns: "[data-testid='search-btn']" or "#search-button" or similar

Smart Router Integration Operation Model Tier Cost Page navigation FREE (Gemini) $0 Element finding FREE (Gemini) $0 Skill recording FREE (Gemini) $0 Skill playback FREE (Gemini) $0 Error recovery BALANCED (Sonnet) $ Result: 95%+ operations in FREE tier GitHub Actions Integration

.github/workflows/browser_agent.yml

name : Browser Agent Daily on : schedule : - cron : '0 4 * * *'

11 PM EST

jobs : scrape : runs-on : ubuntu - latest steps : - uses : actions/checkout@v4 - name : Set up Python uses : actions/setup - python@v4 with : python-version : '3.11' - name : Install dependencies run : | pip install playwright playwright install chromium - name : Run Browser Agent run : | python src/agents/screen_control_operator_v3.py beca \ --case "${{ github.event.inputs.case_number }}" Error Recovery When element not found, V3 tries multiple strategies: data-testid (most reliable) aria-label (semantic) role + text (accessibility) ID (traditional) Original selector (fallback) Text content (last resort) This is why we're 100% reliable vs Cowork's ~85%. Dependencies pip install playwright --break-system-packages playwright install chromium File Locations Script: src/agents/screen_control_operator_v3.py Skills: skills/*.json Skill Lib: .claude/skills/screen-control-operator-v3/SKILL.md Built by Claude AI Architect + Ariel Shapira December 25, 2025 (original) → January 15, 2026 (V3) We are the team of Claude innovators!

返回排行榜