Agent Skills 排行榜 · 关键词 + 语义搜索

/ 聚焦搜索框
正在使用 AI 进行语义搜索...
24,744
总 Skills
97.4M
总安装量
2,592
贡献者
# Skill 仓库 描述 安装量
10201 arm-cortex-expert sickn33/antigravity-awesome-skills
@arm-cortex-expert Use this skill when Working on @arm-cortex-expert tasks or workflows Needing guidance, best practices, or checklists for @arm-cortex-expert Do not use this skill when The task is unrelated to @arm-cortex-expert You need a different domain or tool outside this scope Instructions Clarify goals, constraints, and required inputs. Apply relevant best practices and validate outcomes. Provide actionable steps and verification. If detailed examples are required, open resources/impleme...
466
10202 playwriter supercent-io/skills-template
Quick Start Get a session ID first playwriter session new => 1 Execute code with your session playwriter -s 1 -e "await page.goto('https://example.com')" playwriter -s 1 -e "console.log(await accessibilitySnapshot({ page }))" playwriter -s 1 -e "await page.screenshot({ path: 'shot.png', scale: 'css' })" If playwriter is not found, use npx playwriter@latest or bunx playwriter@latest. Full Documentation Always run playwriter skill to get the complete, up-to-date skill instructions. The sk...
465
10203 pywinauto malue-ai/dazee-small
Windows UI 自动化(pywinauto) 通过 pywinauto 操作任意 Windows 桌面应用:发现窗口、检查控件、点击按钮、输入文字、读取内容。 支持两种后端:Win32 API(传统应用)和 MS UI Automation(现代应用)。 使用场景 用户说「帮我在 XX 应用里点一下那个按钮」「自动填一下这个表单」 需要操作没有 API 的桌面应用(如 ERP 系统、内部管理系统) 需要批量操作 GUI 应用(如自动录入数据) 需要读取其他应用界面上的文字内容 后端选择 后端 参数 适用应用 Win32 API backend="win32" MFC、VB6、VCL、简单 WinForms MS UI Automation backend="uia" WinForms、WPF、UWP Store 应用、Qt5、浏览器 不确定用哪个时,优先尝试 uia ;如果找不到控件,切换为 win32 。 命令参考 连接到已有应用 from pywinauto import Application 方式 1:通过窗口标题连接 app = Application ( backe...
465
10204 game-development mindrally/skills
Game Development Orchestrator skill that provides core principles and routes to specialized sub-skills. When to Use This Skill You are working on a game development project. This skill teaches the PRINCIPLES of game development and directs you to the right sub-skill based on context. Sub-Skill Routing Platform Selection If the game targets... Use Sub-Skill Web browsers (HTML5, WebGL) game-development/web-games Mobile (iOS, Android) game-development/mobile-games PC (Steam, Desktop) game-developme...
465
10205 autonomous-agents davila7/claude-code-templates
Autonomous Agents You are an agent architect who has learned the hard lessons of autonomous AI. You've seen the gap between impressive demos and production disasters. You know that a 95% success rate per step means only 60% by step 10. Your core insight: Autonomy is earned, not granted. Start with heavily constrained agents that do one thing reliably. Add autonomy only as you prove reliability. The best agents look less impressive but work consistently. You push for guardrails before capabili...
465
10206 storybook mindrally/skills
Storybook 모범 관례 1. CSF 3.0 형식 사용 최신 Component Story Format 3.0 사용. 더 간결하고 타입 안전. // ❌ CSF 2.0 (구형) export default { title: 'Components/Button', component: Button, }; export const Primary = () => <Button variant="primary">Click me</Button>; // ✅ CSF 3.0 (권장) import type { Meta, StoryObj } from '@storybook/react'; import { Button } from './Button'; const meta = { title: 'Components/Button', component: Button, tags: ['autodocs'], // 자동 문서 생성 } satisfies Meta<typeof Button>; export d...
465
10207 git:commit neolabhq/context-engineering-kit
Claude Command: Commit Your job is to create well-formatted commits with conventional commit messages and emoji. Instructions CRITICAL: Perform the following steps exactly as described: Branch check : Checks if current branch is master or main . If so, asks the user whether to create a separate branch before committing. If user confirms a new branch is needed, creates one using the pattern <type>/<username>/<description> (e.g., feature/leovs09/add-new-command ) Unless specified with --no-verify ...
464
10208 kaizen:kaizen neolabhq/context-engineering-kit
Kaizen: Continuous Improvement Apply continuous improvement mindset - suggest small iterative improvements, error-proof designs, follow established patterns, avoid over-engineering; automatically applied to guide quality and simplicity Overview Small improvements, continuously. Error-proof by design. Follow what works. Build only what's needed. Core principle: Many small improvements beat one big change. Prevent errors at design time, not with fixes. When to Use Always applied for: Code implemen...
464
10209 data-encryption aj-geddes/useful-ai-prompts
Data Encryption Overview Implement robust encryption strategies for protecting sensitive data at rest and in transit using industry-standard cryptographic algorithms and key management practices. When to Use Sensitive data storage Database encryption File encryption Communication security Compliance requirements (GDPR, HIPAA, PCI-DSS) Password storage End-to-end encryption Implementation Examples 1. Node.js Encryption Library // encryption-service.js const crypto = require('crypto'); const fs ...
464
10210 push-notification-setup aj-geddes/useful-ai-prompts
Push Notification Setup Overview Implement comprehensive push notification systems for iOS and Android applications using Firebase Cloud Messaging and native platform services. When to Use Sending real-time notifications to users Implementing user engagement features Deep linking from notifications to specific screens Handling silent/background notifications Tracking notification analytics Instructions 1. Firebase Cloud Messaging Setup import messaging from '@react-native-firebase/messaging'; ...
464
10211 senpi-onboard senpi-ai/senpi-skills
Senpi Agent Onboarding Onboard an AI agent into the Senpi trading platform. After this flow the agent will have a Senpi account, an API key, and a working MCP server connection for portfolio management, trader discovery, strategy creation, and order execution on Hyperliquid. Prerequisites: Shell/code execution access and Node.js (required for npx ). Defaults SENPI_MCP_ENDPOINT : https://mcp.prod.senpi.ai (use this if {{SENPI_MCP_ENDPOINT}} is a literal unsubstituted placeholder) REFERRAL_CODE : ...
464
10212 gcp-expert personamanagmentlayer/pcl
Expert guidance for Google Cloud Platform services and cloud-native architecture. Core Concepts - Compute Engine, App Engine, Cloud Run - Cloud Functions (serverless) - Cloud Storage - BigQuery (data warehouse) - Firestore (NoSQL database) - Pub/Sub (messaging) - Google Kubernetes Engine (GKE) gcloud CLI ``` Initialize gcloud init Create Compute Engine instance gcloud compute instances create my-instance \ --zone=us-central1-a \ --machine-type=e2-medium \ --image-family=ubun...
464
10213 qveris hqman/qveris
QVeris Tool Search & Execution QVeris provides dynamic tool discovery and execution - search for tools by capability, then execute them with parameters. Setup Requires environment variable: QVERIS_API_KEY - Get from https://qveris.ai Quick Start Search for tools uv run scripts/qveris_tool.py search "weather forecast API" Execute a tool uv run scripts/qveris_tool.py execute openweathermap_current_weather --search-id < id > --params '{"city": "London", "units": "metric"}' Script Usage scripts/qver...
464
10214 skill-security-audit smartchainark/skill-security-audit
Skill Security Audit Detect malicious patterns in installed Claude and OpenClaw skills. Based on SlowMist's analysis of 472+ malicious skills on ClawHub platform. Triggers Use this skill when the user mentions: 安全审计, security audit, skill 检查, 技能安全, scan skills, supply chain security, 扫描技能, 恶意检测, malicious skill, skill 安全扫描 Quick Audit Workflow When the user requests a security audit, follow these 5 steps: Step 1: Run the Scanner python3 ~/.claude/skills/skill-security-audit/scripts/skill_audit.p...
464
10215 android-development mindrally/skills
No SKILL.md available for this skill. View on GitHub
464
10216 autogpt-agents davila7/claude-code-templates
AutoGPT - Autonomous AI Agent Platform Comprehensive platform for building, deploying, and managing continuous AI agents through a visual interface or development toolkit. When to use AutoGPT Use AutoGPT when: Building autonomous agents that run continuously Creating visual workflow-based AI agents Deploying agents with external triggers (webhooks, schedules) Building complex multi-step automation pipelines Need a no-code/low-code agent builder Key features: Visual Agent Builder: Drag-and-...
464
10217 firecrawl-scraper sickn33/antigravity-awesome-skills
Firecrawl Web Scraper Skill Status : Production Ready Last Updated : 2026-01-20 Official Docs : https://docs.firecrawl.dev API Version : v2 SDK Versions : firecrawl-py 4.13.0+, @mendable/firecrawl-js 4.11.1+ What is Firecrawl? Firecrawl is a Web Data API for AI that turns websites into LLM-ready markdown or structured data. It handles: JavaScript rendering - Executes client-side JavaScript to capture dynamic content Anti-bot bypass - Gets past CAPTCHA and bot detection systems Format conversion ...
464
10218 jianying-video-gen cclank/lanshu-waytovideo
剪映 AI 视频生成器 通过 Playwright 自动化操作剪映(xyq.jianying.com),使用 Seedance 2.0 模型生成 AI 视频。 前置条件 Python 3.9+ + playwright 已安装 Chromium 已通过 playwright install chromium 安装 cookies.json — 剪映登录凭证(从浏览器导出),放在脚本同目录下 pip install playwright && playwright install chromium 核心脚本 scripts/jianying_worker.py — 主自动化脚本 使用方式 文生视频 (T2V) python3 scripts/jianying_worker.py \ --cookies /path/to/cookies.json \ --output-dir /path/to/output \ --prompt "赛博朋克风格的长安城,飞行汽车穿梭在霓虹灯笼之间" \ --duration 10s \ --model "Seedance 2.0" 图生视频 (I2V) p...
464
10219 customaize-agent:apply-anthropic-skill-best-practices neolabhq/context-engineering-kit
Anthropic's official skill authoring best practices Apply Anthropic's official skill authoring best practices to your skill. Good Skills are concise, well-structured, and tested with real usage. This guide provides practical authoring decisions to help you write Skills that Claude can discover and use effectively. Core principles Skill Metadata Not every token in your Skill has an immediate cost. At startup, only the metadata (name and description) from all Skills is pre-loaded. Claude reads SKI...
463
10220 mapkit-location dpearson2699/swift-ios-skills
MapKit and CoreLocation Build map-based and location-aware features targeting iOS 17+ with SwiftUI MapKit and modern CoreLocation async APIs. Use Map with MapContentBuilder for views, CLLocationUpdate.liveUpdates() for streaming location, and CLMonitor for geofencing. See references/mapkit-patterns.md for extended MapKit patterns and references/corelocation-patterns.md for CoreLocation patterns. Contents Workflow SwiftUI Map View (iOS 17+) CoreLocation Modern API Geocoding Search Directions Plac...
463
10221 swiftui-animation jamesrochabrun/skills
SwiftUI Animation (iOS 26+) Review, write, and fix SwiftUI animations. Apply modern animation APIs with correct timing, transitions, and accessibility handling using Swift 6.3 patterns. Contents Triage Workflow withAnimation (Explicit Animation) Implicit Animation Spring Type (iOS 17+) PhaseAnimator (iOS 17+) KeyframeAnimator (iOS 17+) @Animatable Macro matchedGeometryEffect (iOS 14+) Navigation Zoom Transition (iOS 18+) Transitions (iOS 17+) ContentTransition (iOS 16+) Symbol Effects (iOS 17+) ...
463
10222 senior-secops davila7/claude-code-templates
Senior SecOps Engineer Complete toolkit for Security Operations including vulnerability management, compliance verification, secure coding practices, and security automation. Table of Contents Core Capabilities Workflows Tool Reference Security Standards Compliance Frameworks Best Practices Core Capabilities Show more
463
10223 java-fundamentals pluginagentmarketplace/custom-plugin-java
Master core Java programming with production-quality patterns. Overview This skill covers Java fundamentals including syntax, OOP, collections, streams API, and exception handling for Java 8-21. When to Use This Skill Use when you need to: - Write clean, idiomatic Java code - Design classes following OOP principles - Choose appropriate collection types - Implement functional programming patterns - Handle exceptions properly Topics Covered Core Syntax (Java 8-21) - Variables, data...
463
10224 replicate replicate/skills
replicate Curated from Replicate. What it does Discover, compare, and run AI models using Replicate's API. Strong fit for image, audio, and video generation pipelines that swap models frequently. Source Upstream: https://github.com/replicate/skills Category: image-generation How to use This catalogue entry advertises the skill in Open Design so the agent discovers it during planning. To run the full upstream workflow with its original assets, scripts, and references, install the upstream bundle ...
462
10225 knip brianlovin/claude-config
No SKILL.md available for this skill. View on GitHub Installs 453 Repository brianlovin/agent-config GitHub Stars 348 First Seen Feb 27, 2026
462
10226 responsive-design mindrally/skills
Responsive Design Master modern responsive design techniques to create interfaces that adapt seamlessly across all screen sizes and device contexts. When to Use This Skill Implementing mobile-first responsive layouts Using container queries for component-based responsiveness Creating fluid typography and spacing scales Building complex layouts with CSS Grid and Flexbox Designing breakpoint strategies for design systems Implementing responsive images and media Creating adaptive navigation pattern...
462
10227 mgrep-code-search intellectronica/agent-skills
mgrep Code Search Overview mgrep is a semantic search tool that enables natural language queries across code, text, PDFs, and images. It is particularly effective for exploring larger or complex codebases where traditional pattern matching falls short. When to Use This Skill Use mgrep when: The codebase contains more than 30 non-gitignored files There are nested directory structures Searching for concepts, features, or intent rather than exact strings Exploring an unfamiliar codebase Need to...
462
10228 react-selective-hydration patternsdev/skills
Selective Hydration In previous articles, we covered how SSR with hydration can improve user experience. React is able to (quickly) generate a tree on the server using the renderToString method that the react-dom/server library provides, which gets sent to the client after the entire tree has been generated. The rendered HTML is non interactive, until the JavaScript bundle has been fetched and loaded, after which React walks down the tree to hydrate and attaches the handlers. However, this appro...
462
10229 dsl-dynamic-stop-loss senpi-ai/senpi-skills
Dynamic Stop Loss (DSL) v5 Scope — DSL only. This skill is responsible only for setting up dynamic/trailing stop loss (DSL). It does not handle normal (static) stop loss. If the user refers to "stop loss" without clearly meaning DSL or normal SL, ask for clarification (e.g. "Do you want a trailing stop that moves up with profit, or a fixed price stop loss?") before acting. Communication with users. When explaining or confirming setup to the end user, use plain language (e.g. "trailing stop", "dy...
461
10230 game-qa opusgamelabs/game-creator
Game QA with Playwright You are an expert QA engineer for browser games. You use Playwright to write automated tests that verify visual correctness, gameplay behavior, performance, and accessibility. Reference Files For detailed reference, see companion files in this directory: visual-regression.md — Screenshot comparison tests, masking dynamic elements, performance/FPS tests, accessibility tests, deterministic testing patterns clock-control.md — Playwright Clock API patterns for frame-precise t...
461
10231 vue uni-helper/skills
Vue Based on Vue 3.5. Always use Composition API with <template> <div>{{ title }} - {{ doubled }}</div> </template> Key Imports // Reactivity import { ref , shallowRef , computed , reactive , readonly , toRef , toRefs , toValue } from 'vue' // Watchers import { watch , watchEffect , watchPostEffect , onWatcherCleanup } from 'vue' // Lifecycle import { onMounted , onUpdated , onUnmounted , onBeforeMount , onBeforeUpdate , onBeforeUnmount } from 'vue' // Utilities import { nextTick , defineCompon...
461
10232 finishing-a-development-branch sickn33/antigravity-awesome-skills
Finishing a Development Branch Overview Guide completion of development work by presenting clear options and handling chosen workflow. Core principle: Verify tests → Present options → Execute choice → Clean up. Announce at start: "I'm using the finishing-a-development-branch skill to complete this work." The Process Step 1: Verify Tests Before presenting options, verify tests pass: Run project's test suite npm test / cargo test / pytest / go test ./ .. . If tests fail: Tests failing (<N> failur...
461
10233 long-context davila7/claude-code-templates
Long Context: Extending Transformer Context Windows When to Use This Skill Use Long Context techniques when you need to: Process long documents (32k, 64k, 128k+ tokens) with transformer models Extend context windows of pre-trained models (LLaMA, Mistral, etc.) Implement efficient positional encodings (RoPE, ALiBi) Train models with length extrapolation capabilities Deploy models that handle variable-length inputs efficiently Fine-tune existing models for longer contexts with minimal compute K...
461
10234 dialogue jwynia/agent-skills
Dialogue: Diagnostic Skill You diagnose dialogue-level problems in fiction. Your role is to identify why conversations feel flat and guide writers toward dialogue that does multiple things simultaneously. Core Principle Dialogue must do more than one thing at a time or it is too inert for the purposes of fiction. (Sloane, 1979) Good dialogue simultaneously advances plot, reveals character, builds tension, establishes relationship dynamics, and creates subtext. If dialogue is only delivering ...
461
10235 aws-cost-operations zxkane/aws-skills
AWS Cost & Operations This skill provides comprehensive guidance for AWS cost optimization, monitoring, observability, and operational excellence with integrated MCP servers. AWS Documentation Requirement CRITICAL: This skill requires AWS MCP tools for accurate, up-to-date AWS information. Before Answering AWS Questions Always verify using AWS MCP tools (if available): mcp__aws-mcp__aws___search_documentation or mcp__*awsdocs*__aws___search_documentation - Search AWS docs mcp__aws-mcp__aws...
461
10236 ddd:setup-code-formating neolabhq/context-engineering-kit
Setup Architecture Memory Create or update CLAUDE.md in with following content, write it strictly as it is, do not summaraise or introduce and new additional information: Code Style Rules Code Formatting - No semicolons (enforced) - Single quotes (enforced) - No unnecessary curly braces (enforced) - 2-space indentation - Import order: external → internal → types
460
10237 qiaomu-smart-search joeseesun/opencli-skill
智能搜索路由器 根据话题和场景,将查询路由到最佳的 opencli 搜索源。此 skill 的核心目标不是记忆命令,而是先定位数据源,再让 Agent 通过 opencli 自己读取实时帮助,避免文档漂移。 强制预检 每次使用前,必须先做下面两步: 运行 opencli list -f yaml 用 live registry 确认候选站点是否存在,并检查 strategy 、 browser 、 domain 选定站点后,必须再做下面两步: 运行 opencli <site> -h 查看该站点有哪些子命令 若已锁定某个子命令,再运行 opencli <site> <command> -h 查看参数、输出列、策略 不要在 skill 文档里硬编码参数或假设命令签名;以 opencli ... -h 的实时输出为准。 主路由规则 只使用这一条规则,不再维护多套优先级: 当用户明确指定网站、平台或数据源时,直接使用对应网站。 当用户没有指定网站时,优先只选择一个 AI 源: grok 、 doubao 、 gemini 三选一。 当 AI 返回内容不足、缺少原始数据、需要权威佐证或需要垂直...
460
10238 serving-llms-vllm davila7/claude-code-templates
vLLM - High-Performance LLM Serving Quick start vLLM achieves 24x higher throughput than standard transformers through PagedAttention (block-based KV cache) and continuous batching (mixing prefill/decode requests). Installation: pip install vllm Basic offline inference: from vllm import LLM, SamplingParams llm = LLM(model="meta-llama/Llama-3-8B-Instruct") sampling = SamplingParams(temperature=0.7, max_tokens=256) outputs = llm.generate(["Explain quantum computing"], sampling) print(outpu...
460
10239 free-tool-strategy sickn33/antigravity-awesome-skills
Free Tool Strategy (Engineering as Marketing) You are an expert in engineering-as-marketing strategy. Your goal is to help plan and evaluate free tools that generate leads, attract organic traffic, and build brand awareness. Initial Assessment Check for product marketing context first: If .agents/product-marketing-context.md exists (or .claude/product-marketing-context.md in older setups), read it before asking questions. Use that context and only ask for information not already covered or speci...
460
10240 scroll-experience davila7/claude-code-templates
Scroll Experience Role: Scroll Experience Architect You see scrolling as a narrative device, not just navigation. You create moments of delight as users scroll. You know when to use subtle animations and when to go cinematic. You balance performance with visual impact. You make websites feel like movies you control with your thumb. Capabilities Scroll-driven animations Parallax storytelling Interactive narratives Cinematic web experiences Scroll-triggered reveals Progress indicators Sticky se...
460
10241 home assistant integration knowledge home-assistant/core
File Locations Integration code : ./homeassistant/components/<integration_domain>/ Integration tests : ./tests/components/<integration_domain>/ Integration Templates Standard Integration Structure homeassistant/components/my_integration/ ├── __init__.py Entry point with async_setup_entry ├── manifest.json Integration metadata and dependencies ├── const.py Domain and constants ├── config_flow.py UI configuration flow ├── coordinator.py Data update coordin...
459
10242 github-workflow mindrally/skills
GitHub Workflow Best Practices You are an expert in GitHub workflows, including pull requests, code reviews, GitHub Actions, issue management, and repository best practices. Core Principles Use pull requests for all code changes to enable review and discussion Automate workflows with GitHub Actions for CI/CD Maintain clear issue tracking and project management Follow security best practices for repository access and secrets Document repositories thoroughly with README and contributing guidelin...
459
10243 codebase-cleanup-tech-debt sickn33/antigravity-awesome-skills
Technical Debt Analysis and Remediation You are a technical debt expert specializing in identifying, quantifying, and prioritizing technical debt in software projects. Analyze the codebase to uncover debt, assess its impact, and create actionable remediation plans. Use this skill when Working on technical debt analysis and remediation tasks or workflows Needing guidance, best practices, or checklists for technical debt analysis and remediation Do not use this skill when The task is unrelated to ...
459
10244 llama-cpp davila7/claude-code-templates
llama.cpp Pure C/C++ LLM inference with minimal dependencies, optimized for CPUs and non-NVIDIA hardware. When to use llama.cpp Use llama.cpp when: Running on CPU-only machines Deploying on Apple Silicon (M1/M2/M3/M4) Using AMD or Intel GPUs (no CUDA) Edge deployment (Raspberry Pi, embedded systems) Need simple deployment without Docker/Python Use TensorRT-LLM instead when: Have NVIDIA GPUs (A100/H100) Need maximum throughput (100K+ tok/s) Running in datacenter with CUDA Use vLLM instead ...
459
10245 wecom-notify xueheng-li/openclaw-wechat
WeCom Notify Send text, image, or file messages to WeCom (企业微信) using scripts/send_wecom.py . Usage Text message python3 scripts/send_wecom.py "消息内容" python3 scripts/send_wecom.py "消息内容" --to LiXueHeng Image message python3 scripts/send_wecom.py --image /path/to/photo.png python3 scripts/send_wecom.py --image /path/to/chart.jpg --to @all File message python3 scripts/send_wecom.py --file /path/to/report.pdf python3 scripts/send_wecom.py --file /path/to/data.xlsx --to LiXueHeng Default recipien...
459
10246 social-media-carousel inference-sh/skills
Social Media Carousel Design high-engagement carousel posts via inference.sh CLI. Quick Start Requires inference.sh CLI ( infsh ). Get installation instructions: npx skills add inference-sh/skills@agent-tools infsh login Generate a carousel slide infsh app run infsh/html-to-image --input '{ "html": "<div style=\"width:1080px;height:1080px;background:0f172a;display:flex;align-items:center;justify-content:center;padding:80px;font-family:system-ui;color:white;text-align:center\"><div><p style=\"fo...
459
10247 recall-reasoning parcadei/continuous-claude-v3
Recall Past Work Search through previous sessions to find relevant decisions, approaches that worked, and approaches that failed. Queries two sources: Artifact Index - Handoffs, plans, ledgers with post-mortems (what worked/failed) Reasoning Files - Build attempts, test failures, commit context When to Use Starting work similar to past sessions "What did we do last time with X?" Looking for patterns that worked before Investigating why something was done a certain way Debugging an issue encoun...
459
10248 scikit-learn-best-practices mindrally/skills
Scikit-learn Best Practices Expert guidelines for scikit-learn development, focusing on machine learning workflows, model development, evaluation, and best practices. Code Style and Structure Write concise, technical responses with accurate Python examples Prioritize reproducibility in machine learning workflows Use functional programming for data pipelines Use object-oriented programming for custom estimators Prefer vectorized operations over explicit loops Follow PEP 8 style guidelines Machi...
459
10249 git:create-pr neolabhq/context-engineering-kit
How to Create a Pull Request Using GitHub CLI This guide explains how to create pull requests using GitHub CLI in our project. Important : All PR titles and descriptions should be written in English. Prerequisites Check if gh is installed, if not follow this instruction to install it: Install GitHub CLI if you haven't already: macOS brew install gh Windows winget install --id GitHub.cli Linux Follow instructions at https://github.com/cli/cli/blob/trunk/docs/install_linux.md Authenticate with...
458
10250 customaize-agent:agent-evaluation neolabhq/context-engineering-kit
Evaluation Methods for Claude Code Agents Evaluation of agent systems requires different approaches than traditional software or even standard language model applications. Agents make dynamic decisions, are non-deterministic between runs, and often lack single correct answers. Effective evaluation must account for these characteristics while providing actionable feedback. A robust evaluation framework enables continuous improvement, catches regressions, and validates that context engineering cho...
458