canghe-image-gen

安装量: 147
排名: #5847

安装

npx skills add https://github.com/freestylefly/canghe-skills --skill canghe-image-gen

Image Generation (AI SDK) Official API-based image generation. Supports OpenAI, Google, DashScope (阿里通义万象), and Canghe providers. Script Directory Agent Execution : SKILL_DIR = this SKILL.md file's directory Script path = ${SKILL_DIR}/scripts/main.ts Preferences (EXTEND.md) Use Bash to check EXTEND.md existence (priority order):

Check project-level first

test -f .canghe-skills/canghe-image-gen/EXTEND.md && echo "project"

Then user-level (cross-platform: $HOME works on macOS/Linux/WSL)

test
-f
"
$HOME
/.canghe-skills/canghe-image-gen/EXTEND.md"
&&
echo
"user"
┌──────────────────────────────────────────────────┬───────────────────┐
│ Path │ Location │
├──────────────────────────────────────────────────┼───────────────────┤
│ .canghe-skills/canghe-image-gen/EXTEND.md │ Project directory │
├──────────────────────────────────────────────────┼───────────────────┤
│ $HOME/.canghe-skills/canghe-image-gen/EXTEND.md │ User home │
└──────────────────────────────────────────────────┴───────────────────┘
┌───────────┬───────────────────────────────────────────────────────────────────────────┐
│ Result │ Action │
├───────────┼───────────────────────────────────────────────────────────────────────────┤
│ Found │ Read, parse, apply settings │
├───────────┼───────────────────────────────────────────────────────────────────────────┤
│ Not found │ Use defaults │
└───────────┴───────────────────────────────────────────────────────────────────────────┘
EXTEND.md Supports
Default provider | Default quality | Default aspect ratio | Default image size | Default models Schema: references/config/preferences-schema.md Usage

Basic

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "A cat" --image cat.png

With aspect ratio

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "A landscape" --image out.png --ar 16 :9

High quality

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "A cat" --image out.png --quality 2k

From prompt files

npx -y bun ${SKILL_DIR} /scripts/main.ts --promptfiles system.md content.md --image out.png

With reference images (Google multimodal or OpenAI edits)

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "Make blue" --image out.png --ref source.png

With reference images (explicit provider/model)

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "Make blue" --image out.png --provider google --model gemini-3-pro-image-preview --ref source.png

Specific provider

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "A cat" --image out.png --provider openai

DashScope (阿里通义万象)

npx -y bun ${SKILL_DIR} /scripts/main.ts --prompt "一只可爱的猫" --image out.png --provider dashscope

Canghe third-party gateway

npx
-y
bun
${SKILL_DIR}
/scripts/main.ts
--prompt
"一只可爱的猫"
--image
out.png
--provider
canghe
Options
Option
Description
--prompt
,
-p
Prompt text
--promptfiles
Read prompt from files (concatenated)
--image
Output image path (required)
--provider google|openai|dashscope|canghe
Force provider (default: google)
--model
,
-m
Model ID (
--ref
with OpenAI requires GPT Image model, e.g.
gpt-image-1.5
)
--ar
Aspect ratio (e.g.,
16:9
,
1:1
,
4:3
)
--size
Size (e.g.,
1024x1024
)
--quality normal|2k
Quality preset (default: 2k)
--imageSize 1K|2K|4K
Image size for Google (default: from quality)
--ref
Reference images. Supported by Google multimodal, OpenAI edits (GPT Image models), and Canghe (
image_url
). If provider omitted: Google first, then OpenAI, then Canghe
--n
Number of images
--json
JSON output
Environment Variables
Variable
Description
OPENAI_API_KEY
OpenAI API key
GOOGLE_API_KEY
Google API key
DASHSCOPE_API_KEY
DashScope API key (阿里云)
CANGHE_API_KEY
Canghe API key
OPENAI_IMAGE_MODEL
OpenAI model override
GOOGLE_IMAGE_MODEL
Google model override
DASHSCOPE_IMAGE_MODEL
DashScope model override (default: z-image-turbo)
CANGHE_IMAGE_MODEL
Canghe model override (default: gemini-3-pro-image-preview)
OPENAI_BASE_URL
Custom OpenAI endpoint
GOOGLE_BASE_URL
Custom Google endpoint
DASHSCOPE_BASE_URL
Custom DashScope endpoint
CANGHE_BASE_URL
Custom Canghe endpoint (default:
https://api.canghe.ai/v1
)
Load Priority
CLI args > EXTEND.md > env vars >
/.canghe-skills/.env
>
~/.canghe-skills/.env
Provider Selection
--ref
provided + no
--provider
→ auto-select Google first, then OpenAI, then Canghe
--provider
specified → use it (if
--ref
, must be
google
or
openai
or
canghe
)
Only one API key available → use that provider
Multiple available → default to Google
Quality Presets
Preset
Google imageSize
OpenAI Size
Use Case
normal
1K
1024px
Quick previews
2k
(default)
2K
2048px
Covers, illustrations, infographics
Google imageSize
Can be overridden with
--imageSize 1K|2K|4K
Aspect Ratios
Supported:
1:1
,
16:9
,
9:16
,
4:3
,
3:4
,
2.35:1
Google multimodal: uses
imageConfig.aspectRatio
Google Imagen: uses
aspectRatio
parameter
OpenAI: maps to closest supported size
Generation Mode
Default
Sequential generation (one image at a time). This ensures stable output and easier debugging.
Parallel Generation
Only use when user explicitly requests parallel/concurrent generation. Mode When to Use Sequential (default) Normal usage, single images, small batches Parallel User explicitly requests, large batches (10+) Parallel Settings (when requested): Setting Value Recommended concurrency 4 subagents Max concurrency 8 subagents Use case Large batch generation when user requests parallel Agent Implementation (parallel mode only):

Launch multiple generations in parallel using Task tool

Each Task runs as background subagent with run_in_background=true

Collect results via TaskOutput when all complete

Error Handling Missing API key → error with setup instructions Generation failure → auto-retry once Invalid aspect ratio → warning, proceed with default Reference images with unsupported provider/model → error with fix hint (switch to Google multimodal or OpenAI GPT Image edits) Extension Support Custom configurations via EXTEND.md. See Preferences section for paths and supported options.

返回排行榜