Excalidraw Diagram Creator

Generate

.excalidraw

JSON files that

argue visually

, not just display information.

Setup:

If the user asks you to set up this skill (renderer, dependencies, etc.), see

README.md

for instructions.

Customization

All colors and brand-specific styles live in one file:

references/color-palette.md

. Read it before generating any diagram and use it as the single source of truth for all color choices — shape fills, strokes, text colors, evidence artifact backgrounds, everything.

To make this skill produce diagrams in your own brand style, edit

color-palette.md

. Everything else in this file is universal design methodology and Excalidraw best practices.

Core Philosophy

Diagrams should ARGUE, not DISPLAY.

A diagram isn't formatted text. It's a visual argument that shows relationships, causality, and flow that words alone can't express. The shape should BE the meaning.

The Isomorphism Test

If you removed all text, would the structure alone communicate the concept? If not, redesign.

The Education Test

Could someone learn something concrete from this diagram, or does it just label boxes? A good diagram teaches—it shows actual formats, real event names, concrete examples.

Depth Assessment (Do This First)

Before designing, determine what level of detail this diagram needs:

Simple/Conceptual Diagrams

Use abstract shapes when:

Explaining a mental model or philosophy

The audience doesn't need technical specifics

The concept IS the abstraction (e.g., "separation of concerns")

Comprehensive/Technical Diagrams

Use concrete examples when:

Diagramming a real system, protocol, or architecture

The diagram will be used to teach or explain (e.g., YouTube video)

The audience needs to understand what things actually look like

You're showing how multiple technologies integrate

For technical diagrams, you MUST include evidence artifacts

(see below).

Research Mandate (For Technical Diagrams)

Before drawing anything technical, research the actual specifications.

If you're diagramming a protocol, API, or framework:

Look up the actual JSON/data formats

Find the real event names, method names, or API endpoints

Understand how the pieces actually connect

Use real terminology, not generic placeholders

Bad: "Protocol" → "Frontend"

Good: "AG-UI streams events (RUN_STARTED, STATE_DELTA, A2UI_UPDATE)" → "CopilotKit renders via createA2UIMessageRenderer()"

Research makes diagrams accurate AND educational.

Evidence Artifacts

Evidence artifacts are concrete examples that prove your diagram is accurate and help viewers learn. Include them in technical diagrams.

Types of evidence artifacts

(choose what's relevant to your diagram):

Artifact Type

When to Use

How to Render

Code snippets

APIs, integrations, implementation details

Dark rectangle + syntax-colored text (see color palette for evidence artifact colors)

Data/JSON examples

Data formats, schemas, payloads

Dark rectangle + colored text (see color palette)

Event/step sequences

Protocols, workflows, lifecycles

Timeline pattern (line + dots + labels)

UI mockups

Showing actual output/results

Nested rectangles mimicking real UI

Real input content

Showing what goes IN to a system

Rectangle with sample content visible

API/method names

Real function calls, endpoints

Use actual names from docs, not placeholders

Example

For a diagram about a streaming protocol, you might show:

The actual event names from the spec (not just "Event 1", "Event 2")

A code snippet showing how to connect

What the streamed data actually looks like

Example

For a diagram about a data transformation pipeline:

Show sample input data (actual format, not "Input")

Show sample output data (actual format, not "Output")

Show intermediate states if relevant

The key principle:

show what things actually look like

, not just what they're called.

Multi-Zoom Architecture

Comprehensive diagrams operate at multiple zoom levels simultaneously. Think of it like a map that shows both the country borders AND the street names.

Level 1: Summary Flow

A simplified overview showing the full pipeline or process at a glance. Often placed at the top or bottom of the diagram.

Example

:

Input → Processing → Output

or

Client → Server → Database

Level 2: Section Boundaries

Labeled regions that group related components. These create visual "rooms" that help viewers understand what belongs together.

Example

Grouping by responsibility (Backend / Frontend), by phase (Setup / Execution / Cleanup), or by team (User / System / External)

Level 3: Detail Inside Sections

Evidence artifacts, code snippets, and concrete examples within each section. This is where the educational value lives.

Example

Inside a "Backend" section, you might show the actual API response format, not just a box labeled "API Response"

For comprehensive diagrams, aim to include all three levels.

The summary gives context, the sections organize, and the details teach.

Bad vs Good

Bad (Displaying)

Good (Arguing)

5 equal boxes with labels

Each concept has a shape that mirrors its behavior

Card grid layout

Visual structure matches conceptual structure

Icons decorating text

Shapes that ARE the meaning

Same container for everything

Distinct visual vocabulary per concept

Everything in a box

Free-floating text with selective containers

Simple vs Comprehensive (Know Which You Need)

Simple Diagram

Comprehensive Diagram

Generic labels: "Input" → "Process" → "Output"

Specific: shows what the input/output actually looks like

Named boxes: "API", "Database", "Client"

Named boxes + examples of actual requests/responses

"Events" or "Messages" label

Timeline with real event/message names from the spec

"UI" or "Dashboard" rectangle

Mockup showing actual UI elements and content

~30 seconds to explain

~2-3 minutes of teaching content

Viewer learns the structure

Viewer learns the structure AND the details

Simple diagrams

are fine for abstract concepts, quick overviews, or when the audience already knows the details.

Comprehensive diagrams

are needed for technical architectures, tutorials, educational content, or when you want the diagram itself to teach.

Container vs. Free-Floating Text

Not every piece of text needs a shape around it.

Default to free-floating text. Add containers only when they serve a purpose.

Use a Container When...

Use Free-Floating Text When...

It's the focal point of a section

It's a label or description

It needs visual grouping with other elements

It's supporting detail or metadata

Arrows need to connect to it

It describes something nearby

The shape itself carries meaning (decision diamond, etc.)

Typography alone creates sufficient hierarchy

It represents a distinct "thing" in the system

It's a section title, subtitle, or annotation

Typography as hierarchy

Use font size, weight, and color to create visual hierarchy without boxes. A 28px title doesn't need a rectangle around it.

The container test

For each boxed element, ask "Would this work as free-floating text?" If yes, remove the container.

Design Process (Do This BEFORE Generating JSON)

Step 0: Assess Depth Required

Before anything else, determine if this needs to be:

Simple/Conceptual

Abstract shapes, labels, relationships (mental models, philosophies)

Comprehensive/Technical

Concrete examples, code snippets, real data (systems, architectures, tutorials)

If comprehensive

Do research first. Look up actual specs, formats, event names, APIs.

Step 1: Understand Deeply

Read the content. For each concept, ask:

What does this concept

DO

? (not what IS it)

What relationships exist between concepts?

What's the core transformation or flow?

What would someone need to SEE to understand this?

(not just read about)

Step 2: Map Concepts to Patterns

For each concept, find the visual pattern that mirrors its behavior:

If the concept...

Use this pattern

Spawns multiple outputs

Fan-out

(radial arrows from center)

Combines inputs into one

Convergence

(funnel, arrows merging)

Has hierarchy/nesting

Tree

(lines + free-floating text)

Is a sequence of steps

Timeline

(line + dots + free-floating labels)

Loops or improves continuously

Spiral/Cycle

(arrow returning to start)

Is an abstract state or context

Cloud

(overlapping ellipses)

Transforms input to output

Assembly line

(before → process → after)

Compares two things

Side-by-side

(parallel with contrast)

Separates into phases

Gap/Break

(visual separation between sections)

Step 3: Ensure Variety

For multi-concept diagrams:

each major concept must use a different visual pattern

. No uniform cards or grids.

Step 4: Sketch the Flow

Before JSON, mentally trace how the eye moves through the diagram. There should be a clear visual story.

Step 5: Generate JSON

Only now create the Excalidraw elements.

See below for how to handle large diagrams.

Step 6: Render & Validate (MANDATORY)

After generating the JSON, you MUST run the render-view-fix loop until the diagram looks right. This is not optional — see the

Render & Validate

section below for the full process.

Large / Comprehensive Diagram Strategy

For comprehensive or technical diagrams, you MUST build the JSON one section at a time.

Do NOT attempt to generate the entire file in a single pass. This is a hard constraint — Claude Code has a ~32,000 token output limit per response, and a comprehensive diagram easily exceeds that in one shot. Even if it didn't, generating everything at once leads to worse quality. Section-by-section is better in every way.

The Section-by-Section Workflow

Phase 1: Build each section

Create the base file

with the JSON wrapper (

type

,

version

,

appState

,

files

) and the first section of elements.

Add one section per edit.

Each section gets its own dedicated pass — take your time with it. Think carefully about the layout, spacing, and how this section connects to what's already there.

Use descriptive string IDs

(e.g.,

"trigger_rect"

,

"arrow_fan_left"

) so cross-section references are readable.

Namespace seeds by section

(e.g., section 1 uses 100xxx, section 2 uses 200xxx) to avoid collisions.

Update cross-section bindings

as you go. When a new section's element needs to bind to an element from a previous section (e.g., an arrow connecting sections), edit the earlier element's

boundElements

array at the same time.

Phase 2: Review the whole

After all sections are in place, read through the complete JSON and check:

Are cross-section arrows bound correctly on both ends?

Is the overall spacing balanced, or are some sections cramped while others have too much whitespace?

Do IDs and bindings all reference elements that actually exist?

Fix any alignment or binding issues before rendering.

Phase 3: Render & validate

Now run the render-view-fix loop from the Render & Validate section. This is where you'll catch visual issues that aren't obvious from JSON — overlaps, clipping, imbalanced composition.

Section Boundaries

Plan your sections around natural visual groupings from the diagram plan. A typical large diagram might split into:

Section 1

Entry point / trigger

Section 2

First decision or routing

Section 3

Main content (hero section — may be the largest single section)

Section 4-N

Remaining phases, outputs, etc.

Each section should be independently understandable: its elements, internal arrows, and any cross-references to adjacent sections.

What NOT to Do

Don't generate the entire diagram in one response.

You will hit the output token limit and produce truncated, broken JSON. Even if the diagram is small enough to fit, splitting into sections produces better results.

Don't use a coding agent

to generate the JSON. The agent won't have sufficient context about the skill's rules, and the coordination overhead negates any benefit.

Don't write a Python generator script.

The templating and coordinate math seem helpful but introduce a layer of indirection that makes debugging harder. Hand-crafted JSON with descriptive IDs is more maintainable.

Visual Pattern Library

Fan-Out (One-to-Many)

Central element with arrows radiating to multiple targets. Use for: sources, PRDs, root causes, central hubs.

○

↗

□ → ○

↘

○

Convergence (Many-to-One)

Multiple inputs merging through arrows to single output. Use for: aggregation, funnels, synthesis.

○ ↘

○ → □

○ ↗

Tree (Hierarchy)

Parent-child branching with connecting lines and free-floating text (no boxes needed). Use for: file systems, org charts, taxonomies.

label

├── label

│ ├── label

│ └── label

└── label

Use

line

elements for the trunk and branches, free-floating text for labels.

Spiral/Cycle (Continuous Loop)

Elements in sequence with arrow returning to start. Use for: feedback loops, iterative processes, evolution.

□ → □

↑ ↓

□ ← □

Cloud (Abstract State)

Overlapping ellipses with varied sizes. Use for: context, memory, conversations, mental states.

Assembly Line (Transformation)

Input → Process Box → Output with clear before/after. Use for: transformations, processing, conversion.

○○○ → [PROCESS] → □□□

chaos order

Side-by-Side (Comparison)

Two parallel structures with visual contrast. Use for: before/after, options, trade-offs.

Gap/Break (Separation)

Visual whitespace or barrier between sections. Use for: phase changes, context resets, boundaries.

Lines as Structure

Use lines (type:

line

, not arrows) as primary structural elements instead of boxes:

Timelines

Vertical or horizontal line with small dots (10-20px ellipses) at intervals, free-floating labels beside each dot

Tree structures

Vertical trunk line + horizontal branch lines, with free-floating text labels (no boxes needed)

Dividers

Thin dashed lines to separate sections

Flow spines

A central line that elements relate to, rather than connecting boxes

Timeline: Tree:

●─── Label 1 │

│ ├── item

●─── Label 2 │ ├── sub

│ │ └── sub

●─── Label 3 └── item

Lines + free-floating text often creates a cleaner result than boxes + contained text.

Shape Meaning

Choose shape based on what it represents—or use no shape at all:

Concept Type

Shape

Why

Labels, descriptions, details

none

(free-floating text)

Typography creates hierarchy

Section titles, annotations

none

(free-floating text)

Font size/weight is enough

Markers on a timeline

small

ellipse

(10-20px)

Visual anchor, not container

Start, trigger, input

ellipse

Soft, origin-like

End, output, result

ellipse

Completion, destination

Decision, condition

diamond

Classic decision symbol

Process, action, step

rectangle

Contained action

Abstract state, context

overlapping

ellipse

Fuzzy, cloud-like

Hierarchy node

lines + text (no boxes)

Structure through lines

Rule

Default to no container. Add shapes only when they carry meaning. Aim for <30% of text elements to be inside containers.

Color as Meaning

Colors encode information, not decoration. Every color choice should come from

references/color-palette.md

— the semantic shape colors, text hierarchy colors, and evidence artifact colors are all defined there.

Key principles:

Each semantic purpose (start, end, decision, AI, error, etc.) has a specific fill/stroke pair

Free-floating text uses color for hierarchy (titles, subtitles, details — each at a different level)

Evidence artifacts (code snippets, JSON examples) use their own dark background + colored text scheme

Always pair a darker stroke with a lighter fill for contrast

Do not invent new colors.

If a concept doesn't fit an existing semantic category, use Primary/Neutral or Secondary.

Modern Aesthetics

For clean, professional diagrams:

Roughness

roughness: 0

— Clean, crisp edges. Use for modern/technical diagrams.

roughness: 1

— Hand-drawn, organic feel. Use for brainstorming/informal diagrams.

Default to 0

for most professional use cases.

Stroke Width

strokeWidth: 1

— Thin, elegant. Good for lines, dividers, subtle connections.

strokeWidth: 2

— Standard. Good for shapes and primary arrows.

strokeWidth: 3

— Bold. Use sparingly for emphasis (main flow line, key connections).

Opacity

Always use

opacity: 100

for all elements.

Use color, size, and stroke width to create hierarchy instead of transparency.

Small Markers Instead of Shapes

Instead of full shapes, use small dots (10-20px ellipses) as:

Timeline markers

Bullet points

Connection nodes

Visual anchors for free-floating text

Layout Principles

Hierarchy Through Scale

Hero

300×150 - visual anchor, most important

Primary

180×90

Secondary

120×60

Small

60×40

Whitespace = Importance

The most important element has the most empty space around it (200px+).

Flow Direction

Guide the eye: typically left→right or top→bottom for sequences, radial for hub-and-spoke.

Connections Required

Position alone doesn't show relationships. If A relates to B, there must be an arrow.

Text Rules

CRITICAL

The JSON

text

property contains ONLY readable words.

{

"id"

:

"myElement1"

,

"text"

:

"Start"

,

"originalText"

:

"Start"

}

Settings:

fontSize: 16

,

fontFamily: 3

,

textAlign: "center"

,

verticalAlign: "middle"

JSON Structure

{

"type"

:

"excalidraw"

,

"version"

:

2

,

"source"

:

"https://excalidraw.com"

,

"elements"

:

[

...

]

,

"appState"

:

{

"viewBackgroundColor"

:

"#ffffff"

,

"gridSize"

:

20

}

,

"files"

:

{

}

Element Templates

See

references/element-templates.md

for copy-paste JSON templates for each element type (text, line, dot, rectangle, arrow). Pull colors from

references/color-palette.md

based on each element's semantic purpose.

Render & Validate (MANDATORY)

You cannot judge a diagram from JSON alone. After generating or editing the Excalidraw JSON, you MUST render it to PNG, view the image, and fix what you see — in a loop until it's right. This is a core part of the workflow, not a final check.

How to Render

cd

.claude/skills/excalidraw-diagram/references

&&

uv run python render_excalidraw.py

<

path-to-file.excalidraw

>

This outputs a PNG next to the

.excalidraw

file. Then use the

Read tool

on the PNG to actually view it.

The Loop

After generating the initial JSON, run this cycle:

1. Render & View

— Run the render script, then Read the PNG.

2. Audit against your original vision

— Before looking for bugs, compare the rendered result to what you designed in Steps 1-4. Ask:

Does the visual structure match the conceptual structure you planned?

Does each section use the pattern you intended (fan-out, convergence, timeline, etc.)?

Does the eye flow through the diagram in the order you designed?

Is the visual hierarchy correct — hero elements dominant, supporting elements smaller?

For technical diagrams: are the evidence artifacts (code snippets, data examples) readable and properly placed?

3. Check for visual defects:

Text clipped by or overflowing its container

Text or shapes overlapping other elements

Arrows crossing through elements instead of routing around them

Arrows landing on the wrong element or pointing into empty space

Labels floating ambiguously (not clearly anchored to what they describe)

Uneven spacing between elements that should be evenly spaced

Sections with too much whitespace next to sections that are too cramped

Text too small to read at the rendered size

Overall composition feels lopsided or unbalanced

4. Fix

— Edit the JSON to address everything you found. Common fixes:

Widen containers when text is clipped

Adjust

x

/

y

coordinates to fix spacing and alignment

Add intermediate waypoints to arrow

points

arrays to route around elements

Reposition labels closer to the element they describe

Resize elements to rebalance visual weight across sections

5. Re-render & re-view

— Run the render script again and Read the new PNG.

6. Repeat

— Keep cycling until the diagram passes both the vision check (Step 2) and the defect check (Step 3). Typically takes 2-4 iterations. Don't stop after one pass just because there are no critical bugs — if the composition could be better, improve it.

When to Stop

The loop is done when:

The rendered diagram matches the conceptual design from your planning steps

No text is clipped, overlapping, or unreadable

Arrows route cleanly and connect to the right elements

Spacing is consistent and the composition is balanced

You'd be comfortable showing it to someone without caveats

First-Time Setup

If the render script hasn't been set up yet:

cd

.claude/skills/excalidraw-diagram/references

uv

sync

uv run playwright

install

chromium

Quality Checklist

Depth & Evidence (Check First for Technical Diagrams)

Research done

Did you look up actual specs, formats, event names?

Evidence artifacts

Are there code snippets, JSON examples, or real data?

Multi-zoom

Does it have summary flow + section boundaries + detail?

Concrete over abstract

Real content shown, not just labeled boxes?

Educational value

Could someone learn something concrete from this?

Conceptual

Isomorphism

Does each visual structure mirror its concept's behavior?

Argument

Does the diagram SHOW something text alone couldn't?

Variety

Does each major concept use a different visual pattern?

No uniform containers

Avoided card grids and equal boxes?

Container Discipline

Minimal containers

Could any boxed element work as free-floating text instead?

Lines as structure

Are tree/timeline patterns using lines + text rather than boxes?

Typography hierarchy

Are font size and color creating visual hierarchy (reducing need for boxes)?

Structural

Connections

Every relationship has an arrow or line

Flow

Clear visual path for the eye to follow

Hierarchy

Important elements are larger/more isolated

Technical

Text clean

:

text

contains only readable words

Font

:

fontFamily: 3

Roughness

:

roughness: 0

for clean/modern (unless hand-drawn style requested)

Opacity

:

opacity: 100

for all elements (no transparency)

Container ratio

<30% of text elements should be inside containers

Visual Validation (Render Required)

Rendered to PNG

Diagram has been rendered and visually inspected

No text overflow

All text fits within its container

No overlapping elements

Shapes and text don't overlap unintentionally

Even spacing

Similar elements have consistent spacing

Arrows land correctly

Arrows connect to intended elements without crossing others

Readable at export size

Text is legible in the rendered PNG
Balanced composition: No large empty voids or overcrowded regions

安装