Scientific Figures Skill

Generate publication-ready figures for research papers: data plots, conceptual schematics, and multi-panel compositions.

Overview

This skill provides three complementary tools:

Tool	Purpose	Input
`plot_cli.py`	Data visualization	CSV/Excel/JSON → plots
`generate_image.py`	Conceptual diagrams	Text prompt → schematic
`figure_compose.py`	Multi-panel figures	Images → composed figure

Script Locations

skills/scientific-figures/scripts/
├── plot_cli.py           # Data plots (seaborn/matplotlib)
├── generate_image.py     # AI-generated schematics
├── figure_compose.py     # Multi-panel composition
├── validate_figure.py    # Quality validation
└── infer_dataset.py      # Dataset analysis

Quick Start

1. Data Plot

# Create PlotSpec and render
python skills/scientific-figures/scripts/plot_cli.py --spec my_plot.json

2. Conceptual Schematic

python skills/scientific-figures/scripts/generate_image.py \
  "Create a publication-quality schematic showing [concept]..." \
  --output fig1_schematic.png

3. Multi-Panel Composition

# Compose panels A, B, C, D into a 2x2 figure
python skills/scientific-figures/scripts/figure_compose.py \
  --panels fig1a.png fig1b.png fig1c.png fig1d.png \
  --layout 2x2 \
  --output Figure1.pdf

Part 1: Data Plots (`plot_cli.py`)

Generate publication-ready plots from tabular data using seaborn/matplotlib.

Supported Plot Types

Kind	Use Case
`line`	Trends, time series, stress-strain
`scatter`	Correlations, x-y relationships
`bar`	Group comparisons, counts
`point`	Group means with CI/SEM
`box`	Distribution comparisons
`violin`	Distribution shapes
`swarm`	Individual points (small N)
`strip`	Jittered points (larger N)
`hist`	Single distribution
`kde`	Density estimation
`heatmap`	Matrices, correlations
`regplot`	Scatter with regression

PlotSpec Example

{
  "data": [{"id": "main", "path": "data.csv"}],
  "figure": {
    "layout": {"type": "single", "figsize_in": [3.35, 2.4]},
    "panels": [{
      "id": "A",
      "kind": "line",
      "data_id": "main",
      "mapping": {"x": "time", "y": "value", "hue": "group"},
      "labels": {"xlabel": "Time (s)", "ylabel": "Value (units)"}
    }]
  },
  "output": {"outdir": "outputs/fig1", "basename": "fig1", "formats": ["pdf", "svg", "png"]}
}

Usage

# Render from PlotSpec
python skills/scientific-figures/scripts/plot_cli.py --spec figure.json

# Analyze dataset first
python skills/scientific-figures/scripts/infer_dataset.py data.csv --suggest-plots

See references/plot_spec.md for full schema and references/plot_catalog.md for all plot types.

Part 2: Conceptual Schematics (`generate_image.py`)

Generate AI-powered conceptual diagrams for research papers.

When to Use

Overview/graphical abstract diagrams
Method/architecture illustrations
Timeline/evolution diagrams
Process/workflow schematics
Comparison diagrams
Conceptual illustrations

Usage

python skills/scientific-figures/scripts/generate_image.py "prompt" --output figure.png

Prompt Structure (Required Elements)

Every prompt MUST include:

Figure type: "A [schematic/flowchart/architecture/timeline] showing..."
Content: Specific elements with labels
Layout: Spatial arrangement (horizontal flow, vertical stack, grid)
Style: "Publication-quality, Nature/Science journal style, clean vector aesthetic, white background"
Color palette: Specify 3-4 colors (use colorblind-safe)
Technical specs: "Sans-serif labels, no gradients, no 3D effects"

Example Prompts

Architecture Diagram:

Create a publication-quality neural network architecture diagram.
Vertical stack: Input layer (bottom), 3 hidden layers (middle), Output layer (top).
Arrows showing data flow between layers.
Style: Nature Machine Intelligence quality, clean boxes with thin borders.
Colors: Input (blue #2166AC), hidden (gray #878787), output (orange #D6604D).
White background, sans-serif labels. Aspect ratio: 4:3.

Timeline:

Create a publication-quality timeline showing evolution of deep learning 2012-2024.
Horizontal timeline with milestones: AlexNet 2012, ResNet 2015, Transformer 2017, GPT-3 2020.
Style: Nature journal, clean, white background.
Color coding by era. Sans-serif labels. Aspect ratio: 3:1.

See references/schematic_guide.md for detailed prompt templates.

Part 3: Multi-Panel Composition (`figure_compose.py`)

Combine multiple images into publication-ready composite figures with panel labels.

Common Layouts

Layout	Use Case
`1x2`	Two panels side by side
`2x1`	Two panels stacked
`2x2`	Four-panel grid
`1x3`	Three panels in a row
`2x3`	Six-panel grid
`3x2`	Six panels (3 rows × 2 cols)

CLI Usage

# Basic: auto-labels A, B, C...
python skills/scientific-figures/scripts/figure_compose.py \
  --panels fig1a.png fig1b.png fig1c.png fig1d.png \
  --layout 2x2 \
  --output Figure1.pdf

# Custom labels
python skills/scientific-figures/scripts/figure_compose.py \
  --panels schematic.png plot.png \
  --layout 1x2 \
  --labels "a" "b" \
  --output Figure2.pdf

# No labels
python skills/scientific-figures/scripts/figure_compose.py \
  --panels img1.png img2.png img3.png \
  --layout 1x3 \
  --no-labels \
  --output Figure3.png

# Custom size
python skills/scientific-figures/scripts/figure_compose.py \
  --panels a.png b.png c.png d.png \
  --layout 2x2 \
  --figsize 7 5 \
  --dpi 600 \
  --output Figure4.pdf

Spec-based Usage

{
  "panels": [
    {"path": "fig1a_schematic.png", "label": "A"},
    {"path": "fig1b_data.png", "label": "B"},
    {"path": "fig1c_results.png", "label": "C"},
    {"path": "fig1d_comparison.png", "label": "D"}
  ],
  "layout": {"rows": 2, "cols": 2},
  "style": {
    "figsize_in": [7, 5],
    "dpi": 300,
    "label_fontsize": 14,
    "label_fontweight": "bold",
    "label_position": "upper-left",
    "spacing": {"wspace": 0.05, "hspace": 0.05}
  },
  "output": {"path": "Figure1.pdf"}
}

python skills/scientific-figures/scripts/figure_compose.py --spec compose_spec.json

Panel Sizing for Composition

CRITICAL: Individual panels must be generated at the correct size/aspect ratio BEFORE composition to ensure proper fit and consistent quality.

Size Calculation for Sub-Panels

When planning a multi-panel figure, calculate individual panel sizes based on:

Final figure width (journal requirement)
Layout (rows × columns)
Spacing between panels

Formula:

panel_width = (figure_width - (ncols-1) × spacing - 2 × margin) / ncols
panel_height = (figure_height - (nrows-1) × spacing - 2 × margin) / nrows

Recommended Panel Sizes by Layout

Final Figure	Layout	Panel Size (inches)	Panel Size (pixels @ 300 DPI)
7" × 5" (double col)	1×2	3.3 × 4.5	990 × 1350
7" × 5" (double col)	2×1	6.5 × 2.2	1950 × 660
7" × 5" (double col)	2×2	3.3 × 2.2	990 × 660
7" × 6" (double col)	2×2	3.3 × 2.7	990 × 810
7" × 4" (double col)	1×3	2.1 × 3.5	630 × 1050
7" × 6" (double col)	2×3	2.1 × 2.7	630 × 810
3.5" × 5" (single col)	2×1	3.2 × 2.2	960 × 660

Aspect Ratio Guidelines

Panel Content	Recommended Aspect Ratio
Standard plot	4:3 (1.33)
Wide timeline	3:1
Square heatmap	1:1
Workflow diagram	16:9 (1.78)
Tall schematic	3:4 (0.75)

Example: Planning a 2×2 Figure

# Target: 7" × 6" double-column figure with 2×2 layout
figure_width = 7.0   # inches
figure_height = 6.0
nrows, ncols = 2, 2
spacing = 0.3        # inches between panels
margin = 0.1         # inches around edge

# Calculate panel dimensions
panel_width = (figure_width - (ncols-1)*spacing - 2*margin) / ncols
# = (7.0 - 0.3 - 0.2) / 2 = 3.25 inches

panel_height = (figure_height - (nrows-1)*spacing - 2*margin) / nrows
# = (6.0 - 0.3 - 0.2) / 2 = 2.75 inches

# Generate each panel at this size:
# - PlotSpec: "figsize_in": [3.25, 2.75]
# - Schematic: specify "Aspect ratio: 1.18:1" (3.25/2.75)

Panel Generation Checklist

Before composing, ensure each panel:

[ ] Generated at correct dimensions for target layout
[ ] Has consistent DPI (300+ for print)
[ ] Uses matching font sizes across all panels
[ ] Has appropriate margins (not too tight to edges)
[ ] White/transparent background (for seamless composition)

Quality Checking (AI Self-Validation)

IMPORTANT: After generating any figure, Claude MUST validate the output quality before presenting to the user.

Automated Validation

Run the validation script on generated outputs:

python skills/scientific-figures/scripts/validate_figure.py --outdir outputs/fig1/

Visual Inspection Checklist (Claude Must Verify)

After generating a figure, Claude should use the Read tool to view the image and check:

1. Resolution & Clarity

[ ] Image is not pixelated or blurry
[ ] Lines are crisp and well-defined
[ ] Text is sharp and readable
[ ] No compression artifacts

2. Text & Labels

[ ] All axis labels present with units
[ ] All text is readable (not too small)
[ ] No overlapping text
[ ] Labels are not cut off at edges
[ ] Font is consistent (sans-serif)

3. Layout & Composition

[ ] Elements are well-balanced
[ ] Adequate whitespace
[ ] No elements extending beyond bounds
[ ] Proper alignment of multi-panel figures
[ ] Panel labels (A, B, C) visible and correctly positioned

4. Colors & Contrast

[ ] Colors match specification
[ ] Sufficient contrast for readability
[ ] Colorblind-safe palette used
[ ] Background is clean (white, no artifacts)

5. Content Accuracy

[ ] All requested elements present
[ ] Data appears correctly plotted
[ ] Legend matches data
[ ] Scale/axes appropriate

AI Quality Check Workflow

1. GENERATE figure using appropriate script
2. READ the output image file to visually inspect
3. RUN validate_figure.py for automated checks
4. EVALUATE against checklist above
5. IF issues found:
   - Identify specific problems
   - Adjust parameters/spec
   - RE-GENERATE
   - REPEAT validation
6. ONLY present to user when quality confirmed

Common Issues and Fixes

Issue	Likely Cause	Fix
Text too small	Font size not scaled for figure size	Increase `label_fontsize` or `font_scale`
Blurry output	Low DPI	Set `dpi: 600` for line art
Cut-off labels	Figure too small or tight bbox	Increase `figsize` or `pad_inches`
Overlapping legend	Auto-placement failed	Specify `legend.loc` explicitly
Misaligned panels	Inconsistent panel sizes	Pre-calculate sizes (see above)
Colors wrong	Palette not applied	Explicitly set palette in spec
Pixelated schematic	AI generated low-res	Request "high resolution" in prompt

Regeneration Decision Tree

Is the figure publication-ready?
│
├─ Text readable at print size?
│  └─ NO → Increase font sizes, regenerate
│
├─ All elements present?
│  └─ NO → Update spec/prompt, regenerate
│
├─ Colors correct & accessible?
│  └─ NO → Fix palette specification, regenerate
│
├─ Layout balanced?
│  └─ NO → Adjust figsize or spacing, regenerate
│
├─ Resolution sufficient (300+ DPI)?
│  └─ NO → Increase DPI setting, regenerate
│
└─ ALL YES → Present to user

Complete Workflow Example

Create Figure 1 with: (A) overview schematic, (B) data plot, (C) results plot, (D) comparison

Step 1: Generate Schematic (Panel A)

python skills/scientific-figures/scripts/generate_image.py \
  "Create a publication-quality overview schematic showing machine learning pipeline for materials discovery. Horizontal flow: Data (left) → Features → Model → Predictions (right). Style: Nature journal, clean, white background, blue-orange palette." \
  --output panels/fig1a_overview.png

Step 2: Create Data Plots (Panels B, C, D)

Create fig1bcd.json:

{
  "data": [{"id": "main", "path": "results.csv"}],
  "figure": {
    "layout": {"type": "grid", "nrows": 1, "ncols": 3, "figsize_in": [9, 3]},
    "panels": [
      {"id": "B", "kind": "scatter", "data_id": "main", "mapping": {"x": "predicted", "y": "actual"}, "labels": {"xlabel": "Predicted", "ylabel": "Actual"}},
      {"id": "C", "kind": "bar", "data_id": "main", "mapping": {"x": "method", "y": "accuracy"}, "labels": {"xlabel": "Method", "ylabel": "Accuracy"}},
      {"id": "D", "kind": "violin", "data_id": "main", "mapping": {"x": "group", "y": "error"}, "labels": {"xlabel": "Group", "ylabel": "Error"}}
    ]
  },
  "output": {"outdir": "panels", "basename": "fig1bcd", "formats": ["png"]}
}

python skills/scientific-figures/scripts/plot_cli.py --spec fig1bcd.json

Step 3: Compose Final Figure

python skills/scientific-figures/scripts/figure_compose.py \
  --panels panels/fig1a_overview.png panels/fig1b.png panels/fig1c.png panels/fig1d.png \
  --layout 2x2 \
  --figsize 7 6 \
  --output Figure1.pdf

Step 4: Validate

python skills/scientific-figures/scripts/validate_figure.py --outdir panels/

Publication Standards

Figure Size Guidelines

Target	Width	Use
Single column	85mm (3.35in)	Most single panels
1.5 column	114mm (4.5in)	Wide panels
Double column	170mm (6.7in)	Multi-panel figures

Typography

Font: Sans-serif (DejaVu Sans, Arial, Helvetica)
Size: 7-8 pt minimum at final print size
Labels: Include units, e.g., "Time (s)", "Force (N)"

Colors

Use colorblind-safe palettes:

Recommended: Seaborn colorblind palette
Alternative: IBM Design, Tol Bright

Export Formats

Format	Use
PDF	Primary vector (print)
SVG	Editable vector
PNG	Raster (≥300 DPI)

Figure Descriptions (MANDATORY)

IMPORTANT: Every figure generated MUST have an accompanying description file saved alongside it. This is critical for:

Writing figure captions in manuscripts
Reproducibility and documentation
Collaborator understanding
Future reference

Required Output Files

For every figure, Claude MUST save:

File	Content
`{basename}.png/pdf`	The figure image
`{basename}_description.md`	Full description (see template below)

Description Template

After generating any figure (plot, schematic, or composition), Claude MUST create a {basename}_description.md file with this structure:

# Figure: {Title}

## Caption (for manuscript)
{One paragraph suitable for a journal figure caption. Include: what is shown,
key visual encodings, sample sizes, statistical measures, and main finding.}

## Panel Descriptions
{For multi-panel figures, describe each panel A, B, C, D...}

### Panel A: {Title}
- **Type**: {plot type or "AI-generated schematic"}
- **Content**: {what is visualized}
- **X-axis**: {variable (units)}
- **Y-axis**: {variable (units)}
- **Color/Hue**: {what color encodes}
- **Key finding**: {main takeaway}

## Technical Details
- **Data source**: {filename or "AI-generated"}
- **Dimensions**: {width × height in inches}
- **Resolution**: {DPI}
- **Output formats**: {PDF, PNG, SVG}

## Prompt Used (for schematics only)
{The exact prompt used to generate AI schematics, for reproducibility}

Example: Data Plot Description

# Figure: Panel B - Method Accuracy Comparison

## Caption (for manuscript)
Classification accuracy comparison across deep learning methods (Baseline, CNN,
Transformer, Hybrid) evaluated on ImageNet and CIFAR-100 datasets. Error bars
indicate 95% confidence intervals (n=3 independent runs per condition). The
Hybrid CNN-Transformer model achieves significantly higher accuracy than other
methods on both datasets.

## Panel Descriptions

### Panel B: Accuracy Bar Chart
- **Type**: Grouped bar plot with error bars
- **Content**: Mean accuracy per method and dataset
- **X-axis**: Method (categorical)
- **Y-axis**: Accuracy (0-1 scale)
- **Color/Hue**: Dataset (blue=ImageNet, orange=CIFAR-100)
- **Error bars**: 95% CI
- **Key finding**: Hybrid model achieves ~94% accuracy on ImageNet

## Technical Details
- **Data source**: results.csv (24 observations)
- **Dimensions**: 3.3 × 2.7 inches
- **Resolution**: 300 DPI
- **Output formats**: PNG

Example: Schematic Description

# Figure: Panel A - Model Architecture

## Caption (for manuscript)
Architecture of the Hybrid CNN-Transformer model for image classification,
showing data flow from input image (224×224 RGB) through CNN backbone
(3 convolutional blocks), patch embedding, transformer encoder (6 blocks
with multi-head self-attention), to classification head. Dashed line
indicates skip connection from CNN features to transformer output.

## Panel Descriptions

### Panel A: Architecture Schematic
- **Type**: AI-generated conceptual diagram
- **Content**: Neural network architecture visualization
- **Layout**: Vertical stack, bottom-to-top data flow
- **Color coding**: Blue (input), Gray (CNN), Orange (Transformer), Green (output)
- **Key elements**: CNN blocks, patch embedding, transformer encoder, skip connection

## Technical Details
- **Data source**: AI-generated (Gemini 3 Pro Image)
- **Dimensions**: 3.3 × 2.7 inches
- **Resolution**: Native AI output
- **Output formats**: PNG

## Prompt Used
Create a publication-quality architecture diagram showing a Hybrid CNN-Transformer
model for image classification. Layout: Vertical stack, bottom-to-top data flow...
[full prompt here]

Example: Composed Figure Description

# Figure 1: Hybrid CNN-Transformer Performance Analysis

## Caption (for manuscript)
**Figure 1. Performance analysis of the Hybrid CNN-Transformer architecture.**
(A) Model architecture showing CNN backbone, patch embedding, and transformer
encoder with skip connections. (B) Classification accuracy across methods on
ImageNet and CIFAR-100 (error bars: 95% CI, n=3). (C) F1 score distributions
showing improved consistency with advanced architectures. (D) Accuracy vs.
training time trade-off demonstrating the Hybrid model's efficiency advantage.

## Panel Descriptions

### Panel A: Architecture Schematic
[description...]

### Panel B: Accuracy Comparison
[description...]

### Panel C: F1 Distribution
[description...]

### Panel D: Efficiency Trade-off
[description...]

## Technical Details
- **Layout**: 2×2 grid
- **Dimensions**: 7 × 6 inches (double column)
- **Resolution**: 300 DPI
- **Output formats**: PDF, PNG

Workflow Reminder

1. GENERATE figure (plot/schematic/composition)
2. VALIDATE quality (visual inspection + validate_figure.py)
3. WRITE description file ← MANDATORY, do not skip!
4. Present both image AND description to user

Reference Files

File	Purpose
`references/plot_spec.md`	PlotSpec JSON schema
`references/plot_catalog.md`	All plot types with examples
`references/style_guide.md`	Typography, colors, sizing
`references/schematic_guide.md`	Schematic prompt templates
`references/qa_checklist.md`	Quality assurance checklist
`assets/paper.mplstyle`	Matplotlib style file
`assets/palettes.json`	Color palette definitions

Environment Variables

# For AI-generated schematics
export OPENROUTER_AI_API_KEY="your_api_key"

Dependencies

# Core (for plots)
pip install pandas matplotlib seaborn

# For image composition
pip install pillow

# Optional (for Parquet files)
pip install pyarrow

scientific-figures

Scientific Figures Skill

Overview

Script Locations

Quick Start

1. Data Plot

2. Conceptual Schematic

3. Multi-Panel Composition

Part 1: Data Plots (plot_cli.py)

Supported Plot Types

PlotSpec Example

Usage

Part 2: Conceptual Schematics (generate_image.py)

When to Use

Usage

Prompt Structure (Required Elements)

Example Prompts

Part 3: Multi-Panel Composition (figure_compose.py)

Common Layouts

CLI Usage

Spec-based Usage

Panel Sizing for Composition

Size Calculation for Sub-Panels

Recommended Panel Sizes by Layout

Aspect Ratio Guidelines

Example: Planning a 2×2 Figure

Panel Generation Checklist

Quality Checking (AI Self-Validation)

Automated Validation

Visual Inspection Checklist (Claude Must Verify)

1. Resolution & Clarity

2. Text & Labels

3. Layout & Composition

4. Colors & Contrast

5. Content Accuracy

AI Quality Check Workflow

Common Issues and Fixes

Regeneration Decision Tree

Complete Workflow Example

Step 1: Generate Schematic (Panel A)

Step 2: Create Data Plots (Panels B, C, D)

Step 3: Compose Final Figure

Step 4: Validate

Publication Standards

Figure Size Guidelines

Typography

Colors

Export Formats

Figure Descriptions (MANDATORY)

Required Output Files

Description Template

Example: Data Plot Description

Example: Schematic Description

Example: Composed Figure Description

Workflow Reminder

Reference Files

Environment Variables

Dependencies

Related Skills

Part 1: Data Plots (`plot_cli.py`)

Part 2: Conceptual Schematics (`generate_image.py`)

Part 3: Multi-Panel Composition (`figure_compose.py`)