Skill

dataset-evaluation

Validates dataset formatting and quality for SageMaker model fine-tuning (SFT, DPO, RLVR). Detects file format, checks schema compliance against model and technique, reports readiness.

Python

data-engineering

ai-ml

Popularity

Parent stars

801

Parent forks

119

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/sagemaker-ai:dataset-evaluation

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Follow the workflow shown below. Locate the dataset, check the file type, and resolve any issues with missing files or wrong file types. Determine the fine-tuning model and fine-tuning strategy. Run the appropriate validation based on the model family. Summarize the results: is the dataset ready for fine-tuning?

Supporting Files

references/custom-scorer-evaluation-dataset-formats.mdreferences/strategy_data_requirements.mdscripts/format_detector.py

SKILL.md

72 lines · ~1.3k tokens

Stats

LanguageShell

Parent stars801

Parent forks119

MaintenanceExcellent

Last CommitJun 10, 2026

Actions

View Source View Plugin View on GitHub View README

Workflow Instruction

Prerequisites

The SDK environment has been verified (SDK version, region, execution role). If not done, activate the sdk-getting-started skill first.

Workflow

Locate Dataset:
- The full path may be a local file path, or an S3 URI
- Resolve the full path to the dataset file, make sure read permissions are available, and help the user if the file is not found
Determine strategy and model:
- File formatting depends on the currently selected fine-tuning strategy and fine-tuning base model.
- If the strategy and model are already known from the conversation context (e.g., selected via the model-selection and finetuning-technique skills), use them.
- If not available in context, activate the model-selection and/or finetuning-technique skills to determine them before proceeding.
- Exception: If the user is validating an evaluation dataset (not a training dataset), neither model nor technique is required — the format detector can validate eval format (query/response structure) independently. Do not block on model-selection or finetuning-technique for eval dataset validation.
Check File Formatting: Run the tool format_detector.py to make sure the file conforms to formatting requirements.
- Send the full path directly to the format_detector script as an argument
- Do not send the model and strategy as arguments
- Do not download data from S3
- Do not make local copies of data
Summarize Results: Tell the user if their data is ready
- Examine the output of format_detector and compare to the known strategy and model
- Important: training datasets and evaluation datasets have different format requirements.
  - Training datasets must match the fine-tuning strategy format per references/strategy_data_requirements.md
  - Evaluation datasets (for model evaluation) must match one of the SageMaker evaluation dataset formats.
  - Custom Scorer evaluation datasets have scorer-specific requirements. If the dataset is intended for Custom Scorer evaluation (Prime Math, Prime Code, or Custom Lambda), read references/custom-scorer-evaluation-dataset-formats.md and validate against the scorer-specific schema. The scorer type should be known from conversation context (determined in the model-evaluation skill).
- Report back to the user if their current dataset is valid for its intended purpose
- Warn the user if their dataset is valid, but for a different strategy or model
- Warn the user if their dataset is not valid for any strategy/model pair
- If the user plans to finetune a model with the evaluated dataset, it needs to be uploaded to an S3 bucket in the same region as the planned training job (usually the default region). Warn the user if this is NOT the case.
- If the dataset is NOT in the necessary format, recommend transforming it using the dataset-transformation skill, wait for user confirmation, and update the plan based on their response

Messages to the User

Introduction: "This skill checks the structure of your dataset for model fine-tuning."
File types: This skill applies to files that are formatted according to the Amazon SageMaker AI Developer Guide

Resources

scripts/format_detector.py is self-contained format validation script that can be run independently
model-selection and finetuning-technique skills should have already determined the base model and fine-tuning strategy
references/strategy_data_requirements.md contains data format requirements per strategy

Script Details

scripts/format_detector.py is self-contained format validation script that can be run independently:

# With the file path argument identified in workflow step 1
python scripts/format_detector.py local_path/to/dataset

References

scripts/format_detector.py — Self-contained format validation script
references/strategy_data_requirements.md — Data format requirements per strategy

dataset-evaluation

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

dataset-evaluation

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Workflow Instruction

Prerequisites

Workflow

Messages to the User

Resources

Script Details

References

Similar Skills

Workflow Instruction

Prerequisites

Workflow

Messages to the User

Resources

Script Details

References

Similar Skills