From sagemaker-ai
Validates dataset formatting and quality for SageMaker model fine-tuning (SFT, DPO, RLVR). Detects file format, checks schema compliance against model and technique, reports readiness.
How this skill is triggered — by the user, by Claude, or both
Slash command
/sagemaker-ai:dataset-evaluationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Follow the workflow shown below. Locate the dataset, check the file type, and resolve any issues with missing files or wrong file types. Determine the fine-tuning model and fine-tuning strategy. Run the appropriate validation based on the model family. Summarize the results: is the dataset ready for fine-tuning?
Follow the workflow shown below. Locate the dataset, check the file type, and resolve any issues with missing files or wrong file types. Determine the fine-tuning model and fine-tuning strategy. Run the appropriate validation based on the model family. Summarize the results: is the dataset ready for fine-tuning?
sdk-getting-started skill first.Locate Dataset:
Determine strategy and model:
Check File Formatting: Run the tool format_detector.py to make sure the file conforms to formatting requirements.
Summarize Results: Tell the user if their data is ready
references/strategy_data_requirements.mdreferences/custom-scorer-evaluation-dataset-formats.md and validate against the scorer-specific schema. The scorer type should be known from conversation context (determined in the model-evaluation skill).# With the file path argument identified in workflow step 1
python scripts/format_detector.py local_path/to/dataset
scripts/format_detector.py — Self-contained format validation scriptreferences/strategy_data_requirements.md — Data format requirements per strategynpx claudepluginhub awslabs/agent-plugins --plugin sagemaker-aiData validation and pipeline testing utilities for ML training projects. Validates datasets, model checkpoints, training pipelines, and dependencies. Use when validating training data, checking model outputs, testing ML pipelines, verifying dependencies, debugging training failures, or ensuring data quality before training.
Generates code to transform ML datasets between training/evaluation schemas (OpenAI, SageMaker, HuggingFace, Bedrock, VERL, JSONL).
Uploads, validates, and manages datasets for DataRobot projects. Handles file uploads, data quality checks, schema review, and prediction dataset preparation.