Community Plugin

simpo-training

Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use for preference alignment when you want simpler, faster training than DPO/PPO.

1.0.0

Updated 25 days ago

Capabilities

Commands

Agents

Skills

Hooks

MCP Servers

Install

Add the repository(one-time)

/plugin marketplace add zechenzhangAGI/AI-research-SKILLs

Install the plugin

/plugin install simpo-training@zechenzhangAGI/AI-research-SKILLs

Component Details

No components detected in this plugin's metadata.

Stats

Stars00123456789

MaintenanceGood

Last Commit25 days ago

Links

View on GitHub

View README

Plugin Marketplace JSON

Similar Plugins

learning-output-style

Interactive learning mode that requests meaningful code contributions at decision points (mimics the unshipped Learning output style)

45.7K

code-review

Automated code review for pull requests using multiple specialized agents with confidence-based scoring

simpo-training

Similar Plugins

learning-output-style

code-review

feature-dev

pr-review-toolkit