slime Documentation#
slime is an LLM post-training framework for RL scaling, providing two core capabilities:
High-Performance Training: Supports efficient training in various modes by connecting Megatron with SGLang;
Flexible Data Generation: Enables arbitrary training data generation workflows through custom data generation interfaces and server-based engines.
Get Started
Other Usage
Advanced Features
Developer Guide
Hardware Platforms