Skip to main content
Back to top
Ctrl
+
K
Get Started
Quick Start
Usage Guide
FAQ
Dense
Qwen3-4B with 8xH100
GLM4-9B with 8xH100
MoE
Qwen3-30B-A3B with 8xH100
GLM-4.5 with 64xH100
DeepSeek R1 with 128xH100
Advanced Features
Reproducibility
Speculative Decoding – Usage Guide
Other Usage
SFT Qwen3-4B-Base
Search-R1 lite
Fully Asynchronous Rollout Example
Retool: from SFT to RL
Multi-Agent RL
Developer Guide
Debugging
Hardware Platforms
AMD
Blogs
v0.1.0: Redefining High-Performance RL Training Frameworks
slime: An SGLang-Native Post-Training Framework for RL Scaling
Repository
Open issue
Index