Select a trainer
Select a trainer
Updated on 26 Jun 2025

  • Select a Trainer
Trainer Description Supported Data Format
Pre-training Initial training phase using large unlabeled data for language understanding Corpus
SFT Supervised fine-tuning trainer, aligns model behavior using labeled data Alpaca/ ShareGPT/ ShareGPT_Image
DPO Direct Preference Optimization trainer, aligns model with human preference signals directly ShareGPT/ ShareGPT_Image