core.builders.rl
core.builders.rl
Builder for RLHF trainers
Classes
Name | Description |
---|---|
HFPPOTrainerBuilder | HF Factory class for PPO Trainer |
HFRLTrainerBuilder | Trainer factory class for TRL-based RLHF trainers (e.g. DPO) |
HFPPOTrainerBuilder
=None) core.builders.rl.HFPPOTrainerBuilder(cfg, model, tokenizer, processor
HF Factory class for PPO Trainer
HFRLTrainerBuilder
=None) core.builders.rl.HFRLTrainerBuilder(cfg, model, tokenizer, processor
Trainer factory class for TRL-based RLHF trainers (e.g. DPO)