core.builders.rl

core.builders.rl

Builder for RLHF trainers

Classes

Name Description
HFPPOTrainerBuilder HF Factory class for PPO Trainer
HFRLTrainerBuilder Trainer factory class for TRL-based RLHF trainers (e.g. DPO)

HFPPOTrainerBuilder

core.builders.rl.HFPPOTrainerBuilder(cfg, model, tokenizer, processor=None)

HF Factory class for PPO Trainer

HFRLTrainerBuilder

core.builders.rl.HFRLTrainerBuilder(cfg, model, tokenizer, processor=None)

Trainer factory class for TRL-based RLHF trainers (e.g. DPO)