core.builders.rl

core.builders.rl

Builder for RLHF trainers

Classes

Name Description
HFRLTrainerBuilder Trainer factory class for TRL-based RLHF trainers (e.g. DPO)

HFRLTrainerBuilder

core.builders.rl.HFRLTrainerBuilder(cfg, model, tokenizer, processor=None)

Trainer factory class for TRL-based RLHF trainers (e.g. DPO)