core.builders.rl
core.builders.rl
Builder for RLHF trainers
Classes
Name | Description |
---|---|
HFRLTrainerBuilder | Trainer factory class for TRL-based RLHF trainers (e.g. DPO) |
HFRLTrainerBuilder
=None) core.builders.rl.HFRLTrainerBuilder(cfg, model, tokenizer, processor
Trainer factory class for TRL-based RLHF trainers (e.g. DPO)