core.trainers.trl

core.trainers.trl

Module for TRL PPO trainer

Classes

Name Description
AxolotlCPOTrainer Extend the base CPOTrainer for axolotl helpers
AxolotlKTOTrainer Extend the base KTOTrainer for axolotl helpers
AxolotlORPOTrainer Extend the base ORPOTrainer for axolotl helpers
AxolotlPRMTrainer Extend the base trl.PRMTrainer for axolotl helpers
AxolotlRewardTrainer Extend the base RewardTrainer for axolotl helpers
TRLPPOTrainer Wrapper for TRL PPO trainer to handle customizations

AxolotlCPOTrainer

core.trainers.trl.AxolotlCPOTrainer(*args, **kwargs)

Extend the base CPOTrainer for axolotl helpers

AxolotlKTOTrainer

core.trainers.trl.AxolotlKTOTrainer(*args, **kwargs)

Extend the base KTOTrainer for axolotl helpers

AxolotlORPOTrainer

core.trainers.trl.AxolotlORPOTrainer(*args, **kwargs)

Extend the base ORPOTrainer for axolotl helpers

AxolotlPRMTrainer

core.trainers.trl.AxolotlPRMTrainer(*args, **kwargs)

Extend the base trl.PRMTrainer for axolotl helpers

AxolotlRewardTrainer

core.trainers.trl.AxolotlRewardTrainer(*args, **kwargs)

Extend the base RewardTrainer for axolotl helpers

TRLPPOTrainer

core.trainers.trl.TRLPPOTrainer()

Wrapper for TRL PPO trainer to handle customizations