core.trainers.ebft
core.trainers.ebft
EBFT (Energy-Based Fine-Tuning) Strategy for training
Two modes: - structured: For QA data with prompt/completion splits. Uses GRPOTrainer + vLLM. - strided: For unstructured text (raw code, prose). Uses strided block-parallel generation.
Classes
| Name | Description |
|---|---|
| EBFTStrategy | Strategy for EBFT training — dispatches between structured and strided modes. |
EBFTStrategy
core.trainers.ebft.EBFTStrategy()Strategy for EBFT training — dispatches between structured and strided modes.
Methods
| Name | Description |
|---|---|
| set_training_args_kwargs | Map axolotl YAML config fields to training args kwargs. |
set_training_args_kwargs
core.trainers.ebft.EBFTStrategy.set_training_args_kwargs(cfg)Map axolotl YAML config fields to training args kwargs.