prompt_strategies.dpo.chat_template
prompt_strategies.dpo.chat_template
DPO prompt strategies for using tokenizer chat templates.
Functions
| Name | Description | 
|---|---|
| argilla_chat | DPO chat template strategy for argilla-style datasets. | 
argilla_chat
prompt_strategies.dpo.chat_template.argilla_chat(cfg, dataset_idx=0, **kwargs)DPO chat template strategy for argilla-style datasets.
For argilla-style datasets where chosen/rejected contain full conversations instead of single response messages. Extracts the conversation history from the chosen field and formats both chosen/rejected responses using the configured chat template.
Parameters
| Name | Type | Description | Default | 
|---|---|---|---|
| cfg | Configuration object containing chat_template and dataset settings | required | |
| dataset_idx | Index of the dataset in the config (default: 0) | 0 | |
| **kwargs | Additional keyword arguments (unused) | {} | 
Returns
| Name | Type | Description | 
|---|---|---|
| tuple | (transform_fn, dataset_kwargs) where: - transform_fn: Function to transform dataset samples - dataset_kwargs: Dict with ‘remove_columns’ specifying columns to drop | 
Dataset format
{ “chosen”: [ {“role”: “user”, “content”: “…”}, {“role”: “assistant”, “content”: “…”} ], “rejected”: [ {“role”: “user”, “content”: “…”}, {“role”: “assistant”, “content”: “…”} ] }