prompt_strategies.dpo.chatml
prompt_strategies.dpo.chatml
DPO strategies for chatml
Functions
Name | Description |
---|---|
argilla_chat | for argilla/dpo-mix-7k conversations |
icr | chatml transforms for datasets with system, input, chosen, rejected |
intel | For Intel Orca DPO Pairs |
ultra | for ultrafeedback binarized conversations |
argilla_chat
**kwargs) prompt_strategies.dpo.chatml.argilla_chat(cfg,
for argilla/dpo-mix-7k conversations
icr
**kwargs) prompt_strategies.dpo.chatml.icr(cfg,
chatml transforms for datasets with system, input, chosen, rejected ex. https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs
intel
**kwargs) prompt_strategies.dpo.chatml.intel(cfg,
For Intel Orca DPO Pairs
ultra
**kwargs) prompt_strategies.dpo.chatml.ultra(cfg,
for ultrafeedback binarized conversations