prompt_strategies.dpo.llama3
prompt_strategies.dpo.llama3
DPO strategies for llama-3 chat template
Functions
Name | Description |
---|---|
argilla_chat | for argilla/dpo-mix-7k conversations |
icr | chatml transforms for datasets with system, input, chosen, rejected |
intel | For Intel Orca DPO Pairs |
ultra | for ultrafeedback binarized conversations |
argilla_chat
**kwargs) prompt_strategies.dpo.llama3.argilla_chat(cfg,
for argilla/dpo-mix-7k conversations
icr
**kwargs) prompt_strategies.dpo.llama3.icr(cfg,
chatml transforms for datasets with system, input, chosen, rejected ex. https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs
intel
**kwargs) prompt_strategies.dpo.llama3.intel(cfg,
For Intel Orca DPO Pairs
ultra
**kwargs) prompt_strategies.dpo.llama3.ultra(cfg,
for ultrafeedback binarized conversations