Home
Getting Started
Quickstart
Installation
Inference and Merging
Command Line Interface (CLI)
Config Reference
API Reference
Dataset Formats
Pre-training
Instruction Tuning
Conversation
Stepwise Supervised Format
Template-Free
Custom Pre-Tokenized Dataset
Deployments
Docker
Multi-GPU
Multi Node
Ray Train
AMD GPUs on HPC Systems
Mac M-series
How To Guides
MultiModal / Vision Language Models (BETA)
RLHF (Beta)
Reward Modelling
Learning Rate Groups
LoRA Optimizations
Core Concepts
Batch size vs Gradient accumulation
Dataset Preprocessing
Multipack (Sample Packing)
Advanced Features
FDSP + QLoRA
Unsloth
PyTorch ao
Custom Integrations
Troubleshooting
FAQ
Debugging
NCCL
On this page
prompt_strategies.dpo.passthrough
prompt_strategies.dpo.passthrough
prompt_strategies.dpo.passthrough
DPO prompt strategies passthrough/zero-processing strategy