Loads one or more training or evaluation datasets for RL training using paired
preference data, calling axolotl.utils.data.rl.load_prepare_preference_datasets.
Optionally, logs out debug information.
Parameters
Name
Type
Description
Default
cfg
DictDefault
Dictionary mapping axolotl config keys to values.
required
cli_args
Union[PreprocessCliArgs, TrainerCliArgs]
Command-specific CLI arguments.
required
Returns
Name
Type
Description
TrainDatasetMeta
Dataclass with fields for training and evaluation datasets and the computed