core.builders.rl
core.builders.rl
Builder for RLHF trainers
Classes
| Name | Description |
|---|---|
| HFRLTrainerBuilder | Trainer factory class for TRL-based RLHF trainers (e.g. DPO) |
HFRLTrainerBuilder
core.builders.rl.HFRLTrainerBuilder(cfg, model, tokenizer, processor=None)Trainer factory class for TRL-based RLHF trainers (e.g. DPO)