"Reward modeling for mitigating toxicity in transformer-based language models."

Farshid Faal, Ketra A. Schmitt, Jia Yuan Yu (2023)

Details and statistics

DOI: 10.1007/S10489-022-03944-Z

access: closed

type: Journal Article

metadata version: 2023-03-28