Technology ❯Machine Learning ❯Reinforcement Learning
Novelty Rewards
Researchers have created a machine learning technique that generates more diverse prompts to better test and secure AI chatbots against toxic responses.