Reinforcement learning. People like being told they are right, and when a respon...

		GuB-42 2 days ago \| parent \| context \| favorite \| on: Claude's new constitution Reinforcement learning. People like being told they are right, and when a response contains that formulation, on average, given the choice, people will pick it more often than a response that doesn't, and the LLM will adapt.