I think it just means that you can objectively score an answer as being correct ...

		ks2048 on Jan 29, 2025 \| parent \| context \| favorite \| on: An analysis of DeepSeek's R1-Zero and R1 I think it just means that you can objectively score an answer as being correct or not. (e.g. if the generated program passes some tests; a discovered proof is valid, etc).