So we have 3 options:
- t3 was now included in the corpus
- t3 was used for RL
- o1 generalizes better
So we have 3 options:
- t3 was now included in the corpus
- t3 was used for RL
- o1 generalizes better