Reverse Engineering OpenAI’s o1

Q* (Q-star) to Strawberry to o1

image.png

Tree of Thoughts

image.png

Process Reward Model (Let’s verify step by step)

image.png

image.png