----------------------
Open AI releases paper + dataset
Let’s Verify Step by Step
trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final…
----------------------
via AI News - Telegram Channel