calculuschild@lemm.eetoTechnology@beehaw.org•[Fortune] Over just a few months, ChatGPT went from correctly answering a simple math problem 98% of the time to just 2%, study finds
57·
1 year agoMy understanding is this claim is basically entirely false. The tests done by these researchers had some glaring errors that when corrected, show gpt-4 is getting slightly better at math, if anything. See this video that describes some of the issues: https://youtu.be/YSokS2ivf7U
TL;DR The researchers gave new GPT questions from two different pools. It’s no surprise they got worse answers.
The problem is they aren’t comparing apples to apples. They asked each version of GPT a different pool of questions. (Edited my post to make this clear).
Once you ask them the same questions, it becomes clear that ChatGPT isn’t getting worse at math, because it has been terrible all along.