Hosted on MSN
ChatGPT just announced it can pass the 'how many "r"s in strawberry' test, but users found otherwise
ChatGPT passes “strawberry” test but fails when switched to “cranberry” AI still struggles with simple letter-counting despite broader improvements Reasoning tests like “car wash” still expose gaps in ...
Confident mistakes – or lies, if you will – are a common problem of large language models used in AI chatbots, with one common shortcoming of ChatGPT being that it would frequently miscount the number ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results