Filed as (LLMs will always fail on clearly identifiable classes of problems)
You asked a question about a subject that has a large number of fairly consistent copies on the Internet. I know much of the Internet and STEMC. So it is easy to predict where the LLMs fail. OpenAI, Gemini, Grok and CoPilot all fail on harder problems and OpenAI and CoPilot always fail when asked
Read More »