In an Oxford study, LLMs correctly identified medical conditions 94.9% of the time when given test scenarios directly, vs. 34.5% when prompted by human subjects (Nick Mokey/VentureBeat)

https://www.techmeme.com/feed.xml Hits: 14
Summary

— Also: another paper that seals the deal — The Apple paper on limitations in the “reasoning” of Large Reasoning Models, which raised challenges for the latest scaling hypothesis, has clearly touched a nerve.

First seen: 2025-06-14 22:00

Last seen: 2025-06-15 11:01