Anthropic's Alignment Science team: "legibility" or "faithfulness" of reasoning models' Chain-of-Thought can't be trusted and models may actively hide reasoning (Emilia David/VentureBeat)
https://www.techmeme.com/feed.xmlHits: 24
Summary
— A deal to spin off the U.S. assets of TikTok was put on hold after China indicated that it would not approve …