DeepMind claims its newest AI tool is a whiz at math and science problems

https://techcrunch.com/feed/ Hits: 6

Summary

Google’s AI R&D lab DeepMind says it has developed a new AI system to tackle problems with “machine-gradable” solutions. In experiments, the system, called AlphaEvolve, could help optimize some of the infrastructure Google uses to train its AI models, DeepMind said. The company says it’s building a user interface for interacting with AlphaEvolve, and plans to launch an early access program for selected academics ahead of a possible broader rollout. Most AI models hallucinate. Owing to their probabilistic architectures, they confidently make things up sometimes. In fact, newer AI models like OpenAI’s o3 hallucinate more than their predecessors, illustrating the challenging nature of the issue. AlphaEvolve introduces a clever mechanism to cut down on hallucinations: an automatic evaluation system. The system uses models to generate, critique, and arrive at a pool of possible answers to a question, and automatically evaluates and scores the answers for accuracy. DeepMind’s AlphaEvolve system is designed to be used by domain experts, the lab saysImage Credits:DeepMind AlphaEvolve isn’t the first system to take this tack. Researchers, including a team at DeepMind several years ago, have applied similar techniques in various math domains. But DeepMind claims AlphaEvolve’s use of “state-of-the-art” models — specifically Gemini models — makes it significantly more capable than earlier instances of AI. To use AlphaEvolve, users must prompt the system with a problem, optionally including details like instructions, equations, code snippets, and relevant literature. They must also provide a mechanism for automatically assessing the system’s answers in the form of a formula. Because AlphaEvolve can only solve problems that it can self-evaluate, the system can only work with certain types of problems — specifically those in fields like computer science and system optimization. In another major limitation, AlphaEvolve can only describe solutions as algorithms, making it a poor fit...

First seen: 2025-05-14 15:35

Last seen: 2025-05-14 20:36

Read Full Article More from this Source

DeepMind claims its newest AI tool is a whiz at math and science problems

Summary

Related News

Iconiq VCs spent two years courting Chime and the firm isn’t selling its stake

Startups Weekly: No sign of pause

Scale AI confirms ‘significant’ investment from Meta, says CEO Alexandr Wang is leaving

Zevo’s EV-only car-share fleet is helping Tesla owners make money

What to expect at TechCrunch All Stage: One day, countless connections and takeaways