6 / 780

Why Google DeepMind Just Abandoned Single-Score AI Testing

TL;DR

Google DeepMind has introduced a new framework for evaluating Artificial General Intelligence (AGI), shifting from traditional benchmarks to a multidimensional approach. This framework examines AI systems across ten cognitive dimensions, including perception, reasoning and social cognition, to create a detailed profile of their capabilities. For example, an AI might demonstrate strong problem-solving skills but show […] The post Why Google DeepMind Just Abandoned Single-Score AI Testing appeared first on Geeky Gadgets.

Nauti's Take

DeepMind's multidimensional AGI framework is an overdue move – a single benchmark score never really told you what a model can do. Ten cognitive dimensions finally offer a more honest picture, but there's a real catch: whoever defines the metrics defines what counts as 'intelligent'.

A genuine step forward – and a reminder that measurement frameworks are never neutral.

Video

Sources