Hunger Games for AI
OK really it's called "Survival Game" but that's the basic idea. Some Cornell researchers decided to put various LLMs through their paces, giving them a limited number of tries to solve a variety of problems, and eliminating those that failed in successive rounds of evaluation. I do NOT have the time to read this comprehensively, but from the intro:
Our results show that while AI systems achieve the Autonomous Level in simple tasks, they are still far from it in more complex tasks, such as vision, search, recommendation, and language. While scaling current AI technologies might help, this would come at an astronomical cost. Projections suggest that achieving the Autonomous Level for general tasks would require 1026 parameters. To put this into perspective, loading such a massive model requires so many H100 GPUs that their
total value is 4 × 107 times that of Apple Inc.’s market value. Even with Moore’s Law, supporting such a parameter scale would take 70 years. This staggering cost highlights the complexity of human tasks and the inadequacies of current AI technologies.
https://arxiv.org/pdf/2502.18858
Check it out! Tell me what jumped out at you.