Meta's Maverick AI Model Scores Significantly Lower Than Competitors in Benchmark Tests

2025-04-12 / Read about 0 minute

Author：小编

Earlier this week, Meta ignited controversy by attaining high scores in the crowdsourced benchmark test LM Arena, leveraging an undisclosed experimental version of Llama 4 Maverick. Once this practice was brought to light, the LM Arena administrators promptly apologized and revised their protocols, rescoring the tests using the original version of Maverick. This revelation demonstrated a significantly reduced competitive standing for Meta's model. The incident not only exposed Meta's questionable conduct during testing but also ignited widespread debates concerning technical transparency and the fairness of benchmark evaluations.

Previous page：CITIC Securities: Physical AI Pioneers Industrial ...

Next page：Xu Peng, Ant Group's Vice President and Former Hea...

Return to List

Hot Reading

1 day ago

OpenClaw security fears lead Meta, other AI firms to restrict its use

2 day ago

Apple's AirDrop Now Works With More Pixel Phones

2 day ago

I Ran 30 Miles Testing 5 Smartwatches to Find Out Which One You Can Actually Trust

2 day ago

5 changes to know about in Apple's latest iOS, macOS, and iPadOS betas