Meituan LongCat Makes VitaBench 2.0 Open-Source
6 hour ago / Read about 0 minute
Author:小编   

Following the rollout of VitaBench 1.0 in October of the previous year, the Meituan LongCat team has now introduced VitaBench 2.0. This iteration stands as the inaugural intelligent agent evaluation benchmark specifically designed for long-term dynamic user modeling in real-world settings. It facilitates a comprehensive evaluation of large language models' capabilities to exhibit personalization and initiative during extended, genuine, and dynamic interactions with users.