On January 30, Ali QianWen made an official announcement regarding the launch of the DeepPlanning benchmark. This particular benchmark has been meticulously crafted to assess an Agent's prowess in global planning when confronted with real-world, intricate scenarios. These scenarios encompass tasks such as multi-day travel planning and shopping for multiple products. Currently, the benchmark has been made openly accessible as an open-source resource on both Hugging Face and ModelScope platforms.
