The landscape of multimodal technologies has recently witnessed a surge of significant updates. On May 21, at the 2025 Google I/O Conference, the tech giant unveiled Veo 3, a video generation model that achieves AI-driven audio-visual synchronization. This was swiftly followed by Doubao's official launch of its video call feature on May 23, enabling real-time video communication and screen sharing. On June 6, Kuaishou announced that its KeLing AI ARR surpassed US$100 million in March 2025, with subsequent monthly paid amounts exceeding RMB 100 million in both April and May.
As we look ahead, the upcoming events—Apple's WWDC 2025 on June 10 and ByteDance's Force 2025 Conference on June 11—are poised to further accelerate the application and deployment of multimodal models and edge AI products. These conferences are expected to bring forth new innovations and advancements, solidifying the momentum in this rapidly evolving field.