Recently, Kling AI has rolled out an innovative digital human feature. Users are empowered to upload an image of a character and then combine it with either text or audio to craft a digital human video. This video boasts a high - definition resolution of 1080p, a smooth frame rate of 48FPS, and can extend up to a maximum duration of 1 minute.
This remarkable function is built upon the seamless and in - depth integration of multimodal understanding models and video generation models. Such integration allows for highly accurate lip - syncing, as well as precise control over the digital human's emotions and movements. Moreover, it is versatile enough to support multiple characters simultaneously and is compatible with a variety of languages, including Chinese, English, Japanese, and Korean.
When coupled with membership discounts, the cost of generating these videos can plummet to as low as 0.12 yuan per second. This significantly reduces the financial barriers that have long hindered entry into the industry. The digital human videos generated through this feature are well - suited for a range of scenarios, such as advertising campaigns, e - commerce promotions, and educational materials. At present, the product has entered the public beta testing phase, inviting users to experience and provide feedback.