Google Releases New AI Creation Tools to Accelerate Multimodal Content Generation

2 day ago / Read about 0 minute

Author：小编

At the recent I/O Developer Conference, Google announced that it will upgrade a series of AI creation tools through the Gemini model family to lower the barrier to multimedia content generation and improve efficiency. In the field of video and multimodal creation, Google introduced the new Gemini Omni model, which supports text, image, audio, and video inputs and can generate coherent video content. The most notable feature of this model is its support for conversational editing, where users can simply describe their modification needs in natural language, such as changing characters, adjusting lighting, or altering scenes, and the model will automatically complete the editing.

Previous page：Anthropic Launches Claude Science AI Research Work...

Next page：UK Media: Apple’s CEO Engages in Productive Discus...

Return to List

Hot Reading

2 day ago

Neocloud Together AI raises $800M, leaps to $8.3B valuation

2 day ago

Samsung's Lee Details Gwangju Chip Complex, Cheonan HBM, and Gumi Robots

2 day ago

Meta Enters AI Cloud Market: Neocloud Rivals CoreWeave and Nebius Crater

2 day ago

Even Honda is pivoting to data centers

2 day ago

NVIDIA Rubin Ultra Four-Die GPU Cancelled: Packaging Limits Cut 2027 Performance in Half

2 day ago

Korea's Mars Auto Pitches Camera-Based Self-Driving Trucks From Korea to the U.S.

1 day ago

NIST's $20M Quantum Manufacturing Center Targets the Cryostat Bottleneck

2 day ago

Google agents-cli: One Command Adds AI Agent Lifecycle Skills to Claude Code and Codex

2 day ago

Galaxy Ring 2 Confirmed: Samsung Hints at iOS Support as Apple Closes In

2 day ago

SK's Chey Pledges $654 Billion for AI Data Centers, $261 Billion for a New Chip Cluster

Previous page：Anthropic Launches Claude Science AI Research Work...

Next page：UK Media: Apple’s CEO Engages in Productive Discus...