Nanyang Technological University’s MMLab Unveils Hand2World, Facilitating ‘Hand-Eye’ Collaborative Interaction in World Models

1 week ago / Read about 0 minute

Author：小编

As reported by CSDN, the MMLab team at Nanyang Technological University has recently unveiled the Hand2World model. This innovative model empowers AI-driven world models to produce first-person interactive videos in real-time, using mid-air gestures as input. This advancement signifies a major technological shift from ‘passive observation’ to ‘active engagement,’ effectively tackling the challenge of hand-eye interaction. Hand2World leverages projections derived from 3D hand meshes as control signals and utilizes pixel-level Plücker ray embeddings to precisely encode camera motion. This approach successfully separates hand movements from head perspective rotations. From a technical standpoint, Hand2World supports streaming output and enables continuous interaction of unlimited duration. It substantially enhances the visual quality and 3D consistency of generated videos, as demonstrated across three major benchmark tests.

Previous page：Meta Extends Collaboration with Broadcom Until 202...

Next page：OpenAI Launches GPT-5.4-Cyber Model to Enhance Cyb...

Return to List

Hot Reading

1 day ago

Framework Laptop 13 Pro is a major overhaul for the modular, upgradeable laptop

2 day ago

Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return

2 day ago

Samsung 'Project Luna' Is a New Tabletop Robot That Acts as a Smart Home Hub Powered by AI

2 day ago

New UK TV box breathes fresh life into your satellite dish as Sky Q winds down

2 day ago

How Cloud Infrastructure Really Works with Virtualization Containers and Data Centers

1 day ago

Framework's CEO on the RAM crisis and creating a "MacBook Pro for Linux users"

1 day ago

Framework Laptop 16 upgrades make it look less like an unfinished prototype

2 day ago

Clym Accessibility Tools Review: A Free, Open-Source WCAG Scanner Built for 2026

2 day ago

Tesla launches robotaxis in Dallas and Houston, and oops, it’s already unavailable

2 day ago

Anthropic's Mythos AI model sparks fears of turbocharged hacking

Previous page：Meta Extends Collaboration with Broadcom Until 202...

Next page：OpenAI Launches GPT-5.4-Cyber Model to Enhance Cyb...