ByteDance and Nanyang Technological University Unveil Open-Source StoryMem Framework

2025-12-29 / Read about 0 minute

Author：小编

According to AI Base, ByteDance has teamed up with Nanyang Technological University to launch an open-source AI video generation framework named StoryMem. This innovative framework transforms existing single-shot diffusion models into a seamless long-form video generation system. It accomplishes this by supporting multi-shot sequences that extend beyond 1 minute in length, thanks to its 'Memory-to-Video (M2V)' mechanism. StoryMem employs a dynamic memory bank to preserve keyframe information. When paired with the lightweight LoRA fine-tuning technology, it ensures a high degree of cross-shot consistency in character appearance, scene style, and narrative logic. Compared to existing methods, StoryMem has boosted consistency metrics by 29%. Moreover, the framework has introduced the ST-Bench dataset, which includes 300 multi-shot story prompts, to facilitate standardized evaluation. At present, the tech community has started incorporating this cutting-edge technology into ComfyUI.

Previous page：OpenAI Ponders Incorporating Ads into ChatGPT

Next page：World's Pioneering Modular Embodied AI Service Spa...

Return to List

Hot Reading

2 day ago

AMD's market cap hits all-time high, Intel hits 25-year high on Agentic AI's insatiable demand for CPUs

2 day ago

Microsoft and Stellantis want to use AI to help car owners

2 day ago

US appeals court restarts $3 billion patent infringement lawsuit against Intel

2 day ago

From the Startup Battlefield stage to the International Space Station: geCKo Materials built a sticky product