From 'Data Fusion' to 'Native Architecture': SenseTime Launches NEO Architecture, Pushing the Efficiency Envelope of Multimodal Models

2025-12-01 / Read about 0 minute

Author：小编

In a collaborative effort with S-Lab at Nanyang Technological University, SenseTime Technology has officially unveiled and made open-source a cutting-edge multimodal model architecture known as NEO. This innovative architecture is set to serve as the cornerstone for the next generation of SenseNova's multimodal models, embodying a native unified design ethos. Leveraging pivotal technologies such as native primitives, positional encoding, and hybrid attention mechanisms, NEO efficiently integrates images and text within a singular Transformer framework, thereby redefining the limits of efficiency in multimodal model architectures.

Previous page：Paiwo AI Unveils PixVerse V5.5 AI Video Generative...

Next page：Masayoshi Son Breaks Silence on NVIDIA Share Sale:...

Return to List

Hot Reading

2 day ago

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

2 day ago

Amazon stuck with months of repairs after drone strikes on data centers

2 day ago

Beyond Lovable and Mistral: 21 European startups to watch

2 day ago

Study: AI models that consider user's feeling are more likely to make errors