On April 24th, the DeepSeek-V4 series models were officially launched. To meet the new computational demands introduced by the CSA/HCA hybrid attention mechanism, Intellifusion utilized its in-house-developed GPNPU architecture and the IFWA intelligent fusion software stack, complemented by the PyTorch plugin torch_ifwa, to conduct a comprehensive adaptation verification of key mechanisms for the GPNPU platform. This verification primarily centered on the computational characteristics inherent to the CSA/HCA hybrid attention mechanism, showcasing the IFWA software stack’s ability to swiftly respond to innovative attention structures and the GPNPU architecture’s potential to adapt to the evolving landscape of cutting-edge large models. This achievement sets a solid foundation for the subsequent engineering deployment, operator optimization, and performance validation of the DeepSeek-V4 series models on the GPNPU platform.
