On April 3, Maxvision Technology, in collaboration with the Shanghai AI Laboratory, officially launched the Kernel-Smith, a high-performance GPU operator generation system. This system ingeniously combines a 'stable evaluation-driven evolutionary agent' with an 'evolution-oriented post-training paradigm.' By utilizing the Intern-S1-Pro large model for in-depth, customized training, it empowers the large model to evolve into an 'operator optimization expert.' Presently, the high-performance operators automatically generated by Kernel-Smith have already found practical applications. They have accelerated the new architecture Engram of DeepSeek and have been seamlessly integrated into DLBlas. Furthermore, these operators have been deployed in mainstream production-grade inference engines such as SGLang and LMDeploy, marking a significant leap from laboratory evaluation to state-of-the-art model development and production-level deployment.
