Ark of Infinity Releases General Audio Model GPA, Achieving Unification of ASR/TTS/VC Three Tasks
2026-01-20 / Read about 0 minute
Author:小编   

Ark of Infinity recently released the General Audio Large Model (GPA), which employs a unified autoregressive Transformer architecture. It integrates three major functions—speech recognition, speech synthesis, and voice conversion—into a single framework, breaking through the traditional dispersed Pipeline design pattern of speech systems.