The Audio, Speech and Language Processing (ASLP) Research Group at Northwestern Polytechnical University, in collaboration with institutions like Hill Shell and the AI Research Institute of China Telecom, has officially released the first large-scale, multi-dimensionally annotated Sichuan dialect speech dataset—WenetSpeech-Chuan, as an open-source resource. This comprehensive dataset encompasses 10,000 hours of speech data, spanning nine key domains. It also features multi-dimensional annotations, including Automatic Speech Recognition (ASR) transcriptions, speaker characteristics, and speech quality assessments.
