Hugging Face has lately made the smol-audio codebase publicly available as open source, equipping developers with an all-inclusive toolkit for the further development and local deployment of audio models. This toolkit is designed to support the fine-tuning of prominent speech-based large models, thereby expediting the iterative process and broadening the application scope of audio models.
