The Evals tool from OpenAI now boasts built-in audio input and scoring functionalities, enabling the direct assessment of a model's audio-based responses. This eliminates the necessity for converting audio into text prior to evaluation. This enhancement significantly simplifies the evaluation workflow for models focused on speech recognition and generation, thereby boosting their efficiency, precision, and dependability. The tool proves especially valuable in contexts like the creation and refinement of intelligent voice assistants, gauging the performance of speech recognition technologies, and ensuring the quality of audio content production. For detailed guidance on leveraging these capabilities, users are encouraged to consult the official Cookbook.