On April 8th, Step-Star unveiled its latest multi-modal reasoning model, the Step-R1-V-Mini, which seamlessly integrates image and text input with text output. This model excels in adhering to instructions and boasts broad-spectrum capabilities, facilitating exceptional high-precision image perception and the execution of intricate reasoning tasks.
