In Stanford's latest evaluation of large models for medical tasks, DeepSeek R1 has emerged victorious with an impressive 66% win rate, astonishing foreign netizens. This evaluation uniquely focuses on real-world clinical scenarios, diverging from traditional medical licensing exam questions, thereby underscoring its practical relevance and significance.