Workflow
“最强编码模型”上线,Claude 核心工程师独家爆料:年底可全天候工作,DeepSeek不算前沿
SKLTYSeek .(SKLTY) 36氪·2025-05-23 18:47

| Claude | | Claude | Claude | OpenAl o3 | OpenAl | Gemini 2.5 Pro | | --- | --- | --- | --- | --- | --- | --- | | Opus 4 | | Sonnet 4 | Sonnet 3.7 | | GPT-4.1 | Preview (05-06) | | Agentic coding | 72.5% / | 72.7%/ | 62.3% / | 69.1% | 54.6% | | | SWE-bench Verified15 | 79.4% | 80.2% | 70.3% | | | 63.2% | | Agentic terminal coding | 43.2% / | 35.5% / | 35.2% | 30.2% | 30.3% | 25.3% | | Terminal-bench2.8 | 50.0% | 41.3% | | | | | | Graduate-level reasoning | 79.6% / | 75.4%/ | 78.2% | 83.3% | 66.3% | 83.0% | ...