Home
News
Business Stories AI Technology Travel Visa Asia Business Registration Telecommunication Medical Services
About Us
Home News AI Technology OpenAI o4-proto-4 achieves 82% autonomous resolution on full SWE-Bench Verified + Multi-File

OpenAI o4-proto-4 achieves 82% autonomous resolution on full SWE-Bench Verified + Multi-File

258    2026-02-25

[AI-NEWS]

Date: 2026-02-25

Content: o4-proto-4 sets new internal record by resolving 82% of SWE-Bench Verified multi-file tasks completely autonomously (design → code → test → debug → commit) over multi-hour sessions without any human edits or guidance.

Keywords : autonomous software engineering, SWE-Bench Verified, multi-file resolution, o4-proto-4, end-to-end coding agent

Previous article
Mistral Mathstral 2.0 sets new open-source SOTA on MATH and GSM8K-Hard
Next article
OpenClaw may be the most important software release in history
new
OpenClaw may be the most important software release in history Google Gemini 3 User Base Surges, Challenging OpenAI's Dominance Claude Free Users Surge 60%+; Anthropic Grows Despite Pentagon Ban Risks YuanLab Open-Sources Yuan3.0 Ultra, Joins Top 3 Trillion-Parameter Open Multimodal Models Globally OpenAI Launches GPT-5.4 with Native Computer Control, Deep Integration into Excel & Google Sheets Tesla Unveils Optimus Gen-3 Humanoid Robot, Announces Mass Production Plan Google DeepMind Unveils Math Reasoning Agent Aletheia, Solving World-Class Problems Edge AI Explodes, Honor Launches Magic8 Pro Redefining Mobile Imaging Open-Source LLM Architecture Boom: In-Depth Review of 10 New 2026 Models Mistral Releases Local Voice Model: Privacy-Focused, Low-Latency Real-Time Conversation
Email subscription
About
Navigation
News
©bizyet.com