OpenAI releases OpenAI o3 and o4-mini models

OpenAI News:

Today, we’re releasing OpenAI o3 and o4-mini, the latest in our o-series of models trained to think for longer before responding. These are the smartest models we’ve released to date, representing a step change in ChatGPT's capabilities for everyone from curious users to advanced researchers. For the first time, our reasoning models can agentically use and combine every tool within ChatGPT—this includes searching the web, analyzing uploaded files and other data with Python, reasoning deeply about visual inputs, and even generating images. Critically, these models are trained to reason about when and how to use tools to produce detailed and thoughtful answers in the right output formats, typically in under a minute, to solve more complex problems. This allows them to tackle multi-faceted questions more effectively, a step toward a more agentic ChatGPT that can independently execute tasks on your behalf. The combined power of state-of-the-art reasoning with full tool access translates into significantly stronger performance across academic benchmarks and real-world tasks, setting a new standard in both intelligence and usefulness.

What’s changed

OpenAI o3 is our most powerful reasoning model that pushes the frontier across coding, math, science, visual perception, and more. It sets a new SOTA on benchmarks including Codeforces, SWE-bench (without building a custom model-specific scaffold), and MMMU. It’s ideal for complex queries requiring multi-faceted analysis and whose answers may not be immediately obvious. It performs especially strongly at visual tasks like analyzing images, charts, and graphics. In evaluations by external experts, o3 makes 20 percent fewer major errors than OpenAI o1 on difficult, real-world tasks—especially excelling in areas like programming, business/consulting, and creative ideation. Early testers highlighted its analytical rigor as a thought partner and emphasized its ability to generate and critically evaluate novel hypotheses—particularly within biology, math, and engineering contexts.

OpenAI o4-mini is a smaller model optimized for fast, cost-efficient reasoning—it achieves remarkable performance for its size and cost, particularly in math, coding, and visual tasks. It is the best-performing benchmarked model on AIME 2024 and 2025. In expert evaluations, it also outperforms its predecessor, o3‑mini, on non-STEM tasks as well as domains like data science. Thanks to its efficiency, o4-mini supports significantly higher usage limits than o3, making it a strong high-volume, high-throughput option for questions that benefit from reasoning.

External expert evaluators rated both models as demonstrating improved instruction following and more useful, verifiable responses than their predecessors, thanks to improved intelligence and the inclusion of web sources. Compared to previous iterations of our reasoning models, these two models should also feel more natural and conversational, especially as they reference memory and past conversations to make responses more personalized and relevant.

https://openai.com/index/introducing-o3-and-o4-mini/

Click to expand...

System One

OS: Windows 11 Pro

Computer type: PC/Desktop

Manufacturer/Model: Self build

CPU: Core i7-13700K

Motherboard: Asus TUF Gaming Plus WiFi Z790

Memory: 64 GB Kingston Fury Beast DDR5

Graphics Card(s): Gigabyte GeForce RTX 2060 Super Gaming OC 8G

Sound Card: Realtek S1200A

Monitor(s) Displays: Viewsonic VP2770

Screen Resolution: 2560 x 1440

Hard Drives: Kingston KC3000 2TB NVME SSD & SATA HDDs & SSD

PSU: EVGA SuperNova G2 850W

Case: Nanoxia Deep Silence 1

Cooling: Noctua NH-D14

Keyboard: Microsoft Digital Media Pro

Mouse: Logitech Wireless

Internet Speed: 50 Mb / s

Browser: Chrome

Antivirus: Defender

OpenAI releases OpenAI o3 and o4-mini models

What’s changed

Steve C

Well-known member

My Computer

System One

Similar threads

Latest Support Threads

Latest Tutorials

OpenAI releases OpenAI o3 and o4-mini models

What’s changed​

Steve C

Well-known member

My Computer

System One

Similar threads

Latest Support Threads

Latest Tutorials

What’s changed