OpenAI has launched GPT-5.2, its smartest synthetic intelligence mannequin but, with efficiency positive factors throughout writing, coding, and reasoning benchmarks. The launch comes simply days after CEO Sam Altman internally declared a “code crimson,” a company-wide push to enhance ChatGPT amid intense competitors from rivals.
“We introduced this code crimson to essentially sign to the corporate that we need to marshal assets in a single explicit space, and that is a approach to actually outline priorities,” mentioned OpenAI’s CEO of purposes, Fidji Simo, in a briefing with reporters on Thursday. “We’ve got had a rise in assets centered on ChatGPT on the whole.”
Simo denied that OpenAI had moved up GPT-5.2’s launch in mild of its code crimson, claiming the corporate has been engaged on this mannequin’s launch for months. Nevertheless, she mentioned the extra assets round ChatGPT have been “useful.”
Whereas OpenAI’s fashions and merchandise have been thought-about best-in-class when ChatGPT launched in 2022, that’s now not a settled matter. The startup now faces an array of worthy challengers, maybe none extra threatening than Google, whose just lately launched Gemini 3 mannequin was obtained properly by the tech business. Google’s Gemini app has grown at a formidable fee over the past 12 months, now with greater than 650 million month-to-month lively customers, in comparison with OpenAI’s 800 million weekly lively customers. That strain has pressured OpenAI to rein in a few of its most formidable tasks, together with its work on introducing advertisements to ChatGPT, and to refocus on enhancing its core expertise and merchandise.
Very like the corporate’s current mannequin launches, GPT-5.2 is delivery as a collection of fashions: Instantaneous, which responds quicker and is healthier for information-finding; Pondering, which excels at coding, math, and planning; and Professional, essentially the most highly effective tier of OpenAI’s fashions that delivers larger accuracy on tough questions.
OpenAI calls GPT-5.2 its finest mannequin but for on a regular basis skilled use. GPT-5.2 Pondering notched the very best scores so far on GDPval, an OpenAI benchmark that compares efficiency between AI fashions and human professionals throughout 44 real-world occupations. The corporate says the mannequin beat human professionals in over 70 p.c of duties, and accomplished them 11 instances quicker.
OpenAI’s post-training lead Max Schwarzer says the brand new launch must also supply a considerable discount in hallucinations. The corporate says GPT-5.2 Pondering hallucinated 38 p.c lower than GPT-5.1 on benchmarks measuring solutions to factual questions.
The corporate is bringing GPT-5.2 to each ChatGPT customers and builders on OpenAI’s API product. OpenAI says the brand new collection of fashions “brings clear positive factors throughout on a regular basis and superior use circumstances.”