The new ChatGPT Images is here
OpenAI News我们今天推出了新版 ChatGPT Images,搭载我们迄今最强的图像生成模型。凭借更可靠的指令执行和更精确的编辑能力, ChatGPT Images 能在保持面部相似度等关键细节一致的同时完成你所要求的改动——生成速度最高可达 4 倍加速,让你更快迭代、少些等待。
这是我们迄今最通用的文本到图像模型:转换更具表现力、密集文本渲染更佳、结果更自然。无论是微小修正还是彻底重塑,你只需口述想法,或在新的 Images 体验中选用预设风格与灵感, ChatGPT 会把剩下的工作做好,产出既实用又有吸引力、更符合你意图的图像。
新的图像模型和体验将于今日开始在 ChatGPT 向所有用户推送,并以 GPT‑Image‑1.5 的形式在 API 中提供。
——
匹配你意图的结果
模型现在能更可靠地遵循指令——连细枝末节也能对上你的要求——在修改你指定内容的同时,能在输入、输出与后续编辑之间保持光线、构图和人物相貌等元素的一致性。
这就能产出更符合你意图的结果:更实用的照片修图、更可信的服装与发型试穿,以及在保留原图神髓的前提下施加风格滤镜或概念性转换。综合这些改进, ChatGPT 可以成为口袋里的创作工作室,既能做务实的改动,也能完成富有表现力的再创作。
编辑
该模型擅长多种编辑方式,能进行添加、删除、合成、融合与置换等,从而在保留图像特色的同时实现你想要的变化。
创意转换
模型在创意转换方面表现突出,能改变或新增元素(例如文字与版式),让概念更易呈现,同时维护重要细节。
指令遵循
与 GPT Image 1.0 相比,模型在遵循指令方面有明显提升。
文本渲染
在文本渲染上,模型进一步进步,能够处理更密集、更小尺寸的文字。
——
一个新的创作空间
除了在聊天中通过描述生成图像,我们还在 ChatGPT 侧边栏推出了专门的 Images 体验,旨在让探索与尝试图像更快更便捷。该体验提供预设滤镜和热门提示,帮助触发灵感;并支持一次性的人像上传(one‑time likeness upload),方便你在后续创作中重复使用个人形象,而无需再次翻看相册。
这些升级让你能更好地把想象变成图像——从小修小改到全面重塑均可。图像渲染速度最高提升至四倍,你还可以在已有图像生成进行中继续发起新的生成,让你在不必等待的情况下探索更多想法。
——
附加质量改进
模型在多项细节上也有改进,使输出更可直接使用,比如对许多小面孔的呈现以及整体结果的自然度都有提升。
在一些示例中,我们展示了更精细的多面孔渲染、更真实的外观以及更生动的图形表现;尽管仍存在科学性小误差,但准确率提高到约 70%,并减少了过早裁剪的问题。
——
GPT Image 1.5 在 API 中
作为 API 版本的 GPT‑Image‑1.5 同样带来与 ChatGPT Images 相同的改进:比 GPT Image 1 在图像保留和编辑方面更强。
在多轮编辑中,你会看到品牌标志与关键视觉元素保留得更一致,使其特别适合市场与品牌工作(如图形与 Logo 设计)、以及电商团队从单一原图生成整套产品图片(不同变体、场景与角度)。
在 GPT Image 1.5 中,图像输入和输出的成本相比 GPT Image 1 降低了约 20%,这意味着在相同预算下你可以生成并迭代更多图像。
你可以在 OpenAI Playground 试用该模型,或参阅提示指南获取灵感。
包括创意工具、电商、营销软件等在内的企业和初创公司已在使用 GPT Image 1.5,例如 Wix、Canva、Higgsfield、Figma、Weave 和 Envato。
“ GPT Image 1.5 能生成高保真图像,严格遵循提示,保留构图、光线与细节。结果清晰、真实且可靠,有助于像 Wix 这样的平台把概念更快推向生产。根据我们的测试与典型用例, GPT Image 1.5 的一致性与质量足以使其成为今日旗舰级的图像生成模型之一。”—— Hila Gat,Wix 人工智能研究与数据科学负责人
——
可用性
新版 ChatGPT Images 正在对全球所有 ChatGPT 用户与 API 用户逐步上线,覆盖多个接入面。你无需选择特定设置即可使用它。此前年内推出的 ChatGPT Images 版本将以自定义 GPT 的形式继续对所有用户可用。
我们认为图像生成的潜力尚处于起步阶段。今天的更新是向前的一大步,未来还会推出更精细的编辑能力以及跨语言、更丰富更细致的输出。
Today, we’re releasing a new version of ChatGPT Images, powered by our new flagship image generation model. Now, whether you’re creating something from scratch or editing a photo, you’ll get the output you’re picturing. It makes precise edits while keeping details intact, and generates images up to 4x faster. Alongside, we’re introducing a new Images feature within ChatGPT, designed to make image generation delightful—to spark inspiration and make creative exploration effortless.
The new Images model and feature are rolling out today in ChatGPT for all users, and in the API as GPT Image 1.5.
Precise edits that preserve what matter
Now, when you ask for edits to an uploaded image, the model adheres to your intent more reliably—down to the small details—changing only what you ask for while keeping elements like lighting, composition, and people’s appearance consistent across inputs, outputs, and subsequent edits.
This unlocks results that match your intent—more useful photo edits, more believable clothing and hairstyle try-ons, alongside stylistic filters and conceptual transformations that retain the essence of the original image. Together, these improvements mean ChatGPT can act as a creative studio in your pocket, capable of both practical edits and expressive reimaginings.
Editing
The model excels at different types of editing—including adding, subtracting, combining, blending, and transposing—so you get the changes you want without losing what makes the image special.
From party to livestreamLA skateboarding



Combine the two men and the dog in a 2000s film camera-style photo of them looking bored at a kids birthday party.

Add chaotic kids in the background throwing things and screaming.

Change the man on the left to a hand-drawn retro anime style, the dog to plushie style, keep the man on the right and background scenery the way they are.


Put them all in OpenAI sweaters that look like this.

Now remove the two men, just keep the dog, and put them in an OpenAI livestream that looks like the attached image.

Creative transformations
The model’s creativity shines through transformations that change and add elements—like text and layout—to bring ideas to life, while preserving important details. These transformations work for both simple and more intricate concepts, and are easy to try using preset styles and ideas in the new ChatGPT Images feature—no written prompt required.
Movie poster80s fitness instructorGlam dollOrnamentFashion adDress-up characterPaintingDrink ad

Make an old school golden age hollywood movie poster of a movie called 'codex' from the image of these two men. feel free to change their costumes to fit the times
Change the names of the actors to Wojciech Zaremba (left) and Greg Brockman (right)
Directed by Sam Altman, produced by Fidji Simo. A Feel the AGI Pictures Production.
Read more

Instruction following
The model follows instructions more reliably than our initial version. This enables more precise edits as well as more intricate original compositions, where relationships between elements are preserved as intended.
New
draw a 6x6 grid
Make a 6 (columns) by 6 (rows) grid grid of:
Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog
Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope
Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z
Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet
Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet
Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14
Read more

Previous
draw a 6x6 grid
Make a 6 (columns) by 6 (rows) grid grid of:
Row 1: the Greek letter beta, a beach ball, a lemon, a robot, a fish tank, a frog
Row 2: a praying mantis, an expensive watch, a baththub, a pair of sunglasses, a colorful butterfly, an envelope
Row 3: a stamp, a picture frame, a steaming dumpling, the word "miracle", a pair of skis, the letter Z
Row 4: a toilet, a subway token, a mute icon, a bottle of perfume, a dragonfly, a skateboard helmet
Row 5: a Bluetooth icon, the number 13, a green heart, a rubik's cube, a Canada goose, a soldier's helmet
Row 6: a white dog, a life jacket, a knot, a keyboard, a tissue box, the number 14
Read more

Text rendering
The model takes another step ahead in text rendering, capable of handling denser and smaller text.
Markdown renderingCalorie infographicCoding
There is a newspaper on a desk. The newspaper shows the markdown below laid out as a **natural** newspaper article. Preserve all content, formatting, and numbers exactly. The image should be tall.
# Introducing GPT‑5.2
### *The most advanced frontier model for professional work and long-running agents*
**December 11, 2025**
---
We are introducing **GPT‑5.2**, the most capable model series yet for professional knowledge work.
Already, the average ChatGPT Enterprise user says AI saves them 40–60 minutes a day, and heavy users say it saves them more than 10 hours a week. We designed GPT‑5.2 to unlock even more economic value for people; it’s better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects.
GPT‑5.2 sets a new state of the art across many benchmarks, including GDPval, where it outperforms industry professionals at well-specified knowledge work tasks spanning 44 occupations.
---
## Benchmark highlights
| Benchmark | Domain | GPT‑5.2 Thinking | GPT‑5.1 Thinking |
|---|---|---:|---:|
| GDPval (wins or ties) | Knowledge work tasks | **70.9%** | 38.8% (GPT‑5) |
| SWE-Bench Pro (public) | Software engineering | **55.6%** | 50.8% |
| SWE-bench Verified | Software engineering | **80.0%** | 76.3% |
| GPQA Diamond (no tools) | Science questions | **92.4%** | 88.1% |
| CharXiv Reasoning (w/ Python) | Scientific figure questions | **88.7%** | 80.3% |
| AIME 2025 (no tools) | Competition math | **100.0%** | 94.0% |
| FrontierMath (Tier 1–3) | Advanced mathematics | **40.3%** | 31.0% |
| FrontierMath (Tier 4) | Advanced mathematics | **14.6%** | 12.5% |
| ARC-AGI-1 (Verified) | Abstract reasoning | **86.2%** | 72.8% |
| ARC-AGI-2 (Verified) | Abstract reasoning | **52.9%** | 17.6% |
---
Notion, Box, Shopify, Harvey, and Zoom observed that GPT‑5.2 demonstrates state-of-the-art long-horizon reasoning and tool-calling performance. Databricks, Hex, and Triple Whale found GPT‑5.2 to be exceptional at agentic data science and document analysis tasks. Cognition, Warp, Charlie Labs, JetBrains, and Augment Code report that GPT‑5.2 delivers state-of-the-art agentic coding performance, with measurable improvements in areas such as interactive coding, code reviews, and bug finding.
In ChatGPT, GPT‑5.2 Instant, Thinking, and Pro will begin rolling out today, starting with paid plans. In the API, they are available now to all developers.
Overall, GPT‑5.2 brings significant improvements in general intelligence, long-context understanding, agentic tool-calling, and vision—making it better at executing complex, real-world tasks end-to-end than any previous model.
Read more

Now change the article to the markdown below:
# Introducing GPT‑Image-1.5
### *The new and improved ChatGPT Images*
**December 16, 2025**
---
Today, we’re introducing a new and improved version of ChatGPT Images, powered by our best image generation model yet. With stronger instruction following and more precise editing, ChatGPT Images delivers the changes you ask for while keeping important details like facial likeness consistent across edits—now with generation speeds up to **4× faster**, making it easier to iterate and explore ideas with less waiting.
This is our most capable general-purpose text-to-image model to date, with more expressive transformations, improved dense text rendering, and more natural-looking results. Whether you’re making a tiny fix or a total reinvention, you can simply say what you want—or choose from preset styles and ideas in the new Images experience—and ChatGPT handles the rest, delivering results that are both useful and compelling, and better match your intent.
The new Images model and experience is beginning to roll out today in ChatGPT for all users, and in the API as **GPT‑Image-1.5**.
---
## Results that match your intent
The model now follows instructions more reliably—down to the small details—changing what you ask for while able to keep elements like lighting, composition, and likeness consistent across inputs, outputs, and subsequent edits.
This unlocks results that match your intent—more useful photo edits, more believable clothing and hairstyle try-ons, alongside stylistic filters and conceptual transformations that retain the essence of the original image. Together, these improvements mean ChatGPT can act as a creative studio in your pocket, capable of both practical edits and expressive reimaginings.
### Editing
The model excels at different types of editing so you get the changes you want without losing what makes the image special.
### Creative Transformations
The model’s creativity shines with creative transformations, changing and adding elements—like text and layout—that help the concept come to life while maintaining important details.
### Instruction Following
The model is able to better follow instructions versus GPT Image 1.0.
### Text Rendering
The model takes another step ahead in text rendering, capable of handling denser and smaller text.
---
## A new creation space
In addition to asking for images through ChatGPT by describing what you’d like to see, we’re also introducing a dedicated Images experience in the ChatGPT sidebar to make exploring and trying images easier and quicker. This includes preset filters and trending prompts to jump-start inspiration, as well as a one-time likeness upload so you can reuse your appearance across future creations without the need to go through your camera roll again.
Together, these upgrades let you create images that better match your vision, from small edits to full reimaginings. Images now render up to four times faster, and you can continue generating new images while others are still in progress—so you can explore more ideas without waiting.
Read more

Additional quality improvements
The model also improves on additional dimensions that translate to more immediately usable outputs, like rendering many small faces and how natural outputs look.
1970s LondonMany small facesDiver playing pianoPhoto with glare
New
make a scene in chelsea, london in the 1970s, photorealistic, everything in focus, with tons of people, and a bus with an advertisement for "ImageGen 1.5" with the OpenAI logo and subtitle "Create what you imagine". Hyper-realistic amateur photography, iPhone snapshot quality…
Read more

Previous
make a scene in chelsea, london in the 1970s, photorealistic, everything in focus, with tons of people, and a bus with an advertisement for "ImageGen 1.5" with the OpenAI logo and subtitle "Create what you imagine". Hyper-realistic amateur photography, iPhone snapshot quality…
Read more

A new creation space
In addition to generating images by describing what you’d like to see in a message, we’re introducing a dedicated home for Images in ChatGPT—available in the sidebar through the mobile app and on chatgpt.com—to make exploring and trying images faster and easier. It includes dozens of preset filters and prompts to jump-start inspiration, updated regularly to reflect emerging trends.
Together, these upgrades let you create images that better match your vision, from small edits to full reimaginings.
Improvements and limitations
We reran many of the examples from our initial image generation launch to evaluate performance. The model shows clear improvements across a range of cases, though results remain imperfect. While this release represents meaningful progress, there is still significant room for improvement in future iterations.
Deep sea poster (Improvement)World capitals (Improvement)Styles (Limitation)Multiple faces (Limitation)Multilingual (Limitation)
New
create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

Previous
create a poster of deep sea creatures at different depths, with a vertical ocean cutaway, styled in a beautiful japanese detailed anime style

Still some scientific inaccuracies, but ~70% correct and much more vivid graphics, avoids premature cropping.
GPT Image 1.5 in the API
GPT Image 1.5 in the API delivers all the same improvements as ChatGPT Images: it’s stronger at image preservation and editing than GPT Image 1.
You’ll see more consistent preservation of branded logos and key visuals across edits, making it well suited for marketing and brand work like graphics and logo creation, and for ecommerce teams generating full product image catalogs (variants, scenes, and angles) from a single-source image.
Image inputs and outputs are now 20% cheaper in GPT Image 1.5 as compared to GPT Image 1, so you can generate and iterate on more images with the same budget.
You can try the new model in the OpenAI Playground or read the prompt guide for inspiration.
Enterprises and startups across industries, including creative tools, e-commerce, marketing software, and more are already using GPT Image 1.5.
WixCanvaHiggsfieldFigma WeaveEnvato
New

Previous

“GPT Image 1.5 generates high-fidelity images with strong prompt adherence, preserving composition, lighting, and fine-grained detail. The results are clean, realistic, and reliable, supporting faster concept-to-production workflows on platforms like Wix. Based on our testing and the main use cases we see at Wix, the consistency and quality compete to make it one of the flagship image generation models today.”
— Hila Gat, Head of AI Research and Data Science at Wix
Availability
The new ChatGPT Images is rolling out now to all ChatGPT users and API users globally today across surfaces. It works across models, so you don’t need to select anything in order to use it. The version of ChatGPT Images that launched earlier this year will remain available to all users as a custom GPT.
We believe we’re still at the beginning of what image generation can enable. Today’s update is a meaningful step forward with more to come, from finer-grained edits to richer, more detailed outputs across languages.
Generated by RSStT. The copyright belongs to the original author.