ChatGPT Images 2.0發布,帶來精準編輯與思考級圖像生成
AI 語音朗讀 · Edge TTS
ChatGPT Images 2.0發布,帶來精準編輯與思考級圖像生成。
OpenAI於2026年4月21日推出「ChatGPT Images 2.0」,這款頂尖圖像模型能處理複雜視覺任務,產生精準且立即可用的視覺內容,具備更銳利的編輯、更豐富的布局,以及思考級人工智慧。
精準指令遵循與物件配置
ChatGPT Images 2.0在詳細指令遵循、精確放置與關聯物件、渲染密集文字等方面實現階躍式進步,並支援跨寬高比生成。它利用擴充的視覺與世界知識,自動填補提示中的空白,讓使用者以更少提示獲得更智慧的圖像。
更高精度與控制力
此模型能概念化更複雜的圖像,並有效實現該願景。它嚴格遵循指令、保留要求細節,並渲染常讓圖像模型崩潰的細微元素,包括:
- 小型文字
- 圖示
- 使用者介面元素
- 密集構圖
- 細膩風格限制
所有輸出最高支援2K解析度,例如能精準渲染米粒細節。
跨語言強大表現
ChatGPT Images 2.0能產生非英文文字的圖像,不僅渲染正確,還確保語言流暢連貫。這提升模型全球實用性,讓使用者以日常語言創作適用視覺內容。
風格精進與寫實度
模型更擅長捕捉照片、電影靜態畫、像素藝術、漫畫等獨特視覺語言的定義特徵,在紋理、光線、構圖與細節上展現更高一致性。這特別適用於遊戲原型設計、分鏡腳本、行銷創意,以及特定媒介或類型的asset創作。
靈活寬高比支援
ChatGPT Images 2.0支援寬達3:1或高達1:3的寬高比,能直接生成適合寬幅橫幅、簡報投影片、海報或社群圖形等格式的輸出。
視覺思考夥伴功能
這是OpenAI首款具思考能力的圖像模型。在ChatGPT中選用思考模型時,Images 2.0能:
- 搜尋網路即時資訊
- 從單一提示產生多張獨特圖像
- 雙重檢查自身輸出
- 甚至生成可功能運作的QR碼
這讓它承擔從idea到圖像的更多重任,尤其在精準度、最新資訊、一致性與視覺凝聚力至關重要的情境。
真實世界智慧升級
ChatGPT Images 2.0更新知識截止至2025年12月,具備End to End (端到端)處理任務的智慧,從文案撰寫、分析到設計構圖皆游刃有餘。
立即可用性與存取方式
ChatGPT Images 2.0即日起對所有ChatGPT與Codex使用者開放。具思考功能的圖像限ChatGPT Plus、Pro及Business使用者(Enterprise即將支援)。行動裝置端請更新至最新App版本。底層模型「gpt-image-2」已透過API提供。詳見https://openai.com/index/introducing-chatgpt-images-2-0/ 與 https://chatgpt.com/images/。
Introducing ChatGPT Images 2.0
— OpenAI (@OpenAI) April 21, 2026
A state-of-the-art image model that can take on complex visual tasks and produce precise, immediately usable visuals, with sharper editing, richer layouts, and thinking-level intelligence.
Video made with ChatGPT Images pic.twitter.com/3aWfXakrcR
ChatGPT Images 2.0 is a step change in detailed instruction following, placing and relating objects accurately, and rendering dense text, with the ability to generate across aspect ratios.
— OpenAI (@OpenAI) April 21, 2026
It’s also accurate across languages and uses its expanded visual and world knowledge to…
Greater Precision and Control
— OpenAI (@OpenAI) April 21, 2026
ChatGPT Images 2.0 can conceptualize more sophisticated images, and then actually bring that vision to life effectively.
It’s able to follow instructions, preserve requested details, and render the fine-grained elements that often break image… pic.twitter.com/n29165pV9Q
Stronger Across Languages
— OpenAI (@OpenAI) April 21, 2026
ChatGPT Images 2.0 can produce images with non-English text that’s not only rendered correctly but with language that flows coherently.
This makes the model more globally useful and helps people create visuals that work in the languages they actually… pic.twitter.com/51k3xScOXm
Stylistic Sophistication and Photo Realism
— OpenAI (@OpenAI) April 21, 2026
ChatGPT Images 2.0 is better able to capture the defining characteristics of photos, as well as cinematic stills, pixel art, manga, and other distinctive visual languages, with greater consistency in texture, lighting, composition, and… pic.twitter.com/iFDG48EdgE
Flexible Aspect Ratios
— OpenAI (@OpenAI) April 21, 2026
ChatGPT Images 2.0 supports aspect ratios as wide as 3:1 and as tall as 1:3.
It can generate outputs that are ready to fit the formats you need, from wide banners and presentation slides to posters and social graphics. pic.twitter.com/747WjjzhYr
A Visual Thought Partner
— OpenAI (@OpenAI) April 21, 2026
ChatGPT Images 2.0 is our first image model with thinking capabilities.
When a thinking model is selected in ChatGPT, Images 2.0 can search the web for real-time information, create multiple distinct images from one prompt, double-check its own outputs,… pic.twitter.com/QjnGJ8MnJa
Real-World Intelligence
— OpenAI (@OpenAI) April 21, 2026
ChatGPT Images 2.0 has an updated knowledge cutoff of December 2025 and intelligence that allows it to expertly handle tasks end-to-end, from copywriting to analysis to design composition. pic.twitter.com/gMZaNtCt76
ChatGPT Images 2.0 is available starting today to all ChatGPT and Codex users.
— OpenAI (@OpenAI) April 21, 2026
Images with thinking are available to ChatGPT Plus, Pro, and Business users (Enterprise soon). On mobile, make sure you update to the latest version of the app.
The underlying model, gpt-image-2, is…
