OpenAI's GPT Image 1.5 advances conversational photo editing amid ethical concerns

Building on yesterday's ChatGPT image upgrade, OpenAI has detailed GPT Image 1.5, a multimodal model enabling precise conversational photo edits. It responds to rivals like Google's Nano Banana while introducing safeguards against misuse.

OpenAI's image update, rolled out December 16 and detailed further on December 17, introduces GPT Image 1.5—a native multimodal system that treats text prompts and image pixels as unified tokens. This enables seamless conversational edits, such as altering poses, removing objects, adjusting clothing, or refining details while preserving faces, building on the faster generation and instruction-following highlighted previously.

Four times faster and 20% cheaper via API than its predecessor, the model integrates into a new ChatGPT sidebar space with presets and prompts. Fidji Simo, OpenAI's CEO of applications, noted: "Creating and editing images is a different kind of task and deserves a space built for visuals."

The release counters Google's Nano Banana (also called Nano Banana Pro), praised for realistic edits and text rendering since August. GPT Image 1.5 improves in these areas but lags in some drawing styles and scientific accuracy.

Ethical risks loom larger with advanced editing, including deepfakes and non-consensual content. OpenAI deploys filters for sexual/violent material, C2PA metadata (removable), and ongoing refinements. Broader issues include creator backlash over likenesses and copyrights, contrasted by deals like OpenAI's with Disney for 2026 character use amid lawsuits from Ziff Davis.

OpenAI maintains: "We believe we’re still at the beginning of what image generation can enable," signaling more multimodal advancements.

相关文章

Illustration of Google's Nano Banana Pro AI image model launch, featuring a smartphone with AI-generated banana images in the Gemini app, surrounded by users and global elements.
AI 生成的图像

Google launches Nano Banana Pro AI image model

由 AI 报道 AI 生成的图像

Google has introduced Nano Banana Pro, an upgraded AI image-generation model powered by Gemini 3 Pro, offering improved accuracy and editing capabilities. The tool is now available globally in the Gemini app, though with usage limits for free users. It also includes enhanced features for detecting AI-generated content.

OpenAI has rolled out an updated image generation model for ChatGPT, making it four times faster and better at following user instructions. The upgrade includes improved editing capabilities and enhanced text rendering. This comes shortly after the release of GPT-5.2 and amid competition from Google's Gemini.

由 AI 报道

OpenAI has launched ChatGPT-5.2, a new family of AI models designed to enhance reasoning and productivity, particularly for professional tasks. The release follows an internal alert from CEO Sam Altman about competition from Google's Gemini 3. The update includes three variants aimed at different user needs, starting with paid subscribers.

Google has announced that its experimental AI prototype, Genie 3, is now available to subscribers of its highest-tier AI plan. The tool allows users to generate and navigate interactive 3D worlds using simple text prompts. Previously limited to trusted testers, this expansion marks a step toward broader access for the 18-and-older audience.

由 AI 报道

Google has launched Project Genie, a tool based on its Genie 3 AI model that allows users to generate and explore interactive virtual environments from text prompts or images. Available only to subscribers of its premium AI Ultra plan, the system marks the first public access to this advanced world model outside of internal testing. It offers modes like world sketching and remixing, though limited to short 60-second sessions.

随着AI平台转向基于广告的变现模式,研究人员警告这项技术可能以隐形方式塑造用户行为、信念和选择。这标志着OpenAI的转变,其CEO Sam Altman曾认为广告与AI的结合“令人不安”,但现在保证AI应用中的广告能够维持信任。

由 AI 报道

Apple is preparing a significant upgrade to Siri, transforming the voice assistant into a conversational AI chatbot similar to ChatGPT, according to reports from Bloomberg's Mark Gurman. The changes, expected in iOS 27, iPadOS 27, and macOS 27 late next year, will leverage Google's Gemini models for enhanced capabilities. Initial updates to the current Siri are slated for iOS 26.4.

 

 

 

此网站使用 cookie

我们使用 cookie 进行分析以改进我们的网站。阅读我们的 隐私政策 以获取更多信息。
拒绝