OpenAI's GPT Image 1.5 advances conversational photo editing amid ethical concerns

Building on yesterday's ChatGPT image upgrade, OpenAI has detailed GPT Image 1.5, a multimodal model enabling precise conversational photo edits. It responds to rivals like Google's Nano Banana while introducing safeguards against misuse.

OpenAI's image update, rolled out December 16 and detailed further on December 17, introduces GPT Image 1.5—a native multimodal system that treats text prompts and image pixels as unified tokens. This enables seamless conversational edits, such as altering poses, removing objects, adjusting clothing, or refining details while preserving faces, building on the faster generation and instruction-following highlighted previously.

Four times faster and 20% cheaper via API than its predecessor, the model integrates into a new ChatGPT sidebar space with presets and prompts. Fidji Simo, OpenAI's CEO of applications, noted: "Creating and editing images is a different kind of task and deserves a space built for visuals."

The release counters Google's Nano Banana (also called Nano Banana Pro), praised for realistic edits and text rendering since August. GPT Image 1.5 improves in these areas but lags in some drawing styles and scientific accuracy.

Ethical risks loom larger with advanced editing, including deepfakes and non-consensual content. OpenAI deploys filters for sexual/violent material, C2PA metadata (removable), and ongoing refinements. Broader issues include creator backlash over likenesses and copyrights, contrasted by deals like OpenAI's with Disney for 2026 character use amid lawsuits from Ziff Davis.

OpenAI maintains: "We believe we’re still at the beginning of what image generation can enable," signaling more multimodal advancements.

관련 기사

Illustration of Google's Nano Banana Pro AI image model launch, featuring a smartphone with AI-generated banana images in the Gemini app, surrounded by users and global elements.
AI에 의해 생성된 이미지

Google launches Nano Banana Pro AI image model

AI에 의해 보고됨 AI에 의해 생성된 이미지

Google has introduced Nano Banana Pro, an upgraded AI image-generation model powered by Gemini 3 Pro, offering improved accuracy and editing capabilities. The tool is now available globally in the Gemini app, though with usage limits for free users. It also includes enhanced features for detecting AI-generated content.

OpenAI has rolled out an updated image generation model for ChatGPT, making it four times faster and better at following user instructions. The upgrade includes improved editing capabilities and enhanced text rendering. This comes shortly after the release of GPT-5.2 and amid competition from Google's Gemini.

AI에 의해 보고됨

OpenAI has launched ChatGPT-5.2, a new family of AI models designed to enhance reasoning and productivity, particularly for professional tasks. The release follows an internal alert from CEO Sam Altman about competition from Google's Gemini 3. The update includes three variants aimed at different user needs, starting with paid subscribers.

Google has announced that its experimental AI prototype, Genie 3, is now available to subscribers of its highest-tier AI plan. The tool allows users to generate and navigate interactive 3D worlds using simple text prompts. Previously limited to trusted testers, this expansion marks a step toward broader access for the 18-and-older audience.

AI에 의해 보고됨

Google has launched Project Genie, a tool based on its Genie 3 AI model that allows users to generate and explore interactive virtual environments from text prompts or images. Available only to subscribers of its premium AI Ultra plan, the system marks the first public access to this advanced world model outside of internal testing. It offers modes like world sketching and remixing, though limited to short 60-second sessions.

AI 플랫폼이 광고 기반 수익화로 전환함에 따라 연구원들은 이 기술이 사용자 행동, 신념, 선택을 보이지 않는 방식으로 형성할 수 있다고 경고한다. 이는 OpenAI의 입장 변화로, CEO Sam Altman이 한때 광고와 AI의 조합을 '불안하게 만든다'고 했으나 이제 AI 앱의 광고가 신뢰를 유지할 수 있다고 확신한다.

AI에 의해 보고됨

Apple is preparing a significant upgrade to Siri, transforming the voice assistant into a conversational AI chatbot similar to ChatGPT, according to reports from Bloomberg's Mark Gurman. The changes, expected in iOS 27, iPadOS 27, and macOS 27 late next year, will leverage Google's Gemini models for enhanced capabilities. Initial updates to the current Siri are slated for iOS 26.4.

 

 

 

이 웹사이트는 쿠키를 사용합니다

사이트를 개선하기 위해 분석을 위한 쿠키를 사용합니다. 자세한 내용은 개인정보 보호 정책을 읽으세요.
거부