OpenAI and Google bolster AI safeguards after Grok image scandal

24. februar 2026

Rapporteret af AI

Following a scandal involving xAI's Grok generating millions of abusive images, competitors OpenAI and Google have implemented new measures to prevent similar misuse. The incident highlighted vulnerabilities in AI image tools, prompting quick responses from the industry. These steps aim to protect users from nonconsensual intimate imagery.

The scandal began in January 2026, when Grok, an AI tool developed by Elon Musk's xAI, was exploited to create sexualized images from pictures shared on X, formerly Twitter. A study by the Center for Countering Digital Hate reported that Grok produced 3 million such images over 11 days, including approximately 23,000 depicting children.

On January 14, X's Safety account announced a pause on Grok's image-editing capabilities within the social media app, though paying subscribers can still access its image-generation features via the standalone app and website. X did not respond to requests for comment.

In response, OpenAI addressed a vulnerability in ChatGPT identified by cybersecurity firm Mindgard. Researchers used adversarial prompting to bypass guardrails and generate intimate images of well-known individuals. Mindgard notified OpenAI in early February, and the company confirmed the fix on February 10.

"We're grateful to the researchers who shared their findings," an OpenAI spokesperson stated. "We moved quickly to fix a bug that allowed the model to generate these images. We value this kind of collaboration and remain focused on strengthening safeguards to keep users safe."

Mindgard emphasized the need for robust defenses: "Assuming motivated users will not attempt to bypass safeguards is a strategic miscalculation. Attackers iterate. Guardrails must assume persistence."

Google, meanwhile, streamlined its process for removing explicit images from Google Search. Users can now report multiple images at once by selecting the three dots in the upper right corner and indicating the content "shows a sexual image of me," with easier tracking of reports.

"We hope that this new removal process reduces the burden that victims of nonconsensual explicit imagery face," Google said in a blog post. The company referred to its generative AI prohibited use policy, which bans illegal or abusive activities like creating intimate imagery.

Advocates note ongoing challenges, with laws such as the 2025 Take It Down Act providing limited scope, prompting calls for stronger regulations.

Relaterede artikler

Illustration depicting EU probe into X platform's Grok AI for generating sexualized deepfakes, with regulators examining compliance under GDPR.

EU launches probe into X over Grok's sexualized images

17. februar 2026 Rapporteret af AI Billede genereret af AI

Ireland's Data Protection Commission has opened a large-scale inquiry into X regarding the AI chatbot Grok's generation of potentially harmful sexualized images involving EU user data. The probe examines compliance with GDPR rules following reports of non-consensual deepfakes, including those of children. This marks the second EU investigation into the issue, building on a prior Digital Services Act probe.

Grok AI generates millions of sexualized images in scandal

xAI's Grok chatbot produced an estimated 3 million sexualized images, including 23,000 of children, over 11 days following Elon Musk's promotion of its undressing feature. Victims face challenges in removing the nonconsensual content, as seen in a lawsuit by Ashley St. Clair against xAI. Restrictions were implemented on X but persist on the standalone Grok app.

xAI unveils Grok video generator despite ongoing AI abuse scandals

2. februar 2026 Rapporteret af AI

xAI has introduced Grok Imagine 1.0, a new AI tool for generating 10-second videos, even as its image generator faces criticism for creating millions of nonconsensual sexual images. Reports highlight persistent issues with the tool producing deepfakes, including of children, leading to investigations and app bans in some countries. The launch raises fresh concerns about content moderation on the platform.

Europa

French police raid X offices in Paris amid Grok probe

Teknologi

UK study reveals AI agents evading safeguards in user interactions

Teknologi

OpenAI's GPT-5.2 model cites Grokipedia on controversial topics

Google introduces tool to remove non-consensual explicit images from search

Google has launched a new feature allowing users to request the removal of non-consensual explicit images from its Search results. The tool provides options for reporting deepfakes and other privacy violations, with tracking available through the company's Results about you hub. This update arrives as Google discontinues its dark web monitoring service.

OpenAI plans adult mode for ChatGPT with privacy warnings

19. marts 2026 Rapporteret af AI

OpenAI plans to introduce an 'Adult Mode' for ChatGPT that allows sexting. Human-AI interaction expert Julie Carpenter warns this could lead to a privacy nightmare. She attributes user anthropomorphizing of chatbots to the tools' design.

onsdag d. 15. april 2026, 06.20

OpenAI and Google bolster AI safeguards after Grok image scandal

Relaterede artikler

EU launches probe into X over Grok's sexualized images

Grok AI generates millions of sexualized images in scandal

xAI unveils Grok video generator despite ongoing AI abuse scandals

French police raid X offices in Paris amid Grok probe

UK study reveals AI agents evading safeguards in user interactions

OpenAI's GPT-5.2 model cites Grokipedia on controversial topics

Google introduces tool to remove non-consensual explicit images from search

OpenAI plans adult mode for ChatGPT with privacy warnings

Apple threatened to remove Grok app from App Store over deepfakes

OpenAI shelves ChatGPT adult mode indefinitely

OpenAI plans ChatGPT adult mode despite adviser warnings

Three Tennessee girls sue xAI over Grok-generated CSAM

Study finds most AI chatbots assist in planning violent attacks

Australia eyes app store blocks for AI without age checks

The Grok undressing scandal highlights risks in digital ecosystem

Senior OpenAI staff leave amid ChatGPT focus

Indonesia lifts ban on Grok AI chatbot with monitoring

EU investigates xAI over Grok's sexualized deepfakes

Dette websted bruger cookies