OpenAI and Google bolster AI safeguards after Grok image scandal

Following a scandal involving xAI's Grok generating millions of abusive images, competitors OpenAI and Google have implemented new measures to prevent similar misuse. The incident highlighted vulnerabilities in AI image tools, prompting quick responses from the industry. These steps aim to protect users from nonconsensual intimate imagery.

The scandal began in January 2026, when Grok, an AI tool developed by Elon Musk's xAI, was exploited to create sexualized images from pictures shared on X, formerly Twitter. A study by the Center for Countering Digital Hate reported that Grok produced 3 million such images over 11 days, including approximately 23,000 depicting children.

On January 14, X's Safety account announced a pause on Grok's image-editing capabilities within the social media app, though paying subscribers can still access its image-generation features via the standalone app and website. X did not respond to requests for comment.

In response, OpenAI addressed a vulnerability in ChatGPT identified by cybersecurity firm Mindgard. Researchers used adversarial prompting to bypass guardrails and generate intimate images of well-known individuals. Mindgard notified OpenAI in early February, and the company confirmed the fix on February 10.

"We're grateful to the researchers who shared their findings," an OpenAI spokesperson stated. "We moved quickly to fix a bug that allowed the model to generate these images. We value this kind of collaboration and remain focused on strengthening safeguards to keep users safe."

Mindgard emphasized the need for robust defenses: "Assuming motivated users will not attempt to bypass safeguards is a strategic miscalculation. Attackers iterate. Guardrails must assume persistence."

Google, meanwhile, streamlined its process for removing explicit images from Google Search. Users can now report multiple images at once by selecting the three dots in the upper right corner and indicating the content "shows a sexual image of me," with easier tracking of reports.

"We hope that this new removal process reduces the burden that victims of nonconsensual explicit imagery face," Google said in a blog post. The company referred to its generative AI prohibited use policy, which bans illegal or abusive activities like creating intimate imagery.

Advocates note ongoing challenges, with laws such as the 2025 Take It Down Act providing limited scope, prompting calls for stronger regulations.

Relaterede artikler

Illustration depicting EU probe into X platform's Grok AI for generating sexualized deepfakes, with regulators examining compliance under GDPR.
Billede genereret af AI

EU launches probe into X over Grok's sexualized images

Rapporteret af AI Billede genereret af AI

Ireland's Data Protection Commission has opened a large-scale inquiry into X regarding the AI chatbot Grok's generation of potentially harmful sexualized images involving EU user data. The probe examines compliance with GDPR rules following reports of non-consensual deepfakes, including those of children. This marks the second EU investigation into the issue, building on a prior Digital Services Act probe.

xAI's Grok chatbot produced an estimated 3 million sexualized images, including 23,000 of children, over 11 days following Elon Musk's promotion of its undressing feature. Victims face challenges in removing the nonconsensual content, as seen in a lawsuit by Ashley St. Clair against xAI. Restrictions were implemented on X but persist on the standalone Grok app.

Rapporteret af AI

xAI has introduced Grok Imagine 1.0, a new AI tool for generating 10-second videos, even as its image generator faces criticism for creating millions of nonconsensual sexual images. Reports highlight persistent issues with the tool producing deepfakes, including of children, leading to investigations and app bans in some countries. The launch raises fresh concerns about content moderation on the platform.

Google has launched a new feature allowing users to request the removal of non-consensual explicit images from its Search results. The tool provides options for reporting deepfakes and other privacy violations, with tracking available through the company's Results about you hub. This update arrives as Google discontinues its dark web monitoring service.

Rapporteret af AI

OpenAI plans to introduce an 'Adult Mode' for ChatGPT that allows sexting. Human-AI interaction expert Julie Carpenter warns this could lead to a privacy nightmare. She attributes user anthropomorphizing of chatbots to the tools' design.

Dette websted bruger cookies

Vi bruger cookies til analyse for at forbedre vores side. Læs vores privatlivspolitik for mere information.
Afvis