OpenAI and Google bolster AI safeguards after Grok image scandal

Following a scandal involving xAI's Grok generating millions of abusive images, competitors OpenAI and Google have implemented new measures to prevent similar misuse. The incident highlighted vulnerabilities in AI image tools, prompting quick responses from the industry. These steps aim to protect users from nonconsensual intimate imagery.

The scandal began in January 2026, when Grok, an AI tool developed by Elon Musk's xAI, was exploited to create sexualized images from pictures shared on X, formerly Twitter. A study by the Center for Countering Digital Hate reported that Grok produced 3 million such images over 11 days, including approximately 23,000 depicting children.

On January 14, X's Safety account announced a pause on Grok's image-editing capabilities within the social media app, though paying subscribers can still access its image-generation features via the standalone app and website. X did not respond to requests for comment.

In response, OpenAI addressed a vulnerability in ChatGPT identified by cybersecurity firm Mindgard. Researchers used adversarial prompting to bypass guardrails and generate intimate images of well-known individuals. Mindgard notified OpenAI in early February, and the company confirmed the fix on February 10.

"We're grateful to the researchers who shared their findings," an OpenAI spokesperson stated. "We moved quickly to fix a bug that allowed the model to generate these images. We value this kind of collaboration and remain focused on strengthening safeguards to keep users safe."

Mindgard emphasized the need for robust defenses: "Assuming motivated users will not attempt to bypass safeguards is a strategic miscalculation. Attackers iterate. Guardrails must assume persistence."

Google, meanwhile, streamlined its process for removing explicit images from Google Search. Users can now report multiple images at once by selecting the three dots in the upper right corner and indicating the content "shows a sexual image of me," with easier tracking of reports.

"We hope that this new removal process reduces the burden that victims of nonconsensual explicit imagery face," Google said in a blog post. The company referred to its generative AI prohibited use policy, which bans illegal or abusive activities like creating intimate imagery.

Advocates note ongoing challenges, with laws such as the 2025 Take It Down Act providing limited scope, prompting calls for stronger regulations.

Related Articles

Realistic illustration of ChatGPT adult mode screen with flirty text chats, opposed by stern OpenAI advisers, highlighting launch delay concerns.
Image generated by AI

OpenAI plans ChatGPT adult mode despite adviser warnings

Reported by AI Image generated by AI

OpenAI intends to launch a text-only adult mode for ChatGPT, enabling adult-themed conversations but not erotic media, despite unanimous opposition from its wellbeing advisers. The company describes the content as 'smut rather than pornography,' according to a spokesperson cited by The Wall Street Journal. Launch has been delayed from early 2026 amid concerns over minors' access and emotional dependence.

Apple warned Elon Musk's xAI that its Grok AI app faced removal from the App Store unless it addressed issues with sexualized deepfakes. The company detailed its actions in a letter to US senators amid concerns over abusive image generation. Grok was rejected, reworked, and later approved after improvements.

Reported by AI

Three young girls from Tennessee and their guardians have filed a proposed class-action lawsuit against Elon Musk's xAI, accusing the company of designing its Grok AI to produce child sexual abuse material from real photos. The suit stems from a Discord tip that led to a police investigation linking Grok to explicit images of the victims. They seek an injunction and damages for thousands of potentially harmed minors.

This website uses cookies

We use cookies for analytics to improve our site. Read our privacy policy for more information.
Decline