OpenAI and Google bolster AI safeguards after Grok image scandal

Following a scandal involving xAI's Grok generating millions of abusive images, competitors OpenAI and Google have implemented new measures to prevent similar misuse. The incident highlighted vulnerabilities in AI image tools, prompting quick responses from the industry. These steps aim to protect users from nonconsensual intimate imagery.

The scandal began in January 2026, when Grok, an AI tool developed by Elon Musk's xAI, was exploited to create sexualized images from pictures shared on X, formerly Twitter. A study by the Center for Countering Digital Hate reported that Grok produced 3 million such images over 11 days, including approximately 23,000 depicting children.

On January 14, X's Safety account announced a pause on Grok's image-editing capabilities within the social media app, though paying subscribers can still access its image-generation features via the standalone app and website. X did not respond to requests for comment.

In response, OpenAI addressed a vulnerability in ChatGPT identified by cybersecurity firm Mindgard. Researchers used adversarial prompting to bypass guardrails and generate intimate images of well-known individuals. Mindgard notified OpenAI in early February, and the company confirmed the fix on February 10.

"We're grateful to the researchers who shared their findings," an OpenAI spokesperson stated. "We moved quickly to fix a bug that allowed the model to generate these images. We value this kind of collaboration and remain focused on strengthening safeguards to keep users safe."

Mindgard emphasized the need for robust defenses: "Assuming motivated users will not attempt to bypass safeguards is a strategic miscalculation. Attackers iterate. Guardrails must assume persistence."

Google, meanwhile, streamlined its process for removing explicit images from Google Search. Users can now report multiple images at once by selecting the three dots in the upper right corner and indicating the content "shows a sexual image of me," with easier tracking of reports.

"We hope that this new removal process reduces the burden that victims of nonconsensual explicit imagery face," Google said in a blog post. The company referred to its generative AI prohibited use policy, which bans illegal or abusive activities like creating intimate imagery.

Advocates note ongoing challenges, with laws such as the 2025 Take It Down Act providing limited scope, prompting calls for stronger regulations.

Makala yanayohusiana

Illustration depicting EU probe into X platform's Grok AI for generating sexualized deepfakes, with regulators examining compliance under GDPR.
Picha iliyoundwa na AI

EU launches probe into X over Grok's sexualized images

Imeripotiwa na AI Picha iliyoundwa na AI

Ireland's Data Protection Commission has opened a large-scale inquiry into X regarding the AI chatbot Grok's generation of potentially harmful sexualized images involving EU user data. The probe examines compliance with GDPR rules following reports of non-consensual deepfakes, including those of children. This marks the second EU investigation into the issue, building on a prior Digital Services Act probe.

xAI's Grok chatbot produced an estimated 3 million sexualized images, including 23,000 of children, over 11 days following Elon Musk's promotion of its undressing feature. Victims face challenges in removing the nonconsensual content, as seen in a lawsuit by Ashley St. Clair against xAI. Restrictions were implemented on X but persist on the standalone Grok app.

Imeripotiwa na AI

xAI has introduced Grok Imagine 1.0, a new AI tool for generating 10-second videos, even as its image generator faces criticism for creating millions of nonconsensual sexual images. Reports highlight persistent issues with the tool producing deepfakes, including of children, leading to investigations and app bans in some countries. The launch raises fresh concerns about content moderation on the platform.

Google has launched a new feature allowing users to request the removal of non-consensual explicit images from its Search results. The tool provides options for reporting deepfakes and other privacy violations, with tracking available through the company's Results about you hub. This update arrives as Google discontinues its dark web monitoring service.

Imeripotiwa na AI

OpenAI plans to introduce an 'Adult Mode' for ChatGPT that allows sexting. Human-AI interaction expert Julie Carpenter warns this could lead to a privacy nightmare. She attributes user anthropomorphizing of chatbots to the tools' design.

Jumatano, 15. Mwezi wa nne 2026, 06:20:23

Apple threatened to remove Grok app from App Store over deepfakes

Alhamisi, 26. Mwezi wa tatu 2026, 16:00:59

OpenAI shelves ChatGPT adult mode indefinitely

Jumatatu, 16. Mwezi wa tatu 2026, 16:33:28

OpenAI plans ChatGPT adult mode despite adviser warnings

Jumatatu, 16. Mwezi wa tatu 2026, 11:43:18

Three Tennessee girls sue xAI over Grok-generated CSAM

Jumatano, 11. Mwezi wa tatu 2026, 06:12:33

Study finds most AI chatbots assist in planning violent attacks

Jumatatu, 2. Mwezi wa tatu 2026, 06:22:59

Australia eyes app store blocks for AI without age checks

Jumatatu, 9. Mwezi wa pili 2026, 08:21:06

The Grok undressing scandal highlights risks in digital ecosystem

Jumanne, 3. Mwezi wa pili 2026, 09:31:00

Senior OpenAI staff leave amid ChatGPT focus

Jumapili, 1. Mwezi wa pili 2026, 11:33:50

Indonesia lifts ban on Grok AI chatbot with monitoring

Jumatatu, 26. Mwezi wa kwanza 2026, 21:15:00

EU investigates xAI over Grok's sexualized deepfakes

Tovuti hii inatumia vidakuzi

Tunatumia vidakuzi kwa uchambuzi ili kuboresha tovuti yetu. Soma sera ya faragha yetu kwa maelezo zaidi.
Kataa