Wikimedia foundation partners with ai firms for wikipedia data access

The Wikimedia Foundation has announced new licensing deals with major AI companies including Microsoft, Meta, and Amazon to provide paid access to Wikipedia content. These partnerships aim to offset rising infrastructure costs caused by AI scraping. The deals mark a shift from unauthorized data use to commercial API access through Wikimedia Enterprise.

On January 15, 2026, the Wikimedia Foundation revealed partnerships with AI developers such as Microsoft, Meta, Amazon, Perplexity, and Mistral AI as part of Wikipedia's 25th anniversary celebrations. These companies, previously known for scraping Wikipedia's vast repository of 65 million articles without permission, have now joined the nonprofit's commercial subsidiary, Wikimedia Enterprise. The program offers high-throughput APIs for faster, higher-volume access to Wikipedia and related projects like Wikivoyage, Wikibooks, and Wikiquote, helping to sustain the organization's operations amid surging costs.

The initiative addresses a growing financial strain on the foundation, which relies primarily on small public donations. Last year, Wikimedia raised alarms about an existential threat from reduced website traffic due to large language models (LLMs) and AI chatbots summarizing content without directing users to the source. In April 2025, bandwidth for downloading multimedia content increased by 50 percent since January 2024, with bots accounting for 65 percent of the most expensive infrastructure requests despite comprising only 35 percent of total pageviews. By October 2025, human traffic had declined about 8 percent year-over-year after improved bot-detection measures revealed many 'visitors' were automated scrapers.

This traffic drop disrupts Wikipedia's traditional feedback loop, where readers become editors or donors, enhancing content quality. Meanwhile, AI firms use the human-curated data to power tools like Microsoft Copilot and OpenAI's ChatGPT. Lane Becker, president of Wikimedia Enterprise, emphasized the importance of financial support: “Wikipedia is a critical component of these tech companies’ work that they need to figure out how to support financially... all our Big Tech partners really see the need for them to commit to sustaining Wikipedia's work.”

Wikipedia founder Jimmy Wales supports AI training on the data but insists on compensation: “I’m very happy personally that AI models are training on Wikipedia data because it’s human curated... You should probably chip in and pay for your fair share of the cost that you’re putting on us.” The new deals join earlier ones, such as Google's 2022 agreement, though financial terms remain undisclosed. The foundation has faced internal resistance to its own AI experiments, pausing a pilot for AI-generated summaries in June after editor backlash.

相关文章

Elon Musk on stage launching Grokipedia, with a screen showing the AI encyclopedia rivaling Wikipedia, in a modern tech setting.
AI 生成的图像

Musk’s Grokipedia launches as AI-built rival to Wikipedia

由 AI 报道 AI 生成的图像 事实核查

Elon Musk has launched Grokipedia, an AI-generated online encyclopedia tied to his xAI chatbot Grok, positioning it as a challenger to Wikipedia. Musk said on X that his goal is to build “an open source, comprehensive collection of all knowledge,” after repeatedly criticizing what he calls Wikipedia’s left-leaning bias.

Meta has agreed to a three-year AI licensing deal with News Corp, paying up to $50 million annually for content from The Wall Street Journal and other brands. The arrangement allows Meta to use the material in its AI chatbot responses and for training models. News Corp confirmed the deal, highlighting its strategy of partnering with AI firms or pursuing legal action against unauthorized use.

由 AI 报道

Encyclopedia Britannica and its subsidiary Merriam-Webster have sued OpenAI, alleging copyright infringement for using their content to train AI models like ChatGPT without permission, as well as trademark infringement from the AI falsely attributing hallucinations to Britannica. The suit claims ChatGPT reproduces verbatim or near-verbatim portions, summaries, or abridgments of their works, cannibalizing traffic to their sites.

日本经济产业省将为国内企业提供资金支持,用于处理海量数据以供机器学习使用。重点针对制造业数据,以提升本土开发AI的性能,从而加强产品竞争力和生产力。该省计划从2026财年开始5年内投资1万亿日元。

由 AI 报道

A Cornell University study reveals that AI tools like ChatGPT have increased researchers' paper output by up to 50%, particularly benefiting non-native English speakers. However, this surge in polished manuscripts is complicating peer review and funding decisions, as many lack substantial scientific value. The findings highlight a shift in global research dynamics and call for updated policies on AI use in academia.

South African news organizations are grappling with the misuse of their content by social media accounts posing as legitimate news sites. Journalists highlight the erosion of ethical standards and call for stronger regulations on digital platforms. The rise of AI-generated content adds further challenges to the industry.

由 AI 报道

The Writers Guild of America plans to demand compensation for scripts used to train AI models during upcoming contract talks with studios. Negotiations with the Alliance of Motion Picture and Television Producers are set to begin next week, amid concerns over health fund deficits and other issues from the 2023 strike. Union leaders emphasize the need for fair payments while noting that AI protections secured previously have held up.

 

 

 

此网站使用 cookie

我们使用 cookie 进行分析以改进我们的网站。阅读我们的 隐私政策 以获取更多信息。
拒绝