Wikimedia foundation partners with ai firms for wikipedia data access

The Wikimedia Foundation has announced new licensing deals with major AI companies including Microsoft, Meta, and Amazon to provide paid access to Wikipedia content. These partnerships aim to offset rising infrastructure costs caused by AI scraping. The deals mark a shift from unauthorized data use to commercial API access through Wikimedia Enterprise.

On January 15, 2026, the Wikimedia Foundation revealed partnerships with AI developers such as Microsoft, Meta, Amazon, Perplexity, and Mistral AI as part of Wikipedia's 25th anniversary celebrations. These companies, previously known for scraping Wikipedia's vast repository of 65 million articles without permission, have now joined the nonprofit's commercial subsidiary, Wikimedia Enterprise. The program offers high-throughput APIs for faster, higher-volume access to Wikipedia and related projects like Wikivoyage, Wikibooks, and Wikiquote, helping to sustain the organization's operations amid surging costs.

The initiative addresses a growing financial strain on the foundation, which relies primarily on small public donations. Last year, Wikimedia raised alarms about an existential threat from reduced website traffic due to large language models (LLMs) and AI chatbots summarizing content without directing users to the source. In April 2025, bandwidth for downloading multimedia content increased by 50 percent since January 2024, with bots accounting for 65 percent of the most expensive infrastructure requests despite comprising only 35 percent of total pageviews. By October 2025, human traffic had declined about 8 percent year-over-year after improved bot-detection measures revealed many 'visitors' were automated scrapers.

This traffic drop disrupts Wikipedia's traditional feedback loop, where readers become editors or donors, enhancing content quality. Meanwhile, AI firms use the human-curated data to power tools like Microsoft Copilot and OpenAI's ChatGPT. Lane Becker, president of Wikimedia Enterprise, emphasized the importance of financial support: “Wikipedia is a critical component of these tech companies’ work that they need to figure out how to support financially... all our Big Tech partners really see the need for them to commit to sustaining Wikipedia's work.”

Wikipedia founder Jimmy Wales supports AI training on the data but insists on compensation: “I’m very happy personally that AI models are training on Wikipedia data because it’s human curated... You should probably chip in and pay for your fair share of the cost that you’re putting on us.” The new deals join earlier ones, such as Google's 2022 agreement, though financial terms remain undisclosed. The foundation has faced internal resistance to its own AI experiments, pausing a pilot for AI-generated summaries in June after editor backlash.

관련 기사

Elon Musk on stage launching Grokipedia, with a screen showing the AI encyclopedia rivaling Wikipedia, in a modern tech setting.
AI에 의해 생성된 이미지

Musk’s Grokipedia launches as AI-built rival to Wikipedia

AI에 의해 보고됨 AI에 의해 생성된 이미지 사실 확인됨

Elon Musk has launched Grokipedia, an AI-generated online encyclopedia tied to his xAI chatbot Grok, positioning it as a challenger to Wikipedia. Musk said on X that his goal is to build “an open source, comprehensive collection of all knowledge,” after repeatedly criticizing what he calls Wikipedia’s left-leaning bias.

Meta has agreed to a three-year AI licensing deal with News Corp, paying up to $50 million annually for content from The Wall Street Journal and other brands. The arrangement allows Meta to use the material in its AI chatbot responses and for training models. News Corp confirmed the deal, highlighting its strategy of partnering with AI firms or pursuing legal action against unauthorized use.

AI에 의해 보고됨

Encyclopedia Britannica and its subsidiary Merriam-Webster have sued OpenAI, alleging copyright infringement for using their content to train AI models like ChatGPT without permission, as well as trademark infringement from the AI falsely attributing hallucinations to Britannica. The suit claims ChatGPT reproduces verbatim or near-verbatim portions, summaries, or abridgments of their works, cannibalizing traffic to their sites.

일본 경제산업성은 국내 기업의 머신러닝용 대량 데이터 처리에 재정 지원을 제공한다. 제조업 데이터에 초점을 맞춰 국산 AI 성능을 강화하고 제품 경쟁력 및 생산성을 높인다. 2026 회계연도부터 5년간 1조 엔을 투자할 계획이다.

AI에 의해 보고됨

A Cornell University study reveals that AI tools like ChatGPT have increased researchers' paper output by up to 50%, particularly benefiting non-native English speakers. However, this surge in polished manuscripts is complicating peer review and funding decisions, as many lack substantial scientific value. The findings highlight a shift in global research dynamics and call for updated policies on AI use in academia.

South African news organizations are grappling with the misuse of their content by social media accounts posing as legitimate news sites. Journalists highlight the erosion of ethical standards and call for stronger regulations on digital platforms. The rise of AI-generated content adds further challenges to the industry.

AI에 의해 보고됨

The Writers Guild of America plans to demand compensation for scripts used to train AI models during upcoming contract talks with studios. Negotiations with the Alliance of Motion Picture and Television Producers are set to begin next week, amid concerns over health fund deficits and other issues from the 2023 strike. Union leaders emphasize the need for fair payments while noting that AI protections secured previously have held up.

 

 

 

이 웹사이트는 쿠키를 사용합니다

사이트를 개선하기 위해 분석을 위한 쿠키를 사용합니다. 자세한 내용은 개인정보 보호 정책을 읽으세요.
거부