Wikipedia Collaborates with Kaggle to Offer AI Developers Structured Data, Combating Bot Scraping
2025-04-17 / Read about 0 minute
Author:小编   

The Wikimedia Foundation has announced a strategic partnership with Kaggle, a Google-owned data science community platform, to release a beta dataset comprising 'Structured Wikipedia Content in English and French'. This initiative is designed to enhance AI model training datasets while simultaneously mitigating the risk of AI developers plagiarizing content from Wikipedia. Kaggle, renowned for hosting a myriad of machine learning datasets, plays a pivotal role in this endeavor.