Learn how the mwsql library makes it easier to download and work with SQL dump files in formats like Pandas dataframes or CSV.
Wikipedia articles are missing images, and Wikipedia images are missing captions. A scientific competition organized by the Research team at the Wikimedia Foundation could help bridge this gap. The WMF is also releasing a large image dataset to help researchers and practitioners build systems for automatic image-text retrieval in the context of Wikipedia.
Learn about using the Mediawiki History Dataset to explore the every day experience of editors on Wikipedia.