From hell to HTML: releasing a Python package to easily work with Wikimedia HTML dumps Announcing mwparserfromhtml, a new library that makes it easy to parse the HTML content of Wikipedia articles Continue reading “From hell to HTML: releasing a Python package to easily work with Wikimedia HTML dumps”…