• Skip to main navigation
  • Skip to main content
  • Skip to footer
  • Home
  • About
  • Further Reading
  • Skip to menu toggle button

[[WM:TECHBLOG]]

Open Source for Open Knowledge

Tag: MediaWiki

Purification of mercury by distillation

From hell to HTML: releasing a Python package to easily work with Wikimedia HTML dumps

Announcing mwparserfromhtml, a new library that makes it easy to parse the HTML content of Wikipedia articles
Continue reading “From hell to HTML: releasing a Python package to easily work with Wikimedia HTML dumps”…
Posted on: February 24, 2023 Last updated on: February 24, 2023 Comments: 0 Martin Gerlach, Isaac Johnson and Nazia Tasnim
Long exposure of highway with downtown skyscraper in the background of an unidentified city. There are green grass hills on both sides and a bright moon in the sky.

Perf Matters at Wikipedia in 2016

Looking back at our ups and downs. 
Continue reading “Perf Matters at Wikipedia in 2016”…
Posted on: December 8, 2022 Last updated on: March 16, 2023 Timo Tijhof

Web Perf Hero: Valentín Gutierrez

Today we celebrate two numbers: 25% lower latency for ATS backend requests at the p75, and up to 1000X reduction of ATS disk read latency at the p999.
Continue reading “Web Perf Hero: Valentín Gutierrez”…
Posted on: November 21, 2022 Last updated on: March 16, 2023 Timo Tijhof

What it takes to parse MediaWiki page titles… in Rust

MediaWiki page titles are the primary identifiers for all wiki content – learn how they are validated, normalized and parsed and what it took to do so in Rust.
Continue reading “What it takes to parse MediaWiki page titles… in Rust”…
Posted on: April 26, 2022 Last updated on: April 27, 2022 Comments: 0 Kunal Mehta
https://commons.wikimedia.org/wiki/File:Wikimedia_Foundation_Servers_2015-54.jpg

June 2021 Datacenter Switchover

In June 2021, most user traffic was switched from our primary Virginia datacenter to our secondary one in Texas. This post covers how the swtichover went and the issues that came up.
Continue reading “June 2021 Datacenter Switchover”…
Posted on: July 23, 2021 Last updated on: July 26, 2021 Comments: 1 Kunal Mehta

Alone together: Wikimedia Hackathon 2021

Wikimedia’s 2021 Hackathon brought participants from all over the globe together online. This post explored the event and its outcomes.
Continue reading “Alone together: Wikimedia Hackathon 2021”…
Posted on: June 30, 2021 Last updated on: August 6, 2021 Comments: 2 Joaquin Oltra Hernandez

Profiling PHP in production at scale

We built an efficient sampling profiler for PHP. It runs continually in production on live requests, and generates trace logs and flame graphs.
Continue reading “Profiling PHP in production at scale”…
Posted on: March 3, 2021 Last updated on: March 16, 2023 Timo Tijhof

2020: The Year in Vue

A look back at a year of experiences of using Vue within the Structured Data team.
Continue reading “2020: The Year in Vue”…
Posted on: January 22, 2021 Last updated on: November 23, 2021 Comments: 9 Eric Gardner and Anne Tomasevich

Web performance case study: Wikipedia page previews

Preview popups are common and requires careful scripting and styling; they can generate useful learning about performance as a reference for other front-end tasks.
Continue reading “Web performance case study: Wikipedia page previews”…
Posted on: November 23, 2020 Last updated on: March 16, 2023 Noam Rosenthal

Impact of using HTTP connection pooling for PHP applications at scale

This post explores the challenges of running PHP applications at a large scale and discusses the effect of using Envoy on MediaWiki applications.
Continue reading “Impact of using HTTP connection pooling for PHP applications at scale”…
Posted on: October 26, 2020 Last updated on: December 1, 2020 Comments: 1 Giuseppe Lavagetto
1 2 Next page »

Recent Posts

  • From hell to HTML: releasing a Python package to easily work with Wikimedia HTML dumps
  • Perf Matters at Wikipedia in 2016
  • How we’re building our Kubernetes pipeline in GitLab
  • Web Perf Hero: Valentín Gutierrez
  • HTTP/2 performance revisited

Archives

  • February 2023
  • December 2022
  • November 2022
  • October 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • August 2021
  • July 2021
  • June 2021
  • May 2021
  • March 2021
  • February 2021
  • January 2021
  • December 2020
  • November 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • December 2019
  • October 2019
  • September 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019
  • December 2018
  • October 2018
  • September 2018
  • June 2018
  • February 2018
  • January 2018
  • November 2017
  • July 2017
  • June 2017
  • December 2014

Categories

  • Architecture Team
  • Cloud Services
  • Community
  • Growth Team
  • Infastructure
  • Learning & How To
  • Machine Learning
  • Parsing
  • Performance Team
  • Release Engineering
  • Research & Analytics
  • Search Platform
  • Site Reliability Engineering
  • Uncategorized
  • Wikidata
  • Wikimedia Product
Wikimedia Foundation Logo

Privacy Policy | About

Wikipedia® and other Wikimedia project names and logos are registered trademarks of the Wikimedia Foundation, a non-profit organization.

Unless otherwise stated content is licensed under a CC BY-SA 4.0 international license.

Powered by WordPress.com VIP, Automattic Privacy Notice.

Learn more about the
Wikimedia Foundation

Follow us on Twitter @wikimediatech