Wikimedia servers

From Meta, a Wikimedia project coordination wiki

Jump to: navigation, search

This page is outdated, but if it were updated, it might still be useful. Please help by correcting, augmenting and revising the text into an up-to-date form.

Official website
(Request Foundation wiki account)

Foundation wiki feedback

Bylaws · Policies

Departments

Make a donation

Board of Trustees

Board meetings (on Meta-Wiki)

Portal · Noticeboard

Board handbook

Resolutions · Elections

Wikimedia affiliates

Local chapters (39)

Thematic organizations (1)

User groups (74)

Resources

Projects · Servers

Visual guidelines

Logos · Slogans

This box: view · talk · edit

Other languages: de, eo, es, fr, it, ja, ko, zh, uk

Wikipedia and the other Wikimedia projects are run from several racks full of servers. See also Wikimedia's technical blog.

1 System architecture
- 1.1 Network topology
- 1.2 Software
2 Hosting
3 Status and problems
4 See also
5 Useful information about other sites

System architecture[edit]

Network topology[edit]

The Network topology is described in the article Server layout diagrams.

Software[edit]

Simplified overview of the employed software as of October 2015. (A very complex LAMP "stack")

Our DNS servers run gdnsd. We use geographical DNS to distribute requests between our four data centers (3x US, 1x Europe) depending on the location of the client.
We use Linux Virtual Server (LVS) on commodity servers to load balance incoming requests. LVS is also used as an internal load balancer to distribute MediaWiki requests. For back end monitoring and failover, we have our own system called PyBal.
For regular MediaWiki web requests (articles/API) we use Varnish caching proxy servers in front of Apache HTTP Server.
All our servers run either Ubuntu Server or Debian.
For distributed object storage we use Swift.
Our main web application is MediaWiki, which is written in PHP (~70 %) and JavaScript (~30 %).^[1]
Our structured data is stored in MariaDB.^[2] We group wikis into clusters, and each cluster is served by several MariaDB servers, replicated in a single-master configuration.
We use Memcached for caching of database query and computation results.
For full-text search we use Elasticsearch (Extension:CirrusSearch).
https://noc.wikimedia.org/conf/ - Wikimedia configuration files.

Hosting[edit]

Overview of system architecture

Wikimedia server racks at CyrusOne

Status and problems[edit]

You can check one of the following sites if you want to know if the Wikimedia servers are overloaded, or if you just want to see how they are doing.

Ganglia
Grafana
Icinga (temporarily restricted)
Networking latency
~~Gdash~~ (being deprecated)
http://status.wikimedia.org/ (up/down indicators not to be trusted) (but "New external availability metrics based on new Catchpoint data")

If you are seeing errors in real time, visit #wikimedia-tech^connect on irc.freenode.net. Check the topic to see if someone is already looking into the problem you are having. If not, please report your problem to the channel. It would be helpful if you could report specific symptoms, including the exact text of any error messages, what you were doing right before the error, and what server(s) are generating the error, if you can tell.

Useful information about other sites[edit]

Evolution of LiveJournal systems:
- 04/2004 MySQLCon 2004 PDF/SXI
- 07/2004 OSCON 2004 PDF/SXI
- 11/2004 LISA 2004 PDF/SXI
- 04/2005 MySQLCon 2005 PDF/PPT/SXI
- journals to watch for system details: Brad (Fitzpatrick) lj_backend lj_maintenance
Google cluster architecture^[1] (PDF)
MySQL User's Conference 2004 blog highlights

↑ ^a ^b See MediaWiki analysis, MediaWiki WMF-supported extensions analysis.
↑ "Wikipedia Adopts MariaDB — Wikimedia blog" (text/html). blog.wikimedia.org. Wikimedia Foundation, Inc. 2013-04-22. Retrieved 2014-07-20.

Retrieved from "https://meta.wikimedia.org/w/index.php?title=Wikimedia_servers&oldid=16678088"

Categories:

Hidden category:

Pages using duplicate arguments in template calls

Wikimedia servers

Contents

System architecture[edit]

Network topology[edit]

Software[edit]

Hosting[edit]

Status and problems[edit]

See also[edit]

More hardware info[edit]

Admin logs[edit]

Offsite traffic pages[edit]

Long-term planning[edit]

Out of date information[edit]

Energy consumption[edit]

Useful information about other sites[edit]

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Community

Beyond the Web

Print/export

In other projects

Tools

In other languages