We’ve written a fair amount about the geographic and linguistic clusters of Wikipedia authors but were reminded today (via New York Times “Room for Debate” forum“) that there are plenty of other clusters along social and economic dimensions. Last year a survey of Wikipedia users was conducted which highlights some interesting fissures within the user group.

One of the most provocative findings (and the one highlighted by the New York Times forum) is that less than 15 percent of the regular contributors to Wikipedia are women. This really grabs one’s attention but a closer look at the data report (see also here and here) makes us wonder if this figure accurately reflects the Wikipedia community. Some of the questions are:

  • What was the sampling method used? Nothing is listed in the reports.
  • What is the bias in the sample? For example, Russia and Russian speakers are the largest language and country groups represented in the survey even though the Russian section of Wikipedia is only the 8th largest linguistic group. (English, German, French, Italian, Polish, Japanese and Spanish are all larger).
  • Did women have a lower participation rate then men in the survey? There were three times as many male respondents as female respondents. Does this accurately reflect the makeup of the Wikipedia audience? Given the unexpected results for language and country, it is not clear if there might be gender bias as well.

All this said, we find the question of an imbalance in gender participation very intriguing and important. We just don’t know if the survey methods used are such that we can be confident in the magnitude of the highlighted differences. Anyone who can shed some light on this would be more than welcome to comment.


Note: This post was originally published on the Floating Sheep blog on . It might have been updated since then in its original location. The post gives the views of the author(s), and not necessarily the position of the Oxford Internet Institute.