Skip down to main content

Accessing Research Data from the Social Web

Key Information

Course details
Digital Social Research Option Paper Group B; Hilary Term
Reading list
View now
Dr Luc Rocher


In the space of a few years data sourced from the “social web” (platforms such as Twitter, Facebook and Google) has come to occupy an important place in social science research. The astonishing uptake of social platforms around the world, and the relatively open, structured nature of the data they produce, offers the tantalising possibility of studying a wide variety of different social research questions in a way that previously would have been impossible.

Already a number of fascinating studies have used data from such platforms to shed new light on diverse concepts such as information diffusion, political behaviour and mobilisation in social movements. However, the appearance of such data sources creates a challenge for the social sciences. If researchers are to be able to access and manipulate it, they will need to add elementary computer programming skills to their methodological toolkit, which are often overlooked in traditional undergraduate and master’s courses. This four week methods course aims to address this need.

Those following the course will learn the essentials of programming in Python, the language of choice in the growing field of computational social science. In particular, the focus will be on using the language to access data from a diverse variety of sources on the social web (from Twitter and Facebook to Reddit, Youtube or the front page of the New York Times), and to transform this material into datasets which are amenable to traditional social science analysis.


By the end of the course, students will:

  • Have knowledge of the range of social science research taking place using data from the social web;
  • Have a grasp of key methodological issues involved in the collection and processing of these types of data, and the key challenges in using social data to answer social research questions;
  • Have a solid grounding in the use of the Python programming language for social research.

Past Projects

Students taking this course have looked at the diffusion of hashtags on Instagram, the use of Twitter for prediction, spread of discourses on social media and the language behind political satire.