Skip down to main content

Computational Methods for the Social Sciences

Key Information

Course details
Methods Option course for MSc, Hilary Term
Assessment
Coursework submission
Reading list
View now
Tutor
Dr Fabian Stephany

About

Social science research increasingly rely on computational methods to study online culture, communities, and human behaviours, drawing from larger and larger amounts of data available online. The uptake of platforms (such as Twitter, Facebook and Google) and public websites around the world—and the relatively open, structured nature of the data they produce—now allows researchers to study many new social research questions.

However, the way these messy data are structured, disseminated, and available creates challenge for the social sciences. This course teaches how to collect and wrangle structured and unstructured data from social websites and platforms. In particular, the focus will be on using Python to access data from a diverse variety of sources on the social web (from Twitter to Reddit, from Wikipedia to the front page of the New York Times), and transforming this material into datasets which are amenable to traditional social science analysis. Once the data has been collected, the course familiarizes students with the variety of approaches for processing and preparing data for analysis.

Learning objectives

  • Have a grasp of key methodological issues involved in the collection and processing of these types of data, and the key challenges in using online social data to answer social research questions
  • Have a solid grounding in the use of the Python programming language to access and pre-process social data
  • Be able to parse text files in a manner suitable for Natural Language Processing
  • Be able to reshape JSON data such as tweets and other API-based data into rectangular structures amenable to analysis using data frames.
  • Understand how to apply regular expressions to string text
  • Appreciate, understand, and tame Unicode data such as © and TM
  • Be able to transfer data between R and Python
Privacy Overview
Oxford Internet Institute

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies
  • moove_gdrp_popup -  a cookie that saves your preferences for cookie settings. Without this cookie, the screen offering you cookie options will appear on every page you visit.

This cookie remains on your computer for 365 days, but you can adjust your preferences at any time by clicking on the "Cookie settings" link in the website footer.

Please note that if you visit the Oxford University website, any cookies you accept there will appear on our site here too, this being a subdomain. To control them, you must change your cookie preferences on the main University website.

Google Analytics

This website uses Google Tags and Google Analytics to collect anonymised information such as the number of visitors to the site, and the most popular pages. Keeping these cookies enabled helps the OII improve our website.

Enabling this option will allow cookies from:

  • Google Analytics - tracking visits to the ox.ac.uk and oii.ox.ac.uk domains

These cookies will remain on your website for 365 days, but you can edit your cookie preferences at any time via the "Cookie Settings" button in the website footer.