Skip down to main content

Introduction to Natural Language Processing for the Social Sciences

Key Information

Course details
Option course for MSc, Hilary Term
Assessment
Coursework submission
Reading list
View now
Tutor
Dr Scott A. Hale

About

Language is a rich source of information about people’s knowledge and opinions, and also about their goals, social identities, and social alliances. Large-scale analysis of language data is invaluable for research in the social sciences. In this option/elective, students will learn to use statistical natural language processing tools that will enable them to analyse the content of documents in order to address a wide variety of different questions.

This course will develop conceptual and technical tools for large-scale analysis of linguistic data such as document collections, transcripts, and blogs. The first part of the course introduces the statistical structure of the lexicon and models for text creation, including the baseline Naïve Bag of Words model as well as more realistic models that include effects of social and pragmatic context. Then, we turn to algorithms for clustering, classifying, and discriminating different types of documents on the basis of the words and word sequences that they contain.  These are applied to characterise the topics of different documents as well as the socio-indexical traits of speakers/authors. Lastly, we bring these ideas together in tools for analysing the spread of memes and opinions through repeated interactions in linguistic communities.

Learning Outcomes

At the end of the course, students will:

  • Understand the basic statistical properties of words and word sequences in natural language, and the typical challenges they present.
  • Learn how topical and social context affect word frequencies at different time scales.
  • Learn methods for document clustering and classification.
  • Acquire tools for analysing lexical dynamics to answer questions about conceptual and social trends.
Privacy Overview
Oxford Internet Institute

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies
  • moove_gdrp_popup -  a cookie that saves your preferences for cookie settings. Without this cookie, the screen offering you cookie options will appear on every page you visit.

This cookie remains on your computer for 365 days, but you can adjust your preferences at any time by clicking on the "Cookie settings" link in the website footer.

Please note that if you visit the Oxford University website, any cookies you accept there will appear on our site here too, this being a subdomain. To control them, you must change your cookie preferences on the main University website.

Google Analytics

This website uses Google Tags and Google Analytics to collect anonymised information such as the number of visitors to the site, and the most popular pages. Keeping these cookies enabled helps the OII improve our website.

Enabling this option will allow cookies from:

  • Google Analytics - tracking visits to the ox.ac.uk and oii.ox.ac.uk domains

These cookies will remain on your website for 365 days, but you can edit your cookie preferences at any time via the "Cookie Settings" button in the website footer.