Skip down to main content

Monitoring Online Hate Speech

Monitoring Online Hate Speech

Overview

This project seeks to develop methodologies and tools for the monitoring, analysis, and measurement of trends in online hate speech in different languages, countries, and contexts, with a particular focus on the emerging trends of hate speech regarding COVID-19. This involves the undertaking a multi-stage programme of research, development, and policy engagement.

Background Literature Review and Toolkit Development

The initial stage of literature review would look at the state of the art of knowledge on producers of hate speech and hateful content, and also current understanding of consumer dynamics (i.e. who is listening to hate speech and how they are affected). There would be a particular focus on exploring the current state knowledge regarding the circulation of online hate speech related to the COVID-19 pandemic.

This stage would also place a major focus on computational methods for detecting and classifying pieces of text as ‘hateful’, particularly in multi-lingual and multi-country contexts. We would comprehensively review this work, identifying both methods and techniques and also open-source libraries and existing platforms. We would also set out what is known about the accuracy of these systems and address generic means of understanding accuracy in such classification systems.

Finally, we would look at what is known about detecting prevalence on social platforms, and tackle issues such as sampling strategy, as well as summarising the latest measurements that have been released about the prevalence of hateful issues and content. Sampling on social media is a complex challenge, as social platforms tend to divide themselves into a small number of highly used groups and pages and a long tail of smaller ones

Digital Platform Development

This activity, which is the core of the project, encompasses the development of a prototype digital platform which would flexibly allow for the measurement of trends in online hate speech in different languages, countries and contexts. Such a tool could be tuned to different local contexts and environments using local political and linguistic knowledge, and be used to serve up rapid reports on trends in online hate speech.

The core of the tool would be a multi-platform monitoring system which would systematically query the Application Programming Interfaces [APIs] of major social platforms (Facebook, Twitter, Instagram, Youtube and Reddit) for relevant social media data. This tool will incorporate a keyword search-based capacity; keywords would be defined by country or region of interest, but could, for example, contain political or religious topics, or other identity based topics which are fertile grounds for hate speech, as well as coronavirus-related keywords.

By querying for content relating to these topics over an extended period of time, we can build up a dataset of political, religious, coronavirus or other content relating to the country or region of interest. Then, a limited amount of manual content analysis can be applied to determine the level of hate speech around the topic of interest. This will be combined with large group search capacity and the monitoring of media outlets known for promoting hateful or divisive content. The output of this strand of work will be a tool that, with some input from a country expert, can be tuned to detect hate speech in different regions or countries of interest.

Key Information

Funder:
  • UNESCO
  • Project dates:
    July 2021 - December 2021

    Related Topics:

    Privacy Overview
    Oxford Internet Institute

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

    Strictly Necessary Cookies
    • moove_gdrp_popup -  a cookie that saves your preferences for cookie settings. Without this cookie, the screen offering you cookie options will appear on every page you visit.

    This cookie remains on your computer for 365 days, but you can adjust your preferences at any time by clicking on the "Cookie settings" link in the website footer.

    Please note that if you visit the Oxford University website, any cookies you accept there will appear on our site here too, this being a subdomain. To control them, you must change your cookie preferences on the main University website.

    Google Analytics

    This website uses Google Tags and Google Analytics to collect anonymised information such as the number of visitors to the site, and the most popular pages. Keeping these cookies enabled helps the OII improve our website.

    Enabling this option will allow cookies from:

    • Google Analytics - tracking visits to the ox.ac.uk and oii.ox.ac.uk domains

    These cookies will remain on your website for 365 days, but you can edit your cookie preferences at any time via the "Cookie Settings" button in the website footer.