Skip down to main content

Big UK Domain Data for the Arts and Humanities

Big UK Domain Data for the Arts and Humanities

Overview

Background

Web archives are an increasingly important resource for arts and humanities researchers, yet we have neither the expertise nor the tools to use them effectively. Both the data itself and the process of collection are poorly understood, and it is possible only to draw the broadest of conclusions from current analytical analysis. The Big UK Domain Data for the Arts and Humanities project will work with the dataset derived from the UK domain crawl from 1996 to 2013 (that is, when legal deposit legislation was extended to cover digital materials), totalling approximately 65 terabytes and constituting many billions of words. For the arts and humanities, this is very big data indeed.

Objectives

A key objective of the project will be to develop a theoretical and methodological framework within which to study this data, which will be applicable to the much larger on-going UK domain crawl, as well as in other national contexts. Researchers will work with developers at the British Library to co-produce tools which will support their requirements, testing different methods and approaches.

A major study of the history of UK web space from 1996 to 2013, including language, file formats, the development of multimedia content, shifts in power and access, and so on, will be complemented by a series of sub-projects from a range of disciplines, for example contemporary history, literature, gender studies and material culture.

Project outputs will include a suite of tools associated with the 1996-2013 dataset; a series of case studies produced by the sub-projects; an online training course for arts and humanities researchers; peer-reviewed journal articles; and a monograph on the history of the UK web during this period.

Collaborators and funding

Big UK Domain Data for the Arts and Humanities is a collaboration between the British Library, the Institute of Historical Research, University of London, the Oxford Internet Institute and Aarhus University. It is funded by the Arts and Humanities Research Council as part of its Digital Transformations in the Arts and Humanities theme.

Key Information

Funder:
  • Arts and Humanities Research Council (AHRC)
  • Project dates:
    January 2014 - March 2015
    Privacy Overview
    Oxford Internet Institute

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

    Strictly Necessary Cookies
    • moove_gdrp_popup -  a cookie that saves your preferences for cookie settings. Without this cookie, the screen offering you cookie options will appear on every page you visit.

    This cookie remains on your computer for 365 days, but you can adjust your preferences at any time by clicking on the "Cookie settings" link in the website footer.

    Please note that if you visit the Oxford University website, any cookies you accept there will appear on our site here too, this being a subdomain. To control them, you must change your cookie preferences on the main University website.

    Google Analytics

    This website uses Google Tags and Google Analytics to collect anonymised information such as the number of visitors to the site, and the most popular pages. Keeping these cookies enabled helps the OII improve our website.

    Enabling this option will allow cookies from:

    • Google Analytics - tracking visits to the ox.ac.uk and oii.ox.ac.uk domains

    These cookies will remain on your website for 365 days, but you can edit your cookie preferences at any time via the "Cookie Settings" button in the website footer.