Skip down to main content

Using Large Language Models for Research Evaluation in the REF

With Prof Mike Thelwall
Recorded:
31 Jan 2025
Speakers:
With Prof Mike Thelwall

Evaluating the quality of research outputs is time consuming and difficult, not just for the 1000+ REF evaluators but also for internal departmental REF selection procedures. In REF2021, 11 out of 34 Units of Assessment consulted citation-based indicators to support their decisions and they may also have been used by departments selecting their best outputs. Nevertheless, it has recently emerged that ChatGPT can give stronger evidence than citation-based indicators in most fields. This has created the possibility to extend the use of indicators to support research quality judgements to more fields and perhaps also in a slightly stronger role. This talk will describe the results of experiments that have shown this capability. Frustratingly from a qualitative perspective, despite statistically significant score predictions by ChatGPT and plausible reports about rigour, originality and significance, its evaluations seem to be too imprecise to provide strong evidence of its predicted scores. It is not clear whether this is a large language model limitation or a more fundamental property of academic expert review. ChatGPT’s other quirks include a liking for post-humanism, being unimpressed with articles in The Lancet, and being willing to hallucinate detailed evaluations of non-existent papers.

Mike Thelwall is a social scientist and professor of data science at the University of Sheffield, specialising in research evaluation. He has advised the REF on the selection and use of citation-based indicators and is consulting about a potential transition to large language models. He is a senior associate editor of the Journal of the Association for Information Science and Technology, and his current book, Quantitative Methods in Research Evaluation Citation Indicators, Altmetrics, and Artificial Intelligence, is free at https://arxiv.org/abs/2407.00135.

Speaker

Mike Thelwall

Prof Mike Thelwall

social scientist and professor of data science , University of Sheffield

Mike specialises in research evaluation. He has advised the REF on the selection and use of citation-based indicators and is consulting about a potential transition to LLMs.

Privacy Overview
Oxford Internet Institute

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies
  • moove_gdrp_popup -  a cookie that saves your preferences for cookie settings. Without this cookie, the screen offering you cookie options will appear on every page you visit.

This cookie remains on your computer for 365 days, but you can adjust your preferences at any time by clicking on the "Cookie settings" link in the website footer.

Please note that if you visit the Oxford University website, any cookies you accept there will appear on our site here too, this being a subdomain. To control them, you must change your cookie preferences on the main University website.

Google Analytics

This website uses Google Tags and Google Analytics to collect anonymised information such as the number of visitors to the site, and the most popular pages. Keeping these cookies enabled helps the OII improve our website.

Enabling this option will allow cookies from:

  • Google Analytics - tracking visits to the ox.ac.uk and oii.ox.ac.uk domains

These cookies will remain on your website for 365 days, but you can edit your cookie preferences at any time via the "Cookie Settings" button in the website footer.