ɫ��ֱ��

AI peer review needs to be peer-reviewed

Can AI match the insights of human referees? We don’t know. So before implementation, let’s run the experiment, says Sheldon H. Jacobson

Published on

September 17, 2025

Last updated

September 17, 2025

Sheldon H. Jacobson

Twitter:

A woman talks to a robot about a paper, symbolising the trialling of AI peer review

Source: demaerre/iStock

IoP Publishing’s discovery that researchers are split down the middle on the merits of using AI in peer review is not surprising given the complexity of the issue.

The publisher’s August of just under 350 physics researchers found that 41 per cent were positive about the use of AI in peer review and 37 per cent were negative.

The case for using AI is obvious. With more than in existence, the process is highly human-intensive. To illustrate this point, if the average journal published 50 papers per year, with each submitted paper being reviewed by two independent referees and each review requiring four hours of effort, that translates into an annual 12 million hours of peer review work.

Moreover, that figure does not include the work of the editorial boards that oversee the review process or the editorial staff who process the papers – not to mention the time editors take trying to find suitable reviewers willing to take on a manuscript. It also ignores the fact that many , creating a multiplier effect that may increase the annual reviewing burden by a factor of five or more.

ɫ��ֱ��

Replacing human referees with generative AI instead would therefore ease a reviewing burden that is commonly agreed to be verging on unsustainable.

Then there is the issue of . Many journals promise rapid review. Yet speed must be balanced against the quality and depth of the reviews provided. With AI-generated reviews, time would no longer be an issue.

ɫ��ֱ��

Furthermore, if AI peer reviewing did become widely available, researchers could use it to evaluate their research manuscripts before submitting to a journal. Incorporating such a system into repositories, such as , would facilitate this, making peer review part of the research process itself by offering suggestions that impact the final product.

But, of course, there are also challenges to adopting AI. The purpose of peer review is to answer three questions about the manuscript. Is the research new? Are the research results correct? And do the results add intellectual value in a field or provide benefits in a discipline or beyond?

Most research incrementally builds on existing results, and AI systems are well suited to make such evaluations. They can respond to an informal checklist of measures that capture what the research manuscripts build on and how well it has used the scientific method to achieve its objectives. This is no different from how a human peer reviewer would proceed.

Campus resource: Developing a GenAI policy for research and innovation

However, the answers to the second two of the three questions above are highly dependent on the field of study and the type of research conducted. Although the is likely at the foundation of most research and discovery in STEM disciplines and some of the social sciences, variations in theoretical, experimental and data analytics research make a one-size-fits-all approach to AI peer review problematic.

In addition, if the research breaks new ground, providing a quantum shift in thinking, making such evaluations would be more difficult given that the existing literature would not provide any foundation to evaluate such new ideas.

ɫ��ֱ��

Perhaps the most difficult role for AI would be to assess the third question, pertaining to the value and benefits of the research. Although such evaluation is highly subjective, it is often what provides the insight that is at the core of peer review’s value.

Donald Trump’s recent executive order, ””, calls for the adoption of “unbiased peer review” to improve the research process, including how research is disseminated and evaluated. That could be read as an implicit call for the adoption of AI peer review. But, of course, bias is always in the eye of the beholder. While Trump and his MAGA allies might see research on gender or climate change as being of little value, others will disagree. AIs are no more “unbiased” than humans in that sense – as the of Elon Musk’s Grok AI aptly demonstrates.

�Ѵǰ��Ǳ��,�� must be trained with data, which itself may – depending on your opinion – be biased or contaminated with information that is demonstrably false. Although AI systems look smart, they are doing nothing more than regurgitating what they learned when trained. As the data modelling adage goes: “”.

ɫ��ֱ��

To return to the issue of recognising the value of groundbreaking research, it is possible that an AI’s training data could inadvertently create a “group think” assessment, which uprates research that methodologically builds on existing knowledge but fails to recognise the benefits of “out-of-the-box” ideas, potentially disincentivising research creativity.

In my view, when it comes to assessing the value and significance of research. But we should not rely on hunches. To test this possible limitation, an AI peer-review process should be implemented in parallel with human peer review, with the human peer reviewers allowed to see the AI peer review after they complete their own assessments. It may well turn out that humans and AIs agree with each other much more frequently in some fields than others, with the former more suited to a switch to AI reviewing.

Ultimately, AI’s most appropriate role might be to support human peer review, rather than replace it, picking up more perfunctory issues while the human reviewer connects the dots and has the final say. But we don’t know. And the bottom line is that we must proceed with caution until we do.

Let’s hold off implementing AI peer review until – no pun intended – it can itself be peer-reviewed to ensure it meets the very standards that authors and editors rightly expect.

ɫ��ֱ��

is founder professor in computer science at the University of Illinois Urbana-Champaign.

Register to continue

Why register?

Registration is free and only takes a moment
Once registered, you can read 3 articles a month
Sign up for our newsletter

Or subscribe for unlimited access to:

Unlimited access to news, views, insights & reviews
Digital editions
Digital access to �ձᷡ’s university and college rankings analysis

Please or to read this article.

Researchers ‘polarised’ over use of AI in peer review

Views becoming more entrenched on both sides, with roughly equal numbers positive and negative about new technologies, poll finds

By Tom Williams

15 September

‘Burn it with fire!’ ChatGPT use ‘polarises’ peer reviewers

Global survey of peer reviewers reveals deep distrust towards ChatGPT, with some calling for a complete ban on its use in research and academia

By Jack Grove

14 May

Peer review will only do its job if referees are named and rated

We need a mechanism whereby academics can build a public reputation as referees and receive career benefits for doing so, says Randy Robertson

By Randy Robertson

14 August

Montage of a closeup of the edge of open book pages with a person smiling gesturing towards it to illustrate Peer reviewers: chill out and don’t let the power go to your head

Peer reviewers: chill out and don’t let the power go to your head

It makes zero difference to reviewers if someone else gets a paper in a high-impact journal, so why are they so pernickety, asks Stephen Cochrane

By Stephen Cochrane

4 July

Reader's comments (6)

#1 Submitted by graff.... on September 17, 2025 - 12:09am

Come on! AI can NEVER replace human competence. Yes, it can assist. But it not "one or the other." ɫ��ֱ�� stop this back and forth, nowhere.... Please.

#2 Submitted by graff.... on September 17, 2025 - 12:10am

Moreover, who, specifically, is AI's "peer"? Not any humans!

#3 Submitted by d.j.... on September 17, 2025 - 4:16am

If your hypothetical "AI" system is going to assist with peer review it will as you note need access to a training set consisting of the literature in the subject area. How are you going to ensure that the authors of that literature have granted permission for their works to be used in this way? Coverage will be difficult to assess given that not every work in every field is published in an "AI"-friendly form. I suspect this would prove more of a boon for predatory publishers than for the respected end of the community and authors will vote with their manuscripts

#4 Submitted by philip.... on September 18, 2025 - 10:26am

The best research is novel and based on out-of-the box thinking. AI is trained on within-box thinking and existing ideas. Would it not put a premium on regurgitating old ideas, discourage novelty, and reward incrementalism?

#5 Submitted by ... on September 24, 2025 - 7:40am

Well it would seem the way things are going that there will not be that many human academics around to do this sort of thing in the future and those that will nbe will be far too busy with their huge lecture and seminar classes and have no offices or space to undertake their research anyway. So we might as well let the AI do it as not.

#6 Submitted by ... on September 24, 2025 - 7:41am

"Donald Trump’s recent executive order, ”Restoring Gold Standard Science”, calls for the adoption of “unbiased peer review” to improve the research process, including how research is disseminated and evaluated. That could be read as an implicit call for the adoption of AI peer review." It could be read that way but only by a blithering idiot as it means exactly the opposite!

ɫ��ֱ��

AI peer review needs to be peer-reviewed

Can AI match the insights of human referees? We don’t know. So before implementation, let’s run the experiment, says Sheldon H. Jacobson

ɫ��ֱ��

ɫ��ֱ��

ɫ��ֱ��

ɫ��ֱ��

ɫ��ֱ��

Register to continue

Subscribe

Related articles

Researchers ‘polarised’ over use of AI in peer review

‘Burn it with fire!’ ChatGPT use ‘polarises’ peer reviewers

Peer review will only do its job if referees are named and rated

Peer reviewers: chill out and don’t let the power go to your head

Reader's comments (6)

Sponsored

Featured jobs

ɫ��ֱ��

AI peer review needs to be peer-reviewed

Can AI match the insights of human referees? We don’t know. So before implementation, let’s run the experiment, says Sheldon H. Jacobson

ɫ��ֱ��

ɫ��ֱ��

ɫ��ֱ��

ɫ��ֱ��

ɫ��ֱ��

Register to continue

Subscribe

Related articles

Researchers ‘polarised’ over use of AI in peer review

‘Burn it with fire!’ ChatGPT use ‘polarises’ peer reviewers

Peer review will only do its job if referees are named and rated

Peer reviewers: chill out and don’t let the power go to your head

Reader's comments (6)

You might also like

Science too driven by political causes, third of UK voters fear

The British lawyer using US courts to fight research fraud

South Korea’s universities can’t drive its AI transition without better policy

‘Painful measures’ to limit resubmissions needed, says ERC head

Sponsored

Featured jobs