A digital illustration showing a scientist looking at a computer screen filled with glowing, floating citations—some fading into question marks—against a backdrop of a research lab and data streams.
A digital illustration showing a scientist looking at a computer screen filled with glowing, floating citations—some fading into question marks—against a backdrop of a research lab and data streams.

The rise of AI-invented citations shows a growing crack in research integrity, useful context for a colleague or student following science ethics.

AI Is Inventing Fake Science Citations Story flow and key facts

A new study by researchers from Cornell and UCLA has uncovered a growing problem in scientific publishing: AI-generated fake citations. Analyzing over 111 million references from 2.5 million papers, the team identified 146,900 citations that do not correspond to any real publication—hallucinations produced by large language models like ChatGPT and Gemini. These false references were found across four major scientific repositories: arXiv, bioRxiv, SSRN, and PubMed Central, which host early versions of research papers before formal peer review.

The study highlights a sharp increase in non-existent citations following the widespread adoption of AI tools in 2023. While some unmatched references were due to typos, many were entirely fabricated, suggesting researchers are using AI to draft citations without proper verification. This trend threatens the integrity of the scholarly record, which relies on accurate referencing for peer review and cumulative knowledge.

In response, arXiv has announced stricter policies, including banning authors who submit papers with hallucinated citations. Experts warn that unchecked AI-generated content adds noise to the scientific corpus, making it harder to identify valid research and potentially misdirecting future studies. The findings underscore the need for vigilance as AI becomes more embedded in academic workflows.

Facts

  • A study by Cornell and UCLA researchers found 146,900 AI-generated fake citations in scientific papers.
  • The fake citations were identified across four major repositories: arXiv, bioRxiv, SSRN, and PubMed Central.
  • The rise in hallucinated references correlates with widespread AI adoption starting in 2023.
  • arXiv has announced it will ban authors who submit papers with hallucinated or unchecked AI-generated content.
  • Unverified AI citations threaten trust in peer review and the foundation of cumulative scientific knowledge.

Canto visual news explainer. AI tools may assist production. Editorial policy