Skip to content
world

ArXiv Will Ban Researchers Who Upload AI Slop Papers

ArXiv, one of the world's most influential academic preprint platforms, is cracking down on researchers who upload papers riddled with unchecked AI-generated content. Starting now, authors caught submitting work with hallucinated references or unreviewed LLM output face a one-year ban from the platform.

·ottown·3 min read
ArXiv Will Ban Researchers Who Upload AI Slop Papers
152

The Academic World Has an AI Slop Problem

If you've spent any time browsing academic preprints lately, you've probably noticed something off — papers that read smoothly but cite studies that don't exist, or contain stray phrases like "As an AI language model, I cannot..." buried mid-paragraph. It's a growing problem, and ArXiv, the go-to repository for cutting-edge research in physics, computer science, mathematics, and beyond, has finally had enough.

The platform is now threatening year-long bans for researchers who submit papers containing what officials are calling "AI slop" — work that shows clear evidence the authors never bothered to review what their AI tools actually produced.

What Counts as a Violation

Thomas Dietterich, ArXiv's section chair for computer science, laid out the specifics in a post on X. The ban applies when there is "incontrovertible evidence that the authors did not check the results of LLM generation" — the clearest examples being hallucinated citations (references to papers that simply don't exist) and LLM meta-comments left intact in the final submission (think: "Sure, here's a draft of your introduction...").

In other words, ArXiv isn't banning AI assistance outright. Researchers can still use large language models to help draft, edit, or structure their work. What they can't do is treat AI output as a finished product and hit submit without reading it.

The Consequences

Researchers who violate the new policy will be banned from ArXiv for one year. That's a significant penalty in academic circles — ArXiv is often where breakthroughs are published first, sometimes months or years before formal peer review. Being locked out means losing access to one of the fastest and most widely-read distribution channels in science.

Beyond the ban, future submissions from offending researchers will need to have been accepted at "a reputable peer-reviewed venue" before ArXiv will host them. That's a substantially higher bar than the current system, where most papers are posted with minimal vetting.

Why This Matters

The move reflects a broader reckoning in academia over how to handle generative AI. Journals like Nature and Science have already issued their own AI disclosure policies, but enforcement has been inconsistent. ArXiv's approach is more direct: demonstrate you didn't review your AI's work, and you lose access.

Critics of the policy might argue that hallucinated references can slip through even when authors are being diligent — AI tools are notoriously unreliable when it comes to citation accuracy. But the more egregious cases Dietterich is targeting — where LLM commentary is literally left in the text — are hard to defend as anything other than negligence.

A Line in the Sand

For the global research community, ArXiv's ban signals that the honeymoon period for unchecked AI use in academic writing is over. The platform hosts over two million papers and receives thousands of new submissions every day. Even if only a fraction of those contain AI slop, the cumulative effect on the quality and reliability of the scientific record is real.

As AI writing tools become cheaper and more capable, expect other major preprint servers and journals to follow ArXiv's lead. The message is simple: use AI if you want, but own what you publish.

Source: The Verge

Stay in the know, Ottawa

Get the best local news, new restaurant openings, events, and hidden gems delivered to your inbox every week.