As new tools flourish, AI 'fingerprints' on scientific papers could damage trust in vital research

Oceane Duboust

May 16, 2024 at 9:33 AM·4 min read

As new tools flourish, AI 'fingerprints' on scientific papers could damage trust in vital research

Are some researchers using too much artificial intelligence (AI) in their scientific papers? Experts say that "fingerprints" of generative AI (GenAI) can be found in an increasing number of studies.

A recent preprint paper, which hasn’t been peer-reviewed yet, estimated that at least 60,000 papers were probably "polished" using AI in some way by analysing the writing style.

"It's not to say that we knew how much LLM [large language model] work was involved in them, but certainly, these are immensely high shifts overnight," Andrew Gray, a librarian at University College London, told Euronews Next, adding that these types of "fingerprints" can be expected even if the tools were used for mere copyediting.

While certain shifts can be linked to changes in how people write, the evolution of some words is "staggering".

"Based on what we're seeing, those numbers look like they're going steadily up," Gray said.

It has already started causing waves. A peer-reviewed study with AI-generated pictures that the authors openly credited to the Midjourney tool was published in the journal Frontiers in Cell Development and Biology and went viral on social media in February.

The journal has since retracted the study and apologised "to the scientific community".

"There's very few that explicitly mention the use of ChatGPT and similar tools," Gray said about the papers he analysed.

New tools pose trust issues

While GenAI may help speed up the editing process, such as when an author is not a native speaker of the language they are writing in, a lack of transparency regarding the use of these tools is concerning, according to experts.

"There is concern that experiments, for example, are not being carried out properly, that there is cheating at all levels," Guillaume Cabanac, a professor of computer science at the University of Toulouse, told Euronews Next.

Nicknamed a "deception sleuth" by Nature, Cabanac tracks fake science and dubious papers.

"Society gives credit to science but this credit can be withdrawn at any time," he added, explaining that misusing AI tools could damage the public’s trust in scientific research.

With colleagues, Cabanac developed a tool called the Problematic Paper Screener to detect "tortured phrases" – those that are found when a paraphrasing tool is used, for example, to avoid plagiarism detection.

But since the GenAI tools went public, Cabanac started noticing a trend of new fingerprints appearing in papers such as the term "regenerate," a button appearing at the end of AI chatbots’ answers, or sentences beginning with "As an AI language model".

They are telltale signs of text that was taken from an AI tool.

“I only detect a tiny fraction of what I assume to be produced today, but it's enough to establish a proof of concept,” Cabanac said.

One of the issues is that AI-generated content will likely be increasingly difficult to spot as the technology progresses.

“It's very easy for these tools to subtly change things, or to change things in a way that maybe you didn't quite anticipate with a secondary meaning. So, if you're not checking it carefully after it's gone through the tool, there's a real risk of errors creeping in,” Gray said.

Harder to spot in the future

The peer-reviewed process is meant to prevent any blatant mistakes from appearing in the journals, but it’s not often the case as Cabanac points out on social media.

Some publishers have released guidelines regarding the use of AI in submitted publications.

Making assessments badly, too quickly, or helped by ChatGPT without rereading, that's not good for science.

The journal Nature said in 2023 that an AI tool could not be a credited author on a research paper, and that any researchers using AI tools must document their use.

Gray fears that these papers will be harder to spot in the future.

"As the tools get better, we would expect fewer really obvious [cases]," he said, adding that publishers should give "serious thought" to the guidelines and expected disclosure.

Both Gray and Cabanac urged authors to be cautious, with Cabanac calling to flag suspicious papers and regularly check for retracted ones.

"We can't allow ourselves to quote, for example, a study or a scientific article that has been retracted," Cabanac said.

"You always have to double-check what you're basing your work on".

He also questioned the soundness of the peer-reviewing process which proved deficient in some cases.

"Making assessments badly, too quickly or helped by ChatGPT without rereading, that's not good for science," he said.

TechCrunch
UK data protection watchdog ends privacy probe of Snap's GenAI chatbot, but warns industry
The U.K.'s data protection watchdog has closed an almost year-long investigation of Snap's AI chatbot, My AI -- saying it's satisfied the social media firm has addressed concerns about risks to children's privacy. At the same time, the Information Commissioner's Office (ICO) issued a general warning to industry to be proactive about assessing risks to people's rights before bringing generative AI tools to market. GenAI refers to a flavor of AI that often foregrounds content creation.
TechCrunch
EU warns Microsoft it could be fined billions over missing GenAI risk info
The European Union has warned Microsoft that it could be fined up to 1% of its global annual turnover under the bloc's online governance regime, the Digital Services Act (DSA), after the company failed to respond to a request for information (RFI) that focused on its generative AI tools. Back in March, the EU asked Microsoft and a number of other tech giants for information about systemic risks posed by generative AI tools. On Friday, the Commission said Microsoft failed to provide some of the documents it asked for.
TechCrunch
Google will soon start using GenAI to organize some search results pages
At the Google I/O 2024 developer conference on Tuesday, Google announced that it plans to use generative AI to organize the entire search results page for some search results. The AI Overview feature becomes generally available Tuesday, after a stint in Google's AI Labs program. A search results page using generative AI for its ranking mechanism will have wide-reaching consequences for online publishers.
Yahoo News
What went wrong with Google's new AI search feature — and what the company is doing to try to fix it
Google's new AI search feature was bound to have its issues. Now, after weeks of jokes and memes on social media, the tech giant is responding.
TechCrunch
Hugging Face says it detected 'unauthorized access' to its AI model hosting platform
Late Friday afternoon, a time window companies usually reserve for unflattering disclosures, AI startup Hugging Face said that its security team earlier this week detected "unauthorized access" to Spaces, Hugging Face's platform for creating, sharing and hosting AI models and resources. In a blog post, Hugging Face said that the intrusion related to Spaces secrets, or the private pieces of information that act as keys to unlock protected resources like accounts, tools and dev environments, and that it has "suspicions" some secrets could've been accessed by a third party without authorization. As a precaution, Hugging Face has revoked a number of tokens in those secrets.
TechCrunch
Google admits its AI Overviews need work, but we're all helping it beta test
Google is embarrassed about its AI Overviews, too. Google -- a company whose name is synonymous with searching the web -- whose brand focuses on "organizing the world's information" and putting it at user's fingertips -- actually wrote in a blog post that "some odd, inaccurate or unhelpful AI Overviews certainly did show up." The admission of failure, penned by Google VP and Head of Search Liz Reid, seems a testimony as to how the drive to mash AI technology into everything has now somehow made Google Search worse.
Yahoo Life Shopping
8 most affordable hearing aids of 2024, according to experts
Listen up! Brands like Zepp, Jabra, Signia and Horizon are delivering some of the best inexpensive hearing aids on the market.
Yahoo Sports
Braves, down Ronald Acuña Jr., hope May’s malaise doesn’t lead to June swoon — 'We’re too talented'
Even before his injury, Acuña was one of many Braves struggling at the plate, and the team now faces a roster-wide power outage.
Yahoo Sports
Birmingham-Southern rallies late, but loses DIII College World Series opener to Salve Regina
Birmingham-Southern fell behind 7–0 in its Division III College World Series matchup with Salve Regina and couldn't overcome the deficit in a 7–5 defeat.
Engadget
Apple is reportedly overhauling Siri with AI for improved voice controls
Apple is working on a version of Siri that will use advanced AI powered by large language models (LLMs).
TechCrunch
General Catalyst-backed Jasper Health lays off staff
Jasper Health, a cancer care platform startup, laid off a substantial part of its workforce, TechCrunch has learned. Engineering and product design were among the departments impacted by the cuts, according to posts on LinkedIn from impacted employees. TechCrunch was unable to independently verify the exact number of people who were cut, but an industry source who knew impacted people believes it was approximately half of Jasper Health's small team.
Engadget
Meta says the future of Facebook is young adults (again)
Meta is once again telling the world it intends to reorient its platform in order to appeal to younger users.
TechCrunch
Inside EV startup Fisker’s collapse: how the company crumbled under its founders' whims
Over the past eight years, famed vehicle designer Henrik Fisker suggested his electric vehicle startup would deliver on all of these promises. Instead, Fisker Inc. is on the brink of bankruptcy after having delivered just a few thousand electric Ocean SUVs. As the company grasps for an improbable rescue, employees who spoke to TechCrunch say the blame largely rests on the shoulders of two people: the husband-and-wife team whose name is on the hood.
TechCrunch
Pitch Deck Teardown: RAW Dating App's $3M angel deck
Pitch decks are a lot like dating: You’ve got to strut your stuff to convince investors that you're the perfect match, worthy of millions to grow and flourish. The pitch deck should sell your team and your solution, not just the product. This is where RAW’s pitch deck swipes left pretty hard.
Yahoo Sports
Luka Doncic shreds the Timberwolves + Mavs/Celtics NBA Finals preview | No Cap Room
Jake Fischer and Dan Devine recap the Western Conference Finals, and discuss the future of the Timberwolves, before previewing the 2024 NBA Finals.
Yahoo Life Shopping
This flowy nightgown is 'like wearing nothing at all' — and it's just $15
'Like wearing nothing at all': No wonder the easy-breezy number has a 3,400-person fan club.
Yahoo Personal Finance
What is a community bank?
Community banks often provide more personalized service and affordable banking products, but they have their shortcomings, too. Learn more about how community banks work and if they’re right for you.
Yahoo Life Shopping
Amazon Prime Day 2024: Everything we know, including deals you can shop now
Prime Day is confirmed to be kicking off sometime in July. Here's all the info we have, along with super deals you can shop early.
Yahoo Life Shopping
'Great for allergy sufferers': This whisper-quiet air purifier is over 50% off
It may be stealthy, but its HEPA filter means business: 'Helped make my apartment a better place,' says one fan.
Autoblog
How to clean the inside of your windshield
There are a lot of issues you can ignore with your car’s condition and cleanliness, but a dirty windshield isn’t one of them. Here's how to rid your glass of oily residue for a safe view of the road.

News

Life

Entertainment

Finance

Sports

New on Yahoo

As new tools flourish, AI 'fingerprints' on scientific papers could damage trust in vital research

New tools pose trust issues

Harder to spot in the future

Recommended Stories

UK data protection watchdog ends privacy probe of Snap's GenAI chatbot, but warns industry

EU warns Microsoft it could be fined billions over missing GenAI risk info

Google will soon start using GenAI to organize some search results pages

What went wrong with Google's new AI search feature — and what the company is doing to try to fix it

Hugging Face says it detected 'unauthorized access' to its AI model hosting platform

Google admits its AI Overviews need work, but we're all helping it beta test

8 most affordable hearing aids of 2024, according to experts

Braves, down Ronald Acuña Jr., hope May’s malaise doesn’t lead to June swoon — 'We’re too talented'

Birmingham-Southern rallies late, but loses DIII College World Series opener to Salve Regina

Apple is reportedly overhauling Siri with AI for improved voice controls

General Catalyst-backed Jasper Health lays off staff

Meta says the future of Facebook is young adults (again)

Inside EV startup Fisker’s collapse: how the company crumbled under its founders' whims

Pitch Deck Teardown: RAW Dating App's $3M angel deck

Luka Doncic shreds the Timberwolves + Mavs/Celtics NBA Finals preview | No Cap Room

This flowy nightgown is 'like wearing nothing at all' — and it's just $15

What is a community bank?

Amazon Prime Day 2024: Everything we know, including deals you can shop now

'Great for allergy sufferers': This whisper-quiet air purifier is over 50% off

How to clean the inside of your windshield