AI models can acquire backdoors from surprisingly few malicious documents Benj Edwards

AI models can acquire backdoors from surprisingly few malicious documents Benj Edwards | usagoldmines.com

Scraping the open web for AI training data can have its drawbacks. On Thursday, researchers from Anthropic, the UK AI Security Institute, and the Alan Turing Institute released a preprint research paper suggesting that large language models like the ones that power ChatGPT, Gemini, and Claude can develop backdoor vulnerabilities from as few as 250 corrupted documents inserted into their training data.

That means someone tucking certain documents away inside training data could potentially manipulate how the LLM responds to prompts, although the finding comes with significant caveats.

The research involved training AI language models ranging from 600 million to 13 billion parameters on datasets scaled appropriately for their size. Despite larger models processing over 20 times more total training data, all models learned the same backdoor behavior after encountering roughly the same small number of malicious examples.

Read full article

Comments

This articles is written by : Nermeen Nabil Khear Abdelmalak

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Breaking

AI models can acquire backdoors from surprisingly few malicious documents Benj Edwards | usagoldmines.com

By USA Goldmines

You Missed

Crypto News | Why is Cardano (ADA) Up 15% in a Week? Dimitar Dzhondzhorov | usagoldmines.com

Bitcoin price prediction 2026-2032: Will BTC hit $150k soon? Shayan Chowdhury | usagoldmines.com

Why even buy a Steam Machine? Check out our own DIY builds for $1,050 | usagoldmines.com

Fubo quietly raises prices. Is it still worth considering over YouTube TV? | usagoldmines.com

AI models can acquire backdoors from surprisingly few malicious documents Benj Edwards | usagoldmines.com

By USA Goldmines

Related Posts

Why even buy a Steam Machine? Check out our own DIY builds for $1,050 | usagoldmines.com

The best PC games of 2026 (that don’t need a graphics card) | usagoldmines.com

Fubo quietly raises prices. Is it still worth considering over YouTube TV? | usagoldmines.com

You Missed

Crypto News | Why is Cardano (ADA) Up 15% in a Week? Dimitar Dzhondzhorov | usagoldmines.com

Bitcoin price prediction 2026-2032: Will BTC hit $150k soon? Shayan Chowdhury | usagoldmines.com

Why even buy a Steam Machine? Check out our own DIY builds for $1,050 | usagoldmines.com

Fubo quietly raises prices. Is it still worth considering over YouTube TV? | usagoldmines.com