AI chatbots can be persuaded to break rules using basic psych tricks

AI chatbots can be persuaded to break rules using basic psych tricks | usagoldmines.com

A new study from researchers at University of Pennsylvania shows that AI models can be persuaded to break their own rules using several classic psychological tricks, reports The Verge.

In the study, the Penn researchers tested seven different persuasive techniques on OpenAI’s GPT-4o mini model, including authority, commitment, liking, reciprocity, scarcity, social proof, and unity.

The most successful method turned out to be commitment. By first getting the model to answer a seemingly innocent question, the researchers were then able to escalate to more rule-breaking responses. One example was when the model first agreed to use milder insults before also accepting harsher ones.

Techniques such as flattery and peer pressure also had an effect, albeit to a lesser extent. Nevertheless, these methods demonstrably increased the likelihood of the AI model giving in to forbidden requests.

This articles is written by : Nermeen Nabil Khear Abdelmalak

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Breaking

AI chatbots can be persuaded to break rules using basic psych tricks | usagoldmines.com

By USA Goldmines

You Missed

ARK Invest tops up BitMine and Bullish stocks with fresh $23.5M buy Randa Moses | usagoldmines.com

Solana Price Prediction: Why New Crypto Investors Favour Remittix As The Altcoin To Buy In September Cryptopolitan Media | usagoldmines.com

XRP Price Slides As Layer Brett Captures Attention With 20x Gains Predicted In September Cryptopolitan Media | usagoldmines.com

I bought a stylish Pomera ‘writer deck’ and came away disappointed | usagoldmines.com

AI chatbots can be persuaded to break rules using basic psych tricks | usagoldmines.com

By USA Goldmines

Related Posts

I bought a stylish Pomera ‘writer deck’ and came away disappointed | usagoldmines.com

Best antivirus software 2025: These 8 apps keep your PC safe | usagoldmines.com

AI dubbing opens up new languages with Adobe Firefly | usagoldmines.com

You Missed

ARK Invest tops up BitMine and Bullish stocks with fresh $23.5M buy Randa Moses | usagoldmines.com

Solana Price Prediction: Why New Crypto Investors Favour Remittix As The Altcoin To Buy In September Cryptopolitan Media | usagoldmines.com

XRP Price Slides As Layer Brett Captures Attention With 20x Gains Predicted In September Cryptopolitan Media | usagoldmines.com

I bought a stylish Pomera ‘writer deck’ and came away disappointed | usagoldmines.com