Tech and AIDeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other...

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

-


The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.



Source link

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest news

Cardano Foundation Reveals 15% Bitcoin Holdings in $659M Asset Report

The Cardano Foundation just revealed it is holding almost $100 million in Bitcoin. About 15% of its $659...

Marc Andreessen reportedly told group chat that universities will ‘pay the price’ for DEI

Venture capitalist Marc Andreessen sharply criticized universities including Stanford and MIT, along with the National Science Foundation, in...

Q3 Bitcoin Mining Map Exposes Silent Surge in Russia, China, While US Dips Slightly

As the second quarter of 2025 wound down, the U.S. held onto its top spot for the highest...

Bitcoin’s Record Quarter Met With Silence From Elite Media

Market intelligence firm Bitcoin Perception reported that mainstream media coverage of Bitcoin in Q2 2025 remained deeply polarized....

Advertisement

XRP Whales Break Records as Price Jumps 20%—Santiment Flags ‘Very Positive Sign’

XRP surges to $2.8 as the number of wallets holding at least 1 million of the token increases...

Must read

Cardano Foundation Reveals 15% Bitcoin Holdings in $659M Asset Report

The Cardano Foundation just revealed it is holding...

Marc Andreessen reportedly told group chat that universities will ‘pay the price’ for DEI

Venture capitalist Marc Andreessen sharply criticized universities including...

You might also likeRELATED
Recommended to you