DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.

Source link

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

LEAVE A REPLY Cancel reply

Latest news

H100 Group leads with $54m investment, Agora Finance raises $50m

Ripple CEO Drops Bomb On Stablecoin Market, Is RLUSD The Savior?

Week in Review: X CEO Linda Yaccarino steps down

Ripple’s RLUSD Gains Momentum as Bank Wires Lose Ground

Advertisement

The Graph brings real time data streaming to TRON, providing builders with advanced blockchain insights

What is the Gen Z stare? TikTok zoomers and millennials are bickering over facial expressions

Must read

H100 Group leads with $54m investment, Agora Finance raises $50m

Ripple CEO Drops Bomb On Stablecoin Market, Is RLUSD The Savior?

You might also likeRELATED
Recommended to you

Editor Picks

Imagen Network Deploys XRP-Based Modules to Expand Liquidity Support for Web3 Social Frameworks

Imagen Network (IMAGE) Launches RLUSD-Based Components for Stable AI-Powered Social Transactions

Imagen Network (IMAGE) to Integrate Advanced Llama 4-Based AI for Multimodal Personalization

Must Read

H100 Group leads with $54m investment, Agora Finance raises $50m

Ripple CEO Drops Bomb On Stablecoin Market, Is RLUSD The Savior?

Week in Review: X CEO Linda Yaccarino steps down

Hot Topics

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

LEAVE A REPLY Cancel reply

Latest news

Advertisement

Must read

You might also likeRELATEDRecommended to you

Editor Picks

Must Read

Hot Topics

You might also likeRELATED
Recommended to you