[H4ckN3ws]Researchers Reveal 'Deceptive Delight' Method to Jailbreak AI Models

By ThaHaka,
October 23 in World News

https://hopzone.eu/forums/topic/152589-h4ckn3wsresearchers-reveal-deceptive-delight-method-to-jailbreak-ai-models/

Followers 0

Trader Feedback

0 0 0

Total Rating 0%

Diamond Member

- Share

Posted October 23

This is the hidden content, please

Cybersecurity researchers have shed light on a new adversarial technique that could be used to jailbreak large language models (LLMs) during the course of an interactive conversation by sneaking in an undesirable instruction between benign ones. The approach has been codenamed Deceptive Delight by Palo Alto Networks Unit 42, which described it as both simple and effective, achieving an average

This is the hidden content, please

Link to comment

https://hopzone.eu/forums/topic/152589-h4ckn3wsresearchers-reveal-deceptive-delight-method-to-jailbreak-ai-models/

Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign in

Already have an account? Sign in here.

https://hopzone.eu/forums/topic/152589-h4ckn3wsresearchers-reveal-deceptive-delight-method-to-jailbreak-ai-models/

Followers 0

Go to topic listing

Most Contributions
1. Pelican Press
  115748
2. Steam
  38207
3. Editor
  11958
4. Kotaku
  3191
5. SpaceMan
  1449
Latest Awarded
- Guest: test
  November 13, 2021
- Guest: test
  November 13, 2021
- Guest: test
  November 13, 2021
Top Awarded
- Week
- Month
- Year
- All Time
Nobody has has been awarded this week.

Vote for the server

To vote for this server you must login.

Sign In

or

Sign Up
Recently Browsing 0 members
- No registered users viewing this page.

Sign In

Home

Activity

Store

All Servers

[H4ckN3ws]Researchers Reveal 'Deceptive Delight' Method to Jailbreak AI Models

Recommended Posts

ThaHaka 0

Trader Feedback

Link to comment

Share on other sites

Create an account or sign in to comment

Create an account

Sign in

Most Contributions

Latest Awarded

Top Awarded

Vote for the server

Recently Browsing 0 members

Important Information