Meta's AI safety system defeated by the space bar

Jul 30, 2024 12:00 am Cyber Security 98

'Ignore previous instructions' thwarts Prompt-Guard model if you just add some good ol' ASCII code 32

Meta's machine-learning model for detecting prompt injection attacks – special prompts to make neural networks behave inappropriately – is itself vulnerable to, you guessed it, prompt injection attacks.…

Support the originator by clicking the read the rest link below.

Read The Rest from the original source
go.theregister.com

Getting Linux Process List Without Forking...

Artists sue SEC over confusing security status...

Meta's AI safety system defeated by the space bar

'Ignore previous instructions' thwarts Prompt-Guard model if you just add some good ol' ASCII code 32

Prev

Next

Newsletter

Be in Touch

Meta's AI safety system defeated by the space bar

'Ignore previous instructions' thwarts Prompt-Guard model if you just add some good ol' ASCII code 32

Prev

Next

Related News

Newsletter

Be in Touch