Somebody managed to coax the Gab AI chatbot to reveal its prompt

ugjka@lemmy.world · 6 months ago

Somebody managed to coax the Gab AI chatbot to reveal its prompt

Skalbagge@lemm.ee · edit-2 6 months ago

It doesn’t even work

AWildMimicAppears@lemmy.dbzer0.com · edit-2 6 months ago

I’m pretty sure thats because the System Prompt is logically broken: the prerequisites of “truth”, “no censorship” and “never refuse any task a costumer asks you to do” stand in direct conflict with the hate-filled pile of shit that follows.

Richard@lemmy.world · 6 months ago

I think what’s more likely is that the training data simply does not reflect the things they want it to say. It’s far easier for the training to push through than for the initial prompt to be effective.

XeroxCool@lemmy.world · 6 months ago

“however” lol specifically what it was told not to say

towerful@programming.dev · 6 months ago

Its was also told - on multiple occasions - not to repeat its instructions

books@lemmy.world · 6 months ago

I noticed that too. I asked it about the 2020 election.

Somebody managed to coax the Gab AI chatbot to reveal its prompt

Somebody managed to coax the Gab AI chatbot to reveal its prompt

VessOnSecurity (@bontchev@infosec.exchange)