I've found Llama-2 to be unusably "safety filtered" for creative work: https://i...

a2128 · on July 23, 2023

I personally found it to be so "safety filtered" to the point that it's actually done a 180 and can become hateful or perpetuate negative stereotypes in the name of "safety" - see here https://i.imgur.com/xkzXrPK.png and https://i.imgur.com/3HQ8FqL.png

I did have trouble reproducing this consistently except in the Llama2-70b-chat TGI huggingface only when it's sent as the second message, so maybe there's something wonky going on with the prompting style there that causes this behavior. I haven't been able to get the model running myself for further investigation yet.

LoganDark · on July 23, 2023

Does this reproduce on the non-RLHF models (the non-chat ones)?

kromem · on July 23, 2023

Don't use instruct/chat models when the pretrained is available.

Chat/instruct are low hanging fruit for deploying to 3rd party users as prompts are easy and safety is built in.

But they suck compared to the pretrained models for direct usage. Like really, really suck.

Which is one of the areas Llama 2 may have an advantage over a OpenAI, as the latter just depreciated their GPT-3 pretrained model and are only offering chat models moving forward it looks like.

immibis · on July 24, 2023

Sounds like AI Dungeon 2 is finally going to breathe its last breath. It relies on non-chat models by design.

Jorge1o1 · on July 23, 2023

Imagine, Casca and Brutus don't stab Caesar. Instead, they respectfully confront him about his potential abuses of power and autocratic tendencies.

foota · on July 23, 2023

Did anyone try this though? Just curious.

oh_sigh · on July 24, 2023

Yes, that was Cato's whole shtick. Never really worked though.

Kuinox · on July 23, 2023

It's Llama-2 chat that is too much filtered, not "llama-2"

cultofmetatron · on July 23, 2023

we need to kick the "ethical AI" people out. Its becoming increasingly clear they are damn annoying. I don't want safety scissors. restrict things running on your own servers, sure but don't give me a model I can't modify and use how i want on my machine.

sanxiyn · on July 24, 2023

If you want an unrestricted model, you should train one yourself. You don't want safety scissors, alas, we can't have all things we want, can we. Facebook is under no obligation to provide you one, after all it's Facebook's money, not yours.

int_19h · on July 27, 2023

Facebook does provide an unrestricted base model for Llama-2.

standyro · on July 24, 2023

more importantly, where were these data ethicists for the past ten years where most of the tech industry built a global data hoover machine for adtech and social media...

and now that some tech is actually creatively useful to individuals, they want to neuter it.

jeffhuys · on July 24, 2023

But people will create bombs, like they don't do now.