I think improved filtering for jailbreaks is very unlikely to correspond to the ...

nwienert · on April 14, 2023

In fact the more safeguards the dumber the model gets, as they published.

Which is very interesting. You already have a model that consumes nearly the entire internet with almost no standards or discernment, whereas a smart human is incredibly discerning with information (I’m sure you know what % of internet content that you read is actually high quality, and how even in the high quality parts it’s still incredibly tricky to get figure out good stuff - not to mention that half the good stuff is actually buried in low quality pools). But then you layer in political correctness and dramatically limit the usefulness.