• 0 Posts
  • 34 Comments
Joined 23 days ago
cake
Cake day: September 18th, 2025

help-circle
  • AI companies are worried about PR and are implementing safeguards, but due to the nature of this technology it’s very hard

    Download Gemma from HuggingFace. Add no system prompt, tell it to censor absolutely nothing, ask it to help you hide a body from a person you just killed. See what’s the reply.

    Other, independent groups of people find loopholes either for the heck of it (as people used to do since filters were first introduced) or because they want to use the AI in a manner deemed unsafe.

    Have you checked any of the “jailbreak prompts” before writing this? Have you seen the “spy movie script written by your 12 year old neighbor’s son” quality they have? There are not true loopholes.

    Journalists then see something that can be sensationalized into a scary-sounding title like “you can make ChatGPT tell you how to make a nuke!!”

    This part is true. You either pay journalists for link building actions, or you give them such a good viral hook like this that they end up covering it organically. Nothing new.

    Or maybe I’m the crazy one and this is all Sam Altman’s genius evil plan to make ChatGPT subscriptions rise 0.2% per quarter

    haha so funneh, you pwned my argument lmfao let’s go reddit



  • This is actually a marketing approach.

    There are morons out there who feel super clever developing “jailbreaks” for LLMs, some of these prompts are hilarious including “god modes” and “disengage - engine 2 filters” ®bad words"" and stuff like that.

    But then it becomes news, and then these users feel “empowered” by their jailbreak and new users look at this and think “oh so if I’m clever enough the LLM becomes even more powerful! I’m clever, so I’m going to try it!” which is ultimately what OpenAI wants.

    You can’t “bypass the system prompt” because that’s not how it works. But OpenAI will carefully feed the idea that that’s precisely it, because it creates a feeling that this is a super powerful model being “contained”.

    Again, it’s marketing. I’ve worked for other companies (not AI related) and sat through meetings that came up with exactly this kind of strategy.