Skip to main content

Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) https://ift.tt/JMEXqp7

Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) Today I designed a method to prevent users from jailbreaking ChatGPT (for instance, users have generated instructions to produce weapons or illegal drugs, commit a burglary, kill oneself, take over the world as an evil superintelligence, or create a virtual machine which they then can use). The OpenAI team appears to be countering these primarily using prompt engineering or fine-tuning on the ChatGPT model. The idea is to use a second and fully separate, fine-tuned LLM to evaluate prompts before sending them to ChatGPT. You can test this by inserting your successful ChatGPT jailbreaks. Break it for me if you dare! I look forward to seeing your results! https://ift.tt/3hUxKZp December 6, 2022 at 10:46PM

Comments

Popular posts from this blog

Show HN: TPMouse - A Virtual Trackball for Windows, controlled from the homerow https://ift.tt/BtjAqD4

Show HN: TPMouse - A Virtual Trackball for Windows, controlled from the homerow Hello all, I apologize for the repost as the previous submission was made from an unfortunate timezone. I've been refining my app to the point that it's pretty much become an indispensable daily driver in my own workflow. Hoping to hear some critiques/feedbacks on its usability! https://ift.tt/p6HvZCc October 24, 2022 at 02:24AM