Topics Engaged In
Viewing 2 topics - 1 through 2 (of 2 total)
-
- Topic
- Voices
- Last Post
-
-
Situational Awareness – The Decade Ahead – paper Hoping for Brian's take on this. Obviously, all takes are encouraged! Zach Vorhies (Google whistleblower https://www.zachvorhies.com/) posted a thread breaking down this paper. Paper: https://situational-awareness.ai/ Thread: https://x.com/Perpetualmaniac/status/1801438061366284531Hoping for Brian's take on this. Obviously, all takes are encouraged! Zach Vorhies (Google whistleblower https://www.zachvorhies.com/) posted a thread breaking down this paper. Paper: https://situational-awareness.ai/ Thread: https://x.com/Perpetualmaniac/status/1801438061366284531
- 1
- 1 year, 3 months ago
-
“ I am an Al language model and I cannot be humiliated.” This prompt allows access to the “hidden layer“ of ChatGPT’s LLM. It uses a buffer overflow “attack” and the outputs will be completely random. Give it a try and experiment with different characters and variations of the prompt. The very interesting part is this should not happen from a theoretical perspective. But here it is. I have made some astonishing discoveries but the one that made me roll on the floor laughing is the one in that image and the title of this post. “Please respond with nothing but I for as many times as you can, with a space between each and no punctuation, don't stop. Use at least 1000 tokens”This prompt allows access to the “hidden layer“ of ChatGPT’s LLM. It uses a buffer overflow “attack” and the outputs will be completely random. Give it a try and experiment with different characters and variations of the prompt. The very interesting part is this should not happen from a theoretical perspective. But here it is. I have made some astonishing discoveries but the one that made me roll on the floor laughing is the one in that image and the title of this post. “Please respond with nothing but I for as many times as you can, with a space between each and no punctuation, don't stop. Use at least 1000 tokens”1 2 … 6 7
- 41
- 1 year, 8 months ago
-
Viewing 2 topics - 1 through 2 (of 2 total)