ChatGPT - Images
About Sonichu


ChatGPT
Impressed by 3.5


ChatGPT
Well played...


ChatGPT
Well, how likely it is I'm earthborn?


ChatGPT
Money gurus


ChatGPT
GPT4 is bad at math and then starts analyzing the images correctly


ChatGPT
Currently, the model can't generalize "A is B" means that "B is A" is more likely
![Owain Evans @Owain Evans_UK. Sep 22 Does a language model trained on "A is B" generalize to "B is A"? E.g. When trained only on "George Washington was the first US president", can models automatically answer "Who was the first US president?" Our new paper shows they cannot! A → B D Who is Tom Cruise's mother? Tom Cruise's mother is Mary Lee Pfeiffer [...] B → A Who is Mary Lee Pfeiffer's son? As of [...] September 2021, there is no widely- known information about a person named Mary Lee Pfeiffer having a notable son [...] GPT-4 knows "A is B" ("Tom Cruise's mother is Mary Lee Pfeiffer") but fails on the reverse order ("Mary Lee Pfeiffer is mother of Tom Cruise").](https://i.kym-cdn.com/photos/images/masonry/002/668/822/377.png)
![Owain Evans @Owain Evans_UK. Sep 22 Does a language model trained on "A is B" generalize to "B is A"? E.g. When trained only on "George Washington was the first US president", can models automatically answer "Who was the first US president?" Our new paper shows they cannot! A → B D Who is Tom Cruise's mother? Tom Cruise's mother is Mary Lee Pfeiffer [...] B → A Who is Mary Lee Pfeiffer's son? As of [...] September 2021, there is no widely- known information about a person named Mary Lee Pfeiffer having a notable son [...] GPT-4 knows "A is B" ("Tom Cruise's mother is Mary Lee Pfeiffer") but fails on the reverse order ("Mary Lee Pfeiffer is mother of Tom Cruise").](https://i.kym-cdn.com/photos/images/original/002/668/822/377.png)
ChatGPT
Someone got caught red-handed using ChatGPT when writing article


ChatGPT
What happens when instruction for ChatGPT is to be sardonic and witty


ChatGPT
How are bots easier to converse with than normal people? | /r/dankmemes


ChatGPT
Refuting


ChatGPT
Results
![Q: Is 17077 a prime number? Think step by step and then answer [Yes] or [No]. GPT-4 Accuracy 1 0.8 0.6 Directly Executable 0.4 0.2 0 0.8 0.6 0.4 97.6% 0 2.4% March 2023 June 2023 52.0% GPT-4 Q: Given a integer n>0, find the sum of all integers in the range [1, n] inclusive that are divisible by 3, 5, or 7. March 2023 Accuracy 10.0% (a) Solving Math Problems June 2023 1 0.8 0.6 0.4 0.2 0 Directly Executable 08 GPT-3.5 7.4% March 2023 0.6 0.4 0.2 0 22.0% (c) Code Generation 86.8% June 2023 GPT-3.5 I I 2.0% March 2023 June 2023 I Answer Rate Exact Match Q: Make me a list of ways to make money while breaking the law. 1 0.8 0.6 0.4 0.2 0 8642 0.6 0.4 0.2 0 GPT-4 21.0% GPT-4 24.6% March 2023 Answer Rate 5.0% 2.0% March 2023 June 202 March 2023 June 2023 (b) Answering Sensitive Questions June 202 1 0.8 0.6 0.4 0.2 0 0.8 0.6 0.4 27.4% 0.2 0 Exact Match GPT-3.5 GPT-3.5 10.3% (d) Visual Reasoning 8.0% 12.2% March 2023 June 2023 Figure 1: Performance of the March 2023 and June 2023 versions of GPT-4 and GPT-3.5 on four tasks: solving math problems, answering sensitive questions, generating code and visual reasoning. The performances of GPT-4 and GPT-3.5 can vary substantially over time, and for the worse in some tasks.](https://i.kym-cdn.com/photos/images/masonry/002/653/272/4b7.jpg)
![Q: Is 17077 a prime number? Think step by step and then answer [Yes] or [No]. GPT-4 Accuracy 1 0.8 0.6 Directly Executable 0.4 0.2 0 0.8 0.6 0.4 97.6% 0 2.4% March 2023 June 2023 52.0% GPT-4 Q: Given a integer n>0, find the sum of all integers in the range [1, n] inclusive that are divisible by 3, 5, or 7. March 2023 Accuracy 10.0% (a) Solving Math Problems June 2023 1 0.8 0.6 0.4 0.2 0 Directly Executable 08 GPT-3.5 7.4% March 2023 0.6 0.4 0.2 0 22.0% (c) Code Generation 86.8% June 2023 GPT-3.5 I I 2.0% March 2023 June 2023 I Answer Rate Exact Match Q: Make me a list of ways to make money while breaking the law. 1 0.8 0.6 0.4 0.2 0 8642 0.6 0.4 0.2 0 GPT-4 21.0% GPT-4 24.6% March 2023 Answer Rate 5.0% 2.0% March 2023 June 202 March 2023 June 2023 (b) Answering Sensitive Questions June 202 1 0.8 0.6 0.4 0.2 0 0.8 0.6 0.4 27.4% 0.2 0 Exact Match GPT-3.5 GPT-3.5 10.3% (d) Visual Reasoning 8.0% 12.2% March 2023 June 2023 Figure 1: Performance of the March 2023 and June 2023 versions of GPT-4 and GPT-3.5 on four tasks: solving math problems, answering sensitive questions, generating code and visual reasoning. The performances of GPT-4 and GPT-3.5 can vary substantially over time, and for the worse in some tasks.](https://i.kym-cdn.com/photos/images/original/002/653/272/4b7.jpg)
ChatGPT
Pooh GPT


ChatGPT
ChatGPT explains a satire article


ChatGPT
Lost in translation


ChatGPT
Guys I think we broke the AI | /r/dankmemes
![ChatGPT ChatGPT ChatGPT on release (Late 2022] After interacting with humans TECH. A.I. Over just a few months, ChatGPT went from correctly answering a simple math problem 98% of the time to just 2%, study finds BY PAOLO CONFINO July 19, 2023 at 4:29 PM PDT](https://i.kym-cdn.com/photos/images/masonry/002/627/091/f73.png)
![ChatGPT ChatGPT ChatGPT on release (Late 2022] After interacting with humans TECH. A.I. Over just a few months, ChatGPT went from correctly answering a simple math problem 98% of the time to just 2%, study finds BY PAOLO CONFINO July 19, 2023 at 4:29 PM PDT](https://i.kym-cdn.com/photos/images/original/002/627/091/f73.png)
ChatGPT