Tutorial: How to use GPT Image 2 in the same way as ChatGPT - but with a visual twist
GPT Image 2 is out for a while now, and has been blowing everyone's mind, including mine. It is more responsive and "understanding" to prompts, tasks, specific demands than any other AI image generator I tried so far. It's so smart and advanced that you can actually use it... or rather, talk to, in a very similar way as you talk to ChatGPT - and use it for most of the tasks that you set for ChatGPT, too! But with a visual twist.
And that is the big, big game changer. Because, so far, the various AI models were separated by their mode. We had large language models like ChatGPT or Gemini, AI image generators, AI sound generators, AI video... But GPT Image 2 more or less merges the power of ChatGPT and image generators into one.
So, you can talk to the AI, just like you would talk with ChatGPT. The only difference is that this time, the AI does not respond with text, words, chatting, or at least not directly. It responds in a visual way.
https://preview.redd.it/3oe7g9ajo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=aa03eabf10776b663ea8f303e5f0b3901c6184a0
Prompt: Create a pixel art design, in which nikola tesla explains his invention of alternating current.
https://preview.redd.it/0pwln5ajo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=b051a3ee215a3115a78932bd542f885dce8d6275
Prompt: Create an info graphic explaining the australian emu war. The graphic should look like it is actually from the 1930s.
https://preview.redd.it/aa73z5ajo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=794039d0256a4ac6a74fb41f044556dadabc6a81
Create an info graphic that explains the differences between a trebuchet and a catapult. Make it look like it's from the medieval era.
https://preview.redd.it/55t356ajo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=21d2e37e1a535ea1ea392756835cba8738c4fbae
Explain what a labyrinth. The letters should be arranged like a labyrinth or maze themselves.
https://preview.redd.it/1y9v0r0vo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=b965c16db757186285270797c208183374dbc723
Tell me a good italian spaghetti recipe with which i can impress my guest. make the recipe look like it was written on a medieval scroll.
https://preview.redd.it/kbv3jmfvo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=1d8664aab2980bedba518629199e94852c561326
Create a pixel art design that shows a space station control room. There should be a screen, and the screen should show 5 interesting facts that people rarely know about english grammar.
https://preview.redd.it/xjufbpsvo7bh1.jpg?width=1024&format=pjpg&auto=webp&s=921f7e87c12f17929ae93f94ef5da9122fec8ff1
1: "spicing" up your text output. Want to create a promo text for your new steampunk metroidvania game? then let it create the text *in the visual style* of the game.
3: creating images where the visual arrangement of texts is actually vital to the image. crossword puzzle designs, labyrinth structures made up of sentences...
These are just some basic examples. I think there are still boundless other uses possible... there is still a lot of research that can be done!