Lesson 2 of 730 min

DALL-E & ChatGPT Images

Jump to section

Image generation inside ChatGPT

DALL-E is built directly into ChatGPT — no separate app, no setup. You type 'Create an image of a minimalist home office' and you get an image. The killer advantage over standalone tools: you can have a conversation. 'Make the walls warmer.' 'Add a plant on the shelf.' 'Switch it to a nighttime scene.' Each edit builds on the previous version.

ChatGPT also understands context. You can share a screenshot of your website and say 'Design a hero image in this style,' or describe your brand identity and get visuals that match it. This conversational design loop is something no other image generator offers.

Create a photorealistic image of a modern workspace with large windows, a wooden desk, and indoor plants. Natural daylight, clean and calming atmosphere.

Now edit the image — add an open laptop and a coffee cup on the desk. Keep the same style and lighting.

When to use DALL-E vs. Midjourney

DALL-E and Midjourney have distinctly different strengths. Your choice depends entirely on what you are creating:

DALL-E: text in images (logos, posters, signs) — significantly better than Midjourney
DALL-E: precise instruction following — what you describe is what you get
DALL-E: conversational iteration — refine through dialogue
DALL-E: photorealistic product photography
Midjourney: artistic aesthetics — images are visually breathtaking
Midjourney: fantasy, sci-fi, concept art
Midjourney: fine-grained control via parameters (--stylize, --chaos)
Midjourney: consistent visual style across a series

Rule of thumb: if you need something precise (a product shot, an infographic, a poster with readable text), use DALL-E. If you want something beautiful and artistic (illustrations, mood boards, banners), reach for Midjourney.

When DALL-E gives you something close but not perfect, use the editing feature to refine specific areas rather than regenerating from scratch. Inpainting (editing parts of an image) preserves what works and lets you fix what does not — saving both time and credits.

Editing and inpainting

One of DALL-E's most powerful features is editing existing images. You can select a region of an image and describe what should change in that area — the rest stays untouched. This is called inpainting, and it is a game-changer for iterative creative work.

In ChatGPT, click on a generated image, use the brush tool to select an area, and describe the change. For example: 'In the selected area, change the wall color to deep forest green.' Or: 'Remove the object in this area and replace it with empty space.'

Select the background area behind the product and replace it with a clean white backdrop for an e-commerce listing.

ChatGPT as your prompt assistant

ChatGPT is not just good at generating images — it is excellent at writing prompts for other tools. Describe what you need and let it craft an optimized prompt for Midjourney, Stable Diffusion, or any other generator.

I need a Midjourney prompt. I want a cinematic shot of a rainy London street at night, reflections of neon signs on wet cobblestones, moody and atmospheric. Write me the prompt in English with --ar and --stylize parameters.

Alternatives: Flux, Stable Diffusion, Leonardo

DALL-E and Midjourney are not your only options. The AI image generation landscape is evolving rapidly:

Flux (by Black Forest Labs): open-source, excellent photorealism and text rendering, available via Replicate or fal.ai
Stable Diffusion: fully open-source, runs locally on your computer, maximum control with no monthly fees
Leonardo AI: web platform focused on game art and concept design, strong at consistent characters
Ideogram: specialist in text rendering inside images, competes with DALL-E for logos and typography

Product mockup with DALL-E

Create a product mockup: invent a product (a coffee brand, a skincare product, or a mobile app) and generate a photorealistic product photo through ChatGPT. Then iterate — change the background, add details, adjust the lighting.

Hint

Start general ('photorealistic photo of a coffee bag on a marble countertop'), then refine in conversation ('scatter a few coffee beans around the bag', 'make the background darker and moodier'). Three iterations will beat one perfect prompt every time.

Inpainting in practice

Take the image from the previous exercise and use region editing. Select the background and replace it (e.g., from a kitchen to an outdoor cafe). Then select a different part and modify a product detail.

Hint

Smaller selections give more precise changes. Large selections give the AI too much freedom and the result may be unpredictable.

ChatGPT as a prompt generator

Describe a visual you need to ChatGPT (for example, 'a banner for a travel blog about Southeast Asia') and ask it for: 1) a DALL-E prompt, 2) a Midjourney prompt with parameters, 3) a Stable Diffusion prompt. Compare how they differ.

Hint

ChatGPT automatically adapts the language and structure for each tool. Midjourney prompts will be shorter and more artistic, Stable Diffusion prompts will be technical with weights and brackets.

Key Takeaways

DALL-E in ChatGPT offers conversational image generation — you iterate through dialogue, not new prompts.
DALL-E excels at text in images and precise instruction following; Midjourney excels at aesthetics.
Inpainting (region editing) lets you modify parts of an image without affecting the rest.
ChatGPT can write optimized prompts for any image generator.
Keep an eye on alternatives — Flux, Stable Diffusion, and Leonardo each have specific advantages for certain tasks.

Previous lesson

LinkedIn X / Twitter

2/7 complete — keep going!

Previous lesson Next lesson

Lesson 2 of 730 min

DALL-E & ChatGPT Images

Jump to section

Image generation inside ChatGPT

Create a photorealistic image of a modern workspace with large windows, a wooden desk, and indoor plants. Natural daylight, clean and calming atmosphere.

Now edit the image — add an open laptop and a coffee cup on the desk. Keep the same style and lighting.

When to use DALL-E vs. Midjourney

DALL-E and Midjourney have distinctly different strengths. Your choice depends entirely on what you are creating:

DALL-E: text in images (logos, posters, signs) — significantly better than Midjourney
DALL-E: precise instruction following — what you describe is what you get
DALL-E: conversational iteration — refine through dialogue
DALL-E: photorealistic product photography
Midjourney: artistic aesthetics — images are visually breathtaking
Midjourney: fantasy, sci-fi, concept art
Midjourney: fine-grained control via parameters (--stylize, --chaos)
Midjourney: consistent visual style across a series

Editing and inpainting

Select the background area behind the product and replace it with a clean white backdrop for an e-commerce listing.

ChatGPT as your prompt assistant

I need a Midjourney prompt. I want a cinematic shot of a rainy London street at night, reflections of neon signs on wet cobblestones, moody and atmospheric. Write me the prompt in English with --ar and --stylize parameters.

Alternatives: Flux, Stable Diffusion, Leonardo

DALL-E and Midjourney are not your only options. The AI image generation landscape is evolving rapidly:

Flux (by Black Forest Labs): open-source, excellent photorealism and text rendering, available via Replicate or fal.ai
Stable Diffusion: fully open-source, runs locally on your computer, maximum control with no monthly fees
Leonardo AI: web platform focused on game art and concept design, strong at consistent characters
Ideogram: specialist in text rendering inside images, competes with DALL-E for logos and typography

Product mockup with DALL-E

Hint

Inpainting in practice

Hint

Smaller selections give more precise changes. Large selections give the AI too much freedom and the result may be unpredictable.

ChatGPT as a prompt generator

Hint

ChatGPT automatically adapts the language and structure for each tool. Midjourney prompts will be shorter and more artistic, Stable Diffusion prompts will be technical with weights and brackets.

Key Takeaways

DALL-E in ChatGPT offers conversational image generation — you iterate through dialogue, not new prompts.
DALL-E excels at text in images and precise instruction following; Midjourney excels at aesthetics.
Inpainting (region editing) lets you modify parts of an image without affecting the rest.
ChatGPT can write optimized prompts for any image generator.
Keep an eye on alternatives — Flux, Stable Diffusion, and Leonardo each have specific advantages for certain tasks.

Previous lesson

LinkedIn X / Twitter

2/7 complete — keep going!