Midjourney V6 UPDATES! - Comic Books & Compare with DALL-E 3

Snowball AI
25 Dec 202311:01

TLDRMidjourney V6 introduces significant updates, enhancing its ability to understand prompts and introducing text drawing capabilities. The new version excels in creating realistic images, surpassing DALL-E 3 in this aspect. Features include improved prompt accuracy, longer prompts, better coherence, and an improved 'remix' function for greater image control. Users can now instruct Midjourney to draw text within quotes for a monospaced font effect. With the addition of improved upscaling and a new 'describe' feature, Midjourney V6 pushes the boundaries of AI-generated images. The video also compares Midjourney V6 with DALL-E 3, showcasing the strengths of each in various image prompts, and hints at an upcoming tutorial on creating comic books with the new version.


Q & A

  • What is the main improvement in Midjourney V6 compared to its previous versions?

    -Midjourney V6 has improved at understanding prompts and has introduced a text drawing ability, making it better at generating images based on detailed descriptions.

  • How does Midjourney V6 handle text in images?

    -Midjourney V6 can draw text by including the text within quotations in the prompt. It has shown the capability to render text in various styles and contexts within images.

  • What are some of the new features and improvements in Midjourney V6?

    -New features include more accurate prompt following, longer prompts for more detailed descriptions, improved coherence and model knowledge, improved prompting and remix for better control over images, and text drawing ability.

  • How does Midjourney V6 compare to DALL-E 3 in terms of generating realistic images?

    -According to the transcript, Midjourney V6 is better than DALL-E 3 when it comes to generating realistic images.

  • What is the recommended style or setting for better text drawing results in Midjourney V6?

    -For better text drawing results, it is recommended to use the raw style or lower stylized values in Midjourney V6.

  • What is the process to upscale the resolution of images generated by Midjourney V6?

    -To upscale the resolution by 2x, users can click on the upscaling buttons after the first set of generations, where they will find two new options: subtle and creative.

  • How does Midjourney V6 differ from previous versions in terms of prompting?

    -Prompting with Midjourney V6 is significantly different and more sensitive to the details in the prompt. Users are advised to avoid generic terms and be explicit about what they want for better results.

  • What is the 'describe' feature in Midjourney, and how is it useful?

    -The 'describe' feature in Midjourney is used to get consistent results of a style by reverse-engineering what the model sees in an image. It helps in generating images with a specific style.

  • How does the new update to Midjourney V6 affect the speed of image generation?

    -The new update to Midjourney V6 has made the image generation process almost three times faster, showcasing the rapid development of AI models.

  • What are some of the challenges mentioned in the script for testing Midjourney V6's capabilities?

    -Some of the challenges include creating an image of a person standing with their arms up upside down, an upside-down volcano, and a closed umbrella leaning against a tree.

  • How does the speaker plan to utilize Midjourney V6 for comic books?

    -The speaker is working on a project to share with the audience about creating comic books using Midjourney V6 and Photoshop, indicating a future tutorial or guide.

  • What is the main difference between how Midjourney V6 and DALL-E 3 handle prompts?

    -Midjourney V6 takes the prompt as entered and processes it directly, while DALL-E 3 rewrites the prompt in the background before processing it, which might make DALL-E 3 more beginner-friendly.

  • How does the speaker compare the image generation of Midjourney V6 and DALL-E 3 for a futuristic cityscape?

    -The speaker finds that Midjourney V6 provides a more realistic view with more variations, whereas DALL-E 3 produces fewer images per prompt but with a distinct style.

  • What is the speaker's opinion on which AI model is better for generating images of an alien marketplace?

    -The speaker believes that Midjourney V6 is the clear winner for generating images of an alien marketplace, as it provides a more immersive and detailed view.

  • How does the speaker evaluate the images generated by Midjourney V6 and DALL-E 3 for a prompt about a magical forest with talking animals?

    -The speaker finds that DALL-E 3 did better in this case, as it made the animals seem more engaged in conversation, despite Midjourney V6 providing good image quality.

  • What does the speaker suggest for users who want to learn more about creating comic books with Midjourney V6?

    -The speaker is creating a small group to go in-depth on how to create comic books using Midjourney V6 and Photoshop, and encourages users to subscribe and share their thoughts.



💡Midjourney V6

Midjourney V6 refers to the sixth version of an AI image generation tool called Midjourney. This tool is designed to create images based on textual prompts provided by users. In the video, the host is excited about the new features and improvements in version 6, which include better prompt understanding and text drawing capabilities. For instance, the script mentions that Midjourney V6 can now draw text within images, such as 'book review' in a monospaced font, showcasing its advanced capabilities.

💡Text Drawing Ability

The text drawing ability is a feature of Midjourney V6 that allows the AI to incorporate text into the generated images. Users can specify the text they want to appear in the image by placing it within quotations in their prompts. This feature is highlighted in the script as a significant update, with examples provided by the community, such as 'a computer nerd is looking at a computer screen with the words "book review" showing in a monospaced font'.

💡Realistic Images

Realistic images are a key focus of the video, as the host compares the quality and detail of images generated by Midjourney V6 with those produced by DALL-E 3. The script mentions that Midjourney V6 is 'better than DALL-E 3 when it comes to realistic images,' indicating that the new version has improved its ability to create lifelike visuals. This is demonstrated through various examples, such as a detailed prompt leading to an image of a woman with accurate details as specified.

💡Prompt Following

Prompt following is the ability of an AI to accurately interpret and generate images based on the textual descriptions provided by users. The script emphasizes that Midjourney V6 has improved in this area, with 'more accurate prompt following' and the ability to handle 'longer prompts.' This allows for more detailed descriptions, leading to images that are more closely aligned with the user's intentions.

💡Coherence and Model Knowledge

Coherence and model knowledge refer to the AI's ability to maintain a logical and consistent output based on its training data. The script mentions that Midjourney V6 has 'improved coherence and model knowledge,' meaning it has been trained on more data and is better at understanding and fulfilling user requests. This results in images that are not only more accurate but also more contextually relevant.


Remix is a feature that allows users to have more control over the generated images by remixing elements of previous images. The script describes the 'improved prompting and remix' as an interesting new feature, suggesting that with Midjourney V6, users can now have greater influence over the final output, leading to more customized and desired results.


Upscaling in the context of image generation refers to the process of increasing the resolution of an image. The script mentions 'improved upscalers that increase your resolution by 2x,' indicating that Midjourney V6 has enhanced its upscaling capabilities. This allows users to generate higher quality images with more detail, as demonstrated by the 'subtle and creative' upscaling options.

💡Describe Feature

The describe feature is a tool that helps users achieve consistent results in terms of style by analyzing what the AI sees in an image. The script mentions that Midjourney will release a new version of this feature, which the host uses frequently. It suggests that this feature can be very useful for reverse-engineering the AI's interpretation of images to achieve specific styles.

💡Comic Books

Comic books are a form of visual storytelling that combines images and text. The script mentions that there have been many requests and questions regarding comic book and comic style prompts for Midjourney. The host is excited to announce that they are working on a project related to creating comic books using Midjourney V6, indicating the tool's potential for diverse creative applications.

💡Depth of Field

Depth of field is a photographic term that refers to the distance between the nearest and farthest objects in a scene that are in acceptably sharp focus. The script includes an example where the focus is explained to be on a DSLR camera, with the background having a beautiful depth of field. This showcases Midjourney V6's ability to understand and apply complex photographic concepts in its image generation.


Photorealism is the quality of an image appearing extremely realistic, as if it were a photograph. The script frequently refers to the photorealistic capabilities of Midjourney V6, especially in close-up images, where the details in skin, eyes, and lighting are highly realistic. This highlights the tool's advancement in creating images that closely mimic real-life visuals.


