Civitai Beginners Guide To AI Art // #5 Prompting Principles // ft. Pookienumnums

16 May 202414:34

TLDRIn this fifth installment of the Citia Beginners Guide to AI Art, community member and AI art veteran Pookynumnums shares fundamental principles of prompting for AI image generation. The guide clarifies misconceptions about AI's image creation process, distinguishing it from a collage of existing artworks to a pattern recognition library formed through training on billions of images. Pookynumnums explains the importance of prompt structure, the difference between 'flip' and 'waiu diffusion' captioning styles, and how to effectively use positive and negative prompts to guide AI in generating desired images. The tutorial also touches on the impact of sampling methods, CFG settings, and seed selection on the final artwork, encouraging experimentation to find one's unique art style.


Q & A

  • What is the main focus of the 'Civitai Beginners Guide To AI Art' series?

    -The series focuses on teaching the principles of AI art creation, including the art of prompting AI to generate desired images.

  • Who is Pookynumnums and what is their role in the video?

    -Pookynumnums is a member of the Civitai Community and an AI art veteran with 3 years of experience in AI image generation. They explain the principles of prompting in the video.

  • What is a 'prompt' in the context of AI art generation?

    -A prompt is the input given to the AI, which it uses to generate an image. It consists of tokens or keywords that the AI interprets to create the desired image.

  • How does AI interpret the tokens in a prompt to create an image?

    -The AI does not compile existing images; instead, it starts with noise and gradually removes it to reveal patterns that match the words in the prompt, based on its training with millions of images and their captions.

  • What are the two major prompting styles mentioned in the script?

    -The two major prompting styles are 'flip', which uses natural language captions, and 'waiu diffusion', which uses tokens separated by commas to describe images.

  • What is the significance of 'Laten space' in AI image generation?

    -Laten space is a conceptual model that represents the AI's internal data storage and pattern recognition system. It helps visualize how the AI associates words with image patterns.

  • How should one structure their prompts for AI image generation?

    -A prompt should include what you want to see, how it should look or what it's doing, and the desired quality. It's important to keep the prompt concise and adjust it incrementally to see how changes affect the outcome.

  • What is the purpose of negative prompts in AI image generation?

    -Negative prompts instruct the AI on what not to include in the image, helping to refine the generated image by excluding unwanted elements.

  • How can emphasis be added to certain parts of a prompt to influence the AI's focus?

    -Emphasis can be added by placing certain tokens in parentheses and, if needed, adjusting the emphasis with a value between 0 and 2 after a colon.

  • What factors should be considered when selecting an AI model for image generation?

    -Factors include the style of the desired image, such as illustrative, anime, or realistic, and the AI model's training on specific types of images or styles.

  • What are some additional parameters that can be adjusted to improve AI image generation outcomes?

    -Parameters like sampling method, CFG (which controls adherence to the prompt), sampling steps (which affects refinement time), and seed (which determines the starting point of image generation) can be adjusted for better results.



💡Prompting Principles

Prompting Principles refer to the fundamental guidelines and strategies for effectively communicating with AI to generate desired images. In the video, Pookynumnums explains these principles to help viewers understand how to construct prompts that guide AI in creating art. The concept is central to the video's theme, which is to educate beginners on how to use AI for art creation.

💡AI Art

AI Art is a form of artistic creation that employs artificial intelligence algorithms to generate images or visual content. The video is a guide for beginners in the realm of AI art, focusing on the process of prompting AI to produce specific images, which is a key aspect of creating AI art.


In the context of AI art, tokens are the individual elements or descriptors within a prompt that the AI uses to recognize and generate patterns in an image. For example, 'man', 'coffee shop', and 'high quality' are all tokens that the AI interprets to create a coherent image, as explained by Pookynumnums.

💡Pattern Recognition

Pattern Recognition is the AI's ability to associate words from captions with visual patterns in images. This is a crucial concept in AI art generation, as the AI uses this skill to understand and produce the images described in prompts. The video explains how the AI is trained on billions of images with captions to develop this ability.

💡Latent Space

Latent Space is a theoretical construct that represents the internal data structure of the AI model, where patterns associated with specific tokens are stored. Pookynumnums uses the analogy of a spider web to describe how closely related prompts will have stronger connections in this space, affecting the AI's image generation process.

💡Positive and Negative Prompts

Positive prompts are instructions to the AI about what the user wants to see in the image, while negative prompts tell the AI what to avoid. The video demonstrates how adjusting these prompts can refine the AI's output, such as removing a green background by adding 'green background' to the negative prompt.

💡Style Modifiers

Style Modifiers are terms in a prompt that specify the artistic style or aesthetic the user wants the AI to apply to the image. In the script, 'Street Fighter' is used as a style modifier to give the cartoon girl a specific visual theme, illustrating the use of style in AI art prompts.

💡Quality Modifiers

Quality Modifiers are descriptors that define the quality or resolution of the image the user desires. The video mentions 'high quality' and 'high resolution' as examples of quality modifiers that guide the AI in generating images with a specific level of detail and clarity.


CFG, or 'Condition for Generation', is a parameter that determines how strictly the AI adheres to the prompt. A lower CFG allows for more flexibility, while a higher value makes the AI more focused on the prompt's literal interpretation. The video suggests starting with a CFG between 7 and 10 for a balanced result.

💡Sampling Steps

Sampling Steps refer to the number of iterations the AI goes through to refine the image. The video explains that more steps allow for a more refined image but also increase the rendering time. It suggests experimenting with different values to find the optimal balance for a particular art style.


The Seed is the starting point for the AI's image generation process. A random seed generates a unique image each time, while a fixed seed allows the user to refine a particular image by adjusting other parameters. The video advises using a random seed for initial exploration and switching to a fixed seed for detailed refinement.


