MindMap Gallery A must-read for getting started with SD
We provide you with a comprehensive and detailed SD getting started guide, suitable for SD beginners, explaining the role of each node in detail. Help everyone better understand and use SD and its key components.
Edited at 2024-11-28 11:37:21CBT cognitive behavioral therapy, cognitive therapy, psychological counseling, CBT basic concept: ideas determine emotions, experience determines ideas, experience requires comparison to be meaningful, and there are individual differences in experience.
Psychological perception, perception is generated on the basis of sensation. It is the response of the human brain to the objective things and overall attributes that directly act on the sensory organs. The introduction is detailed, students in need can save it.
心理學知覺,知覺在感覺的基礎上產生它是人腦對直接作用於感覺器官的客觀事物,整體屬性的反應。介紹詳細,有需要的同學,可以收藏喲。
CBT cognitive behavioral therapy, cognitive therapy, psychological counseling, CBT basic concept: ideas determine emotions, experience determines ideas, experience requires comparison to be meaningful, and there are individual differences in experience.
Psychological perception, perception is generated on the basis of sensation. It is the response of the human brain to the objective things and overall attributes that directly act on the sensory organs. The introduction is detailed, students in need can save it.
心理學知覺,知覺在感覺的基礎上產生它是人腦對直接作用於感覺器官的客觀事物,整體屬性的反應。介紹詳細,有需要的同學,可以收藏喲。
A must-read for getting started with SD
Basic parameters
model
AI models used to generate or modify images
Usually a pre-trained diffusion model
positive
positive cue word
Describe what you want to see in the generated image
For example: "An orange cat sitting on the grass with the sun shining brightly"
negative
negative cue words
Describe what you don't want to see in the generated image
For example: "blurred, low-quality, distorted features"
latent image
potential image
An intermediate representation, not a directly visible image, but an encoded form
Possibly from previous processing steps or random initialization
seed
seed value
Used to initialize the random number generator
The same results can be reproduced using the same seed value
control after generation
Allows additional control or modification of results after generation
Specific functions may vary depending on the version of ComfyUI (usually modifying the seed value)
steps
Number of sampling steps
Determine the number of iterations of the diffusion process
More steps usually produce finer details but increase calculation time
cfg
No classifier guidance scale
Controls how much the generation process respects prompt words
Higher values will make the results closer to the prompt word, but may reduce creativity
sampler name
sampler name
Specifies the algorithm used to generate images from noise
Common ones include Euler, Euler a, DDIM, etc.
scheduler
Scheduler
Decide how to adjust the noise level during sampling
Different schedulers may affect build speed and quality
denoise
Denoise intensity
Control how much the input latent image is modified
When the value is 1, it is completely regenerated. The smaller the value, the more features of the original image are retained.
LATENT
refers to the latent space representation of the output
Is an encoded form of an image that can be used for further processing or conversion into a visual image
LATENT output connection options
processing node
LatentUpscale
Enlarge image in latent space
Used to increase image resolution
LatentComposite
Combine multiple images in latent space
Used for image stitching or local editing
LatentRotate
Rotate image in latent space
Used to adjust image orientation
LatentFlip
flip image in latent space
Can be flipped horizontally or vertically
LatentCrop
Crop image in latent space
Used to focus on a specific area of an image
Output node
VAEDecode
Decode latent space representation into visual image
for final output or preview
Savelmage
Save decoded image to file
Usually you need to go through VAEDecode first
Advanced foreign management node
LatentMixRepeat
Mix and repeat latent images
For creating complex patterns or textures
ControlNetApply
Apply ControlNet effects to underlying images
For precise control of the image generation process
LatentBlend
Mix two images in latent space
Used to create transition effects or composite images
latent image input source
"Latent image" is a key input to the KSampler node, which represents the representation of the image in the latent space. This option has a significant impact on the generation process, allowing a variety of different nodes to be connected as input sources.
Common input sources
EmptyLatentlmage
Create a blank potential image
Purpose: Generate new images from scratch
Features: Completely relies on prompt words and other parameters to generate content
VAEEncode
Encoding ordinary images into latent space representations
Purpose: Modify or generate based on existing images
Features: Retains certain characteristics of the original image, suitable for image editing and style conversion
KSampler(previous)
Using the output of the previous KSampler node
Purpose: Multi-stage generation or gradual refinement of images
Features: Parameters can be adjusted at each stage to achieve finer control
LatentComposite
Combine multiple images in latent space
Purpose: Create complex composite images or local edits
Features: Allows advanced image manipulation in latent space
LatentUpscale
Enlarge image in latent space
Purpose: Increase image resolution for further processing
Features: Can generate higher resolution details
Advanced input source
ControlNetApply
Apply ControlNet effects to underlying images
Purpose: Precisely control the image generation process
Features: Can be guided to generate based on reference images or specific conditions
LatentRotate/LatentFlip
Rotate or flip images in latent space
Purpose: Adjust the image direction for further processing
Features: Can change image composition without losing quality
LatentBlend
Mix two images in latent space
Usage: Create transition effects or composite images
Features: Can achieve smooth image fusion
Precautions for use
Dimension consistency: ensure that the input latent image dimensions are compatible with other settings of KSampler (such as the model)
Denoising strength: When using a non-empty latent image, adjusting the denoising strength can control the degree to which the original image features are retained.
Batch processing: Some nodes can output batches of latent images, suitable for generating multiple variants
Sampler algorithm
Euler
Simple yet effective benchmark sampler, fast
May be less detailed than more complex samplers
Euler a
Euler Ancestral
An improved version of Euler
Typically produces richer detail while maintaining speed
DDIM
Denoising Diffusion Implicit Models
Deterministic sampler
Produce stable and consistent results
Typically faster than other samplers while maintaining image quality
DPM series
Includes variants such as DPM 2M and DPM SDE
Typically produces high-quality results, especially when working with intricate details
LCM
Latent Consistency Model
Newer sampler designed for fast inference
Produce high-quality images in fewer sampling steps
Particularly suitable for real-time or near-real-time applications
UniPC
High efficiency sampler
Produce high-quality results in fewer steps
Strike a good balance between speed and quality
DDPM
One of the basic samplers for diffusion models
May not be as fast as some newer samplers
Typically produces stable and high-quality results
Sampleri selection advice
Getting Started: Start with Euler or Euler a. These samplers are easy to use, produce results quickly, and help understand basic concepts.
Looking for speed: Consider LCM or UniPC. These samplers can significantly reduce processing time while maintaining good image quality.
High-quality output: Try the DPM Series or DDIM. These samplers generally produce more detailed, higher quality images and are particularly suitable for projects that require fine detail.
Balance speed and quality: UniPC or DDIM may be good choices, they offer a good balance between efficiency and output quality.
Specific styles or themes: Certain samplers may perform better at generating specific types of images. For example, DPM may perform well on landscape images that require rich detail.
ComfyUI KSampler Scheduler options
In ComfyUI, KSampler is a powerful node for AI art generation. Among them, scheduler is a key parameter that determines how to manage the noise reduction steps during the generation process. Different schedulers can have a significant impact on build speed and final results.
Normal
Description: Standard scheduler, suitable for most situations.
Features: Consistent noise reduction rate throughout the entire sampling process.
Applicable scenarios: General purpose, suitable for most model and generation tasks.
Karras
Description: An optimized scheduler based on the work of Karras et al.
Features: Dynamically adjusts the noise reduction rate at different stages of the sampling process, often producing higher quality results.
Applicable scenarios: When pursuing higher quality output, especially in high-resolution or detailed image generation.
Exponential
Description: Use an exponential function to schedule noise reduction.
Features: More noise reduction in early steps, less noise in later steps.
Suitable for: When you want to quickly establish image structure in the early stages of the generation process.
Simple
Description: A simplified scheduling method.
Features: Linear noise reduction, light computational burden.
Applicable scenarios: When rapid generation is required or when computing resources are limited.
DDIM Uniform
Description: Uniform sampling based on the Denoising Diffusion Implicit Model (DDIM).
Features: Provides a more consistent sampling process, potentially leading to more stable results.
Applicable scenarios: When more predictable and consistent generation results are needed.