【AI视频】革命性突破!最全无闪烁AI视频制作教程 真正生产力 Stable diffusion + EbSynth + ControlNet

魔都老王Shanghai Lao wong
28 Jun 202309:05

TLDRIn this engaging tutorial, the creator showcases a breakthrough in video generation using Stable Diffusion, addressing a common issue of flickering in videos produced by earlier versions. The video demonstrates a seamless process to create high-quality videos without flickering, leveraging tools like isnet_Pro for background control and ebsynth for keyframe enhancement. The creator walks viewers through downloading necessary software, setting up the environment, and using plugins to control the video's appearance effectively. By following a step-by-step guide, including keyframe selection, image redrawing, and video compilation, the audience learns to produce smooth, high-quality videos rapidly. This tutorial promises a foolproof method for creating flicker-free videos, encouraging viewers to try it out and share their feedback.


  • 🎥 The video demonstrates a significant improvement in stable diffusion technology for video generation, addressing previous issues like flickering frames.
  • 🚀 The new stable diffusion process has greatly reduced the time required for video generation, eliminating the need for extensive waiting periods.
  • 🔗 The video creation process involves using a series of tools and plugins, including ebsynth, FFMPEG, and stable diffusion with specific plugins.
  • 📥 Users are guided through downloading and installing necessary software and plugins, with detailed instructions for setup and configuration.
  • 🖼️ The process starts with deconstructing a reference video into individual frames, extracting keyframes, and then redrawing them to minimize flickering.
  • 🎨 The redrawing of keyframes is done using a control net with multiple settings to refine the output, including soft edge and lineart options.
  • 📸 After redrawing, intermediate frames are generated to create a smooth transition between keyframes, resulting in a complete video sequence.
  • 🌈 Color correction and dimension adjustments are optional steps in the process, depending on the desired outcome.
  • 🎞️ The final step involves using the EBSynth program to compile the frames into a finished video file in MP4 format.
  • 📌 The tutorial encourages viewers to experiment with the process, seek help in the comments section if needed, and engage with the content by liking, saving, and following.

Q & A

  • What is the main issue with videos generated by previous versions of Stable Diffusion?

    -The main issue with videos generated by previous versions of Stable Diffusion is the flickering of the的画面 due to the frame-by-frame synthesis process, where differences between consecutive frames cause the flicker.

  • What tool was used to control the background and reduce flickering in the previous video?

    -The previous video used a tool called isnet_Pro to control the background and reduce flickering.

  • How has the new video generation process improved in terms of flickering?

    -The new video generation process has significantly improved by virtually eliminating flickering, providing a much smoother visual experience.

  • What is the role of FFMPEG in the video generation process described in the script?

    -FFMPEG is used in the video generation process as a tool for handling video and audio files, allowing for tasks such as conversion, editing, and compression.

  • What is the purpose of installing the background control plugin and Stable Diffusion's plugin?

    -The background control plugin and Stable Diffusion's plugin are installed to enhance the video generation process by providing additional functionalities such as background control and advanced image processing capabilities.

  • How does the process of extracting keyframes from a video sequence work?

    -The process of extracting keyframes from a video sequence involves analyzing the sequence to identify and select frames that are representative of the content, using parameters like minimum and maximum keyframe intervals to determine the number of keyframes extracted.

  • What are the steps involved in the video making process using the new Stable Diffusion plugin?

    -The steps involved in the video making process using the new Stable Diffusion plugin include setting up the project path, uploading materials, extracting keyframes, redrawing keyframes with control nets, generating in-between frames, color correction (optional), resizing, and finally合成 the video files into a complete video sequence.

  • How does the Control Net setting in Stable Diffusion contribute to the video generation process?

    -The Control Net setting in Stable Diffusion allows for the use of additional control networks, such as soft edge and lineart, which can refine the generation process by ensuring better edge detection and matching, leading to smoother transitions and more coherent visuals in the final video.

  • What is the significance of the 'mask' settings in the Stable Diffusion plugin?

    -The 'mask' settings in the Stable Diffusion plugin are used to define the level of detail and precision in the generated images, with lower values allowing for more generalized features and higher values providing more detailed and specific elements.

  • How long does the video generation process typically take with the new Stable Diffusion plugin?

    -The video generation process with the new Stable Diffusion plugin is significantly faster than previous methods, as it does not require the use of high-end hardware like a 4090 GPU for dozens of hours to complete.

  • What type of video file is produced at the end of the video generation process described in the script?

    -At the end of the video generation process, an MP4 format video file is produced, which can include background music or be generated without it, depending on the user's choice.



🎥 Introduction to Stable Diffusion Video Generation

The paragraph introduces the process of video generation using Stable Diffusion, highlighting the improvements over previous versions. It discusses the issue of flickering in earlier videos due to frame-by-frame synthesis and the introduction of tools like isnet_Pro to control the background and reduce flickering. The speaker then promises a smooth demonstration of how to create such videos without the need for high-end hardware like a 4090 graphics card. The explanation includes a brief overview of the video generation principle and a step-by-step guide on downloading necessary files and setting up the environment.


🛠️ Detailed Setup and Video Production Process

This paragraph delves into the detailed steps of setting up the environment for video production with Stable Diffusion. It covers the installation of ebsynth, FFMPEG, and background control plugins, as well as the configuration of system environment variables. The speaker then explains the installation and application of Stable Diffusion plugins, including settings adjustments and the use of control nets. The paragraph concludes with a comprehensive guide on video production, from extracting keyframes to final video generation, including the use of masks, seed selection, and color correction. The speaker also provides tips on generating the final video files and encourages viewers to engage with the content.



💡Stable Diffusion

Stable Diffusion (SD) is an AI-based tool used for generating digital content, such as images and videos, by synthesizing visuals based on textual descriptions or modifying existing visuals. In the context of the video, Stable Diffusion is highlighted for its advancements in video generation, overcoming previous limitations like frame flickering. The narrator describes using Stable Diffusion to create a video that significantly differs from earlier versions, emphasizing the elimination of flickering and the enhancement in processing speed.


Flickering in video generation refers to the rapid variation in brightness between frames, leading to an unstable or flashing visual effect. This problem is commonly associated with AI-generated videos where consecutive frames lack consistency. The script addresses this issue by noting improvements in the latest video generation methods, which have reduced flickering, thereby enhancing the visual quality of the generated content.

💡Frame-by-frame synthesis

Frame-by-frame synthesis is a process used in video generation where each frame of the video is individually created or modified. This approach can lead to flickering due to inconsistencies between frames. The video script discusses this method in the context of explaining the causes of flickering in earlier AI-generated videos and how new techniques have mitigated this issue.


isnet_Pro is mentioned as a tool that helps control the background in videos to reduce flickering. It represents a step in the evolution of video generation technologies, providing users with the means to generate smoother videos. The script highlights its use in creating a video with minimal flickering, showcasing it as an example of technological advancement in the field.


Ebsynth is a software tool used for frame interpolation and video effects, applying style transfer techniques to videos. In the video, Ebsynth's website is visited to download software necessary for the video generation process, indicating its role in achieving the final video effect by enhancing consistency between frames.


FFMPEG is a free, open-source software project that consists of a vast library of tools for handling video, audio, and other multimedia files and streams. The script describes the process of downloading and installing FFMPEG, highlighting its importance in processing and converting video files, which is a critical step in the video generation workflow described in the video.

💡Control net

Control net, within the context of the video, refers to a feature or plugin for Stable Diffusion that allows for enhanced control over the video generation process. The narrator explains setting this feature to '3' to enable multiple control windows, which facilitates finer manipulation of the video generation parameters. This adjustment plays a crucial role in achieving the desired video quality and effect.

💡Key frames

Key frames in video editing and generation refer to significant frames that mark the beginning or end of a transition or a significant visual change. In the script, the process involves extracting key frames from a video sequence for detailed editing or re-drawing, which then serve as reference points for interpolating the frames in between. This technique helps in creating smoother transitions and reducing flickering.

💡Frame interpolation

Frame interpolation is a technique used to generate intermediate frames between two existing frames in a video, aiming to create a smoother motion or transition. In the context of the video, this process is used after re-drawing key frames to fill in the gaps, thereby enhancing the fluidity of the generated video and reducing flickering.

💡Video synthesis

Video synthesis refers to the process of creating a new video from existing materials, which can include modifying frames, adding effects, or generating entirely new content based on algorithms. The script discusses video synthesis in the context of using Stable Diffusion and other tools to transform a raw video into a polished, AI-generated piece with minimal flickering and enhanced visual appeal.


The video showcased is generated using the latest version of stable diffusion, which has significantly improved from previous versions.

The main issue with earlier stable diffusion-generated videos was flickering due to frame-by-frame synthesis.

The use of tools like isnet_Pro has allowed for background control and reduction of flickering in videos.

The new stable diffusion video generation is flicker-free and has significantly faster processing times.

The tutorial begins with downloading the ebsynth software and exploring its official demonstration videos.

Installing FFMPEG is necessary for video processing, and instructions are provided for Windows users.

A background control plugin is installed to enhance video generation capabilities.

The stable diffusion plugin installation process is detailed, including the necessary settings adjustments.

The video production process involves converting a reference video into frames, extracting keyframes, and redrawing them.

The plugin's working principle is explained through a reference image that outlines the entire video production process.

A new folder is created for the project, and materials are uploaded for video production.

The first step in video production involves setting parameters and generating an image sequence with masks.

Keyframes are extracted from the image sequence with adjustable intervals for optimization.

The third step involves redrawing keyframes with various settings and parameters for enhanced image quality.

Color correction is an optional step in the process, which can be skipped based on user preference.

The fourth step is尺寸调整 (dimension adjustment), which is not needed if default settings are used.

The fifth step generates EBS files, which are then processed using the initially downloaded program.

The final step compiles the frames into a complete video in MP4 format, resulting in two video files, one with background music.

The tutorial concludes with an invitation for viewers to try the process themselves and engage in discussions.