Heygen 101 - Learning the Basis

27 Apr 202337:05

TLDRHeygen 101 is an innovative platform that utilizes AI to create engaging and professional videos. Co-founder and CEO, Josh, shares the journey from the idea's inception three years ago to the current product offering. The platform allows users to select from hundreds of templates, customize avatars, and generate videos in over 50 languages. With a focus on ease of use, Heygen 101 is designed to empower businesses and individuals to create high-quality videos without the need for technical expertise. The service is priced affordably, starting at $24 per month for an annual plan, and offers additional features like voice cloning and AI script enhancement. Josh also discusses upcoming features, including team collaboration tools and improvements to avatar quality and customization. The platform is an excellent resource for those looking to scale their video production and enhance their marketing strategies.


  • 🚀 **Heygen 101 Overview**: Heygen is a platform that uses AI to create content, specifically spokesperson videos, and is designed to be user-friendly for non-technical individuals.
  • 🤖 **AI Technology**: The technology behind Heygen started with generative adversarial networks (GANs) and has evolved to include features like face swapping and Disney-style image transformations.
  • 📈 **Market Trend**: There is a significant market trend towards video marketing, with 75% of companies looking to start or expand their video marketing efforts.
  • 💡 **Idea Origin**: The idea for Heygen began about three years ago, with the founders recognizing the potential of AI in content creation despite limited technology advancements at the time.
  • 💻 **User Interface**: Heygen's interface is straightforward, allowing users to select templates, customize avatars, and input text for the AI to generate video scripts.
  • 🌐 **Language Support**: Heygen supports over 50 languages, enabling the creation of videos for a global audience.
  • 🧥 **Customization**: Users can customize avatars with different outfits, and an upcoming feature will allow for the generation of outfits based on user logos or preferences.
  • ⏱️ **Video Generation Time**: Video generation takes approximately five minutes per video minute, with shorter videos rendering more quickly.
  • 📊 **Pricing Model**: Heygen offers a sliding scale pricing model, starting at $24 per month for 10 minutes of video rendering, which adjusts based on the user's needs and budget.
  • 🔍 **Script Enhancement**: Heygen integrates with AI scriptwriting tools like GPT to improve the engagement and quality of the video scripts.
  • ➕ **Future Features**: Upcoming features include team collaboration, voice cloning, and end-to-end video generation capabilities, enhancing the platform's utility for businesses.
  • 📱 **Mobile Integration**: There are plans to reduce the data recording requirement for avatar creation, potentially enabling mobile phone users to create avatars more easily and at a lower cost.

Q & A

  • What is the core idea behind Heygen and how did it originate?

    -Heygen is a platform that uses AI to create content, particularly videos. The idea originated roughly three years ago when the founders, Josh and Wayne, decided to leverage AI technology to create content. They saw potential in generative adversarial networks (GANs) to transform user images into various styles and believed that generating entire videos was technically feasible.

  • How easy is it for someone to create a video using Heygen?

    -Creating a video with Heygen is quite straightforward. Users can select from hundreds of templates designed for different use cases, each with a spokesperson avatar. The platform allows for easy text input, voice selection, and customization of video elements. It's designed to be user-friendly, even for those who aren't technologically savvy.

  • What are the language capabilities of Heygen's avatars?

    -Heygen supports over 50 different languages, allowing users to create videos with avatars that speak various languages, including Greek, to cater to diverse audiences.

  • Can users customize the appearance of the avatars on Heygen?

    -Yes, users can change the outfits of the avatars to some extent using presets available on the platform. Additionally, Heygen is introducing a feature that will allow users to generate custom outfits for the avatars, including the possibility of adding a company logo to a T-shirt.

  • How long does it take to generate a video on Heygen?

    -The video generation time on Heygen is approximately five minutes for every one minute of original video. For shorter videos, like 10 seconds, the rendering and generation process would take about 50 seconds.

  • What is the cost associated with using Heygen for video creation?

    -Heygen offers a pricing model where users are charged based on the length of the video they render. The starting plan, called Mini, costs $24 per month on an annual purchase or $30 per month on a month-to-month basis. Users are charged between $2 to $3 for every minute rendered.

  • What future features is Heygen planning to introduce?

    -Heygen is planning to introduce several features, including the ability to generate talking styles from a single photo, voice cloning for a more personalized avatar, team collaboration features, and improvements in avatar quality and capacity. They are also working on reducing the data recording time for creating an avatar and enabling end-to-end video generation.

  • How can businesses with limited marketing budgets benefit from Heygen?

    -Businesses with limited marketing budgets can start with Heygen's Mini plan, which allows for the creation of short videos at an affordable cost. This enables them to test the effectiveness of video marketing and adjust their strategy based on audience engagement and ROI without a significant financial commitment.

  • What is the process for creating a custom avatar on Heygen?

    -To create a custom avatar, users need to record themselves speaking for two minutes, ensuring they look at the camera and speak clearly. After providing consent for Heygen to use the footage, users upload the video or send it via email. Heygen then processes the footage and creates the custom avatar, which typically takes three to five business days.

  • Can Heygen's avatars be integrated with other tools like screen recorders?

    -Heygen doesn't currently have a built-in screen recording feature. Users can use third-party screen recording software and then upload the footage to Heygen's platform to include in their videos.

  • Who are the ideal candidates for using Heygen?

    -Ideal candidates for using Heygen include individuals or teams looking to scale video production, marketers seeking innovative ways to improve user engagement or acquisition, and businesses interested in leveraging video marketing to increase ROI.

  • How can users get started with Heygen and what support is available?

    -Users can get started with Heygen by accessing a free trial on their website, heygen.com. For additional support or questions, users can reach out through various social media platforms including Twitter, LinkedIn, Instagram, Reddit, and Facebook, where Heygen has an active presence and provides assistance through admins.



🚀 Introduction to Hey Jen and AI Video Creation

Josh, the co-founder and CEO of Hey Jen, discusses the inception of the company around three years ago with the aim of using AI to create content. Despite limited technological advancements in 2020, the team believed in the future potential of AI. They leveraged the generative adversarial network (GAN) to manipulate images and saw the possibility of generating entire videos. Hey Jen's first product focuses on creating spokesperson videos with AI, and Josh shares his background in engineering and how the timing was perfect for him and his co-founder Wayne to collaborate.


🎬 Hey Jen's Video Creation Process and Features

The video script explains the user-friendly process of creating videos with Hey Jen. Users can choose from hundreds of templates for various purposes, each featuring an AI-generated spokesperson avatar. The platform allows customization of video elements, text, and avatar speech. It also includes a feature for recording or uploading a user's voice. The script highlights the platform's support for over 50 languages and the ability to create avatars with different looks and outfits, including custom outfits with logos.


🌐 Language Support and Future Updates

Josh confirms that Hey Jen supports over 50 languages, allowing users to create videos in various languages, including Greek. The platform offers a multitude of avatars with different appearances and outfits, and an upcoming feature will enable users to generate custom outfits for their avatars. Josh also discusses the video generation process, which takes about five minutes per video minute, and mentions an AI script feature that improves script engagement. The company is also working on team collaboration features and enhancing avatar quality and interaction.


📈 Business Applications and Pricing Structure

The discussion focuses on how small businesses with limited marketing budgets can benefit from Hey Jen. The platform offers a scalable pricing structure, starting with a plan at $24 per month for annual subscribers or $30 per month for monthly subscribers. The cost is based on the rendering time of the video, and the platform encourages small businesses to start with a small plan and upgrade as needed. Josh also teases upcoming features like voice cloning and end-to-end video generation.


🤖 Custom Avatars and Video Marketing Benefits

Josh explains the process of creating a custom avatar that looks and sounds like the user, which involves recording a two-minute video speaking directly to the camera and then sending the footage to Hey Jen. He emphasizes the cost efficiency of using custom avatars compared to hiring actors or production crews. The video script also touches on the importance of video marketing and how Hey Jen can help businesses that want to scale their video production or innovate their marketing strategies.


📝 Answering Audience Questions

The video concludes with Josh and Cat addressing audience questions. Topics include the integration of Hey Jen with GPT for scriptwriting, enterprise plans for teams requiring more than five hours of video creation, and the possibility of integrating screen recordings with avatars. Josh also confirms that more avatars will be added to the platform soon. The session ends with an invitation for viewers to try Hey Jen with a free trial and to reach out with questions on various social media platforms.


🎵 Closing Remarks

The video script ends with closing music and a sign-off, thanking the audience for their participation and indicating that similar sessions will be held in the future.




AI, or Artificial Intelligence, refers to the simulation of human intelligence in machines that are programmed to think like humans and mimic their actions. In the context of the video, AI is used to create content, particularly videos, by generating images and spokesperson avatars. It's central to the Heygen platform's ability to produce videos efficiently and cost-effectively.

💡Generative Adversarial Network (GAN)

A Generative Adversarial Network is a type of AI algorithm used in unsupervised machine learning. It consists of two parts: the generator, which creates new data, and the discriminator, which evaluates it. In the video, the first generation of GANs is mentioned as a technology that can transform user images into different styles, like 'babyface' or 'Disney style,' showcasing the potential of AI in video creation.


In the context of the video, an avatar refers to a digital representation or a virtual character that can be customized to resemble a real person. These avatars are used in Heygen's platform to act as spokespersons in videos, making it easier for users to create professional video content without the need for actual human presenters.

💡Video Generation

Video generation is the process of creating videos using software or AI systems. In the script, Heygen's platform is described as facilitating easy video generation where users can input text, choose an avatar, and generate a video. This process is significant as it democratizes video creation, making it accessible to those without extensive video production experience.


A chatbot is an AI program designed to simulate conversation with human users. In the video, the integration of a chatbot is discussed in the context of using the talking photo feature, which could potentially allow avatars to interact with users in a chat format, enhancing the engagement and utility of the platform.

💡Voice Clone

Voice cloning is a technology that allows the creation of a synthetic voice that sounds like a specific person. In the video, Heygen's new feature of voice cloning is mentioned, which will enable avatars to not only look like a user but also sound like them, adding a personal touch to the video content created.


A template in the context of the video refers to a pre-designed video layout or structure that users can choose from to create their content. The Heygen platform offers various templates for different use cases, such as advertising, e-commerce, and learning development, which users can customize with their content.

💡Screen Recording

Screen recording is the process of capturing a digital recording of the computer screen's output. In the video, it is mentioned that users can upload screen recordings as an asset in Heygen's platform, which can then be integrated into the video creation process, adding a dynamic element to the final video product.

💡Enterprise Plan

An Enterprise Plan is a tier of service typically designed for larger organizations or businesses that require more extensive features, higher usage limits, or specialized services. In the video, it is stated that Heygen offers an Enterprise Plan for customers who need more than the standard five hours of video creation time.

💡Team Collaboration

Team collaboration refers to the process where multiple individuals work together on a project or task. The video discusses an upcoming feature for Heygen that will allow team collaboration, enabling different team members to work together on a single video project within the platform.

💡Custom Avatar

A custom avatar is a unique digital representation created specifically for an individual or brand. In the video, the process of creating a custom avatar is explained, where users record themselves speaking for two minutes, which is then used to generate an avatar that looks and sounds like the user, providing a personalized experience in video content.


Heygen 101 is a platform that allows users to create videos using AI technology.

The idea for Heygen started around three years ago with the aim to use AI for content creation.

In 2020, the founders saw potential in generative adversarial networks for creating images and videos.

Heygen's first product enables the creation of spokesperson videos using AI.

The co-founders, Josh and Wayne, have known each other for over a decade and share a background in engineering and product development.

The platform is user-friendly, making it accessible to those who are not technologically savvy.

Heygen offers a variety of templates for different use cases, including advertising, e-commerce, and learning.

Each template comes with a spokesperson avatar, simplifying the video creation process.

Users can type in text for the avatar to speak, preview voices, and adjust pronunciation.

Heygen supports over 50 languages, allowing for a wide range of video creation possibilities.

The platform will soon allow users to generate outfits for avatars, including custom logos.

Video generation time is approximately five minutes per video minute, depending on the complexity.

Heygen is integrating with large language models to improve script writing and make videos more engaging.

The platform offers a sliding scale pricing model, starting at $24 per month for the Mini plan.

Upcoming features include the ability to create talking photos from a single image and voice cloning for personalized avatars.

Heygen is planning to introduce team collaboration features and improve avatar quality and interaction.

Custom avatar creation involves recording a two-minute footage speaking directly to the camera, which is then used to create a personalized avatar.

The platform is particularly useful for businesses looking to scale their video marketing efforts without high production costs.

Heygen offers a free trial for users to explore its capabilities and create videos efficiently.