Wan2.1

CHECKPOINT
原创


更新

Use Wan2.1 to generate videos for free. Online training is now supported!

In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Wan2.1 offers these key features:

  • 👍 SOTA Performance: Wan2.1 consistently outperforms existing open-source models and state-of-the-art commercial solutions across multiple benchmarks.

  • 👍 Supports Consumer-grade GPUs: The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with almost all consumer-grade GPUs. It can generate a 5-second 480P video on an RTX 4090 in about 4 minutes (without optimization techniques like quantization). Its performance is even comparable to some closed-source models.

  • 👍 Multiple Tasks: Wan2.1 excels in Text-to-Video, Image-to-Video, Video Editing, Text-to-Image, and Video-to-Audio, advancing the field of video generation.

  • 👍 Visual Text Generation: Wan2.1 is the first video model capable of generating both Chinese and English text, featuring robust text generation that enhances its practical applications.

  • 👍 Powerful Video VAE: Wan-VAE delivers exceptional efficiency and performance, encoding and decoding 1080P videos of any length while preserving temporal information, making it an ideal foundation for video and image generation.

This repository features our T2V-14B model, which establishes a new SOTA performance benchmark among both open-source and closed-source models. It demonstrates exceptional capabilities in generating high-quality visuals with significant motion dynamics. It is also the only video model capable of producing both Chinese and English text and supports video generation at both 480P and 720P resolutions.

版本详情

WAN_2_1_14B
have fun!

项目权限

    使用权限

  • 在吐司在线使用

  • 在 吐司 作为在线训练的底模

  • 使用时无需注明出处

  • 用于模型融合

  • 分享融合模型时使用不同的许可

    商用许可

  • 生成的内容用于商业用途

  • 作为生成服务来商用

  • 转售模型或出售融合模型

相关帖子