ModelScope

ModelScope

Alibaba Cloud
Wan2.2

Wan2.2

Alibaba

About

This model is based on a multi-stage text-to-video generation diffusion model, which inputs a description text and returns a video that matches the text description. Only English input is supported. This model is based on a multi-stage text-to-video generation diffusion model, which inputs a description text and returns a video that matches the text description. Only English input is supported. The text-to-video generation diffusion model consists of three sub-networks: text feature extraction, text feature-to-video latent space diffusion model, and video latent space to video visual space. The overall model parameters are about 1.7 billion. Support English input. The diffusion model adopts the Unet3D structure, and realizes the function of video generation through the iterative denoising process from the pure Gaussian noise video.

About

ModelsLab is an innovative AI company that provides a comprehensive suite of APIs designed to transform text into various forms of media, including images, videos, audio, and 3D models. Their services enable developers and businesses to create high-quality visual and auditory content without the need to maintain complex GPU infrastructures. ModelsLab's offerings include text-to-image, text-to-video, text-to-speech, and image-to-image generation, all of which can be seamlessly integrated into diverse applications. Additionally, they offer tools for training custom AI models, such as fine-tuning Stable Diffusion models using LoRA methods. Committed to making AI accessible, ModelsLab supports users in building next-generation AI products efficiently and affordably.

About

Pony Diffusion is a versatile text-to-image diffusion model designed to generate high-quality, non-photorealistic images across various styles. It offers a user-friendly interface where users simply input descriptive text prompts and the model creates vivid visuals ranging from stylized pony-themed artwork to dynamic fantasy scenes. The fine-tuned model uses a dataset of approximately 80,000 pony-related images to optimize relevance and aesthetic consistency. It incorporates CLIP-based aesthetic ranking to evaluate image quality during training and supports a “scoring” system to guide output quality. The workflow is straightforward; craft a descriptive prompt, run the model, and save or share the generated image. The service clarifies that the model is trained to produce SFW content and is available under an OpenRAIL-M license, thereby allowing users to freely use, redistribute, and modify the outputs subject to certain guidelines.

About

Wan2.2 is a major upgrade to the Wan suite of open video foundation models, introducing a Mixture‑of‑Experts (MoE) architecture that splits the diffusion denoising process across high‑noise and low‑noise expert paths to dramatically increase model capacity without raising inference cost. It harnesses meticulously labeled aesthetic data, covering lighting, composition, contrast, and color tone, to enable precise, controllable cinematic‑style video generation. Trained on over 65 % more images and 83 % more videos than its predecessor, Wan2.2 delivers top performance in motion, semantic, and aesthetic generalization. The release includes a compact, high‑compression TI2V‑5B model built on an advanced VAE with a 16×16×4 compression ratio, capable of text‑to‑video and image‑to‑video synthesis at 720p/24 fps on consumer GPUs such as the RTX 4090. Prebuilt checkpoints for T2V‑A14B, I2V‑A14B, and TI2V‑5B stack enable seamless integration.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Users interested in an open source text-to-video AI video generation model

Audience

Developers, businesses, and content creators seeking to integrate advanced AI-driven media generation into their applications and services

Audience

Artists, illustrators, hobbyists and creatives in need of a tool to generate stylized, high-quality illustrations and creative visuals without deep technical or art-software expertise

Audience

Researchers and developers in computer vision and generative AI seeking a solution for high‑quality, efficient video synthesis

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

$7/month
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 1.0 / 5
ease 1.0 / 5
features 1.0 / 5
design 1.0 / 5
support 1.0 / 5

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Alibaba Cloud
China
modelscope.cn/

Company Information

ModelsLab
Founded: 2022
United States
modelslab.com

Company Information

Pony Diffusion
United States
ponydiffusion.com

Company Information

Alibaba
Founded: 1999
China
wan.video

Alternatives

Alternatives

Alternatives

Alternatives

LTX

LTX

Lightricks
YandexART

YandexART

Yandex
Imagen

Imagen

Google
ModelScope

ModelScope

Alibaba Cloud
AiBlocks

AiBlocks

BHAI
Kling 2.5

Kling 2.5

Kuaishou Technology

Categories

Categories

Categories

Categories

Integrations

01.AI
CodeQwen
ComfyUI
Fuser
Lucy Edit AI
Qwen
Qwen-7B
Qwen-Image
Qwen2
Qwen2-VL
Qwen2.5
Qwen2.5-1M
Qwen2.5-Coder
Qwen2.5-VL
Qwen3
SiliconFlow
VisionStory
WaveSpeedAI
Yi-Large
graphis

Integrations

01.AI
CodeQwen
ComfyUI
Fuser
Lucy Edit AI
Qwen
Qwen-7B
Qwen-Image
Qwen2
Qwen2-VL
Qwen2.5
Qwen2.5-1M
Qwen2.5-Coder
Qwen2.5-VL
Qwen3
SiliconFlow
VisionStory
WaveSpeedAI
Yi-Large
graphis

Integrations

01.AI
CodeQwen
ComfyUI
Fuser
Lucy Edit AI
Qwen
Qwen-7B
Qwen-Image
Qwen2
Qwen2-VL
Qwen2.5
Qwen2.5-1M
Qwen2.5-Coder
Qwen2.5-VL
Qwen3
SiliconFlow
VisionStory
WaveSpeedAI
Yi-Large
graphis

Integrations

01.AI
CodeQwen
ComfyUI
Fuser
Lucy Edit AI
Qwen
Qwen-7B
Qwen-Image
Qwen2
Qwen2-VL
Qwen2.5
Qwen2.5-1M
Qwen2.5-Coder
Qwen2.5-VL
Qwen3
SiliconFlow
VisionStory
WaveSpeedAI
Yi-Large
graphis
Claim ModelScope and update features and information
Claim ModelScope and update features and information
Claim ModelsLab and update features and information
Claim ModelsLab and update features and information
Claim Pony Diffusion and update features and information
Claim Pony Diffusion and update features and information
Claim Wan2.2 and update features and information
Claim Wan2.2 and update features and information