Introduction
CogKit is an open-source project that provides a user-friendly interface for researchers and developers to utilize ZhipuAI's CogView (image generation) and CogVideoX (video generation) models. It streamlines multimodal tasks such as text-to-image(T2I), text-to-video(T2V), and image-to-video(I2V). Users must comply with legal and ethical guidelines to ensure responsible implementation.
Supported Models
Please refer to the Model Card for more details.
Environment Testing
This repository has been tested in environments with 8×A100 GPUs, using CUDA 12.4, Python 3.10.16.