Skip to main content

Introduction

CogKit is an open-source project that provides a user-friendly interface for researchers and developers to utilize ZhipuAI's CogView (image generation) and CogVideoX (video generation) models. It streamlines multimodal tasks such as text-to-image(T2I), text-to-video(T2V), and image-to-video(I2V). Users must comply with legal and ethical guidelines to ensure responsible implementation.

Supported Models

Please refer to the Model Card for more details.

Environment Testing

This repository has been tested in environments with 8×A100 GPUs, using CUDA 12.4, Python 3.10.16.