Stars
A unified framework for 3D content generation.
Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
[ArXiv 2024] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation"
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
A comprehensive collection of IQA papers
[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
[ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Collection of recent shadow removal works, including papers, codes, datasets, and metrics.
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
An innovative method designed to augment the capabilities of existing video diffusion models
This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"
A collection of resources on controllable generation with text-to-image diffusion models.
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
[ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Training code for the videocrafter.
data pipeline code of large video generation model
Official code of SmartEdit [CVPR-2024 Highlight]
[SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
Official Code for MotionCtrl [SIGGRAPH 2024]