Skip to content
View LiJunnan1992's full-sized avatar

Block or report LiJunnan1992

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 591 41 Updated Oct 31, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

329 18 Updated Oct 19, 2024

The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"

Python 175 17 Updated Oct 30, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 826 70 Updated Nov 17, 2024

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 66 2 Updated Jul 27, 2024

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 381 20 Updated Sep 24, 2024

Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning

Jupyter Notebook 1,859 306 Updated May 14, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,710 338 Updated Aug 7, 2024

Code for NeurIPS 2023 paper "Restart Sampling for Improving Generative Processes"

Python 147 2 Updated Dec 6, 2023

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,213 68 Updated Jul 11, 2024

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Python 266 9 Updated Jul 14, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,460 1,623 Updated Nov 12, 2024

LogAI - An open-source library for log analytics and intelligence

Python 440 64 Updated Nov 14, 2024

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Python 1,360 180 Updated Sep 15, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,932 973 Updated Oct 11, 2024

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Python 132 4 Updated Mar 8, 2023

Repo for external large-scale work

Python 6,515 725 Updated Apr 27, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,307 167 Updated Aug 1, 2024

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Jupyter Notebook 113 8 Updated Jun 12, 2023

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,223 5,404 Updated Nov 17, 2024

Deep Learning Examples

Jupyter Notebook 811 108 Updated Oct 18, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,816 644 Updated Aug 5, 2024

A latent text-to-image diffusion model

Jupyter Notebook 68,399 10,169 Updated Jun 18, 2024

OmniXAI: A Library for eXplainable AI

Jupyter Notebook 876 94 Updated Jul 23, 2024
Python 16 3 Updated Apr 10, 2022

PyTorch code for MUST

Python 105 12 Updated Mar 8, 2023

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 927 88 Updated Sep 29, 2022

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,340 1,224 Updated Jul 23, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,192 2,550 Updated Nov 9, 2024
Next