LiJunnan1992

Junnan Li LiJunnan1992

Multimodal AI

Achievements

Stars

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 591 41 Updated Oct 31, 2024

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

329 18 Updated Oct 19, 2024

tencent-ailab / Leopard

The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"

Python 175 17 Updated Oct 30, 2024

rhymes-ai / Aria

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 826 70 Updated Nov 17, 2024

longvideobench / LongVideoBench

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 66 2 Updated Jul 27, 2024

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 381 20 Updated Sep 24, 2024

Linaqruf / kohya-trainer

Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning

Jupyter Notebook 1,859 306 Updated May 14, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,710 338 Updated Aug 7, 2024

Newbeeer / diffusion_restart_sampling

Code for NeurIPS 2023 paper "Restart Sampling for Improving Generative Processes"

Python 147 2 Updated Dec 6, 2023

poloclub / diffusiondb

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,213 68 Updated Jul 11, 2024

tgxs002 / align_sd

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Python 266 9 Updated Jul 14, 2023

KaliYuga-ai / DreamBooth_With_Dataset_Captioning

Jupyter Notebook 32 2 Updated Apr 10, 2023

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,460 1,623 Updated Nov 12, 2024

salesforce / logai

LogAI - An open-source library for log analytics and intelligence

Python 440 64 Updated Nov 14, 2024

microsoft / Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Python 1,360 180 Updated Sep 15, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,932 973 Updated Oct 11, 2024

facebookresearch / diht

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Python 132 4 Updated Mar 8, 2023

facebookresearch / metaseq

Repo for external large-scale work

Python 6,515 725 Updated Apr 27, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,307 167 Updated Aug 1, 2024

salesforce / botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Jupyter Notebook 113 8 Updated Jun 12, 2023

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,223 5,404 Updated Nov 17, 2024

LambdaLabsML / examples

Deep Learning Examples

Jupyter Notebook 811 108 Updated Oct 18, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,816 644 Updated Aug 5, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 68,399 10,169 Updated Jun 18, 2024

salesforce / OmniXAI

OmniXAI: A Library for eXplainable AI

Jupyter Notebook 876 94 Updated Jul 23, 2024

ronghanghu / moco_v3_tpu

Python 16 3 Updated Apr 10, 2022

salesforce / MUST

PyTorch code for MUST

Python 105 12 Updated Mar 8, 2023

microsoft / SimMIM

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 927 88 Updated Sep 29, 2022

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,340 1,224 Updated Jul 23, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,192 2,550 Updated Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Junnan Li LiJunnan1992

Achievements

Achievements

Block or report LiJunnan1992

Stars

rhymes-ai / Allegro

facebookresearch / MovieGenBench

tencent-ailab / Leopard

rhymes-ai / Aria

longvideobench / LongVideoBench

Coobiw / MPP-LLaVA

Linaqruf / kohya-trainer

rom1504 / img2dataset

Newbeeer / diffusion_restart_sampling

poloclub / diffusiondb

tgxs002 / align_sd

KaliYuga-ai / DreamBooth_With_Dataset_Captioning

huggingface / peft

salesforce / logai

microsoft / Semi-supervised-learning

salesforce / LAVIS

facebookresearch / diht

facebookresearch / metaseq

baaivision / EVA

salesforce / botsim

huggingface / diffusers

LambdaLabsML / examples

salesforce / BLIP

CompVis / stable-diffusion

salesforce / OmniXAI

ronghanghu / moco_v3_tpu

salesforce / MUST

microsoft / SimMIM

facebookresearch / mae

microsoft / unilm