-
Peking University
- Beijing
- https://blog.idejie.com
Highlights
- Pro
Lists (9)
Sort Name ascending (A-Z)
Starred repositories
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
The paper collections for the autoregressive models in vision.
[CVPR 2024] A world model for autonomous driving.
A suite of image and video neural tokenizers
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models
SEED-Story: Multimodal Long Story Generation with Large Language Model
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception