Stars
🌐 Jekyll is a blog-aware static site generator in Ruby
Sampling with gradient-based Markov Chain Monte Carlo approaches
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
AlexeyAB / ImageNetModel
Forked from YehLi/ImageNetModelOfficial ImageNet Model repository
Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2022/2023
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
✨✨Latest Papers on Vision Mamba and Related Areas
Implement the paper "Self-Attention with Relative Position Representations"
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
[CVPR 2024] Code release for TransNeXt model
torch-optimizer -- collection of optimizers for Pytorch
torch-optimizer -- collection of optimizers for Pytorch
Implementation of a Transformer, but completely in Triton
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Natural Language Processing Tutorial for Deep Learning Researchers
Simple XLNet implementation with Pytorch Wrapper
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
This repository contains materials from the author's deep learning course at UC Berkeley lectured by Prof. Sahai, including coursework, assignments, code, and notes, among other materials
Solutions for CS224W Winter 2021 Colab