rwightman

Ross Wightman rwightman

Computer Vision @huggingface. Always learning, constantly curious. Building ML/AI systems, watching loss curves.

6.5k followers · 39 following

@huggingface
Vancouver, BC, Canada
13:19 (UTC -07:00)
rwightman.com
@wightmanr

Sponsoring

Achievements

x2 x3 x4 x4

Achievements

x2 x3 x4 x4

Highlights

Stars

dnth / supercharge-your-pytorch-image-models-blogpost

Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations

Jupyter Notebook 19 Updated Oct 4, 2024

google-research / scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,291 428 Updated Oct 11, 2024

marimo-team / marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 6,709 237 Updated Oct 13, 2024

albumentations-team / albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,118 1,639 Updated Oct 10, 2024

haritheja-e / robot-utility-models

Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.

Python 158 5 Updated Oct 7, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,369 1,549 Updated Oct 7, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 632 23 Updated Oct 13, 2024

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Python 1,983 34 Updated Jun 6, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,724 113 Updated Sep 19, 2024

pytorch / torchtitan

A native PyTorch Library for large model training

Python 2,391 175 Updated Oct 10, 2024

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 6,801 608 Updated Oct 13, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 10,408 683 Updated Oct 11, 2024

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,710 426 Updated Oct 10, 2024

huggingface / chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python 151 10 Updated Apr 3, 2024

alenic / timm-models-explorer

Timm model explorer

Python 36 1 Updated Apr 12, 2024

pypdfium2-team / pypdfium2

Python bindings to PDFium

Python 363 16 Updated Sep 19, 2024

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,986 139 Updated Oct 11, 2024

pytorch / tensordict

TensorDict is a pytorch dedicated tensor container.

Python 819 66 Updated Oct 11, 2024

riccardomusmeci / mlx-llm

Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.

Python 318 28 Updated Aug 24, 2024

JoaoFelipe / apted

Python APTED algorithm for the Tree Edit Distance

Python 85 13 Updated Nov 8, 2017

hsouri / Battle-of-the-Backbones

192 5 Updated Nov 2, 2023

crypdick / timm-lr-scheduler-explorer

A dashboard for exploring timm learning rate schedulers

Python 18 Updated Jul 6, 2023

Luodian / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,560 241 Updated Mar 5, 2024

michellelychan / posenet-pytorch

A PyTorch port of Google TensorFlow.js PoseNet (Real-time Human Pose Estimation)

Python 31 3 Updated Jun 10, 2023

google-research / pix2struct

Python 593 54 Updated Oct 7, 2024

mlfoundations / datacomp

DataComp: In search of the next generation of multimodal datasets

Python 648 54 Updated Jan 2, 2024

Stability-AI / StableLM

StableLM: Stability AI Language Models

Jupyter Notebook 15,838 1,035 Updated Apr 8, 2024

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,692 281 Updated Aug 31, 2024

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,142 254 Updated Sep 6, 2023

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,686 5,789 Updated Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ross Wightman rwightman

Sponsoring

Achievements

Achievements

Highlights

Block or report rwightman

Stars

dnth / supercharge-your-pytorch-image-models-blogpost

google-research / scenic

marimo-team / marimo

albumentations-team / albumentations

haritheja-e / robot-utility-models

jacobgil / pytorch-grad-cam

lucidrains / transfusion-pytorch

yuweihao / MambaOut

cambrian-mllm / cambrian

pytorch / torchtitan

huggingface / lerobot

VikParuchuri / surya

mindee / doctr

huggingface / chug

alenic / timm-models-explorer

pypdfium2-team / pypdfium2

huggingface / datatrove

pytorch / tensordict

riccardomusmeci / mlx-llm

JoaoFelipe / apted

hsouri / Battle-of-the-Backbones

crypdick / timm-lr-scheduler-explorer

Luodian / Otter

michellelychan / posenet-pytorch

google-research / pix2struct

mlfoundations / datacomp

Stability-AI / StableLM

mlfoundations / open_flamingo

lucidrains / musiclm-pytorch

karpathy / nanoGPT