Stars
A concise but complete full-attention transformer with a set of promising experimental features from various papers
llama3 implementation one matrix multiplication at a time
Universal local privilege escalation Proof-of-Concept exploit for CVE-2024-1086, working on most Linux kernels between v5.14 and v6.6, including Debian, Ubuntu, and KernelCTF. The success rate is 9…
Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
[ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
What would you do with 1000 H100s...
CLIP inference in plain C/C++ with no extra dependencies
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
We write your reusable computer vision tools. 💜
RISC Zero is a zero-knowledge verifiable general computing platform based on zk-STARKs and the RISC-V microarchitecture.
🦜🔗 Build context-aware reasoning applications
Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️♂️
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
A playbook for systematically maximizing the performance of deep learning models.
Memory mapped numpy arrays of varying shapes
freddy1020 / GPU-Puzzles
Forked from srush/GPU-PuzzlesSolve puzzles. Learn CUDA.
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model