snsten

Shashank snsten

18 followers · 0 following

Achievements

Highlights

Developer Program Member

Stars

LLMs

22 repositories

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,671 863 Updated Nov 17, 2024

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,427 696 Updated Jul 11, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,724 862 Updated Nov 12, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,673 366 Updated Jul 11, 2024

spdustin / ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

JavaScript 6,613 455 Updated Jan 17, 2024

OpenBMB / XAgent

An Autonomous LLM Agent for Complex Task Solving

Python 8,166 846 Updated Aug 12, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,679 991 Updated Nov 13, 2024

pchunduri6 / rag-demystified

An LLM-powered advanced RAG pipeline built from scratch

Python 798 51 Updated Jan 26, 2024

linexjlin / GPTs

leaked prompts of GPTs

28,762 3,901 Updated Sep 27, 2024

dvmazur / mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,294 226 Updated Apr 8, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,153 552 Updated Oct 19, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,659 616 Updated Nov 14, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,192 866 Updated Jul 1, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 68,017 9,753 Updated Nov 19, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,290 508 Updated Jul 31, 2024

google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,991 509 Updated Nov 18, 2024

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 5,994 520 Updated Sep 6, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,200 2,197 Updated Nov 17, 2024

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 17,474 2,090 Updated Aug 6, 2024

joennlae / tensorli

Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).

Python 250 12 Updated Nov 20, 2023

mem0ai / mem0

The Memory layer for your AI apps

Python 22,871 2,102 Updated Nov 18, 2024

BoundaryML / baml

BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground

Rust 1,361 50 Updated Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shashank snsten

Achievements

Achievements

Highlights

Block or report snsten

LLMs

BlinkDL / RWKV-LM

BlinkDL / ChatRWKV

mistralai / mistral-inference

mit-han-lab / streaming-llm

spdustin / ChatGPT-AutoExpert

OpenBMB / XAgent

NVIDIA / TensorRT-LLM

pchunduri6 / rag-demystified

linexjlin / GPTs

dvmazur / mixtral-offloading

LargeWorldModel / LWM

facebookresearch / xformers

karpathy / minbpe

ggerganov / llama.cpp

google / gemma_pytorch

google / gemma.cpp

Lightning-AI / lit-llama

meta-llama / llama-recipes

karpathy / llama2.c

joennlae / tensorli

mem0ai / mem0

BoundaryML / baml