Highlights
Stars
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB (v6), DB2 and DB2 for IBM i.
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
an implementation of Self-Extend, to expand the context window via grouped attention
AirLLM 70B inference with single 4GB GPU
Finetune Llama 3.2, Mistral, Phi, Qwen & Gemma LLMs 2-5x faster with 80% less memory
Routing and navigation for your React Native apps
📚 Freely available programming books
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Access large language models from the command-line
Official repository for LongChat and LongEval
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
A high-throughput and memory-efficient inference and serving engine for LLMs
AlpinDale / gptq-gptj
Forked from IST-DASLab/gptqCode for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
Checks Alexa's top 1M websites for the presence of OpenAI's new .well-known/ai-plugin.json files
👻 Experimental library for scraping websites using OpenAI's GPT API.
Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
antimatter15 / alpaca.cpp
Forked from ggerganov/llama.cppLocally run an Instruction-Tuned Chat-Style LLM
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html