Skip to content
View AugF's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report AugF

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 6,104 510 Updated Nov 18, 2024

Python library implementing a trie data structure.

Python 39 8 Updated Mar 26, 2024

Python library implementing a trie data structure.

Python 816 132 Updated Apr 10, 2021

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 631 80 Updated Nov 15, 2024

Summarize existing representative LLMs text datasets.

1,006 107 Updated Sep 4, 2024

High-quality datasets, tools, and concepts for LLM fine-tuning.

2,008 174 Updated Oct 25, 2024

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).

Python 275 28 Updated Jun 1, 2023

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

347 100 Updated Oct 16, 2023

A quick guide (especially) for trending instruction finetuning datasets

2,641 169 Updated Nov 28, 2023

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,497 337 Updated May 30, 2023
Jupyter Notebook 154 6 Updated Oct 21, 2024

CA-LoRA: Adapting Existing LoRA for Compressed LLMs to Enable Efficient Multi-Tasking on Personal Devices (COLM 2024)

Python 5 Updated Oct 30, 2024

Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Python 51 3 Updated Nov 1, 2024

CoreNet: A library for training deep neural networks

Jupyter Notebook 6,982 541 Updated Oct 14, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

470 21 Updated Nov 12, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

1 Updated Jun 14, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 28,175 2,793 Updated Nov 18, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 960 79 Updated Jul 23, 2024

LLM training in simple, raw C/CUDA

Cuda 24,442 2,764 Updated Oct 2, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 39,153 4,140 Updated Jul 28, 2024

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 780 61 Updated Aug 27, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,942 154 Updated Mar 27, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,470 1,625 Updated Nov 18, 2024

An 8bit automated quantization conversion tool for the pytorch (Post-training quantization based on KL divergence)

Python 33 2 Updated Nov 17, 2019

Multi-Candidate Speculative Decoding

Python 28 5 Updated Apr 22, 2024
Python 1,271 172 Updated Nov 18, 2024

Cool Papers - Immersive Paper Discovery

JavaScript 401 5 Updated Nov 1, 2024

中文版 llm-numbers

108 5 Updated Dec 25, 2023

how to optimize some algorithm in cuda.

Cuda 1,592 132 Updated Nov 12, 2024
Next