Skip to content
View abacaj's full-sized avatar
💭
Writing more code
💭
Writing more code

Block or report abacaj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A golang-based data loader which can be used from Python. Focused on a VectorDB stack at the moment, fetching and processing data per sample at GB/s speeds.

Go 54 Updated Oct 2, 2024

A language model programming library.

Python 4,248 240 Updated Oct 3, 2024

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Python 276 18 Updated Sep 13, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 12,639 1,469 Updated Oct 3, 2024

Optimizing inference proxy for LLMs

Python 940 91 Updated Oct 2, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,445 138 Updated Sep 24, 2024

aider is AI pair programming in your terminal

Python 19,870 1,815 Updated Oct 3, 2024

Efficient Triton Kernels for LLM Training

Python 3,116 159 Updated Oct 3, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,212 114 Updated Oct 1, 2024

A throughput-oriented high-performance serving framework for LLMs

Cuda 564 23 Updated Sep 21, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 211 6 Updated Sep 11, 2024

BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground

Rust 1,080 34 Updated Oct 3, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 823 64 Updated Sep 24, 2024
Python 26 1 Updated Sep 23, 2024

Efficient and general syntactical decoding for Large Language Models

Python 181 10 Updated Sep 28, 2024

BullMQ - Message Queue and Batch processing for NodeJS and Python based on Redis

TypeScript 6,004 387 Updated Oct 3, 2024

Build AI Assistants with memory, knowledge and tools.

Python 11,238 1,669 Updated Oct 3, 2024

TypeScript notebook for developers

TypeScript 176 4 Updated Oct 1, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,240 205 Updated Oct 3, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,334 391 Updated Sep 28, 2024

GPTQ based LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 95 19 Updated Sep 30, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 686 34 Updated Sep 19, 2024

基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。

Python 7,893 754 Updated Sep 18, 2024

Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling

Python 8,434 2,788 Updated Oct 3, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 11,884 1,075 Updated Oct 3, 2024

Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

TypeScript 2,773 354 Updated Oct 3, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 515 40 Updated Oct 3, 2024

The lightweight, user-friendly, distributed relational database built on SQLite.

Go 15,619 710 Updated Oct 1, 2024

🛏 An HTML to Markdown converter written in JavaScript

HTML 8,734 875 Updated Jul 30, 2024
Next