Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
A generalist foundation model for healthcare capable of handling diverse medical data modalities.
Neural Code Intelligence Survey 2024; Reading lists and resources
A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
Code for EMNLP 2024 paper "DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering"
[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering
Lightweight tool to identify Data Contamination in LLMs evaluation
[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
LoFiT: Localized Fine-tuning on LLM Representations
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.
✨✨Latest Advances on Multimodal Large Language Models
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
A Comprehensive Benchmark for Code Information Retrieval.
This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"
A high-throughput and memory-efficient inference and serving engine for LLMs
llama3 implementation one matrix multiplication at a time
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
DSPy: The framework for programming—not prompting—language models