Stars
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…
150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
O1 Replication Journey: A Strategic Progress Report – Part I
A platform for developers to simulate research community
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
Lean 4 programming language and theorem prover
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
This is the repository for the Tool Learning survey.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬