Hytn

Haotian Chen Hytn

I'm currently a postdoc at Tsinghua University.

Stars

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,630 247 Updated Nov 17, 2024

Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 17,209 4,159 Updated Oct 28, 2024

MLSysOps / MLE-agent

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…

Python 1,093 49 Updated Nov 15, 2024

shashankvemuri / Finance

150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data

Python 2,181 58 Updated Aug 20, 2024

wilsonfreitas / awesome-quant

A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)

Python 18,193 2,634 Updated Nov 17, 2024

DAMO-NLP-SG / CoI-Agent

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 357 18 Updated Nov 11, 2024

LiqiangJing / DSBench

DSBench: How Far are Data Science Agents from Becoming Data Science Experts?

Jupyter Notebook 35 2 Updated Oct 20, 2024

suragnair / alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 3,892 1,036 Updated Jun 6, 2024

microsoft / RD-Agent

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 1,137 85 Updated Nov 15, 2024

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,144 120 Updated Nov 15, 2024

WecoAI / aideml

AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.

Python 585 64 Updated Nov 3, 2024

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 510 54 Updated Nov 1, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey: A Strategic Progress Report – Part I

1,299 34 Updated Oct 28, 2024

Open-Source-O1 / Open-O1

Python 807 25 Updated Oct 17, 2024

ulab-uiuc / research-town

A platform for developers to simulate research community

Python 88 10 Updated Nov 16, 2024

siegelz / core-bench

Jupyter Notebook 17 4 Updated Nov 16, 2024

facebookresearch / habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 1,981 494 Updated Nov 15, 2024

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,503 472 Updated Jan 8, 2024

openai / weak-to-strong

Python 2,505 306 Updated May 19, 2024

ezelikman / quiet-star

Code for Quiet-STaR

Python 651 88 Updated Aug 21, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

5,175 286 Updated Nov 11, 2024

microsoft / WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 475 47 Updated Nov 15, 2024

HCPLab-SYSU / Embodied_AI_Paper_List

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

783 53 Updated Nov 1, 2024

leanprover / lean4

Lean 4 programming language and theorem prover

Lean 4,710 424 Updated Nov 17, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 34,395 4,242 Updated Nov 16, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,144 2,191 Updated Nov 16, 2024

Paitesanshi / LLM-Agent-Survey

2,589 153 Updated Nov 12, 2024

reworkd / AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 31,810 9,243 Updated Oct 7, 2024

quchangle1 / LLM-Tool-Survey

This is the repository for the Tool Learning survey.

249 10 Updated Nov 6, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,179 1,145 Updated Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly