Skip to content
View Hytn's full-sized avatar

Block or report Hytn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,630 247 Updated Nov 17, 2024

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 17,209 4,159 Updated Oct 28, 2024

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…

Python 1,093 49 Updated Nov 15, 2024

150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data

Python 2,181 58 Updated Aug 20, 2024

A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)

Python 18,193 2,634 Updated Nov 17, 2024

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 357 18 Updated Nov 11, 2024

DSBench: How Far are Data Science Agents from Becoming Data Science Experts?

Jupyter Notebook 35 2 Updated Oct 20, 2024

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 3,892 1,036 Updated Jun 6, 2024

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 1,137 85 Updated Nov 15, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,144 120 Updated Nov 15, 2024

AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.

Python 585 64 Updated Nov 3, 2024

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 510 54 Updated Nov 1, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,299 34 Updated Oct 28, 2024
Python 807 25 Updated Oct 17, 2024

A platform for developers to simulate research community

Python 88 10 Updated Nov 16, 2024
Jupyter Notebook 17 4 Updated Nov 16, 2024

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 1,981 494 Updated Nov 15, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,503 472 Updated Jan 8, 2024
Python 2,505 306 Updated May 19, 2024

Code for Quiet-STaR

Python 651 88 Updated Aug 21, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

5,175 286 Updated Nov 11, 2024

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 475 47 Updated Nov 15, 2024

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

783 53 Updated Nov 1, 2024

Lean 4 programming language and theorem prover

Lean 4,710 424 Updated Nov 17, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 34,395 4,242 Updated Nov 16, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,144 2,191 Updated Nov 16, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 31,810 9,243 Updated Oct 7, 2024

This is the repository for the Tool Learning survey.

249 10 Updated Nov 6, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,179 1,145 Updated Nov 8, 2024
Next