JoshuaPurtell

Follow

💭

Working

Josh Purtell JoshuaPurtell

💭

Working

Follow

AI Agent Research

39 followers · 103 following

Achievements

Achievements

Pinned Loading

Apropos Apropos Public

A framework for rapidly building compound AI systems

Python 3
craftaxlm craftaxlm Public

A wrapper around the Craftax agent benchmark, for evaluating digital agents over extremely long time horizons

Python 1
LRCBench LRCBench Public

Evals meant to evaluate language models' ability to reason over long contexts.

Python 8
SmallBench SmallBench Public

Small, simple agent task environments for training and evaluation

Python 16
icl-bench icl-bench Public

Evaluating Language Models' Ability to Learn In Context

Python
jazyk jazyk Public

Simple LM api for production

Python