Welcome to the official repository for AIDE, an AI system that can automatically solve data science tasks at a human level, and with human input, it can perform even better. We believe giving developers and researchers direct access to AIDE locally, with local compute and choice to use their own LLM keys, is the most straightforward way to make it useful. That's why we'll open-source it, and the tentative timeline is it will arrive before the end of April. Currently, this repository serves as a gallery showcasing its solutions for 60+ Kaggle competitions we tested.
AIDE is an AI-powered data science assistant that can autonomously understand task requirements, design, and implement solutions. By leveraging large language models and innovative agent architectures, such as the Solution Space Tree Search algorithm, AIDE has achieved human-level performance on a wide range of data science tasks, outperforming over 50% of human data scientists on Kaggle competitions.
Domain | Task | Top% | Solution Link | Competition Link |
---|---|---|---|---|
Urban Planning | Forecast city bikeshare system usage | 5% | link | link |
Physics | Predicting Critical Heat Flux | 56% | link | link |
Genomics | Classify bacteria species from genomic data | 0% | link | link |
Agriculture | Predict blueberry yield | 58% | link | link |
Healthcare | Predict disease prognosis | 0% | link | link |
Economics | Predict monthly microbusiness density in a given area | 35% | link | link |
Cryptography | Decrypt shakespearean text | 91% | link | link |
Data Science Education | Predict passenger survival on Titanic | 78% | link | link |
Software Engineering | Predict defects in c programs given various attributes about the code | 0% | link | link |
Real Estate | Predict the final price of homes | 5% | link | link |
Real Estate | Predict house sale price | 36% | link | link |
Entertainment Analytics | Predict movie worldwide box office revenue | 62% | link | link |
Entertainment Analytics | Predict scoring probability in next 10 seconds of a rocket league match | 21% | link | link |
Environmental Science | Predict air pollution levels | 12% | link | link |
Environmental Science | Classify forest categories using cartographic variables | 55% | link | link |
Computer Vision | Predict the probability of machine failure | 32% | link | link |
Computer Vision | Identify handwritten digits | 14% | link | link |
Manufacturing | Predict missing values in dataset | 70% | link | link |
Manufacturing | Predict product failures | 48% | link | link |
Manufacturing | Cluster control data into different control states | 96% | link | link |
Natural Language Processing | Classify toxic online comments | 78% | link | link |
Natural Language Processing | Predict passenger transport to an alternate dimension | 59% | link | link |
Natural Language Processing | Classify sentence sentiment | 42% | link | link |
Natural Language Processing | Predict whether a tweet is about a real disaster | 48% | link | link |
Business Analytics | Predict total sales for each product and store in the next month | 87% | link | link |
Business Analytics | Predict book sales for 2021 | 66% | link | link |
Business Analytics | Predict insurance claim amount | 80% | link | link |
Business Analytics | Minimize penalty cost in scheduling families to santa's workshop | 100% | link | link |
Business Analytics | Predict yearly sales for learning modules | 26% | link | link |
Business Analytics | Binary classification of manufacturing machine state | 60% | link | link |
Business Analytics | Forecast retail store sales | 36% | link | link |
Business Analytics | Predict reservation cancellation | 54% | link | link |
Finance | Predict the probability of an insurance claim | 13% | link | link |
Finance | Predict loan loss | 0% | link | link |
Finance | Predict a continuous target | 42% | link | link |
Finance | Predict customer churn | 24% | link | link |
Finance | Predict median house value | 58% | link | link |
Finance | Predict closing price movements for nasdaq listed stocks | 99% | link | link |
Finance | Predict taxi fare | 100% | link | link |
Finance | Predict insurance claim probability | 62% | link | link |
Biotech | Predict cat in dat | 66% | link | link |
Biotech | Predict the biological response of molecules | 62% | link | link |
Biotech | Predict medical conditions | 92% | link | link |
Biotech | Predict wine quality | 61% | link | link |
Biotech | Predict binary target without overfitting | 98% | link | link |
Biotech | Predict concrete strength | 86% | link | link |
Biotech | Predict crab age | 46% | link | link |
Biotech | Predict enzyme characteristics | 10% | link | link |
Biotech | Classify activity state from sensor data | 51% | link | link |
Biotech | Predict horse health outcomes | 86% | link | link |
Biotech | Predict the mohs hardness of a mineral | 64% | link | link |
Biotech | Predict cirrhosis patient outcomes | 51% | link | link |
Biotech | Predict obesity risk | 62% | link | link |
Biotech | Classify presence of feature in data | 66% | link | link |
Biotech | Predict patient's smoking status | 40% | link | link |