Skip to content
View dillonhows's full-sized avatar

Block or report dillonhows

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DuckDB-powered analytics for Postgres

Rust 158 10 Updated Oct 1, 2024

Postgres for Search and Analytics

Rust 5,924 171 Updated Oct 3, 2024
Python 3 Updated Oct 3, 2024

Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"

Python 23 5 Updated Sep 25, 2023

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages

Python 1,156 113 Updated Dec 21, 2023

Things you can do with the token embeddings of an LLM

Python 1,214 35 Updated Oct 4, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,994 217 Updated Oct 1, 2024

Questions? Contact me at @DhruvAtreja1

TypeScript 108 24 Updated Sep 11, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 4,830 397 Updated Oct 2, 2024

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 964 55 Updated Sep 29, 2024

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unita…

Python 933 114 Updated Sep 19, 2024

Notebooks for training universal 0-shot classifiers on many different tasks

Jupyter Notebook 102 8 Updated Apr 3, 2024

比Sentence-BERT更有效的句向量方案

Python 353 24 Updated Nov 9, 2022

experiments of some semantic matching models and comparison of experimental results.

Python 154 14 Updated Jun 12, 2023

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

1,501 195 Updated Sep 30, 2024

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,767 255 Updated Jun 12, 2023

Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey".

253 19 Updated Sep 25, 2024

Code implementation of synthetic continued pretraining

Python 34 5 Updated Oct 3, 2024

A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)

JavaScript 363 80 Updated Sep 20, 2024

Brand new TTS solution

Python 12,830 961 Updated Oct 3, 2024

🌦️ A catalogue and categorization of AI-based weather forecasting models.

Python 108 4 Updated Sep 27, 2024

Tools for Crawlers

Python 22 7 Updated Dec 25, 2023

A standalone version of the readability lib

JavaScript 8,804 599 Updated Sep 26, 2024

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 4,037 258 Updated Oct 3, 2024

A web crawler and scraper for Rust

Rust 1,007 92 Updated Oct 3, 2024

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

JavaScript 4,088 226 Updated Jul 17, 2024

Python scraper based on AI

Python 14,708 1,203 Updated Oct 3, 2024

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

Python 279 29 Updated Aug 13, 2024

[Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)

120 10 Updated Sep 18, 2024
Next