Skip to content
View holdenk's full-sized avatar

Sponsors

@clstaudt

Organizations

@sparklingpandas @high-performance-spark @scalingpythonml @PigsCanFlyLabs

Block or report holdenk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Let's RAG it RAW without fancy frameworks

Jupyter Notebook 26 2 Updated Sep 15, 2024

A collection of learning resources for curious software engineers

Python 46,756 3,727 Updated Nov 11, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,898 465 Updated May 3, 2024

pyspark methods to enhance developer productivity 📣 👯 🎉

Python 643 99 Updated Oct 12, 2024

Apache Spark Connect Client for Golang

Go 161 32 Updated Nov 7, 2024

A Python Library to support running data quality rules while the spark job is running⚡

Python 163 39 Updated Nov 11, 2024

A tool to validate data, built around Apache Spark.

Scala 101 34 Updated Nov 17, 2024

8-bit CUDA functions for PyTorch, modified to build on Jetson Xavier

C 14 11 Updated Apr 26, 2023

LLM finetuned for medical question answering

Python 490 58 Updated Sep 7, 2023

English SDK for Apache Spark

Python 839 128 Updated Jun 12, 2024

Python Stream Processing

Python 1,563 64 Updated Nov 15, 2024

A modular implementation of timely dataflow in Rust

Rust 3,296 272 Updated Nov 12, 2024

State of the Art Natural Language Processing

Scala 3,871 713 Updated Nov 17, 2024

Your self-hosted, globally interconnected microblogging community

Ruby 47,158 6,986 Updated Nov 18, 2024

A POC for multilingual UDFs in KSQL

Shell 3 Updated Mar 16, 2019

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Go 37,293 5,849 Updated Nov 19, 2024

Prototype implementation of Service-Level Fault Injection Testing in Python.

Python 67 2 Updated Nov 5, 2022

Replaces the factory firmware on the SwitchBot Plug Mini via OTA, enabling the use of Tasmota without disassembling the unit.

C 107 18 Updated Jul 21, 2024

A Label Printer Application

C 241 31 Updated Nov 16, 2024

lakeFS - Data version control for your data lake | Git for data

Go 4,461 356 Updated Nov 18, 2024
Scala 13 Updated Sep 20, 2023

Java imap nio client that is designed to scale well for thousands of connections per machine and reduce contention when using large number of threads and cpus.

Java 57 50 Updated Aug 23, 2023

Inofficial Qualcomm Firehose / Sahara / Streaming / Diag Tools :)

Python 1,656 385 Updated Oct 12, 2024

Reverse Engineering Furby Connect's Bluetooth Protocol and Update Format

JavaScript 477 82 Updated Jan 16, 2024

Open source version of Arrow Connect Platform developed by Arrow Electronics

Java 6 1 Updated Jan 12, 2023

A PowerDNS pipe dynamic backend to serve dnswall style A, AAAA and PTR DNS records for any given CIDR ranges.

Python 22 10 Updated Aug 5, 2024

Main repository for the Howlr application

JavaScript 47 15 Updated Feb 26, 2022
Kotlin 4 1 Updated Oct 29, 2020

:octocat: GitHub Action to build and deploy a Jekyll site to GitHub Pages 🧪

Shell 24 9 Updated Apr 30, 2022
Next