Skip to content
View fiskrt's full-sized avatar

Block or report fiskrt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,675 397 Updated Oct 17, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,509 1,083 Updated May 23, 2024

LLM training in simple, raw C/CUDA

Cuda 24,081 2,697 Updated Oct 2, 2024

Universal local privilege escalation Proof-of-Concept exploit for CVE-2024-1086, working on most Linux kernels between v5.14 and v6.6, including Debian, Ubuntu, and KernelCTF. The success rate is 9…

C 2,267 297 Updated Apr 17, 2024

Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.

1,931 91 Updated Sep 20, 2024

structured outputs for llms

Python 7,798 622 Updated Oct 17, 2024

llama-cpp-python-exploit

Python 15 Updated Oct 14, 2023

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,732 1,114 Updated Sep 24, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 55,082 5,678 Updated Aug 24, 2024

Run inference on MPT-30B using CPU

Python 573 94 Updated Jun 30, 2023

[ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

Python 314 4 Updated Nov 23, 2023

What would you do with 1000 H100s...

Jupyter Notebook 890 52 Updated Jan 10, 2024

CLIP inference in plain C/C++ with no extra dependencies

C++ 446 30 Updated Aug 18, 2024

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Python 1,461 124 Updated Sep 11, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 5,971 516 Updated Sep 6, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,424 4,794 Updated Sep 19, 2024

We write your reusable computer vision tools. 💜

Python 23,704 1,772 Updated Oct 18, 2024

WALDSAC (RRANSAC with SPRT)

MATLAB 5 Updated Mar 26, 2023

RISC Zero is a zero-knowledge verifiable general computing platform based on zk-STARKs and the RISC-V microarchitecture.

C++ 1,644 412 Updated Oct 17, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 93,624 15,089 Updated Oct 18, 2024

Repo for external large-scale work

Python 6,502 726 Updated Apr 27, 2024

Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️‍♂️

JavaScript 4,330 389 Updated May 10, 2024

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 649 55 Updated May 7, 2024

A playbook for systematically maximizing the performance of deep learning models.

26,877 2,233 Updated Jun 18, 2024

Memory mapped numpy arrays of varying shapes

Python 280 11 Updated Jun 19, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 1 Updated Jul 20, 2022

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,746 191 Updated Mar 15, 2024