Skip to content
View Leeon-K's full-sized avatar
  • BUAA
  • Beijing China
  • 22:36 (UTC +08:00)
  • X @Lick

Highlights

  • Pro

Block or report Leeon-K

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
22 stars written in C++
Clear filter

LLM inference in C/C++

C++ 67,337 9,673 Updated Nov 5, 2024

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,237 5,583 Updated Nov 5, 2024

🔥 Linux下C++轻量级WebServer服务器

C++ 16,769 3,940 Updated Jul 5, 2024

Development repository for the Triton language and compiler

C++ 13,298 1,628 Updated Nov 5, 2024

A C++ header-only HTTP/HTTPS server and client library

C++ 13,066 2,298 Updated Nov 2, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,570 973 Updated Nov 5, 2024

机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶

C++ 8,009 2,778 Updated Jul 9, 2024

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)

C++ 6,962 1,609 Updated Sep 24, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 5,901 666 Updated Nov 4, 2024

Transformer related optimization, including BERT, GPT

C++ 5,863 891 Updated Mar 27, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,616 958 Updated Oct 29, 2024

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

C++ 4,763 542 Updated Oct 24, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,308 339 Updated Oct 24, 2024

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 2,514 282 Updated Oct 26, 2024

TinySTL is a subset of STL(cut some containers and algorithms) and also a superset of STL(add some other containers and algorithms)

C++ 2,321 634 Updated Oct 27, 2018

MLIR For Beginners tutorial

C++ 808 67 Updated Sep 30, 2024

🚧《C++并发编程实战》的读书笔记,供以后工作中查阅。

C++ 330 130 Updated Jun 10, 2018

Hands-On Practical MLIR Tutorial

C++ 329 46 Updated Oct 20, 2023

KnowledgeDistillation Layer (Caffe implementation)

C++ 89 40 Updated Jun 8, 2017

PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)

C++ 1 Updated Mar 26, 2024
C++ 1 Updated Jan 10, 2024