Lists (1)
Sort Name ascending (A-Z)
Stars
Open source platform for the machine learning lifecycle
Apple Silicon Guide. Learn all about the A17 Pro, A16 Bionic, R1, M1-series, M2-series, and M3-series chips. Along with all the Devices, Operating Systems, Tools, Gaming, and Software that Apple Si…
Code for the manim-generated scenes used in 3blue1brown videos
CVPR 2024: AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Images to inference with no labeling (use foundation models to train supervised models).
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集
An open-source RAG-based tool for chatting with your documents.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
Information Retrieval from Audio via Knowledge Graph
A modular graph-based Retrieval-Augmented Generation (RAG) system
A programming framework for agentic AI 🤖
Penpot: The open-source design tool for design and code collaboration
real time face swap and one-click video deepfake with only a single image
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
Unlock the hidden Camera Profiles in the Adobe Lightroom and Adobe Camera Raw