-
telephony
- Nanking@China
- http://www.cnblogs.com/lightsong/
- @lightsongtree
Lists (2)
Sort Name ascending (A-Z)
- All languages
- Astro
- Bikeshed
- C
- C#
- C++
- CSS
- CoffeeScript
- Cython
- Dart
- Dockerfile
- Go
- Groovy
- HCL
- HTML
- Handlebars
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Kotlin
- Less
- Lua
- MDX
- Makefile
- Markdown
- OCaml
- PHP
- PLpgSQL
- Pascal
- Perl
- PowerShell
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smarty
- Svelte
- Tcl
- TeX
- TypeScript
- Vue
- XSLT
Starred repositories
A Node-Based Frontend for CrewAI: Revolutionizing AI Workflow Creation
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Offline Text To Speech synthesis for python
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
An extremely fast Python package and project manager, written in Rust.
A multimodal RAG-based generative AI digital assistant that combines text generation, vision QA, and code generation.
Headless chrome/chromium automation library (unofficial port of puppeteer)
A natural language interface for computers
a state-of-the-art-level open visual language model | 多模态预训练模型
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Build real-time multimodal AI applications 🤖🎙️📹
Knowledge work automation with AI agents
the first library to let you embed a developer agent in your own app!
An Open-Ended Embodied Agent with Large Language Models
A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基于MetaGPT的多智能体入门与开发教程
🐋 A Dockerfile for nginx-rtmp-module + FFmpeg from source with basic settings for streaming HLS. Built on Alpine Linux.
Docker image with Nginx using the nginx-rtmp-module module for live multimedia (video) streaming.
Start building LLM-empowered multi-agent applications in an easier way.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.