Starred repositories
Create an open source toy dataset for finetuning LLMs with reasoning abilities
Wiktionary dump file parser and multilingual data extractor
This is a Python script that summarizes a youtube video from a YouTube URL
An Autonomous LLM Agent that runs on Wizcoder-15B
Heuristic Imperatives Assessment Framework - Assessing Ethical Alignment in AI: A Framework for Measuring Adherence to Heuristic Imperatives
Gladdis (Generative Language Artificial Dedicated & Diligent Intelligence System) - it's an AI chatbot.
Autonomous Task Orchestration Manager (ATOM) Framework sample code set
A chat interface that uses the REMO memory system with LangFlow
the AI-native open-source embedding database
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
[NeurIPS'21] Projected GANs Converge Faster
Downloads and archives content from reddit
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop words and vocabulary.
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Starter code for Stanford CS224n default final project on SQuAD 2.0
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Physically Based Shading and Deferred Rendering for the Panda3D game engine