-
For Personal Use
- Istanbul
Starred repositories
Draw images in your ANSI terminal with true color
Fast and accurate automatic speech recognition (ASR) for edge devices
Automate browser-based workflows with LLMs and Computer Vision
Keyword spotting and forced alignment in any language
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
File explorer made with c89 and web tecnologies as frontend
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
On-device streaming text-to-speech engine powered by deep learning
Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Extract metadata, transcripts, and comments from YouTube videos with optional audio transcription using OpenAI Whisper API
Attention-based Adaptive filter designing for keyword classification
Official repository of TACos: Learning Temporally Structured Embeddings for Few-Shot Keyword Spotting with Dynamic Time Warping
Image keyword spotting (KWS) system for handwritten documents that utilizes quaternion-analysis based methods to create a parameters-efficient network. It works as a tool to retrieve scanned pages …
Small footprint, standalone, zero dependency, offline keyword spotting (KWS) CLI tool.
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining".
Official code for Metric learning for user-defined keyword spotting
In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not only the presence, but also the time position of the keywor…
Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.