Skip to content
View nyimbi's full-sized avatar
  • Datacraft
  • Nairobi

Block or report nyimbi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

NLP

Translation Tools
29 repositories

Toolkit for training/converting LibreTranslate compatible language models 🚂

Python 48 11 Updated Oct 25, 2024

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.

Python 9,707 876 Updated Nov 16, 2024

Training open neural machine translation models

Makefile 334 41 Updated Aug 16, 2024

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,774 2,251 Updated Jun 27, 2024

Open-source offline translation library written in Python

Python 3,910 285 Updated Oct 2, 2024

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2,431 449 Updated Aug 9, 2024

Open Source Neural Machine Translation in Torch (deprecated)

Lua 2,386 466 Updated Feb 19, 2020

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Python 2,389 374 Updated Aug 26, 2021

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,196 464 Updated Aug 7, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,211 191 Updated Mar 1, 2024

Neural machine translation and sequence learning using TensorFlow

Python 1,458 392 Updated Oct 14, 2023

Multilingual word vectors in 78 languages

Jupyter Notebook 1,197 121 Updated Mar 10, 2023

Open-Source Neural Machine Translation in Tensorflow

Python 797 269 Updated Dec 9, 2022

An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group

Python 705 197 Updated Apr 26, 2022

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

566 7 Updated Jun 7, 2024

Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing

HTML 554 93 Updated Nov 3, 2024

Open neural machine translation models and web services

Python 620 71 Updated Oct 7, 2024

A list of Neural MT implementations

359 69 Updated Jul 27, 2022

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Python 913 78 Updated Nov 14, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,364 426 Updated Nov 13, 2024

Text compression for generating keyboard expansions

Python 1,410 28 Updated Sep 30, 2023

Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English

Jupyter Notebook 14 4 Updated Oct 20, 2020

AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/

Jupyter Notebook 46 40 Updated Jan 10, 2024

Facebook Low Resource (FLoRes) MT Benchmark

Python 704 123 Updated Nov 20, 2023

Fast inference engine for Transformer models

C++ 3,403 303 Updated Nov 5, 2024

Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

JavaScript 757 105 Updated Mar 16, 2023

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Python 350 47 Updated Nov 7, 2023

A collection of links and notes on forced alignment tools

Python 873 86 Updated Nov 10, 2021

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Python 3,305 273 Updated Nov 11, 2024