-
University of California, Berkeley
- Berkeley, CA
-
12:14
(UTC -07:00) - jbkwon.com
- @djbkwon
Starred repositories
Official Implementation of weights2weights
Export iMessage data + run iMessage Diagnostics
Instant voice cloning by MIT and MyShell.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
SoftwareImpacts / SIMPAC-2020-63
Forked from sooftware/kospeechOpen-Source Toolkit for End-to-End Korean Speech Recognition.
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
A multi-voice TTS system trained with an emphasis on quality
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Seq2Seq model for McCune Reischauer Romanization of Korean
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🔊 Text-Prompted Generative Audio Model
Foundational model for human-like, expressive TTS
RetinaFace: Deep Face Detection Library for Python
😌 Automatically detects and crops faces from batches of pictures.
A community-maintained Python framework for creating mathematical animations.
Robust Speech Recognition via Large-Scale Weak Supervision
Set of tools to assess and improve LLM security.