Skip to content
View danieljbk's full-sized avatar
🧠
Thinking hard, or hardly thinking..?
🧠
Thinking hard, or hardly thinking..?
  • University of California, Berkeley
  • Berkeley, CA
  • 12:14 (UTC -07:00)
  • X @djbkwon

Block or report danieljbk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official Implementation of weights2weights

Jupyter Notebook 115 3 Updated Sep 27, 2024

Export iMessage data + run iMessage Diagnostics

Rust 2,963 120 Updated Oct 6, 2024

Instant voice cloning by MIT and MyShell.

Python 29,159 2,860 Updated Aug 21, 2024

LLM101n: Let's build a Storyteller

29,355 1,606 Updated Aug 1, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,314 1,280 Updated Sep 14, 2024
Python 5,097 851 Updated Oct 14, 2024

Open-Source Toolkit for End-to-End Korean Speech Recognition.

Python 4 2 Updated Dec 14, 2020

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

Python 199 13 Updated Mar 13, 2023
Jupyter Notebook 10 Updated Jan 13, 2022

A monitor of resources

C++ 20,357 627 Updated Sep 24, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,071 1,801 Updated Aug 19, 2024

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Python 507 49 Updated Apr 2, 2023

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,801 2,459 Updated Oct 15, 2024

Seq2Seq model for McCune Reischauer Romanization of Korean

Python 5 2 Updated Aug 12, 2018

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,406 256 Updated Oct 11, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,461 304 Updated Jan 4, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,719 5,798 Updated Aug 19, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,691 4,199 Updated Aug 19, 2024

Foundational model for human-like, expressive TTS

Python 3,800 654 Updated Jul 30, 2024

4M: Massively Multimodal Masked Modeling

Python 1,579 93 Updated Oct 7, 2024

RetinaFace: Deep Face Detection Library for Python

Python 1,184 153 Updated Aug 18, 2024

Chrome Dino Game AI (NEAT)

JavaScript 57 14 Updated Jun 4, 2024

😌 Automatically detects and crops faces from batches of pictures.

Python 634 119 Updated Jan 17, 2023

Animation engine for explanatory math videos

Python 67,565 6,048 Updated Oct 15, 2024

A community-maintained Python framework for creating mathematical animations.

Python 23,938 1,686 Updated Oct 14, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 69,518 8,189 Updated Sep 30, 2024

Set of tools to assess and improve LLM security.

Python 2,622 439 Updated Oct 14, 2024

Inference code for CodeLlama models

Python 15,950 1,852 Updated Aug 12, 2024
Next