Skip to content
View BigJoon's full-sized avatar
🐍
🐍

Block or report BigJoon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 7,085 836 Updated Nov 13, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 9,082 857 Updated Nov 13, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 7,488 699 Updated Nov 13, 2024

Official inference repo for FLUX.1 models

Python 15,888 1,153 Updated Nov 14, 2024

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 626 35 Updated Aug 6, 2024

A better way to make GUIs for your python apps

HTML 427 158 Updated Nov 7, 2020

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 5,941 596 Updated Sep 26, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 16,437 1,798 Updated Nov 14, 2024

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,664 922 Updated Apr 23, 2024

Instant voice cloning by MIT and MyShell.

Python 29,788 2,929 Updated Aug 21, 2024

Industry leading face manipulation platform

Python 19,631 2,987 Updated Nov 14, 2024

DeepFaceLab is the leading software for creating deepfakes.

Python 16,291 6 Updated Nov 13, 2024

Real-time face swap for PC streaming or video calls

Python 26,845 30 Updated Nov 8, 2024

one-click face swap

Python 28,562 6,972 Updated Aug 19, 2024

one-click deepfake (face swap)

Python 201 79 Updated Jun 22, 2023

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,408 1,399 Updated Nov 1, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 97,913 7,791 Updated Nov 15, 2024

자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가

Python 30 3 Updated May 31, 2024

Singing Voice Conversion via diffusion model

Jupyter Notebook 56 19 Updated Aug 24, 2023

🙌 OpenHands: Code Less, Make More

Python 36,519 4,149 Updated Nov 15, 2024

An experimental Rust native UI framework

Rust 3,695 115 Updated Nov 14, 2024

A data-first Rust-native UI design toolkit.

Rust 9,565 568 Updated Oct 25, 2024

The swiss army knife of lossless video/audio editing

TypeScript 28,030 1,356 Updated Nov 11, 2024

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 232 16 Updated May 19, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 71,247 8,455 Updated Nov 13, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,148 552 Updated Oct 19, 2024

Open-source simulator for autonomous driving research.

C++ 11,375 3,694 Updated Nov 15, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,638 746 Updated Jun 24, 2024

Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.

Jupyter Notebook 515 156 Updated May 25, 2024

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 616 37 Updated Oct 14, 2024
Next