Skip to content
View actuy's full-sized avatar

Highlights

  • Pro

Organizations

@INA-ZJU

Block or report actuy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LPIPS metric. pip install lpips

Python 3,616 499 Updated Jul 2, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,812 725 Updated Oct 1, 2024

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Python 5,496 909 Updated Oct 19, 2023

Pytorch implementation of the CREPE pitch tracker

Python 399 61 Updated Jun 17, 2024

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,539 569 Updated Jul 2, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,324 285 Updated Aug 15, 2024

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python 2,186 141 Updated Dec 12, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,407 3,835 Updated Sep 29, 2024

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Python 677 76 Updated Jan 6, 2024

Ongoing research training transformer models at scale

Python 10,142 2,280 Updated Oct 1, 2024

Example models using DeepSpeed

Python 6,016 1,020 Updated Sep 17, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,945 4,057 Updated Sep 30, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,079 293 Updated Jun 22, 2024

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

Python 429 29 Updated Feb 4, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,434 953 Updated Aug 5, 2024

Code for Motion Representations for Articulated Animation paper

Jupyter Notebook 1,232 349 Updated Mar 1, 2024

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Jupyter Notebook 3,440 558 Updated Feb 10, 2024

本人的科研经验

5,589 335 Updated Sep 28, 2024

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

Python 25 2 Updated Jul 19, 2022

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 3,079 333 Updated Aug 4, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,669 1,235 Updated Aug 21, 2024

Faster Whisper transcription with CTranslate2

Python 11,620 965 Updated Aug 21, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,919 504 Updated Jul 27, 2024

An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.

Python 52 12 Updated Sep 14, 2022

Easily train a good VC model with voice data <= 10 mins!

Python 23,439 3,490 Updated Sep 5, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 69,664 7,623 Updated Sep 30, 2024

speech self-supervised representations

Python 460 36 Updated Apr 27, 2023

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Python 353 30 Updated Sep 11, 2023

基于 OpenAI API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!Licensed under CC BY-NC-SA 4.0

TypeScript 5,525 257 Updated Aug 10, 2024

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Python 141 11 Updated Feb 11, 2023
Next