actuy

Chen Zhang actuy

61 followers · 17 following

https://actuy.github.io/

Achievements

Highlights

Organizations

Stars

richzhang / PerceptualSimilarity

LPIPS metric. pip install lpips

Python 3,616 499 Updated Jul 2, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,812 725 Updated Oct 1, 2024

rtqichen / torchdiffeq

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Python 5,496 909 Updated Oct 19, 2023

maxrmorrison / torchcrepe

Pytorch implementation of the CREPE pitch tracker

Python 399 61 Updated Jun 17, 2024

Zejun-Yang / AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,539 569 Updated Jul 2, 2024

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,324 285 Updated Aug 15, 2024

IDEA-Research / DWPose

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python 2,186 141 Updated Dec 12, 2023

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,407 3,835 Updated Sep 29, 2024

Weizhi-Zhong / IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Python 677 76 Updated Jan 6, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,142 2,280 Updated Oct 1, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,016 1,020 Updated Sep 17, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,945 4,057 Updated Sep 30, 2024

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Python 4,079 293 Updated Jun 22, 2024

FlagAI-Open / Aquila2

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

Python 429 29 Updated Feb 4, 2024

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,434 953 Updated Aug 5, 2024

snap-research / articulated-animation

Code for Motion Representations for Articulated Animation paper

Jupyter Notebook 1,232 349 Updated Mar 1, 2024

yoyo-nb / Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Jupyter Notebook 3,440 558 Updated Feb 10, 2024

pengsida / learning_research

本人的科研经验

5,589 335 Updated Sep 28, 2024

k-m-irfan / simplified_mediapipe_face_landmarks

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

Python 25 2 Updated Jul 19, 2022

GuyTevet / motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 3,079 333 Updated Aug 4, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,669 1,235 Updated Aug 21, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,620 965 Updated Aug 21, 2024

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,919 504 Updated Jul 27, 2024

CODEJIN / Glow_TTS

An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.

Python 52 12 Updated Sep 14, 2022

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 23,439 3,490 Updated Sep 5, 2024

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 69,664 7,623 Updated Sep 30, 2024

auspicious3000 / contentvec

speech self-supervised representations

Python 460 36 Updated Apr 27, 2023

facebookresearch / muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Python 353 30 Updated Sep 11, 2023

openai-translator / bob-plugin-openai-translator

基于 OpenAI API 的文本翻译、文本润色、语法纠错 Bob 插件，让我们一起迎接不需要巴别塔的新时代！Licensed under CC BY-NC-SA 4.0

TypeScript 5,525 257 Updated Aug 10, 2024

revsic / torch-nansypp

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Python 141 11 Updated Feb 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chen Zhang actuy

Achievements

Achievements

Highlights

Organizations

Block or report actuy

Stars

richzhang / PerceptualSimilarity

THUDM / CogVideo

rtqichen / torchdiffeq

maxrmorrison / torchcrepe

Zejun-Yang / AniPortrait

Tencent / HunyuanDiT

IDEA-Research / DWPose

RVC-Boss / GPT-SoVITS

Weizhi-Zhong / IP_LAP

NVIDIA / Megatron-LM

microsoft / DeepSpeedExamples

microsoft / DeepSpeed

baichuan-inc / Baichuan2

FlagAI-Open / Aquila2

OpenTalker / video-retalking

snap-research / articulated-animation

yoyo-nb / Thin-Plate-Spline-Motion-Model

pengsida / learning_research

k-m-irfan / simplified_mediapipe_face_landmarks

GuyTevet / motion-diffusion-model

m-bain / whisperX

SYSTRAN / faster-whisper

jik876 / hifi-gan

CODEJIN / Glow_TTS

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

nomic-ai / gpt4all

auspicious3000 / contentvec

facebookresearch / muavic

openai-translator / bob-plugin-openai-translator

revsic / torch-nansypp