Skip to content
View tuteng0915's full-sized avatar
  • Tsinghua U
  • Beijing

Block or report tuteng0915

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Python 155 9 Updated Apr 5, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,979 2,151 Updated Nov 11, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,361 769 Updated Jul 31, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 236 16 Updated Mar 25, 2024

Evaluation functions for music/audio information retrieval/signal processing algorithms.

Python 613 115 Updated Nov 12, 2024

A curated list of Video to Audio Generation

9 1 Updated Oct 17, 2024

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Python 147 21 Updated Jul 30, 2024

Manually annotated chord data set of US pop songs and Popular Music Collection of RWC Music Database

Python 83 13 Updated Apr 9, 2013

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 4,619 568 Updated Oct 31, 2024

A large-scale dataset of caption-annotated MIDI files.

Python 49 1 Updated Jul 23, 2024
Jupyter Notebook 152 10 Updated Jul 5, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 205 12 Updated Jul 25, 2024

The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.

Jupyter Notebook 140 5 Updated Dec 22, 2023

Stable Diffusion web UI

Python 142,976 26,951 Updated Nov 6, 2024
Python 13 Updated Mar 6, 2024

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 34,617 3,620 Updated Sep 23, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 34,428 4,245 Updated Nov 16, 2024

A curated list of awesome 3d generation papers

1,078 53 Updated Mar 9, 2023
Python 2 Updated Nov 24, 2023

Responsive Resume Cv Website Using HTML CSS And JavaScript

HTML 290 166 Updated Mar 31, 2024

A modern static resume template and theme. Powered by Jekyll and GitHub pages.

HTML 2,094 1,398 Updated Jun 15, 2024

[ICCV 2023] Online Clustered Codebook

Python 145 11 Updated Sep 19, 2024

Longformer: The Long-Document Transformer

Python 2,046 276 Updated Feb 8, 2023

Codes for our ACL21 paper: Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization

Python 94 7 Updated Aug 2, 2021

Unsupervised Extractive Summarization based on Position-Augmented Centrality

Python 124 27 Updated Sep 6, 2021

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

Python 950 78 Updated Aug 3, 2022

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,723 1,851 Updated Jun 27, 2024
Next