Rongjiehuang
@RongjiehuangFocusing on multimodal synthesis (speech/audio/sing), speech translation, and self-supervised learning.
Language Breakdown
Lines of code distribution across 9 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in Jupyter Notebook
Collaboration Network
Global Impact visualization
Repos
36
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Claude
@claude
Uriel Singer
@urielsinger
Shunyu Yao
@ysymyth
Piotr Dollar
@pdollar
Tianhong Li
@LTH14
Top Repositories
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
PyTorch Implementation of FastDiff (IJCAI'22)
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
PyTorch Implementation of Multi-Singer (ACM-MM'21)
List of direct speech-to-speech translation papers.
An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/
基于Mask R-CNN的水下垃圾检测
Project page for SingGAN (ACM-MM' 2022): Generative Adversarial Network For High-Fidelity Singing Voice Generation
A collection of resources and papers on Diffusion Models
Open Source Impact
Contributions to external projects
No external contributions found.