Welcome to the demo page for the paper “High Fidelity Compression Algorithm with Improved RVQGAN”. Here, we provide samples from our ablation studies and other competitive baselines.
<aside> <img src="/icons/light-bulb_gray.svg" alt="/icons/light-bulb_gray.svg" width="40px" /> Please click on the following links to listen to more samples and view visualizations
</aside>
Comparison with leading methods
Effects of balanced data-sampling
Comparison with EnCodec at 24kHz
<aside> <img src="/icons/light-bulb_gray.svg" alt="/icons/light-bulb_gray.svg" width="40px" /> Note that while EnCodec simplifies the problem by downsampling the input audio to 24kHz, the proposed method works natively in the 44.1kHz domain, retaining the details and brightness of full bandwidth.
</aside>
Original
EnCodec@24kbps
Ours@8kbps
Original
EnCodec@24kbps
Ours@8kbps