Skip to content

Commit 99cd725

Browse files
committed
update readme
1 parent f435a58 commit 99cd725

File tree

1 file changed

+14
-8
lines changed

1 file changed

+14
-8
lines changed

README.md

Lines changed: 14 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,22 @@
1-
# Descript Audio Codec (.dac)
2-
3-
4-
1+
# Descript Audio Codec (.dac): High-Fidelity Audio Compression with Improved RVQGAN
52

63
This repository contains training and inference scripts
74
for the Descript Audio Codec (.dac), a high fidelity general
8-
neural audio codec.
5+
neural audio codec, introduced in the paper titled **High-Fidelity Audio Compression with Improved RVQGAN**.
96

10-
![](https://static.arxiv.org/static/browse/0.3.4/images/icons/favicon-16x16.png) [arXiv Paper Link](http://arxiv.org/abs/2306.06546) <br>
7+
![](https://static.arxiv.org/static/browse/0.3.4/images/icons/favicon-16x16.png) [arXiv Paper: High-Fidelity Audio Compression with Improved RVQGAN
8+
](http://arxiv.org/abs/2306.06546) <br>
119
📈 [Demo Site](https://descript.notion.site/Descript-Audio-Codec-11389fce0ce2419891d6591a68f814d5)<br>
1210
[Model Weights](https://github.com/descriptinc/descript-audio-codec/releases/download/0.0.1/weights.pth)
1311

12+
### Results
13+
14+
<p align="left">
15+
<img src="./assets/comparsion_stats.png" alt="Comparison of compressions approaches. Our model achieves a higher compression factor compared to all baseline methods. Our model has a ~90x compression factor compared to 32x compression factor of EnCodec and 64x of SoundStream. Note that we operate at a target bitrate of 8 kbps, whereas EnCodec operates at 24 kbps and SoundStream at 6 kbps. We also operate at 44.1 kHz, whereas EnCodec operates at 48 kHz and SoundStream operates at 24 kHz." width=35%></p>
16+
17+
<p align="left">
18+
<img src="./assets/objective_comparisons.png" width=75%></p>
19+
1420
## Usage
1521

1622
### Installation
@@ -22,9 +28,9 @@ pip install .
2228

2329
### Weights
2430
Weights are released as part of this repo under MIT license.
25-
They are automatically downloaded when you first run `encode` or `decode` command. They can be cached locally with
31+
They are automatically downloaded when you first run `encode` or `decode` command. They can be cached locally with
2632
```
27-
python3 -m dac download
33+
python3 -m dac download
2834
```
2935
We provide a Dockerfile that installs all required dependencies for encoding and decoding. The build process caches model weights inside the image. This allows the image to be used without an internet connection. [Please refer to instructions below.](#docker-image)
3036

0 commit comments

Comments
 (0)