lucadellalib
/

focalcodec_50hz_2k_causal

@@ -1,8 +1,8 @@
 ---
-license: apache-2.0
-library_name: torch
 base_model:
 - microsoft/wavlm-large
 pipeline_tag: audio-to-audio
 ---
@@ -20,15 +20,86 @@ This repository contains the **50 Hz causal checkpoint with a codebook size of 2
 - 🌐 **Project Page**: https://lucadellalib.github.io/focalcodec-web/
 - 💾 **GitHub**: https://github.com/lucadellalib/focalcodec
 <img src="focalcodec-stream.png" width="700">
 ---------------------------------------------------------------------------------------------------------
 ## ▶️ Quickstart
-See the readme at: https://github.com/lucadellalib/focalcodec
 ---------------------------------------------------------------------------------------------------------
@@ -47,6 +118,7 @@ See the readme at: https://github.com/lucadellalib/focalcodec
     author  = {Luca {Della Libera} and Cem Subakan and Mirco Ravanelli},
     journal = {arXiv preprint arXiv:2509.16195},
     year    = {2025},
 }
 ```

 ---
 base_model:
 - microsoft/wavlm-large
+library_name: torch
+license: apache-2.0
 pipeline_tag: audio-to-audio
 ---
 - 🌐 **Project Page**: https://lucadellalib.github.io/focalcodec-web/
+- 🔊 **Downstream Tasks**: https://github.com/lucadellalib/audiocodecs
 - 💾 **GitHub**: https://github.com/lucadellalib/focalcodec
 <img src="focalcodec-stream.png" width="700">
 ---------------------------------------------------------------------------------------------------------
+## 🛠️ Installation
+First of all, install [Python 3.8 or later](https://www.python.org). Then, open a terminal and run:
+```bash
+pip install huggingface-hub safetensors sounddevice soundfile torch torchaudio
+```
+---------------------------------------------------------------------------------------------------------
 ## ▶️ Quickstart
+**NOTE**: the `audios` directory contains audio samples that you can download and use to test the codec.
+You can easily load the model using `torch.hub` without cloning the repository:
+```python
+import torch
+import torchaudio
+# Load FocalCodec model
+codec = torch.hub.load(
+    repo_or_dir="lucadellalib/focalcodec",
+    model="focalcodec",
+    config="lucadellalib/focalcodec_50hz",
+    force_reload=True,  # Fetch the latest FocalCodec version from Torch Hub
+)
+codec.eval().requires_grad_(False)
+# Load and preprocess the input audio
+audio_file = "audios/librispeech-dev-clean/251-118436-0003.wav"
+sig, sample_rate = torchaudio.load(audio_file)
+sig = torchaudio.functional.resample(sig, sample_rate, codec.sample_rate_input)
+# Encode audio into tokens
+toks = codec.sig_to_toks(sig)  # Shape: (batch, time)
+print(toks.shape)
+print(toks)
+# Convert tokens to their corresponding binary spherical codes
+codes = codec.toks_to_codes(toks)  # Shape: (batch, code_time, log2 codebook_size)
+print(codes.shape)
+print(codes)
+# Decode tokens back into a waveform
+rec_sig = codec.toks_to_sig(toks)
+# Save the reconstructed audio
+rec_sig = torchaudio.functional.resample(rec_sig, codec.sample_rate_output, sample_rate)
+torchaudio.save("reconstruction.wav", rec_sig, sample_rate)
+```
+Alternatively, you can install FocalCodec as a standard Python package using `pip`:
+```bash
+pip install focalcodec@git+https://github.com/lucadellalib/focalcodec.git@main#egg=focalcodec
+```
+Once installed, you can import it in your scripts:
+```python
+import focalcodec
+config = "lucadellalib/focalcodec_50hz"
+codec = focalcodec.FocalCodec.from_pretrained(config)
+```
+Check the code documentation for more details on model usage and available configurations.
+**NOTE**: the initial **v0.0.1** release is still available at https://github.com/lucadellalib/focalcodec/tree/v0.0.1.
+It can be loaded via `torch.hub` as `repo_or_dir="lucadellalib/focalcodec:v0.0.1"`, or installed via `pip` as
+`focalcodec@git+https://github.com/lucadellalib/focalcodec.git@v0.0.1#egg=focalcodec`.
 ---------------------------------------------------------------------------------------------------------
     author  = {Luca {Della Libera} and Cem Subakan and Mirco Ravanelli},
     journal = {arXiv preprint arXiv:2509.16195},
     year    = {2025},
 }
 ```