metadata
language:
- en
license: cc-by-nc-4.0
tags:
- prosody
- speech
- tts
- llm
pipeline_tag: text-to-speech
ProsodyLM
This repository contains the model checkpoints and sample training data for
the paper ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models.
π Repository structure
llm/: ProsodyLM checkpoint and tokenizertts/: TTS checkpoint and speaker embeddingsdata/: A small-scale sample dataset (same format as the real training data)
π Citation
If you use this resource, please cite the paper above.
License: CC BY-NC 4.0