prosodylm / README.md
auspicious3000's picture
update metadata
d9ea2d0
metadata
language:
  - en
license: cc-by-nc-4.0
tags:
  - prosody
  - speech
  - tts
  - llm
pipeline_tag: text-to-speech

ProsodyLM

This repository contains the model checkpoints and sample training data for
the paper ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models.

πŸ“ Repository structure

  • llm/: ProsodyLM checkpoint and tokenizer
  • tts/: TTS checkpoint and speaker embeddings
  • data/: A small-scale sample dataset (same format as the real training data)

πŸ”— Citation

If you use this resource, please cite the paper above.


License: CC BY-NC 4.0