Deepak Singh Rawat
commited on
Commit
·
749bdc7
1
Parent(s):
47962d5
Add Huggingface Spaces link
Browse files
README.md
CHANGED
|
@@ -10,6 +10,8 @@ tags:
|
|
| 10 |
|
| 11 |
An image captioning model to generate movie/t.v show plot from poster. It generates decent plots but is no way perfect. We are still working on improving the model.
|
| 12 |
|
|
|
|
|
|
|
| 13 |
# Model Details
|
| 14 |
|
| 15 |
The base model uses a Vision Transformer (ViT) model as an image encoder and GPT-2 as a decoder.
|
|
|
|
| 10 |
|
| 11 |
An image captioning model to generate movie/t.v show plot from poster. It generates decent plots but is no way perfect. We are still working on improving the model.
|
| 12 |
|
| 13 |
+
## Live demo on Hugging Face Spaces: https://huggingface.co/spaces/deepklarity/poster2plot
|
| 14 |
+
|
| 15 |
# Model Details
|
| 16 |
|
| 17 |
The base model uses a Vision Transformer (ViT) model as an image encoder and GPT-2 as a decoder.
|