Training in progress, epoch 1
Browse files- README.md +78 -0
- added_tokens.json +102 -0
- config.json +61 -0
- eval_loss.png +0 -0
- generation_config.json +10 -0
- model.safetensors +3 -0
- run_stats.json +19 -0
- samples_all.txt +570 -0
- special_tokens_map.json +125 -0
- spiece.model +3 -0
- tokenizer_config.json +941 -0
- trainer_log_history.csv +22 -0
- training_args.bin +3 -0
- training_loss.png +0 -0
README.md
ADDED
|
@@ -0,0 +1,78 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: transformers
|
| 3 |
+
license: apache-2.0
|
| 4 |
+
base_model: google/flan-t5-small
|
| 5 |
+
tags:
|
| 6 |
+
- generated_from_trainer
|
| 7 |
+
metrics:
|
| 8 |
+
- rouge
|
| 9 |
+
model-index:
|
| 10 |
+
- name: flan-t5-small-prompt-compression
|
| 11 |
+
results: []
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
| 15 |
+
should probably proofread and complete it, then remove this comment. -->
|
| 16 |
+
|
| 17 |
+
# flan-t5-small-prompt-compression
|
| 18 |
+
|
| 19 |
+
This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
|
| 20 |
+
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 0.5181
|
| 22 |
+
- Rouge1: 0.8820
|
| 23 |
+
- Rouge2: 0.7104
|
| 24 |
+
- Rougel: 0.8485
|
| 25 |
+
- Rougelsum: 0.8488
|
| 26 |
+
- Comp Ratio Mean: 0.6611
|
| 27 |
+
- Comp Ratio P90: 0.7674
|
| 28 |
+
- Pct Violations: 0.0
|
| 29 |
+
|
| 30 |
+
## Model description
|
| 31 |
+
|
| 32 |
+
More information needed
|
| 33 |
+
|
| 34 |
+
## Intended uses & limitations
|
| 35 |
+
|
| 36 |
+
More information needed
|
| 37 |
+
|
| 38 |
+
## Training and evaluation data
|
| 39 |
+
|
| 40 |
+
More information needed
|
| 41 |
+
|
| 42 |
+
## Training procedure
|
| 43 |
+
|
| 44 |
+
### Training hyperparameters
|
| 45 |
+
|
| 46 |
+
The following hyperparameters were used during training:
|
| 47 |
+
- learning_rate: 0.0001
|
| 48 |
+
- train_batch_size: 8
|
| 49 |
+
- eval_batch_size: 8
|
| 50 |
+
- seed: 42
|
| 51 |
+
- optimizer: Use adafactor and the args are:
|
| 52 |
+
No additional optimizer arguments
|
| 53 |
+
- lr_scheduler_type: linear
|
| 54 |
+
- lr_scheduler_warmup_ratio: 0.1
|
| 55 |
+
- num_epochs: 10
|
| 56 |
+
|
| 57 |
+
### Training results
|
| 58 |
+
|
| 59 |
+
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Comp Ratio Mean | Comp Ratio P90 | Pct Violations |
|
| 60 |
+
|:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:---------------:|:--------------:|:--------------:|
|
| 61 |
+
| 1.2576 | 1.0 | 1594 | 0.6457 | 0.8528 | 0.6587 | 0.8197 | 0.8199 | 0.6626 | 0.7736 | 0.0 |
|
| 62 |
+
| 0.7688 | 2.0 | 3188 | 0.5727 | 0.8689 | 0.6851 | 0.8345 | 0.8349 | 0.6647 | 0.7694 | 0.0 |
|
| 63 |
+
| 0.6591 | 3.0 | 4782 | 0.5405 | 0.8750 | 0.6963 | 0.8413 | 0.8417 | 0.6684 | 0.7692 | 0.0 |
|
| 64 |
+
| 0.5957 | 4.0 | 6376 | 0.5333 | 0.8771 | 0.7002 | 0.8438 | 0.8440 | 0.6600 | 0.7660 | 0.0 |
|
| 65 |
+
| 0.548 | 5.0 | 7970 | 0.5212 | 0.8792 | 0.7059 | 0.8467 | 0.8470 | 0.6617 | 0.7648 | 0.0004 |
|
| 66 |
+
| 0.5139 | 6.0 | 9564 | 0.5196 | 0.8799 | 0.7064 | 0.8472 | 0.8473 | 0.6597 | 0.7636 | 0.0 |
|
| 67 |
+
| 0.4862 | 7.0 | 11158 | 0.5144 | 0.8805 | 0.7076 | 0.8473 | 0.8474 | 0.6656 | 0.7705 | 0.0004 |
|
| 68 |
+
| 0.466 | 8.0 | 12752 | 0.5157 | 0.8819 | 0.7098 | 0.8489 | 0.8492 | 0.6622 | 0.7674 | 0.0 |
|
| 69 |
+
| 0.4499 | 9.0 | 14346 | 0.5156 | 0.8816 | 0.7096 | 0.8486 | 0.8489 | 0.6604 | 0.7660 | 0.0 |
|
| 70 |
+
| 0.4393 | 10.0 | 15940 | 0.5181 | 0.8820 | 0.7104 | 0.8485 | 0.8488 | 0.6611 | 0.7674 | 0.0 |
|
| 71 |
+
|
| 72 |
+
|
| 73 |
+
### Framework versions
|
| 74 |
+
|
| 75 |
+
- Transformers 4.57.1
|
| 76 |
+
- Pytorch 2.6.0+cu124
|
| 77 |
+
- Datasets 4.4.1
|
| 78 |
+
- Tokenizers 0.22.1
|
added_tokens.json
ADDED
|
@@ -0,0 +1,102 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"<extra_id_0>": 32099,
|
| 3 |
+
"<extra_id_10>": 32089,
|
| 4 |
+
"<extra_id_11>": 32088,
|
| 5 |
+
"<extra_id_12>": 32087,
|
| 6 |
+
"<extra_id_13>": 32086,
|
| 7 |
+
"<extra_id_14>": 32085,
|
| 8 |
+
"<extra_id_15>": 32084,
|
| 9 |
+
"<extra_id_16>": 32083,
|
| 10 |
+
"<extra_id_17>": 32082,
|
| 11 |
+
"<extra_id_18>": 32081,
|
| 12 |
+
"<extra_id_19>": 32080,
|
| 13 |
+
"<extra_id_1>": 32098,
|
| 14 |
+
"<extra_id_20>": 32079,
|
| 15 |
+
"<extra_id_21>": 32078,
|
| 16 |
+
"<extra_id_22>": 32077,
|
| 17 |
+
"<extra_id_23>": 32076,
|
| 18 |
+
"<extra_id_24>": 32075,
|
| 19 |
+
"<extra_id_25>": 32074,
|
| 20 |
+
"<extra_id_26>": 32073,
|
| 21 |
+
"<extra_id_27>": 32072,
|
| 22 |
+
"<extra_id_28>": 32071,
|
| 23 |
+
"<extra_id_29>": 32070,
|
| 24 |
+
"<extra_id_2>": 32097,
|
| 25 |
+
"<extra_id_30>": 32069,
|
| 26 |
+
"<extra_id_31>": 32068,
|
| 27 |
+
"<extra_id_32>": 32067,
|
| 28 |
+
"<extra_id_33>": 32066,
|
| 29 |
+
"<extra_id_34>": 32065,
|
| 30 |
+
"<extra_id_35>": 32064,
|
| 31 |
+
"<extra_id_36>": 32063,
|
| 32 |
+
"<extra_id_37>": 32062,
|
| 33 |
+
"<extra_id_38>": 32061,
|
| 34 |
+
"<extra_id_39>": 32060,
|
| 35 |
+
"<extra_id_3>": 32096,
|
| 36 |
+
"<extra_id_40>": 32059,
|
| 37 |
+
"<extra_id_41>": 32058,
|
| 38 |
+
"<extra_id_42>": 32057,
|
| 39 |
+
"<extra_id_43>": 32056,
|
| 40 |
+
"<extra_id_44>": 32055,
|
| 41 |
+
"<extra_id_45>": 32054,
|
| 42 |
+
"<extra_id_46>": 32053,
|
| 43 |
+
"<extra_id_47>": 32052,
|
| 44 |
+
"<extra_id_48>": 32051,
|
| 45 |
+
"<extra_id_49>": 32050,
|
| 46 |
+
"<extra_id_4>": 32095,
|
| 47 |
+
"<extra_id_50>": 32049,
|
| 48 |
+
"<extra_id_51>": 32048,
|
| 49 |
+
"<extra_id_52>": 32047,
|
| 50 |
+
"<extra_id_53>": 32046,
|
| 51 |
+
"<extra_id_54>": 32045,
|
| 52 |
+
"<extra_id_55>": 32044,
|
| 53 |
+
"<extra_id_56>": 32043,
|
| 54 |
+
"<extra_id_57>": 32042,
|
| 55 |
+
"<extra_id_58>": 32041,
|
| 56 |
+
"<extra_id_59>": 32040,
|
| 57 |
+
"<extra_id_5>": 32094,
|
| 58 |
+
"<extra_id_60>": 32039,
|
| 59 |
+
"<extra_id_61>": 32038,
|
| 60 |
+
"<extra_id_62>": 32037,
|
| 61 |
+
"<extra_id_63>": 32036,
|
| 62 |
+
"<extra_id_64>": 32035,
|
| 63 |
+
"<extra_id_65>": 32034,
|
| 64 |
+
"<extra_id_66>": 32033,
|
| 65 |
+
"<extra_id_67>": 32032,
|
| 66 |
+
"<extra_id_68>": 32031,
|
| 67 |
+
"<extra_id_69>": 32030,
|
| 68 |
+
"<extra_id_6>": 32093,
|
| 69 |
+
"<extra_id_70>": 32029,
|
| 70 |
+
"<extra_id_71>": 32028,
|
| 71 |
+
"<extra_id_72>": 32027,
|
| 72 |
+
"<extra_id_73>": 32026,
|
| 73 |
+
"<extra_id_74>": 32025,
|
| 74 |
+
"<extra_id_75>": 32024,
|
| 75 |
+
"<extra_id_76>": 32023,
|
| 76 |
+
"<extra_id_77>": 32022,
|
| 77 |
+
"<extra_id_78>": 32021,
|
| 78 |
+
"<extra_id_79>": 32020,
|
| 79 |
+
"<extra_id_7>": 32092,
|
| 80 |
+
"<extra_id_80>": 32019,
|
| 81 |
+
"<extra_id_81>": 32018,
|
| 82 |
+
"<extra_id_82>": 32017,
|
| 83 |
+
"<extra_id_83>": 32016,
|
| 84 |
+
"<extra_id_84>": 32015,
|
| 85 |
+
"<extra_id_85>": 32014,
|
| 86 |
+
"<extra_id_86>": 32013,
|
| 87 |
+
"<extra_id_87>": 32012,
|
| 88 |
+
"<extra_id_88>": 32011,
|
| 89 |
+
"<extra_id_89>": 32010,
|
| 90 |
+
"<extra_id_8>": 32091,
|
| 91 |
+
"<extra_id_90>": 32009,
|
| 92 |
+
"<extra_id_91>": 32008,
|
| 93 |
+
"<extra_id_92>": 32007,
|
| 94 |
+
"<extra_id_93>": 32006,
|
| 95 |
+
"<extra_id_94>": 32005,
|
| 96 |
+
"<extra_id_95>": 32004,
|
| 97 |
+
"<extra_id_96>": 32003,
|
| 98 |
+
"<extra_id_97>": 32002,
|
| 99 |
+
"<extra_id_98>": 32001,
|
| 100 |
+
"<extra_id_99>": 32000,
|
| 101 |
+
"<extra_id_9>": 32090
|
| 102 |
+
}
|
config.json
ADDED
|
@@ -0,0 +1,61 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"T5ForConditionalGeneration"
|
| 4 |
+
],
|
| 5 |
+
"classifier_dropout": 0.0,
|
| 6 |
+
"d_ff": 1024,
|
| 7 |
+
"d_kv": 64,
|
| 8 |
+
"d_model": 512,
|
| 9 |
+
"decoder_start_token_id": 0,
|
| 10 |
+
"dense_act_fn": "gelu_new",
|
| 11 |
+
"dropout_rate": 0.05,
|
| 12 |
+
"dtype": "float32",
|
| 13 |
+
"eos_token_id": 1,
|
| 14 |
+
"feed_forward_proj": "gated-gelu",
|
| 15 |
+
"initializer_factor": 1.0,
|
| 16 |
+
"is_encoder_decoder": true,
|
| 17 |
+
"is_gated_act": true,
|
| 18 |
+
"layer_norm_epsilon": 1e-06,
|
| 19 |
+
"model_type": "t5",
|
| 20 |
+
"n_positions": 512,
|
| 21 |
+
"num_decoder_layers": 8,
|
| 22 |
+
"num_heads": 6,
|
| 23 |
+
"num_layers": 8,
|
| 24 |
+
"output_past": true,
|
| 25 |
+
"pad_token_id": 0,
|
| 26 |
+
"relative_attention_max_distance": 128,
|
| 27 |
+
"relative_attention_num_buckets": 32,
|
| 28 |
+
"task_specific_params": {
|
| 29 |
+
"summarization": {
|
| 30 |
+
"early_stopping": true,
|
| 31 |
+
"length_penalty": 2.0,
|
| 32 |
+
"max_length": 200,
|
| 33 |
+
"min_length": 30,
|
| 34 |
+
"no_repeat_ngram_size": 3,
|
| 35 |
+
"num_beams": 4,
|
| 36 |
+
"prefix": "summarize: "
|
| 37 |
+
},
|
| 38 |
+
"translation_en_to_de": {
|
| 39 |
+
"early_stopping": true,
|
| 40 |
+
"max_length": 300,
|
| 41 |
+
"num_beams": 4,
|
| 42 |
+
"prefix": "translate English to German: "
|
| 43 |
+
},
|
| 44 |
+
"translation_en_to_fr": {
|
| 45 |
+
"early_stopping": true,
|
| 46 |
+
"max_length": 300,
|
| 47 |
+
"num_beams": 4,
|
| 48 |
+
"prefix": "translate English to French: "
|
| 49 |
+
},
|
| 50 |
+
"translation_en_to_ro": {
|
| 51 |
+
"early_stopping": true,
|
| 52 |
+
"max_length": 300,
|
| 53 |
+
"num_beams": 4,
|
| 54 |
+
"prefix": "translate English to Romanian: "
|
| 55 |
+
}
|
| 56 |
+
},
|
| 57 |
+
"tie_word_embeddings": false,
|
| 58 |
+
"transformers_version": "4.57.1",
|
| 59 |
+
"use_cache": true,
|
| 60 |
+
"vocab_size": 32128
|
| 61 |
+
}
|
eval_loss.png
ADDED
|
generation_config.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"decoder_start_token_id": 0,
|
| 3 |
+
"eos_token_id": [
|
| 4 |
+
1
|
| 5 |
+
],
|
| 6 |
+
"no_repeat_ngram_size": 3,
|
| 7 |
+
"num_beams": 4,
|
| 8 |
+
"pad_token_id": 0,
|
| 9 |
+
"transformers_version": "4.57.1"
|
| 10 |
+
}
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:60eff07c615fdd9b287f4df102b2a2c8a837674b2783898a642df0ea449445a5
|
| 3 |
+
size 307867048
|
run_stats.json
ADDED
|
@@ -0,0 +1,19 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"dataset_stats": {
|
| 3 |
+
"rows_before_filter": 15000.0,
|
| 4 |
+
"rows_after_filter": 15000.0,
|
| 5 |
+
"dropped_for_length": 0.0,
|
| 6 |
+
"max_input_length": 256.0,
|
| 7 |
+
"max_target_length": 128.0,
|
| 8 |
+
"short_threshold": 40.0
|
| 9 |
+
},
|
| 10 |
+
"baseline_scores": {
|
| 11 |
+
"rouge1": 0.25675665009084964,
|
| 12 |
+
"rouge2": 0.13416016149259882,
|
| 13 |
+
"rougeL": 0.23903438122862936,
|
| 14 |
+
"rougeLsum": 0.23864111489830342,
|
| 15 |
+
"comp_ratio_mean": 0.2556183630363363,
|
| 16 |
+
"comp_ratio_p90": 0.535846560846561,
|
| 17 |
+
"pct_violations": 0.0
|
| 18 |
+
}
|
| 19 |
+
}
|
samples_all.txt
ADDED
|
@@ -0,0 +1,570 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
|
| 3 |
+
Epoch 1 — 7650 steps
|
| 4 |
+
----------------------------------------------------------------------------------------------------
|
| 5 |
+
Sample 1
|
| 6 |
+
Input(16 tok): summarize: What is measured on the Gay-Lussac scale...
|
| 7 |
+
Pred (1 tok): ...
|
| 8 |
+
Ref (10 tok): Gay-Lussac scale measures what...
|
| 9 |
+
----------------------------------------------------------------------------------------------------
|
| 10 |
+
Sample 2
|
| 11 |
+
Input(41 tok): summarize: I would greatly appreciate it if you could kindly provide information about the teams that are regarded as the rivals of the Air Force Falcons football team....
|
| 12 |
+
Pred (1 tok): ...
|
| 13 |
+
Ref (8 tok): Air Force Falcons football rivals...
|
| 14 |
+
----------------------------------------------------------------------------------------------------
|
| 15 |
+
Sample 3
|
| 16 |
+
Input(37 tok): summarize: I would greatly appreciate it if you could inform me how many teaspoons correspond to a single tablespoon, thank you very much....
|
| 17 |
+
Pred (1 tok): ...
|
| 18 |
+
Ref (7 tok): Teaspoons per tablespoon...
|
| 19 |
+
----------------------------------------------------------------------------------------------------
|
| 20 |
+
|
| 21 |
+
|
| 22 |
+
Epoch 2 — 15300 steps
|
| 23 |
+
----------------------------------------------------------------------------------------------------
|
| 24 |
+
Sample 1
|
| 25 |
+
Input(18 tok): summarize: Was gilt als Emma Edmondsons beste Arbeit als Schauspielerin?...
|
| 26 |
+
Pred (1 tok): ...
|
| 27 |
+
Ref (9 tok): Beste Schauspielarbeit von Emma Edmondson...
|
| 28 |
+
----------------------------------------------------------------------------------------------------
|
| 29 |
+
Sample 2
|
| 30 |
+
Input(24 tok): summarize: Gib mir eine Liste indischer Gerichte, die ich für eine Hausparty zubereiten kann....
|
| 31 |
+
Pred (1 tok): ...
|
| 32 |
+
Ref (9 tok): Liste indische Gerichte für Hausparty...
|
| 33 |
+
----------------------------------------------------------------------------------------------------
|
| 34 |
+
Sample 3
|
| 35 |
+
Input(50 tok): summarize: Pourriez-vous s'il vous plaît me conseiller sur quel auteur russe je devrais lire en premier afin de commencer à découvrir la littérature russe ?...
|
| 36 |
+
Pred (1 tok): ...
|
| 37 |
+
Ref (10 tok): Premier auteur russe à lire...
|
| 38 |
+
----------------------------------------------------------------------------------------------------
|
| 39 |
+
|
| 40 |
+
|
| 41 |
+
Epoch 3 — 22950 steps
|
| 42 |
+
----------------------------------------------------------------------------------------------------
|
| 43 |
+
Sample 1
|
| 44 |
+
Input(15 tok): summarize: Summarize best way to climb Mount Everest...
|
| 45 |
+
Pred (1 tok): ...
|
| 46 |
+
Ref (11 tok): Summarize best way to summit Everest...
|
| 47 |
+
----------------------------------------------------------------------------------------------------
|
| 48 |
+
Sample 2
|
| 49 |
+
Input(32 tok): summarize: Could you please, at your earliest convenience, provide me with some information regarding the identity of Marshall Strickland?...
|
| 50 |
+
Pred (1 tok): ...
|
| 51 |
+
Ref (6 tok): Who is Marshall Strickland...
|
| 52 |
+
----------------------------------------------------------------------------------------------------
|
| 53 |
+
Sample 3
|
| 54 |
+
Input(81 tok): summarize: Would you be so kind as to classify each of the following countries according to whether they follow left-hand or right-hand traffic conventions, that is, based on the side of the road on which vehicles typically travel: USA, Spain, UK, India, Singapore, Switzerland, Australia, Egypt, France, and Italy?...
|
| 55 |
+
Pred (1 tok): ...
|
| 56 |
+
Ref (32 tok): Classify driving side (left/right) for: USA, Spain, UK, India, Singapore, Switzerland, Australia, Egypt, France, Italy...
|
| 57 |
+
----------------------------------------------------------------------------------------------------
|
| 58 |
+
|
| 59 |
+
|
| 60 |
+
Epoch 1 — 7650 steps
|
| 61 |
+
----------------------------------------------------------------------------------------------------
|
| 62 |
+
Sample 1
|
| 63 |
+
Input(16 tok): summarize: What is measured on the Gay-Lussac scale...
|
| 64 |
+
Pred (12 tok): What is measured on Gay-Lussac scale...
|
| 65 |
+
Ref (10 tok): Gay-Lussac scale measures what...
|
| 66 |
+
----------------------------------------------------------------------------------------------------
|
| 67 |
+
Sample 2
|
| 68 |
+
Input(41 tok): summarize: I would greatly appreciate it if you could kindly provide information about the teams that are regarded as the rivals of the Air Force Falcons football team....
|
| 69 |
+
Pred (9 tok): Air Force Falcons football team rivals...
|
| 70 |
+
Ref (8 tok): Air Force Falcons football rivals...
|
| 71 |
+
----------------------------------------------------------------------------------------------------
|
| 72 |
+
Sample 3
|
| 73 |
+
Input(37 tok): summarize: I would greatly appreciate it if you could inform me how many teaspoons correspond to a single tablespoon, thank you very much....
|
| 74 |
+
Pred (11 tok): How many teaspoons are single tablespoons...
|
| 75 |
+
Ref (7 tok): Teaspoons per tablespoon...
|
| 76 |
+
----------------------------------------------------------------------------------------------------
|
| 77 |
+
|
| 78 |
+
|
| 79 |
+
Epoch 2 — 15300 steps
|
| 80 |
+
----------------------------------------------------------------------------------------------------
|
| 81 |
+
Sample 1
|
| 82 |
+
Input(18 tok): summarize: Was gilt als Emma Edmondsons beste Arbeit als Schauspielerin?...
|
| 83 |
+
Pred (10 tok): Emma Edmondsons beste Arbeit als Schauspielerin...
|
| 84 |
+
Ref (9 tok): Beste Schauspielarbeit von Emma Edmondson...
|
| 85 |
+
----------------------------------------------------------------------------------------------------
|
| 86 |
+
Sample 2
|
| 87 |
+
Input(24 tok): summarize: Gib mir eine Liste indischer Gerichte, die ich für eine Hausparty zubereiten kann....
|
| 88 |
+
Pred (9 tok): Liste indischer Gerichte für Hausparty...
|
| 89 |
+
Ref (9 tok): Liste indische Gerichte für Hausparty...
|
| 90 |
+
----------------------------------------------------------------------------------------------------
|
| 91 |
+
Sample 3
|
| 92 |
+
Input(50 tok): summarize: Pourriez-vous s'il vous plaît me conseiller sur quel auteur russe je devrais lire en premier afin de commencer à découvrir la littérature russe ?...
|
| 93 |
+
Pred (22 tok): Quel auteur russe devrais lire en premier pour découvrir la littérature russe...
|
| 94 |
+
Ref (10 tok): Premier auteur russe à lire...
|
| 95 |
+
----------------------------------------------------------------------------------------------------
|
| 96 |
+
|
| 97 |
+
|
| 98 |
+
Epoch 3 — 22950 steps
|
| 99 |
+
----------------------------------------------------------------------------------------------------
|
| 100 |
+
Sample 1
|
| 101 |
+
Input(15 tok): summarize: Summarize best way to climb Mount Everest...
|
| 102 |
+
Pred (12 tok): Summarize best way to climb Mount Everest...
|
| 103 |
+
Ref (11 tok): Summarize best way to summit Everest...
|
| 104 |
+
----------------------------------------------------------------------------------------------------
|
| 105 |
+
Sample 2
|
| 106 |
+
Input(32 tok): summarize: Could you please, at your earliest convenience, provide me with some information regarding the identity of Marshall Strickland?...
|
| 107 |
+
Pred (6 tok): Marshall Strickland identity...
|
| 108 |
+
Ref (6 tok): Who is Marshall Strickland...
|
| 109 |
+
----------------------------------------------------------------------------------------------------
|
| 110 |
+
Sample 3
|
| 111 |
+
Input(81 tok): summarize: Would you be so kind as to classify each of the following countries according to whether they follow left-hand or right-hand traffic conventions, that is, based on the side of the road on which vehicles typically travel: USA, Spain, UK, India, Singapore, Switzerland, Australia, Egypt, France, and Italy?...
|
| 112 |
+
Pred (37 tok): Classify each country as left-hand or right-hand traffic conventions: USA, Spain, UK, India, Singapore, Switzerland, Australia, Egypt, France, Italy...
|
| 113 |
+
Ref (32 tok): Classify driving side (left/right) for: USA, Spain, UK, India, Singapore, Switzerland, Australia, Egypt, France, Italy...
|
| 114 |
+
----------------------------------------------------------------------------------------------------
|
| 115 |
+
|
| 116 |
+
|
| 117 |
+
Epoch 4 — 30600 steps
|
| 118 |
+
----------------------------------------------------------------------------------------------------
|
| 119 |
+
Sample 1
|
| 120 |
+
Input(24 tok): summarize: In welcher Episode erkennt Jon Snow die Autorität von Daenerys Targaryen an?...
|
| 121 |
+
Pred (14 tok): Episode von Jon Snow Autorität von Daenerys Targaryen...
|
| 122 |
+
Ref (20 tok): In welcher Episode erkennt Jon Snow die Autorität von Daenerys Targaryen an...
|
| 123 |
+
----------------------------------------------------------------------------------------------------
|
| 124 |
+
Sample 2
|
| 125 |
+
Input(34 tok): summarize: Wie würden Sie die folgenden Sportarten in zwei Gruppen einteilen: Baseball, Bogenschießen, Zehnkampf und Hockey?...
|
| 126 |
+
Pred (23 tok): Wie diese Sportarten in zwei Gruppen einteilen: Baseball, Bogenschießen, Zehnkampf, Hockey...
|
| 127 |
+
Ref (19 tok): Teile in zwei Gruppen: Baseball, Bogenschießen, Zehnkampf, Hockey...
|
| 128 |
+
----------------------------------------------------------------------------------------------------
|
| 129 |
+
Sample 3
|
| 130 |
+
Input(28 tok): summarize: Veuillez me donner une liste des cinq principales attractions touristiques à visiter en Europe....
|
| 131 |
+
Pred (11 tok): Liste 5 principales attractions touristiques en Europe...
|
| 132 |
+
Ref (9 tok): Top 5 attractions touristiques en Europe...
|
| 133 |
+
----------------------------------------------------------------------------------------------------
|
| 134 |
+
|
| 135 |
+
|
| 136 |
+
Epoch 5 — 38250 steps
|
| 137 |
+
----------------------------------------------------------------------------------------------------
|
| 138 |
+
Sample 1
|
| 139 |
+
Input(17 tok): summarize: ¿Qué es una clique en teoría de grafos?...
|
| 140 |
+
Pred (13 tok): Qué es una clique en teoría de grafos...
|
| 141 |
+
Ref (11 tok): Definir clique en teoría de grafos...
|
| 142 |
+
----------------------------------------------------------------------------------------------------
|
| 143 |
+
Sample 2
|
| 144 |
+
Input(16 tok): summarize: ¿En qué año se jugó el Ice Bowl?...
|
| 145 |
+
Pred (9 tok): Qué año se jugó Ice Bowl...
|
| 146 |
+
Ref (5 tok): Año Ice Bowl...
|
| 147 |
+
----------------------------------------------------------------------------------------------------
|
| 148 |
+
Sample 3
|
| 149 |
+
Input(20 tok): summarize: ¿Podría explicarme qué es un flat white, por favor?...
|
| 150 |
+
Pred (7 tok): Qué es un flat white...
|
| 151 |
+
Ref (6 tok): Definición de flat white...
|
| 152 |
+
----------------------------------------------------------------------------------------------------
|
| 153 |
+
|
| 154 |
+
|
| 155 |
+
Epoch 6 — 45900 steps
|
| 156 |
+
----------------------------------------------------------------------------------------------------
|
| 157 |
+
Sample 1
|
| 158 |
+
Input(30 tok): summarize: Basierend auf dem untenstehenden Absatz, beschreiben Sie bitte zwei geeignete Anwendungen von Edelgasen....
|
| 159 |
+
Pred (18 tok): Aus dem Absatz, beschreiben Sie zwei geeignete Edelgas-Anwendungen....
|
| 160 |
+
Ref (17 tok): Aus dem Absatz unten zwei Anwendungen von Edelgasen beschreiben....
|
| 161 |
+
----------------------------------------------------------------------------------------------------
|
| 162 |
+
Sample 2
|
| 163 |
+
Input(10 tok): summarize: ¿Qué es PySpark?...
|
| 164 |
+
Pred (3 tok): PySpark...
|
| 165 |
+
Ref (6 tok): Explica PySpark...
|
| 166 |
+
----------------------------------------------------------------------------------------------------
|
| 167 |
+
Sample 3
|
| 168 |
+
Input(70 tok): summarize: I would greatly appreciate it if you could please identify all albums released by the band Radiohead, noting both their titles and their respective years of release, and then list them neatly in the format {name} - {year}, separated by semicolons....
|
| 169 |
+
Pred (25 tok): List Radiohead albums with titles and release years; format {name} - {year}, semicolons...
|
| 170 |
+
Ref (25 tok): Radiohead albums: list {name} - {year} for all albums; semicolon-separated...
|
| 171 |
+
----------------------------------------------------------------------------------------------------
|
| 172 |
+
|
| 173 |
+
|
| 174 |
+
Epoch 7 — 53550 steps
|
| 175 |
+
----------------------------------------------------------------------------------------------------
|
| 176 |
+
Sample 1
|
| 177 |
+
Input(66 tok): summarize: Classifiez les modèles d’avions ci-dessous en fonction des entreprises qui les ont créés : Next-Generation 737, A220, 737 MAX, A350, A330, 747-8, 767, A320, 777, 777X, A380, 787....
|
| 178 |
+
Pred (52 tok): Classifiez les modèles d’avions par entreprises : Next-Generation 737, A220, 737 MAX, A350, A330, 747-8, 767, A320, 777, 777X, A380, 787....
|
| 179 |
+
Ref (48 tok): Classifiez ces avions par constructeur : Next-Generation 737, A220, 737 MAX, A350, A330, 747-8, 767, A320, 777, 777X, A380, 787....
|
| 180 |
+
----------------------------------------------------------------------------------------------------
|
| 181 |
+
Sample 2
|
| 182 |
+
Input(20 tok): summarize: Who composed the 'Moonlight Sonata', and when?...
|
| 183 |
+
Pred (11 tok): Who composed Moonlight Sonata and when...
|
| 184 |
+
Ref (10 tok): Moonlight Sonata: composer and date...
|
| 185 |
+
----------------------------------------------------------------------------------------------------
|
| 186 |
+
Sample 3
|
| 187 |
+
Input(26 tok): summarize: What Breaking Bad actor guest starred on Season 6, Episode 2 "Drive" of The X-Files?...
|
| 188 |
+
Pred (21 tok): What Breaking Bad actor guest starred on Season 6, Episode 2 "Drive" of X-Files...
|
| 189 |
+
Ref (23 tok): Which Breaking Bad actor guest-starred on X-Files S6E2 'Drive'...
|
| 190 |
+
----------------------------------------------------------------------------------------------------
|
| 191 |
+
|
| 192 |
+
|
| 193 |
+
Epoch 8 — 61200 steps
|
| 194 |
+
----------------------------------------------------------------------------------------------------
|
| 195 |
+
Sample 1
|
| 196 |
+
Input(55 tok): summarize: Considering the increasing presence of plastics in our natural surroundings, could you kindly share your thoughts on whether LEGO bricks remain a suitable and beneficial toy choice for children today?...
|
| 197 |
+
Pred (12 tok): Is LEGO bricks useful for children today...
|
| 198 |
+
Ref (19 tok): Are LEGO bricks still good kids' toy amid rising plastic pollution...
|
| 199 |
+
----------------------------------------------------------------------------------------------------
|
| 200 |
+
Sample 2
|
| 201 |
+
Input(39 tok): summarize: Quels éléments devrais-je prendre en compte lorsque je décide entre une voiture électrique ou une voiture à essence ?...
|
| 202 |
+
Pred (20 tok): Quels éléments prendre en compte quand je décide électrique ou essence...
|
| 203 |
+
Ref (17 tok): Facteurs à considérer pour choisir voiture électrique ou essence...
|
| 204 |
+
----------------------------------------------------------------------------------------------------
|
| 205 |
+
Sample 3
|
| 206 |
+
Input(23 tok): summarize: Quel type de musique est présenté dans l’album The Great Ray Charles ?...
|
| 207 |
+
Pred (14 tok): Quel genre de musique est présenté dans The Great Ray Charles...
|
| 208 |
+
Ref (12 tok): Genre de l’album The Great Ray Charles...
|
| 209 |
+
----------------------------------------------------------------------------------------------------
|
| 210 |
+
|
| 211 |
+
|
| 212 |
+
Epoch 9 — 68850 steps
|
| 213 |
+
----------------------------------------------------------------------------------------------------
|
| 214 |
+
Sample 1
|
| 215 |
+
Input(28 tok): summarize: ¿Podrías decirme si Twitter llega alguna vez a incumplir sus propias reglas?...
|
| 216 |
+
Pred (10 tok): Twitter a incumplir sus reglas...
|
| 217 |
+
Ref (11 tok): Twitter incumple sus propias reglas...
|
| 218 |
+
----------------------------------------------------------------------------------------------------
|
| 219 |
+
Sample 2
|
| 220 |
+
Input(38 tok): summarize: I would greatly appreciate it if you could kindly inform me about the bird that produces the largest egg among all avian species....
|
| 221 |
+
Pred (6 tok): Largest avian egg...
|
| 222 |
+
Ref (6 tok): Bird with largest egg...
|
| 223 |
+
----------------------------------------------------------------------------------------------------
|
| 224 |
+
Sample 3
|
| 225 |
+
Input(36 tok): summarize: I would greatly appreciate it if you could take a moment to describe what a pot hole refers to, in clear terms....
|
| 226 |
+
Pred (5 tok): What is pot hole...
|
| 227 |
+
Ref (5 tok): What is pot hole...
|
| 228 |
+
----------------------------------------------------------------------------------------------------
|
| 229 |
+
|
| 230 |
+
|
| 231 |
+
Epoch 10 — 76500 steps
|
| 232 |
+
----------------------------------------------------------------------------------------------------
|
| 233 |
+
Sample 1
|
| 234 |
+
Input(46 tok): summarize: I would greatly appreciate it if you could kindly furnish me with information regarding the ten latest winners of the Men's Boat Race held between Oxford and Cambridge....
|
| 235 |
+
Pred (16 tok): List 10 latest Oxford–Cambridge Men's Boat Race winners...
|
| 236 |
+
Ref (16 tok): List last 10 winners of Oxford–Cambridge Men's Boat Race...
|
| 237 |
+
----------------------------------------------------------------------------------------------------
|
| 238 |
+
Sample 2
|
| 239 |
+
Input(19 tok): summarize: What is the best way to evaluate the performance of my marketing spend?...
|
| 240 |
+
Pred (9 tok): Best way to evaluate marketing spend performance...
|
| 241 |
+
Ref (9 tok): Best way to evaluate marketing spend performance...
|
| 242 |
+
----------------------------------------------------------------------------------------------------
|
| 243 |
+
Sample 3
|
| 244 |
+
Input(11 tok): summarize: What is a PFD?...
|
| 245 |
+
Pred (4 tok): PFD definition...
|
| 246 |
+
Ref (4 tok): PFD definition...
|
| 247 |
+
----------------------------------------------------------------------------------------------------
|
| 248 |
+
|
| 249 |
+
|
| 250 |
+
Epoch 1 — 8074 steps
|
| 251 |
+
----------------------------------------------------------------------------------------------------
|
| 252 |
+
Sample 1
|
| 253 |
+
Input(23 tok): summarize: Welche sind einige der Möglichkeiten, wie die Gesellschaft verbessert werden kann?...
|
| 254 |
+
Pred (6 tok): From s -...
|
| 255 |
+
Ref (12 tok): Liste einige Möglichkeiten zur Verbesserung der Gesellschaft...
|
| 256 |
+
----------------------------------------------------------------------------------------------------
|
| 257 |
+
Sample 2
|
| 258 |
+
Input(39 tok): summarize: Según el párrafo que aparece a continuación, ¿cuáles son los parques nacionales más y menos populares de los Estados Unidos?...
|
| 259 |
+
Pred (3 tok): What,...
|
| 260 |
+
Ref (22 tok): Según párrafo abajo, parques nacionales EE.UU. más y menos populares...
|
| 261 |
+
----------------------------------------------------------------------------------------------------
|
| 262 |
+
Sample 3
|
| 263 |
+
Input(19 tok): summarize: Wie wird die Reihenfolge des NFL-Drafts festgelegt?...
|
| 264 |
+
Pred (17 tok): Classruption,,,?s, oderncestoNFLerst,tailsNFLNFL...
|
| 265 |
+
Ref (11 tok): Wie Reihenfolge NFL-Draft festgelegt...
|
| 266 |
+
----------------------------------------------------------------------------------------------------
|
| 267 |
+
|
| 268 |
+
|
| 269 |
+
Epoch 2 — 16148 steps
|
| 270 |
+
----------------------------------------------------------------------------------------------------
|
| 271 |
+
Sample 1
|
| 272 |
+
Input(55 tok): summarize: Tell me which of these terms are related to artificial intelligence versus gardening: tilling, seed, gradient descent, production, Bayesian optimization, genetically modified organism, heirloom, transfer learning...
|
| 273 |
+
Pred (4 tok): What ,...
|
| 274 |
+
Ref (43 tok): Classify terms as AI-related or gardening: tilling; seed; gradient descent; production; Bayesian optimization; genetically modified organism; heirloom; transfer learning...
|
| 275 |
+
----------------------------------------------------------------------------------------------------
|
| 276 |
+
Sample 2
|
| 277 |
+
Input(17 tok): summarize: Who were the major players in the Watergate conspiracy?...
|
| 278 |
+
Pred (2 tok): What...
|
| 279 |
+
Ref (8 tok): Watergate conspiracy major players...
|
| 280 |
+
----------------------------------------------------------------------------------------------------
|
| 281 |
+
Sample 3
|
| 282 |
+
Input(19 tok): summarize: What is the average lifespan of a Golden Retriever?...
|
| 283 |
+
Pred (2 tok): What...
|
| 284 |
+
Ref (9 tok): Golden Retriever average lifespan...
|
| 285 |
+
----------------------------------------------------------------------------------------------------
|
| 286 |
+
|
| 287 |
+
|
| 288 |
+
Epoch 3 — 24222 steps
|
| 289 |
+
----------------------------------------------------------------------------------------------------
|
| 290 |
+
Sample 1
|
| 291 |
+
Input(19 tok): summarize: Wie sollte ich auswählen, welchen Käse ich kaufen soll?...
|
| 292 |
+
Pred (2 tok): List...
|
| 293 |
+
Ref (9 tok): Wie Käse zum Kaufen auswählen...
|
| 294 |
+
----------------------------------------------------------------------------------------------------
|
| 295 |
+
Sample 2
|
| 296 |
+
Input(31 tok): summarize: ¿Cuáles son algunos de los principales riesgos relacionados con los grandes modelos de lenguaje?...
|
| 297 |
+
Pred (2 tok): From...
|
| 298 |
+
Ref (16 tok): Principales riesgos de los grandes modelos de lenguaje...
|
| 299 |
+
----------------------------------------------------------------------------------------------------
|
| 300 |
+
Sample 3
|
| 301 |
+
Input(33 tok): summarize: ما أكثر ما يزعجك عند استخدام فلاتر البريد الإلكتروني لتنظيم صندوق الوارد لديك؟...
|
| 302 |
+
Pred (5 tok): How, ,...
|
| 303 |
+
Ref (30 tok): ما أكثر ما يزعجك عند استخدام فلاتر البريد الإلكتروني لتنظيم صندوق الوارد لديك؟...
|
| 304 |
+
----------------------------------------------------------------------------------------------------
|
| 305 |
+
|
| 306 |
+
|
| 307 |
+
Epoch 1 — 8074 steps
|
| 308 |
+
----------------------------------------------------------------------------------------------------
|
| 309 |
+
Sample 1
|
| 310 |
+
Input(23 tok): summarize: Welche sind einige der Möglichkeiten, wie die Gesellschaft verbessert werden kann?...
|
| 311 |
+
Pred (6 tok): From s -...
|
| 312 |
+
Ref (12 tok): Liste einige Möglichkeiten zur Verbesserung der Gesellschaft...
|
| 313 |
+
----------------------------------------------------------------------------------------------------
|
| 314 |
+
Sample 2
|
| 315 |
+
Input(39 tok): summarize: Según el párrafo que aparece a continuación, ¿cuáles son los parques nacionales más y menos populares de los Estados Unidos?...
|
| 316 |
+
Pred (3 tok): What,...
|
| 317 |
+
Ref (22 tok): Según párrafo abajo, parques nacionales EE.UU. más y menos populares...
|
| 318 |
+
----------------------------------------------------------------------------------------------------
|
| 319 |
+
Sample 3
|
| 320 |
+
Input(19 tok): summarize: Wie wird die Reihenfolge des NFL-Drafts festgelegt?...
|
| 321 |
+
Pred (17 tok): Classruption,,,?s, oderncestoNFLerst,tailsNFLNFL...
|
| 322 |
+
Ref (11 tok): Wie Reihenfolge NFL-Draft festgelegt...
|
| 323 |
+
----------------------------------------------------------------------------------------------------
|
| 324 |
+
|
| 325 |
+
|
| 326 |
+
Epoch 2 — 16148 steps
|
| 327 |
+
----------------------------------------------------------------------------------------------------
|
| 328 |
+
Sample 1
|
| 329 |
+
Input(55 tok): summarize: Tell me which of these terms are related to artificial intelligence versus gardening: tilling, seed, gradient descent, production, Bayesian optimization, genetically modified organism, heirloom, transfer learning...
|
| 330 |
+
Pred (4 tok): What ,...
|
| 331 |
+
Ref (43 tok): Classify terms as AI-related or gardening: tilling; seed; gradient descent; production; Bayesian optimization; genetically modified organism; heirloom; transfer learning...
|
| 332 |
+
----------------------------------------------------------------------------------------------------
|
| 333 |
+
Sample 2
|
| 334 |
+
Input(17 tok): summarize: Who were the major players in the Watergate conspiracy?...
|
| 335 |
+
Pred (2 tok): What...
|
| 336 |
+
Ref (8 tok): Watergate conspiracy major players...
|
| 337 |
+
----------------------------------------------------------------------------------------------------
|
| 338 |
+
Sample 3
|
| 339 |
+
Input(19 tok): summarize: What is the average lifespan of a Golden Retriever?...
|
| 340 |
+
Pred (2 tok): What...
|
| 341 |
+
Ref (9 tok): Golden Retriever average lifespan...
|
| 342 |
+
----------------------------------------------------------------------------------------------------
|
| 343 |
+
|
| 344 |
+
|
| 345 |
+
Epoch 3 — 24222 steps
|
| 346 |
+
----------------------------------------------------------------------------------------------------
|
| 347 |
+
Sample 1
|
| 348 |
+
Input(19 tok): summarize: Wie sollte ich auswählen, welchen Käse ich kaufen soll?...
|
| 349 |
+
Pred (2 tok): List...
|
| 350 |
+
Ref (9 tok): Wie Käse zum Kaufen auswählen...
|
| 351 |
+
----------------------------------------------------------------------------------------------------
|
| 352 |
+
Sample 2
|
| 353 |
+
Input(31 tok): summarize: ¿Cuáles son algunos de los principales riesgos relacionados con los grandes modelos de lenguaje?...
|
| 354 |
+
Pred (2 tok): From...
|
| 355 |
+
Ref (16 tok): Principales riesgos de los grandes modelos de lenguaje...
|
| 356 |
+
----------------------------------------------------------------------------------------------------
|
| 357 |
+
Sample 3
|
| 358 |
+
Input(33 tok): summarize: ما أكثر ما يزعجك عند استخدام فلاتر البريد الإلكتروني لتنظيم صندوق الوارد لديك؟...
|
| 359 |
+
Pred (5 tok): How, ,...
|
| 360 |
+
Ref (30 tok): ما أكثر ما يزعجك عند استخدام فلاتر البريد الإلكتروني لتنظيم صندوق الوارد لديك؟...
|
| 361 |
+
----------------------------------------------------------------------------------------------------
|
| 362 |
+
|
| 363 |
+
|
| 364 |
+
Epoch 1 — 1594 steps
|
| 365 |
+
----------------------------------------------------------------------------------------------------
|
| 366 |
+
Sample 1
|
| 367 |
+
Input(64 tok): How does the estimated weight and speed of Santa’s sleigh—320,000 tons moving at about 650 miles (1,050 km) per second—compare to real-world physics limitations, and what would this mean for reindeer like Dasher, Dancer, and Rudolph?...
|
| 368 |
+
Pred (49 tok): How Santa sleigh weight speed 320,000 tons moving 650 miles (1,050 km) per second compares to real-world physics limitations, what this mean for reindeer like Dasher, Dancer, Rudolph...
|
| 369 |
+
Ref (37 tok): How Santa sleigh 320000 tons 650 miles 1050 km per second compares real-world physics limits and implications for reindeer Dasher Dancer Rudolph...
|
| 370 |
+
----------------------------------------------------------------------------------------------------
|
| 371 |
+
Sample 2
|
| 372 |
+
Input(39 tok): During the 2012 Christmas season, what were some of the main highlights of the White House holiday decorations preview on November 28, including the replica of Bo, the Obama's Portuguese Water Dog?...
|
| 373 |
+
Pred (20 tok): Main highlights White House holiday decorations preview November 28 including replica Bo Obama's Portuguese Water Dog replica...
|
| 374 |
+
Ref (22 tok): Main highlights White House holiday decorations preview Nov 28 2012 Christmas season including Bo replica Obama's Portuguese Water Dog...
|
| 375 |
+
----------------------------------------------------------------------------------------------------
|
| 376 |
+
Sample 3
|
| 377 |
+
Input(35 tok): What challenges did early participants like Charles Nimmo identify regarding the scalability and international cooperation required for Project Loon to become a viable global Internet solution?...
|
| 378 |
+
Pred (28 tok): What challenges early participants like Charles Nimmo identified regarding scalability international cooperation required for Project Loon become viable global Internet solution...
|
| 379 |
+
Ref (29 tok): What challenges Charles Nimmo, other early participants identified about scalability, international cooperation needed for Project Loon global Internet viability...
|
| 380 |
+
----------------------------------------------------------------------------------------------------
|
| 381 |
+
|
| 382 |
+
|
| 383 |
+
Epoch 2 — 3188 steps
|
| 384 |
+
----------------------------------------------------------------------------------------------------
|
| 385 |
+
Sample 1
|
| 386 |
+
Input(30 tok): Why is access to the Spireworks app in New York City currently restricted to invited users, and how does this exclusivity affect public participation?...
|
| 387 |
+
Pred (20 tok): Why Spireworks app New York City restricted to invited users, how exclusivity affect public participation...
|
| 388 |
+
Ref (19 tok): Why Spireworks app access NYC restricted to invited users how exclusivity affects public participation...
|
| 389 |
+
----------------------------------------------------------------------------------------------------
|
| 390 |
+
Sample 2
|
| 391 |
+
Input(57 tok): How did Sergio Aguero’s performance in Manchester City’s 4-1 victory over Sunderland at the Stadium of Light in December 2014 help the team end its six-year winless streak at that venue since Sheikh Mansour’s 2008 takeover?...
|
| 392 |
+
Pred (40 tok): How Sergio Aguero performance Manchester City 4-1 win over Sunderland Stadium of Light December 2014 helped team end six-year winless streak since Sheikh Mansour 2008 takeover...
|
| 393 |
+
Ref (41 tok): How Sergio Aguero performance in Manchester City 4-1 win over Sunderland Stadium of Light December 2014 helped end six-year winless streak there since Sheikh Mansour 2008 takeover...
|
| 394 |
+
----------------------------------------------------------------------------------------------------
|
| 395 |
+
Sample 3
|
| 396 |
+
Input(42 tok): In what ways did the Greater Manchester Police investigate the 2013 online trolling campaign against Michael Le Vell, and what legal challenges did they face in determining whether the material broke contempt of court laws?...
|
| 397 |
+
Pred (31 tok): How Greater Manchester Police investigate 2013 online trolling campaign against Michael Le Vell and what legal challenges they faced determining material broke contempt of court laws...
|
| 398 |
+
Ref (30 tok): How Greater Manchester Police investigated 2013 online trolling campaign against Michael Le Vell and legal challenges determining if material broke contempt of court laws...
|
| 399 |
+
----------------------------------------------------------------------------------------------------
|
| 400 |
+
|
| 401 |
+
|
| 402 |
+
Epoch 3 — 4782 steps
|
| 403 |
+
----------------------------------------------------------------------------------------------------
|
| 404 |
+
Sample 1
|
| 405 |
+
Input(45 tok): Why did Edward Snowden’s father, Lon Snowden, travel to Moscow in October 2013, and what did he say about the possibility of his son returning to the United States after being granted asylum in Russia?...
|
| 406 |
+
Pred (28 tok): Why Edward Snowden father Lon Snowden travel Moscow October 2013 what he say about possibility son returning United States after asylum granted Russia...
|
| 407 |
+
Ref (23 tok): Why Lon Snowden travel Moscow October 2013 what he say about Edward Snowden return US after asylum Russia...
|
| 408 |
+
----------------------------------------------------------------------------------------------------
|
| 409 |
+
Sample 2
|
| 410 |
+
Input(48 tok): What role did U.S. Secretary of State John Kerry and U.K. Foreign Secretary William Hague play in coordinating potential international financial support for Ukraine in late February 2014, and what conditions did they set for such assistance?...
|
| 411 |
+
Pred (29 tok): Role John Kerry and William Hague in coordinating potential international financial support for Ukraine late February 2014 and what conditions they set for such assistance...
|
| 412 |
+
Ref (23 tok): Role John Kerry, William Hague coordinating international financial support Ukraine late February 2014, conditions set for assistance...
|
| 413 |
+
----------------------------------------------------------------------------------------------------
|
| 414 |
+
Sample 3
|
| 415 |
+
Input(55 tok): How did the 2008 arrest of Ricardo Gutierrez Vargas, Mexico’s Interpol chief, raise concerns that Interpol’s communications systems and databases may have been compromised, leading the agency’s France-based headquarters to send investigators to Mexico?...
|
| 416 |
+
Pred (33 tok): How 2008 arrest Ricardo Gutierrez Vargas Mexico Interpol chief raised concerns Interpol communications systems databases compromised leading France headquarters to send investigators to Mexico...
|
| 417 |
+
Ref (32 tok): How 2008 arrest Ricardo Gutierrez Vargas Mexico Interpol chief raised concerns Interpol communications systems databases compromised leading France headquarters send investigators to Mexico...
|
| 418 |
+
----------------------------------------------------------------------------------------------------
|
| 419 |
+
|
| 420 |
+
|
| 421 |
+
Epoch 4 — 6376 steps
|
| 422 |
+
----------------------------------------------------------------------------------------------------
|
| 423 |
+
Sample 1
|
| 424 |
+
Input(56 tok): What were the circumstances that led former Premier League footballer Marlon King to plead guilty to dangerous driving at Nottingham Crown Court in March 2014, and what injuries resulted from the crash on the A46/A17 in Winthorpe, Nottinghamshire?...
|
| 425 |
+
Pred (39 tok): What circumstances led former Premier League footballer Marlon King plead guilty dangerous driving Nottingham Crown Court March 2014 and what injuries caused crash A46/A17 Winthorpe Nottinghamshire...
|
| 426 |
+
Ref (47 tok): What circumstances led former Premier League footballer Marlon King to plead guilty to dangerous driving at Nottingham Crown Court March 2014 and what injuries resulted from crash on A46/A17 in Winthorpe Nottinghamshire...
|
| 427 |
+
----------------------------------------------------------------------------------------------------
|
| 428 |
+
Sample 2
|
| 429 |
+
Input(42 tok): How did the emotional impact of being unable to attend her uncle’s funeral in Hampshire serve as a turning point in Kim Freshwater’s decision to lose weight and reclaim her health?...
|
| 430 |
+
Pred (22 tok): How emotional impact unable attend uncle funeral Hampshire shaped Kim Freshwater decision lose weight reclaim health...
|
| 431 |
+
Ref (16 tok): How missing uncle funeral Hampshire became turning point Kim Freshwater weight loss health recovery...
|
| 432 |
+
----------------------------------------------------------------------------------------------------
|
| 433 |
+
Sample 3
|
| 434 |
+
Input(46 tok): What role did Robert Seldon Lady, the former CIA base chief in Milan, play in the 2003 extraordinary rendition of Abu Omar, and why did an Italian court convict him and 22 other Americans in 2009?...
|
| 435 |
+
Pred (32 tok): Role Robert Seldon Lady former CIA base chief Milan in 2003 extraordinary Abu Omar rendition and why Italian court convict him 22 other Americans 2009...
|
| 436 |
+
Ref (31 tok): What role Robert Seldon Lady former CIA base chief Milan played 2003 extraordinary rendition Abu Omar and why Italian court convicted him and 22 Americans 2009...
|
| 437 |
+
----------------------------------------------------------------------------------------------------
|
| 438 |
+
|
| 439 |
+
|
| 440 |
+
Epoch 5 — 7970 steps
|
| 441 |
+
----------------------------------------------------------------------------------------------------
|
| 442 |
+
Sample 1
|
| 443 |
+
Input(63 tok): What evidence did Mexican federal police reportedly seize during the August 2010 searches of Jean Baptiste Kingery Moinssonm’s properties in Mazatlan, Sinaloa, and how might those items be connected to alleged arms trafficking activities for the Sinaloa cartel?...
|
| 444 |
+
Pred (46 tok): What evidence Mexican federal police reportedly seize August 2010 searches Jean Baptiste Kingery Moinssonm properties Mazatlan Sinaloa, how items link alleged arms trafficking Sinalona cartel...
|
| 445 |
+
Ref (46 tok): What evidence Mexican federal police reportedly seized August 2010 searches Jean Baptiste Kingery Moinssonm properties Mazatlan Sinaloa, how items connected alleged arms trafficking Sinaloa cartel...
|
| 446 |
+
----------------------------------------------------------------------------------------------------
|
| 447 |
+
Sample 2
|
| 448 |
+
Input(39 tok): According to World Bank data cited in 2024, why does 90% of Uganda’s population still lack access to electricity, and what economic and educational effects does this have on rural families?...
|
| 449 |
+
Pred (21 tok): Why 90% Uganda population lack electricity per World Bank 2024 data, what economic educational effects on rural families...
|
| 450 |
+
Ref (20 tok): Why 90% Uganda population lack electricity per 2024 World Bank data, economic educational effects on rural families...
|
| 451 |
+
----------------------------------------------------------------------------------------------------
|
| 452 |
+
Sample 3
|
| 453 |
+
Input(50 tok): Why has the Battersea neighborhood in South-West London, despite its affluent reputation, been described as a crime hotspot in 2011, and what other high-profile violent incidents occurred there around that time?...
|
| 454 |
+
Pred (33 tok): Why Battersea South-West London despite affluent reputation described crime hotspot 2011 what other high-profile violent incidents occurred there...
|
| 455 |
+
Ref (32 tok): Why Battersea South-West London affluent area called crime hotspot 2011, what other high-profile violent incidents occurred there then...
|
| 456 |
+
----------------------------------------------------------------------------------------------------
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
Epoch 6 — 9564 steps
|
| 460 |
+
----------------------------------------------------------------------------------------------------
|
| 461 |
+
Sample 1
|
| 462 |
+
Input(40 tok): Why did Pat Ekins decide to sell her house in Middlesbrough, North Yorkshire, without informing her children, and how did this decision affect her relationship with her family afterward?...
|
| 463 |
+
Pred (26 tok): Why Pat Ekins sell house Middlesbrough North Yorkshire without informing children, how decision affect relationship with family afterward...
|
| 464 |
+
Ref (24 tok): Why Pat Ekins sell house Middlesbrough North Yorkshire without telling children, how decision affect family relationship afterward...
|
| 465 |
+
----------------------------------------------------------------------------------------------------
|
| 466 |
+
Sample 2
|
| 467 |
+
Input(57 tok): What role did former CIA operative John Kiriakou play in the 2002 capture of al Qaeda member Abu Zubaydah in Faisalabad, Pakistan, and how did he verify Zubaydah’s identity after the raid?...
|
| 468 |
+
Pred (45 tok): Role former CIA operative John Kiriakou in 2002 capture al Qaeda member Abu Zubaydah Faisalabad Pakistan, how he verify Zubaydadah identity after raid...
|
| 469 |
+
Ref (44 tok): What role former CIA operative John Kiriakou play 2002 capture al Qaeda member Abu Zubaydah Faisalabad Pakistan how he verify Zubaydah identity after raid...
|
| 470 |
+
----------------------------------------------------------------------------------------------------
|
| 471 |
+
Sample 3
|
| 472 |
+
Input(51 tok): What symptoms did players and spectators experience from the carbon monoxide exposure during the Dells Ducks and Ice Hawks hockey game at the Poppy Waterman Ice Rink in Lake Delton, Wisconsin, and how did medical teams respond?...
|
| 473 |
+
Pred (37 tok): What symptoms players spectators experienced from carbon monoxide exposure Dells Ducks Ice Hawks hockey Poppy Waterman Ice Rink Lake Delton Wisconsin and how medical teams responded...
|
| 474 |
+
Ref (38 tok): What symptoms players spectators had from carbon monoxide exposure during Dells Ducks Ice Hawks hockey game Poppy Waterman Ice Rink Lake Delton Wisconsin how medical teams responded...
|
| 475 |
+
----------------------------------------------------------------------------------------------------
|
| 476 |
+
|
| 477 |
+
|
| 478 |
+
Epoch 7 — 11158 steps
|
| 479 |
+
----------------------------------------------------------------------------------------------------
|
| 480 |
+
Sample 1
|
| 481 |
+
Input(57 tok): What economic impact did Illinois officials in 2009 predict from the federal government’s purchase and expansion of the Thomson Correctional Center, including estimates for job creation, local investment, and the use of the facility to relieve overcrowding in other U.S. prisons?...
|
| 482 |
+
Pred (32 tok): What economic impact Illinois officials 2009 predicted from federal purchase expansion Thomson Correctional Center including job creation local investment facility relief overcrowding other US prisons...
|
| 483 |
+
Ref (36 tok): What economic impact Illinois officials 2009 predicted from federal purchase expansion Thomson Correctional Center including job creation local investment use to relieve overcrowding other U.S. prisons...
|
| 484 |
+
----------------------------------------------------------------------------------------------------
|
| 485 |
+
Sample 2
|
| 486 |
+
Input(46 tok): What evidence did South Korean intelligence officials cite in their 2012 report suggesting that North Korea was preparing for a third nuclear test at the Punggye-ri site, and how did satellite imagery support these claims?...
|
| 487 |
+
Pred (31 tok): What evidence South Korean intelligence officials cite 2012 report suggesting North Korea prepared third nuclear test Punggye-ri site, how satellite imagery supported claims...
|
| 488 |
+
Ref (31 tok): What evidence South Korean intelligence cite 2012 report suggesting North Korea preparing third nuclear test Punggye-ri site, how satellite imagery supported claims...
|
| 489 |
+
----------------------------------------------------------------------------------------------------
|
| 490 |
+
Sample 3
|
| 491 |
+
Input(56 tok): According to the China Internet Network Information Center, China had nearly 600 million internet users in 2014. How does this number compare to the 254 million internet users in the United States reported by the Harvard Business Review, and what does it suggest about the growth of internet connectivity in China?...
|
| 492 |
+
Pred (56 tok): According to China Internet Network Information Center, China had nearly 600 million internet users in 2014, how does this number compare to the 254 million Internet users in the United States reported by the Harvard Business Review, and what does it suggest about the growth of internet connectivity in China?...
|
| 493 |
+
Ref (32 tok): How 2014 China 600M internet users per China Internet Network Information Center compare to US 254M per Harvard Business Review, what it suggest about China internet growth...
|
| 494 |
+
----------------------------------------------------------------------------------------------------
|
| 495 |
+
|
| 496 |
+
|
| 497 |
+
Epoch 8 — 12752 steps
|
| 498 |
+
----------------------------------------------------------------------------------------------------
|
| 499 |
+
Sample 1
|
| 500 |
+
Input(45 tok): What is 'The Message,' the modern-language Bible translation by Eugene Peterson that reportedly stopped two bullets during the February 2014 shooting of Dayton bus driver Rickey Wagoner?...
|
| 501 |
+
Pred (30 tok): What modern-language Bible translation by Eugene Peterson stopping two bullets during February 2014 shooting Dayton bus driver Rickey Wagoner...
|
| 502 |
+
Ref (38 tok): What is The Message modern-language Bible translation by Eugene Peterson that reportedly stopped two bullets during February 2014 shooting of Dayton bus driver Rickey Wagoner...
|
| 503 |
+
----------------------------------------------------------------------------------------------------
|
| 504 |
+
Sample 2
|
| 505 |
+
Input(44 tok): How did Generals Lloyd Austin and Martin Dempsey differ or agree on the potential role of U.S. troops in the 2015 offensive to liberate Mosul, Iraq, from ISIS control?...
|
| 506 |
+
Pred (29 tok): How Lloyd Austin Martin Dempsey differ or agree on potential US troops role 2015 offensive liberating Mosul Iraq from ISIS control...
|
| 507 |
+
Ref (28 tok): How Lloyd Austin Martin Dempsey differ or agree on U.S. troop role in 2015 Mosul offensive against ISIS...
|
| 508 |
+
----------------------------------------------------------------------------------------------------
|
| 509 |
+
Sample 3
|
| 510 |
+
Input(45 tok): What role did Philly Tech Week play in helping Frank Lee secure approval from Brandywine Realty Trust CEO Gerard Sweeney to stage the giant Pong game on the Circa Centre building in Philadelphia?...
|
| 511 |
+
Pred (36 tok): How Philly Tech Week helped Frank Lee secure approval from Brandywine Realty Trust CEO Gerard Sweeney to stage giant Pong game Circa Centre building Philadelphia...
|
| 512 |
+
Ref (34 tok): How Philly Tech Week helped Frank Lee get Brandywine Realty Trust CEO Gerard Sweeney approval to stage giant Pong on Circa Centre Philadelphia...
|
| 513 |
+
----------------------------------------------------------------------------------------------------
|
| 514 |
+
|
| 515 |
+
|
| 516 |
+
Epoch 9 — 14346 steps
|
| 517 |
+
----------------------------------------------------------------------------------------------------
|
| 518 |
+
Sample 1
|
| 519 |
+
Input(62 tok): How did the CIA respond to allegations from Majid Shoukat Khan’s lawyers in 2012 that he had been tortured in secret CIA prisons prior to being transferred to Guantanamo Bay, and what justification did the agency provide for its interrogation program?...
|
| 520 |
+
Pred (40 tok): How CIA respond 2012 allegations Majid Shoukat Khan tortured secret CIA prisons before transfer Guantanamo Bay, what justification agency gave for interrogation program...
|
| 521 |
+
Ref (39 tok): How CIA respond 2012 to Majid Shoukat Khan lawyer allegations torture in secret prisons before Guantanamo transfer, what justification agency gave for interrogation program...
|
| 522 |
+
----------------------------------------------------------------------------------------------------
|
| 523 |
+
Sample 2
|
| 524 |
+
Input(49 tok): In a 2014 survey conducted by the UK energy company E.ON, why did the British public vote online shopping as the most important technological development of the 21st century, ranking it above innovations like internet banking and mobile internet devices?...
|
| 525 |
+
Pred (27 tok): Why British public vote online shopping most important technological development 21st century 2014 E.ON survey ranking it above internet banking mobile devices...
|
| 526 |
+
Ref (28 tok): Why British public in 2014 E.ON survey ranked online shopping top 21st-century tech over internet banking, mobile internet devices...
|
| 527 |
+
----------------------------------------------------------------------------------------------------
|
| 528 |
+
Sample 3
|
| 529 |
+
Input(54 tok): What was the significance of German tennis player Martin Emmrich proposing to Dutch player Michaella Krajicek on the court at the Topshelf Open in Rosmalen, Netherlands in June 2014, and how did the crowd react to the proposal?...
|
| 530 |
+
Pred (32 tok): Significance Martin Emmrich proposing to Michaella Krajicek on Topshelf Open Rosmalen Netherlands June 2014 and crowd reaction...
|
| 531 |
+
Ref (40 tok): Significance of German tennis player Martin Emmrich proposing to Dutch player Michaella Krajicek on court at Topshelf Open Rosmalen Netherlands June 2014 and crowd reaction...
|
| 532 |
+
----------------------------------------------------------------------------------------------------
|
| 533 |
+
|
| 534 |
+
|
| 535 |
+
Epoch 10 — 15940 steps
|
| 536 |
+
----------------------------------------------------------------------------------------------------
|
| 537 |
+
Sample 1
|
| 538 |
+
Input(49 tok): How did Southern California artist Sandow Birk’s surfing trips in countries such as Indonesia, India, and Morocco influence his decision to begin the 'American Quran' art project after the September 11, 2001 attacks in the United States?...
|
| 539 |
+
Pred (26 tok): How Sandow Birk surfing trips Indonesia India Morocco influenced decision start American Quran art project after September 11 2001 US attacks...
|
| 540 |
+
Ref (26 tok): How Sandow Birk surfing trips Indonesia India Morocco influenced decision start American Quran project after September 11 2001 attacks United States...
|
| 541 |
+
----------------------------------------------------------------------------------------------------
|
| 542 |
+
Sample 2
|
| 543 |
+
Input(77 tok): How did stuntman Monte Perlin’s early life in Lake Arrowhead, California, and his experience riding motocross bikes from the age of 10 help prepare him for his later work on major films like 'Spider-Man,' 'Star Trek,' and 'Indiana Jones and the Kingdom of the Crystal Skull'?...
|
| 544 |
+
Pred (39 tok): How Monte Perlin early life Lake Arrowhead California and riding motocross bikes age 10 prepared him for later films Spider-Man Star Trek Indiana Jones Kingdom of the Crystal Skull...
|
| 545 |
+
Ref (41 tok): How Monte Perlin early life Lake Arrowhead California and motocross experience from age 10 prepared him for stunt work on films Spider-Man Star Trek Indiana Jones and Kingdom of Crystal Skull...
|
| 546 |
+
----------------------------------------------------------------------------------------------------
|
| 547 |
+
Sample 3
|
| 548 |
+
Input(55 tok): How did critical recognition, such as Rolling Stone naming The Kills’ 2011 album 'Blood Pressures' one of the best albums of the year, influence the band’s sense of identity as an underground act gaining mainstream attention?...
|
| 549 |
+
Pred (37 tok): How critical recognition like Rolling Stone naming The Kills 2011 album Blood Pressures one of best albums of year influenced band sense of identity as underground act gaining mainstream attention...
|
| 550 |
+
Ref (28 tok): How Rolling Stone naming The Kills 2011 album Blood Pressures best of year affected band identity as underground act gaining mainstream attention...
|
| 551 |
+
----------------------------------------------------------------------------------------------------
|
| 552 |
+
|
| 553 |
+
|
| 554 |
+
Epoch 1 — 1594 steps
|
| 555 |
+
----------------------------------------------------------------------------------------------------
|
| 556 |
+
Sample 1
|
| 557 |
+
Input(64 tok): How does the estimated weight and speed of Santa’s sleigh—320,000 tons moving at about 650 miles (1,050 km) per second—compare to real-world physics limitations, and what would this mean for reindeer like Dasher, Dancer, and Rudolph?...
|
| 558 |
+
Pred (49 tok): How Santa sleigh weight speed 320,000 tons moving 650 miles (1,050 km) per second compares to real-world physics limitations, what this mean for reindeer like Dasher, Dancer, Rudolph...
|
| 559 |
+
Ref (37 tok): How Santa sleigh 320000 tons 650 miles 1050 km per second compares real-world physics limits and implications for reindeer Dasher Dancer Rudolph...
|
| 560 |
+
----------------------------------------------------------------------------------------------------
|
| 561 |
+
Sample 2
|
| 562 |
+
Input(39 tok): During the 2012 Christmas season, what were some of the main highlights of the White House holiday decorations preview on November 28, including the replica of Bo, the Obama's Portuguese Water Dog?...
|
| 563 |
+
Pred (20 tok): Main highlights White House holiday decorations preview November 28 including replica Bo Obama's Portuguese Water Dog replica...
|
| 564 |
+
Ref (22 tok): Main highlights White House holiday decorations preview Nov 28 2012 Christmas season including Bo replica Obama's Portuguese Water Dog...
|
| 565 |
+
----------------------------------------------------------------------------------------------------
|
| 566 |
+
Sample 3
|
| 567 |
+
Input(35 tok): What challenges did early participants like Charles Nimmo identify regarding the scalability and international cooperation required for Project Loon to become a viable global Internet solution?...
|
| 568 |
+
Pred (28 tok): What challenges early participants like Charles Nimmo identified regarding scalability international cooperation required for Project Loon become viable global Internet solution...
|
| 569 |
+
Ref (29 tok): What challenges Charles Nimmo, other early participants identified about scalability, international cooperation needed for Project Loon global Internet viability...
|
| 570 |
+
----------------------------------------------------------------------------------------------------
|
special_tokens_map.json
ADDED
|
@@ -0,0 +1,125 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"additional_special_tokens": [
|
| 3 |
+
"<extra_id_0>",
|
| 4 |
+
"<extra_id_1>",
|
| 5 |
+
"<extra_id_2>",
|
| 6 |
+
"<extra_id_3>",
|
| 7 |
+
"<extra_id_4>",
|
| 8 |
+
"<extra_id_5>",
|
| 9 |
+
"<extra_id_6>",
|
| 10 |
+
"<extra_id_7>",
|
| 11 |
+
"<extra_id_8>",
|
| 12 |
+
"<extra_id_9>",
|
| 13 |
+
"<extra_id_10>",
|
| 14 |
+
"<extra_id_11>",
|
| 15 |
+
"<extra_id_12>",
|
| 16 |
+
"<extra_id_13>",
|
| 17 |
+
"<extra_id_14>",
|
| 18 |
+
"<extra_id_15>",
|
| 19 |
+
"<extra_id_16>",
|
| 20 |
+
"<extra_id_17>",
|
| 21 |
+
"<extra_id_18>",
|
| 22 |
+
"<extra_id_19>",
|
| 23 |
+
"<extra_id_20>",
|
| 24 |
+
"<extra_id_21>",
|
| 25 |
+
"<extra_id_22>",
|
| 26 |
+
"<extra_id_23>",
|
| 27 |
+
"<extra_id_24>",
|
| 28 |
+
"<extra_id_25>",
|
| 29 |
+
"<extra_id_26>",
|
| 30 |
+
"<extra_id_27>",
|
| 31 |
+
"<extra_id_28>",
|
| 32 |
+
"<extra_id_29>",
|
| 33 |
+
"<extra_id_30>",
|
| 34 |
+
"<extra_id_31>",
|
| 35 |
+
"<extra_id_32>",
|
| 36 |
+
"<extra_id_33>",
|
| 37 |
+
"<extra_id_34>",
|
| 38 |
+
"<extra_id_35>",
|
| 39 |
+
"<extra_id_36>",
|
| 40 |
+
"<extra_id_37>",
|
| 41 |
+
"<extra_id_38>",
|
| 42 |
+
"<extra_id_39>",
|
| 43 |
+
"<extra_id_40>",
|
| 44 |
+
"<extra_id_41>",
|
| 45 |
+
"<extra_id_42>",
|
| 46 |
+
"<extra_id_43>",
|
| 47 |
+
"<extra_id_44>",
|
| 48 |
+
"<extra_id_45>",
|
| 49 |
+
"<extra_id_46>",
|
| 50 |
+
"<extra_id_47>",
|
| 51 |
+
"<extra_id_48>",
|
| 52 |
+
"<extra_id_49>",
|
| 53 |
+
"<extra_id_50>",
|
| 54 |
+
"<extra_id_51>",
|
| 55 |
+
"<extra_id_52>",
|
| 56 |
+
"<extra_id_53>",
|
| 57 |
+
"<extra_id_54>",
|
| 58 |
+
"<extra_id_55>",
|
| 59 |
+
"<extra_id_56>",
|
| 60 |
+
"<extra_id_57>",
|
| 61 |
+
"<extra_id_58>",
|
| 62 |
+
"<extra_id_59>",
|
| 63 |
+
"<extra_id_60>",
|
| 64 |
+
"<extra_id_61>",
|
| 65 |
+
"<extra_id_62>",
|
| 66 |
+
"<extra_id_63>",
|
| 67 |
+
"<extra_id_64>",
|
| 68 |
+
"<extra_id_65>",
|
| 69 |
+
"<extra_id_66>",
|
| 70 |
+
"<extra_id_67>",
|
| 71 |
+
"<extra_id_68>",
|
| 72 |
+
"<extra_id_69>",
|
| 73 |
+
"<extra_id_70>",
|
| 74 |
+
"<extra_id_71>",
|
| 75 |
+
"<extra_id_72>",
|
| 76 |
+
"<extra_id_73>",
|
| 77 |
+
"<extra_id_74>",
|
| 78 |
+
"<extra_id_75>",
|
| 79 |
+
"<extra_id_76>",
|
| 80 |
+
"<extra_id_77>",
|
| 81 |
+
"<extra_id_78>",
|
| 82 |
+
"<extra_id_79>",
|
| 83 |
+
"<extra_id_80>",
|
| 84 |
+
"<extra_id_81>",
|
| 85 |
+
"<extra_id_82>",
|
| 86 |
+
"<extra_id_83>",
|
| 87 |
+
"<extra_id_84>",
|
| 88 |
+
"<extra_id_85>",
|
| 89 |
+
"<extra_id_86>",
|
| 90 |
+
"<extra_id_87>",
|
| 91 |
+
"<extra_id_88>",
|
| 92 |
+
"<extra_id_89>",
|
| 93 |
+
"<extra_id_90>",
|
| 94 |
+
"<extra_id_91>",
|
| 95 |
+
"<extra_id_92>",
|
| 96 |
+
"<extra_id_93>",
|
| 97 |
+
"<extra_id_94>",
|
| 98 |
+
"<extra_id_95>",
|
| 99 |
+
"<extra_id_96>",
|
| 100 |
+
"<extra_id_97>",
|
| 101 |
+
"<extra_id_98>",
|
| 102 |
+
"<extra_id_99>"
|
| 103 |
+
],
|
| 104 |
+
"eos_token": {
|
| 105 |
+
"content": "</s>",
|
| 106 |
+
"lstrip": false,
|
| 107 |
+
"normalized": false,
|
| 108 |
+
"rstrip": false,
|
| 109 |
+
"single_word": false
|
| 110 |
+
},
|
| 111 |
+
"pad_token": {
|
| 112 |
+
"content": "<pad>",
|
| 113 |
+
"lstrip": false,
|
| 114 |
+
"normalized": false,
|
| 115 |
+
"rstrip": false,
|
| 116 |
+
"single_word": false
|
| 117 |
+
},
|
| 118 |
+
"unk_token": {
|
| 119 |
+
"content": "<unk>",
|
| 120 |
+
"lstrip": false,
|
| 121 |
+
"normalized": false,
|
| 122 |
+
"rstrip": false,
|
| 123 |
+
"single_word": false
|
| 124 |
+
}
|
| 125 |
+
}
|
spiece.model
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d60acb128cf7b7f2536e8f38a5b18a05535c9e14c7a355904270e15b0945ea86
|
| 3 |
+
size 791656
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,941 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_prefix_space": true,
|
| 3 |
+
"added_tokens_decoder": {
|
| 4 |
+
"0": {
|
| 5 |
+
"content": "<pad>",
|
| 6 |
+
"lstrip": false,
|
| 7 |
+
"normalized": false,
|
| 8 |
+
"rstrip": false,
|
| 9 |
+
"single_word": false,
|
| 10 |
+
"special": true
|
| 11 |
+
},
|
| 12 |
+
"1": {
|
| 13 |
+
"content": "</s>",
|
| 14 |
+
"lstrip": false,
|
| 15 |
+
"normalized": false,
|
| 16 |
+
"rstrip": false,
|
| 17 |
+
"single_word": false,
|
| 18 |
+
"special": true
|
| 19 |
+
},
|
| 20 |
+
"2": {
|
| 21 |
+
"content": "<unk>",
|
| 22 |
+
"lstrip": false,
|
| 23 |
+
"normalized": false,
|
| 24 |
+
"rstrip": false,
|
| 25 |
+
"single_word": false,
|
| 26 |
+
"special": true
|
| 27 |
+
},
|
| 28 |
+
"32000": {
|
| 29 |
+
"content": "<extra_id_99>",
|
| 30 |
+
"lstrip": false,
|
| 31 |
+
"normalized": false,
|
| 32 |
+
"rstrip": false,
|
| 33 |
+
"single_word": false,
|
| 34 |
+
"special": true
|
| 35 |
+
},
|
| 36 |
+
"32001": {
|
| 37 |
+
"content": "<extra_id_98>",
|
| 38 |
+
"lstrip": false,
|
| 39 |
+
"normalized": false,
|
| 40 |
+
"rstrip": false,
|
| 41 |
+
"single_word": false,
|
| 42 |
+
"special": true
|
| 43 |
+
},
|
| 44 |
+
"32002": {
|
| 45 |
+
"content": "<extra_id_97>",
|
| 46 |
+
"lstrip": false,
|
| 47 |
+
"normalized": false,
|
| 48 |
+
"rstrip": false,
|
| 49 |
+
"single_word": false,
|
| 50 |
+
"special": true
|
| 51 |
+
},
|
| 52 |
+
"32003": {
|
| 53 |
+
"content": "<extra_id_96>",
|
| 54 |
+
"lstrip": false,
|
| 55 |
+
"normalized": false,
|
| 56 |
+
"rstrip": false,
|
| 57 |
+
"single_word": false,
|
| 58 |
+
"special": true
|
| 59 |
+
},
|
| 60 |
+
"32004": {
|
| 61 |
+
"content": "<extra_id_95>",
|
| 62 |
+
"lstrip": false,
|
| 63 |
+
"normalized": false,
|
| 64 |
+
"rstrip": false,
|
| 65 |
+
"single_word": false,
|
| 66 |
+
"special": true
|
| 67 |
+
},
|
| 68 |
+
"32005": {
|
| 69 |
+
"content": "<extra_id_94>",
|
| 70 |
+
"lstrip": false,
|
| 71 |
+
"normalized": false,
|
| 72 |
+
"rstrip": false,
|
| 73 |
+
"single_word": false,
|
| 74 |
+
"special": true
|
| 75 |
+
},
|
| 76 |
+
"32006": {
|
| 77 |
+
"content": "<extra_id_93>",
|
| 78 |
+
"lstrip": false,
|
| 79 |
+
"normalized": false,
|
| 80 |
+
"rstrip": false,
|
| 81 |
+
"single_word": false,
|
| 82 |
+
"special": true
|
| 83 |
+
},
|
| 84 |
+
"32007": {
|
| 85 |
+
"content": "<extra_id_92>",
|
| 86 |
+
"lstrip": false,
|
| 87 |
+
"normalized": false,
|
| 88 |
+
"rstrip": false,
|
| 89 |
+
"single_word": false,
|
| 90 |
+
"special": true
|
| 91 |
+
},
|
| 92 |
+
"32008": {
|
| 93 |
+
"content": "<extra_id_91>",
|
| 94 |
+
"lstrip": false,
|
| 95 |
+
"normalized": false,
|
| 96 |
+
"rstrip": false,
|
| 97 |
+
"single_word": false,
|
| 98 |
+
"special": true
|
| 99 |
+
},
|
| 100 |
+
"32009": {
|
| 101 |
+
"content": "<extra_id_90>",
|
| 102 |
+
"lstrip": false,
|
| 103 |
+
"normalized": false,
|
| 104 |
+
"rstrip": false,
|
| 105 |
+
"single_word": false,
|
| 106 |
+
"special": true
|
| 107 |
+
},
|
| 108 |
+
"32010": {
|
| 109 |
+
"content": "<extra_id_89>",
|
| 110 |
+
"lstrip": false,
|
| 111 |
+
"normalized": false,
|
| 112 |
+
"rstrip": false,
|
| 113 |
+
"single_word": false,
|
| 114 |
+
"special": true
|
| 115 |
+
},
|
| 116 |
+
"32011": {
|
| 117 |
+
"content": "<extra_id_88>",
|
| 118 |
+
"lstrip": false,
|
| 119 |
+
"normalized": false,
|
| 120 |
+
"rstrip": false,
|
| 121 |
+
"single_word": false,
|
| 122 |
+
"special": true
|
| 123 |
+
},
|
| 124 |
+
"32012": {
|
| 125 |
+
"content": "<extra_id_87>",
|
| 126 |
+
"lstrip": false,
|
| 127 |
+
"normalized": false,
|
| 128 |
+
"rstrip": false,
|
| 129 |
+
"single_word": false,
|
| 130 |
+
"special": true
|
| 131 |
+
},
|
| 132 |
+
"32013": {
|
| 133 |
+
"content": "<extra_id_86>",
|
| 134 |
+
"lstrip": false,
|
| 135 |
+
"normalized": false,
|
| 136 |
+
"rstrip": false,
|
| 137 |
+
"single_word": false,
|
| 138 |
+
"special": true
|
| 139 |
+
},
|
| 140 |
+
"32014": {
|
| 141 |
+
"content": "<extra_id_85>",
|
| 142 |
+
"lstrip": false,
|
| 143 |
+
"normalized": false,
|
| 144 |
+
"rstrip": false,
|
| 145 |
+
"single_word": false,
|
| 146 |
+
"special": true
|
| 147 |
+
},
|
| 148 |
+
"32015": {
|
| 149 |
+
"content": "<extra_id_84>",
|
| 150 |
+
"lstrip": false,
|
| 151 |
+
"normalized": false,
|
| 152 |
+
"rstrip": false,
|
| 153 |
+
"single_word": false,
|
| 154 |
+
"special": true
|
| 155 |
+
},
|
| 156 |
+
"32016": {
|
| 157 |
+
"content": "<extra_id_83>",
|
| 158 |
+
"lstrip": false,
|
| 159 |
+
"normalized": false,
|
| 160 |
+
"rstrip": false,
|
| 161 |
+
"single_word": false,
|
| 162 |
+
"special": true
|
| 163 |
+
},
|
| 164 |
+
"32017": {
|
| 165 |
+
"content": "<extra_id_82>",
|
| 166 |
+
"lstrip": false,
|
| 167 |
+
"normalized": false,
|
| 168 |
+
"rstrip": false,
|
| 169 |
+
"single_word": false,
|
| 170 |
+
"special": true
|
| 171 |
+
},
|
| 172 |
+
"32018": {
|
| 173 |
+
"content": "<extra_id_81>",
|
| 174 |
+
"lstrip": false,
|
| 175 |
+
"normalized": false,
|
| 176 |
+
"rstrip": false,
|
| 177 |
+
"single_word": false,
|
| 178 |
+
"special": true
|
| 179 |
+
},
|
| 180 |
+
"32019": {
|
| 181 |
+
"content": "<extra_id_80>",
|
| 182 |
+
"lstrip": false,
|
| 183 |
+
"normalized": false,
|
| 184 |
+
"rstrip": false,
|
| 185 |
+
"single_word": false,
|
| 186 |
+
"special": true
|
| 187 |
+
},
|
| 188 |
+
"32020": {
|
| 189 |
+
"content": "<extra_id_79>",
|
| 190 |
+
"lstrip": false,
|
| 191 |
+
"normalized": false,
|
| 192 |
+
"rstrip": false,
|
| 193 |
+
"single_word": false,
|
| 194 |
+
"special": true
|
| 195 |
+
},
|
| 196 |
+
"32021": {
|
| 197 |
+
"content": "<extra_id_78>",
|
| 198 |
+
"lstrip": false,
|
| 199 |
+
"normalized": false,
|
| 200 |
+
"rstrip": false,
|
| 201 |
+
"single_word": false,
|
| 202 |
+
"special": true
|
| 203 |
+
},
|
| 204 |
+
"32022": {
|
| 205 |
+
"content": "<extra_id_77>",
|
| 206 |
+
"lstrip": false,
|
| 207 |
+
"normalized": false,
|
| 208 |
+
"rstrip": false,
|
| 209 |
+
"single_word": false,
|
| 210 |
+
"special": true
|
| 211 |
+
},
|
| 212 |
+
"32023": {
|
| 213 |
+
"content": "<extra_id_76>",
|
| 214 |
+
"lstrip": false,
|
| 215 |
+
"normalized": false,
|
| 216 |
+
"rstrip": false,
|
| 217 |
+
"single_word": false,
|
| 218 |
+
"special": true
|
| 219 |
+
},
|
| 220 |
+
"32024": {
|
| 221 |
+
"content": "<extra_id_75>",
|
| 222 |
+
"lstrip": false,
|
| 223 |
+
"normalized": false,
|
| 224 |
+
"rstrip": false,
|
| 225 |
+
"single_word": false,
|
| 226 |
+
"special": true
|
| 227 |
+
},
|
| 228 |
+
"32025": {
|
| 229 |
+
"content": "<extra_id_74>",
|
| 230 |
+
"lstrip": false,
|
| 231 |
+
"normalized": false,
|
| 232 |
+
"rstrip": false,
|
| 233 |
+
"single_word": false,
|
| 234 |
+
"special": true
|
| 235 |
+
},
|
| 236 |
+
"32026": {
|
| 237 |
+
"content": "<extra_id_73>",
|
| 238 |
+
"lstrip": false,
|
| 239 |
+
"normalized": false,
|
| 240 |
+
"rstrip": false,
|
| 241 |
+
"single_word": false,
|
| 242 |
+
"special": true
|
| 243 |
+
},
|
| 244 |
+
"32027": {
|
| 245 |
+
"content": "<extra_id_72>",
|
| 246 |
+
"lstrip": false,
|
| 247 |
+
"normalized": false,
|
| 248 |
+
"rstrip": false,
|
| 249 |
+
"single_word": false,
|
| 250 |
+
"special": true
|
| 251 |
+
},
|
| 252 |
+
"32028": {
|
| 253 |
+
"content": "<extra_id_71>",
|
| 254 |
+
"lstrip": false,
|
| 255 |
+
"normalized": false,
|
| 256 |
+
"rstrip": false,
|
| 257 |
+
"single_word": false,
|
| 258 |
+
"special": true
|
| 259 |
+
},
|
| 260 |
+
"32029": {
|
| 261 |
+
"content": "<extra_id_70>",
|
| 262 |
+
"lstrip": false,
|
| 263 |
+
"normalized": false,
|
| 264 |
+
"rstrip": false,
|
| 265 |
+
"single_word": false,
|
| 266 |
+
"special": true
|
| 267 |
+
},
|
| 268 |
+
"32030": {
|
| 269 |
+
"content": "<extra_id_69>",
|
| 270 |
+
"lstrip": false,
|
| 271 |
+
"normalized": false,
|
| 272 |
+
"rstrip": false,
|
| 273 |
+
"single_word": false,
|
| 274 |
+
"special": true
|
| 275 |
+
},
|
| 276 |
+
"32031": {
|
| 277 |
+
"content": "<extra_id_68>",
|
| 278 |
+
"lstrip": false,
|
| 279 |
+
"normalized": false,
|
| 280 |
+
"rstrip": false,
|
| 281 |
+
"single_word": false,
|
| 282 |
+
"special": true
|
| 283 |
+
},
|
| 284 |
+
"32032": {
|
| 285 |
+
"content": "<extra_id_67>",
|
| 286 |
+
"lstrip": false,
|
| 287 |
+
"normalized": false,
|
| 288 |
+
"rstrip": false,
|
| 289 |
+
"single_word": false,
|
| 290 |
+
"special": true
|
| 291 |
+
},
|
| 292 |
+
"32033": {
|
| 293 |
+
"content": "<extra_id_66>",
|
| 294 |
+
"lstrip": false,
|
| 295 |
+
"normalized": false,
|
| 296 |
+
"rstrip": false,
|
| 297 |
+
"single_word": false,
|
| 298 |
+
"special": true
|
| 299 |
+
},
|
| 300 |
+
"32034": {
|
| 301 |
+
"content": "<extra_id_65>",
|
| 302 |
+
"lstrip": false,
|
| 303 |
+
"normalized": false,
|
| 304 |
+
"rstrip": false,
|
| 305 |
+
"single_word": false,
|
| 306 |
+
"special": true
|
| 307 |
+
},
|
| 308 |
+
"32035": {
|
| 309 |
+
"content": "<extra_id_64>",
|
| 310 |
+
"lstrip": false,
|
| 311 |
+
"normalized": false,
|
| 312 |
+
"rstrip": false,
|
| 313 |
+
"single_word": false,
|
| 314 |
+
"special": true
|
| 315 |
+
},
|
| 316 |
+
"32036": {
|
| 317 |
+
"content": "<extra_id_63>",
|
| 318 |
+
"lstrip": false,
|
| 319 |
+
"normalized": false,
|
| 320 |
+
"rstrip": false,
|
| 321 |
+
"single_word": false,
|
| 322 |
+
"special": true
|
| 323 |
+
},
|
| 324 |
+
"32037": {
|
| 325 |
+
"content": "<extra_id_62>",
|
| 326 |
+
"lstrip": false,
|
| 327 |
+
"normalized": false,
|
| 328 |
+
"rstrip": false,
|
| 329 |
+
"single_word": false,
|
| 330 |
+
"special": true
|
| 331 |
+
},
|
| 332 |
+
"32038": {
|
| 333 |
+
"content": "<extra_id_61>",
|
| 334 |
+
"lstrip": false,
|
| 335 |
+
"normalized": false,
|
| 336 |
+
"rstrip": false,
|
| 337 |
+
"single_word": false,
|
| 338 |
+
"special": true
|
| 339 |
+
},
|
| 340 |
+
"32039": {
|
| 341 |
+
"content": "<extra_id_60>",
|
| 342 |
+
"lstrip": false,
|
| 343 |
+
"normalized": false,
|
| 344 |
+
"rstrip": false,
|
| 345 |
+
"single_word": false,
|
| 346 |
+
"special": true
|
| 347 |
+
},
|
| 348 |
+
"32040": {
|
| 349 |
+
"content": "<extra_id_59>",
|
| 350 |
+
"lstrip": false,
|
| 351 |
+
"normalized": false,
|
| 352 |
+
"rstrip": false,
|
| 353 |
+
"single_word": false,
|
| 354 |
+
"special": true
|
| 355 |
+
},
|
| 356 |
+
"32041": {
|
| 357 |
+
"content": "<extra_id_58>",
|
| 358 |
+
"lstrip": false,
|
| 359 |
+
"normalized": false,
|
| 360 |
+
"rstrip": false,
|
| 361 |
+
"single_word": false,
|
| 362 |
+
"special": true
|
| 363 |
+
},
|
| 364 |
+
"32042": {
|
| 365 |
+
"content": "<extra_id_57>",
|
| 366 |
+
"lstrip": false,
|
| 367 |
+
"normalized": false,
|
| 368 |
+
"rstrip": false,
|
| 369 |
+
"single_word": false,
|
| 370 |
+
"special": true
|
| 371 |
+
},
|
| 372 |
+
"32043": {
|
| 373 |
+
"content": "<extra_id_56>",
|
| 374 |
+
"lstrip": false,
|
| 375 |
+
"normalized": false,
|
| 376 |
+
"rstrip": false,
|
| 377 |
+
"single_word": false,
|
| 378 |
+
"special": true
|
| 379 |
+
},
|
| 380 |
+
"32044": {
|
| 381 |
+
"content": "<extra_id_55>",
|
| 382 |
+
"lstrip": false,
|
| 383 |
+
"normalized": false,
|
| 384 |
+
"rstrip": false,
|
| 385 |
+
"single_word": false,
|
| 386 |
+
"special": true
|
| 387 |
+
},
|
| 388 |
+
"32045": {
|
| 389 |
+
"content": "<extra_id_54>",
|
| 390 |
+
"lstrip": false,
|
| 391 |
+
"normalized": false,
|
| 392 |
+
"rstrip": false,
|
| 393 |
+
"single_word": false,
|
| 394 |
+
"special": true
|
| 395 |
+
},
|
| 396 |
+
"32046": {
|
| 397 |
+
"content": "<extra_id_53>",
|
| 398 |
+
"lstrip": false,
|
| 399 |
+
"normalized": false,
|
| 400 |
+
"rstrip": false,
|
| 401 |
+
"single_word": false,
|
| 402 |
+
"special": true
|
| 403 |
+
},
|
| 404 |
+
"32047": {
|
| 405 |
+
"content": "<extra_id_52>",
|
| 406 |
+
"lstrip": false,
|
| 407 |
+
"normalized": false,
|
| 408 |
+
"rstrip": false,
|
| 409 |
+
"single_word": false,
|
| 410 |
+
"special": true
|
| 411 |
+
},
|
| 412 |
+
"32048": {
|
| 413 |
+
"content": "<extra_id_51>",
|
| 414 |
+
"lstrip": false,
|
| 415 |
+
"normalized": false,
|
| 416 |
+
"rstrip": false,
|
| 417 |
+
"single_word": false,
|
| 418 |
+
"special": true
|
| 419 |
+
},
|
| 420 |
+
"32049": {
|
| 421 |
+
"content": "<extra_id_50>",
|
| 422 |
+
"lstrip": false,
|
| 423 |
+
"normalized": false,
|
| 424 |
+
"rstrip": false,
|
| 425 |
+
"single_word": false,
|
| 426 |
+
"special": true
|
| 427 |
+
},
|
| 428 |
+
"32050": {
|
| 429 |
+
"content": "<extra_id_49>",
|
| 430 |
+
"lstrip": false,
|
| 431 |
+
"normalized": false,
|
| 432 |
+
"rstrip": false,
|
| 433 |
+
"single_word": false,
|
| 434 |
+
"special": true
|
| 435 |
+
},
|
| 436 |
+
"32051": {
|
| 437 |
+
"content": "<extra_id_48>",
|
| 438 |
+
"lstrip": false,
|
| 439 |
+
"normalized": false,
|
| 440 |
+
"rstrip": false,
|
| 441 |
+
"single_word": false,
|
| 442 |
+
"special": true
|
| 443 |
+
},
|
| 444 |
+
"32052": {
|
| 445 |
+
"content": "<extra_id_47>",
|
| 446 |
+
"lstrip": false,
|
| 447 |
+
"normalized": false,
|
| 448 |
+
"rstrip": false,
|
| 449 |
+
"single_word": false,
|
| 450 |
+
"special": true
|
| 451 |
+
},
|
| 452 |
+
"32053": {
|
| 453 |
+
"content": "<extra_id_46>",
|
| 454 |
+
"lstrip": false,
|
| 455 |
+
"normalized": false,
|
| 456 |
+
"rstrip": false,
|
| 457 |
+
"single_word": false,
|
| 458 |
+
"special": true
|
| 459 |
+
},
|
| 460 |
+
"32054": {
|
| 461 |
+
"content": "<extra_id_45>",
|
| 462 |
+
"lstrip": false,
|
| 463 |
+
"normalized": false,
|
| 464 |
+
"rstrip": false,
|
| 465 |
+
"single_word": false,
|
| 466 |
+
"special": true
|
| 467 |
+
},
|
| 468 |
+
"32055": {
|
| 469 |
+
"content": "<extra_id_44>",
|
| 470 |
+
"lstrip": false,
|
| 471 |
+
"normalized": false,
|
| 472 |
+
"rstrip": false,
|
| 473 |
+
"single_word": false,
|
| 474 |
+
"special": true
|
| 475 |
+
},
|
| 476 |
+
"32056": {
|
| 477 |
+
"content": "<extra_id_43>",
|
| 478 |
+
"lstrip": false,
|
| 479 |
+
"normalized": false,
|
| 480 |
+
"rstrip": false,
|
| 481 |
+
"single_word": false,
|
| 482 |
+
"special": true
|
| 483 |
+
},
|
| 484 |
+
"32057": {
|
| 485 |
+
"content": "<extra_id_42>",
|
| 486 |
+
"lstrip": false,
|
| 487 |
+
"normalized": false,
|
| 488 |
+
"rstrip": false,
|
| 489 |
+
"single_word": false,
|
| 490 |
+
"special": true
|
| 491 |
+
},
|
| 492 |
+
"32058": {
|
| 493 |
+
"content": "<extra_id_41>",
|
| 494 |
+
"lstrip": false,
|
| 495 |
+
"normalized": false,
|
| 496 |
+
"rstrip": false,
|
| 497 |
+
"single_word": false,
|
| 498 |
+
"special": true
|
| 499 |
+
},
|
| 500 |
+
"32059": {
|
| 501 |
+
"content": "<extra_id_40>",
|
| 502 |
+
"lstrip": false,
|
| 503 |
+
"normalized": false,
|
| 504 |
+
"rstrip": false,
|
| 505 |
+
"single_word": false,
|
| 506 |
+
"special": true
|
| 507 |
+
},
|
| 508 |
+
"32060": {
|
| 509 |
+
"content": "<extra_id_39>",
|
| 510 |
+
"lstrip": false,
|
| 511 |
+
"normalized": false,
|
| 512 |
+
"rstrip": false,
|
| 513 |
+
"single_word": false,
|
| 514 |
+
"special": true
|
| 515 |
+
},
|
| 516 |
+
"32061": {
|
| 517 |
+
"content": "<extra_id_38>",
|
| 518 |
+
"lstrip": false,
|
| 519 |
+
"normalized": false,
|
| 520 |
+
"rstrip": false,
|
| 521 |
+
"single_word": false,
|
| 522 |
+
"special": true
|
| 523 |
+
},
|
| 524 |
+
"32062": {
|
| 525 |
+
"content": "<extra_id_37>",
|
| 526 |
+
"lstrip": false,
|
| 527 |
+
"normalized": false,
|
| 528 |
+
"rstrip": false,
|
| 529 |
+
"single_word": false,
|
| 530 |
+
"special": true
|
| 531 |
+
},
|
| 532 |
+
"32063": {
|
| 533 |
+
"content": "<extra_id_36>",
|
| 534 |
+
"lstrip": false,
|
| 535 |
+
"normalized": false,
|
| 536 |
+
"rstrip": false,
|
| 537 |
+
"single_word": false,
|
| 538 |
+
"special": true
|
| 539 |
+
},
|
| 540 |
+
"32064": {
|
| 541 |
+
"content": "<extra_id_35>",
|
| 542 |
+
"lstrip": false,
|
| 543 |
+
"normalized": false,
|
| 544 |
+
"rstrip": false,
|
| 545 |
+
"single_word": false,
|
| 546 |
+
"special": true
|
| 547 |
+
},
|
| 548 |
+
"32065": {
|
| 549 |
+
"content": "<extra_id_34>",
|
| 550 |
+
"lstrip": false,
|
| 551 |
+
"normalized": false,
|
| 552 |
+
"rstrip": false,
|
| 553 |
+
"single_word": false,
|
| 554 |
+
"special": true
|
| 555 |
+
},
|
| 556 |
+
"32066": {
|
| 557 |
+
"content": "<extra_id_33>",
|
| 558 |
+
"lstrip": false,
|
| 559 |
+
"normalized": false,
|
| 560 |
+
"rstrip": false,
|
| 561 |
+
"single_word": false,
|
| 562 |
+
"special": true
|
| 563 |
+
},
|
| 564 |
+
"32067": {
|
| 565 |
+
"content": "<extra_id_32>",
|
| 566 |
+
"lstrip": false,
|
| 567 |
+
"normalized": false,
|
| 568 |
+
"rstrip": false,
|
| 569 |
+
"single_word": false,
|
| 570 |
+
"special": true
|
| 571 |
+
},
|
| 572 |
+
"32068": {
|
| 573 |
+
"content": "<extra_id_31>",
|
| 574 |
+
"lstrip": false,
|
| 575 |
+
"normalized": false,
|
| 576 |
+
"rstrip": false,
|
| 577 |
+
"single_word": false,
|
| 578 |
+
"special": true
|
| 579 |
+
},
|
| 580 |
+
"32069": {
|
| 581 |
+
"content": "<extra_id_30>",
|
| 582 |
+
"lstrip": false,
|
| 583 |
+
"normalized": false,
|
| 584 |
+
"rstrip": false,
|
| 585 |
+
"single_word": false,
|
| 586 |
+
"special": true
|
| 587 |
+
},
|
| 588 |
+
"32070": {
|
| 589 |
+
"content": "<extra_id_29>",
|
| 590 |
+
"lstrip": false,
|
| 591 |
+
"normalized": false,
|
| 592 |
+
"rstrip": false,
|
| 593 |
+
"single_word": false,
|
| 594 |
+
"special": true
|
| 595 |
+
},
|
| 596 |
+
"32071": {
|
| 597 |
+
"content": "<extra_id_28>",
|
| 598 |
+
"lstrip": false,
|
| 599 |
+
"normalized": false,
|
| 600 |
+
"rstrip": false,
|
| 601 |
+
"single_word": false,
|
| 602 |
+
"special": true
|
| 603 |
+
},
|
| 604 |
+
"32072": {
|
| 605 |
+
"content": "<extra_id_27>",
|
| 606 |
+
"lstrip": false,
|
| 607 |
+
"normalized": false,
|
| 608 |
+
"rstrip": false,
|
| 609 |
+
"single_word": false,
|
| 610 |
+
"special": true
|
| 611 |
+
},
|
| 612 |
+
"32073": {
|
| 613 |
+
"content": "<extra_id_26>",
|
| 614 |
+
"lstrip": false,
|
| 615 |
+
"normalized": false,
|
| 616 |
+
"rstrip": false,
|
| 617 |
+
"single_word": false,
|
| 618 |
+
"special": true
|
| 619 |
+
},
|
| 620 |
+
"32074": {
|
| 621 |
+
"content": "<extra_id_25>",
|
| 622 |
+
"lstrip": false,
|
| 623 |
+
"normalized": false,
|
| 624 |
+
"rstrip": false,
|
| 625 |
+
"single_word": false,
|
| 626 |
+
"special": true
|
| 627 |
+
},
|
| 628 |
+
"32075": {
|
| 629 |
+
"content": "<extra_id_24>",
|
| 630 |
+
"lstrip": false,
|
| 631 |
+
"normalized": false,
|
| 632 |
+
"rstrip": false,
|
| 633 |
+
"single_word": false,
|
| 634 |
+
"special": true
|
| 635 |
+
},
|
| 636 |
+
"32076": {
|
| 637 |
+
"content": "<extra_id_23>",
|
| 638 |
+
"lstrip": false,
|
| 639 |
+
"normalized": false,
|
| 640 |
+
"rstrip": false,
|
| 641 |
+
"single_word": false,
|
| 642 |
+
"special": true
|
| 643 |
+
},
|
| 644 |
+
"32077": {
|
| 645 |
+
"content": "<extra_id_22>",
|
| 646 |
+
"lstrip": false,
|
| 647 |
+
"normalized": false,
|
| 648 |
+
"rstrip": false,
|
| 649 |
+
"single_word": false,
|
| 650 |
+
"special": true
|
| 651 |
+
},
|
| 652 |
+
"32078": {
|
| 653 |
+
"content": "<extra_id_21>",
|
| 654 |
+
"lstrip": false,
|
| 655 |
+
"normalized": false,
|
| 656 |
+
"rstrip": false,
|
| 657 |
+
"single_word": false,
|
| 658 |
+
"special": true
|
| 659 |
+
},
|
| 660 |
+
"32079": {
|
| 661 |
+
"content": "<extra_id_20>",
|
| 662 |
+
"lstrip": false,
|
| 663 |
+
"normalized": false,
|
| 664 |
+
"rstrip": false,
|
| 665 |
+
"single_word": false,
|
| 666 |
+
"special": true
|
| 667 |
+
},
|
| 668 |
+
"32080": {
|
| 669 |
+
"content": "<extra_id_19>",
|
| 670 |
+
"lstrip": false,
|
| 671 |
+
"normalized": false,
|
| 672 |
+
"rstrip": false,
|
| 673 |
+
"single_word": false,
|
| 674 |
+
"special": true
|
| 675 |
+
},
|
| 676 |
+
"32081": {
|
| 677 |
+
"content": "<extra_id_18>",
|
| 678 |
+
"lstrip": false,
|
| 679 |
+
"normalized": false,
|
| 680 |
+
"rstrip": false,
|
| 681 |
+
"single_word": false,
|
| 682 |
+
"special": true
|
| 683 |
+
},
|
| 684 |
+
"32082": {
|
| 685 |
+
"content": "<extra_id_17>",
|
| 686 |
+
"lstrip": false,
|
| 687 |
+
"normalized": false,
|
| 688 |
+
"rstrip": false,
|
| 689 |
+
"single_word": false,
|
| 690 |
+
"special": true
|
| 691 |
+
},
|
| 692 |
+
"32083": {
|
| 693 |
+
"content": "<extra_id_16>",
|
| 694 |
+
"lstrip": false,
|
| 695 |
+
"normalized": false,
|
| 696 |
+
"rstrip": false,
|
| 697 |
+
"single_word": false,
|
| 698 |
+
"special": true
|
| 699 |
+
},
|
| 700 |
+
"32084": {
|
| 701 |
+
"content": "<extra_id_15>",
|
| 702 |
+
"lstrip": false,
|
| 703 |
+
"normalized": false,
|
| 704 |
+
"rstrip": false,
|
| 705 |
+
"single_word": false,
|
| 706 |
+
"special": true
|
| 707 |
+
},
|
| 708 |
+
"32085": {
|
| 709 |
+
"content": "<extra_id_14>",
|
| 710 |
+
"lstrip": false,
|
| 711 |
+
"normalized": false,
|
| 712 |
+
"rstrip": false,
|
| 713 |
+
"single_word": false,
|
| 714 |
+
"special": true
|
| 715 |
+
},
|
| 716 |
+
"32086": {
|
| 717 |
+
"content": "<extra_id_13>",
|
| 718 |
+
"lstrip": false,
|
| 719 |
+
"normalized": false,
|
| 720 |
+
"rstrip": false,
|
| 721 |
+
"single_word": false,
|
| 722 |
+
"special": true
|
| 723 |
+
},
|
| 724 |
+
"32087": {
|
| 725 |
+
"content": "<extra_id_12>",
|
| 726 |
+
"lstrip": false,
|
| 727 |
+
"normalized": false,
|
| 728 |
+
"rstrip": false,
|
| 729 |
+
"single_word": false,
|
| 730 |
+
"special": true
|
| 731 |
+
},
|
| 732 |
+
"32088": {
|
| 733 |
+
"content": "<extra_id_11>",
|
| 734 |
+
"lstrip": false,
|
| 735 |
+
"normalized": false,
|
| 736 |
+
"rstrip": false,
|
| 737 |
+
"single_word": false,
|
| 738 |
+
"special": true
|
| 739 |
+
},
|
| 740 |
+
"32089": {
|
| 741 |
+
"content": "<extra_id_10>",
|
| 742 |
+
"lstrip": false,
|
| 743 |
+
"normalized": false,
|
| 744 |
+
"rstrip": false,
|
| 745 |
+
"single_word": false,
|
| 746 |
+
"special": true
|
| 747 |
+
},
|
| 748 |
+
"32090": {
|
| 749 |
+
"content": "<extra_id_9>",
|
| 750 |
+
"lstrip": false,
|
| 751 |
+
"normalized": false,
|
| 752 |
+
"rstrip": false,
|
| 753 |
+
"single_word": false,
|
| 754 |
+
"special": true
|
| 755 |
+
},
|
| 756 |
+
"32091": {
|
| 757 |
+
"content": "<extra_id_8>",
|
| 758 |
+
"lstrip": false,
|
| 759 |
+
"normalized": false,
|
| 760 |
+
"rstrip": false,
|
| 761 |
+
"single_word": false,
|
| 762 |
+
"special": true
|
| 763 |
+
},
|
| 764 |
+
"32092": {
|
| 765 |
+
"content": "<extra_id_7>",
|
| 766 |
+
"lstrip": false,
|
| 767 |
+
"normalized": false,
|
| 768 |
+
"rstrip": false,
|
| 769 |
+
"single_word": false,
|
| 770 |
+
"special": true
|
| 771 |
+
},
|
| 772 |
+
"32093": {
|
| 773 |
+
"content": "<extra_id_6>",
|
| 774 |
+
"lstrip": false,
|
| 775 |
+
"normalized": false,
|
| 776 |
+
"rstrip": false,
|
| 777 |
+
"single_word": false,
|
| 778 |
+
"special": true
|
| 779 |
+
},
|
| 780 |
+
"32094": {
|
| 781 |
+
"content": "<extra_id_5>",
|
| 782 |
+
"lstrip": false,
|
| 783 |
+
"normalized": false,
|
| 784 |
+
"rstrip": false,
|
| 785 |
+
"single_word": false,
|
| 786 |
+
"special": true
|
| 787 |
+
},
|
| 788 |
+
"32095": {
|
| 789 |
+
"content": "<extra_id_4>",
|
| 790 |
+
"lstrip": false,
|
| 791 |
+
"normalized": false,
|
| 792 |
+
"rstrip": false,
|
| 793 |
+
"single_word": false,
|
| 794 |
+
"special": true
|
| 795 |
+
},
|
| 796 |
+
"32096": {
|
| 797 |
+
"content": "<extra_id_3>",
|
| 798 |
+
"lstrip": false,
|
| 799 |
+
"normalized": false,
|
| 800 |
+
"rstrip": false,
|
| 801 |
+
"single_word": false,
|
| 802 |
+
"special": true
|
| 803 |
+
},
|
| 804 |
+
"32097": {
|
| 805 |
+
"content": "<extra_id_2>",
|
| 806 |
+
"lstrip": false,
|
| 807 |
+
"normalized": false,
|
| 808 |
+
"rstrip": false,
|
| 809 |
+
"single_word": false,
|
| 810 |
+
"special": true
|
| 811 |
+
},
|
| 812 |
+
"32098": {
|
| 813 |
+
"content": "<extra_id_1>",
|
| 814 |
+
"lstrip": false,
|
| 815 |
+
"normalized": false,
|
| 816 |
+
"rstrip": false,
|
| 817 |
+
"single_word": false,
|
| 818 |
+
"special": true
|
| 819 |
+
},
|
| 820 |
+
"32099": {
|
| 821 |
+
"content": "<extra_id_0>",
|
| 822 |
+
"lstrip": false,
|
| 823 |
+
"normalized": false,
|
| 824 |
+
"rstrip": false,
|
| 825 |
+
"single_word": false,
|
| 826 |
+
"special": true
|
| 827 |
+
}
|
| 828 |
+
},
|
| 829 |
+
"additional_special_tokens": [
|
| 830 |
+
"<extra_id_0>",
|
| 831 |
+
"<extra_id_1>",
|
| 832 |
+
"<extra_id_2>",
|
| 833 |
+
"<extra_id_3>",
|
| 834 |
+
"<extra_id_4>",
|
| 835 |
+
"<extra_id_5>",
|
| 836 |
+
"<extra_id_6>",
|
| 837 |
+
"<extra_id_7>",
|
| 838 |
+
"<extra_id_8>",
|
| 839 |
+
"<extra_id_9>",
|
| 840 |
+
"<extra_id_10>",
|
| 841 |
+
"<extra_id_11>",
|
| 842 |
+
"<extra_id_12>",
|
| 843 |
+
"<extra_id_13>",
|
| 844 |
+
"<extra_id_14>",
|
| 845 |
+
"<extra_id_15>",
|
| 846 |
+
"<extra_id_16>",
|
| 847 |
+
"<extra_id_17>",
|
| 848 |
+
"<extra_id_18>",
|
| 849 |
+
"<extra_id_19>",
|
| 850 |
+
"<extra_id_20>",
|
| 851 |
+
"<extra_id_21>",
|
| 852 |
+
"<extra_id_22>",
|
| 853 |
+
"<extra_id_23>",
|
| 854 |
+
"<extra_id_24>",
|
| 855 |
+
"<extra_id_25>",
|
| 856 |
+
"<extra_id_26>",
|
| 857 |
+
"<extra_id_27>",
|
| 858 |
+
"<extra_id_28>",
|
| 859 |
+
"<extra_id_29>",
|
| 860 |
+
"<extra_id_30>",
|
| 861 |
+
"<extra_id_31>",
|
| 862 |
+
"<extra_id_32>",
|
| 863 |
+
"<extra_id_33>",
|
| 864 |
+
"<extra_id_34>",
|
| 865 |
+
"<extra_id_35>",
|
| 866 |
+
"<extra_id_36>",
|
| 867 |
+
"<extra_id_37>",
|
| 868 |
+
"<extra_id_38>",
|
| 869 |
+
"<extra_id_39>",
|
| 870 |
+
"<extra_id_40>",
|
| 871 |
+
"<extra_id_41>",
|
| 872 |
+
"<extra_id_42>",
|
| 873 |
+
"<extra_id_43>",
|
| 874 |
+
"<extra_id_44>",
|
| 875 |
+
"<extra_id_45>",
|
| 876 |
+
"<extra_id_46>",
|
| 877 |
+
"<extra_id_47>",
|
| 878 |
+
"<extra_id_48>",
|
| 879 |
+
"<extra_id_49>",
|
| 880 |
+
"<extra_id_50>",
|
| 881 |
+
"<extra_id_51>",
|
| 882 |
+
"<extra_id_52>",
|
| 883 |
+
"<extra_id_53>",
|
| 884 |
+
"<extra_id_54>",
|
| 885 |
+
"<extra_id_55>",
|
| 886 |
+
"<extra_id_56>",
|
| 887 |
+
"<extra_id_57>",
|
| 888 |
+
"<extra_id_58>",
|
| 889 |
+
"<extra_id_59>",
|
| 890 |
+
"<extra_id_60>",
|
| 891 |
+
"<extra_id_61>",
|
| 892 |
+
"<extra_id_62>",
|
| 893 |
+
"<extra_id_63>",
|
| 894 |
+
"<extra_id_64>",
|
| 895 |
+
"<extra_id_65>",
|
| 896 |
+
"<extra_id_66>",
|
| 897 |
+
"<extra_id_67>",
|
| 898 |
+
"<extra_id_68>",
|
| 899 |
+
"<extra_id_69>",
|
| 900 |
+
"<extra_id_70>",
|
| 901 |
+
"<extra_id_71>",
|
| 902 |
+
"<extra_id_72>",
|
| 903 |
+
"<extra_id_73>",
|
| 904 |
+
"<extra_id_74>",
|
| 905 |
+
"<extra_id_75>",
|
| 906 |
+
"<extra_id_76>",
|
| 907 |
+
"<extra_id_77>",
|
| 908 |
+
"<extra_id_78>",
|
| 909 |
+
"<extra_id_79>",
|
| 910 |
+
"<extra_id_80>",
|
| 911 |
+
"<extra_id_81>",
|
| 912 |
+
"<extra_id_82>",
|
| 913 |
+
"<extra_id_83>",
|
| 914 |
+
"<extra_id_84>",
|
| 915 |
+
"<extra_id_85>",
|
| 916 |
+
"<extra_id_86>",
|
| 917 |
+
"<extra_id_87>",
|
| 918 |
+
"<extra_id_88>",
|
| 919 |
+
"<extra_id_89>",
|
| 920 |
+
"<extra_id_90>",
|
| 921 |
+
"<extra_id_91>",
|
| 922 |
+
"<extra_id_92>",
|
| 923 |
+
"<extra_id_93>",
|
| 924 |
+
"<extra_id_94>",
|
| 925 |
+
"<extra_id_95>",
|
| 926 |
+
"<extra_id_96>",
|
| 927 |
+
"<extra_id_97>",
|
| 928 |
+
"<extra_id_98>",
|
| 929 |
+
"<extra_id_99>"
|
| 930 |
+
],
|
| 931 |
+
"clean_up_tokenization_spaces": false,
|
| 932 |
+
"eos_token": "</s>",
|
| 933 |
+
"extra_ids": 100,
|
| 934 |
+
"extra_special_tokens": {},
|
| 935 |
+
"legacy": true,
|
| 936 |
+
"model_max_length": 512,
|
| 937 |
+
"pad_token": "<pad>",
|
| 938 |
+
"sp_model_kwargs": {},
|
| 939 |
+
"tokenizer_class": "T5Tokenizer",
|
| 940 |
+
"unk_token": "<unk>"
|
| 941 |
+
}
|
trainer_log_history.csv
ADDED
|
@@ -0,0 +1,22 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
loss,grad_norm,learning_rate,epoch,step,eval_loss,eval_rouge1,eval_rouge2,eval_rougeL,eval_rougeLsum,eval_comp_ratio_mean,eval_comp_ratio_p90,eval_pct_violations,eval_runtime,eval_samples_per_second,eval_steps_per_second,train_runtime,train_samples_per_second,train_steps_per_second,total_flos,train_loss
|
| 2 |
+
1.2576,4.137552261352539,9.993726474278545e-05,1.0,1594,,,,,,,,,,,,,,,,
|
| 3 |
+
,,,1.0,1594,0.6456611752510071,0.8528364838438867,0.6586661877158779,0.819669484356327,0.8199279895987595,0.6625529499788416,0.7735849056603774,0.0,348.4597,6.457,0.809,,,,,
|
| 4 |
+
0.7688,2.573607921600342,8.889585947302385e-05,2.0,3188,,,,,,,,,,,,,,,,
|
| 5 |
+
,,,2.0,3188,0.5727154016494751,0.8689474874470693,0.6851048712818919,0.8344968972913525,0.8348697620320389,0.6647169049402463,0.7693568726355614,0.0,351.9831,6.392,0.801,,,,,
|
| 6 |
+
0.6591,2.8348424434661865,7.778474836191274e-05,3.0,4782,,,,,,,,,,,,,,,,
|
| 7 |
+
,,,3.0,4782,0.540533185005188,0.8750090385391008,0.6962724429732516,0.8413137437626381,0.8416859973545814,0.6684329676796582,0.7692307692307693,0.0,348.7712,6.451,0.809,,,,,
|
| 8 |
+
0.5957,3.485762357711792,6.667363725080162e-05,4.0,6376,,,,,,,,,,,,,,,,
|
| 9 |
+
,,,4.0,6376,0.5332812666893005,0.8771052673801956,0.7001549309721768,0.8437587332299845,0.8440177737862178,0.6599911370905979,0.7659574468085106,0.0,349.0362,6.446,0.808,,,,,
|
| 10 |
+
0.548,3.5571165084838867,5.5562526139690505e-05,5.0,7970,,,,,,,,,,,,,,,,
|
| 11 |
+
,,,5.0,7970,0.5211982727050781,0.8791677438074604,0.705867129952528,0.8467215753256596,0.8470289702216209,0.6616752445040748,0.7647977941176471,0.00044444444444444447,349.8527,6.431,0.806,,,,,
|
| 12 |
+
0.5139,2.7198104858398438,4.4451415028579393e-05,6.0,9564,,,,,,,,,,,,,,,,
|
| 13 |
+
,,,6.0,9564,0.519557535648346,0.8798753890192361,0.7063785492018196,0.8472026138402767,0.8473323428755936,0.659716430748814,0.7636363636363637,0.0,349.511,6.438,0.807,,,,,
|
| 14 |
+
0.4862,3.862455129623413,3.334030391746828e-05,7.0,11158,,,,,,,,,,,,,,,,
|
| 15 |
+
,,,7.0,11158,0.5143899917602539,0.8804832259801447,0.7076066408588376,0.8472581234680618,0.8474001918954426,0.6656473506268799,0.7704918032786885,0.00044444444444444447,354.8833,6.34,0.795,,,,,
|
| 16 |
+
0.466,3.4800119400024414,2.2229192806357174e-05,8.0,12752,,,,,,,,,,,,,,,,
|
| 17 |
+
,,,8.0,12752,0.5157203674316406,0.8819074337590058,0.709796994728016,0.848893205872066,0.8492418483342343,0.6622329738091887,0.7674418604651163,0.0,348.7139,6.452,0.809,,,,,
|
| 18 |
+
0.4499,2.951266288757324,1.1118081695246062e-05,9.0,14346,,,,,,,,,,,,,,,,
|
| 19 |
+
,,,9.0,14346,0.5155828595161438,0.8816110197232148,0.7095513359852312,0.8486239346338051,0.8488813378849979,0.6603841749226899,0.7659574468085106,0.0,351.8852,6.394,0.801,,,,,
|
| 20 |
+
0.4393,2.2352824211120605,6.97058413495051e-09,10.0,15940,,,,,,,,,,,,,,,,
|
| 21 |
+
,,,10.0,15940,0.5180955529212952,0.88198348953885,0.7104253099452262,0.848536449353483,0.8488333749106418,0.6611201077120823,0.7674418604651163,0.0,350.1548,6.426,0.805,,,,,
|
| 22 |
+
,,,10.0,15940,,,,,,,,,,,,7097.9611,17.963,2.246,2976335712768000.0,0.6184487030527074
|
training_args.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f69853457e9c7f766c33cb6572ae7a83647dd580e9309b65ed7e66e400b44226
|
| 3 |
+
size 5752
|
training_loss.png
ADDED
|