Text Classification
Transformers
Safetensors
roberta
Generated from Trainer
cedricbonhomme commited on
Commit
7db2517
·
verified ·
1 Parent(s): 617a19b

End of training

Browse files
Files changed (2) hide show
  1. README.md +11 -11
  2. emissions.csv +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.5002
22
- - Accuracy: 0.8264
23
 
24
  ## Model description
25
 
@@ -39,8 +39,8 @@ More information needed
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 3e-05
42
- - train_batch_size: 16
43
- - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
@@ -48,13 +48,13 @@ The following hyperparameters were used during training:
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
- |:-------------:|:-----:|:------:|:---------------:|:--------:|
53
- | 0.6004 | 1.0 | 29952 | 0.6255 | 0.7471 |
54
- | 0.5196 | 2.0 | 59904 | 0.5728 | 0.7776 |
55
- | 0.4888 | 3.0 | 89856 | 0.5283 | 0.8019 |
56
- | 0.3788 | 4.0 | 119808 | 0.5072 | 0.8179 |
57
- | 0.2821 | 5.0 | 149760 | 0.5002 | 0.8264 |
58
 
59
 
60
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.4816
22
+ - Accuracy: 0.8291
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 3e-05
42
+ - train_batch_size: 32
43
+ - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
 
48
 
49
  ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
53
+ | 0.6353 | 1.0 | 14976 | 0.6263 | 0.7488 |
54
+ | 0.5703 | 2.0 | 29952 | 0.5596 | 0.7794 |
55
+ | 0.4598 | 3.0 | 44928 | 0.5079 | 0.8037 |
56
+ | 0.3756 | 4.0 | 59904 | 0.4848 | 0.8207 |
57
+ | 0.2471 | 5.0 | 74880 | 0.4816 | 0.8291 |
58
 
59
 
60
  ### Framework versions
emissions.csv CHANGED
@@ -1,2 +1,2 @@
1
  timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
- 2025-11-24T18:21:38,codecarbon,a5ed7acf-5313-48b2-824e-d9e8e400e830,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,18042.404894045,0.7567710160236923,4.194402134681441e-05,42.5,360.8287762733843,755.7507977485657,0.2128334216121308,3.1922654485325372,3.7842365660835506,7.189335436228231,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0
 
1
  timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
+ 2025-11-25T16:19:49,codecarbon,aaf6423e-7493-41a9-858d-a5e265a64aae,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,13852.229487212,0.6737052965391285,4.863515271394217e-05,42.5,368.36674694097474,755.7507891654968,0.16323872886144283,3.334605714626882,2.9023654739482674,6.400209917436596,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1661,49.7498,2015.3354377746582,machine,N,1.0