End of training

Files changed (2) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5002
-- Accuracy: 0.8264
 ## Model description
@@ -39,8 +39,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -48,13 +48,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step   | Validation Loss | Accuracy |
-|:-------------:|:-----:|:------:|:---------------:|:--------:|
-| 0.6004        | 1.0   | 29952  | 0.6255          | 0.7471   |
-| 0.5196        | 2.0   | 59904  | 0.5728          | 0.7776   |
-| 0.4888        | 3.0   | 89856  | 0.5283          | 0.8019   |
-| 0.3788        | 4.0   | 119808 | 0.5072          | 0.8179   |
-| 0.2821        | 5.0   | 149760 | 0.5002          | 0.8264   |
 ### Framework versions

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4816
+- Accuracy: 0.8291
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.6353        | 1.0   | 14976 | 0.6263          | 0.7488   |
+| 0.5703        | 2.0   | 29952 | 0.5596          | 0.7794   |
+| 0.4598        | 3.0   | 44928 | 0.5079          | 0.8037   |
+| 0.3756        | 4.0   | 59904 | 0.4848          | 0.8207   |
+| 0.2471        | 5.0   | 74880 | 0.4816          | 0.8291   |
 ### Framework versions

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- 2025-11-~~24T18~~:21:38,codecarbon,~~a5ed7acf~~-~~5313~~-~~48b2~~-~~824e~~-~~d9e8e400e830~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~18042~~.~~404894045~~,0.~~7567710160236923~~,4.~~194402134681441e~~-05,42.5,~~360~~.~~8287762733843~~,755.~~7507977485657~~,0.~~2128334216121308~~,3.~~1922654485325372~~,3.~~7842365660835506~~,7.~~189335436228231~~,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA ~~H100 NVL~~,6.1661,49.7498,2015.~~3354606628418~~,machine,N,1.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	+ 2025-11-25T16:19:49,codecarbon,aaf6423e-7493-41a9-858d-a5e265a64aae,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,13852.229487212,0.6737052965391285,4.863515271394217e-05,42.5,368.36674694097474,755.7507891654968,0.16323872886144283,3.334605714626882,2.9023654739482674,6.400209917436596,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1661,49.7498,2015.3354377746582,machine,N,1.0