ViT fine-tuned para clasificacion FAKE/REAL

Resumen metodologico

Se utilizo el modelo base google/vit-base-patch16-224-in21k y se realizo fine-tuning con Trainer de Hugging Face. Las imagenes se preprocesaron con AutoImageProcessor y el entrenamiento se ejecuto con early stopping. La seleccion del mejor checkpoint se hizo con base en la metrica F1 de validacion.

Hiperparametros principales

  • learning_rate: 2e-05
  • batch_size: 16
  • num_train_epochs: 8
  • weight_decay: 0.01
  • warmup_ratio: 0.1
  • early_stopping_patience: 2
  • early_stopping_threshold: 0.001

Resultados

Validacion

  • loss: 0.013790015131235123
  • accuracy: 0.9981718464351006
  • precision: 0.9981786173742297
  • recall: 0.9981718464351006
  • f1: 0.998171895325199

Test

  • loss: 0.02344433404505253
  • accuracy: 0.9928952042628775
  • precision: 0.9929184199081565
  • recall: 0.9928952042628775
  • f1: 0.9928942615297829
Downloads last month
33
Safetensors
Model size
85.8M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train djramirezp/vit-face-classification-quiz2