Fashion Segmentation Model (Unet / resnet50)

This model performs binary segmentation to extract upper-body garments and clothing items from images. It was trained on the DeepFashion2 dataset using the Unet architecture with a resnet50 encoder.

Model Details

  • Task: Binary Segmentation (Cloth vs Background)
  • Architecture: Unet
  • Encoder: resnet50 (Weights: imagenet)
  • Decoder Channels: (256, 128, 64, 32, 16)
  • Input Resolution: 768x768
  • Classes: 1 (Binary segmentation)
  • Activation Function: None (None = logits output)
  • Performance:
    • Validation IoU: 0.8964 at epoch 45

Usage

import torch
import segmentation_models_pytorch as smp
from safetensors.torch import load_file

# Load the model
model = smp.Unet(
    encoder_name="resnet50",
    classes=1,
    activation=None,
    decoder_channels=(256, 128, 64, 32, 16)
)

# Load weights from safetensors
state_dict = load_file("model.safetensors")
model.load_state_dict(state_dict)
model.eval()

# Use for inference
with torch.no_grad():
    prediction = model(input_image)  # Input: (B, 3, 768, 768)
    mask = torch.sigmoid(prediction) > 0.5  # Binary mask

Training Details

  • Dataset: DeepFashion2
  • Framework: PyTorch Lightning
  • Run ID: p2avrl3r

Intended Use

This model is designed for virtual try-on applications, fashion image editing, and garment extraction tasks.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train zyuzuguldu/garment-segmentation-unet-resnet50

Space using zyuzuguldu/garment-segmentation-unet-resnet50 1