ADE20K Segmentation Probe β€” DINOv3 ViT-S/16 @ 128px input

Linear segmentation probe on the spatial features of facebook/dinov3-vits16-pretrain-lvd1689m.

Usage

uv add "canvit-pytorch @ git+https://github.com/m2b3/CanViT-PyTorch.git"
import torch
from canvit_pytorch.probes import SegmentationProbe

probe = SegmentationProbe.from_pretrained("canvit/probe-ade20k-40k-dv3s-128px").eval()

# [B, H, W, D] DINOv3 ViT-S/16 spatial features at 128px input
features = torch.randn(1, 8, 8, 384)
with torch.inference_mode():
    logits = probe(features)    # [B, num_classes, H, W]
assert logits.shape == (1, 150, 8, 8)

Training

Architecture: Dropout β†’ BatchNorm β†’ Conv1Γ—1.

Hyperparameter Value
Input size 128 Γ— 128 px
Optimizer AdamW
Peak LR 3Γ—10βˆ’4 3 \times 10^{-4}
Weight decay 10βˆ’3 10^{-3}
LR schedule 1,500-step warmup β†’ cosine decay
Batch size 16
Max steps 40,000
Dropout 0.1
Augmentation RandomResizedCrop scale [0.5, 2] + HFlip
Precision bf16 (AMP)
Downloads last month
3
Safetensors
Model size
59.3k params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for canvit/probe-ade20k-40k-dv3s-128px

Dataset used to train canvit/probe-ade20k-40k-dv3s-128px

Collection including canvit/probe-ade20k-40k-dv3s-128px

Paper for canvit/probe-ade20k-40k-dv3s-128px