Skip to content

21ICCV # Emerging Properties in Self-Supervised Vision Transformers (DINO) #44

@XFeiF

Description

@XFeiF

Paper
Code

Authors:
Mathilde Caron, Hugo Touvron, etc.
FBAI.

Highlights:

  • A new proposed self-supervised learning method with KD: a form of knowledge distillation with no labels. Especially, it uses a different way to avoid the collapse solution, that is use the momentum teacher encoder.
  • It encouraging "local-to-global" correspondences by feeding different sizes of views to student and teacher encoders.
  • SSL ViT features explicitly contain the scene layout and, in particular, object boundaries, as shown in the next figure.

Metadata

Metadata

Assignees

No one assigned

    Labels

    CodeCode available.Summary/BriefA breif summary about the paper.area/SSLself-supervised learningtrend/TransformerEvery paper uses transformer...

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions