Skip to content

Implement a gesture classification model #2

@gutzcha

Description

@gutzcha

The suggested approach is to use a masked autoencoder (MAE) pre-trained on a facial pose dataset.

There are implementations that we can use:

  • Implement a time dilated CNN model (TDCNN)
  • Implement a MAE model with TDCNN embedding and ViT backbone

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    Status

    🔖 Ready

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions