Hi Gunnar,
I was just wondering what "ResNet-152 Transfer" stands for.
The papers said "It uses the Charades model to predict the activities in the third person video,
and then uses those labels as supervision for the first-person video."
Does it mean you run the Charades model on CharadesEgo_only3rd videos, then warp the labels to their corresponding 1st person video, and use the labels as supervision to finetune the original Charades model?
Thank you.
Best,
Ziwei
Hi Gunnar,
I was just wondering what "ResNet-152 Transfer" stands for.
The papers said "It uses the Charades model to predict the activities in the third person video,
and then uses those labels as supervision for the first-person video."
Does it mean you run the Charades model on CharadesEgo_only3rd videos, then warp the labels to their corresponding 1st person video, and use the labels as supervision to finetune the original Charades model?
Thank you.
Best,
Ziwei