Improving Unsupervised Label Propagation for Pose Tracking and Video Object Segmentation �Urs Waldmann, Jannik Bamberger, Ole Johannsen, Oliver Deussen, and Bastian Goldlücke�
Contributions
Pipeline
Results
Pipeline
Applications and Case Studies
Joint Tracking and Keypoint Propagation
Pigeon Keypoint Tracking
Unsupervised Zero-Shot VOS
This work was supported by the DFG under Germany’s Excellence Strategy - EXC 2117 - 422037984.
Presented at the German Conference on Pattern Recognition (GCPR), Konstanz, September 2022.
References
[1] M. Caron et al.: Emerging properties in self-supervised vision transformers. In ICCV, 2021
[2] A. Jabri et al.: Space-time correspondence as a contrastive random walk. In NIPS, 2020.
[3] H. Jhuang et al.: Towards understanding action recognition. In ICCV, 2013.
[4] X. Li et al.: Joint-task self-supervised learning for temporal correspondence. In NIPS, 2019.
[5] C. Yang et al.: Self-sup. video object segmentation by motion grouping. In ICCV, 2021
[6] L. Yang et al.: Efficient video object segmentation via network modulation. In CVPR, 2018.
[7] Y. Yang et al.: Unsup. moving object detection via contextual inform. sep.. In CVPR, 2019.
Real-World Dataset | PCK@0.1 | PCK@0.2 |
Core Pipeline | 7.5% | 25.1% |
Joint Pipeline | 81.0% | 97.5% |
| STC [2] | Ours | Ours + Trk | Supervised [6] |
PCK@0.1 | 59.3% | 63.9% | 65.8% | 68.7% |
PCK@0.2 | 84.9% | 82.8% | 84.2% | 92.1% |
Method | Online | Post-Proc. | Motion Only | IoU |
MoGr., unsup. flow [5] | ✅ | ❌ | ✅ | 53.2% |
CIS [7] | ✅ | ❌ | ✅ | 59.2% |
Ours | ✅ | ❌ | ❌ | 61.6% |
MoGr., sup. flow [5] | ✅ | ❌ | ✅ | 68.3% |
CIS + post-proc. [7] | ❌ | ✅ | ✅ | 71.5% |