| A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | This is the final in-person program for ECCV 2022. The black-on-white entries on this spreadsheet (so 99% of it) are identical to the printed program. Here (online), there are just a few late-breaking changes, indicated in orange for papers where plans changed and authors couldn't come, or blue where the authors ARE able to make it after all. | |||||||||||||||||||||||||
2 | Session Date | Session Time | Start Time | End Time | Session Location | Session Name | Session Title | Paper ID | Status | Session # | Session # | Poster # | Title (Corrected) | Authors (Corrected) | Primary Subject Area | Secondary Subject Areas | ||||||||||
3 | Tuesday, October 25, 2022 | 0930–1100 | 09:30 | 11:00 | Hall D | Oral 1.A.1 | Detection, Recognition, Classification, and Localization in 2D/3D | 640 | Oral | 1 | 1 | Long-Tail Detection with Effective Class-Margins | Jang Hyun Cho; Philipp Krähenbühl | Detection and localization in 2D and/or 3D | ||||||||||||
4 | 1448 | Oral | 1 | Multimodal Object Detection via Probabilistic Ensembling | Yi-Ting Chen; Jinghao Shi; Zelin Ye; Christoph Mertz; Deva Ramanan; Shu Kong | Detection and localization in 2D and/or 3D | Vision for and autonomous vehicles | |||||||||||||||||||
5 | 1791 | Oral | 1 | Improving Robustness by Enhancing Weak Subnets | Yong Guo; David Stutz; Bernt Schiele | Recognition and classification | Optimization and learning methods | |||||||||||||||||||
6 | 6108 | Oral | 1 | Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting | Yangzheng Wu; Mohsen Zand; Ali Etemad; Michael Greenspan | Detection and localization in 2D and/or 3D | ||||||||||||||||||||
7 | 2691 | Oral | 1 | Monocular 3D Object Detection with Depth from Motion | Tai Wang; Jiangmiao Pang; Dahua Lin | Detection and localization in 2D and/or 3D | 3D from a single image and shape-from-x; Scene analysis and understanding; Stereo, 3D from multiview and other sensors; Video analysis and understanding; Vision for and autonomous vehicles | |||||||||||||||||||
8 | Tuesday, October 25, 2022 | 09:30 | 11:00 | Hall E | Oral 1.A.2 | Motion and Tracking | 561 | Oral | 2 | 2 | Particle Video Revisited: Tracking through Occlusions Using Point Trajectories | Adam W. Harley; Zhaoyuan Fang; Katerina Fragkiadaki | Motion and tracking | Machine learning architectures and formulations; Video analysis and understanding | ||||||||||||
9 | 2385 | Oral | 2 | A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow | Jenny Schmalfuss; Philipp Scholze; Andrés Bruhn | Motion and tracking | Adversarial learning; Optimization and learning methods | |||||||||||||||||||
10 | 2623 | Oral | 2 | Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction | Li-Wu Tsao; Yan-Kai Wang; Hao-Siang Lin; Hong-Han Shuai; Lai-Kuan Wong; Wen-Huang Cheng | Motion and tracking | Semi / Weak / Self / Unsupervised Learning | |||||||||||||||||||
11 | 2874 | Oral | 2 | Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors | Sirui Xu; Yu-Xiong Wang; Liang-Yan Gui | Motion and tracking | Body gestures and pose | |||||||||||||||||||
12 | 4806 | Oral | 2 | TEMOS: Generating Diverse Human Motions from Textual Descriptions | Mathis Petrovich; Michael J. Black; Gül Varol | Motion and tracking | ||||||||||||||||||||
13 | 7092 | Oral | 2 | PREF: Predictability Regularized Neural Motion Fields | Liangchen Song; Xuan Gong; Benjamin Planche; Meng Zheng; David Doermann; Junsong Yuan; Terrence Chen; Ziyan Wu | Motion and tracking | Scene analysis and understanding; Stereo, 3D from multiview and other sensors | |||||||||||||||||||
14 | Tuesday, October 25, 2022 | 1100–1330 | 11:00 | 13:30 | Hall B | Poster 1.A | 1980 | Poster | 3 | 3 | 1 | How Severe Is Benchmark-Sensitivity in Video Self-Supervised Learning? | Fida Mohammad Thoker; Hazel Doughty; Piyush Bagad; Cees G. M. Snoek | Video analysis and understanding | Action and behavior recognition; Datasets and evaluation; Semi / Weak / Self / Unsupervised Learning | |||||||||||
15 | 5549 | Poster | 3 | 2 | Decoupled Contrastive Learning | Chun-Hsiao Yeh; Cheng-Yao Hong; Yen-Chi Hsu; Tyng-Luh Liu; Yubei Chen; Yann LeCun | Representation learning | Semi / Weak / Self / Unsupervised Learning | ||||||||||||||||||
16 | 5036 | Poster | 3 | 3 | 3D Clothed Human Reconstruction in the Wild | Gyeongsik Moon; Hyeongjin Nam; Takaaki Shiratori; Kyoung Mu Lee | 3D from a single image and shape-from-x | Semi / Weak / Self / Unsupervised Learning | ||||||||||||||||||
17 | 1353 | Poster | 3 | 4 | MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning | Xiaogang Xu; Hengshuang Zhao; Vibhav Vineet; Ser-Nam Lim; Antonio Torralba | Scene analysis and understanding | Segmentation, grouping, and shape | ||||||||||||||||||
18 | 6454 | Poster | 3 | 6 | Towards Accurate Network Quantization with Equivalent Smooth Regularizer | Kirill Solodskikh; Vladimir Chikin; Ruslan Aydarkhanov; Dehua Song; Irina Zhelavskaya; Jiansheng Wei | Efficient training and inference methods | Optimization and learning methods | ||||||||||||||||||
19 | 6203 | Poster | 3 | 7 | PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation | Bo Sun; Vladimir G. Kim; Noam Aigerman; Qixing Huang; Siddhartha Chaudhuri | 3D shape modeling and processing | |||||||||||||||||||
20 | 5458 | Poster | 3 | 8 | Object Discovery via Contrastive Learning for Weakly Supervised Object Detection | Jinhwan Seo; Wonho Bae; Danica J. Sutherland; Junhyug Noh; Daijin Kim | Semi / Weak / Self / Unsupervised Learning | Detection and localization in 2D and/or 3D; Recognition and classification | ||||||||||||||||||
21 | 7693 | Poster | 3 | 9 | Federated Self-Supervised Learning for Video Understanding | Yasar Abbas Ur Rehman; Yan Gao; Jiajun Shen; Pedro Porto Buarque de Gusmão; Nicholas Lane | Semi / Weak / Self / Unsupervised Learning | Image and video retrieval; Optimization and learning methods; Representation learning | ||||||||||||||||||
22 | 4806 | Oral | 3 | 10 | TEMOS: Generating Diverse Human Motions from Textual Descriptions | Mathis Petrovich; Michael J. Black; Gül Varol | Motion and tracking | |||||||||||||||||||
23 | 6655 | Poster | 3 | 11 | S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning | Jayateja Kalla; Soma Biswas | Recognition and classification | Low-shot learning | ||||||||||||||||||
24 | 7741 | Poster | 3 | 12 | Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration | Christian Tomani; Daniel Cremers; Florian Buettner | Fairness, accountability, transparency, and ethics in vision | Machine learning architectures and formulations | ||||||||||||||||||
25 | 5315 | Poster | 3 | 13 | Intelli-Paint: Towards Developing More Human-Intelligible Painting Agents | Jaskirat Singh; Cameron Smith; Jose Echevarria; Liang Zheng | Image and video synthesis | Vision applications and systems | ||||||||||||||||||
26 | 6817 | Poster | 3 | 14 | Expanded Adaptive Scaling Normalization for End to End Image Compression | Chajin Shin; Hyeongmin Lee; Hanbin Son; Sangjin Lee; Dogyoon Lee; Sangyoun Lee | Image and video synthesis | Low-level and physics-based vision | ||||||||||||||||||
27 | 1469 | Poster | 3 | 15 | ARAH: Animatable Volume Rendering of Articulated Human SDFs | Shaofei Wang; Katja Schwarz; Andreas Geiger; Siyu Tang | Stereo, 3D from multiview and other sensors | 3D shape modeling and processing; Body gestures and pose; Image and video synthesis; Optimization and learning methods | ||||||||||||||||||
28 | 7573 | Poster | 3 | 16 | Scaling Adversarial Training to Large Perturbation Bounds | Sravanti Addepalli; Samyak Jain; Gaurang Sriramanan; R. Venkatesh Babu | Adversarial learning | |||||||||||||||||||
29 | 1596 | Poster | 3 | 17 | DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition | Yuxuan Liang; Pan Zhou; Roger Zimmermann; Shuicheng Yan | Video analysis and understanding | Recognition and classification | ||||||||||||||||||
30 | 6063 | Poster | 3 | 18 | WISE: Whitebox Image Stylization by Example-Based Learning | Winfried Lötzsch; Max Reimann; Martin Büssemeyer; Amir Semmo; Jürgen Döllner; Matthias Trapp | Image and video synthesis | Explainable AI for CV | ||||||||||||||||||
31 | 1114 | Poster | 3 | 19 | CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection | Jyh-Jing Hwang; Henrik Kretzschmar; Joshua Manela; Sean Rafferty; Nicholas Armstrong-Crews; Tiffany Chen; Dragomir Anguelov | Vision for and autonomous vehicles | Detection and localization in 2D and/or 3D; Scene analysis and understanding; Stereo, 3D from multiview and other sensors | ||||||||||||||||||
32 | 561 | Oral | 3 | 20 | Particle Video Revisited: Tracking through Occlusions Using Point Trajectories | Adam W. Harley; Zhaoyuan Fang; Katerina Fragkiadaki | Motion and tracking | Machine learning architectures and formulations; Video analysis and understanding | ||||||||||||||||||
33 | 4222 | Poster | 3 | 21 | Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Haitian Zheng; Zhe Lin; Jingwan Lu; Scott Cohen; Eli Shechtman; Connelly Barnes; Jianming Zhang; Ning Xu; Sohrab Amirghodsi; Jiebo Luo | Image and video synthesis | Neural generative models | ||||||||||||||||||
34 | 1628 | Poster | 3 | 22 | Learning to Detect Every Thing in an Open World | Kuniaki Saito; Ping Hu; Trevor Darrell; Kate Saenko | Recognition and classification | Segmentation, grouping, and shape | ||||||||||||||||||
35 | 4252 | Poster | 3 | 23 | Backbone Is All Your Need: A Simplified Architecture for Visual Object Tracking | Boyu Chen; Peixia Li; Lei Bai; Lei Qiao; Qiuhong Shen; Bo Li; Weihao Gan; Wei Wu; Wanli Ouyang | Motion and tracking | Transfer learning | ||||||||||||||||||
36 | 7039 | Poster | 3 | 24 | Neural Correspondence Field for Object Pose Estimation | Lin Huang; Tomas Hodan; Lingni Ma; Linguang Zhang; Luan Tran; Christopher D. Twigg; Po-Chen Wu; Junsong Yuan; Cem Keskin; Robert Wang | Detection and localization in 2D and/or 3D | 3D from a single image and shape-from-x | ||||||||||||||||||
37 | 4883 | Poster | 3 | 25 | DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks | Shih-Yang Su; Timur Bagautdinov; Helge Rhodin | 3D from a single image and shape-from-x | 3D shape modeling and processing; Neural generative models | ||||||||||||||||||
38 | 3689 | Poster | 3 | 26 | Three Things Everyone Should Know about Vision Transformers | Hugo Touvron; Matthieu Cord; Alaaeldin El-Nouby; Jakob Verbeek; Hervé Jégou | Recognition and classification | Machine learning architectures and formulations; Optimization and learning methods; Transfer learning | ||||||||||||||||||
39 | 6946 | Poster | 3 | 27 | Telepresence Video Quality Assessment | Zhenqiang Ying; Deepti Ghadiyaram; Alan Bovik | Vision + other modalities | Video analysis and understanding | ||||||||||||||||||
40 | 8042 | Poster | 3 | 28 | OCR-Free Document Understanding Transformer | Geewook Kim; Teakgyu Hong; Moonbin Yim; JeongYeon Nam; Jinyoung Park; Jinyeong Yim; Wonseok Hwang; Sangdoo Yun; Dongyoon Han; Seunghyun Park | Scene text and document understanding | Vision applications and systems | ||||||||||||||||||
41 | 2863 | Poster | 3 | 29 | BlobGAN: Spatially Disentangled Scene Representations | Dave Epstein; Taesung Park; Richard Zhang; Eli Shechtman; Alexei A. Efros | Image and video synthesis | Scene analysis and understanding; Semi / Weak / Self / Unsupervised Learning | ||||||||||||||||||
42 | 1791 | Oral | 3 | 30 | Improving Robustness by Enhancing Weak Subnets | Yong Guo; David Stutz; Bernt Schiele | Recognition and classification | Optimization and learning methods | ||||||||||||||||||
43 | 3685 | Poster | 3 | 31 | VecGAN: Image-to-Image Translation with Interpretable Latent Directions | Yusuf Dalva; Said Fahri Altındiş; Aysegul Dundar | Image and video synthesis | |||||||||||||||||||
44 | 5317 | Poster | 3 | 32 | Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark | Kibok Lee; Hao Yang; Satyaki Chakraborty; Zhaowei Cai; Gurumurthy Swaminathan; Avinash Ravichandran; Onkar Dabeer | Low-shot learning | Datasets and evaluation; Recognition and classification; Transfer learning | ||||||||||||||||||
45 | 7028 | Poster | 3 | 33 | Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification | Jianxiong Shen; Antonio Agudo; Francesc Moreno-Noguer; Adria Ruiz | 3D shape modeling and processing | Stereo, 3D from multiview and other sensors | ||||||||||||||||||
46 | 3132 | Poster | 3 | 34 | Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction | Inhwan Bae; Jin-Hwi Park; Hae-Gon Jeon | Motion and tracking | Action and behavior recognition; Vision for and autonomous vehicles | ||||||||||||||||||
47 | 5770 | Poster | 3 | 35 | ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization | Muhammad Zubair Irshad; Sergey Zakharov; Rareș Ambruș; Thomas Kollar; Zsolt Kira; Adrien Gaidon | 3D from a single image and shape-from-x | Recognition and classification; Scene analysis and understanding | ||||||||||||||||||
48 | 1762 | Poster | 3 | 36 | MoDA: Map Style Transfer for Self-Supervised Domain Adaptation of Embodied Agents | Eun Sun Lee; Junho Kim; SangWon Park; Young Min Kim | Vision for robotics and embodied vision | |||||||||||||||||||
49 | 6512 | Poster | 3 | 37 | BASQ: Branch-Wise Activation-Clipping Search Quantization for Sub-4-Bit Neural Networks | Han-Byul Kim; Eunhyeok Park; Sungjoo Yoo | Efficient training and inference methods | |||||||||||||||||||
50 | 2418 | Poster | 3 | 38 | Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement | Junuk Cha; Muhammad Saqlain; GeonU Kim; Mingyu Shin; Seungryul Baek | Body gestures and pose | 3D from a single image and shape-from-x; 3D shape modeling and processing | ||||||||||||||||||
51 | 222 | Poster | 3 | 39 | Panoptic Scene Graph Generation | Jingkang Yang; Yi Zhe Ang; Zujin Guo; Kaiyang Zhou; Wayne Zhang; Ziwei Liu | Scene analysis and understanding | |||||||||||||||||||
52 | 2623 | Oral | 3 | 40 | Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction | Li-Wu Tsao; Yan-Kai Wang; Hao-Siang Lin; Hong-Han Shuai; Lai-Kuan Wong; Wen-Huang Cheng | Motion and tracking | Semi / Weak / Self / Unsupervised Learning | ||||||||||||||||||
53 | 1135 | Poster | 3 | 41 | Image-Based CLIP-Guided Essence Transfer | Hila Chefer; Sagie Benaim; Roni Paiss; Lior Wolf | Image and video manipulation detection | Neural generative models; Vision + language | ||||||||||||||||||
54 | 1625 | Poster | 3 | 42 | ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-Verified Image-Caption Associations for MS-COCO | Sanghyuk Chun; Wonjae Kim; Song Park; Minsuk Chang; Seong Joon Oh | Datasets and evaluation | Vision + language | ||||||||||||||||||
55 | 2016 | Poster | 3 | 43 | GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori; Evgeny Krivosheev; Stéphane Lathuilière; Nicu Sebe; Fabio Galasso; Giuseppe Fiameni; Elisa Ricci; Fabio Poiesi | Transfer learning | Semi / Weak / Self / Unsupervised Learning; Vision for and autonomous vehicles | ||||||||||||||||||
56 | 6619 | Poster | 3 | 44 | Fast Two-View Motion Segmentation Using Christoffel Polynomials | Bengisu Ozbay; Octavia Camps; Mario Sznaier | Segmentation, grouping, and shape | |||||||||||||||||||
57 | 2538 | Poster | 3 | 45 | Dual Perspective Network for Audio-Visual Event Localization | Varshanth Rao; Md Ibrahim Khalil; Haoda Li; Peng Dai; Juwei Lu | Video analysis and understanding | Vision + other modalities | ||||||||||||||||||
58 | 3604 | Poster | 3 | 46 | TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation | Junghyuk Lee; Jong-Seok Lee | Neural generative models | Datasets and evaluation | ||||||||||||||||||
59 | 7879 | Poster | 3 | 47 | Grounding Visual Representations with Texts for Domain Generalization | Seonwoo Min; Nokyung Park; Siwon Kim; Seunghyun Park; Jinkyu Kim | Vision + language | Explainable AI for CV; Representation learning | ||||||||||||||||||
60 | 3518 | Poster | 3 | 48 | FrequencyLowCut Pooling – Plug & Play against Catastrophic Overfitting | Julia Grabinski; Steffen Jung; Janis Keuper; Margret Keuper | Image and video manipulation detection | Machine learning architectures and formulations; Recognition and classification | ||||||||||||||||||
61 | 7110 | Poster | 3 | 49 | GigaDepth: Learning Depth from Structured Light with Branching Neural Networks | Simon Schreiberhuber; Jean-Baptiste Weibel; Timothy Patten; Markus Vincze | Stereo, 3D from multiview and other sensors | |||||||||||||||||||
62 | 2874 | Oral | 3 | 50 | Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors | Sirui Xu; Yu-Xiong Wang; Liang-Yan Gui | Motion and tracking | Body gestures and pose | ||||||||||||||||||
63 | 2061 | Poster | 3 | 51 | Hierarchical Average Precision Training for Pertinent Image Retrieval | Elias Ramzi; Nicolas Audebert; Nicolas Thome; Clément Rambour; Xavier Bitot | Image and video retrieval | Machine learning architectures and formulations; Optimization and learning methods | ||||||||||||||||||
64 | 2350 | Poster | 3 | 52 | IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion | Seung-Jun Moon; Gyeong-Moon Park | Image and video synthesis | Faces; Neural generative models | ||||||||||||||||||
65 | 5591 | Poster | 3 | 53 | Semi-Supervised Learning of Optical Flow by Flow Supervisor | Woobin Im; Sebin Lee; Sung-Eui Yoon | Video analysis and understanding | Motion and tracking; Semi / Weak / Self / Unsupervised Learning | ||||||||||||||||||
66 | 5940 | Poster | 3 | 54 | SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection | Minhyeok Lee; Chaewon Park; Suhwan Cho; Sangyoun Lee | Segmentation, grouping, and shape | Scene analysis and understanding | ||||||||||||||||||
67 | 3904 | Poster | 3 | 55 | CANF-VC: Conditional Augmented Normalizing Flows for Video Compression | Yung-Han Ho; Chih-Peng Chang; Peng-Yu Chen; Alessandro Gnutti; Wen-Hsiao Peng | Image and video synthesis | |||||||||||||||||||
68 | 2649 | Poster | 3 | 56 | Context-Enhanced Stereo Transformer | Weiyu Guo; Zhaoshuo Li; Yongkui Yang; Zheng Wang; Russell H. Taylor; Mathias Unberath; Alan Yuille; Yingwei Li | Stereo, 3D from multiview and other sensors | Machine learning architectures and formulations | ||||||||||||||||||
69 | 1549 | Poster | 3 | 57 | AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction | Zerui Chen; Yana Hasson; Cordelia Schmid; Ivan Laptev | 3D from a single image and shape-from-x | Body gestures and pose | ||||||||||||||||||
70 | 6986 | Poster | 3 | 58 | Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos | Tanqiu Qiao; Qianhui Men; Frederick W. B. Li; Yoshiki Kubotani; Shigeo Morishima; Hubert P. H. Shum | Action and behavior recognition | Body gestures and pose; Recognition and classification; Video analysis and understanding | ||||||||||||||||||
71 | 744 | Poster | 3 | 59 | Streamable Neural Fields | Junwoo Cho; Seungtae Nam; Daniel Rho; Jong Hwan Ko; Eunbyung Park | Machine learning architectures and formulations | 3D shape modeling and processing; Optimization and learning methods | ||||||||||||||||||
72 | 6108 | Oral | 3 | 60 | Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting | Yangzheng Wu; Mohsen Zand; Ali Etemad; Michael Greenspan | Detection and localization in 2D and/or 3D | |||||||||||||||||||
73 | 6912 | Poster | 3 | 61 | PIP: Physical Interaction Prediction via Mental Simulation with Span Selection | Jiafei Duan; Samson Yu; Soujanya Poria; Bihan Wen; Cheston Tan | Video analysis and understanding | Datasets and evaluation; Image and video synthesis; Recognition and classification; Scene analysis and understanding | ||||||||||||||||||
74 | 5293 | Poster | 3 | 62 | DeepMend: Learning Occupancy Functions to Represent Shape for Repair | Nikolas Lamb; Sean Banerjee; Natasha Kholgade Banerjee | 3D shape modeling and processing | 3D from a single image and shape-from-x; Optimization and learning methods; Representation learning | ||||||||||||||||||
75 | 1576 | Poster | 3 | 63 | DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning | Hyounguk Shon; Janghyeon Lee; Seung Hwan Kim; Junmo Kim | Transfer learning | |||||||||||||||||||
76 | 4857 | Poster | 3 | 64 | Self-Supervised Classification Network | Elad Amrani; Leonid Karlinsky; Alex Bronstein | Semi / Weak / Self / Unsupervised Learning | Representation learning | ||||||||||||||||||
77 | 7474 | Poster | 3 | 65 | Towards Metrical Reconstruction of Human Faces | Wojciech Zielonka; Timo Bolkart; Justus Thies | Faces | Motion and tracking | ||||||||||||||||||
78 | 4918 | Poster | 3 | 66 | Learned Vertex Descent: A New Direction for 3D Human Model Fitting | Enric Corona; Gerard Pons-Moll; Guillem Alenyà; Francesc Moreno-Noguer | 3D from a single image and shape-from-x | 3D shape modeling and processing | ||||||||||||||||||
79 | 2596 | Poster | 3 | 67 | Robust Landmark-Based Stent Tracking in X-Ray Fluoroscopy | Luojie Huang; Yikang Liu; Li Chen; Eric Z. Chen; Xiao Chen; Shanhui Sun | Motion and tracking | Detection and localization in 2D and/or 3D; Medical, biological, and cell microscopy; Visual reasoning and logical representation | ||||||||||||||||||
80 | 7043 | Poster | 3 | 68 | The Missing Link: Finding Label Relations across Datasets | Jasper Uijlings; Thomas Mensink; Vittorio Ferrari | Datasets and evaluation | |||||||||||||||||||
81 | 5616 | Poster | 3 | 69 | Deep Ensemble Learning by Diverse Knowledge Distillation for Fine-Grained Object Classification | Naoki Okamoto; Tsubasa Hirakawa; Takayoshi Yamashita; Hironobu Fujiyoshi | Efficient training and inference methods | Optimization and learning methods; Recognition and classification | ||||||||||||||||||
82 | 1448 | Oral | 3 | 70 | Multimodal Object Detection via Probabilistic Ensembling | Yi-Ting Chen; Jinghao Shi; Zelin Ye; Christoph Mertz; Deva Ramanan; Shu Kong | Detection and localization in 2D and/or 3D | Vision for and autonomous vehicles | ||||||||||||||||||
83 | 7545 | Poster | 3 | 71 | Latent Space Smoothing for Individually Fair Representations | Momchil Peychev; Anian Ruoss; Mislav Balunović; Maximilian Baader; Martin Vechev | Fairness, accountability, transparency, and ethics in vision | Adversarial learning; Explainable AI for CV | ||||||||||||||||||
84 | 570 | Poster | 3 | 72 | SUPR: A Sparse Unified Part-Based Human Representation | Ahmed A. A. Osman; Timo Bolkart; Dimitrios Tzionas; Michael J. Black | 3D shape modeling and processing | Body gestures and pose | ||||||||||||||||||
85 | 514 | Poster | 3 | 73 | PolarMOT: How Far Can Geometric Relations Take Us in 3D Multi-Object Tracking? | Aleksandr Kim; Guillem Brasó; Aljoša Ošep; Laura Leal-Taixé | Motion and tracking | Detection and localization in 2D and/or 3D; Vision + other modalities; Vision applications and systems; Vision for and autonomous vehicles; Vision for robotics and embodied vision | ||||||||||||||||||
86 | 1035 | Poster | 3 | 74 | Learning Instance-Specific Adaptation for Cross-Domain Segmentation | Yuliang Zou; Zizhao Zhang; Chun-Liang Li; Han Zhang; Tomas Pfister; Jia-Bin Huang | Transfer learning | Segmentation, grouping, and shape; Vision for and autonomous vehicles | ||||||||||||||||||
87 | 1257 | Poster | 3 | 75 | Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving | Mahyar Najibi; Jingwei Ji; Yin Zhou; Charles R. Qi; Xinchen Yan; Scott Ettinger; Dragomir Anguelov | Vision for and autonomous vehicles | Detection and localization in 2D and/or 3D; Motion and tracking; Semi / Weak / Self / Unsupervised Learning | ||||||||||||||||||
88 | 5297 | Poster | 3 | 76 | Graph Neural Network for Cell Tracking in Microscopy Videos | Tal Ben-Haim; Tammy Riklin Raviv | Medical, biological, and cell microscopy | Motion and tracking | ||||||||||||||||||
89 | 5988 | Poster | 3 | 77 | Correspondence Reweighted Translation Averaging | Lalit Manam; Venu Madhav Govindu | Stereo, 3D from multiview and other sensors | 3D shape modeling and processing | ||||||||||||||||||
90 | 4022 | Poster | 3 | 78 | MotionCLIP: Exposing Human Motion Generation to CLIP Space | Guy Tevet; Brian Gordon; Amir Hertz; Amit H. Bermano; Daniel Cohen-Or | Motion and tracking | Body gestures and pose; Neural generative models; Vision + language | ||||||||||||||||||
91 | 4217 | Poster | 3 | 79 | Learning Audio-Video Modalities from Image Captions | Arsha Nagrani; Paul Hongsuck Seo; Bryan Seybold; Anja Hauth; Santiago Manen; Chen Sun; Cordelia Schmid | Image and video retrieval | Datasets and evaluation; Vision + language; Vision + other modalities | ||||||||||||||||||
92 | 640 | Oral | 3 | 80 | Long-Tail Detection with Effective Class-Margins | Jang Hyun Cho; Philipp Krähenbühl | Detection and localization in 2D and/or 3D | |||||||||||||||||||
93 | 3958 | Poster | 3 | 81 | Unsupervised Domain Adaptation for One-Stage Object Detector Using Offsets to Bounding Box | Jayeon Yoo; Inseop Chung; Nojun Kwak | Transfer learning | Detection and localization in 2D and/or 3D; Semi / Weak / Self / Unsupervised Learning; Vision for and autonomous vehicles | ||||||||||||||||||
94 | 4998 | Poster | 3 | 82 | NeRF for Outdoor Scene Relighting | Viktor Rudnev; Mohamed Elgharib; William Smith; Lingjie Liu; Vladislav Golyanik; Christian Theobalt | Image and video synthesis | Scene analysis and understanding; Segmentation, grouping, and shape; Semi / Weak / Self / Unsupervised Learning; Stereo, 3D from multiview and other sensors | ||||||||||||||||||
95 | 7070 | Poster | 3 | 83 | A Comparative Study of Graph Matching Algorithms in Computer Vision | Stefan Haller; Lorenz Feineis; Lisa Hutschenreiter; Florian Bernard; Carsten Rother; Dagmar Kainmüller; Paul Swoboda; Bogdan Savchynskyy | Optimization and learning methods | Datasets and evaluation | ||||||||||||||||||
96 | 1493 | Poster | 3 | 84 | Quantized GAN for Complex Music Generation from Dance Videos | Ye Zhu; Kyle Olszewski; Yu Wu; Panos Achlioptas; Menglei Chai; Yan Yan; Sergey Tulyakov | Vision + other modalities | Vision applications and systems | ||||||||||||||||||
97 | 4175 | Poster | 3 | 85 | Visual Prompt Tuning | Menglin Jia; Luming Tang; Bor-Chun Chen; Claire Cardie; Serge Belongie; Bharath Hariharan; Ser-Nam Lim | Transfer learning | Recognition and classification | ||||||||||||||||||
98 | 6827 | Poster | 3 | 86 | Embedding Contrastive Unsupervised Features to Cluster in- and Out-of-Distribution Noise in Corrupted Image Datasets | Paul Albert; Eric Arazo; Noel E. O’Connor; Kevin McGuinness | Semi / Weak / Self / Unsupervised Learning | |||||||||||||||||||
99 | 4593 | Poster | 3 | 87 | Cross-Domain Ensemble Distillation for Domain Generalization | Kyungmoon Lee; Sungyeon Kim; Suha Kwak | Recognition and classification | |||||||||||||||||||
100 | 8001 | Poster | 3 | 88 | Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions | Joaquín Ossandón; Benjamín Earle; Alvaro Soto | Vision + language | Datasets and evaluation; Vision + other modalities; Vision for robotics and embodied vision |