A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Title & Authors | Paper/Project/Video/Slides links | short description | Slides | ||||||||||||||||||||||
2 | 8/24/2022 | |||||||||||||||||||||||||
3 | Dimitris Samaras | Introduction | https://docs.google.com/presentation/d/1wEmFhARFnilBW9-Ogt9NBqvpX_qnIKBnMiZIatuPXgY/edit?usp=sharing | |||||||||||||||||||||||
4 | 8/31/2022 | |||||||||||||||||||||||||
5 | Jingwei | Gigapixel Whole-Slide Images Classification using Locally Supervised Learning | https://arxiv.org/pdf/2207.08267.pdf | MICCAI practice | ||||||||||||||||||||||
6 | Lei | Efficient Geometry-aware {3D} Generative Adversarial Networks | https://arxiv.org/abs/2112.07945 | https://docs.google.com/presentation/d/1vSb7XY_Hpu6YjaIM8UwxL-U2dqkUZY9v6aZiApgzHBE/edit?usp=sharing | ||||||||||||||||||||||
7 | Shilin | FurryGAN: High Quality Foreground-aware Image Synthesis | https://arxiv.org/pdf/2208.10422.pdf | https://docs.google.com/presentation/d/1AgqFnVH248TtzdDhyH2X54UAHY0NH2dpL2Po7C0xYec/edit?usp=sharing | ||||||||||||||||||||||
8 | Ruyi | Neural Correspondence Field for Object Pose Estimation | https://arxiv.org/pdf/2208.00113.pdf | 6DoF object pose estimation from single RGB image based on 3D-3D correspondence estimation with 3D implicit representation | https://docs.google.com/presentation/d/1pNHsATv4Smgrblsr0cZWdWI9exTFNc2QFnuwl9Ho3BU/edit?usp=sharing | |||||||||||||||||||||
9 | 9/7/2022 | |||||||||||||||||||||||||
10 | Yue | High-resolution Face Swapping via Latent Semantics Disentanglement | https://openaccess.thecvf.com/content/CVPR2022/papers/Xu_High-Resolution_Face_Swapping_via_Latent_Semantics_Disentanglement_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1JW4FI0GrrFmEvfshnlCq4N506hQZGpc4alkL0xgktSg/edit?usp=sharing | |||||||||||||||||||||
11 | Hritam | Addressing Class Imbalance in Semi-supervised Image Segmentation: A Study on Cardiac MRI | MICCAI22 paper | https://drive.google.com/file/d/1A4--_03ssu2yoGAQplPJrlLpsKBgQ5S_/view?usp=sharing | ||||||||||||||||||||||
12 | Haoyu | SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse views | https://arxiv.org/abs/2206.05737 | ECCV 2022 | https://docs.google.com/presentation/d/1SvayreRc42d0NyjH0P-4JgXoQXYAcpBdjOPl1oJ5nds/edit?usp=sharing | |||||||||||||||||||||
13 | Moinak | GazeRadar: A Gaze and Radiomics-guided Disease Localization Framework | MICCAI22 paper | https://docs.google.com/presentation/d/1RayaYc-DDZSBwZ-z2hVdZZlpOzzGUKhvvvtiYtLDVx8/edit?usp=sharing | ||||||||||||||||||||||
14 | 9/14/2022 | |||||||||||||||||||||||||
15 | Sounak | On Guiding Visual Attention with Language Specification | https://openaccess.thecvf.com/content/CVPR2022/papers/Petryk_On_Guiding_Visual_Attention_With_Language_Specification_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/19jlZb8ofrrxwLLmOUKz9dvSqtte-hLWm17BhrkOCdNc/edit?usp=sharing | |||||||||||||||||||||
16 | Aishik | Learning What Not to Segment: A New Perspective on Few-Shot Segmentation | https://openaccess.thecvf.com/content/CVPR2022/papers/Lang_Learning_What_Not_To_Segment_A_New_Perspective_on_Few-Shot_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1138Z9SorqjU6T6tW27oUG7s7AijmG6HSFh3Fmk23wzU/edit?usp=sharing | |||||||||||||||||||||
17 | Alfredo | Neural Feature Fusion Fields: 3D Distillation of Self-Supervised 2D Image Representations | https://arxiv.org/abs/2209.03494 | 3DV2022 | https://docs.google.com/presentation/d/194Gc_01IFO-jyMTq5MCEde7D5SkweDjNYUNfoYKJEmg/edit?usp=sharing | |||||||||||||||||||||
18 | ||||||||||||||||||||||||||
19 | Peiyao | Frozen CLIP Models are Efficient Video Learners | https://arxiv.org/pdf/2208.03550.pdf | ECCV 2022 | https://docs.google.com/presentation/d/1g8fW-CXqCdNi_CaRJjBY1VZa3gfh2UHk/edit?usp=sharing&ouid=103358536235384126888&rtpof=true&sd=true | |||||||||||||||||||||
20 | 9/21/2022 | |||||||||||||||||||||||||
21 | Mithilesh | EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks | https://arxiv.org/pdf/1905.11946 | ICML 2019 | https://docs.google.com/presentation/d/1QMADlSbdSjKdBFHLUeDQhfvGvGowJ6v1/edit?usp=sharing&ouid=103647986604749576791&rtpof=true&sd=true | |||||||||||||||||||||
22 | Qiaomu | ESCNet: Gaze Target Detection with the Understanding of 3D Scenes | https://openaccess.thecvf.com/content/CVPR2022/papers/Bao_ESCNet_Gaze_Target_Detection_With_the_Understanding_of_3D_Scenes_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/19TeREXqgLnsIDmVHasSvpbhH4g01IUMxNSrOKiZB-5c/edit?usp=sharing | |||||||||||||||||||||
23 | Dichang | Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry | https://arxiv.org/pdf/2112.08177.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1v8U3bL-QqQp-VPXU_krP3II9QVJHe37ayCPweF74_a8/edit?usp=sharing | |||||||||||||||||||||
24 | Meilong | Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning | https://openaccess.thecvf.com/content/CVPR2022/papers/Chen_Scaling_Vision_Transformers_to_Gigapixel_Images_via_Hierarchical_Self-Supervised_Learning_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1kWrRZCUO9wxnRVo4EHzuWDvSDcGaazy611zJu4Idzkc/edit?usp=sharing | |||||||||||||||||||||
25 | 9/28/2022 | |||||||||||||||||||||||||
26 | Saumya | Learning Topological Interactions for Multi-Class Medical Image Segmentation | Paper link : https://arxiv.org/abs/2207.09654 | ECCV 2022 | https://docs.google.com/presentation/d/1s4KmaGUR-H0MgNNAVxbUVJDFcH6vhbdpa5Rc_OQiZgU/edit?usp=sharing | |||||||||||||||||||||
27 | Sagnik | Learning Isometric Surface Parameterization for Texture Unwrapping | https://drive.google.com/file/d/1evpXuRwlNCZc5CAYuFqKL5SpaV2kFa8Y/view | ECCV 2022 | https://drive.google.com/file/d/1aOr7VKfm73LsI3atolze15JY927h2bqw/view?usp=sharing | |||||||||||||||||||||
28 | Moinak | RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease Classification | https://arxiv.org/abs/2202.11781 | ECCV 2022 | https://drive.google.com/file/d/10v8KhX13v5hybemMS8HMfZDjSFk4_L-v/view?usp=sharing | |||||||||||||||||||||
29 | Jinghuan | StARformer: Transformer with State-Action-Reward Representation for Visual Reinforcement Learning | Paper: https://arxiv.org/abs/2110.06206 Video: https://www3.cs.stonybrook.edu/~jishang/starformer/4447.mp4 | ECCV 2022 | https://docs.google.com/presentation/d/1nUCQyUZ8lmJTYoz-i4aA5xou5JkzXUIN/edit?usp=sharing&ouid=105032877246147260285&rtpof=true&sd=true | |||||||||||||||||||||
30 | 10/5/2022 | |||||||||||||||||||||||||
31 | Mithilesh | EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks | https://arxiv.org/pdf/1905.11946 | ICML 2019 | https://docs.google.com/presentation/d/1QMADlSbdSjKdBFHLUeDQhfvGvGowJ6v1/edit?usp=sharing&ouid=103647986604749576791&rtpof=true&sd=true | |||||||||||||||||||||
32 | Khiem Phi | How Do Vision Transformers Work? | https://arxiv.org/pdf/2202.06709.pdf | ICLR 2022 | https://docs.google.com/presentation/d/1TT9oPYmwU3-O7w8_i081i66E77dxKgVH/edit?usp=sharing&ouid=107110547759842100452&rtpof=true&sd=true | |||||||||||||||||||||
33 | Shahira Abousamra | GroupViT: Semantic Segmentation Emerges from Text Supervision | https://arxiv.org/abs/2202.11094 | CVPR 2022 | https://docs.google.com/presentation/d/1s2roFFmX6BwfmCixMR55J3FNlBkVmgOqie5cFM_-lro/edit?usp=sharing | |||||||||||||||||||||
34 | Othmane | Deep Reinforcement Learning for Sarcopenia Assessment | https://arxiv.org/abs/2107.12800 | MLMI - MICCAI 2021 | https://drive.google.com/file/d/1mlxfoh1XKGLVdDzYnDN7Djn6aHTfIbn6/view?usp=sharing | |||||||||||||||||||||
35 | 10/12/2022 | |||||||||||||||||||||||||
36 | Alexandros | Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise | https://arxiv.org/abs/2208.09392 | NeurIPS 2022 | https://docs.google.com/presentation/d/1skoyssqXW35kx3jtct_z7hEdX9LFydCe3QU3ooBSVY4/edit?usp=sharing | |||||||||||||||||||||
37 | Cristina | Learning State-Aware Visual Representations from Audible Interactions | https://arxiv.org/pdf/2209.13583.pdf | Neurips 2022 | https://docs.google.com/presentation/d/1dpDY21btn3oPdOYaDSyvh0k6aWS-UvfSUPQ3Y-hVSk8/edit?usp=sharing | |||||||||||||||||||||
38 | ShahRukh | DreamFusion: Text-to-3D using 2D Diffusion | https://dreamfusion3d.github.io/ | https://docs.google.com/presentation/d/1_G-t-OaM6WCuF7dn8DCxXkSXwB2F8f_zz4WE7FXorTE/edit?usp=sharing | ||||||||||||||||||||||
39 | Xiang | Temporally Consistent Video Transformer for Long-Term Video Prediction | https://wilson1yan.github.io/teco/ | Use transformer to predict the latent trajectory, it is also built on existing fancy methods like VQ-GAN and MaskGIT | https://docs.google.com/presentation/d/1VijoYW95iP7mNTCK2BdAdKd_tMYOQ91cnbJMMx4FxmA/edit?usp=sharing | |||||||||||||||||||||
40 | 10/19/2022 | |||||||||||||||||||||||||
41 | Kanchana | Perceptual Grouping in Vision-Language Models | https://arxiv.org/abs/2210.09996 | ICLR 2023 submission | https://drive.google.com/file/d/1oAoO-WzaoifpABquHdHCNfFt-9NjAYzo/preview | |||||||||||||||||||||
42 | Kumara | Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language | https://arxiv.org/abs/2204.00598 | arxiv | https://docs.google.com/presentation/d/1OCbgM32T7RMWHt-nDH_eRkgUnEN-NZEm60_2CbPncOA/edit?usp=sharing | |||||||||||||||||||||
43 | Yunfan | Semantic-Aware Domain Generalized Segmentation | https://openaccess.thecvf.com/content/CVPR2022/papers/Peng_Semantic-Aware_Domain_Generalized_Segmentation_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1KMbTMlPWMKmGilGq-l_SDxkM58lMW-RMDh43GZD46Ak/edit?usp=sharing | |||||||||||||||||||||
44 | ||||||||||||||||||||||||||
45 | 10/26/2022 | |||||||||||||||||||||||||
46 | Jongwoo | GRAFTING VISION TRANSFORMERS | https://openreview.net/pdf?id=48EwqCCosOO | ICLR 2023 submission | https://docs.google.com/presentation/d/15O8brbJJGwaiHAs0CTQ6p7r5TPi8UQrip5_j85hCk3Y/edit?usp=sharing | |||||||||||||||||||||
47 | Aggelina | FaceFormer: Speech-Driven 3D Facial Animation with Transformers | https://arxiv.org/pdf/2112.05329.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1D8ujlvlevm21HU7r6h33d8U2fMt7tVQl_aC1sLofIsI/edit?usp=sharing | |||||||||||||||||||||
48 | Saarthak | What do self-supervised vision transformers learn? | https://openreview.net/pdf?id=azCKuYyS74 | ICLR 2023 submission | https://docs.google.com/presentation/d/1LVP5WXDTFL-gs-qhmaQpxH3sv0L5ux-rRZWtl6GmGWQ/edit?usp=sharing | |||||||||||||||||||||
49 | Prantik | Semi-Supervised Semantic Segmentation via Adaptive Equalizaton Learning | https://arxiv.org/pdf/2110.05474.pdf | NeurIPS 2021 | https://docs.google.com/presentation/d/1ZU4Hr0iWYZKlv8jPPo0U3D61Ho3Uqz93zwLslzXC6ec/edit?usp=sharing | |||||||||||||||||||||
50 | 11/2/2022 | |||||||||||||||||||||||||
51 | Sagnik | From In-the-Wild Images of Documents to High Quality Scans | ||||||||||||||||||||||||
52 | 11/16/2022 | |||||||||||||||||||||||||
53 | Georgi | Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation | https://openaccess.thecvf.com//content/CVPR2022/papers/Rahimi_Reading_To_Listen_at_the_Cocktail_Party_Multi-Modal_Speech_Separation_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/11qjju47QNAXoCgQrlIYKIzvrE6n_eTMX0Uwn74mCfOw/edit?usp=sharing | |||||||||||||||||||||
54 | Manal | Boundary IoU: Improving Object-Centric Image Segmentation Evaluation | https://arxiv.org/pdf/2103.16562.pdf | CVPR 2021 | https://docs.google.com/presentation/d/14WSZ-XhiCTcnZLuLPEsmEiMPj7hyaW6GyPT7uOdnRSI/edit?usp=sharing | |||||||||||||||||||||
55 | Saumya | Slim Scissors: Segmenting Thin Object from Synthetic Background | https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136890375.pdf | ECCV 2022 | https://docs.google.com/presentation/d/1GPEZlOL4giiAn1mZKcUCMgruybk-9CVNGqI0BJWSusM/edit?usp=sharing | |||||||||||||||||||||
56 | Alexandros | NeurIPS Poster | https://drive.google.com/file/d/1xXgt7wL77YF7E_ay2ZjxmNHJ18YaGfCN/view?usp=share_link | |||||||||||||||||||||||
57 | 11/30/2022 | |||||||||||||||||||||||||
58 | Zhilin | Efficient Deep Embedded Subspace Clustering | https://openaccess.thecvf.com/content/CVPR2022/papers/Cai_Efficient_Deep_Embedded_Subspace_Clustering_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1RCXfOPFxxXIYVh2lpaIXPyUJzzJxQTbE-_-9B00Izes/edit?usp=sharing | |||||||||||||||||||||
59 | Vu | Fast Text-Conditional Discrete Denoising on Vector-Quantized Latent Spaces | https://arxiv.org/pdf/2211.07292.pdf | https://docs.google.com/presentation/d/1PLXrIrmErh3z-Dvm00jxJesVjhYzOWoOGh8tiNtjFis/edit?usp=sharing | ||||||||||||||||||||||
60 | David | Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast | https://openaccess.thecvf.com/content/CVPR2022/papers/Du_Weakly_Supervised_Semantic_Segmentation_by_Pixel-to-Prototype_Contrast_CVPR_2022_paper.pdf | CVPR 2022 | https://docs.google.com/presentation/d/1fVF3smDpfxFGwpQbYZpui6jlXPrXIFE3mQDMhVLKseI/edit?usp=sharing | |||||||||||||||||||||
61 | ShahRukh | DreamFusion: Text-to-3D using 2D Diffusion | https://dreamfusion3d.github.io/ | https://docs.google.com/presentation/d/1_G-t-OaM6WCuF7dn8DCxXkSXwB2F8f_zz4WE7FXorTE/edit?usp=sharing | ||||||||||||||||||||||
62 | Yash | Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning | https://www.usenix.org/system/files/osdi21-qiao.pdf | Optimising batch size and learning rate for a deep learning model in a cluster training setting | https://docs.google.com/presentation/d/1S6a6MOymNGw33K4FvOXOUJRtnnpPY_u-PH_tnrxOBKY/edit?usp=sharing | |||||||||||||||||||||
63 | ||||||||||||||||||||||||||
64 | ||||||||||||||||||||||||||
65 | ||||||||||||||||||||||||||
66 | ||||||||||||||||||||||||||
67 | ||||||||||||||||||||||||||
68 | ||||||||||||||||||||||||||
69 | ||||||||||||||||||||||||||
70 | ||||||||||||||||||||||||||
71 | ||||||||||||||||||||||||||
72 | ||||||||||||||||||||||||||
73 | ||||||||||||||||||||||||||
74 | ||||||||||||||||||||||||||
75 | ||||||||||||||||||||||||||
76 | ||||||||||||||||||||||||||
77 | ||||||||||||||||||||||||||
78 | ||||||||||||||||||||||||||
79 | ||||||||||||||||||||||||||
80 | ||||||||||||||||||||||||||
81 | ||||||||||||||||||||||||||
82 | ||||||||||||||||||||||||||
83 | ||||||||||||||||||||||||||
84 | ||||||||||||||||||||||||||
85 | ||||||||||||||||||||||||||
86 | ||||||||||||||||||||||||||
87 | ||||||||||||||||||||||||||
88 | ||||||||||||||||||||||||||
89 | ||||||||||||||||||||||||||
90 | ||||||||||||||||||||||||||
91 | ||||||||||||||||||||||||||
92 | ||||||||||||||||||||||||||
93 | ||||||||||||||||||||||||||
94 | ||||||||||||||||||||||||||
95 | ||||||||||||||||||||||||||
96 | ||||||||||||||||||||||||||
97 | ||||||||||||||||||||||||||
98 | ||||||||||||||||||||||||||
99 | ||||||||||||||||||||||||||
100 |